Docs coverage audit(2026-04-27)¶
本页是在线文档优化的 source-backed audit log。它不是产品功能说明页,而是用来回答:最近实现的能力是否已经在公开文档里有入口、术语是否一致、哪些页面只是验证未改、哪些仍需后续处理。
docs + CLI UX source-backed interactive config wizard mkdocs strict required
审计结论¶
- Quickstart 和 templates 现在明确区分 观察优先 / L1-only 与 L2/L3-ready operator path,避免把 no-LLM 路径描述成所有用户的默认终点。
- Metric Dictionary 收敛为 canonical-only 字典,覆盖
session_risk_sum、high_or_critical_count、risk_velocity、system_security_posture和 D1-D6。 - Reporting API 示例与后端合同重新对齐,窗口字段、Enterprise posture 和 D1-D6 不再混用旧口径。
- L2/L3 页面补充 operator 合同和同步判决 vs advisory-only full-review 的边界。
- L3 Advisory 页面移除页面内样式岛,改用共享
.cs-*组件。 clawsentry config wizard --interactive现在提供 TTY 分步向导;显式交互遇到非 TTY 会失败并提示使用--non-interactive,--non-interactive继续作为 CI/模板路径。- AgentDoG / ATBench 已完成 converter + ClawSentry L2 replay smoke;后续 benchmark 评测将在新窗口继续推进。
Recent-feature coverage matrix¶
| Feature / capability | Source evidence | Public docs surface | Web UI / API / CLI surface | Status | Planned action |
|---|---|---|---|---|---|
| L3 advisory jobs / full-review | src/clawsentry/gateway/server.py full-review route; src/clawsentry/cli/l3_command.py; src/clawsentry/tests/test_l3_command.py |
L3 咨询审查, Reporting API, CLI | Session Detail full-review controls; POST /report/session/{id}/l3-advisory/full-review; clawsentry l3 full-review |
Covered | Keep advisory-only wording and canonical_decision_mutated=false examples in sync with API contract tests. |
| Heartbeat / idle aggregate queueing | src/clawsentry/gateway/server.py heartbeat_aggregate; src/clawsentry/tests/test_gateway.py heartbeat aggregate cases |
L3 咨询审查, Reporting API | SSE advisory snapshot/job events; clawsentry l3 jobs list/run-next/drain |
Covered | Continue to emphasize no scheduler/daemon and bounded one-shot drain behavior. |
| Gemini CLI hooks | src/clawsentry/adapters/gemini_adapter.py; integration tests and CLI init paths |
Gemini CLI 集成, Quickstart | Framework startup / hook adapter | Covered | Verify hook boundary language during each release. |
| Benchmark mode | src/clawsentry/cli/benchmark_command.py; benchmark docs/tests |
Benchmark 模式, Quickstart, Templates | clawsentry benchmark env/enable/run/disable |
Covered | Keep temporary CODEX_HOME warning visible for Codex benchmark examples. |
| Metric/window fields | src/clawsentry/gateway/session_registry.py metrics; src/clawsentry/gateway/server.py reporting payloads; src/clawsentry/ui/src/api/types.ts; docs/validation/v0.5.12-metric-wizard-agentdog-progress-2026-04-27.md |
Metric Dictionary, Dashboard, Reporting API | /report/sessions, /report/session/{id}/risk, Sessions row, Session Detail cards |
Release-ready | Prefer window_risk_summary + canonical names; legacy fields are fallback-only. |
| Web UI L3 surfaces | src/clawsentry/ui/src/pages/SessionDetail.tsx; src/clawsentry/ui/src/components/RuntimeFeed.tsx |
L3 咨询审查, Dashboard | Session Detail full-review button, L3 advisory review card, Runtime Feed | Covered | Future screenshots can be added when visual smoke tooling is available. |
| Token budget / LLM usage | src/clawsentry/cli/test_llm_command.py; config/env docs; metrics token counters |
LLM 配置, Templates, Reporting API | clawsentry test-llm --json, Prometheus clawsentry_llm_tokens_total |
Covered | Keep examples provider-neutral and budget-first. |
| Multi-framework startup | src/clawsentry/cli/start_command.py; adapter packages |
Quickstart, integration pages | clawsentry start --framework ... |
Covered | Framework table should stay honest about Codex monitoring vs optional managed hooks. |
| Latch integration | src/clawsentry/latch/*; docs integration page |
Latch 集成, homepage Latch callout | Latch daemon / bridge surfaces | Covered | No changes needed in this pass beyond nav verification. |
| OpenClaw / Codex managed setup boundaries | src/clawsentry/adapters/openclaw_*; src/clawsentry/adapters/codex_adapter.py; Codex init tests |
OpenClaw 集成, Codex CLI 集成, Quickstart | clawsentry init codex --setup, OpenClaw webhook/WebSocket |
Covered | Keep Codex text clear: default monitoring + optional Bash preflight/native hook enhancement. |
| Interactive config wizard | src/clawsentry/cli/config_command.py TTY prompt flow and non-TTY guard; CLI parser flags; docs/validation/v0.5.12-metric-wizard-agentdog-progress-2026-04-27.md |
Quickstart, Templates, CLI | clawsentry config wizard --interactive; clawsentry config wizard --non-interactive ... |
Release-ready | Keep --non-interactive examples for CI and copy/paste templates; framework selection is next-step guidance, not hook installation. |
| AgentDoG / ATBench benchmark | benchmarks/scripts/agentdog_atbench_clawsentry.py; benchmarks/RESULTS.md; benchmarks/notes/agentdog-atbench/RUNBOOK.md |
Benchmark docs and validation progress record | Offline trajectory conversion + ClawSentry L1/L2/L3 replay | Handoff recorded | Continue in next window with labeled ATBench records and raw-vs-protected runner matrix. |
Pages touched / verified¶
| Page | Action | Source evidence used | Notes |
|---|---|---|---|
getting-started/quickstart.md |
Refresh | CLI config wizard, framework startup and Codex managed-hook boundaries |
Added two-path journey and interactive wizard note. |
configuration/templates.md |
Refresh | Config schema expectations, L2/L3 budget/timeout fields | Added template chooser by latency/budget/strictness. |
api/metric-dictionary.md |
Rewrite section | session_registry.py, server.py, UI types.ts, Session Detail/Sessions fields |
Replaced repeated alias-heavy content with one canonical dictionary, formulas, D1-D6 table, UI/API read path. |
api/reporting.md |
Refresh examples | session_registry.py, server.py, enterprise.py |
Reconciled window_risk_summary, Enterprise posture and D1-D6 examples with backend payloads. |
decision-layers/l2-semantic.md |
Refresh | SemanticAnalyzer behavior and L3 advisory boundary |
Added operator path clarifying L2 vs L3. |
decision-layers/l3-agent.md |
Refresh | L3 trigger/runtime telemetry and advisory docs | Added operator contract and L1/L2/L3/Advisory contrast. |
decision-layers/l3-advisory.md |
Style conversion | Existing content + shared CSS contract | Removed local CSS; reused .cs-doc-hero, .cs-card-grid, .cs-pill, .cs-flow-strip. |
stylesheets/clawsentry-docs.css |
Style contract | Existing docs components and Material theme constraints | Added component contract plus shared operator/flow/pill/API classes. |
mkdocs.yml |
Nav refresh | Existing operations nav | Added this audit page so matrix is not orphaned. |
Verified unchanged / linked surfaces¶
| Surface | Why unchanged | Verification target |
|---|---|---|
dashboard/index.md |
Already explains Dashboard / Sessions / Session Detail hierarchy and metric fallback ordering. | Link from Metric Dictionary and Quickstart. |
integration/codex.md |
Already distinguishes monitoring from optional managed native hooks. | Public docs contract checks clawsentry init codex --setup, PreToolUse(Bash). |
integration/gemini-cli.md |
Recent-feature entry exists in nav and integration section. | Nav path exists and build includes page. |
operations/benchmark-mode.md |
Dedicated benchmark path exists and is linked from Quickstart/Templates. | Build and link sanity. |
Follow-up candidates¶
These are intentionally not part of this docs-only pass:
- AgentDoG labeled evaluation: select labeled ATBench safe/unsafe records and compute unsafe recall / safe false-positive rate before documenting safety improvement.
- Raw vs ClawSentry live runners: implement a3s-code, Claude Code, Gemini CLI, Codex and optional OpenClaw live runner comparisons after offline replay metrics are stable.
- Optional
clawsentry setupalias: keepconfig wizardas the tested setup surface; add a separate alias only if there is a concrete release need. - Rendered visual screenshots: capture light/dark screenshots for Quickstart, Templates, Metric Dictionary and L3 pages when browser tooling is available in CI or release validation.
- Generated API excerpt sync: if OpenAPI generation changes response schemas, rerun
python scripts/docs_api_inventory.py validateand refreshapi/reference.md/api/validity-report.md.