Docs coverage audit（2026-04-27）¶

本页是在线文档优化的 source-backed audit log。它不是产品功能说明页，而是用来回答：最近实现的能力是否已经在公开文档里有入口、术语是否一致、哪些页面只是验证未改、哪些仍需后续处理。

docs + CLI UX source-backed interactive config wizard mkdocs strict required

审计结论¶

Quickstart 和 templates 现在明确区分 观察优先 / L1-only 与 L2/L3-ready operator path，避免把 no-LLM 路径描述成所有用户的默认终点。
Metric Dictionary 收敛为 canonical-only 字典，覆盖 session_risk_sum、high_or_critical_count、risk_velocity、system_security_posture 和 D1-D6。
Reporting API 示例与后端合同重新对齐，窗口字段、Enterprise posture 和 D1-D6 不再混用旧口径。
L2/L3 页面补充 operator 合同和同步判决 vs advisory-only full-review 的边界。
L3 Advisory 页面移除页面内样式岛，改用共享 .cs-* 组件。
clawsentry config wizard --interactive 现在提供 TTY 分步向导；显式交互遇到非 TTY 会失败并提示使用 --non-interactive，--non-interactive 继续作为 CI/模板路径。
AgentDoG / ATBench 已完成 converter + ClawSentry L2 replay smoke；后续 benchmark 评测将在新窗口继续推进。

Recent-feature coverage matrix¶

Feature / capability	Source evidence	Public docs surface	Web UI / API / CLI surface	Status	Planned action
L3 advisory jobs / full-review	`src/clawsentry/gateway/server.py` full-review route; `src/clawsentry/cli/l3_command.py`; `src/clawsentry/tests/test_l3_command.py`	L3 咨询审查, Reporting API, CLI	Session Detail full-review controls; `POST /report/session/{id}/l3-advisory/full-review`; `clawsentry l3 full-review`	Covered	Keep advisory-only wording and `canonical_decision_mutated=false` examples in sync with API contract tests.
Heartbeat / idle aggregate queueing	`src/clawsentry/gateway/server.py` `heartbeat_aggregate`; `src/clawsentry/tests/test_gateway.py` heartbeat aggregate cases	L3 咨询审查, Reporting API	SSE advisory snapshot/job events; `clawsentry l3 jobs list/run-next/drain`	Covered	Continue to emphasize no scheduler/daemon and bounded one-shot drain behavior.
Gemini CLI hooks	`src/clawsentry/adapters/gemini_adapter.py`; integration tests and CLI init paths	Gemini CLI 集成, Quickstart	Framework startup / hook adapter	Covered	Verify hook boundary language during each release.
Benchmark mode	`src/clawsentry/cli/benchmark_command.py`; benchmark docs/tests	Benchmark 模式, Quickstart, Templates	`clawsentry benchmark env/enable/run/disable`	Covered	Keep temporary `CODEX_HOME` warning visible for Codex benchmark examples.
Metric/window fields	`src/clawsentry/gateway/session_registry.py` metrics; `src/clawsentry/gateway/server.py` reporting payloads; `src/clawsentry/ui/src/api/types.ts`; `docs/validation/v0.5.12-metric-wizard-agentdog-progress-2026-04-27.md`	Metric Dictionary, Dashboard, Reporting API	`/report/sessions`, `/report/session/{id}/risk`, Sessions row, Session Detail cards	Release-ready	Prefer `window_risk_summary` + canonical names; legacy fields are fallback-only.
Web UI L3 surfaces	`src/clawsentry/ui/src/pages/SessionDetail.tsx`; `src/clawsentry/ui/src/components/RuntimeFeed.tsx`	L3 咨询审查, Dashboard	Session Detail full-review button, L3 advisory review card, Runtime Feed	Covered	Future screenshots can be added when visual smoke tooling is available.
Token budget / LLM usage	`src/clawsentry/cli/test_llm_command.py`; config/env docs; metrics token counters	LLM 配置, Templates, Reporting API	`clawsentry test-llm --json`, Prometheus `clawsentry_llm_tokens_total`	Covered	Keep examples provider-neutral and budget-first.
Multi-framework startup	`src/clawsentry/cli/start_command.py`; adapter packages	Quickstart, integration pages	`clawsentry start --framework ...`	Covered	Framework table should stay honest about Codex monitoring vs optional managed hooks.
Latch integration	`src/clawsentry/latch/*`; docs integration page	Latch 集成, homepage Latch callout	Latch daemon / bridge surfaces	Covered	No changes needed in this pass beyond nav verification.
OpenClaw / Codex managed setup boundaries	`src/clawsentry/adapters/openclaw_*`; `src/clawsentry/adapters/codex_adapter.py`; Codex init tests	OpenClaw 集成, Codex CLI 集成, Quickstart	`clawsentry init codex --setup`, OpenClaw webhook/WebSocket	Covered	Keep Codex text clear: default monitoring + optional Bash preflight/native hook enhancement.
Interactive config wizard	`src/clawsentry/cli/config_command.py` TTY prompt flow and non-TTY guard; CLI parser flags; `docs/validation/v0.5.12-metric-wizard-agentdog-progress-2026-04-27.md`	Quickstart, Templates, CLI	`clawsentry config wizard --interactive`; `clawsentry config wizard --non-interactive ...`	Release-ready	Keep `--non-interactive` examples for CI and copy/paste templates; framework selection is next-step guidance, not hook installation.
AgentDoG / ATBench benchmark	`benchmarks/scripts/agentdog_atbench_clawsentry.py`; `benchmarks/RESULTS.md`; `benchmarks/notes/agentdog-atbench/RUNBOOK.md`	Benchmark docs and validation progress record	Offline trajectory conversion + ClawSentry L1/L2/L3 replay	Handoff recorded	Continue in next window with labeled ATBench records and raw-vs-protected runner matrix.

Pages touched / verified¶

Page	Action	Source evidence used	Notes
`getting-started/quickstart.md`	Refresh	CLI `config wizard`, framework startup and Codex managed-hook boundaries	Added two-path journey and interactive wizard note.
`configuration/templates.md`	Refresh	Config schema expectations, L2/L3 budget/timeout fields	Added template chooser by latency/budget/strictness.
`api/metric-dictionary.md`	Rewrite section	`session_registry.py`, `server.py`, UI `types.ts`, Session Detail/Sessions fields	Replaced repeated alias-heavy content with one canonical dictionary, formulas, D1-D6 table, UI/API read path.
`api/reporting.md`	Refresh examples	`session_registry.py`, `server.py`, `enterprise.py`	Reconciled `window_risk_summary`, Enterprise posture and D1-D6 examples with backend payloads.
`decision-layers/l2-semantic.md`	Refresh	`SemanticAnalyzer` behavior and L3 advisory boundary	Added operator path clarifying L2 vs L3.
`decision-layers/l3-agent.md`	Refresh	L3 trigger/runtime telemetry and advisory docs	Added operator contract and L1/L2/L3/Advisory contrast.
`decision-layers/l3-advisory.md`	Style conversion	Existing content + shared CSS contract	Removed local CSS; reused `.cs-doc-hero`, `.cs-card-grid`, `.cs-pill`, `.cs-flow-strip`.
`stylesheets/clawsentry-docs.css`	Style contract	Existing docs components and Material theme constraints	Added component contract plus shared operator/flow/pill/API classes.
`mkdocs.yml`	Nav refresh	Existing operations nav	Added this audit page so matrix is not orphaned.

Verified unchanged / linked surfaces¶

Surface	Why unchanged	Verification target
`dashboard/index.md`	Already explains Dashboard / Sessions / Session Detail hierarchy and metric fallback ordering.	Link from Metric Dictionary and Quickstart.
`integration/codex.md`	Already distinguishes monitoring from optional managed native hooks.	Public docs contract checks `clawsentry init codex --setup`, `PreToolUse(Bash)`.
`integration/gemini-cli.md`	Recent-feature entry exists in nav and integration section.	Nav path exists and build includes page.
`operations/benchmark-mode.md`	Dedicated benchmark path exists and is linked from Quickstart/Templates.	Build and link sanity.

Follow-up candidates¶

These are intentionally not part of this docs-only pass:

AgentDoG labeled evaluation: select labeled ATBench safe/unsafe records and compute unsafe recall / safe false-positive rate before documenting safety improvement.
Raw vs ClawSentry live runners: implement a3s-code, Claude Code, Gemini CLI, Codex and optional OpenClaw live runner comparisons after offline replay metrics are stable.
Optional clawsentry setup alias: keep config wizard as the tested setup surface; add a separate alias only if there is a concrete release need.
Rendered visual screenshots: capture light/dark screenshots for Quickstart, Templates, Metric Dictionary and L3 pages when browser tooling is available in CI or release validation.
Generated API excerpt sync: if OpenAPI generation changes response schemas, rerun python scripts/docs_api_inventory.py validate and refresh api/reference.md / api/validity-report.md.