docs: plan CLI command surface alignment#1606
Conversation
Deploying agentv with
|
| Latest commit: |
478d3c8
|
| Status: | ✅ Deploy successful! |
| Preview URL: | https://cdd0eaae.agentv.pages.dev |
| Branch Preview URL: | https://research-av-ap2w-cli-command.agentv.pages.dev |
|
Review finding:
|
|
Addressed the review blocker in What changed:
Validation: |
|
Verification blocker remains after re-fetching peer sources. I refreshed the peer clones during verification. Promptfoo, Margin evals, and DeepEval match the SHAs recorded in the plan, but Other validation passed:
Leaving the PR draft and unmerged until the Vercel snapshot row is refreshed to current fetched |
|
Updated the Vercel snapshot row in Validation for the original blocker:
Remaining merge blocker: CI is not green. The Test job failed twice on the PR merge ref, including one rerun. The repeated failure is:
The logged command failed because the generated
All other checks passed after the rerun. Leaving the PR draft and unmerged until the test failure is resolved or CI is otherwise green. |
6a22715 to
478d3c8
Compare
Summary
Adds
docs/plans/2026-07-02-cli-command-surface-alignment.md, a research report and staged plan for making AgentV's CLI command surface more intuitive while preserving AgentV's repo-native artifact and product boundaries.This PR is linked to Bead
av-ap2w.Findings
promptfoo evalsurface plusview,share,retry,cache,generate,list, andexportcommands.margin run --suite --agent-config --eval, with resource-specificinitsubcommands and bundle-based resume.deepeval test runworkflow, plus TUI/cloud viewing and many provider setup commands; AgentV should avoid copying that provider-command sprawl.init, default/no-arg run, single experiment shorthand,run-all,playground, fingerprint reuse, and--force/--dry/--smokecontrols.dashboard,results,inspect,compare,trend, andruns. The plan recommends reframing the common path first, then consolidating overlapping post-run inspection commands aroundresultsanddashboard.Source SHAs and DeepWiki Usage
Local clones inspected:
/home/entity/projects/promptfoo/promptfooat6bfc5a0c7f16f9c4717ac731d276b578e63d0769/home/entity/projects/Margin-Lab/evalsat53fb2fd080689efaf7934573d8759d14fc1043e4/home/entity/projects/confident-ai/deepevalat324355e8982bf9ee52c192215b06b4267aafa58e/home/entity/projects/vercel-labs/agent-evalata9dcc9a8c53dbc22ececc967ded7ab3963f18e67(main...origin/main [behind 8])DeepWiki MCP was used for repo-level command-surface orientation across the four peer repos and for a focused Vercel agent-eval question. Local source was treated as authoritative; the report notes that DeepWiki's Vercel summary was stale relative to the local clone.
Scope
No command implementation is included. This is a research/plan PR only.
Validation
git diff --checkgit,rg, and file reads only