ae-framework: Agent-Neutral Assurance Control Plane for Agent-Driven SDLC

docRole	narrative
lastVerified	2026-06-21

ae-framework: Agent-Neutral Assurance Control Plane for Agent-Driven SDLC

🌍 Language / 言語: English | 日本語 | Documentation / ドキュメント

English

ae-framework is an agent-neutral assurance control plane for agent-driven SDLC. Coding agents, human maintainers, CI jobs, and formal tools produce changes and raw signals; ae-framework standardises the specifications, verification evidence, policy gates, and release judgments used to decide whether those changes can be trusted.

Product thesis: Bring your own agent. Keep your assurance plane. Codex, Claude Code, GitHub Copilot, Gemini-family coding tools, human maintainers, CI jobs, and formal tools are replaceable producers; ae-framework keeps judgment artifacts, policy gates, and release decisions stable across those producer choices.

Preview launch material: docs/product/LAUNCH-KIT.md, docs/product/ONE-PAGE-PITCH.md, and docs/product/DEMO-SCRIPT.md. Start the offline demo with pnpm run demo:agent-assurance.

Current evidence route for first-time product evaluation: run the 15-minute quickstart, read the ACP-097 pilot report, then check launch-kit limitations and the controlled-comparison protocol. Internal dogfooding and pilot evidence are separate: the current external pilot report is dry-run only with 0 live external PRs collected, and the controlled comparison has not been executed. Public claims should therefore stay limited to review traceability and evidence routing, not review-speed, safety, adoption-impact, or agent-vendor superiority.

Two-layer model

flowchart TB
  subgraph H[Harness layer]
    H1[lint / test / hooks]
    H2[E2E / adapters / runners]
  end
  subgraph C[Assurance control plane]
    C1[Context Pack / evidence aggregation]
    C2[policy gate / review / release judgment]
  end
  H --> C

Baseline stabilises the harness layer with first-run / verify:lite.
Structured assurance connects Context Pack, property/MBT/conformance, and change evidence into the control plane.
High-assurance critical core strengthens the control plane for selected high-risk changes with formal/model/proof lanes.

What this repository provides

Agent-neutral assurance control plane: Context Pack, formal/conformance summaries, artifact validation, policy gates, and PR/release automation that turn producer outputs into reviewable assurance evidence.
Assurance orchestration for agent-driven SDLC: Ready-to-run GitHub Actions (PR verify / verify-lite, nightly heavy tests, Slack alerts) and CLI scripts that keep requirements, tests, and regression signals aligned without turning ae-framework into a coding-agent runtime.
Spec & Verification Kit: Traceable spec format, mutation/MBT/property verification pipelines, and formal runners for Alloy/TLA/SMT/Apalache/Kani/SPIN/CSP(cspx)/Lean4 with unified summaries.
Project scaffolding & policies: pnpm workspace layout, lint/test/type-coverage gates, label gating (typecov, flake), and TDD-friendly Git hooks.
Cacheable heavy test artifacts: scripts/pipelines/sync-test-results.mjs to restore/store/snapshot mutation + MBT results; heavy-test-trends artifacts for CI triage.
Producer integration guidance: Playbooks and connectors for Claude Code / Codex plus producer-boundary guidance for Copilot, Gemini-family tools, humans, and CI jobs; JSON-first outputs and AJV validation keep producer artifacts safe before they enter judgment.

What this is not

Not a single-model code generator — code generation is one producer, not the system of record.
Not an agent runtime or IDE plugin — bring your own agent and keep agent choice outside the judgment contracts.
Not a general-purpose Next.js UI kit or design system starter.
Not a hosted CI/CD service — workflows are provided for self-hosted GitHub runners or forks.
Not mandatory formal proof for every change — routine changes stay on the fast lane unless risk or policy selects heavier assurance.

Adoption profiles

Baseline: verify:lite, schema/AJV validation, PR gates for routine application delivery.
Structured assurance: Context Pack, property/MBT/conformance, richer traceability and change evidence.
High-assurance critical core: formal/model/proof lanes plus proof-carrying change packages for selected high-risk components.

Quick start (local)

# Prereqs: Node.js 20.11+ (<23), pnpm 10
corepack enable
corepack prepare pnpm@10.0.0 --activate
pnpm install
pnpm run first-run
pnpm run setup-hooks

# Offline BYO-agent assurance demo (no GitHub token or hosted LLM API)
pnpm run demo:agent-assurance
# The demo already writes artifacts/review/agent-assurance-demo/assurance-review.md.
# This no-arg renderer targets the default CI/local artifact paths; pass explicit
# --*-summary options from docs/guides/byo-agent-assurance-quickstart.md for demo artifacts.
pnpm run assurance:review-surface

# Fast feedback
pnpm run lint
pnpm run test:fast

# Mutation quick run (mktemp-based; supports STRYKER_TEMP_DIR)
STRYKER_TIME_LIMIT=0 pnpm run pipelines:mutation:quick
# If report generation is intentionally optional, set MUTATION_REPORT_STRICT=0

# Formal smoke (non-blocking summary; cspx backend preferred)
pnpm run verify:formal
pnpm run verify:csp -- --file spec/csp/cspx-smoke.cspm --mode typecheck

# Heavy test cache & trend snapshot
node scripts/pipelines/sync-test-results.mjs --store
node scripts/pipelines/sync-test-results.mjs --snapshot
node scripts/pipelines/compare-test-trends.mjs --json-output reports/heavy-test-trends.json

npm install is intentionally blocked by preinstall because this repository uses pnpm workspace dependencies (workspace:*). pnpm run doctor:env writes artifacts/doctor/env.json and returns 0 (ok) / 2 (warning) / 1 (error) / 3 (invalid arguments). pnpm run first-run runs doctor:env -> build -> verify:lite and writes summary JSON/Markdown files under the artifacts/first-run directory.

Documentation pointers

Overview & nav: docs/README.md, docs/project-organization.md
Maintenance operations: docs/maintenance/branch-cleanup-runbook.md
Worktree maintenance operations: docs/maintenance/worktree-cleanup-runbook.md
TODO triage operations: docs/maintenance/todo-triage-runbook.md
Current architecture snapshot: docs/architecture/CURRENT-SYSTEM-OVERVIEW.md
Zero-based ideal redesign blueprint: docs/architecture/ZERO-BASED-IDEAL-DESIGN.md
Product fit (what to input/output, which tools to use): docs/product/PRODUCT-FIT-INPUT-OUTPUT-TOOL-MAP.md
Assurance control plane positioning: docs/product/ASSURANCE-CONTROL-PLANE.md
Assurance control plane policy: docs/product/ASSURANCE-CONTROL-PLANE-POLICY.md
Agent-neutral assurance roadmap: docs/product/AGENT-NEUTRAL-ASSURANCE-ROADMAP.md
Public preview launch kit: docs/product/LAUNCH-KIT.md, docs/product/ONE-PAGE-PITCH.md, docs/product/DEMO-SCRIPT.md
Product evidence and limitations: docs/product/EFFECTIVENESS-METRICS.md, docs/product/DOGFOODING-REPORT-2026Q3.md, docs/product/PILOT-REPORT-2026Q3-01.md, docs/product/CONTROLLED-COMPARISON-PROTOCOL.md
Agent PR assurance metrics collector: docs/ci/agent-pr-assurance-metrics.md
BYO-agent assurance onboarding: docs/guides/byo-agent-assurance-onboarding.md
15-minute BYO-agent assurance quickstart: docs/guides/byo-agent-assurance-quickstart.md
Assurance model (claim / level / lane / evidence): docs/quality/ASSURANCE-MODEL.md
Assurance operations runbook: docs/quality/assurance-operations-runbook.md
Assurance onboarding checklist: docs/guides/assurance-onboarding-checklist.md
PR automation (Copilot -> auto-fix -> auto-merge): docs/ci/pr-automation.md
Release engineering (release verify / post-deploy): docs/operate/release-engineering.md
CI/quality gates: docs/ci/phase2-ci-hardening-outline.md, docs/ci/label-gating.md, docs/ci/harness-taxonomy.md
Development deep dives: Enhanced State Manager, Circuit Breaker
Docs consistency lint: pnpm run check:doc-consistency (docs/quality/doc-consistency-lint.md)
Heavy test observability: docs/ci/heavy-test-trend-archive.md, docs/ci/heavy-test-alerts.md, docs/ci/heavy-test-album.md
Specification & verification: docs/quality/, docs/quality/formal-csp.md, docs/ci-policy.md, docs/testing/integration-runtime-helpers.md
Context Pack v1 (spec contract): docs/spec/context-pack.md
Context Pack onboarding checklist: docs/guides/context-pack-onboarding-checklist.md
Context Pack Phase5+ cookbook: docs/guides/context-pack-phase5-cookbook.md
Context Pack troubleshooting runbook: docs/operations/context-pack-troubleshooting.md
Contract catalog (input/decision/evidence/operation): docs/reference/CONTRACT-CATALOG.md
Spec & Verification Kit (minimal activation guide): docs/reference/SPEC-VERIFICATION-KIT-MIN.md
Connectors & agent workflows: docs/integrations/CLAUDE-CODE-TASK-TOOL-INTEGRATION.md, docs/integrations/CODEX-INTEGRATION.md
License scope and notice management: LICENSE, LICENSE-SCOPE.md, TRADEMARKS.md, THIRD_PARTY_NOTICES.md

Japanese

ae-framework は エージェント協調型SDLCのための、エージェント非依存の assurance control plane です。coding agent、人間のmaintainer、CI job、formal tool は変更やraw signalを生成する producer であり、ae-framework はその変更を信頼して merge / release できるかを、仕様・検証・証跡・policy gate・release judgment に基づいて判断可能にします。

Product thesis: Bring your own agent. Keep your assurance plane. Codex、Claude Code、GitHub Copilot、Gemini系tool、人間のmaintainer、CI job、formal tool は交換可能な producer であり、ae-framework は producer の選択に依存しない judgment artifact、policy gate、release decision を維持します。

Preview launch material: docs/product/LAUNCH-KIT.md, docs/product/ONE-PAGE-PITCH.md, docs/product/DEMO-SCRIPT.md。offline demo は pnpm run demo:agent-assurance から開始します。

初見の product evidence 導線は、15分 quickstart → ACP-097 pilot report → launch kit の limitations → controlled-comparison protocol の順です。内部 dogfooding と pilot evidence は別扱いです。現在の external pilot report は dry-run only で live external PR の収集数は0件、controlled comparison は未実施です。そのため公開claimは review traceability と evidence routing に限定し、未測定の review-speed、安全性、導入効果、agent vendor 優位性は主張しません。

二層モデル

flowchart TB
  subgraph H[Harness layer]
    H1[lint / test / hooks]
    H2[E2E / adapters / runners]
  end
  subgraph C[Assurance control plane]
    C1[Context Pack / evidence aggregation]
    C2[policy gate / review / release judgment]
  end
  H --> C

Baseline は first-run / verify:lite で harness layer を安定化させる段階です。
Structured assurance は Context Pack、property/MBT/conformance、change artifact を control plane に接続する段階です。
High-assurance critical core は selected high-risk change に対して control plane を強化する段階です。

提供するもの

Agent-neutral assurance control plane: Context Pack、形式検証/Conformance要約、artifact validation、policy gate、PR/release 自動化を束ね、producer output を判断可能な証跡へ変換。
AIエージェント協調型SDLCの証跡・検証オーケストレーション: PR Verify／夜間ヘビーテスト／Slack通知などのGitHub ActionsとCLIスクリプトで、要件・テスト・退行検知を review / release judgment 向けの証跡として整列させる。
仕様・検証キット: トレーサブルな仕様フォーマット、mutation/MBT/Propertyテストのパイプライン、scripts/pipelines/compare-test-trends.mjs によるトレンド比較。
プロジェクト骨子とポリシー: pnpmワークスペース、Lint/Test/型カバレッジのゲート、ラベルゲーティング（typecov・flake）、TDDフック。
ヘビーテスト成果物のキャッシュ: scripts/pipelines/sync-test-results.mjs による store/restore/snapshot、heavy-test-trends アーティファクトでCIトリアージを高速化。
Producer統合指針: Claude Code / Codex 向けプレイブックに加え、Copilot、Gemini系tool、人間、CI job を producer として扱う境界を示し、JSON-first成果物とAJV検証で judgment に入る前の生成物を安全に扱う。

提供しないもの

単一モデル依存のコード生成専用ツール。codegen は producer の一つであり、SSOT は spec / contract / artifact に置く。
エージェント実行ランタイムやIDEプラグイン（各自のエージェントを利用し、agent選択を judgment contract から分離）。
汎用のNext.js UIスターターやデザインシステム配布物。
ホスト型CI/CDサービス（GitHub Actionsの定義を提供）。
すべての変更に formal proof を強制する運用。通常変更は fast lane に留め、risk や policy が必要とする場合だけ heavy lane へ昇格する。

導入プロファイル

Baseline: verify:lite、schema/AJV、PRゲートで通常の業務アプリを安定化。
Structured assurance: Context Pack、property/MBT/conformance、change evidence を追加し、仕様と検証の対応を明示。
High-assurance critical core: 重要コンポーネントに対して formal/model/proof lane と proof-carrying change package を適用。

すぐ試す

# 前提: Node.js 20.11+ (<23), pnpm 10
corepack enable
corepack prepare pnpm@10.0.0 --activate
pnpm install
pnpm run first-run
pnpm run setup-hooks
pnpm run demo:agent-assurance
# demo は artifacts/review/agent-assurance-demo/assurance-review.md を生成します。
# 引数なしの renderer は既定の CI/local artifact path 向けです。demo artifact を再生成する場合は
# docs/guides/byo-agent-assurance-quickstart.md の --*-summary 指定を使ってください。
pnpm run assurance:review-surface

pnpm run lint
pnpm run test:fast

# Mutation quick（mktemp利用、STRYKER_TEMP_DIR対応）
STRYKER_TIME_LIMIT=0 pnpm run pipelines:mutation:quick
# レポート生成失敗を許容する場合のみ MUTATION_REPORT_STRICT=0 を付与

# ヘビーテスト結果のキャッシュ運用
node scripts/pipelines/sync-test-results.mjs --store
node scripts/pipelines/sync-test-results.mjs --snapshot
node scripts/pipelines/compare-test-trends.mjs --json-output reports/heavy-test-trends.json

このリポジトリは workspace:* を使うため、npm install は preinstall ガードで意図的に失敗させています。pnpm install を使用してください。 pnpm run doctor:env は artifacts/doctor/env.json を出力し、終了コードは 0（正常）/2（警告）/1（要修正）/3（引数不正）です。 pnpm run first-run は doctor:env -> build -> verify:lite を順に実行し、artifacts/first-run ディレクトリに summary の JSON/Markdown を出力します。

ドキュメントへの入り口

全体概要: docs/README.md, docs/project-organization.md
現行アーキテクチャ全体像: docs/architecture/CURRENT-SYSTEM-OVERVIEW.md
ゼロベース再設計の理想像: docs/architecture/ZERO-BASED-IDEAL-DESIGN.md
適用対象/入力/出力/ツール適性: docs/product/PRODUCT-FIT-INPUT-OUTPUT-TOOL-MAP.md
Assurance control plane の位置付け: docs/product/ASSURANCE-CONTROL-PLANE.md
Assurance control plane policy: docs/product/ASSURANCE-CONTROL-PLANE-POLICY.md
Agent-neutral assurance roadmap: docs/product/AGENT-NEUTRAL-ASSURANCE-ROADMAP.md
Public preview launch kit: docs/product/LAUNCH-KIT.md, docs/product/ONE-PAGE-PITCH.md, docs/product/DEMO-SCRIPT.md
Product evidence と limitations: docs/product/EFFECTIVENESS-METRICS.md, docs/product/DOGFOODING-REPORT-2026Q3.md, docs/product/PILOT-REPORT-2026Q3-01.md, docs/product/CONTROLLED-COMPARISON-PROTOCOL.md
Agent PR assurance metrics collector: docs/ci/agent-pr-assurance-metrics.md
BYO-agent assurance onboarding: docs/guides/byo-agent-assurance-onboarding.md
15分 BYO-agent assurance quickstart: docs/guides/byo-agent-assurance-quickstart.md
Assurance model（claim / level / lane / evidence）: docs/quality/ASSURANCE-MODEL.md
PR自動化（Copilot→auto-fix→auto-merge）: docs/ci/pr-automation.md
リリース運用（release verify / post-deploy verify）: docs/operate/release-engineering.md
CI/品質ゲート: docs/ci/phase2-ci-hardening-outline.md, docs/ci/label-gating.md, docs/ci/harness-taxonomy.md
ドキュメント検証ポリシー: docs/ci/docs-doctest-policy.md
開発向け設計ドキュメント: Enhanced State Manager, Circuit Breaker
ドキュメント整合チェック: pnpm run check:doc-consistency（docs/quality/doc-consistency-lint.md）
ヘビーテスト観測: docs/ci/heavy-test-trend-archive.md, docs/ci/heavy-test-alerts.md, docs/ci/heavy-test-album.md
仕様と検証: docs/ci-policy.md, docs/testing/integration-runtime-helpers.md, docs/quality/, docs/quality/formal-csp.md
Context Pack v1（仕様入力契約）: docs/spec/context-pack.md
Context Pack 導入チェックリスト: docs/guides/context-pack-onboarding-checklist.md
Context Pack Phase5+ 実践ガイド: docs/guides/context-pack-phase5-cookbook.md
Context Pack 障害対応ランブック: docs/operations/context-pack-troubleshooting.md
契約カタログ（input/decision/evidence/operation）: docs/reference/CONTRACT-CATALOG.md
Spec & Verification Kit（最小パッケージ・有効化手順）: docs/reference/SPEC-VERIFICATION-KIT-MIN.md
エージェント統合: docs/integrations/CLAUDE-CODE-TASK-TOOL-INTEGRATION.md, docs/integrations/CODEX-INTEGRATION.md
ライセンス適用範囲と notice 管理: LICENSE, LICENSE-SCOPE.md, TRADEMARKS.md, THIRD_PARTY_NOTICES.md

🔒 TypeScript Policy / TypeScript ポリシー

@ts-expect-error Usage Policy

When using @ts-expect-error comments, follow this structured format:

// @ts-expect-error owner:@username expires:YYYY-MM-DD reason: detailed description
const _example: number = 'type mismatch for policy example';

Core fields:

owner:@username - GitHub handle responsible for the suppression
expires:YYYY-MM-DD - Date when this suppression should be reviewed/removed
reason: description - Detailed explanation (minimum 12 characters)

Each suppression entry still carries all three fields.

Examples:

// @ts-expect-error owner:@alice expires:2027-12-31 reason: narrowing todo for complex union
const result = complexUnion as string;

// @ts-expect-error owner:@bob expires:2027-06-15 reason: legacy API compatibility until v2 migration
legacyApi.unsafeMethod(data);

Policy enforcement:

✅ CI validates all @ts-expect-error comments
⚠️ Local development shows warnings only
🔍 Script: node scripts/ci/check-expect-error.mjs

Documentation / ドキュメント

Full navigation: docs/README.md
Quick starts: docs/getting-started/QUICK-START-GUIDE.md (baseline -> structured assurance -> high-assurance PR), docs/getting-started/PHASE-6-GETTING-STARTED.md
CLI Reference: docs/reference/CLI-COMMANDS-REFERENCE.md, API: docs/reference/API.md
CLI entrypoints in this repo: main CLI src/cli/index.ts (ae / ae-framework), benchmark CLI src/cli/benchmark-cli.ts (ae-benchmark), legacy compatibility shim src/cli.ts

🤝 Contributing / 貢献

We welcome contributions! See CONTRIBUTING.md. 貢献を歓迎します！詳細はCONTRIBUTING.mdをご覧ください。

📄 License / ライセンス

Apache-2.0 - see LICENSE. Scope, trademark, and third-party notice handling are documented in LICENSE-SCOPE.md, TRADEMARKS.md, and THIRD_PARTY_NOTICES.md.

🙏 Acknowledgments

Built with: MCP SDK, Claude/Codex task tools, pnpm workspace, Vitest, AJV, GitHub Actions.

ae-framework — automating agentic specification, verification, and CI quality gates.

Name		Name	Last commit message	Last commit date
Latest commit History 5,313 Commits
.ae		.ae
.devcontainer		.devcontainer
.github		.github
api		api
apps		apps
artifacts		artifacts
benchmarks/baseline		benchmarks/baseline
codex		codex
config		config
configs		configs
contracts		contracts
docker		docker
docs		docs
examples		examples
fixtures		fixtures
infra		infra
observability		observability
packages		packages
plans		plans
podman		podman
policies		policies
policy		policy
presets/formal		presets/formal
proofs/lean		proofs/lean
samples		samples
schema		schema
scripts		scripts
security/sbom		security/sbom
spec		spec
src		src
templates		templates
test-cassettes		test-cassettes
tests		tests
types		types
.dependency-cruiser.js		.dependency-cruiser.js
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.npmrc		.npmrc
.nycrc.json		.nycrc.json
.tool-versions		.tool-versions
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE-SCOPE.md		LICENSE-SCOPE.md
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
TRADEMARKS.md		TRADEMARKS.md
eslint.config.js		eslint.config.js
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ae-framework: Agent-Neutral Assurance Control Plane for Agent-Driven SDLC

English

Two-layer model

What this repository provides

What this is not

Adoption profiles

Quick start (local)

Documentation pointers

Japanese

二層モデル

提供するもの

提供しないもの

導入プロファイル

すぐ試す

ドキュメントへの入り口

🔒 TypeScript Policy / TypeScript ポリシー

@ts-expect-error Usage Policy

Documentation / ドキュメント

🤝 Contributing / 貢献

📄 License / ライセンス

🙏 Acknowledgments

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ae-framework: Agent-Neutral Assurance Control Plane for Agent-Driven SDLC

English

Two-layer model

What this repository provides

What this is not

Adoption profiles

Quick start (local)

Documentation pointers

Japanese

二層モデル

提供するもの

提供しないもの

導入プロファイル

すぐ試す

ドキュメントへの入り口

🔒 TypeScript Policy / TypeScript ポリシー

@ts-expect-error Usage Policy

Documentation / ドキュメント

🤝 Contributing / 貢献

📄 License / ライセンス

🙏 Acknowledgments

About

Topics

Resources

License

Licenses found

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages