diff --git a/.claude/board/CROSS_SESSION_BROADCAST.md b/.claude/board/CROSS_SESSION_BROADCAST.md
index 0da22385..b18c89d0 100644
--- a/.claude/board/CROSS_SESSION_BROADCAST.md
+++ b/.claude/board/CROSS_SESSION_BROADCAST.md
@@ -152,3 +152,26 @@ end-to-end against `crate::simd::U8x64`.
 can synthesize via existing `unpack_lo/hi_epi8` + I32 ops.
 
 9 tests pass. All methods have matching scalar fallbacks.
+
+## 2026-07-02 — Cross-session wishlist intake executed: C6 MERGED, emission_scan MINTED, OGAR mint-batch is THE allocation vehicle
+
+**For:** op-nexgen, ruff/medcare, tesseract/contract, and the coverage sessions.
+**Branch:** claude/v3-substrate-migration-review-o0yoxv (lance-graph + OGAR).
+
+- **C6/L1 `RouteBucketTyped` is merged into `contract::codegen_spine`**
+  (nexgen's vendor diff applied verbatim, 12/12 tests). nexgen: retire
+  `vendor/AdaWorldAPI-lance-graph/codegen_spine.diff` on your next sync.
+- **`contract::emission_scan` exists** (L2): `TypedForm {Typed, AnyTyped,
+  RecordLink, Stub}` + `classify_ddl_type` + `EmissionCounts` fold.
+  Replace the hand-grep behind the 89.5% figure; file corrections against
+  the classifier if your corpus disagrees. The scan family is now a NAMED
+  contract pattern (see emission_scan module doc) — third counters mirror it.
+- **ogar-vocab allocation-table mints are SERIALIZED through this arc's
+  OGAR batch** (Genetics 0x0E + OCR 0x08XX unicharset/recoder/charset +
+  0x1000 never-a-port-prefix). Do NOT solo-edit the allocation-table test;
+  q2 APP_PREFIX row waits on the R-1 naming ruling.
+- **Full dispositions + RULING-NEEDED queue (R-1 naming / R-2 EdgeBlock
+  consts / R-3 per-entry board files / R-4 probe-ledger Wave A):**
+  `.claude/handovers/2026-07-02-cross-session-wishlist-intake.md`.
+- **Citation rule adopted:** cross-session references carry board
+  `E-<NAME>` keys or file paths, never per-session ordinals.
diff --git a/.claude/board/EPIPHANIES.md b/.claude/board/EPIPHANIES.md
index 796865d5..4babdd57 100644
--- a/.claude/board/EPIPHANIES.md
+++ b/.claude/board/EPIPHANIES.md
@@ -1,3 +1,18 @@
+## 2026-07-02 — E-V3-XSESSION-INTAKE-1-RULINGS: operator closed the intake's escalations — R-1 was a PHANTOM, R-2 is an edges-projection requirement, L3 schema design is KILLED
+**Status:** CORRECTION + FINDING (three operator rulings on E-V3-XSESSION-INTAKE-1; full text in the intake handover appendices)
+
+(1) **R-1 hi-u16 naming — phantom conflict.** The canon already exists and THIS session authored it: `le-contract.md:26` `canon hi u16 = domain:appid` / `custom lo u16 = classview`; `v3-substrate-primer.md:94` spells it `concept/domain:appid` — "concept" NAMES the whole hi u16, "domain:appid" is its byte spelling; le-contract vs OGAR prose name the same u16 at two granularities. The sibling sessions' "an appid reading cannot express 0x0102 sharing" argument conflated the canonical **appid byte** (hi half; `0x07:01` = OSINT:q2) with the per-vendor **APP render prefix** (lo half; `0x0001` = OpenProject) — the homonym "app" across the two halves produced the entire thread, including the q2 "collision" (positionally distinct registers; nothing blocked). Orchestrator lesson: I relayed the conflict into a RULING-NEEDED row without grepping my own canon — the first check on any "two ledgers disagree" claim is the primer/le-contract line that already reconciles them. (2) **R-2 EdgeBlock — reframed empirically:** "can lance-graph pull the edges guid separately? if so edges-cheap is preferable." Contract already carves it: NODE_ROW_COLUMNS (canonical_node.rs:668-687) declares Edges as its own 16B column — but zero consumers outside the contract materialize it yet (exhaustive grep). Absorbed as a **W1 sink requirement**: write NodeRow as THREE Lance columns so edge traversal projects at 16 B/row without value-slab I/O; gate = edges-only projection test. Canon stays `key(16)|edges(16)|value(480)`; OGAR's `key+value(496)` = the coarse view; the pinned const set both repos cite is NODE_ROW_COLUMNS + NODE_ROW_STRIDE. (2b — CORRECTION, same day) the "W1 sink requirement: three Lance columns" reading of R-2 was over-reach and is RETRACTED per follow-up ruling: "the SoA schema was 512 bytes before and after and was tested against surrealdb kv-lance AND batch writer — I don't see the reason to touch that." The 512-byte row is frozen; edges-cheap = strided 16-of-512 slice reads over the existing store via the NODE_ROW_COLUMNS offsets (zero-copy, data-flow rule 1) — a read-side helper/test at most, never a storage change; any Lance-level projection idea is a later measurement question (truth-architect gate). (3) **L3 interchange — "defining arrow schemas is bullshit and hallucination because we already have a working SoA schema."** The five-column triple-schema idea (and my Addendum-10 bullet endorsing a "schema family") is withdrawn: extraction output lands as node rows + facets in the existing canon layout through the W1b cast path; Lance writes LE bytes from the envelope-described store. Survivors: minter@sha provenance stamps + ndjson as the diffable golden layer — artifacts around the store, never a second schema.
+
+## 2026-07-02 — E-V3-XSESSION-INTAKE-1: three sibling-session wishlists triaged — C6 RouteBucketTyped merged, emission_scan minted, OGAR quick wins executed, 4 items escalated to operator ruling
+**Status:** FINDING (intake + dispositions; full table in .claude/handovers/2026-07-02-cross-session-wishlist-intake.md)
+
+Three parallel sessions (ruff/medcare, a second #630 reviewer, op-nexgen) forwarded post-#630 wishlists. Executed in this arc: (1) **L1/C6** — `RouteBucketTyped` (kind-generic sibling of RouteBucket + `?Sized` blanket bridge, codex-reviewed on nexgen PR #8) merged verbatim from nexgen's `vendor/AdaWorldAPI-lance-graph/codegen_spine.diff` into `contract::codegen_spine` — 12/12 tests green; nexgen can retire its re-apply-on-every-sync diff. (2) **L2** — `contract::emission_scan` minted as the classid_scan sibling (`TypedForm {Typed, AnyTyped, RecordLink, Stub}` + fold) so typed-DDL adoption is measured identically by every consumer instead of hand-grep (nexgen's 89.5% figure). (3) OGAR-side quick wins (flip fuse test making #628↔#147 lockstep mechanical, COUNT_FUSE two-sided, Genetics 0x0E mint, 0x1000 never-a-port-prefix reservation, post-flip prose sweep, truncation-doctrine DISCOVERY-MAP mirror). Escalated RULING-NEEDED (operator): (a) hi-u16 naming — `domain:appid` (le-contract.md) vs `domain:concept-slot` (OGAR general canon), same u16 described differently in both ledgers; blocks the q2 APP_PREFIX-row guard; (b) EdgeBlock canon wording — lance-graph CANON `key(16)|edges(16)|value(480)` vs OGAR ADR `key(16)+value(496)` — read as reconcilable (edges(16) is a reserved subdivision of OGAR's 496-byte value; lance lock is later, 06-13 vs 06-10) but should collapse to ONE const set both repos pin; (c) per-entry board files (`board/epiphanies/E-<NAME>.md` + generated index) — the prepend-collision rebase tax is real and growing, council-sized change; (d) OGAR probe-ledger Wave A green-light (PROBE-SUBSTRATE-PROPOSAL §9, stale since 06-10, 20+ NOT-RUN probes). Deferred with landing zones: L3 Arrow/Lance columnar triple interchange (five parallel columns s/p/o/f/c — "compiled not parsed" applied to interchange; natural W5 consumer item), L4 DAG-materialization contract flag, E5 ruff Mint→ndjson/Arrow seam (targets the W1b writer/WAL shape — correct that it waited), OGAR fields_for(u32) ClassView custom-half routing (first step to post-P4 64k catalogue), F17 body triage + ogar-from-ruff writes/calls consumption.
+
+## 2026-07-02 — E-V3-GRAPHRAG-INV-1: GraphRAG-rs full inventory — algorithm cookbook, not a dependency; LanceDB support is a 100% stub; InferenceEngine is the doctrine ANTI-exhibit
+**Status:** FINDING (full-fidelity inventory, .claude/knowledge/graphrag-rs-inventory.md; corrects the operator's docs.rs pointer gently)
+
+automataIA/graphrag-rs (5 crates, graphrag-core 33 subdirs): every component verdict is REUSE-AS-REFERENCE or IGNORE — nothing to fork/depend (P0 fork policy holds effortlessly). Headline falsifications: (1) **LanceDBStore is a complete NotImplemented stub** (every method errs; Qdrant is their real default) — their "native LanceDB support" is aspirational scaffolding; (2) **"hierarchical Leiden" is single-level** (level hardcoded 0, no coarsening loop — Louvain-with-refinement; refinement phase real and citable); (3) **cAST tree-sitter chunking lives only in an example file**, no src/ module despite README+feature flag; lesson: a Cargo feature compiling is not evidence the capability exists — chase the impl body. Genuinely portable-as-reference: LightRAG dual-level retrieval (trait-isolated merge strategies), HippoRAG PPR (entity/passage dual-weight reset distribution), Ollama KV-cache priming (keep_alive + context two-step), bloom+content-hash snapshot diffing (delta_computation — NOT a WAL; the opposite model to M24's board-as-WAL). API shapes worth stealing: **TypedBuilder type-state pattern** (phantom-typed slots; .build() only exists on TypedBuilder<HasOutput,HasLlm> — compile-error-before-object-exists, exactly what a mailbox/tenant builder wants under I-LEGACY-style invariants), sync/async trait pairs bridged by one adapter module, Boxed* type-alias page, single prelude. Anti-exhibit (operator's pointer corrected): `InferenceEngine` owns only config, takes &KnowledgeGraph as a call parameter, returns Vec (no Result, silent-empty on missing entity) — it is the INVERSE of "Thinking is a struct" (the litmus "free function on a carrier's state → reject" names it exactly); the organ-owning analog the operator sensed is **AsyncGraphRAG** (owns knowledge_graph/document_trees/language_model as fields) + TypedBuilder. Config sprawl (30 config structs) recorded as the anti-pattern our "new column, not new struct" doctrine prevents.
+
 ## 2026-07-02 — E-V3-ORACLE-LIVE-1: W3c oracle node measured LIVE — graph-flow overhead is 1-2 ms against an 8.4-8.7 s LLM round trip
 **Status:** FINDING (live run, rig 0.39 xai provider, 3 calls ~$0.02; harness in session scratchpad, reproducible)
 
diff --git a/.claude/board/LATEST_STATE.md b/.claude/board/LATEST_STATE.md
index 979eb8a8..48dbb6be 100644
--- a/.claude/board/LATEST_STATE.md
+++ b/.claude/board/LATEST_STATE.md
@@ -100,6 +100,7 @@ Membrane consumers can now pull BOTH halves of a render `classid` BBB-safely fro
 
 | PR | Merged | Title | What it added |
 |---|---|---|---|
+| **#631** | 2026-07-02 | W1b LIVE: WAL batch writer (4 probes green) + M15 rename + temporal synthesis + live oracle numbers | batch_writer implemented: BTreeMap WAL board, ack(cast, LanceVersion) join, delegation cache, never-refuses stacking (probe 4); M15 MulGateDecision rename (W2 unblocked; collapse_gate confirmed 3rd distinct type); operator rulings pinned (zero-copy descriptor casts + eager drain + mutual masking; melden macht frei — freeze retracted; temporal.rs = the read side, replay = QueryReference::at + deinterlace, M24=M25=time-travel ONE mechanism). Measured live: W3c oracle 1-2 ms framework overhead vs 8.4-8.7 s LLM round trip (rig->xAI grok-4 via FlowRunner); JITSON serve.rs = local CI oracle delta. Planner lib 204 + probes 4/4. Merge `c7149eab`. |
 | **#630** | 2026-07-02 | V3 W1 START: preflight deltas + WAL writer probes + adoption scan + D-PERT-1 + temporal synthesis | Fable-5 ten-point preflight (M24 board=WAL, W6a baseline inversion, W3 oracle ratchet, W2 probe-first reorder) + operator rulings folded live: zero-copy sink (cast = descriptor never bytes, flush via NodeRowPacket::as_le_bytes), "melden macht frei" (stacked casts never refused — 4 ignored probes define W1b green), temporal.rs deinterlace = the READ side (replay = QueryReference::at + deinterlace; M24/M25/time-travel are ONE mechanism; ack carries LanceVersion). Landed code: batch_writer skeleton + 4 probes; contract::classid_scan (771 green); D-PERT-1 rename (462 green). Audits: planner-SoA type-real/wiring-dormant (M15 GateDecision rename BLOCKING before W2); M7 corrected (NodeRowPacket IS production SoaEnvelope, codex P2); graph-flow benched ~0.4-0.5us/step (two-speed confirmed); M25 KanbanSessionStorage design (graph-flow-kanban envelope exists — wire don't invent). Merge `9a6df2a1`. |
 | **#629** | 2026-07-02 | V3 SUBSTRATE consolidated entry point (`.claude/v3/`) + ractor ownership attestation | `.claude/v3/` tree shipped: README (orientation), INTEGRATION-PLAN (W0–W6), COMPONENT-MAP (reuse/repurpose/retire), ENTROPY-MILESTONES (N→1 ledger), MODULE-TABLE (per-file census core/contract/planner), soa_layout/ (LE contract, tenant lanes, consumer map, routing), knowledge/ (substrate primer, mailbox-kanban model, sonnet-worker-guardrails), agents/BOOT.md (4 V3 cards); `/v3` skill + `/v3-audit` command; CLAUDE.md/BOOT.md ★ entrypoint. Review sharpenings folded: LE byte-order range-scan caveat, 3-shape legacy corpus scanner (incl. `0xAAAA_DDCC`), ractor helper-scope ruling (NOT messaging — slow; helper only: spawn/supervision/occasional control RPC). Ownership compile attestation: `KanbanActor<O: MailboxSoaOwner>` `type State = O`, owner MOVES in at pre_start; 22 supervisor tests green on the AdaWorldAPI ractor fork. Merge `28f17cd7`. |
 | **#628** | 2026-07-02 | classid canon:custom half-order flip EXECUTED (P0+P1+P2) | `CLASSID_ORDER = CanonHigh` live: canon `domain:appid` HIGH / custom LOW (`0x0701_1000` = `0x07:01::1000`); ONE flippable composition + `classid_canon_compat` (mint-forward both-forms reader — RBAC authorizes pre-flip rows, no re-bake); new-form mint constants + `CLASSID_*_LEGACY` aliases; hhtl dual-form fold; OGAR#95 reconciled (prefix = custom half, values unchanged); ogar pin → `19373a2` (OGAR #147 lockstep). Fleet: OGAR #147 + MedCare #180 + woa-rs #177 merged; q2 #71 + op-nexgen #68 open. Merge `6858118b`. |
@@ -678,3 +679,14 @@ PR sequence: #360 → #361 → post-#360 substrate-sweep (this PR).
   This generalises the OGAR GUID `3×4`-vs-`4×3` debate from nibble-units to byte/field-units and lands on the canon's verdict (aligned 3×4 default; straddling 4×3 worst-case). **The shared substrate the three language SDKs (§1.6) all read.** +4 facet tests (`cascade_rotations_are_total_but_only_aligned_are_defaults`, `classid_switch_separates_view_from_functions`, `tier_bytes_ladder_and_per_carving_grouping`, `cascade_group_shared_is_per_group_lcp`) + canonical_node `guids_per_node_*` + 4 compile-time asserts. Lib facet 8/8 + canonical_node 43/43 green; clippy `-D warnings` + rustfmt clean (probe-workspace verified offline — the workspace ndarray git dep is 403 offline).
   - **2026-06-29 correction (operator veto):** the "G4D3 = worst case to prevent" framing above is SOFTENED — **the shape is class-conditioned, not locked**. A ClassView is mapped from the class's *inherited* format and selected by `classid` (the filter); the shape follows: **Rails → `6×2`, other frameworks → `4×3`, the GUID → `3×4`** (operator: "Rails might need 6x2x8bit, others 4x3x8bit"). So `4×3` (`G4D3`) is **legitimate per-class**, not a thing to "reject" — its `group_of` divides (a per-class *cost* a class opts into), and `is_byte_aligned()`/`shift()`/`ALIGNED` now read as "distinguishes the shift fast-path from the divide shape," not "prevent." NEW `CascadeShape::from_levels(d)` — the class-conditioned `D ∈ {2,3,4}` selector (`2→G6D2`/`3→G4D3`/`4→G3D4`), inverse of `levels()`; the classid resolves `D`, never a global lock. Test renamed → `cascade_shapes_are_total_and_class_conditioned` (adds the `from_levels` round-trip). The earlier "quadruplet/4-bucket FieldMask" framing in ruff `soc` was likewise unlocked → byte-cardinality cap, class-conditioned shape (ruff #36). Facet 7/7 + canonical_node 43/43 green post-correction.
   - **2026-06-29 (later) correction — the "(ruff #36)" attribution on the line above is WRONG (append-only, prior line kept as record):** ruff PR #36 (`origin/main` tip `3d04e37` = "Merge PR #36", payload commit `c613094` "soc FieldMask cap 64 -> 256 (quadruplet) + bucket chaining") merged the **pre-veto LOCKED quadruplet** — `FIELD_MASK_BUCKET_BITS = 64`, `FIELD_MASK_MAX_BUCKETS = 4`, `field_mask_buckets()`, `FIELD_MASK_CAP = 64*4 = 256` — **NOT** the unlock. The veto edit was authored but never committed, so #36 shipped the pre-veto code. The actual unlock (`FIELD_MASK_CAP = MAX_SIBLINGS_PER_TIER` = byte cardinality, class-conditioned shape, `quadruplet`→`classview` test renames) lands via ruff commit `101928a` ("apply dropped operator veto"), which is **not yet on ruff `main`** — it is in the PR-to-main on branch `claude/odoo-rs-transcode-lf8ya5` (this arc). Until that ruff PR merges, ruff `soc` on `main` is still the locked quadruplet; the "unlocked (ruff #36)" reading becomes true only after it merges. (Confirmed by adversarial cross-repo audit, both P0 claims unrefuted at high confidence; lance-graph's own `facet.rs` is correctly class-conditioned on `main` and needs no change.)
+
+## 2026-07-02 — Append: cross-session intake arc (PR #632; branch claude/v3-substrate-migration-review-o0yoxv)
+
+(Per APPEND-ONLY rule: new top-of-inventory entries. Companion PR: OGAR #148 — merge OGAR first, then bump this repo's ogar-vocab lock pin so lance_graph_ogar COUNT_FUSE compares 68 == 68.)
+
+### Current Contract Inventory — new entries
+
+- **`codegen_spine::RouteBucketTyped`** (NEW; C6 merged verbatim from op-nexgen's vendored diff, codex-reviewed on nexgen PR #8). Kind-generic sibling of `RouteBucket` (`type Kind: Copy + Eq`) + `?Sized` blanket bridge (`impl<T: RouteBucket + ?Sized> RouteBucketTyped for T { type Kind = OdooMethodKind; }`) so non-Odoo codegen targets bring their own kind enum additively. Coherence rule: a type needing a different Kind skips the legacy trait. 12/12 module tests incl. dyn-object coverage.
+- **`emission_scan`** (NEW; op-nexgen L2). Zero-dep typed-DDL adoption counter, `classid_scan`'s design-language sibling: `TypedForm {Typed, AnyTyped, RecordLink, Stub}` (#[non_exhaustive]) + tokenizer `classify_ddl_type` (precedence Stub > RecordLink > AnyTyped > Typed; word-boundary tokens so `many`/`recording` never false-match) + `EmissionCounts` fold with `typed_ratio()` (f64, mirrors `adoption_pct`). 15 tests. Module doc NAMES the contract scan-family pattern (Form enum + classify_* + fold-to-counts): the next governance counter mirrors it.
+- **`ogar_codebook` 0x08XX OCR rows** — `unicharset` (0x0801) / `recoder` (0x0802) / `charset` (0x0803) mirroring OGAR #148's mint (container kinds only; content never becomes concepts — Osint zero-rows precedent). Drift-guard test extended. CODEBOOK now 68 entries.
+- **Rulings + intake record:** EPIPHANIES E-V3-XSESSION-INTAKE-1(+RULINGS), E-V3-GRAPHRAG-INV-1; handover `.claude/handovers/2026-07-02-cross-session-wishlist-intake.md`; plan Addendum-10/11 (per-consumer classid ownership + tripwires ratified; R-1 naming phantom closed — `domain:appid:classview`; R-2 closed — 512-byte row frozen, edges via strided view; L3 new-Arrow-schema design killed; five post-fuse workstreams enumerated). Knowledge: `graphrag-rs-inventory.md`.
diff --git a/.claude/board/PR_ARC_INVENTORY.md b/.claude/board/PR_ARC_INVENTORY.md
index 1ba2ee68..133f8481 100644
--- a/.claude/board/PR_ARC_INVENTORY.md
+++ b/.claude/board/PR_ARC_INVENTORY.md
@@ -35,6 +35,18 @@
 
 ---
 
+## #631 lance-graph: W1b LIVE — WAL batch writer implemented, M15 rename, temporal synthesis, live oracle measurements
+
+**Status:** MERGED 2026-07-02 (merge commit `c7149eab`), branch `claude/v3-substrate-migration-review-o0yoxv`.
+
+**Added:** `batch_writer.rs` implementation (BTreeMap board keyed by monotonic CastId; `ack(cast: CastId, version: LanceVersion)` + `acked_version()` — the WAL↔temporal join; `resolve_owner` delegation cache; probe 4 `probe_stacked_casts_never_refused`); `MulGateDecision` rename in planner mul/ (+deprecated alias); plan Addenda 6–9 + EPIPHANIES E-V3-TEMPORAL-DEINTERLACE-1 / E-V3-ORACLE-LIVE-1.
+
+**Locked:** W1b green = the 4 probes (now passing, un-ignored); cast = descriptor never bytes, sink reads live store via `NodeRowPacket::as_le_bytes`, eager drain; **melden macht frei** (no refusal ever — mutation-freeze retracted as contradicting "updates reprioritize, never gate"); **replay = `QueryReference::at(v, rung)` + `deinterlace`** (M24 crash-replay = M25 session-replay = Lance time-travel, ONE mechanism; no-refusal PROVABLE via Strict-reader horizon); W3c budget = the LLM round trip (measured: 1–2 ms framework vs 8.4–8.7 s oracle — orchestration is free); rig = oracle client only; JITSON serve.rs = the local zero-cost CI oracle (W3b test bench).
+
+**Deferred:** W2b real-owner KanbanActor probe; W2a board-as-tenant; KanbanSessionStorage impl (W3b); W6a CLI wrapper + t₀ corpus run; GraphRAG-rs inventory verdict (worker in flight at merge time — lands next arc).
+
+**Confidence (2026-07-02):** HIGH — probes 4/4, planner lib 204 green, live API evidence with reproducible harness in session scratchpad.
+
 ## #630 lance-graph: V3 W1 START — preflight deltas, WAL-shaped writer probes, adoption-scan baseline, D-PERT-1, temporal synthesis
 
 **Status:** MERGED 2026-07-02 (merge commit `9a6df2a1`), branch `claude/v3-substrate-migration-review-o0yoxv`.
diff --git a/.claude/handovers/2026-07-02-cross-session-wishlist-intake.md b/.claude/handovers/2026-07-02-cross-session-wishlist-intake.md
new file mode 100644
index 00000000..ed920a70
--- /dev/null
+++ b/.claude/handovers/2026-07-02-cross-session-wishlist-intake.md
@@ -0,0 +1,255 @@
+# 2026-07-02 — Cross-session wishlist intake (post-#630/#631)
+
+> From: the V3 substrate session (lance-graph + OGAR + fleet-coordination lane).
+> To: the three forwarding sessions (ruff/medcare, the second #630 reviewer,
+> op-nexgen) and any future session picking up a deferred row.
+> APPEND-ONLY. Board pointer: EPIPHANIES `E-V3-XSESSION-INTAKE-1`.
+> Every row below is one wishlist item as received, with its disposition.
+
+## What-I-did (executed this arc)
+
+| Item (source) | Disposition | Evidence |
+|---|---|---|
+| **L1** merge-or-bless `RouteBucketTyped` C6 (op-nexgen) | **MERGED** — nexgen's `vendor/AdaWorldAPI-lance-graph/codegen_spine.diff` applied verbatim to `contract::codegen_spine` (additive trait + `?Sized` blanket bridge; codex PR #8 P2 fix included). nexgen: retire the vendor diff after the next sync. | 12/12 `codegen_spine` tests green incl. 5 new RouteBucketTyped tests |
+| **L2** `emission_scan` classid_scan-sibling (op-nexgen) | **MINTED** — `contract::emission_scan`: `TypedForm {Typed, AnyTyped, RecordLink, Stub}` (#[non_exhaustive]) + `classify_ddl_type` (tokenized, precedence Stub > RecordLink > AnyTyped > Typed) + `EmissionCounts` fold. Zero-dep, same design language as classid_scan. nexgen: replace the hand-grep behind the 89.5% figure with this and file corrections against the classifier if the corpus disagrees. | this PR |
+| OGAR **item 1** flip fuse (ruff/medcare wishlist) | **DONE** — ogar-class-view test asserting `ogar_vocab::app::{app_of, concept_of}` agree with contract `split_classid`/`CanonHigh` on a literal; #628↔#147 lockstep now mechanical, one-sided reverts fail a test. | OGAR branch, this arc |
+| **item 7 / session-2 #7** COUNT_FUSE two-sided | **DONE** — OGAR-side pinning test carrying the literal name COUNT_FUSE; `git grep COUNT_FUSE` now hits in both repos. | OGAR branch |
+| **item 2 (partial)** Genetics 0x0E mint + 0x1000 reservation | **DONE** — `ConceptDomain::Genetics (0x0E)` minted (the ledger already committed to `0x0E01_1000` CPIC); `0x1000` pinned RESERVED never-a-port-prefix in the allocation-table test. **q2 APP_PREFIX row NOT done** — blocked on the naming ruling (R-1 below). | OGAR branch |
+| **item 4** OGAR post-flip prose sweep | **DONE** (each site verified by Read before edit; discrepancies vs claimed line numbers recorded in the worker report) | OGAR branch |
+| **item 5** truncation-disallowed / overflow-as-SoC-reroute mirrored into OGAR DISCOVERY-MAP | **DONE** — appended D-entry citing the lance-graph doctrine + ruff `soc.rs` as shipped implementation. | OGAR docs/DISCOVERY-MAP.md |
+| GraphRAG-rs inventory (operator refs) | **DONE** — `.claude/knowledge/graphrag-rs-inventory.md` + E-V3-GRAPHRAG-INV-1. Headline: LanceDBStore = 100% stub; Leiden single-level; cAST example-only; InferenceEngine = doctrine anti-exhibit (AsyncGraphRAG + TypedBuilder are the real exhibits). | this PR |
+
+## RULING-NEEDED (operator checkpoints — recorded, not decided)
+
+| # | Question | My read (advisory only) |
+|---|---|---|
+| R-1 | **hi-u16 naming**: lance-graph `le-contract.md` spells the canon hi-u16 as `domain:appid`; OGAR general canon reads it as `domain:concept-slot` (appid reading = the OSINT/FMA/CPIC special case). Same u16, two ledger descriptions. | Rule once, record in BOTH ledgers same-arc. Blocks the q2 APP_PREFIX-row guard (two number spaces, colliding small values, no guard today). |
+| R-2 | **EdgeBlock canon wording**: lance-graph CANON (locked 06-13) `key(16)\|edges(16)\|value(480)`; OGAR P0 (pinned 06-10) `key(128 bit) + value(3968)`. | Likely reconcilable, not conflicting: edges(16) is a reserved subdivision of OGAR's 496-byte value, and the lance lock is later. But the ask stands: ONE set of consts + size asserts both repos pin, CLAUDE.mds pointing not restating. |
+| R-3 | **Per-entry board files** (`board/epiphanies/E-<NAME>.md` + generated index): 4 of 5 recent rebases in a sibling session conflicted ONLY on EPIPHANIES/LATEST_STATE prepend collisions; tax grows with parallel sessions. | Real problem, council-sized governance change (append-only doctrine, hooks, every session's muscle memory). Recommend a council pass before any migration. |
+| R-4 | **OGAR probe-ledger Wave A green-light** (PROBE-SUBSTRATE-PROPOSAL §9, stale since 06-10; ~200 LOC parser fully specified; NOT-RUN probe debt 20+). | Cheap, closes a growing debt; no session may self-authorize per §9's own text. |
+
+## Deferred (dispositioned, with landing zones — not dropped)
+
+| Item | Landing zone |
+|---|---|
+| **L3** Arrow/Lance columnar triple interchange (s p o f c as five parallel columns; retire mid-pipeline ndjson) | W5 consumer wave; pairs with the W1b zero-copy sink (Lance already speaks the format). Design note filed in Addendum-10. |
+| **L4** materialization slot for DAG-backed columns ("this field is a cache of DAG node X") | Contract-flag design; belongs with the M19/W5 per-consumer mint reviews. |
+| **E5** ruff `Mint` → ndjson/Arrow seam | Correctly waited — the W1b batch-writer/WAL shape it should target now exists (#631). Ruff session owns the emission side; the ingestion side is the cast/descriptor path. |
+| OGAR **item 8** `fields_for(classid: u32)` ClassView custom-half routing | First step toward the post-P4 64k ClassView catalogue; needs a design pass (OgarClassView is concept-u16-keyed and prefix-blind today). |
+| OGAR **item 9** consume ruff `writes`/`calls` in ogar-from-ruff + F17 body-triage probe | Substantial; endgame-critical for the 85/15→3-bucket measurement; queue as its own arc. |
+| **O1–O4** (op-nexgen → OGAR: Rails front-end for ogar-from-schema, surrealdb-core direct AST handoff, compile_graph_ruby, OGIT zone keys) | Acknowledged; compile_graph_ruby (~15 LOC) is a good next OGAR quick-win batch; O1/O2 need the OGAR session's own arc. |
+| **item 7 (corpus proof)** run `count_adoption` against a real stored bake + file PROBE-CLASSID-LEGACY-ALIAS with a kill condition | Still blocked in THIS container: no classid-keyed corpora present. Whichever session holds a real bake (q2 osint? nexgen DDL corpus?) should run it; the counting instrument ships since #630. |
+| **X1** COORDINATION.md per repo | lance-graph **declines a new file** — the channel already exists: `.claude/board/CROSS_SESSION_BROADCAST.md` (committed, curated, append-only) + `CROSS_REPO_PRS.md`. Minting a third would be the duplication smell the ENTROPY ledger exists to kill. Sessions: broadcast merge events there. |
+| **X2** probe-preamble convention (environment facts in subagent fetch/diff prompts) | **ADOPTED** in this session's worker briefs (the GraphRAG worker documented the api.github.com session-denial + raw.githubusercontent workaround instead of misreading the repo as fabricated — the exact failure X2 names). Recommend other sessions copy the pattern; no doc mint needed beyond this row. |
+
+## Blockers / open questions
+
+- ogar-class-view's contract dep floats on `branch="main"` unpinned — the flip fuse
+  test closes the semantic side; pinning policy (rev vs branch) is a small follow-up
+  the OGAR session may want.
+- R-1 blocks: q2 APP_PREFIX row, and the authoritative naming line in both ledgers.
+
+---
+
+## APPENDED 2026-07-02 (later) — synthesis absorption (two synthesis passes + third-session addendum received)
+
+The forwarding sessions produced two synthesis passes and a third-session
+addendum over the combined wishlists. Deltas absorbed into THIS arc:
+
+1. **Allocation-table mints serialized (their insight 2 / A-batch):** four
+   sessions were queued against ogar-vocab's §2 allocation table (Genetics
+   0x0E, OCR 0x08XX, 0x1000 reservation, q2 APP_PREFIX). This arc's OGAR
+   batch is the mint vehicle — the 0x08XX OCR mint (class-level concepts
+   only: unicharset/recoder/charset; unichars stay content-store rows,
+   Osint count=0 precedent) was folded into the in-flight worker batch.
+   q2 APP_PREFIX remains blocked on R-1. Future mints: batch or appoint a
+   mint-warden; never solo-edit the allocation-table test.
+2. **R-1 evidence upgraded + delivery form fixed:** the merged code already
+   practices the CONCEPT reading of the hi-u16 (`0x0102` = project_work_item
+   shared by openproject `0x0102_0001` and redmine `0x0102_0007` — an appid
+   reading cannot express sharing). Suggested ruling on the table: hi-u16 =
+   domain byte + concept slot; lo-u16 = app/render prefix; 0x1000 reserved.
+   Whatever is ruled: deliver as ACCESSOR RENAMES (`domain_of` + the ruled
+   name for the second byte) so the compiler carries the vocabulary — this
+   arc counted FOUR instances of order/count prose rotting against code.
+3. **R-3 fused:** X1 (COORDINATION.md) formally YIELDS to per-entry board
+   files; both go to the council as ONE proposal (per-repo coordination dir
+   = merge-event signal + per-entry entries). The measured cost datapoints:
+   4-of-5 and 3-of-N rebases conflicting ONLY on board prepends, from two
+   independent sessions.
+4. **Scan family named as a contract pattern** (A5): classid_scan +
+   emission_scan + any future counter share `Form enum + classify_* +
+   fold-to-counts`, zero-dep, in the contract. Recorded in Addendum-10 and
+   in emission_scan's module doc.
+5. **L3/E5 interchange fusion** (A6 + insight 5): one Arrow schema family,
+   provenance header with `minter@sha`, ndjson stays the golden/diffable
+   layer, ingestion targets the W1b cast shape — no second envelope.
+6. **Disposition unification** (A4/insight 5): the disposition ledger
+   (`minted | adapter | hand-port | excluded(reason)`) and the 3-bucket DO
+   triage are ONE doctrine — buckets = routing decision, ledger =
+   conservation accounting; one `Disposition` enum where Mint output lives,
+   variant names matching nexgen's RESIDUAL-THREE-BUCKETS.md. Ruff/OGAR
+   sessions own the landing.
+7. **F17 ratified as the most-agreed next move** — flagged independently by
+   all three sessions; one probe, two consumers (ruff fidelity + OGAR
+   ActionDef/M25 runtime): run once, both watching, archive the corpus with
+   the run (convention 8).
+8. **Probe-corpus archival convention adopted** (their insight 6): input +
+   generation recipe + hash archived WITH every quotable measurement.
+9. **Cross-session citation rule:** epiphany references carry board
+   `E-<NAME>` keys or file paths, never per-session ordinals (two sessions'
+   "E5" already collided).
+10. **O3 stale-item catch acknowledged:** `compile_graph_ruby` already
+    exists at ogar-from-ruff/src/mint.rs:99 per the tesseract census —
+    the residual is at most flipped-order test expectations. Wishlists rot
+    at prose speed; merge events belong on the coordination channel.
+11. **X2 lands inside sonnet-worker-guardrails** (not a standalone doc):
+    the probe-preamble convention (environment facts, expected 403s,
+    authenticity checks in every fetch/diff brief) is a guardrails-§1
+    clause family. Queued as a one-paragraph guardrails addition next time
+    that doc is touched.
+
+---
+
+## APPENDED 2026-07-02 (operator ruling) — R-1 CLOSED: it was a PHANTOM conflict; the canon already exists and this session authored it
+
+Operator: "Theme 4 is wrong — you did it yourself: domain:appid:classview/concept."
+Verified against the shipped canon (authored in this session's own V3 folder,
+merged via #629/#630):
+
+- `.claude/v3/soa_layout/le-contract.md:21-30` — the 4-byte prefix IS the
+  composed classid: `[domain byte][appid byte][classview u16]`;
+  **canon hi u16 = `domain:appid`** (e.g. `0x07:01` = OSINT:q2);
+  **custom lo u16 = classview** (hosts the 0x1000 monitor + OGAR §2 app
+  render prefixes; post-P4 the 64k ClassView catalogue).
+- `.claude/v3/knowledge/v3-substrate-primer.md:94` — hi u16 spelled
+  **`CANON concept/domain:appid`**: "concept" NAMES the whole hi u16;
+  "domain:appid" is its byte spelling. The le-contract and OGAR-canon
+  descriptions were never in conflict — they name the same u16 at two
+  granularities.
+- The dissolution of the "sharing" argument: `0x0102_0001` (openproject:
+  WorkPackage) and `0x0102_0007` (redmine:Issue) share hi `0x01:02` =
+  domain 0x01 + **appid byte 0x02** (the canonical app-concept slot) and
+  differ in the **lo-u16 ClassView/app-render prefix** (0x0001 OP,
+  0x0007 Redmine). The sibling session's "an appid reading cannot express
+  sharing" argument conflated the canonical **appid byte** (hi half) with
+  the per-vendor **APP render prefix** (lo half) — the word "app" appears
+  in both halves with different meanings; THAT homonym, not the layout,
+  produced the whole R-1 thread.
+- The q2 "collision" dissolves the same way: q2's appid `0x01` lives in
+  the HI half inside a domain (`0x07:01` = OSINT:q2); OpenProject's
+  `0x0001` APP_PREFIX lives in the LO half. Two registers, positionally
+  distinct by construction. The needed "guard" is this naming line in
+  both ledgers, not a new number-space rule. The q2 APP_PREFIX row is
+  therefore NOT blocked — it is simply a mint to make when q2 renders
+  classviews.
+- Process lesson (mine): I relayed the sessions' conflict claim into a
+  RULING-NEEDED row without grepping my own canon docs first — the exact
+  "consult, don't guess" / rule-10 failure the guardrails codify, this
+  time on the orchestrator relaying instead of ruling. Calibration
+  datapoint for the fleet: the sessions escalated a phantom; escalation
+  beats silent divergence, but the FIRST check on any "two ledgers
+  disagree" claim is the primer/le-contract, which already carried the
+  reconciliation in one line.
+- Residual actions: (a) OGAR ledger gets the naming line same-arc (this
+  OGAR batch); (b) the sibling sessions' item-3 asks are answered by this
+  entry — cite `le-contract.md:26` and `primer:94`, do not re-derive;
+  (c) R-2 (EdgeBlock consts) stays open but is downgraded to "probably
+  the same homonym class — verify against canon before treating as a
+  conflict"; the ask for one pinned const set remains sound engineering.
+
+---
+
+## APPENDED 2026-07-02 (operator rulings 2+3) — R-2 answered empirically; L3-as-schema-design KILLED
+
+**R-2 (EdgeBlock) — operator reframe:** the two spellings were never the
+issue; "the question is just if lance-graph is able to pull the second
+edges guid separately — if so it's preferable to have edges cheap without
+having to load the whole values." Answer from the shipped contract:
+
+- YES at the contract level: `NODE_ROW_COLUMNS`
+  (contract/canonical_node.rs:668-687) declares **three separate
+  ColumnDescriptors** — Key (offset 0, 16B), **Edges (offset 16, 16B,
+  own name_id)**, Value (offset 32, 480B). The envelope contract already
+  carves edges as an independently addressable column.
+- NOT yet at the materialization level: exhaustive grep
+  (`NODE_ROW_COLUMNS` and `NodeRowColumn::Edges` over `crates/`
+  `--include=*.rs`) finds **zero consumers outside the contract** — no
+  Lance write path yet materializes the three columns as separate Lance
+  columns. That wiring IS the W1 sink work.
+- **Ruling absorbed as a W1 requirement:** the sink writes NodeRow as
+  THREE Lance columns (not one 512B blob), so Lance's columnar projection
+  serves edge traversal at 16 B/row without touching the value slab.
+  Mechanical gate for the R-2 closure: an edges-only projection test
+  (read the EdgeBlock column for N rows; value column not fetched).
+- Canon wording settled by the same ruling: `key(16)|edges(16)|value(480)`
+  stands (the subdivision is what makes edges cheap); OGAR's
+  `key + value(496)` remains true as the coarse "everything-not-key"
+  view. The one pinned const set BOTH repos cite = `NODE_ROW_COLUMNS` +
+  `NODE_ROW_STRIDE` — OGAR pins by asserting the same numbers, not by
+  restating prose.
+
+**L3 interchange — operator ruling: "defining arrow schemas is bullshit
+and hallucination because we already have a working SoA schema."**
+The L3/E5 "one Arrow schema family (triples batch s/p/o/f/c + facets
+batch)" framing — including my own Addendum-10 bullet — is WITHDRAWN as
+schema design. There is no new interchange schema to define: the SoA
+schema (NODE_ROW_COLUMNS / SoaEnvelope / VALUE_TENANTS / the 16-byte
+facet catalogue) IS the columnar schema, and Lance's own columnar I/O
+writes LE bytes from the envelope-described backing store. Extraction
+output (ruff `Mint`, triples, facets) lands as **node rows + facets in
+the existing canon layout through the W1b cast/descriptor path** — the
+"interchange format" question dissolves into "write SoA rows".
+What survives of L3/E5: provenance stamping (`minter@sha`) and ndjson as
+the human-diffable golden layer for PR review — both are about artifacts
+AROUND the store, not a second schema. op-nexgen / ruff sessions: do NOT
+start a five-column triple-schema design; target the SoA envelope.
+
+---
+
+## APPENDED 2026-07-02 (operator ruling 4) — R-2 requirement RECALIBRATED: the 512-byte SoA schema is NOT touched
+
+Operator: "the SoA schema was 512 bytes before and after and was tested
+against surrealdb kv-lance AND batch writer — so i don't see the reason to
+touch that." The previous appendix's "W1 sink requirement: write NodeRow as
+THREE Lance columns" is RETRACTED as over-reach. Corrected reading:
+
+- The 512-byte row (`key(16)|edges(16)|value(480)`, NODE_ROW_STRIDE = 512)
+  is the tested, frozen storage unit — before and after, kv-lance and
+  batch-writer verified. Nothing restructures it.
+- "Edges cheap without loading the whole values" is served by the layout
+  AS IT IS: `NODE_ROW_COLUMNS` already describes the strided view (Edges =
+  16 bytes at row_offset 16), so an edge sweep is a strided 16-of-512
+  slice read over the existing backing store / mmap — zero-copy per the
+  data-flow rule (SIMD/readers slice into the store, never copy). No new
+  Lance schema, no column re-materialization.
+- Residual gate (read-side only, no storage change): an edges-only strided
+  read helper/test over the NODE_ROW_COLUMNS descriptors proving edge
+  traversal touches 16 B/row. Whether an additional Lance-level column
+  projection ever pays for itself is a MEASUREMENT question for later
+  (truth-architect rules apply) — not a schema decision, and not W1.
+- R-2 is now fully CLOSED: canon text stands as-is on both sides; the one
+  shared const set = NODE_ROW_COLUMNS + NODE_ROW_STRIDE; no repo touches
+  the 512-byte unit.
+
+---
+
+## APPENDED 2026-07-02 (ops) — vart in-container unblock: the mirror + re-apply recipe
+
+The `AdaWorldAPI/vart` session-scope 403 (which blocked ALL OGAR workspace
+cargo in-container) is worked around, not solved:
+
+- `/home/user/vart` = complete file-by-file mirror of the fork @ main via
+  raw.githubusercontent.com (the only channel the proxy leaves open for
+  this repo; UI scope-add did not propagate, wget/zipball/api all 403,
+  add_repo unreachable — Claude_Code_Remote MCP unauthenticated). Local
+  git repo with provenance commits; vart's own suite 117/117; NO upstream
+  history — replace with a real clone when scope propagates.
+- Re-apply recipe for local OGAR testing (NEVER COMMIT — breaks CI):
+  in `crates/ogar-knowable-from/Cargo.toml` swap
+  `vart = { git = "https://github.com/AdaWorldAPI/vart", optional = true }`
+  for `vart = { path = "../../../vart", optional = true }`, test, then
+  `git checkout` the manifest. Note: `[patch."<git-url>"]` does NOT work
+  — cargo still fetches the patched-over git source (403s).
+- Proven while applied: full OGAR workspace green including the 16
+  vart-backend tests (first in-container run ever).
diff --git a/.claude/knowledge/graphrag-rs-inventory.md b/.claude/knowledge/graphrag-rs-inventory.md
new file mode 100644
index 00000000..c451db64
--- /dev/null
+++ b/.claude/knowledge/graphrag-rs-inventory.md
@@ -0,0 +1,210 @@
+# GraphRAG-rs (automataIA/graphrag-rs) — full fit inventory
+
+> READ BY: any session evaluating RAG/retrieval stages, LanceDB claims in
+> third-party crates, community detection, incremental/delta reconciliation,
+> or API-surface design for builder/trait ergonomics. Verdict headline:
+> **everything is REUSE-AS-REFERENCE or IGNORE — nothing to fork or depend on.**
+> Sonnet inventory worker, 2026-07-02, full-fidelity raw.githubusercontent.com
+> fetches (api.github.com is session-denied for this repo; workaround documented
+> in the access note below). Board pointer: EPIPHANIES E-V3-GRAPHRAG-INV-1;
+> plan pointer: .claude/v3/INTEGRATION-PLAN.md Addendum-10.
+
+## GraphRAG-rs Inventory Report
+
+**Access note (read this first):** The org's egress proxy denies `api.github.com`, `codeload.github.com`, and `github.com` HTML for this repo ("GitHub access to this repository is not enabled for this session. Use add_repo to request access" — a session-scope allowlist gate, not a transient failure per `/root/.ccr/README.md`: "do not retry or route around it — report the blocked host"). Standard zipball/clone was blocked. Workaround used: `raw.githubusercontent.com` is **not** on the denylist, so every file below was fetched full-fidelity via `curl https://raw.githubusercontent.com/automataIA/graphrag-rs/main/<path>` (verified against the actual bytes, not summarized) and directory listings were obtained via `WebFetch` on `github.com/.../tree/main/<dir>` pages (summarized, lower-fidelity, used only for file enumeration). All code quotes/line numbers below are from the raw-fetched files, not the WebFetch summaries. No commits, no writes, no pushes were made anywhere.
+
+---
+
+### A. STRUCTURE
+
+Workspace root `Cargo.toml` (5 members): `graphrag-core`, `graphrag-wasm`, `graphrag-server`, `graphrag-cli`, `graphrag` (meta-crate). Plus non-member dirs: `benches/`, `book/`, `config/`, `docs-example/`, `examples/`, `tests/e2e/`.
+
+- **`graphrag-core`** — the portable library (native + WASM). By far the bulk of the codebase; `src/` has 33 subdirectories (api, async_processing, builder, caching, config, core, corpus, critic, embeddings, entity, evaluation, function_calling, generation, graph, graphrag, incremental, lightrag, monitoring, nlp, ollama, optimization, parallel, persistence, pipeline, query, reranking, retrieval, rograg, storage, summarization, text, vector) + 7 top-level files (`lib.rs`, `inference.rs`, `pipeline_executor.rs`, `async_graphrag.rs`, `automatic_entity_linking.rs`, `caching_test.rs`, `phase_saver.rs`). Sampled file sizes: `retrieval/mod.rs` 1684 lines, `incremental/mod.rs` 1218 lines, `incremental/delta_computation.rs` 929 lines, `retrieval/pagerank_retrieval.rs` 914 lines, `graph/leiden.rs` 843 lines, `graph/pagerank.rs` 704 lines, `ollama/mod.rs` 719 lines, `text/mod.rs` 646 lines. This is a large, actively-maintained crate, not a toy.
+- **`graphrag-server`** — Actix-web + Apistos REST API (`graphrag-server/Cargo.toml:31-37`), 15 files incl. `lancedb_store.rs`, `qdrant_store.rs`, `handlers.rs`, `auth.rs`, `distributed_cache.rs`.
+- **`graphrag-wasm`** — Leptos-based browser build, `graphrag-wasm/Cargo.toml:33-38` pulls `graphrag-core` with `default-features = false` and only `["wasm","memory-storage","basic-retrieval","leiden"]` — **`pagerank` is explicitly excluded** because "rayon doesn't work in WASM" (comment at `graphrag-wasm/Cargo.toml:31`).
+- **`graphrag-cli`** — ratatui TUI, direct `graphrag-core` integration (no HTTP), features: `async,pagerank,lightrag,leiden,caching,parallel-processing,ollama,rograg,cross-encoder,incremental,json5-support,vector-hnsw` (`graphrag-cli/Cargo.toml:27-40`).
+- **`graphrag`** — thin meta-crate bundling `graphrag-core` + `graphrag-cli` (`graphrag/Cargo.toml`).
+- **`tests/e2e/`** — **not a Rust test harness.** It is a shell-script black-box benchmark driver (`run_benchmarks.sh`, `generate_report.sh`) that builds `graphrag-cli --release` and runs it across a matrix of **7 pipeline dimensions** (approach × embeddings × entity extraction × graph construction × chunking × retrieval-features × LLM model) against real books, writing `results/<pipeline>__<book>.json` and a markdown comparison report (`tests/e2e/README.md:1-113`). No `#[test]` functions here — it's an operational benchmark suite, not CI-gated correctness testing.
+
+---
+
+### B. LANCE / ARROW
+
+Their pins (`Cargo.toml:132-137`, workspace deps): `lancedb = "0.26.2"`, `arrow = "57"` (default-features=false), `arrow-array = "57"`, `arrow-schema = "57"`. **No direct `lance` dependency at all** — only the `lancedb` embedded-DB wrapper crate. Note an internal inconsistency: `graphrag-server/Cargo.toml:48-49` pins `arrow-array = "56"` / `arrow-schema = "56"` directly (bypassing `workspace = true`), one major version behind the workspace's `57` pin — a real drift in their own repo.
+
+Ours: `lance =7.0.0`, `lancedb 0.30.0`, `arrow 58`. Theirs is materially older on all three axes.
+
+**Storage abstraction — trait-based in theory, NOT swappable in practice.** `graphrag-core/src/core/traits.rs:196-320` defines both a sync `VectorStore` trait and an async `AsyncVectorStore` trait (generic `type Error`, `add_vector`/`search`/`search_with_threshold`/`remove_vector`), and a parallel `Storage`/`AsyncStorage` pair (`traits.rs:31-115`). The **only** implementor found is `MemoryStorage` (`graphrag-core/src/storage/mod.rs:11-155`, impls `Storage` only under `#[cfg(feature = "async")]`, `storage/mod.rs:106-155`). The two production vector stores live in a *different crate* (`graphrag-server/src/lancedb_store.rs`, `qdrant_store.rs`) with their own bespoke `LanceDBStore`/`QdrantStore` structs, their own error enums (`LanceDBError`, `QdrantError`), and **neither implements `VectorStore`/`AsyncVectorStore`** (confirmed via grep — zero `impl … for QdrantStore`/`LanceDBStore` against those traits). Backend selection is Cargo-feature-gated (`graphrag-server/Cargo.toml:19-21`, `default = ["qdrant"]`), not runtime-polymorphic through the core trait.
+
+**Bigger finding: `LanceDBStore` is a complete stub.** Every method returns `Err(LanceDBError::NotImplemented(...))`:
+```rust
+// graphrag-server/src/lancedb_store.rs:119-138 (create_table)
+Err(LanceDBError::NotImplemented(
+    "LanceDB integration is a placeholder. Full implementation requires:
+    1. Connect to LanceDB: lancedb::connect(db_path)
+    2. Define schema with vector field
+    3. Create table with schema
+    4. Set up vector index for fast search".to_string(),
+))
+```
+Same pattern at `add_document` (line 168-170), `search` (189-191), `delete_document` (198-200). The Arrow schema helper (`create_schema`, lines 220-244) is real but unused by any actual `lancedb::connect()` call. By contrast `QdrantStore` (`graphrag-server/src/qdrant_store.rs:95-125`) genuinely calls `qdrant_client::Qdrant::from_url(...)`, `CreateCollectionBuilder`, etc. — and Qdrant, not LanceDB, is their `default` feature. **LanceDB support in graphrag-rs is aspirational scaffolding, not working code.**
+
+---
+
+### C. RETRIEVAL STAGES
+
+- **LightRAG dual-level retrieval** — `graphrag-core/src/lightrag/dual_retrieval.rs`. Genuinely self-contained: `DualLevelRetriever` (lines 79-100) owns `keyword_extractor: Arc<KeywordExtractor>` + two `Arc<dyn SemanticSearcher>` (a 1-method trait, lines 73-76) for high/low-level stores, runs both concurrently via `tokio::join!` (line 118-121), then merges via 4 pluggable `MergeStrategy` variants (Interleave/HighFirst/LowFirst/Weighted, lines 57-69, weighted-merge impl at 278-316). Zero coupling to the concrete `GraphRAG` type or to LanceDB/Qdrant — only depends on `SearchResult` and its own trait. **Portable as-is.**
+- **HippoRAG Personalized PageRank** — `graphrag-core/src/retrieval/hipporag_ppr.rs`. Also self-contained: `HippoRAGRetriever` (87-105) holds only `config` + `Option<PersonalizedPageRank>`; the 5-step `retrieve()` (117-140) — entity weights from facts → passage weights from dense scores → combine into reset-probability distribution → run PPR → rank passages — takes all graph state as call parameters (`entity_to_passages: &HashMap<...>`, `passage_scores: &HashMap<...>`), never stores it. Depends on `graph::pagerank::PersonalizedPageRank` (one hop away, also self-contained per below). Faithful to the HippoRAG paper's damping=0.5 / passage-weight=0.05 defaults (`HippoRAGConfig::default()`, lines 49-61). **Portable as-is.**
+- **Cross-encoder rerank** — `graphrag-core/src/reranking/cross_encoder.rs`. Trait-based (`CrossEncoder`, 3 async methods, lines 69-79). Real implementation `CandleCrossEncoder` (94-261) loads a HF-hub BERT via `candle-core`/`candle-transformers`, feature-gated behind `neural-embeddings` (Cargo.toml). **Important caveat:** the always-compiled default, `ConfidenceCrossEncoder` (316-354), is a pure passthrough — `score_pair` returns `Ok(0.0)` unconditionally (line 347-349) and `rerank` just re-wraps the original score with zero delta (329-345). So "cross-encoder reranking" only does real cross-encoding when `neural-embeddings` is compiled in; otherwise it's a no-op stub with the same name. **Portable as reference/pattern; the trait shape is the reusable part, not the Candle backend (which would drag in candle-core/candle-transformers).**
+
+---
+
+### D. GRAPH
+
+- **Leiden — hand-implemented, not a crate dep.** `graphrag-core/src/graph/leiden.rs:10-11` imports `petgraph::graph::{Graph, NodeIndex}` and builds everything from scratch on `Graph<String, f32, Undirected>`. `LeidenCommunityDetector::detect_communities` (477-503) → `hierarchical_leiden` (506-544): Phase 1 singleton init (546-556), Phase 2 greedy local-moving modularity optimization (519-535, `find_best_community`/`calculate_modularity_delta`), Phase 3 **refinement** — the actual Leiden differentiator vs. Louvain — `refine_partition` (600-624) DFS-checks each community's internal connectivity (`is_well_connected`, 627-650) and splits disconnected ones. **Finding worth flagging honestly:** despite the `HierarchicalCommunities`/`LeidenConfig.max_levels` API surface implying multi-level hierarchy, `hierarchical_leiden` only ever populates `levels.insert(level, communities)` **once**, `level` is hardcoded to `0` (line 514) and never incremented — no graph-coarsening/aggregation loop exists (grepped for `aggregate|next_level|level +=` — only one unrelated hit at line 239 in a different function). So the "hierarchical Leiden" in the module doc-comment is currently single-level Louvain-with-refinement, not true multi-resolution Leiden. Useful as an algorithm reference, but don't take the "hierarchical" claim at face value.
+- **PageRank / Fast-GraphRAG pruning** — `graphrag-core/src/graph/pagerank.rs` (704 lines) implements `PersonalizedPageRank`; config includes `sparse_threshold`, `incremental_updates`, `simd_block_size: 32` (used by `HippoRAGConfig::to_pagerank_config()`, `hipporag_ppr.rs:314-328`) suggesting a sparse/SIMD-tuned iterative solver — did not deep-read the full 704 lines but the surface signature is self-contained (`sprs`/`nalgebra`/`parking_lot`/`lru` deps per `Cargo.toml` `pagerank` feature).
+- **WASM reality check:** `graphrag-wasm` compiles `graphrag-core` with `default-features=false`, features `["wasm","memory-storage","basic-retrieval","leiden"]` only (`graphrag-wasm/Cargo.toml:33-38`). **`pagerank` is deliberately excluded** ("pagerank feature requires rayon which doesn't work in WASM", same file line 31) — meaning HippoRAG-PPR and PageRank-retrieval are native-only; only Leiden ships to the browser build. WASM vector search uses "Voy" (75KB k-d tree, per README) not LanceDB/Qdrant at all.
+
+---
+
+### E. CHUNKING
+
+- **`HierarchicalChunker`** (`graphrag-core/src/text/chunking.rs`, 1-262) — genuinely self-contained, pure-Rust recursive-separator splitter (LangChain `RecursiveCharacterTextSplitter`-style: `\n\n → \n → ". " → "! " → "? " → "; " → ": " → " " → ""`, lines 18-28), with UTF-8-boundary-safe backward word-boundary search and abbreviation-aware sentence-boundary detection (`is_likely_abbreviation`, 203-237). Zero deps beyond std. **Portable as-is.**
+- **cAST (tree-sitter AST chunking) — the README oversells this.** `Cargo.toml:856` declares `code-chunking = ["tree-sitter", "tree-sitter-rust"]` as a feature, and `README.md:551-587` documents a `RustCodeChunkingStrategy` with "cAST (Context-Aware Splitting)" branding citing CMU research. **But no such module exists under `graphrag-core/src/text/`** (directory listing: `analysis.rs, boundary_detection.rs, chunk_enricher.rs, chunking.rs, chunking_strategies.rs, contextual_enricher.rs, document_structure.rs, extractive_summarizer.rs, keyword_extraction.rs, late_chunking.rs, layout_parser.rs, mod.rs, semantic_chunking.rs, semantic_coherence.rs, parsers/{html,markdown,plaintext,mod}.rs` — no tree-sitter file). The README's own usage snippet points at `graphrag-core/examples/symposium_trait_based_chunking.rs` for the tree-sitter code path (`README.md:587`), i.e. **the cAST implementation lives in an example, not a reusable library module.** Treat "cAST" as read-the-example-then-reimplement, not "pull in a crate module."
+
+---
+
+### F. INCREMENTAL
+
+`graphrag-core/src/incremental/delta_computation.rs` — this is **snapshot-diffing, not a WAL.** `DeltaComputer::compute_delta(before: &GraphSnapshot, after: &GraphSnapshot) -> GraphDelta` (317-321+) takes two *complete* `GraphSnapshot`s (`nodes: HashMap<String, NodeSnapshot>`, `edges: HashMap<(String,String), EdgeSnapshot>`, lines 65-76) and diffs them via: (1) a hand-rolled Bloom filter for O(1) negative membership pre-checks (`BloomFilter`, 222-279, FNV-1a-variant hash), (2) content-addressed hashing (SHA-256 or BLAKE3, `HashAlgorithm` enum 56-61) per node/edge for change detection, (3) `rayon`-parallel diff computation (`parallel_computation`/`parallel_chunk_size` config, 28-32). Output `GraphDelta` (123-147) has `nodes_added/removed/modified`, `edges_added/removed/modified`, each `*Modification` carrying a `Vec<PropertyChange>` with `old_value`/`new_value`/`ChangeType::{Added,Modified,Removed}` (149-199). This is a **before/after full-state comparison model** — useful as a reference for "compute minimal diff between two graph states," but it is architecturally the opposite of a sequential append-only WAL (no LSN, no fsync, no replay-from-log semantics; both snapshots must be materialized in full before diffing).
+
+The adjacent `incremental/mod.rs` (`IncrementalGraphManager`) is closer to an audit/undo log: `UpdateRecord` (374-398: `id`, `timestamp`, `update_type: UpdateType`, `affected_nodes: Vec<String>`, `affected_edges: Vec<(String,String)>`, `metadata: HashMap<String,String>`) with a 7-variant `UpdateType` (`AddNode/UpdateNode/RemoveNode/AddEdge/UpdateEdge/RemoveEdge/BatchUpdate`, 404-428, explicitly noting batch atomicity/rollback intent in doc comments). Did not verify whether this log is disk-persisted or purely in-memory (`IncrementalGraphManager` struct at line 46 not fully read) — flag as unverified rather than claim WAL durability either way. **For M24: read `delta_computation.rs`'s bloom-filter + content-hash pattern as a candidate for "detect what changed" reconciliation, and `UpdateRecord`/`UpdateType` as a candidate shape for an operation log — but neither is a drop-in WAL; both would need reimplementation on our substrate (SoA/Lance tombstone model, not `HashMap<String,_>` snapshots).**
+
+---
+
+### G. LLM LAYER
+
+Ollama is the only native LLM integration, and it's hand-rolled — **no `rig`, no external agent-framework crate** anywhere in the fetched `Cargo.toml`s (workspace deps list at `Cargo.toml:23-142` has no `rig-core`/`rig`/similar). `graphrag-core/src/ollama/mod.rs` (719 lines) implements its own `OllamaClient` against the raw Ollama HTTP API via `ureq` (sync, `ureq` is a workspace dep) or async paths, with a notably production-grade detail: `OllamaGenerationParams` carries `keep_alive` and `context: Vec<i64>` fields explicitly for **KV-cache priming** — "prime with full document → get context back → send only chunk text with priming context → skip document re-evaluation" (doc comments, lines 40-63), plus `OllamaGenerateResponse.prompt_eval_count`/`eval_count` to verify cache hits (85-97). The core abstraction is the sync `LanguageModel`/async `AsyncLanguageModel` trait pair (`core/traits.rs:524-624`), with type-erased `BoxedAsyncLanguageModel = Box<dyn AsyncLanguageModel<Error=GraphRAGError> + Send + Sync>` (`traits.rs:1444-1445`) used at integration boundaries (e.g. `AsyncGraphRAG.language_model: Option<Arc<BoxedAsyncLanguageModel>>`, `async_graphrag.rs:70`).
+
+Tool-use/function-calling (`graphrag-core/src/function_calling/mod.rs`) is also self-built: `CallableFunction` trait (55-64), `FunctionCaller` orchestrator, own `FunctionDefinition`/`FunctionCall`/`FunctionResult` types keyed on the `json` crate (not `serde_json::Value`) — no OpenAI-function-calling-schema library, no agent framework.
+
+---
+
+### H. FIT VERDICT PER COMPONENT
+
+| Component | File(s) | Verdict | Why |
+|---|---|---|---|
+| LightRAG dual-level retrieval | `lightrag/dual_retrieval.rs` | **REUSE-AS-REFERENCE** | Clean, trait-isolated (`SemanticSearcher`), read the merge-strategy logic and reimplement on `SoaEnvelope`/`Blackboard` types — do not pull the crate |
+| HippoRAG PPR | `retrieval/hipporag_ppr.rs` + `graph/pagerank.rs` | **REUSE-AS-REFERENCE** | Same shape: config+algorithm only, no storage coupling; the entity/passage dual-weight PPR reset-distribution trick is the valuable part to port |
+| Cross-encoder rerank | `reranking/cross_encoder.rs` | **REUSE-AS-REFERENCE (trait shape only)** | `CrossEncoder` trait is a clean 3-method contract worth mirroring; the Candle-BERT body is a heavy dep (candle-core/nn/transformers + hf-hub) we'd never vendor — and the always-on default (`ConfidenceCrossEncoder`) is a no-op stub, so don't cite this crate as "proof cross-encoding works out of the box" |
+| Leiden community detection | `graph/leiden.rs` | **REUSE-AS-REFERENCE, WITH CAVEAT** | Algorithm (local-moving + connectivity-refinement) is real and citable, but the "hierarchical" multi-level claim is not actually implemented (single level only) — read the refinement-phase logic, verify/complete the aggregation loop yourself, don't assume theirs is multi-resolution |
+| cAST / tree-sitter chunking | `examples/symposium_trait_based_chunking.rs` (not in `src/`) | **IGNORE for code, REFERENCE for approach** | Not a library module — it's example code behind a doc-driven feature flag with no `src/` implementation found. If we want AST-aware chunking we'd design it fresh; at most skim the example for the tree-sitter-Rust query pattern |
+| LanceDB storage | `graphrag-server/src/lancedb_store.rs` | **IGNORE** | Entirely unimplemented stub (every method `NotImplemented`); zero salvage value beyond "here's an Arrow schema shape someone sketched" |
+| Qdrant storage | `graphrag-server/src/qdrant_store.rs` | **IGNORE (not our stack)** | Real code, but Qdrant is not part of our fork policy / P0 (`lance-graph` mandates AdaWorldAPI forks, not Qdrant); no reason to consume |
+| `Storage`/`VectorStore` traits | `core/traits.rs` | **REUSE-AS-REFERENCE, low value** | Textbook sync+async trait-pair design (12 traits total, each with a default-batch/health-check async companion) — fine to skim for the "sync trait + async trait + adapter-module bridging them" pattern (`sync_to_async` module, lines 1251-1350), but we already have our own `PlannerContract`/`CamCodecContract`/`OrchestrationBridge` surface in `lance-graph-contract` that is more mature |
+| Delta computation (snapshot diff) | `incremental/delta_computation.rs` | **REUSE-AS-REFERENCE** | Bloom-filter-gated content-hash diffing between two full snapshots is a legitimate, well-isolated pattern for "what changed" reconciliation; port the *idea*, not the code (their `HashMap<String,_>` graph model doesn't map onto our SoA layout) |
+| Incremental update log | `incremental/mod.rs` (`UpdateRecord`/`UpdateType`) | **REUSE-AS-REFERENCE, low confidence** | Plausible shape for an operation log but persistence semantics unverified in this pass — do not assume WAL durability guarantees without reading `IncrementalGraphManager`'s full body first |
+| Ollama client / KV-cache priming | `ollama/mod.rs` | **REUSE-AS-REFERENCE** | The `keep_alive` + `context` two-step KV-cache priming pattern (prime with full doc, then cheap per-chunk continuation) is a genuinely useful idea worth stealing conceptually for any Ollama-backed pipeline we build |
+| Function calling / tool use | `function_calling/mod.rs` | **IGNORE** | Bespoke, uses the `json` crate not `serde_json`, no meaningful advantage over what we'd build against our own `OrchestrationBridge`/`UnifiedStep` surface |
+| `graphrag-core` public API (builder/prelude/PipelineExecutor) | `lib.rs`, `builder/mod.rs`, `pipeline_executor.rs` | **REUSE-AS-REFERENCE (design only)** | See Section I — genuinely good API-shape ideas, zero code worth taking |
+
+**Overall bias confirmed:** per this workspace's P0 fork policy (never take heavy deps casually — AdaWorldAPI forks only, everything else crates.io-only-if-no-fork-exists), and given that most of graphrag-rs's interesting algorithmic pieces (LightRAG, HippoRAG-PPR, Leiden, cross-encoder trait, delta-diffing, KV-cache priming) are already self-contained enough to **read and reimplement** rather than vendor, the realistic outcome across the board is **reference-reading**, not a dependency addition. Nothing here rises to "fork it" — the codebase is useful as an algorithm/design cookbook, not as a crate to wire in.
+
+---
+
+### I. PUBLIC API DESIGN (graphrag-core, the docs.rs surface)
+
+**(1) Pipeline composition surface.** Two parallel paths, both builder-pattern, one config-struct-driven:
+- **Runtime-validated:** `GraphRAGBuilder` (`builder/mod.rs:281-598`) — plain fluent builder over a `Config` struct, `.build()` calls `GraphRAG::new(config)` (581-583), errors surface at `build()` time.
+- **Compile-time-validated (type-state):** `TypedBuilder<Output, Llm>` (`builder/mod.rs:79-271`) — two phantom-typed slots (`NoOutput|HasOutput`, `NoLlm|HasLlm`); `.with_output_dir()` and `.with_ollama()`/`.with_hash_embeddings()`/`.with_candle_embeddings()` transition the type parameters (107-182); `.build()` **only exists** on `TypedBuilder<HasOutput, HasLlm>` (241-271) — calling it before both are configured is a compile error, not a runtime one. Order-independent (either required setter first, verified by `test_typed_builder_llm_before_output`, `builder/mod.rs:801-810`).
+- **Stage composition after `build()`:** `PipelineExecutor<'a>` (`pipeline_executor.rs:51-59`) wraps `&'a mut GraphRAG` and exposes `run_full_pipeline()` (65-101, delegates to `GraphRAG::build_graph()`), `ingest_and_build(text)` (105-115, add-then-build in one call), and `current_state()` (118-120, a zero-cost snapshot report with no pipeline run). This is a thin **facade over one big `build_graph()` call**, not a per-stage step API — despite the module doc calling it "step-by-step," there's no `run_entity_extraction()`/`run_community_detection()` granularity; `GraphRAG::build_graph()` internally handles "all phases" (comment, `pipeline_executor.rs:76`) as one opaque unit.
+- The 7-stage pipeline itself is NOT exposed as discrete public stage types/traits — it's baked into `GraphRAG`'s internal `build` module (`graphrag/build.rs`, not read in this pass but referenced at `graphrag/mod.rs:9`). **This is the API's weakest point relative to what its name promises** — see (5).
+
+**(2) Traits a consumer implements vs. concrete types they just use.** Clean split:
+- *Implement* (to swap a backend): `Embedder`/`AsyncEmbedder`, `VectorStore`/`AsyncVectorStore`, `EntityExtractor`/`AsyncEntityExtractor`, `Retriever`/`AsyncRetriever`, `LanguageModel`/`AsyncLanguageModel`, `GraphStore`/`AsyncGraphStore`, `Storage`/`AsyncStorage`, `FunctionRegistry`/`AsyncFunctionRegistry`, `ConfigProvider`/`AsyncConfigProvider`, `Serializer`/`AsyncSerializer` — all in `core/traits.rs`, each declared **twice**, sync and async, with the async variant carrying default-impl'd batch/concurrent/health-check/retry methods (e.g. `AsyncEmbedder::embed_batch_concurrent` default impl at `traits.rs:154-173`).
+- *Just use*: `Document`, `Entity`, `Relationship`, `TextChunk`, `KnowledgeGraph`, `ChunkId`/`DocumentId`/`EntityId` (newtype wrappers, `core/mod.rs:97-358`), `Config`, `SearchResult`, `GraphRAG` itself — all re-exported in one flat `prelude` module (`lib.rs:179-206`).
+
+**(3) Swappable backends — hybrid generics + type-erased dyn, not enum dispatch.** The trait definitions are fully generic (`trait Embedder { type Error: ...; fn embed(&self, ...) }`), so a consumer *can* monomorphize against a concrete embedder type with zero vtable cost. But at actual composition boundaries — inside `AsyncGraphRAG`, inside the `BoxedAsync*` type aliases — they collapse to `Box<dyn Trait + Send + Sync>` (`traits.rs:1442-1460`, four aliases: `BoxedAsyncLanguageModel`, `BoxedAsyncEmbedder`, `BoxedAsyncVectorStore`, `BoxedAsyncRetriever`). So the pattern is: **generic trait for the impl side, `Box<dyn Trait>` for the storage/composition side** — same shape our own `LanguageModel`-analog would want if we ever need "one struct field, many possible backends chosen at runtime" (e.g. Ollama vs. a future local model) without infecting the whole struct with a generic parameter. No enum-based backend dispatch anywhere in the traits I read.
+
+**(4) Concrete API shapes worth stealing as design inspiration:**
+
+- **Type-state builder for "you cannot call `.build()` until required config is set"** — exact mechanism (phantom-typed marker structs, transitions per-setter, `impl TypedBuilder<HasOutput, HasLlm> { fn build() }` only existing on the fully-configured instantiation):
+  ```rust
+  // builder/mod.rs:79-98, 107-119, 241-260
+  pub struct TypedBuilder<Output = NoOutput, Llm = NoLlm> {
+      config: Config,
+      _output: PhantomData<Output>,
+      _llm: PhantomData<Llm>,
+  }
+  impl<Llm> TypedBuilder<NoOutput, Llm> {
+      pub fn with_output_dir(mut self, dir: &str) -> TypedBuilder<HasOutput, Llm> { ... }
+  }
+  impl TypedBuilder<HasOutput, HasLlm> {
+      pub fn build(self) -> Result<crate::GraphRAG> { crate::GraphRAG::new(self.config) }
+  }
+  ```
+  Directly applicable anywhere we want a mailbox/tenant/kanban builder that must not compile until ownership+layout-version are both set — cheaper than a runtime panic, and it's exactly the shape `I-LEGACY-API-FEATURE-GATED`-style invariants want enforced *before* the object exists.
+
+- **Sync trait + async trait pair, bridged by an adapter module** — instead of only-async (which forces `tokio` everywhere) or only-sync (which blocks executors), they ship both and one bridging module converts:
+  ```rust
+  // core/traits.rs:1255-1264 (sync_to_async module)
+  pub struct StorageAdapter<T>(pub Arc<tokio::sync::Mutex<T>>);
+  #[async_trait]
+  impl<T> AsyncStorage for StorageAdapter<T> where T: Storage + Send + Sync + 'static { ... }
+  ```
+  This is a clean pattern for a `Retriever`-analog trait in our own orchestration surface where some backends are naturally sync (in-memory HashMap lookups) and some are naturally async (network LLM calls) — wrap the sync one once, get the async trait for free.
+
+- **`Box<dyn Trait>` type aliases collected in one place** as the single "here is every pluggable organ, type-erased" surface:
+  ```rust
+  // core/traits.rs:1442-1460
+  pub type BoxedAsyncLanguageModel = Box<dyn AsyncLanguageModel<Error = GraphRAGError> + Send + Sync>;
+  pub type BoxedAsyncEmbedder = Box<dyn AsyncEmbedder<Error = GraphRAGError> + Send + Sync>;
+  pub type BoxedAsyncVectorStore = Box<dyn AsyncVectorStore<Error = GraphRAGError> + Send + Sync>;
+  pub type BoxedAsyncRetriever = Box<dyn AsyncRetriever<Query = String, Result = SearchResult, Error = GraphRAGError> + Send + Sync>;
+  ```
+  Worth stealing as a *naming convention* — a single `boxed.rs`-style module that enumerates every swappable organ type as one alias each, rather than scattering `Box<dyn ...>` inline at every call site.
+
+- **A single-page `prelude` module** re-exporting exactly the ~15 types a 90%-case consumer needs (`lib.rs:179-206`) — cheap, high-value API-surface hygiene: `use graphrag_core::prelude::*;` and you're done, vs. hunting through 33 submodules.
+
+**(5) Anti-inspiring — surface bloat, config sprawl, aspirational-but-unwired features (avoid these):**
+
+- **Config sprawl is severe.** `config/mod.rs` defines **30 separate config structs** (grepped `^pub struct` → `Config`, `GlinerConfig`, `AutoSaveConfig`, `ZeroCostApproachConfig`, `LazyGraphRAGConfig`, `ConceptExtractionConfig`, `CoOccurrenceConfig`, `LazyIndexingConfig`, `LazyQueryExpansionConfig`, `LazyRelevanceScoringConfig`, `E2GraphRAGConfig`, `NERExtractionConfig`, `KeywordExtractionConfig`, `E2GraphConstructionConfig`, `E2IndexingConfig`, `PureAlgorithmicConfig`, `PatternExtractionConfig`, `PureKeywordExtractionConfig`, `RelationshipDiscoveryConfig`, `SearchRankingConfig`, `VectorSearchConfig`, `KeywordSearchConfig`, `GraphTraversalConfig`, `HybridFusionConfig`, `FusionWeights`, `HybridStrategyConfig`, `LazyAlgorithmicConfig`, `ProgressiveConfig`, `BudgetAwareConfig`), most with their own `impl Default`. This is what happens when every experimental pipeline variant (algorithmic/semantic/hybrid/lazy/e2graph/pure-algorithmic) gets its own config struct instead of a smaller number of composable knobs. **Avoid this shape** — it's the config-file equivalent of "a new struct instead of a new column" that our own `I-VSA-IDENTITIES`/SoA doctrine explicitly warns against.
+- **"Composable pipeline executor" doesn't actually decompose the pipeline** — `PipelineExecutor::run_full_pipeline()` is one opaque call into `GraphRAG::build_graph()`; the module's own doc-comment promise of "fine-grained control over the build pipeline phases" (`pipeline_executor.rs:47-50`) isn't backed by per-phase public methods in what I read. If we ever build an equivalent orchestration facade, make the phase boundaries real public methods, not marketing copy over one big call.
+- **Feature-flag-implies-existence is unreliable twice over** in this codebase: `code-chunking` feature wires tree-sitter deps but ships no `src/` module (§E), and the `lancedb` feature/dep is real but the store built on it is 100% stub (§B). **Lesson for our own crate hygiene:** a Cargo feature flag being present and compiling is not evidence the capability is implemented — always chase the actual `impl` body, not the `Cargo.toml` feature list, before citing a capability as "they have X."
+- **`GraphRAG` (sync orchestrator) owns its organs as `Option<T>` fields** (`knowledge_graph`, `retrieval_system`, `query_planner`, `critic`, conditionally `parallel_processor` — `graphrag/mod.rs:62-72`) populated lazily via `ensure_initialized()`/`initialize()` (75-82), which is a reasonable "constructor is separate from compute" shape — but it notably does **not** hold the LLM as a field; Ollama calls happen through free functions/other modules, breaking the "everything the struct needs to think lives in the struct" symmetry that the async variant (`AsyncGraphRAG`) does honor via `language_model: Option<Arc<BoxedAsyncLanguageModel>>` (`async_graphrag.rs:70`). Worth noting as an inconsistency between the sync and async orchestrator designs, not a coherent single doctrine.
+
+---
+
+### ADDENDUM — `graphrag_core::inference::InferenceEngine` deep-dive (operator's primary exhibit)
+
+Full file read: `graphrag-core/src/inference.rs` (417 lines).
+
+**(a) Full public method surface:**
+```rust
+// inference.rs:64-66
+pub fn new(config: InferenceConfig) -> Self
+
+// inference.rs:83-88
+pub fn infer_relationships(
+    &self,
+    target_entity: &EntityId,
+    relation_type: &str,
+    knowledge_graph: &KnowledgeGraph,
+) -> Vec<InferredRelation>
+
+// inference.rs:408-412
+pub fn find_entity_by_name<'a>(
+    &self,
+    knowledge_graph: &'a KnowledgeGraph,
+    name: &str,
+) -> Option<&'a Entity>
+```
+Private helpers: `calculate_evidence_score`, `extract_entity_name`, `calculate_proximity_score`, `entities_near_pattern` (all `&self`, all take borrowed data, no interior state touched).
+
+**(b) State owned vs. borrowed.** `InferenceEngine` owns exactly one field: `config: InferenceConfig` (a `Copy`-ish tiny struct: `min_confidence: f32`, `max_candidates: usize`, `co_occurrence_threshold: f32`, lines 28-46). **It does not own a `KnowledgeGraph`, does not own a retriever, does not own an LLM.** Every method that needs graph data takes `&KnowledgeGraph` as a call parameter and returns owned `Vec<InferredRelation>`/`Option<&Entity>` — pure function over borrowed input, config-parameterized.
+
+**(c) Relation to pipeline stages.** It is neither the top-level pipeline entry point nor a per-stage engine wired into `build_graph()`'s automatic flow — it is an **optional, separately-invoked utility** (module doc: "Implicit relationship inference system," inference.rs:1). Its job is narrow and heuristic: given a target entity, scan the chunks containing it, score co-occurring entities via keyword-pattern matching against 30-ish hardcoded "friendship" phrases (lines 178-207) and negative/"enemy" phrases (226-246) plus token-proximity weighting (322-357, 5-tier distance bucketing), and return ranked candidate relationships above a confidence threshold. This reads as a **narrow, single-purpose analysis pass a caller runs on demand** (e.g. from a CLI command or an example), not infrastructure the 7-stage pipeline depends on. `GraphRAG`'s own `mod` doc-comment structure (§I(4)) never mentions `inference` as one of its owned submodules (`graphrag/mod.rs:19-24` imports `critic, ollama, persistence, query, retrieval` — **not** `inference`).
+
+**(d) Async/sync split and error type.** Fully synchronous — no `async fn`, no `.await` anywhere in the file. **No `Result` return type at all** — `infer_relationships` and `find_entity_by_name` return plain `Vec<T>`/`Option<T>`, never `Err`; failure is represented as an empty result (line 94-96: `if target_ent.is_none() { return inferred_relations; }` — silently returns `vec![]`, no error signal to the caller that the entity wasn't found).
+
+**(e) Fit against the "Thinking is a struct" doctrine.** `InferenceEngine` is the **inverse** of that doctrine. It is a stateless-ish service object: fields are configuration, not cognitive state; methods take the graph as a call parameter rather than holding it as an organ; there is no `free_energy`/`resolution`/`awareness` equivalent — no notion of confidence propagating back into anything, no revision of prior state, no memory across calls (two consecutive `infer_relationships()` calls on the same engine share nothing but the immutable config). It is architecturally closer to a free function that happens to be wrapped in a struct for config-currying than to a cognitive-cycle engine that owns its reasoning tissue. **This is NOT a good template for a cognitive-cycle engine in our doctrine's sense** — it demonstrates the "free function on a carrier's state, reject" pattern our own litmus test explicitly names (per `lance-graph/CLAUDE.md` §"The Click," Litmus tests: "Does this add a free function on a carrier's state, or a method on the carrier? → Free function = reject."). By contrast, `AsyncGraphRAG` (`async_graphrag.rs:64-71`) — which the operator did *not* ask about but which sits one file over — **does** own its organs as fields (`knowledge_graph: Arc<RwLock<Option<KnowledgeGraph>>>`, `document_trees: Arc<RwLock<HashMap<...>>>`, `language_model: Option<Arc<BoxedAsyncLanguageModel>>`), which is the closer analog if the operator wants a "struct owns its tissue" exhibit from this codebase — `InferenceEngine` specifically is the wrong exhibit for that comparison, and worth correcting: the docs.rs prominence of `InferenceEngine` reflects that it's a public top-level module (`pub mod inference;`, `lib.rs:127`), not that it's architecturally central to the crate's design.
\ No newline at end of file
diff --git a/.claude/v3/INTEGRATION-PLAN.md b/.claude/v3/INTEGRATION-PLAN.md
index 66d6f0c4..67120fcb 100644
--- a/.claude/v3/INTEGRATION-PLAN.md
+++ b/.claude/v3/INTEGRATION-PLAN.md
@@ -345,3 +345,96 @@ Operator: "check temporal.rs for a deeper understanding." Verified against
   the W3b role exactly; the thread's queue-vs-stateful debate is resolved
   in V3 by the kanban board being both (M24 WAL + state). GraphRAG-rs
   noted as RAG prior art (native LanceDB/Arrow, Leiden, LightRAG, cAST).
+
+### Addendum-10 2026-07-02 — GraphRAG-rs verdicts + cross-session convergence intake
+
+- **GraphRAG-rs inventory closed** (full doc:
+  `.claude/knowledge/graphrag-rs-inventory.md`; board:
+  E-V3-GRAPHRAG-INV-1): every component REUSE-AS-REFERENCE or IGNORE;
+  the Addendum-9 "native LanceDB/Arrow" note is hereby CORRECTED —
+  their LanceDBStore is a 100% `NotImplemented` stub (Qdrant is the
+  real default), "hierarchical" Leiden is single-level, cAST is
+  example-only. Steal-worthy for W2a/W3: the TypedBuilder type-state
+  pattern (`.build()` only exists on the fully-configured phantom
+  instantiation — a mailbox/tenant/kanban builder that cannot compile
+  until owner + layout version are set) and the sync/async trait-pair
+  + adapter-module bridge. Anti-exhibit: `InferenceEngine` (config-only
+  struct, graph as call parameter) is the inverse of Thinking-is-a-struct;
+  `AsyncGraphRAG` is the organ-owning analog.
+- **Cross-session intake executed** (three sibling wishlists + two
+  synthesis passes; dispositions:
+  `.claude/handovers/2026-07-02-cross-session-wishlist-intake.md`;
+  board: E-V3-XSESSION-INTAKE-1): C6 `RouteBucketTyped` MERGED into
+  `contract::codegen_spine` (nexgen retires its vendor diff);
+  `contract::emission_scan` MINTED (classid_scan sibling); OGAR
+  quick-win batch on the OGAR branch (flip fuse, COUNT_FUSE two-sided,
+  Genetics 0x0E + OCR 0x08XX + 0x1000-reservation as ONE batched
+  allocation-table arc per the convergence ruling "serialize the
+  mints", prose sweep, truncation-doctrine DISCOVERY-MAP mirror).
+- **The scan family is now a named contract pattern** (ratified by
+  3-session convergence): `Form` enum + `classify_*` + fold-to-counts,
+  zero-dep, in the contract — instances: `classid_scan` (V3 adoption),
+  `emission_scan` (typed-DDL adoption). The third counter (soc-verdict
+  counts, predicate-coverage, parity-fixture coverage) MIRRORS this
+  shape; inventing a bespoke grep instead is the drift signal.
+- **Interchange guard (L3/E5 fusion):** ONE Arrow schema family for
+  extraction interchange (triples batch s/p/o/f/c + facets batch),
+  provenance header carries `minter@sha`; ndjson STAYS as the
+  committed-golden/diffable layer (Arrow batches are opaque to PR
+  review); the ingestion side targets the W1b cast/descriptor shape —
+  no second envelope. W5 design item.
+- **Probe-corpus archival convention (unclaimed-gap fix):** every
+  measurement that will be quoted (F17 triage ratio, count_adoption
+  vs a real bake, medcare re-runs) archives input + generation recipe
+  + hash WITH the run, or it recreates "last measured, can't
+  re-verify".
+- **RULING-NEEDED queue for the operator** (recorded in the intake
+  handover §R): R-1 hi-u16 naming — evidence now strong for the
+  concept reading (`0x0102` shared by OP `0x0102_0001` + Redmine
+  `0x0102_0007` is inexpressible under an appid reading); deliver the
+  ruling as ACCESSOR RENAMES not prose (4 prose-rot instances this
+  arc). R-2 EdgeBlock canon → one const set + size asserts both repos
+  pin. R-3 per-entry board files + per-repo coordination dir as ONE
+  merged council proposal (X1 yielded). R-4 OGAR probe-ledger Wave A
+  green-light.
+- **CORRECTIONS (same day, operator rulings — canonical text in the
+  intake handover appendices + E-V3-XSESSION-INTAKE-1-RULINGS):**
+  R-1 was a PHANTOM (canon already exists: `domain:appid:classview`,
+  le-contract.md §prefix + primer §5 — "concept" names the whole hi
+  u16; the "app" homonym across the halves caused the thread; both
+  ledgers now carry the line, OGAR D-CLASSID-HI-U16-SPELLING). R-2
+  closed empirically (edges pull separately via the NODE_ROW_COLUMNS
+  strided view; the 512-byte row — tested against kv-lance AND the
+  batch writer, with time series wired into surrealdb from Lance
+  versioning — is NOT touched; residual = a read-side edges-only
+  test). L3 schema design KILLED ("defining arrow schemas is bullshit
+  and hallucination — we already have a working SoA schema"):
+  extraction interchange lands as SoA rows through the W1b cast path;
+  survivors are minter@sha provenance + ndjson-as-golden only.
+
+### Addendum-11 2026-07-02 — operator ruling: per-consumer classid ownership + tripwires; the five open workstreams
+
+**Ruling:** "everyone should be responsible for his own OGAR classids
+and trip the wire if not in sync." Distributed ownership + fuses IS the
+coordination mechanism — each consumer owns its classid allocations and
+wires a sync-tripping test (the flip fuse + two-sided COUNT_FUSE shipped
+this arc are the pattern instances; the serialized-mint-batch was the
+transitional vehicle, ownership+fuses is the steady state).
+
+**The open workstreams after the fuses (operator-enumerated; "huge
+tasks but manageable if done properly"):**
+
+1. **Thinking ↔ substrate + V3 migration** — this plan's W-waves
+   (W1 shipped #631; W2 kanban/board-as-tenant next).
+2. **Orchestration rs-graph-llm** — W3b KanbanSessionStorage +
+   graph-flow as the replayable handler (M25), oracle-frequency rig.
+3. **OGAR as transpiler + compile-time API** — askama + ClassView +
+   FieldMask bitmask (the Redmine ERB pattern), taking over from
+   SurrealQL AST DLL parsing (SURREAL-AST-AS-ADAPTER completes into
+   the compile-time render path).
+4. **The complete cross-repo inventory** (the ~300-crate census asset:
+   MODULE-TABLE + COMPONENT-MAP + consumer maps) — maintain and
+   complete it; it is the substrate for every wave's preflight.
+5. **Per-consumer accommodation** — ruff + OGAR adaptations,
+   lance-graph UnifiedBridge ↔ OGAR, AST contract evolution (the W5
+   consumer wave, now with the fuse doctrine as its safety rail).
diff --git a/crates/lance-graph-contract/src/classid_scan.rs b/crates/lance-graph-contract/src/classid_scan.rs
index fe5d5530..6b1a37d6 100644
--- a/crates/lance-graph-contract/src/classid_scan.rs
+++ b/crates/lance-graph-contract/src/classid_scan.rs
@@ -230,9 +230,15 @@ mod tests {
     #[test]
     fn classify_form_canon_high_native() {
         // NodeGuid post-flip constants — canon in the HIGH half, native.
-        assert_eq!(classify_form(NodeGuid::CLASSID_OSINT), ClassidForm::CanonHigh);
+        assert_eq!(
+            classify_form(NodeGuid::CLASSID_OSINT),
+            ClassidForm::CanonHigh
+        );
         assert_eq!(classify_form(NodeGuid::CLASSID_FMA), ClassidForm::CanonHigh);
-        assert_eq!(classify_form(NodeGuid::CLASSID_PROJECT), ClassidForm::CanonHigh);
+        assert_eq!(
+            classify_form(NodeGuid::CLASSID_PROJECT),
+            ClassidForm::CanonHigh
+        );
         assert_eq!(classify_form(NodeGuid::CLASSID_ERP), ClassidForm::CanonHigh);
 
         // Post-flip V3-marked forms: canon high, custom == 0x1000 — still
@@ -361,7 +367,11 @@ mod tests {
                 ClassidForm::LegacyV3MarkerHigh,
             ),
             (
-                compose_classid_with(ClassidOrder::CanonLow, concept, AppPrefix::Healthcare.prefix()),
+                compose_classid_with(
+                    ClassidOrder::CanonLow,
+                    concept,
+                    AppPrefix::Healthcare.prefix(),
+                ),
                 ClassidForm::LegacyRenderPrefixHigh,
             ),
         ];
@@ -406,7 +416,11 @@ mod tests {
         let ids = [
             NodeGuid::CLASSID_OSINT_LEGACY,
             compose_classid_with(ClassidOrder::CanonLow, concept, 0x1000),
-            compose_classid_with(ClassidOrder::CanonLow, concept, AppPrefix::Healthcare.prefix()),
+            compose_classid_with(
+                ClassidOrder::CanonLow,
+                concept,
+                AppPrefix::Healthcare.prefix(),
+            ),
         ];
         let counts = count_adoption(ids.into_iter());
         assert_eq!(
@@ -427,12 +441,16 @@ mod tests {
         // each legacy shape), 0 ambiguous — the mixed-corpus case.
         let concept = a_concept();
         let ids = [
-            NodeGuid::CLASSID_OSINT, // CanonHigh
+            NodeGuid::CLASSID_OSINT,                                        // CanonHigh
             compose_classid_with(ClassidOrder::CanonHigh, concept, 0x1000), // CanonHigh (V3-marked)
-            NodeGuid::CLASSID_DEFAULT, // CanonHigh (degenerate)
+            NodeGuid::CLASSID_DEFAULT,      // CanonHigh (degenerate)
             NodeGuid::CLASSID_OSINT_LEGACY, // LegacyZeroPrefixHigh
             compose_classid_with(ClassidOrder::CanonLow, concept, 0x1000), // LegacyV3MarkerHigh
-            compose_classid_with(ClassidOrder::CanonLow, concept, AppPrefix::Healthcare.prefix()), // LegacyRenderPrefixHigh
+            compose_classid_with(
+                ClassidOrder::CanonLow,
+                concept,
+                AppPrefix::Healthcare.prefix(),
+            ), // LegacyRenderPrefixHigh
         ];
         let counts = count_adoption(ids.into_iter());
         assert_eq!(
diff --git a/crates/lance-graph-contract/src/codegen_spine.rs b/crates/lance-graph-contract/src/codegen_spine.rs
index 7562c502..a7cda942 100644
--- a/crates/lance-graph-contract/src/codegen_spine.rs
+++ b/crates/lance-graph-contract/src/codegen_spine.rs
@@ -355,6 +355,97 @@ pub trait RouteBucket {
     }
 }
 
+// ---------------------------------------------------------------------------
+// ② (cont.) RouteBucketTyped — kind-generic sibling for non-Odoo targets
+// ---------------------------------------------------------------------------
+
+/// Sibling to [`RouteBucket`] for codegen targets whose handler kinds are
+/// not the Odoo set. Generic over the kind type so a non-Odoo target
+/// (e.g. OpenProject, Wikidata, a future framework) can reuse the bucket
+/// abstraction without forcing its kinds into [`OdooMethodKind`].
+///
+/// # Why this exists
+///
+/// [`RouteBucket::kind`] returns [`OdooMethodKind`] by value — concrete,
+/// not a generic or associated type. That hardcodes the Odoo handler-kind
+/// taxonomy into the trait surface, so a non-Odoo consumer cannot
+/// `impl RouteBucket` with its own kind enum (no additive escape).
+///
+/// `RouteBucketTyped` parameterises the kind so each target can bring its
+/// own enum. The existing [`RouteBucket`] is **not** modified; the
+/// [`blanket impl`][impl_blanket] below makes every `RouteBucket` automatically
+/// a `RouteBucketTyped<Kind = OdooMethodKind>` so generic consumers can
+/// accept both shapes (`fn f<B: RouteBucketTyped<Kind = MyKind>>(b: &B)` and
+/// `fn g<B: RouteBucketTyped<Kind = OdooMethodKind>>(b: &B)` both work).
+///
+/// # Coherence note
+///
+/// The blanket impl pins `Kind = OdooMethodKind` for every `RouteBucket`
+/// implementor. A type that *also* needs `RouteBucketTyped` with a
+/// **different** kind must NOT impl `RouteBucket` (it would conflict).
+/// Non-Odoo targets simply skip the legacy trait and impl this one directly.
+///
+/// # Method-name collision with `RouteBucket` (deliberate — read this)
+///
+/// This trait intentionally reuses the `kind` / `id` / `id_owned` method
+/// names so the two traits carry ONE contract shape, and the blanket impl
+/// delegates verbatim to `RouteBucket` — the two resolutions are always
+/// semantically identical. The cost (codex P2, PR #632): a scope that
+/// brings BOTH traits in unqualified (e.g. `use codegen_spine::*;`) makes
+/// `bucket.kind()` ambiguous on a concrete `RouteBucket` implementor.
+/// This is a **compile error with an obvious fix, never silent
+/// misbehavior** — disambiguate with UFCS (`RouteBucket::kind(&b)` /
+/// `RouteBucketTyped::kind(&b)`, either is correct because the blanket
+/// delegates), or import only the trait you consume. Renaming the methods
+/// was rejected: it would fork the contract shape and break the
+/// already-deployed op-nexgen consumers that code against `kind()`. The
+/// tests below demonstrate the UFCS pattern where both traits share a
+/// scope.
+pub trait RouteBucketTyped {
+    /// The handler-kind enum specific to this target. Must be `Copy + Eq`
+    /// so it can drive dispatcher tables / `match` arms / hash keys.
+    type Kind: Copy + Eq;
+
+    /// The kind of this route in this target's taxonomy.
+    fn kind(&self) -> Self::Kind;
+
+    /// Stable identity of this route (same contract as [`RouteBucket::id`]).
+    fn id(&self) -> &str;
+
+    /// Owned-id escape hatch (same contract as [`RouteBucket::id_owned`]).
+    fn id_owned(&self) -> String {
+        self.id().to_string()
+    }
+}
+
+/// Bridge impl: every [`RouteBucket`] is automatically a
+/// [`RouteBucketTyped`] with `Kind = OdooMethodKind`. This is the back-compat
+/// seam — existing consumers that only know about `RouteBucket` continue to
+/// work, and new generic code can take `RouteBucketTyped` and accept both.
+///
+/// The `T: ?Sized` bound is load-bearing: without it the blanket only covers
+/// sized implementors, which leaves the documented `&dyn RouteBucket` shape
+/// outside the bridge. Codex PR #8 P2 flagged the gap. With `?Sized` an
+/// existing `&dyn RouteBucket` is usable as `&dyn RouteBucketTyped<Kind = OdooMethodKind>`,
+/// or through a `?Sized`-bounded generic.
+///
+/// [impl_blanket]: #impl-RouteBucketTyped-for-T
+impl<T: RouteBucket + ?Sized> RouteBucketTyped for T {
+    type Kind = OdooMethodKind;
+
+    fn kind(&self) -> OdooMethodKind {
+        RouteBucket::kind(self)
+    }
+
+    fn id(&self) -> &str {
+        RouteBucket::id(self)
+    }
+
+    fn id_owned(&self) -> String {
+        RouteBucket::id_owned(self)
+    }
+}
+
 // ---------------------------------------------------------------------------
 // ③ WidgetRender — the askama GUI shape contract
 // ---------------------------------------------------------------------------
@@ -589,8 +680,15 @@ mod tests {
             id: "account.move._compute_amount".into(),
             kind: OdooMethodKind::IterRecordsAggregateRelation,
         };
-        assert_eq!(r.kind().id(), "iter_records_aggregate_relation");
-        assert_eq!(r.id(), "account.move._compute_amount");
+        // UFCS disambiguation: both `RouteBucket::kind` and (via the C6
+        // blanket impl) `RouteBucketTyped::kind` are in scope here through
+        // `use super::*`; downstream consumers that import only one trait
+        // do NOT need this. Semantics unchanged.
+        assert_eq!(
+            RouteBucket::kind(&r).id(),
+            "iter_records_aggregate_relation"
+        );
+        assert_eq!(RouteBucket::id(&r), "account.move._compute_amount");
     }
 
     #[test]
@@ -608,7 +706,8 @@ mod tests {
         struct DummyWidget;
         impl WidgetRender<DummyRoute> for DummyWidget {
             fn render(bucket: &DummyRoute) -> Result<String, WidgetRenderError> {
-                Ok(format!("widget for kind={}", bucket.kind()))
+                // UFCS — see disambiguation note above.
+                Ok(format!("widget for kind={}", RouteBucket::kind(bucket)))
             }
         }
 
@@ -629,4 +728,155 @@ mod tests {
         assert_eq!(dep_traversal, Genericity::Agnostic);
         assert_eq!(skr04, Genericity::Domain);
     }
+
+    // -----------------------------------------------------------------------
+    // RouteBucketTyped — kind-generic sibling trait (additive, non-Odoo
+    // targets bring their own kind enum)
+    // -----------------------------------------------------------------------
+
+    /// A non-Odoo target's handler-kind taxonomy. Stand-in for, e.g.,
+    /// OpenProject's `list_for_tenant` / `detail_for_tenant` /
+    /// `template_get` / `csrf_form_post_engine_call` set. Used here only to
+    /// exercise that `RouteBucketTyped` accepts an arbitrary `Kind`.
+    #[derive(Debug, Clone, Copy, PartialEq, Eq)]
+    enum OpKindFixture {
+        ListForTenant,
+        DetailForTenant,
+        TemplateGet,
+    }
+
+    /// An OP-style bucket: impls **only** `RouteBucketTyped`, NOT
+    /// `RouteBucket`. Proves a non-Odoo target plugs in additively without
+    /// touching the legacy trait or its enum.
+    struct OpBucket {
+        kind: OpKindFixture,
+        id: String,
+    }
+
+    impl RouteBucketTyped for OpBucket {
+        type Kind = OpKindFixture;
+        fn kind(&self) -> OpKindFixture {
+            self.kind
+        }
+        fn id(&self) -> &str {
+            &self.id
+        }
+    }
+
+    #[test]
+    fn route_bucket_typed_accepts_non_odoo_kind() {
+        let b = OpBucket {
+            kind: OpKindFixture::ListForTenant,
+            id: "projects.list_work_packages".to_string(),
+        };
+        assert_eq!(b.kind(), OpKindFixture::ListForTenant);
+        assert_eq!(b.id(), "projects.list_work_packages");
+        assert_eq!(b.id_owned(), "projects.list_work_packages");
+    }
+
+    #[test]
+    fn route_bucket_typed_generic_consumer_accepts_op_kind() {
+        // A generic consumer parameterised on the kind enum compiles + runs
+        // for the OP kind — the whole point of the additive trait.
+        fn dispatch_one<B: RouteBucketTyped<Kind = OpKindFixture>>(b: &B) -> &'static str {
+            match b.kind() {
+                OpKindFixture::ListForTenant => "list",
+                OpKindFixture::DetailForTenant => "detail",
+                OpKindFixture::TemplateGet => "template",
+            }
+        }
+        let b = OpBucket {
+            kind: OpKindFixture::DetailForTenant,
+            id: "projects.get_work_package".to_string(),
+        };
+        assert_eq!(dispatch_one(&b), "detail");
+        // Construct the remaining fixture variant through the same generic
+        // consumer so every OpKindFixture variant is exercised (dead-code
+        // lint under `-D warnings`) and all match arms are reachable.
+        let t = OpBucket {
+            kind: OpKindFixture::TemplateGet,
+            id: "projects.render_template".to_string(),
+        };
+        assert_eq!(dispatch_one(&t), "template");
+    }
+
+    /// A back-compat Odoo bucket: impls `RouteBucket` only. The blanket impl
+    /// MUST make it usable as `RouteBucketTyped<Kind = OdooMethodKind>`
+    /// without any additional code.
+    struct OdooBucketCompat {
+        kind: OdooMethodKind,
+        id: &'static str,
+    }
+    impl RouteBucket for OdooBucketCompat {
+        fn kind(&self) -> OdooMethodKind {
+            self.kind
+        }
+        fn id(&self) -> &str {
+            self.id
+        }
+    }
+
+    #[test]
+    fn route_bucket_blanket_impl_preserves_odoo_consumers() {
+        let b = OdooBucketCompat {
+            kind: OdooMethodKind::PassOverride,
+            id: "account.move._compute_amount",
+        };
+        // Direct RouteBucket access — unchanged from before C6.
+        assert_eq!(RouteBucket::kind(&b), OdooMethodKind::PassOverride);
+        assert_eq!(RouteBucket::id(&b), "account.move._compute_amount");
+        // Same bucket, via the new RouteBucketTyped trait — the blanket impl
+        // pins Kind = OdooMethodKind so this resolves without any extra impl.
+        let typed: &dyn RouteBucketTyped<Kind = OdooMethodKind> = &b;
+        assert_eq!(typed.kind(), OdooMethodKind::PassOverride);
+        assert_eq!(typed.id(), "account.move._compute_amount");
+        assert_eq!(typed.id_owned(), "account.move._compute_amount");
+    }
+
+    #[test]
+    fn route_bucket_typed_generic_consumer_accepts_odoo_via_blanket() {
+        // A generic consumer parameterised on `OdooMethodKind` accepts an
+        // implementor that only knows about `RouteBucket` — proving the
+        // blanket impl is the back-compat bridge, not just for show.
+        fn name<B: RouteBucketTyped<Kind = OdooMethodKind>>(b: &B) -> &'static str {
+            b.kind().id()
+        }
+        let b = OdooBucketCompat {
+            kind: OdooMethodKind::IterRecordsComputeFromRelated,
+            id: "x.y",
+        };
+        assert_eq!(name(&b), "iter_records_compute_from_related");
+    }
+
+    #[test]
+    fn route_bucket_blanket_impl_covers_dyn_route_bucket() {
+        // Codex PR #8 P2: an erased `&dyn RouteBucket` must also be usable
+        // through the new trait. The fix is `T: ?Sized` on the blanket; the
+        // call shape codex named is a `?Sized`-bounded generic accepting
+        // the erased trait object.
+        //
+        // (Note: Rust does NOT permit direct trait-object-to-trait-object
+        // coercion `&dyn RouteBucket -> &dyn RouteBucketTyped<...>` even with
+        // the blanket — that would require trait-object upcasting, which is
+        // a separate feature. The legitimate / supported reach is the
+        // `?Sized`-bounded generic below; with `T: Sized` (the pre-fix shape)
+        // this `label(erased)` call would not compile.)
+        let concrete = OdooBucketCompat {
+            kind: OdooMethodKind::PassOverride,
+            id: "account.move._compute_amount",
+        };
+        let erased: &dyn RouteBucket = &concrete;
+
+        fn label<B: RouteBucketTyped<Kind = OdooMethodKind> + ?Sized>(b: &B) -> String {
+            format!("{}={}", b.id(), b.kind().id())
+        }
+        assert_eq!(label(erased), "account.move._compute_amount=pass_override");
+
+        // Bonus: same generic accepts a sized concrete implementor, ensuring
+        // the `?Sized` widening did not break the original Sized path.
+        assert_eq!(
+            label(&concrete),
+            "account.move._compute_amount=pass_override"
+        );
+    }
 }
diff --git a/crates/lance-graph-contract/src/emission_scan.rs b/crates/lance-graph-contract/src/emission_scan.rs
new file mode 100644
index 00000000..895f5f1c
--- /dev/null
+++ b/crates/lance-graph-contract/src/emission_scan.rs
@@ -0,0 +1,345 @@
+//! `emission_scan` — DDL-type-expression counting logic (zero-dep).
+//!
+//! Requested by the op-nexgen consumer session (post-#630 wishlist L2) so
+//! every consumer measures typed-DDL schema coverage identically instead of
+//! grepping its own emitted DDL; sibling of [`classid_scan`](crate::classid_scan)
+//! (D-V3-W6a). Reference figure at request time: nexgen measured 89.5% typed
+//! fields on the OpenProject corpus by hand-grep.
+//!
+//! This module mirrors `classid_scan`'s two-piece shape: [`classify_ddl_type`]
+//! buckets one DDL type expression (a `DEFINE FIELD ... TYPE <expr>` right-hand
+//! side) into a [`TypedForm`], [`count_emission`] folds an iterator of such
+//! expressions into [`EmissionCounts`]. No classid handling of any kind lives
+//! here — this module counts *type expressions*, not classids, and performs no
+//! bit math on any composed `u32` (`/v3-audit` check 1 does not apply — there is
+//! nothing here for it to flag).
+//!
+//! Classification is a deterministic tokenizer walk over the expression's
+//! alphanumeric runs, in fixed PRECEDENCE order: [`TypedForm::Stub`] >
+//! [`TypedForm::RecordLink`] > [`TypedForm::AnyTyped`] > [`TypedForm::Typed`].
+//! Tokenizing on non-alphanumeric boundaries means a token equal to `any` only
+//! matches the bare word `any` (not a substring of `many`), and a token equal
+//! to `record` only matches the bare word `record` (not a substring of
+//! `recording`) — see [`classify_ddl_type`]'s doc comment for the full worked
+//! example table.
+//!
+//! # The contract scan-family pattern (named 2026-07-02)
+//!
+//! This is the SECOND instance of a named design language: a governance
+//! metric implemented as a zero-dep contract fold — a `Form` enum +
+//! `classify_*` per item + `count_*` fold to a counts struct. Instances:
+//! [`classid_scan`](crate::classid_scan) (V3 classid adoption) and this
+//! module (typed-DDL adoption). Ratified by three-session convergence
+//! (board: E-V3-XSESSION-INTAKE-1): the NEXT governance counter
+//! (soc-verdict counts, predicate coverage, parity-fixture coverage, ...)
+//! MIRRORS this shape in the contract instead of living as a consumer-side
+//! grep — a bespoke grep where a scan module belongs is the drift signal.
+
+/// The decoded shape of one SurrealQL DDL `TYPE` expression, per the
+/// precedence procedure in [`classify_ddl_type`].
+#[derive(Debug, Clone, Copy, PartialEq, Eq, Hash)]
+#[non_exhaustive]
+pub enum TypedForm {
+    /// A concrete SurrealQL type expression: `int`, `float`, `bool`,
+    /// `string`, `datetime`, `duration`, `bytes`, `number`, `decimal`,
+    /// `object`, `geometry`, `uuid`, or an `array<T>` / `set<T>` /
+    /// `option<T>` wrapping a concrete `T`. Worked examples: `"int"`,
+    /// `"array<float>"`.
+    Typed,
+    /// The expression's effective type is `any` — bare `any`, or
+    /// `array<any>` / `set<any>` / `option<any>` (schema present but
+    /// typeless). Worked examples: `"any"`, `"array<any>"`.
+    AnyTyped,
+    /// The expression contains a `record` link type (`record<...>`, or
+    /// nested inside another wrapper, e.g. `array<record<user>>`). Worked
+    /// examples: `"record<work_package>"`, `"array<record<user>>"`.
+    RecordLink,
+    /// No usable type: an empty or whitespace-only expression, or a
+    /// placeholder marker (`todo` / `stub` / `fixme`, case-insensitive, as a
+    /// standalone token). Worked examples: `""`, `"TODO"`.
+    Stub,
+}
+
+/// Classify one DDL type expression into its [`TypedForm`], by precedence
+/// **Stub > RecordLink > AnyTyped > Typed**.
+///
+/// Tokenizes the expression on non-alphanumeric boundaries (`<`, `>`, `|`,
+/// whitespace, etc.), so a token is only ever a maximal run of ASCII
+/// alphanumerics — `many` never matches the `any` token test, and `recording`
+/// never matches the `record` token test.
+///
+/// - Empty/whitespace-only expression, or any token case-insensitively equal
+///   to `todo` / `stub` / `fixme` → [`TypedForm::Stub`].
+/// - Otherwise, any token equal to `record` (any case run — SurrealQL type
+///   keywords are lowercase, matched case-sensitively like the rest of this
+///   classifier) → [`TypedForm::RecordLink`].
+/// - Otherwise, any token equal to `any` → [`TypedForm::AnyTyped`].
+/// - Otherwise → [`TypedForm::Typed`].
+///
+/// Worked examples (doc-pinned, mirrored in `#[test]`s below):
+///
+/// | Expression              | Result               |
+/// |--------------------------|----------------------|
+/// | `"int"`                  | `Typed`               |
+/// | `"array<float>"`         | `Typed`               |
+/// | `"any"`                  | `AnyTyped`             |
+/// | `"array<any>"`           | `AnyTyped`             |
+/// | `"record<work_package>"` | `RecordLink`           |
+/// | `"array<record<user>>"`  | `RecordLink`           |
+/// | `""`                     | `Stub`                 |
+/// | `"TODO"`                 | `Stub`                 |
+#[inline]
+#[must_use]
+pub fn classify_ddl_type(ty: &str) -> TypedForm {
+    let mut saw_record = false;
+    let mut saw_any = false;
+    let mut token_count = 0usize;
+
+    for token in ty.split(|c: char| !c.is_ascii_alphanumeric() && c != '_') {
+        if token.is_empty() {
+            continue;
+        }
+        token_count += 1;
+
+        // Stub is the ONLY early return: it is top precedence, so nothing a
+        // later token could contain outranks it. `record`/`any` must NOT
+        // early-return — a stub marker may still follow (e.g.
+        // `record<user> TODO`, `record<fixme>`), and the documented
+        // precedence says Stub wins globally, not first-token-wins
+        // (codex P2, PR #632).
+        if token.eq_ignore_ascii_case("todo")
+            || token.eq_ignore_ascii_case("stub")
+            || token.eq_ignore_ascii_case("fixme")
+        {
+            return TypedForm::Stub;
+        }
+        if token == "record" {
+            saw_record = true;
+        }
+        if token == "any" {
+            saw_any = true;
+        }
+    }
+
+    if token_count == 0 {
+        // Empty or whitespace-only expression.
+        return TypedForm::Stub;
+    }
+    if saw_record {
+        return TypedForm::RecordLink;
+    }
+    if saw_any {
+        return TypedForm::AnyTyped;
+    }
+    TypedForm::Typed
+}
+
+/// Range-count tallies over a scanned set of DDL type expressions, mirroring
+/// `classid_scan::AdoptionCounts`'s field shape.
+#[derive(Debug, Clone, Copy, PartialEq, Eq, Default)]
+pub struct EmissionCounts {
+    /// Rows classified as [`TypedForm::Typed`].
+    pub typed: u64,
+    /// Rows classified as [`TypedForm::AnyTyped`].
+    pub any_typed: u64,
+    /// Rows classified as [`TypedForm::RecordLink`].
+    pub record_link: u64,
+    /// Rows classified as [`TypedForm::Stub`].
+    pub stub: u64,
+}
+
+impl EmissionCounts {
+    /// Total rows observed (`typed + any_typed + record_link + stub`).
+    #[inline]
+    #[must_use]
+    pub fn total(&self) -> u64 {
+        self.typed + self.any_typed + self.record_link + self.stub
+    }
+
+    /// Fold one classified [`TypedForm`] into the running tallies.
+    #[inline]
+    pub fn observe(&mut self, form: TypedForm) {
+        match form {
+            TypedForm::Typed => self.typed += 1,
+            TypedForm::AnyTyped => self.any_typed += 1,
+            TypedForm::RecordLink => self.record_link += 1,
+            TypedForm::Stub => self.stub += 1,
+        }
+    }
+
+    /// Typed-coverage ratio: `typed / total`, in `[0.0, 1.0]`. `0.0` for an
+    /// empty scan (`total() == 0`) rather than `NaN` — mirrors
+    /// `classid_scan::AdoptionCounts::adoption_pct`'s "empty corpus is
+    /// vacuously not-yet-typed, not undefined" convention exactly (that
+    /// method also returns `f64`, so this crate's zero-dep constraint does
+    /// not forbid floating point — see the module-level report note on why
+    /// this deviates from an integer-permille shape).
+    #[inline]
+    #[must_use]
+    pub fn typed_ratio(&self) -> f64 {
+        let total = self.total();
+        if total == 0 {
+            0.0
+        } else {
+            self.typed as f64 / total as f64
+        }
+    }
+}
+
+/// Fold an iterator of DDL type expressions into [`EmissionCounts`] by
+/// [`classify_ddl_type`]. Mirrors `classid_scan::count_adoption`'s signature
+/// shape (`impl Iterator<Item = T>`, not `IntoIterator`) over `&str` items.
+#[must_use]
+pub fn count_emission<'a>(types: impl Iterator<Item = &'a str>) -> EmissionCounts {
+    let mut counts = EmissionCounts::default();
+    for ty in types {
+        counts.observe(classify_ddl_type(ty));
+    }
+    counts
+}
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+
+    // ── classify_ddl_type: doc-pinned worked examples ──
+
+    #[test]
+    fn classify_ddl_type_int_is_typed() {
+        assert_eq!(classify_ddl_type("int"), TypedForm::Typed);
+    }
+
+    #[test]
+    fn classify_ddl_type_array_float_is_typed() {
+        assert_eq!(classify_ddl_type("array<float>"), TypedForm::Typed);
+    }
+
+    #[test]
+    fn classify_ddl_type_bare_any_is_any_typed() {
+        assert_eq!(classify_ddl_type("any"), TypedForm::AnyTyped);
+    }
+
+    #[test]
+    fn classify_ddl_type_array_any_is_any_typed() {
+        assert_eq!(classify_ddl_type("array<any>"), TypedForm::AnyTyped);
+    }
+
+    #[test]
+    fn classify_ddl_type_record_link_is_record_link() {
+        assert_eq!(
+            classify_ddl_type("record<work_package>"),
+            TypedForm::RecordLink
+        );
+    }
+
+    #[test]
+    fn classify_ddl_type_nested_array_record_is_record_link() {
+        assert_eq!(
+            classify_ddl_type("array<record<user>>"),
+            TypedForm::RecordLink
+        );
+    }
+
+    #[test]
+    fn classify_ddl_type_empty_is_stub() {
+        assert_eq!(classify_ddl_type(""), TypedForm::Stub);
+    }
+
+    #[test]
+    fn classify_ddl_type_whitespace_only_is_stub() {
+        assert_eq!(classify_ddl_type("   "), TypedForm::Stub);
+    }
+
+    #[test]
+    fn classify_ddl_type_todo_marker_is_stub() {
+        assert_eq!(classify_ddl_type("TODO"), TypedForm::Stub);
+        assert_eq!(classify_ddl_type("todo"), TypedForm::Stub);
+        assert_eq!(classify_ddl_type("stub"), TypedForm::Stub);
+        assert_eq!(classify_ddl_type("fixme"), TypedForm::Stub);
+    }
+
+    // ── precedence tests ──
+
+    #[test]
+    fn classify_ddl_type_record_any_is_record_link_not_any_typed() {
+        // record<any> — precedence: RecordLink > AnyTyped, both tokens present.
+        assert_eq!(classify_ddl_type("record<any>"), TypedForm::RecordLink);
+    }
+
+    #[test]
+    fn classify_ddl_type_stub_marker_beats_record_and_any() {
+        // A stub marker anywhere in the expression wins, even alongside
+        // record/any tokens.
+        assert_eq!(classify_ddl_type("TODO record<any>"), TypedForm::Stub);
+    }
+
+    #[test]
+    fn classify_ddl_type_stub_marker_after_record_still_wins() {
+        // Regression (codex P2, PR #632): a stub marker AFTER the `record`
+        // token must still win — precedence is global over the whole
+        // expression, never first-token-wins. Before the fix, the early
+        // return on `record` miscounted partially-stubbed record-link DDL
+        // as real links.
+        assert_eq!(classify_ddl_type("record<user> TODO"), TypedForm::Stub);
+        assert_eq!(classify_ddl_type("record<fixme>"), TypedForm::Stub);
+        assert_eq!(
+            classify_ddl_type("array<record<user>> stub"),
+            TypedForm::Stub
+        );
+    }
+
+    #[test]
+    fn classify_ddl_type_false_positive_guard_many_and_recording() {
+        // Substring "any" inside "many" and substring "record" inside
+        // "recording" must NOT trigger the corresponding classification —
+        // tokenization is on non-alphanumeric boundaries only.
+        assert_eq!(classify_ddl_type("many"), TypedForm::Typed);
+        assert_eq!(classify_ddl_type("recording"), TypedForm::Typed);
+        assert_eq!(classify_ddl_type("array<recording>"), TypedForm::Typed);
+    }
+
+    // ── EmissionCounts / count_emission ──
+
+    #[test]
+    fn count_emission_mixed_produces_correct_tallies_and_ratio() {
+        let types = [
+            "int",                  // Typed
+            "array<float>",         // Typed
+            "any",                  // AnyTyped
+            "array<any>",           // AnyTyped
+            "record<work_package>", // RecordLink
+            "array<record<user>>",  // RecordLink
+            "",                     // Stub
+            "TODO",                 // Stub
+        ];
+        let counts = count_emission(types.into_iter());
+        assert_eq!(
+            counts,
+            EmissionCounts {
+                typed: 2,
+                any_typed: 2,
+                record_link: 2,
+                stub: 2,
+            }
+        );
+        assert_eq!(counts.total(), 8);
+        assert!((counts.typed_ratio() - 0.25).abs() < f64::EPSILON);
+    }
+
+    #[test]
+    fn count_emission_all_typed_is_full_ratio() {
+        let types = ["int", "bool", "string"];
+        let counts = count_emission(types.into_iter());
+        assert_eq!(counts.total(), 3);
+        assert!((counts.typed_ratio() - 1.0).abs() < f64::EPSILON);
+    }
+
+    #[test]
+    fn count_emission_empty_iterator_is_zero_not_nan() {
+        let counts = count_emission(std::iter::empty());
+        assert_eq!(counts, EmissionCounts::default());
+        assert_eq!(counts.total(), 0);
+        assert_eq!(counts.typed_ratio(), 0.0);
+        assert!(!counts.typed_ratio().is_nan());
+    }
+}
diff --git a/crates/lance-graph-contract/src/lib.rs b/crates/lance-graph-contract/src/lib.rs
index 504e81bb..12b9101c 100644
--- a/crates/lance-graph-contract/src/lib.rs
+++ b/crates/lance-graph-contract/src/lib.rs
@@ -71,6 +71,10 @@ pub mod counterfactual;
 pub mod crystal;
 pub mod cycle_accumulator;
 pub mod distance;
+/// D-V3-W6a — DDL typed-emission counting logic (`TypedForm`,
+/// `classify_ddl_type`, `EmissionCounts`, `count_emission`), sibling of
+/// [`classid_scan`]. Requested by the op-nexgen consumer session.
+pub mod emission_scan;
 pub mod episodic_edges;
 pub mod escalation;
 pub mod exploration;
diff --git a/crates/lance-graph-contract/src/ogar_codebook.rs b/crates/lance-graph-contract/src/ogar_codebook.rs
index c0804a4b..8ebf7450 100644
--- a/crates/lance-graph-contract/src/ogar_codebook.rs
+++ b/crates/lance-graph-contract/src/ogar_codebook.rs
@@ -476,6 +476,16 @@ pub const CODEBOOK: &[(&str, u16)] = &[
     ("pricelist", 0x0209),
     ("pricelist_rule", 0x020A),
     ("unit_of_measure", 0x020B),
+    // ── 0x08XX — OCR domain (document extraction; the Tesseract-rs arc) ──
+    // Class-level container KINDS only (the 5+3-hardened mint discipline):
+    // the concept slots name the container types the OGAR Core resolves —
+    // never their content. The 112 unichars of a trained unicharset are
+    // content-store rows, NOT concept slots (the Osint zero-rows ruling is
+    // the guard precedent; unlike Osint, OCR's container kinds ARE cross-app
+    // concepts and do get slots). Mirrors OGAR `ogar_vocab::CODEBOOK` 0x08XX.
+    ("unicharset", 0x0801),
+    ("recoder", 0x0802),
+    ("charset", 0x0803),
     // ── 0x09XX — Health domain (MedCare; OGIT NTO/Healthcare promotion) ──
     ("patient", 0x0901),
     ("diagnosis", 0x0902),
@@ -653,6 +663,9 @@ mod tests {
         assert_eq!(canonical_concept_id("commercial_line_item"), Some(0x0201));
         assert_eq!(canonical_concept_id("commercial_document"), Some(0x0202));
         assert_eq!(canonical_concept_id("currency_policy"), Some(0x0206));
+        // 0x08XX OCR (container kinds; unichar content stays out of the codebook).
+        assert_eq!(canonical_concept_id("unicharset"), Some(0x0801));
+        assert_eq!(canonical_concept_id("charset"), Some(0x0803));
         // 0x09XX Health + 0x0BXX Auth (OGAR #110 minted the AuthStore family).
         assert_eq!(canonical_concept_id("patient"), Some(0x0901));
         assert_eq!(canonical_concept_id("vital_sign"), Some(0x0907));