Add Gemma 4 architecture support to TransformerBridge by punishell · Pull Request #1377 · TransformerLensOrg/TransformerLens

punishell · 2026-06-10T12:33:15Z

Description

Adds TransformerBridge support for Google's Gemma 4 family (released April 2026), which had no support in TransformerLens.

Fixes #1297

A single text-only adapter covers both architectures:

Gemma4ForConditionalGeneration — E2B / E4B / 31B / 26B-A4B
Gemma4UnifiedForConditionalGeneration — the encoder-free 12B (needs transformers >= 5.10)

Gemma 4 layers are heterogeneous, so the adapter delegates all math to HF and maps variant-specific submodules optional: KV-shared layers drop k/v projections, K==V layers drop v_proj, and Per-Layer-Embedding / MoE submodules appear only on some variants. Unlike Gemma 1-3, Gemma4RMSNorm has no (1 + weight) offset.

Adds DelegatedAttentionBlockBridge (drops the split-QKV fork aliases, mirroring MLABlockBridge) so hook-alias resolution stays clean when attention is delegated wholesale to HF.

google/gemma-4-E2B-it passes verify_models (P1 100%, P2 100%, P4 94.7%).

Type of change

New feature (non-breaking change which adds functionality)

Checklist:

I have commented my code, particularly in hard-to-understand areas
My changes generate no new warnings
I have added tests that prove my feature works
New and existing unit tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

Adds a text-only adapter covering both Gemma4ForConditionalGeneration (E2B/E4B/31B/26B-A4B) and Gemma4UnifiedForConditionalGeneration (12B), addressing TransformerLensOrg#1297. Gemma 4 layers are heterogeneous: KV-shared layers drop k/v projections, K==V layers drop v_proj, and per-layer-embedding / MoE submodules appear only on some variants -- all mapped optional and delegated to HF. Unlike Gemma 1-3, Gemma4RMSNorm has no (1+weight) offset. Adds DelegatedAttentionBlockBridge (drops the split-QKV fork aliases, as MLABlockBridge does) so hook-alias resolution stays clean when attention is delegated wholesale to HF. google/gemma-4-E2B-it passes verification (P1 100%, P2 100%, P4 94.7%). - New adapter + four-place registration + gemma4/gemma4_unified model_type mappings - 10 checkpoints added to the model registry - Unit + integration tests (logit parity vs HF on all three structural variants)

jlarson4 · 2026-06-10T14:12:10Z

@punishell We do have a different contributor actively working on this already. Once his implementation is ready I'll review both and determine which is correct for TransformerLens.

We will also want full multimodal support, not just text only (See Gemma3ForConditional's architecture adapter for details on how that works)

punishell · 2026-06-10T14:16:09Z

I needed it for my current project so made it and pushed :)

…

On Wed, Jun 10, 2026 at 4:12 PM Jonah Larson ***@***.***> wrote: *jlarson4* left a comment (TransformerLensOrg/TransformerLens#1377) <#1377 (comment)> @punishell <https://github.com/punishell> We do have a different contributor actively working on this already. Once his implementation is ready I'll review both and determine which is correct for TransformerLens. We will also want full multimodal support, not just text only (See Gemma3ForConditional's architecture adapter for details on how that works) — Reply to this email directly, view it on GitHub <#1377?email_source=notifications&email_token=AH45ASP2M6WRKVLVBPKCACD47FUFDA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINRXGEYTKNJQGM4KM4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJLDGN5XXIZLSL5RWY2LDNM#issuecomment-4671155038>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AH45ASKAPPNU6A5RKQR2NVL47FUFDAVCNFSNUABFKJSXA33TNF2G64TZHM2TEOJTHE2DGNJVHNEXG43VMU5TINRTGEZDEMRTG42KC5QC> . Triage notifications, keep track of coding agent tasks and review pull requests on the go with GitHub Mobile for iOS <https://github.com/notifications/mobile/ios/AH45ASPRAF3FOPHMBBAM7BL47FUFDA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINRXGEYTKNJQGM4KM4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJKTGN5XXIZLSL5UW64Y> and Android <https://github.com/notifications/mobile/android/AH45ASIOJKSMZMLP3RVNO5D47FUFDA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINRXGEYTKNJQGM4KM4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJLTGN5XXIZLSL5QW4ZDSN5UWI>. Download it today! You are receiving this because you were mentioned.Message ID: ***@***.***>

jlarson4 · 2026-06-10T14:35:22Z

@punishell Oh that's wonderful! Glad to hear that it's working for you, and thank you for using TransformerLens!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Gemma 4 architecture support to TransformerBridge#1377

Add Gemma 4 architecture support to TransformerBridge#1377
punishell wants to merge 1 commit into
TransformerLensOrg:devfrom
punishell:gemma4-support

punishell commented Jun 10, 2026

Uh oh!

jlarson4 commented Jun 10, 2026

Uh oh!

punishell commented Jun 10, 2026 via email

Uh oh!

jlarson4 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

punishell commented Jun 10, 2026

Description

Type of change

Checklist:

Uh oh!

jlarson4 commented Jun 10, 2026

Uh oh!

punishell commented Jun 10, 2026 via email

Uh oh!

jlarson4 commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants