[DO NOT MERGE] Feat/nnx set defaults true g3 test by ecnal-cienet · Pull Request #4146 · AI-Hypercomputer/maxtext

ecnal-cienet · 2026-06-11T20:06:43Z

Description

This PR is used for testing Google internal G3 tests.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

The PR6-PR10 sequence promoted every routed-to-Linen feature to NNX-native (DPO/PR6, MaxEngine/PR7, LoRA+GRPO/PR8, QK-Clip + checkpoint utilities/PR9, AQT + serve-mode/PR9.5, vocab tiling custom_vjp/PR10). With those gaps closed, NNX is the production path; this commit makes it the default. Empirical break-test on CPU (pytest before/after the flip across tiling_test, train_compile_test, sharding_compare_test, maxtext_utils_test, maxengine_test) showed zero flip-induced failures - every CPU unit-test failure pre-existed on PR10 tip. TPU smoke verified end-to-end: gemma2-2b 3-step train under the new defaults logged "pure_nnx: True" in pyconfig and produced loss 13.04 -> 12.32 -> 11.82 (decreasing, no NaN/inf, no Traceback). Linen-only test files were already pinned in the prior commit so no per-test breakage from the flip. base.yml: enable_nnx, pure_nnx_decoder, pure_nnx all flip False -> True. No use_nnx_pipeline flag is added: PR10 tip has no NNX pipeline path to opt out of, so a one-valued flag would be dead weight. Pipeline tests keep their Linen pin from the prior commit; the eventual NNX pipeline work (PR11.5) will introduce its own opt-in if needed. Sharding goldens not regenerated: tests/unit/sharding_compare_test.py already pins enable_nnx=False, pure_nnx=False, pure_nnx_decoder=False explicitly when invoking the dump utility, so existing goldens at tests/utils/sharding_info/ stay valid against the flipped default.

codecov · 2026-06-11T20:16:37Z

Codecov Report

❌ Patch coverage is 50.00000% with 127 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/checkpoint_conversion/utils/utils.py	0.00%	19 Missing ⚠️
src/maxtext/checkpoint_conversion/to_maxtext.py	0.00%	13 Missing ⚠️
src/maxtext/common/checkpointing.py	14.28%	11 Missing and 1 partial ⚠️
...axtext/trainers/post_train/sft/train_sft_native.py	7.69%	12 Missing ⚠️
src/maxtext/trainers/diloco/diloco.py	60.71%	8 Missing and 3 partials ⚠️
src/maxtext/trainers/pre_train/train.py	0.00%	11 Missing ⚠️
src/maxtext/trainers/pre_train/train_compile.py	58.33%	10 Missing ⚠️
...rc/maxtext/utils/generate_param_only_checkpoint.py	0.00%	10 Missing ⚠️
src/maxtext/utils/model_creation_utils.py	0.00%	7 Missing ⚠️
src/maxtext/utils/train_utils.py	0.00%	6 Missing ⚠️
... and 5 more

📢 Thoughts on this report? Let us know!

ecnal-cienet changed the title ~~Feat/nnx set defaults true g3 test~~ [DO NOT MERGE] Feat/nnx set defaults true g3 test Jun 11, 2026

ecnal-cienet force-pushed the feat/nnx-set-defaults-true-g3-test branch from b6622f3 to 152195e Compare June 11, 2026 20:12

ecnal-cienet marked this pull request as ready for review June 11, 2026 20:12

ecnal-cienet requested review from darisoy, dipannita08 and igorts-git as code owners June 11, 2026 20:12

ecnal-cienet added the pull ready label Jun 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT MERGE] Feat/nnx set defaults true g3 test#4146

[DO NOT MERGE] Feat/nnx set defaults true g3 test#4146
ecnal-cienet wants to merge 1 commit into
mainfrom
feat/nnx-set-defaults-true-g3-test

ecnal-cienet commented Jun 11, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Jun 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ecnal-cienet commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

codecov Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ecnal-cienet commented Jun 11, 2026 •

edited

Loading

codecov Bot commented Jun 11, 2026 •

edited

Loading