Gemma3 is Torch Exportable #37728

guangy10 · 2025-04-23T23:50:18Z

What does this PR do?

Initial effort to add torch.expport support for the Gemma3 model!
Gemma3 provides a 1b variant that is suitable for ExecuTorch to bring it for on-device use-case. This PR is focusing on creating the export recipe and validate the exported model can produce same output as eager.

Expand support to other models that utilize HybridCache as well including gemma2 and cohere2.

End2end Test Validation with Exported Graph

RUN_SLOW=1 pytest tests/models/gemma3/test_modeling_gemma3.py -s -v -k test_export_text_only_with_hybrid_cache

Export generated texts: 'What is the capital of France?

The capital of France is Paris.

Final Answer: The final answer is $\boxed{Paris'


Eager generated texts: 'What is the capital of France?

The capital of France is Paris.

Final Answer: The final answer is $\boxed{Paris'

PASSED
======================================================= 1 passed, 319 deselected, 108 warnings in 26.95s =======================================================

Ene2end Validation in Optimum-ExecuTorch

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. Gemma3 is ExecuTorch compatible #37727
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@ArthurZucker @gante @qubvel

github-actions · 2025-04-23T23:50:34Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

tests/models/gemma3/test_modeling_gemma3.py

src/transformers/models/gemma3/modeling_gemma3.py

guangy10 · 2025-04-24T00:02:03Z

CC: @tugsbayasgalan for review as well

guangy10 · 2025-04-24T18:05:20Z

removed unused buffer

guangy10 · 2025-04-24T20:54:11Z

Fixed in the ExportableModule to make gemma3 lowerable to ExecuTorch

guangy10 · 2025-04-25T01:16:22Z

I've ran make fixup but it doesn't fixed the linter.

tests/models/gemma3/test_modeling_gemma3.py

Cyrilvallez · 2025-04-25T13:10:53Z

Hey @guangy10! Thanks for the PR! The changes LGTM, however the gemma3 change should be reflected in modular_gemma3.py, which is the source of modeling_gemma3.py! 🤗
Any way you could downstream the small change to other models using Hybridcache as well? They should be the models that were changed in #37447 👌

Cyrilvallez · 2025-04-25T13:11:27Z

It's the source of the issue in check_repo_consistency 😉

Cyrilvallez · 2025-04-25T13:12:45Z

cc @gante as well for viz!

guangy10 · 2025-04-25T17:58:54Z

Hey @guangy10! Thanks for the PR! The changes LGTM, however the gemma3 change should be reflected in modular_gemma3.py, which is the source of modeling_gemma3.py! 🤗 Any way you could downstream the small change to other models using Hybridcache as well? They should be the models that were changed in #37447 👌

Done. Could you take another look?

Cyrilvallez

LGTM, thanks a lot! Great work, super clean 🤗
Merging

* Gemma3 is Torch Exportable * Expand the support to other mdoels using HybridCache --------- Co-authored-by: Guang Yang <guangyang@fb.com>

github-actions bot marked this pull request as draft April 23, 2025 23:50

guangy10 mentioned this pull request Apr 23, 2025

Export to ExecuTorch #32253

Open

33 tasks

guangy10 force-pushed the gemma3_executorch branch from 41ccd3d to e120479 Compare April 23, 2025 23:53

guangy10 commented Apr 23, 2025

View reviewed changes

tests/models/gemma3/test_modeling_gemma3.py Outdated Show resolved Hide resolved

guangy10 commented Apr 23, 2025

View reviewed changes

src/transformers/models/gemma3/modeling_gemma3.py Show resolved Hide resolved

guangy10 force-pushed the gemma3_executorch branch from e120479 to c3a03f9 Compare April 24, 2025 00:01

guangy10 marked this pull request as ready for review April 24, 2025 00:09

guangy10 force-pushed the gemma3_executorch branch from c3a03f9 to c126d47 Compare April 24, 2025 18:04

guangy10 force-pushed the gemma3_executorch branch from c126d47 to 98df630 Compare April 24, 2025 20:53

guangy10 force-pushed the gemma3_executorch branch 6 times, most recently from 13dbffd to 6ec580f Compare April 25, 2025 01:09

guangy10 mentioned this pull request Apr 25, 2025

[Gemma3] compile ✨ #37447

Merged

2 tasks

guangy10 force-pushed the gemma3_executorch branch from 6ec580f to 6f389db Compare April 25, 2025 01:18

guangy10 commented Apr 25, 2025

View reviewed changes

tests/models/gemma3/test_modeling_gemma3.py Outdated Show resolved Hide resolved

Gemma3 is Torch Exportable

bff3c2f

guangy10 force-pushed the gemma3_executorch branch from 6f389db to bff3c2f Compare April 25, 2025 01:20

Expand the support to other mdoels using HybridCache

ae84d3f

guangy10 force-pushed the gemma3_executorch branch from 0729eff to ae84d3f Compare April 25, 2025 19:34

guangy10 mentioned this pull request Apr 25, 2025

Gemma3 Support huggingface/optimum-executorch#57

Open

Cyrilvallez approved these changes Apr 28, 2025

View reviewed changes

Cyrilvallez merged commit 816b370 into huggingface:main Apr 28, 2025
13 checks passed

This was referenced Apr 28, 2025

Allow override inputs to export recipe #37508

Open

Add support for Gemma3 huggingface/optimum-executorch#58

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma3 is Torch Exportable #37728

Gemma3 is Torch Exportable #37728

guangy10 commented Apr 23, 2025 •

edited

Loading

github-actions bot commented Apr 23, 2025

guangy10 commented Apr 24, 2025

guangy10 commented Apr 24, 2025 •

edited

Loading

guangy10 commented Apr 24, 2025

guangy10 commented Apr 25, 2025

Cyrilvallez commented Apr 25, 2025

Cyrilvallez commented Apr 25, 2025

Cyrilvallez commented Apr 25, 2025

guangy10 commented Apr 25, 2025

Cyrilvallez left a comment

Gemma3 is Torch Exportable #37728

Gemma3 is Torch Exportable #37728

Conversation

guangy10 commented Apr 23, 2025 • edited Loading

What does this PR do?

End2end Test Validation with Exported Graph

Ene2end Validation in Optimum-ExecuTorch

Before submitting

Who can review?

github-actions bot commented Apr 23, 2025

guangy10 commented Apr 24, 2025

guangy10 commented Apr 24, 2025 • edited Loading

guangy10 commented Apr 24, 2025

guangy10 commented Apr 25, 2025

Cyrilvallez commented Apr 25, 2025

Cyrilvallez commented Apr 25, 2025

Cyrilvallez commented Apr 25, 2025

guangy10 commented Apr 25, 2025

Cyrilvallez left a comment

Choose a reason for hiding this comment

guangy10 commented Apr 23, 2025 •

edited

Loading

guangy10 commented Apr 24, 2025 •

edited

Loading