Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Gemma 3 #9250

Open
jackzhxng opened this issue Mar 14, 2025 · 0 comments
Open

Add Gemma 3 #9250

jackzhxng opened this issue Mar 14, 2025 · 0 comments
Labels
module: examples Issues related to demos under examples/

Comments

@jackzhxng
Copy link
Contributor

jackzhxng commented Mar 14, 2025

🚀 The feature, motivation and pitch

Add support for the text decoder backbone of the new Gemma 3 (1B / 4B for edge) model.

The model architecture should slide right into our llama_transformer.py, with the exception of the interspersed sliding window local attention layers as specified in the technical report, which will require some modifications to our model code. Luckily the way this is done is through slicing the attention mask and not the kv cache, so we can keep using our static kv cache implementation. The local / global attention mechanism uses a ring buffer (dynamic kc cache) for the local layers. We will need to enable ring buffer on ET first.

Checkpoints are on HuggingFace:

Optional - after adding Gemma 3, it should be pretty quick to add Gemma 2 2B as well, which is a pretty popular edge model in the local LLM community.

RFC (Optional)

#8228

cc @mergennachin @iseeyuan @lucylq @helunwencser @tarun292 @kimishpatel

@jackzhxng jackzhxng added good first issue Good for newcomers module: examples Issues related to demos under examples/ labels Mar 14, 2025
@github-project-automation github-project-automation bot moved this to To triage in ExecuTorch Core Mar 14, 2025
@jackzhxng jackzhxng moved this from To triage to Backlog in ExecuTorch Core Mar 14, 2025
@jackzhxng jackzhxng moved this from Backlog to To triage in ExecuTorch Core Mar 14, 2025
@jackzhxng jackzhxng removed the good first issue Good for newcomers label Mar 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: examples Issues related to demos under examples/
Projects
Status: To triage
Status: No status
Development

No branches or pull requests

1 participant