Google: Gemma 3n 2B (free)
FreeGoogleID: google/gemma-3n-e2b-it:free
Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, designed to operate efficiently at an effective parameter size of 2B while leveraging a 6B architecture. Based on the MatFormer architecture, it supports nested submodels and modular composition via the Mix-and-Match framework. Gemma 3n models are optimized for low-resource deployment, offering 32K context length and strong multilingual and reasoning performance across common benchmarks. This variant is trained on a diverse corpus including code, math, web, and multimodal data.
Pricing per 1M Tokens
| Input (Prompt) | Free |
| Output (Completion) | Free |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 8K |
| Max Output Tokens | 2K |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Other |
| Instruct Type | N/A |
| Top Provider Context | 8K |
| Top Provider Max Output | 2K |
| Moderated | No |
Compare this model
See how Google: Gemma 3n 2B (free) stacks up against other models.
More from Google
Last updated: March 23, 2026
First tracked: March 23, 2026