Google: Gemma 3n 2B (free)

Free
GoogleID: google/gemma-3n-e2b-it:free

Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, designed to operate efficiently at an effective parameter size of 2B while leveraging a 6B architecture. Based on the MatFormer architecture, it supports nested submodels and modular composition via the Mix-and-Match framework. Gemma 3n models are optimized for low-resource deployment, offering 32K context length and strong multilingual and reasoning performance across common benchmarks. This variant is trained on a diverse corpus including code, math, web, and multimodal data.

Pricing per 1M Tokens

Input (Prompt)Free
Output (Completion)Free
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length8K
Max Output Tokens2K
Input ModalitiesText
Output ModalitiesText
TokenizerOther
Instruct TypeN/A
Top Provider Context8K
Top Provider Max Output2K
ModeratedNo

More from Google

Last updated: March 23, 2026

First tracked: March 23, 2026