Google: Gemma 3 12B

GoogleID: google/gemma-3-12b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Gemma 3 12B is the second largest in the family of Gemma 3 models after [Gemma 3 27B](google/gemma-3-27b-it)

Pricing per 1M Tokens

Input (Prompt)$0.04
Output (Completion)$0.13
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length131K
Max Output TokensN/A
Input ModalitiesText + Image
Output ModalitiesText
TokenizerGemini
Instruct Typegemma
Top Provider Context131K
Top Provider Max OutputN/A
ModeratedNo

More from Google

Last updated: March 23, 2026

First tracked: March 23, 2026