Google released its most cost-efficient model yet at just $0.25 per million input tokens.
Gemini 3.1 Flash-Lite delivers 2.5x faster time-to-first-token and 45% faster output than Gemini 2.5 Flash while matching its quality. It supports multimodal prompts with up to 1 million tokens of context.
For high-volume production workloads, this model offers an exceptional price-to-performance ratio. At $0.25/1M input tokens, it undercuts most competitors in its capability tier.
Check our cost calculator to see how Flash-Lite compares for your usage patterns.