ReleaseMarch 3, 2026

Google Launches Gemini 3.1 Flash-Lite — Cheapest Frontier Model Yet

Google released its most cost-efficient model yet at just $0.25 per million input tokens.

Gemini 3.1 Flash-Lite delivers 2.5x faster time-to-first-token and 45% faster output than Gemini 2.5 Flash while matching its quality. It supports multimodal prompts with up to 1 million tokens of context.

For high-volume production workloads, this model offers an exceptional price-to-performance ratio. At $0.25/1M input tokens, it undercuts most competitors in its capability tier.

Check our cost calculator to see how Flash-Lite compares for your usage patterns.