Introducing Gemini 3.1 Flash-Lite, the fastest and most cost-effective model in the Gemini 3 series, tailored for high-volume developer workloads. This game-changing model is now available in preview via the Gemini API in Google AI Studio and for enterprises using Vertex AI.
Designed for cost-efficiency without quality compromise, Gemini 3.1 Flash-Lite is priced at just $0.25 per million input tokens and $1.50 per million output tokens. It outshines its predecessor, 2.5 Flash, delivering 2.5X faster Time to First Answer Token and a 45% boost in output speed according to the Artificial Analysis benchmark, while maintaining similar or superior quality. This low latency is essential for high-frequency workflows, making it perfect for developers aiming to create responsive, real-time applications. Optimize your AI capabilities with Gemini 3.1 Flash-Lite today!
Source link
