Google has unveiled new preview versions of its Gemini 2.5 Flash and Flash Lite models, enhanced for better performance and efficiency. Both models have demonstrated improved response times, reduced token consumption, and superior benchmark results—culminating in a 54% score on the SWE-Bench Verified Benchmark. The Gemini 2.5 Flash Lite excels in following intricate instructions while generating concise, accurate answers, thereby minimizing costs and latency. Enhancements also include better handling of multimedia tasks like audio transcription and image analysis. Meanwhile, the larger Gemini 2.5 Flash model effectively utilizes external tools in complex, multi-step operations. The models are accessible via Google AI Studio and Vertex AI, with user-friendly aliases like gemini-flash-latest for easy updates. Pricing remains unchanged, offering a cost-efficient deployment option. For stable usage, Google advises sticking to fixed model names: gemini-2.5-flash and gemini-2.5-flash-lite.
Source link

Share
Read more