@GoogleAI
Smarter. Faster. Gemini 3.1 Flash-Lite is here⚡ The model offers uncompromising speed & intelligence at scale by focusing on: — Cost-efficiency: Priced at just $0.25/1M input and $1.50/1M output tokens, it gets work done faster at a fraction of the cost of larger models, including a 45% increase in output speed compared to Gemini 2.5 Flash — Compute control: 3.1 Flash-Lite comes with flexible thinking levels so you can actively select how much the model “thinks” for a given task, based on the complexity of your project — Scalable intelligence for building: The model can handle high volume tasks at scale while keeping costs down *and* can tackle complex workloads where in-depth reasoning is needed, like generating user interfaces or detailed instruction following 3.1 Flash-Lite is rolling out in preview today via the Gemini API in @GoogleAIStudio and Vertex AI.