TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Gemini 2.5 Flash-Lite

By Google
Model family: Gemini
Gemini 2.5 Flash-Lite is a cost- and latency-optimized version of Google's Gemini 2.5 model, designed for high-volume tasks requiring rapid responses, such as translation and classification. It features the same core capabilities as the 2.5 family, including a large 1-million-token context window, multimodal inputs (text, images, audio, video), and tool integration like Google Search grounding and code execution. Key features include its default "thinking off" state for maximum speed, which can be enabled for more complex reasoning at a cost, and strong benchmark performance for coding, math, science, and reasoning tasks
New Text Gen 3
Released: July 22, 2025

Overview

Gemini 2.5 Flash-Lite is our most balanced Gemini model, optimized for low latency use cases. It comes with the same capabilities that make other Gemini 2.5 models helpful, such as the ability to turn thinking on at different budgets, connecting to tools like Grounding with Google Search and code execution, multimodal input, and a 1 million-token context length.

About Google

At Google, we think that AI can meaningfully improve people's lives and that the biggest impact will come when everyone can access it.

Industry: Research
Company Size: 182.000-190.000
Location: Mountain View, CA, US
Website: ai.google
View Company Profile

Tools using Gemini 2.5 Flash-Lite

Last updated: February 26, 2026
0 AIs selected
Clear selection
#
Name
Task