How much does the Gemini 2.5 Flash-Lite API cost?

Gemini 2.5 Flash-Lite costs $0.1 per million input tokens and $0.4 per million output tokens, with cached input at $0.03 per million. A workload of 10M input and 1.5M output tokens per month costs about $1.60.

What is the context window of Gemini 2.5 Flash-Lite?

Gemini 2.5 Flash-Lite supports a context window of 1,048,576 tokens (1.0M), with up to 66K output tokens per response.

Is Gemini 2.5 Flash-Lite open source?

No — Gemini 2.5 Flash-Lite is a proprietary model available through Google's API and partner platforms.

What are the best alternatives to Gemini 2.5 Flash-Lite?

The closest alternatives by overall capability are Qwen3-Max, GPT-5.5, GPT-5, OpenAI o3. See the comparison pages for detailed head-to-head breakdowns.

Gemini 2.5 Flash-Lite

Name: Gemini 2.5 Flash-Lite
Price: 0.1 USD
Author: Google

Google·Jul 2025proprietary

Gemini 2.5 Flash-Lite is the cheapest Gemini 2.5 variant, built for translation, classification and other high-frequency, cost-sensitive jobs. With multimodal input and the 1M context window intact at $0.10/$0.40, it competes head-on with GPT-5 nano at the absolute bottom of the price curve, and it remains attractive even after the Gemini 3.1 Flash-Lite release for purely budget-driven workloads.

Benchmark results

No verified benchmark results tracked yet for Gemini 2.5 Flash-Lite. This page is updated as official evaluations are published.

Where it shines

Among the lowest prices of any major-lab model
Multimodal + 1M context at the price floor
High rate limits for bulk processing

Alternatives to Gemini 2.5 Flash-Lite

Qwen3-Max

Alibaba's trillion-parameter API flagship — frontier-adjacent quality with strong agentic tool use at mid-tier prices.

GPT-5.5

OpenAI's flagship reasoning model with a 1M-token context window, built for hard coding, science and long-horizon agentic work.

GPT-5

OpenAI's August 2025 unified flagship that merged the GPT and o-series reasoning lines into one model with selectable effort.

OpenAI o3

OpenAI's dedicated 2025 reasoning model that pioneered thinking-with-images and agentic tool use within chain-of-thought.

Frequently asked questions

How much does the Gemini 2.5 Flash-Lite API cost?: Gemini 2.5 Flash-Lite costs $0.1 per million input tokens and $0.4 per million output tokens, with cached input at $0.03 per million. A workload of 10M input and 1.5M output tokens per month costs about $1.60.
What is the context window of Gemini 2.5 Flash-Lite?: Gemini 2.5 Flash-Lite supports a context window of 1,048,576 tokens (1.0M), with up to 66K output tokens per response.
Is Gemini 2.5 Flash-Lite open source?: No — Gemini 2.5 Flash-Lite is a proprietary model available through Google's API and partner platforms.
What are the best alternatives to Gemini 2.5 Flash-Lite?: The closest alternatives by overall capability are Qwen3-Max, GPT-5.5, GPT-5, OpenAI o3. See the comparison pages for detailed head-to-head breakdowns.