How much does the Llama 4 Maverick API cost?

Llama 4 Maverick costs $0.27 per million input tokens and $0.85 per million output tokens. A workload of 10M input and 1.5M output tokens per month costs about $3.98.

What is the context window of Llama 4 Maverick?

Llama 4 Maverick supports a context window of 1,000,000 tokens (1M).

Is Llama 4 Maverick open source?

Yes — Llama 4 Maverick is an open-weights model released under the Llama 4 Community License license, so it can be downloaded and self-hosted.

What are the best alternatives to Llama 4 Maverick?

The closest alternatives by overall capability are Qwen3-Max, GPT-5.5, Gemini 3 Pro, Gemini 2.5 Pro. See the comparison pages for detailed head-to-head breakdowns.

Llama 4 Maverick

Name: Llama 4 Maverick
Price: 0.27 USD
Author: Meta

Meta·Apr 2025open weights · Llama 4 Community License

Llama 4 Maverick is Meta's flagship open(ish)-weights model: a 400B-parameter mixture-of-experts with 17B active, natively multimodal, with a 1M-token context. Its launch reception was mixed against benchmark expectations, and the 2025–26 Chinese open-weights wave has eclipsed it on raw capability — but its US origin, enormous host availability and enterprise-friendly story keep it a fixture in corporate evaluations. Note the community license restricts the very largest deployers.

Benchmark results

Where it shines

US-origin open weights — eases some enterprise compliance
Native image understanding
Served by virtually every inference host at low prices

Alternatives to Llama 4 Maverick

Qwen3-Max

Alibaba's trillion-parameter API flagship — frontier-adjacent quality with strong agentic tool use at mid-tier prices.

GPT-5.5

OpenAI's flagship reasoning model with a 1M-token context window, built for hard coding, science and long-horizon agentic work.

Gemini 3 Pro

Google's November 2025 frontier breakout — 91.9% GPQA Diamond and 37.5% HLE made it the reasoning leader of its generation.

Gemini 2.5 Pro

Google's 2025 workhorse flagship — first mainstream thinking model with a 1M context, still widely deployed.

Frequently asked questions

How much does the Llama 4 Maverick API cost?: Llama 4 Maverick costs $0.27 per million input tokens and $0.85 per million output tokens. A workload of 10M input and 1.5M output tokens per month costs about $3.98.
What is the context window of Llama 4 Maverick?: Llama 4 Maverick supports a context window of 1,000,000 tokens (1M).
Is Llama 4 Maverick open source?: Yes — Llama 4 Maverick is an open-weights model released under the Llama 4 Community License license, so it can be downloaded and self-hosted.
What are the best alternatives to Llama 4 Maverick?: The closest alternatives by overall capability are Qwen3-Max, GPT-5.5, Gemini 3 Pro, Gemini 2.5 Pro. See the comparison pages for detailed head-to-head breakdowns.