GPT-4o mini vs Qwen3-Max

Benchmarks, API pricing and specs, head to head. Data updated 2026-06-10.

GPT-4o mini

OpenAI · Jul 2024

The 2024 budget classic that defined the cheap-LLM tier; GPT-5 nano and mini have since taken its crown.

Qwen3-Max

Alibaba (Qwen) · Sep 2025

80.5

Alibaba's trillion-parameter API flagship — frontier-adjacent quality with strong agentic tool use at mid-tier prices.

The verdict

These two models don't yet share verified results on the benchmarks we track, so judge them on specs, pricing and intended use. On price the gap is dramatic: GPT-4o mini works out roughly 9.1x cheaper per blended million tokens. Qwen3-Max also takes 262K of context versus 128K for GPT-4o mini.

Specs & pricing

GPT-4o miniQwen3-Max
modhub Index80.5
Input price / 1M$0.15$1.2
Output price / 1M$0.6$6
Context window128K262K
Max output16K66K
Open weightsnono
Reasoning modelnoyes
Multimodal inputtext, imagetext
Knowledge cutoffOct 2023Jun 2025
ReleasedJul 2024Sep 2025
Example monthly cost*$2.40$21.00

* 10M input + 1.5M output tokens per month at list prices, no caching. Green = better value on that row.

Frequently asked questions

Which is better, GPT-4o mini or Qwen3-Max?
These two models don't yet share verified results on the benchmarks we track, so judge them on specs, pricing and intended use. On price the gap is dramatic: GPT-4o mini works out roughly 9.1x cheaper per blended million tokens. Qwen3-Max also takes 262K of context versus 128K for GPT-4o mini.
Which is cheaper, GPT-4o mini or Qwen3-Max?
GPT-4o mini costs $0.15/$0.6 per million input/output tokens, while Qwen3-Max costs $1.2/$6. For a typical workload of 10M input and 1.5M output tokens per month, that's $2.40 versus $21.00.
Which model is better for coding, GPT-4o mini or Qwen3-Max?
We don't yet track SWE-bench Verified results for both models; check their individual pages for coding-related scores.

More comparisons