Kimi K2 Thinking

Moonshot AI·Nov 2025reasoningopen weights · Modified MIT

67.7

modhub Index

Kimi K2 Thinking stunned the field in November 2025: an open-weights trillion-parameter MoE (32B active) that matched or beat closed flagships on agentic benchmarks like Humanity's Last Exam with tools, and could sustain hundreds of sequential tool calls without drifting. Trained for a reported $4.6M, it became the strongest argument that open models had caught the closed frontier. It remains a top choice for self-hosted deep-research and agent pipelines.

Benchmark results

Where it shines

  • Elite agentic search and tool-use endurance
  • Open weights at trillion-parameter scale
  • INT4 quantization-aware training for practical serving

Alternatives to Kimi K2 Thinking

Frequently asked questions

How much does the Kimi K2 Thinking API cost?
Kimi K2 Thinking costs $0.6 per million input tokens and $2.5 per million output tokens, with cached input at $0.15 per million. A workload of 10M input and 1.5M output tokens per month costs about $9.75.
What is the context window of Kimi K2 Thinking?
Kimi K2 Thinking supports a context window of 262,144 tokens (262K).
Is Kimi K2 Thinking open source?
Yes — Kimi K2 Thinking is an open-weights model released under the Modified MIT license, so it can be downloaded and self-hosted.
What are the best alternatives to Kimi K2 Thinking?
The closest alternatives by overall capability are DeepSeek V3.2, OpenAI o3, Claude Sonnet 4.5, GPT-5. See the comparison pages for detailed head-to-head breakdowns.