Department · Specimens34 flagship models
The flagship
models, indexed.
Origin, capabilities, context, $/M token, open or closed. Filter, sort, or browse the curated lineup by region.
Curated
By origin.
Claude Sonnet 4
USAThe workhorse Claude tier — extended thinking at a fraction of Opus pricing.
Anthropic200K ctx$3.00 / $15.00
Claude Opus 4
USAAnthropic's frontier model with extended thinking, leading SWE-bench Verified.
Anthropic200K ctx$15.00 / $75.00
Gemini 2.5 Pro
USAGoogle's deep-thinking flagship with a 1M-token context window.
Google DeepMind1M ctx$1.25 / $10.00
Grok 3
USAxAI's flagship with real-time X integration and a Think reasoning mode.
xAI128K ctx$3.00 / $15.00
China
All China modelsGLM-4.5
ChinaZhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.
Zhipu AI128K ctx$0.60 / $2.20Open
Qwen3-Coder
ChinaOpen-weights coding specialist — 480B MoE, agentic by design.
Alibaba Cloud256K ctx$0.60 / $2.40Open
Kimi K2
ChinaOpen-weights 1T-parameter MoE — agentic, long-context, the model behind Kimi Chat.
Moonshot AI128K ctx$0.60 / $2.50Open
MiniMax-M1
ChinaOpen-weights 456B MoE with the largest free-tier context window in production: 1M tokens.
MiniMax1M ctx$0.30 / $1.65Open
FamilyAllBaichuanClaudeCodestralCommandDeepSeekDoubaoERNIEGLMGPTGPT-4GeminiGrokHunyuanKimiLlamaMiniMaxMistralPhiQwenSoraStepYio-series