Department · Specimens34 flagship models

The flagship
models, indexed.

Origin, capabilities, context, $/M token, open or closed. Filter, sort, or browse the curated lineup by region.

OriginAll USA14 China17 Europe2 UK0 Israel0 Canada0 Other1·ModalityAny text vision audio video Open weights

Curated

By origin.

USA

All USA models

Claude Sonnet 4

USA

The workhorse Claude tier — extended thinking at a fraction of Opus pricing.

Anthropic200K ctx$3.00 / $15.00

Claude Opus 4

USA

Anthropic's frontier model with extended thinking, leading SWE-bench Verified.

Anthropic200K ctx$15.00 / $75.00

Gemini 2.5 Pro

USA

Google's deep-thinking flagship with a 1M-token context window.

Google DeepMind1M ctx$1.25 / $10.00

Grok 3

USA

xAI's flagship with real-time X integration and a Think reasoning mode.

xAI128K ctx$3.00 / $15.00

China

All China models

GLM-4.5

China

Zhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.

Zhipu AI128K ctx$0.60 / $2.20Open

Qwen3-Coder

China

Open-weights coding specialist — 480B MoE, agentic by design.

Alibaba Cloud256K ctx$0.60 / $2.40Open

Kimi K2

China

Open-weights 1T-parameter MoE — agentic, long-context, the model behind Kimi Chat.

Moonshot AI128K ctx$0.60 / $2.50Open

MiniMax-M1

China

Open-weights 456B MoE with the largest free-tier context window in production: 1M tokens.

MiniMax1M ctx$0.30 / $1.65Open

The Index

Every model, indexed.

Newest A–Z Cheapest

FamilyAll Baichuan Claude Codestral Command DeepSeek Doubao ERNIE GLM GPT GPT-4 Gemini Grok Hunyuan Kimi Llama MiniMax Mistral Phi Qwen Sora Step Yi o-series

GLM-4.5

China

Zhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.

Zhipu AI128K ctx$0.60 / $2.20Open

Qwen3-Coder

China

Open-weights coding specialist — 480B MoE, agentic by design.

Alibaba Cloud256K ctx$0.60 / $2.40Open

Kimi K2

China

Open-weights 1T-parameter MoE — agentic, long-context, the model behind Kimi Chat.

Moonshot AI128K ctx$0.60 / $2.50Open

MiniMax-M1

China

Open-weights 456B MoE with the largest free-tier context window in production: 1M tokens.

MiniMax1M ctx$0.30 / $1.65Open

Claude Sonnet 4

USA

The workhorse Claude tier — extended thinking at a fraction of Opus pricing.

Anthropic200K ctx$3.00 / $15.00

Claude Opus 4

USA

Anthropic's frontier model with extended thinking, leading SWE-bench Verified.

Anthropic200K ctx$15.00 / $75.00

Qwen3

China

Alibaba's latest open-weights generation — dense + MoE variants with hybrid reasoning mode.

Alibaba Cloud128K ctx$0.50 / $2.00Open

GPT-4.1

USA

OpenAI's coding-focused refresh with a full 1M-token context window.

OpenAI1M ctx$2.00 / $8.00

Gemini 2.5 Pro

USA

Google's deep-thinking flagship with a 1M-token context window.

Google DeepMind1M ctx$1.25 / $10.00

ERNIE X1

China

Baidu's reasoning model — Chinese-language counterpart to DeepSeek R1 and OpenAI o-series.

Baidu128K ctx$0.28 / $1.10

ERNIE 4.5

China

Baidu's multimodal flagship — Chinese-language leader, integrated with the Baidu ecosystem.

Baidu128K ctx$0.55 / $2.20

Grok 3

USA

xAI's flagship with real-time X integration and a Think reasoning mode.

xAI128K ctx$3.00 / $15.00

o3-mini

USA

Cheap, fast reasoning — o1-level math/code at a fraction of the cost.

OpenAI200K ctx$1.10 / $4.40

Qwen 2.5-Max

China

Alibaba's frontier MoE — closed-weights, competitive with Claude 3.5 Sonnet on key benchmarks.

Alibaba Cloud32K ctx$1.60 / $6.40

Doubao 1.5 Lite

China

ByteDance's fast tier — the cheapest frontier-class API in China.

ByteDance32K ctx$0.03 / $0.09

Doubao 1.5 Pro

China

ByteDance's flagship — the LLM behind Doubao (China's most-used consumer AI app).

ByteDance256K ctx$0.11 / $0.28

DeepSeek R1

China

Open-weights reasoning model — o1-comparable quality with full chain-of-thought visible.

DeepSeek64K ctx$0.55 / $2.19Open

Codestral 25.01

Europe

Mistral's coding-specialist model — 256K context, fast autocomplete, 80+ languages.

Mistral AI256K ctx$0.30 / $0.90

DeepSeek V3

China

671B MoE (37B active) — frontier-class quality at a fraction of competitor pricing.

DeepSeek64K ctx$0.27 / $1.10Open

Phi-4

USA

14B small-language model — outperforms much larger models thanks to curated synthetic data.

Microsoft Research16K ctx$0.15 / $0.25Open

Gemini 2.0 Flash

USA

Fast, cheap, multimodal — 1M context at the lowest tier-1 pricing.

Google DeepMind1M ctx$0.10 / $0.40

Sora Turbo

USA

OpenAI's text-to-video model — up to 20-second 1080p clips.

OpenAI0 ctx— / —

Llama 3.3 70B

USA

70B-parameter open-weights model matching Llama 3.1 405B quality at a fraction of the cost.

Meta AI128K ctx$0.60 / $0.60Open

o1

USA

OpenAI's reasoning flagship — chain-of-thought trained via large-scale RL.

OpenAI200K ctx$15.00 / $60.00

Step-2

China

StepFun's trillion-parameter MoE flagship — one of China's most-watched closed labs.

Step Fun200K ctx$0.60 / $2.00

Hunyuan-Turbo

China

Tencent's flagship MoE — the engine behind Yuanbao and many internal Tencent products.

Tencent256K ctx$0.30 / $1.00Open

Claude 3.5 Haiku

USA

Anthropic's fast tier — sub-second responses for high-throughput workloads.

Anthropic200K ctx$0.80 / $4.00

Yi-Lightning

China

01.AI's fast tier — top-tier quality at consumer-app latency.

01.AI16K ctx$0.14 / $0.14

Qwen 2.5-72B Instruct

China

Open-weights 72B from the Qwen 2.5 family — Apache 2.0, runs on a single 8×A100 node.

Alibaba Cloud128K ctx$0.35 / $0.40Open

Command R+

Other

Cohere's RAG-first enterprise flagship — strong citations, on-prem, BYOC deployment.

Cohere128K ctx$2.50 / $10.00Open

Mistral Large 2

Europe

Mistral's flagship — 123B parameters, EU-resident, with research-only weights available.

Mistral AI128K ctx$2.00 / $6.00

Llama 3.1 405B

USA

Meta's largest open-weights model — 405B dense parameters under permissive license.

Meta AI128K ctx$3.50 / $3.50Open

Baichuan-4

China

Baichuan AI's flagship — strong Chinese-language and medical-domain performance.

Baichuan AI32K ctx$0.70 / $0.70

GPT-4o

USA

OpenAI's multimodal flagship — text, vision, audio, and image in one model.

OpenAI128K ctx$2.50 / $10.00

The flagshipmodels, indexed.

By origin.

🇺🇸USA

Claude Sonnet 4

Claude Opus 4

Gemini 2.5 Pro

Grok 3

🇨🇳China

GLM-4.5

Qwen3-Coder

Kimi K2

MiniMax-M1

Every model, indexed.

GLM-4.5

Qwen3-Coder

Kimi K2

MiniMax-M1

Claude Sonnet 4

Claude Opus 4

Qwen3

GPT-4.1

Gemini 2.5 Pro

ERNIE X1

ERNIE 4.5

Grok 3

o3-mini

Qwen 2.5-Max

Doubao 1.5 Lite

Doubao 1.5 Pro

DeepSeek R1

Codestral 25.01

DeepSeek V3

Phi-4

Gemini 2.0 Flash

Sora Turbo

Llama 3.3 70B

o1

Step-2

Hunyuan-Turbo

Claude 3.5 Haiku

Yi-Lightning

Qwen 2.5-72B Instruct

Command R+

Mistral Large 2

Llama 3.1 405B

Baichuan-4

GPT-4o

The flagship
models, indexed.

USA

China