Gemini 2.5 Pro

by Google DeepMind·USA·Released Mar 25, 2025

Google's deep-thinking flagship with a 1M-token context window.

textvisionaudiovideocodechatreasoningagentstoolslong-context

Vendor site

— · 0 reviews

About this model

Gemini 2.5 Pro (March 2025) was Google's first 'thinking' model — like OpenAI's o-series, it spends additional compute on internal reasoning before responding. Unlike o1 / o3-mini, the reasoning trace is visible to the user, which Google argues helps debug agent workflows.

The model ships with a 1M-token context window (2M in some configurations) and tops several reasoning benchmarks at release — 84% on GPQA Diamond, 86.7% on MATH. It also brings the strongest video understanding of any frontier model, courtesy of the multimodal-from-scratch Gemini architecture.

Pricing is tiered by context length: $1.25/M input for ≤200K tokens, $2.50/M for >200K. Output is $10/M (or $15/M past 200K). Google offers a generous free tier via AI Studio for prototyping.

Strengths

•1M-token context (2M in some configs) at competitive pricing
•Visible reasoning traces — easier to debug than OpenAI's o-series
•Top-of-leaderboard at launch on GPQA Diamond (84%)
•Strongest video understanding of any frontier model
•Generous AI Studio free tier

Limitations

•Tool-call format is Google-specific, not MCP
•Coding scores trail Claude 4 family on SWE-bench Verified
•Pricing structure complicates capacity planning (tier change at 200K tokens)

When to use it

→Whole-corpus document analysis (1M+ token inputs)
→Video analysis and content moderation at scale
→Multi-step reasoning where chain-of-thought visibility matters
→Workspace-native assistants (Docs, Gmail, Sheets)

Architecture & training

DeepMind has confirmed Gemini 2.5 uses a sparse Mixture-of-Experts architecture trained natively on interleaved text/image/audio/video tokens. The thinking capability was added in post-training via a process Google calls 'Gemini Thinking' — a variant of large-scale RL on chain-of-thought generation. Training infrastructure is Google's TPU v5p superpods.

Benchmarks

Benchmark	Score	Bar
GPQA	84.0
MATH	86.7
MMLU	85.8
SWE-bench Verified	63.8

Reviews · 0

Stories about Gemini 2.5 Pro

Google DeepMindJul 8, 2026

Google DeepMind Blog Lists May–June 2026 Updates on Models, Safety, and Research Partnerships

The Google DeepMind blog index features recent posts covering new Gemini and Gemma model releases, AI safety and responsibility initiatives, scientific research applications, and partnerships with A24 and Singapore.

Google DeepMindJul 2, 2026

Google DeepMind Blog Lists New Gemini and Gemma Models, AI Safety Initiatives, and Research Updates

The Google DeepMind blog features multiple announcements covering new Gemini and Gemma model releases, AI safety and responsibility programs, and scientific research tools. Updates span faster text generation, voice translation, computer-use capabilities, and international partnerships for housing, education, and robotics.

Google DeepMindJun 3, 2026

Google DeepMind Blog Lists New Gemini Models, Scientific AI Tools, and Global Partnerships

The Google DeepMind blog highlights recent posts announcing updates to the Gemini and Gemma model families, AI systems for scientific discovery and weather prediction, and new international partnerships focused on safety and research.

Google DeepMindMay 28, 2026

Google DeepMind Blog: New AI Models, Science Tools, and Global Partnerships

Google DeepMind's blog features multiple announcements including the Gemini Omni and Gemini 3.5 models, a new national AI partnership with Singapore, scientific research tools like Co-Scientist and AlphaEvolve, and the Google DeepMind Accelerator program for environmental risks in Asia Pacific.

Compare against

All models →

Gemini 2.0 Flash

USA

Fast, cheap, multimodal — 1M context at the lowest tier-1 pricing.

Google DeepMind1M ctx$0.10 / $0.40

GLM-4.5

China

Zhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.

Zhipu AI128K ctx$0.60 / $2.20Open

Qwen3-Coder

China

Open-weights coding specialist — 480B MoE, agentic by design.

Alibaba Cloud256K ctx$0.60 / $2.40Open

Kimi K2

China

Open-weights 1T-parameter MoE — agentic, long-context, the model behind Kimi Chat.

Moonshot AI128K ctx$0.60 / $2.50Open

About this model

✓ Strengths

× Limitations

When to use it

Architecture & training

Benchmarks

Reviews · 0

Stories about Gemini 2.5 Pro

Google DeepMind Blog Lists May–June 2026 Updates on Models, Safety, and Research Partnerships

Google DeepMind Blog Lists New Gemini and Gemma Models, AI Safety Initiatives, and Research Updates

Google DeepMind Blog Lists New Gemini Models, Scientific AI Tools, and Global Partnerships

Google DeepMind Blog: New AI Models, Science Tools, and Global Partnerships

Compare against

Gemini 2.0 Flash

GLM-4.5

Qwen3-Coder

Kimi K2

Strengths

Limitations