Codestral 25.01

by Mistral AI·Europe·Released Jan 13, 2025

Mistral's coding-specialist model — 256K context, fast autocomplete, 80+ languages.

textcodecodetools

Vendor site

— · 0 reviews

About this model

Codestral 25.01 (January 2025) is Mistral's coding specialist — a 22B-parameter model with a 256K context window and a focus on real-time autocomplete latency rather than chat-style coding. Codestral powers the inline completions in major IDE plugins (Cursor, Continue, Tabnine integrations, JetBrains AI).

Compared to general-purpose models, Codestral is optimised for the fill-in-the-middle (FIM) format that autocomplete engines use, with much lower latency at the cost of being less suited to standalone chat workflows.

Strengths

•Best-in-class latency for IDE autocomplete (FIM format)
•256K context — full files / multi-file context in single requests
•Supports 80+ programming languages
•Drop-in replacement for older Codestral in major IDE plugins

Limitations

•Not designed for chat-style coding — use Mistral Large for conversations
•Weights available only for non-commercial evaluation (Mistral NPL)
•Beaten by Sonnet 4 / GPT-4.1 on long-horizon agentic coding

When to use it

→IDE autocomplete (Cursor inline, Continue, JetBrains AI)
→Fill-in-the-middle code completion at scale
→Inline refactoring suggestions
→Multi-file code completion within a 256K window

Architecture & training

22B-parameter dense transformer trained on a code-heavy corpus across 80+ languages. The fill-in-the-middle training data is explicitly weighted to optimise for the autocomplete use case rather than chat. Mistral's technical post emphasises latency-per-completion as the primary optimisation target.

Benchmarks

Benchmark	Score	Bar
MBPP	80.2
HumanEval	86.6
RepoBench	38.0

Reviews · 0

Stories about Codestral 25.01

ReplicateJun 3, 2026

Replicate Intelligence #8: Meta Releases Llama 3.1 405B, Mistral Unveils Large 2, and Meta Open-Sources AI Agent Toolkit

Replicate's weekly bulletin covers Meta's release of the Llama 3.1 model family including the 405B parameter model, Mistral AI's Large 2 under a research license, and Meta's open-sourced toolkit for building AI agents. The issue also highlights Meta's PromptGuard for detecting malicious prompts and a new Replicate API endpoint for searching public models.

MistralMay 28, 2026

Mistral AI Announces Product and Research Updates Including Medium 3.5, Forge, and NVIDIA Partnership

Mistral AI announced multiple product and research updates, including remote coding agents in Vibe powered by Mistral Medium 3.5, the Forge system for enterprise AI models, Voxtral TTS, and a partnership with NVIDIA. Other releases include Mistral Small 4, Leanstral for Lean 4, Workflows in public preview, and MCP connectors in Studio.

Compare against

All models →

GLM-4.5

China

Zhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.

Zhipu AI128K ctx$0.60 / $2.20Open

Qwen3-Coder

China

Open-weights coding specialist — 480B MoE, agentic by design.

Alibaba Cloud256K ctx$0.60 / $2.40Open

Kimi K2

China

Open-weights 1T-parameter MoE — agentic, long-context, the model behind Kimi Chat.

Moonshot AI128K ctx$0.60 / $2.50Open

MiniMax-M1

China

Open-weights 456B MoE with the largest free-tier context window in production: 1M tokens.

MiniMax1M ctx$0.30 / $1.65Open

About this model

✓ Strengths

× Limitations

When to use it

Architecture & training

Benchmarks

Reviews · 0

Stories about Codestral 25.01

Replicate Intelligence #8: Meta Releases Llama 3.1 405B, Mistral Unveils Large 2, and Meta Open-Sources AI Agent Toolkit

Mistral AI Announces Product and Research Updates Including Medium 3.5, Forge, and NVIDIA Partnership

Compare against

GLM-4.5

Qwen3-Coder

Kimi K2

MiniMax-M1

Strengths

Limitations