Claude 3.5 Haiku

by Anthropic·USA·Released Nov 4, 2024

Anthropic's fast tier — sub-second responses for high-throughput workloads.

textcodechattoolslong-context

Vendor site

— · 0 reviews

About this model

Claude 3.5 Haiku is Anthropic's fast tier — released October 2024 with sub-second median latency and aggressive pricing ($0.80/M input, $4/M output). Despite the small-tier branding, Haiku 3.5 actually outperforms the original Claude 3 Opus on several benchmarks including HumanEval (88.1%), which surprised the industry on release.

Anthropic has not yet released a Haiku 4 variant, so 3.5 remains the recommendation for latency-sensitive workloads. The model retains full tool-use support — there's no feature gap vs the larger tiers, just a quality gap on the hardest tasks.

Strengths

•Sub-second median latency for short prompts
•Cheapest Claude model — $0.80/M input
•Full tool-use support — no feature gap vs higher tiers
•Beats Claude 3 Opus on HumanEval coding benchmark

Limitations

•Quality gap vs Sonnet 4 on reasoning-heavy tasks
•No extended thinking mode
•No Haiku 4 yet — sits a generation behind the Claude 4 family

When to use it

→High-throughput classification and tagging
→Real-time chat with sub-second latency budgets
→Routing layer in multi-model architectures
→Embedding-stage extraction for RAG pipelines
→Batch jobs where cost per token dominates

Architecture & training

Released as part of the Claude 3.5 generation in October 2024, sharing the pretraining corpus and constitutional-AI post-training with Sonnet 3.5. Anthropic has confirmed Haiku 3.5 is a smaller distilled variant rather than a separately-trained model — which explains how it inherits the safety and tool-use behaviour of the larger Claude 3.5 tiers.

Benchmarks

Benchmark	Score	Bar
MMLU	75.0
HumanEval	88.1

Reviews · 0

Stories about Claude 3.5 Haiku

Import AIJul 19, 2026

Anthropic reports 8x surge in code output, sees early signs of recursive self-improvement

Anthropic says code merged into its codebase jumped 8x in 2026 compared to 2021–2024, which it views as preliminary evidence that AI is already beginning to accelerate the development of better AI.

InterconnectsJul 18, 2026

Z.ai's GLM-5.2 stuns AI community, rivaling Claude and OpenAI on top benchmarks

The open-weight GLM-5.2 model is the first to crack into top-tier coding agent performance alongside Anthropic and OpenAI's best — and it landed right after the U.S. effectively banned Claude Fable 5.

The DecoderJul 18, 2026

Anthropic cuts Claude Fable 5 limits, pushes Pro users to API pricing

Anthropic is slashing Fable 5 usage in its subscription plans by up to 50 percent while nudging Pro and Team Standard users toward pay-per-use API pricing. The move comes as competitive pressure mounts from cheaper models like OpenAI's GPT-5.6 Sol.

Anthropic NewsJul 6, 2026

Anthropic Details Claude Fable 5 Cyber Safeguards and Jailbreak Framework

Anthropic detailed the cybersecurity safety classifiers for its globally redeployed Claude Fable 5 model and released an early draft of an AI jailbreak severity framework developed with Glasswing partners. The company also launched a HackerOne program for researchers to submit potential cyber jailbreaks.

Compare against

All models →

Claude Sonnet 4

USA

The workhorse Claude tier — extended thinking at a fraction of Opus pricing.

Anthropic200K ctx$3.00 / $15.00

Claude Opus 4

USA

Anthropic's frontier model with extended thinking, leading SWE-bench Verified.

Anthropic200K ctx$15.00 / $75.00

GLM-4.5

China

Zhipu's flagship — agentic-first MoE with strong coding + tool-use benchmarks.

Zhipu AI128K ctx$0.60 / $2.20Open

Qwen3-Coder