Confirm Action

Are you sure you want to proceed?

Anthropic Claude Haiku 4.5 VS OpenAI GPT-5.4 Mini

Claude Haiku 4.5 vs GPT-5.4 Mini: which wins at real work?

22 task areas · same graded test runs · rank comparison only, so 0–100 and Elo collections never mix raw scores.

GPT-5.4 Mini wins 18 of 22 task areas we tested; Claude Haiku 4.5 takes 4. GPT-5.4 Mini costs 1.1× less per token ($5.25 vs $6 per 1M).

4
Task areas won
18
39
Avg percentile
64
0
Top-3 finishes
2
$6.0
Price / 1M tokens
$5.25
Anthropic
Provider
OpenAI

GPT-5.4 Mini costs 1.1× less per token ($5.25 vs $6 per 1M).

Task by task

Task area Claude Haiku 4.5 GPT-5.4 Mini Winner
Structured Output #102 / 110
Usable
#1 / 110
Excellent
GPT-5.4 Mini
Customer Support #93 / 113
Usable
#18 / 113
Strong
GPT-5.4 Mini
Presentations & Decks #85 / 107
Strong
#14 / 107
Excellent
GPT-5.4 Mini
Knowledge & Docs #68 / 107
Usable
#3 / 107
Excellent
GPT-5.4 Mini
Coding #79 / 115
Usable
#21 / 115
Strong
GPT-5.4 Mini
RAG, Safety & Grounding #84 / 110
Strong
#37 / 110
Excellent
GPT-5.4 Mini
Research & Competitive Analysis #56 / 107
Usable
#11 / 107
Excellent
GPT-5.4 Mini
AI Strategy #88 / 126
Strong
#45 / 126
Strong
GPT-5.4 Mini
Sales #77 / 107
Usable
#38 / 107
Usable
GPT-5.4 Mini
Training & Education #35 / 107
Excellent
#74 / 107
Strong
Claude Haiku 4.5
Chef / Home Cooking #109 / 126
Needs editing
#79 / 126
Usable
GPT-5.4 Mini
Creative & Comedy #81 / 107 #52 / 107 GPT-5.4 Mini
Frontend & Landing Pages #6 / 106
Usable
#35 / 106
Needs editing
Claude Haiku 4.5
Legal & HR #46 / 107
Excellent
#25 / 107
Excellent
GPT-5.4 Mini
Product & Project Management #38 / 107
Strong
#19 / 107
Excellent
GPT-5.4 Mini
Landing Pages #66 / 69
Needs editing
#49 / 69
Usable
GPT-5.4 Mini
Executive Assistant #67 / 109
Strong
#81 / 109
Usable
Claude Haiku 4.5
Data & Analytics #51 / 110
Excellent
#63 / 110
Excellent
Claude Haiku 4.5
Investor & Pitch #58 / 63
Needs editing
#47 / 63
Usable
GPT-5.4 Mini
Summarization & Meeting Notes #24 / 107
Excellent
#16 / 107
Excellent
GPT-5.4 Mini
Content & Brand #89 / 124
Usable
#84 / 124
Usable
GPT-5.4 Mini
Translation & Localization #26 / 107
Excellent
#21 / 107
Excellent
GPT-5.4 Mini

Rank = position among every model config we tested in that task area (lower is better). Sorted by biggest gap first.

Frequently asked

Is Claude Haiku 4.5 better than GPT-5.4 Mini?

Across 22 task areas we benchmarked, GPT-5.4 Mini ranks higher in 18 and Claude Haiku 4.5 in 4.

Which is cheaper, Claude Haiku 4.5 or GPT-5.4 Mini?

GPT-5.4 Mini costs 1.1× less per token ($5.25 vs $6 per 1M).

What is Claude Haiku 4.5 better at?

Claude Haiku 4.5 out-ranks GPT-5.4 Mini at Training & Education, Frontend & Landing Pages, Executive Assistant.

What is GPT-5.4 Mini better at?

GPT-5.4 Mini out-ranks Claude Haiku 4.5 at Structured Output, Customer Support, Presentations & Decks.

Full Claude Haiku 4.5 review → Full GPT-5.4 Mini review → Full model leaderboard →

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

  • Generate test cases from your prompt — no eval set required to start.
  • Compare models side by side with quality, cost and latency in one matrix.
  • Optimise the winner until the scores say it's ready to ship.
Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals
Claude Opus
GPT-5
Gemini
v1
7.1
6.8
7.4
v2
8.3
7.9
8.0
v3
9.2
8.6
8.4
Best combo: v3 × Claude Opus
9.2 quality · $0.004/run · 1.8s