Claude Haiku 4.5 vs GPT-5.4 Mini: which wins at real work?

22 task areas · same graded test runs · rank comparison only, so 0–100 and Elo collections never mix raw scores.

GPT-5.4 Mini wins 18 of 22 task areas we tested; Claude Haiku 4.5 takes 4. GPT-5.4 Mini costs 1.1× less per token ($5.25 vs $6 per 1M).

Claude Haiku 4.5

GPT-5.4 Mini

Task areas won

Avg percentile

Top-3 finishes

$6.0

Price / 1M tokens

$5.25

Anthropic

Provider

OpenAI

GPT-5.4 Mini costs 1.1× less per token ($5.25 vs $6 per 1M).

Task by task

Task area	Claude Haiku 4.5	GPT-5.4 Mini	Winner
Structured Output	#102 / 110 Usable	#1 / 110 Excellent	GPT-5.4 Mini
Customer Support	#93 / 113 Usable	#18 / 113 Strong	GPT-5.4 Mini
Presentations & Decks	#85 / 107 Strong	#14 / 107 Excellent	GPT-5.4 Mini
Knowledge & Docs	#68 / 107 Usable	#3 / 107 Excellent	GPT-5.4 Mini
Coding	#79 / 115 Usable	#21 / 115 Strong	GPT-5.4 Mini
RAG, Safety & Grounding	#84 / 110 Strong	#37 / 110 Excellent	GPT-5.4 Mini
Research & Competitive Analysis	#56 / 107 Usable	#11 / 107 Excellent	GPT-5.4 Mini
AI Strategy	#88 / 126 Strong	#45 / 126 Strong	GPT-5.4 Mini
Sales	#77 / 107 Usable	#38 / 107 Usable	GPT-5.4 Mini
Training & Education	#35 / 107 Excellent	#74 / 107 Strong	Claude Haiku 4.5
Chef / Home Cooking	#109 / 126 Needs editing	#79 / 126 Usable	GPT-5.4 Mini
Creative & Comedy	#81 / 107	#52 / 107	GPT-5.4 Mini
Frontend & Landing Pages	#6 / 106 Usable	#35 / 106 Needs editing	Claude Haiku 4.5
Legal & HR	#46 / 107 Excellent	#25 / 107 Excellent	GPT-5.4 Mini
Product & Project Management	#38 / 107 Strong	#19 / 107 Excellent	GPT-5.4 Mini
Landing Pages	#66 / 69 Needs editing	#49 / 69 Usable	GPT-5.4 Mini
Executive Assistant	#67 / 109 Strong	#81 / 109 Usable	Claude Haiku 4.5
Data & Analytics	#51 / 110 Excellent	#63 / 110 Excellent	Claude Haiku 4.5
Investor & Pitch	#58 / 63 Needs editing	#47 / 63 Usable	GPT-5.4 Mini
Summarization & Meeting Notes	#24 / 107 Excellent	#16 / 107 Excellent	GPT-5.4 Mini
Content & Brand	#89 / 124 Usable	#84 / 124 Usable	GPT-5.4 Mini
Translation & Localization	#26 / 107 Excellent	#21 / 107 Excellent	GPT-5.4 Mini

Rank = position among every model config we tested in that task area (lower is better). Sorted by biggest gap first.

Frequently asked

Is Claude Haiku 4.5 better than GPT-5.4 Mini?

Across 22 task areas we benchmarked, GPT-5.4 Mini ranks higher in 18 and Claude Haiku 4.5 in 4.

Which is cheaper, Claude Haiku 4.5 or GPT-5.4 Mini?

GPT-5.4 Mini costs 1.1× less per token ($5.25 vs $6 per 1M).

What is Claude Haiku 4.5 better at?

Claude Haiku 4.5 out-ranks GPT-5.4 Mini at Training & Education, Frontend & Landing Pages, Executive Assistant.

What is GPT-5.4 Mini better at?

GPT-5.4 Mini out-ranks Claude Haiku 4.5 at Structured Output, Customer Support, Presentations & Decks.

Full Claude Haiku 4.5 review → Full GPT-5.4 Mini review → Full model leaderboard →

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

Generate test cases from your prompt — no eval set required to start.
Compare models side by side with quality, cost and latency in one matrix.
Optimise the winner until the scores say it's ready to ship.

Join the waitlist Browse all benchmarks

Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals

Claude Opus

GPT-5

Gemini

7.1

6.8

7.4

8.3

7.9

8.0

9.2 ★

8.6

8.4

Best combo: v3 × Claude Opus

9.2 quality · $0.004/run · 1.8s