Gemini 3.1 Pro Preview vs GPT-5.5: which wins at real work?

22 task areas · same graded test runs · rank comparison only, so 0–100 and Elo collections never mix raw scores.

GPT-5.5 wins 13 of 22 task areas we tested; Gemini 3.1 Pro Preview takes 9. Gemini 3.1 Pro Preview costs 2.5× less per token ($14 vs $35 per 1M).

Gemini 3.1 Pro Preview

GPT-5.5

Task areas won

Avg percentile

Top-3 finishes

$14.0

Price / 1M tokens

$35.0

Google

Provider

OpenAI

Gemini 3.1 Pro Preview costs 2.5× less per token ($14 vs $35 per 1M).

Task by task

Task area	Gemini 3.1 Pro Preview	GPT-5.5	Winner
Training & Education	#4 / 107 Excellent	#59 / 107 Excellent	Gemini 3.1 Pro Preview
Coding	#55 / 115 Strong	#1 / 115 Excellent	GPT-5.5
Research & Competitive Analysis	#40 / 107 Strong	#7 / 107 Excellent	GPT-5.5
AI Strategy	#30 / 126 Strong	#59 / 126 Strong	Gemini 3.1 Pro Preview
Frontend & Landing Pages	#31 / 106 Needs editing	#10 / 106 Needs editing	GPT-5.5
Presentations & Decks	#22 / 107 Excellent	#2 / 107 Excellent	GPT-5.5
Product & Project Management	#25 / 107 Excellent	#8 / 107 Excellent	GPT-5.5
Knowledge & Docs	#19 / 107 Strong	#5 / 107 Excellent	GPT-5.5
Structured Output	#14 / 110 Excellent	#3 / 110 Excellent	GPT-5.5
Translation & Localization	#14 / 107 Excellent	#3 / 107 Excellent	GPT-5.5
Data & Analytics	#37 / 110 Excellent	#46 / 110 Excellent	Gemini 3.1 Pro Preview
Sales	#31 / 107 Strong	#40 / 107 Usable	Gemini 3.1 Pro Preview
Legal & HR	#17 / 107 Excellent	#9 / 107 Excellent	GPT-5.5
RAG, Safety & Grounding	#8 / 110 Excellent	#14 / 110 Excellent	Gemini 3.1 Pro Preview
Summarization & Meeting Notes	#3 / 107 Excellent	#8 / 107 Excellent	Gemini 3.1 Pro Preview
Content & Brand	#5 / 124 Strong	#2 / 124 Strong	GPT-5.5
Executive Assistant	#13 / 112 Strong	#10 / 112 Strong	GPT-5.5
Chef / Home Cooking	#6 / 126 Strong	#4 / 126 Strong	GPT-5.5
Creative & Comedy	#4 / 107	#2 / 107	GPT-5.5
Investor & Pitch	#10 / 63 Strong	#12 / 63 Strong	Gemini 3.1 Pro Preview
Landing Pages	#2 / 69 Strong	#4 / 69 Strong	Gemini 3.1 Pro Preview
Customer Support	#1 / 113 Strong	#2 / 113 Strong	Gemini 3.1 Pro Preview

Rank = position among every model config we tested in that task area (lower is better). Sorted by biggest gap first.

Frequently asked

Is Gemini 3.1 Pro Preview better than GPT-5.5?

Across 22 task areas we benchmarked, GPT-5.5 ranks higher in 13 and Gemini 3.1 Pro Preview in 9.

Which is cheaper, Gemini 3.1 Pro Preview or GPT-5.5?

Gemini 3.1 Pro Preview costs 2.5× less per token ($14 vs $35 per 1M).

What is Gemini 3.1 Pro Preview better at?

Gemini 3.1 Pro Preview out-ranks GPT-5.5 at Training & Education, AI Strategy, Data & Analytics.

What is GPT-5.5 better at?

GPT-5.5 out-ranks Gemini 3.1 Pro Preview at Coding, Research & Competitive Analysis, Frontend & Landing Pages.

Full Gemini 3.1 Pro Preview review → Full GPT-5.5 review → Full model leaderboard →

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

Generate test cases from your prompt — no eval set required to start.
Compare models side by side with quality, cost and latency in one matrix.
Optimise the winner until the scores say it's ready to ship.

Join the waitlist Browse all benchmarks

Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals

Claude Opus

GPT-5

Gemini

7.1

6.8

7.4

8.3

7.9

8.0

9.2 ★

8.6

8.4

Best combo: v3 × Claude Opus

9.2 quality · $0.004/run · 1.8s