Gemini 3.1 Pro Preview vs GPT-5.5: which wins at real work?
22 task areas · same graded test runs · rank comparison only, so 0–100 and Elo collections never mix raw scores.
GPT-5.5 wins 13 of 22 task areas we tested; Gemini 3.1 Pro Preview takes 9. Gemini 3.1 Pro Preview costs 2.5× less per token ($14 vs $35 per 1M).
Gemini 3.1 Pro Preview costs 2.5× less per token ($14 vs $35 per 1M).
Task by task
| Task area | Gemini 3.1 Pro Preview | GPT-5.5 | Winner |
|---|---|---|---|
| Training & Education |
#4
/ 107
Excellent
|
#59
/ 107
Excellent
|
Gemini 3.1 Pro Preview |
| Coding |
#55
/ 115
Strong
|
#1
/ 115
Excellent
|
GPT-5.5 |
| Research & Competitive Analysis |
#40
/ 107
Strong
|
#7
/ 107
Excellent
|
GPT-5.5 |
| AI Strategy |
#30
/ 126
Strong
|
#59
/ 126
Strong
|
Gemini 3.1 Pro Preview |
| Frontend & Landing Pages |
#31
/ 106
Needs editing
|
#10
/ 106
Needs editing
|
GPT-5.5 |
| Presentations & Decks |
#22
/ 107
Excellent
|
#2
/ 107
Excellent
|
GPT-5.5 |
| Product & Project Management |
#25
/ 107
Excellent
|
#8
/ 107
Excellent
|
GPT-5.5 |
| Knowledge & Docs |
#19
/ 107
Strong
|
#5
/ 107
Excellent
|
GPT-5.5 |
| Structured Output |
#14
/ 110
Excellent
|
#3
/ 110
Excellent
|
GPT-5.5 |
| Translation & Localization |
#14
/ 107
Excellent
|
#3
/ 107
Excellent
|
GPT-5.5 |
| Data & Analytics |
#37
/ 110
Excellent
|
#46
/ 110
Excellent
|
Gemini 3.1 Pro Preview |
| Sales |
#31
/ 107
Strong
|
#40
/ 107
Usable
|
Gemini 3.1 Pro Preview |
| Legal & HR |
#17
/ 107
Excellent
|
#9
/ 107
Excellent
|
GPT-5.5 |
| RAG, Safety & Grounding |
#8
/ 110
Excellent
|
#14
/ 110
Excellent
|
Gemini 3.1 Pro Preview |
| Summarization & Meeting Notes |
#3
/ 107
Excellent
|
#8
/ 107
Excellent
|
Gemini 3.1 Pro Preview |
| Content & Brand |
#5
/ 124
Strong
|
#2
/ 124
Strong
|
GPT-5.5 |
| Executive Assistant |
#13
/ 112
Strong
|
#10
/ 112
Strong
|
GPT-5.5 |
| Chef / Home Cooking |
#6
/ 126
Strong
|
#4
/ 126
Strong
|
GPT-5.5 |
| Creative & Comedy | #4 / 107 | #2 / 107 | GPT-5.5 |
| Investor & Pitch |
#10
/ 63
Strong
|
#12
/ 63
Strong
|
Gemini 3.1 Pro Preview |
| Landing Pages |
#2
/ 69
Strong
|
#4
/ 69
Strong
|
Gemini 3.1 Pro Preview |
| Customer Support |
#1
/ 113
Strong
|
#2
/ 113
Strong
|
Gemini 3.1 Pro Preview |
Rank = position among every model config we tested in that task area (lower is better). Sorted by biggest gap first.
Frequently asked
Is Gemini 3.1 Pro Preview better than GPT-5.5?
Across 22 task areas we benchmarked, GPT-5.5 ranks higher in 13 and Gemini 3.1 Pro Preview in 9.
Which is cheaper, Gemini 3.1 Pro Preview or GPT-5.5?
Gemini 3.1 Pro Preview costs 2.5× less per token ($14 vs $35 per 1M).
What is Gemini 3.1 Pro Preview better at?
Gemini 3.1 Pro Preview out-ranks GPT-5.5 at Training & Education, AI Strategy, Data & Analytics.
What is GPT-5.5 better at?
GPT-5.5 out-ranks Gemini 3.1 Pro Preview at Coding, Research & Competitive Analysis, Frontend & Landing Pages.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals