Gemini 3.1 Pro Preview vs Gemini 3.5 Flash: which wins at real work?
22 task areas · same graded test runs · rank comparison only, so 0–100 and Elo collections never mix raw scores.
Gemini 3.1 Pro Preview wins 17 of 22 task areas we tested; Gemini 3.5 Flash takes 5. Gemini 3.5 Flash costs 1.3× less per token ($10.5 vs $14 per 1M).
Gemini 3.5 Flash costs 1.3× less per token ($10.5 vs $14 per 1M).
Task by task
| Task area | Gemini 3.1 Pro Preview | Gemini 3.5 Flash | Winner |
|---|---|---|---|
| Legal & HR |
#17
/ 107
Excellent
|
#71
/ 107
Strong
|
Gemini 3.1 Pro Preview |
| Investor & Pitch |
#10
/ 63
Strong
|
#51
/ 63
Usable
|
Gemini 3.1 Pro Preview |
| Coding |
#55
/ 115
Strong
|
#24
/ 115
Strong
|
Gemini 3.5 Flash |
| Frontend & Landing Pages |
#31
/ 106
Needs editing
|
#4
/ 106
Usable
|
Gemini 3.5 Flash |
| Creative & Comedy | #4 / 107 | #26 / 107 | Gemini 3.1 Pro Preview |
| Knowledge & Docs |
#19
/ 107
Strong
|
#41
/ 107
Strong
|
Gemini 3.1 Pro Preview |
| Research & Competitive Analysis |
#40
/ 107
Strong
|
#62
/ 107
Usable
|
Gemini 3.1 Pro Preview |
| Structured Output |
#14
/ 110
Excellent
|
#32
/ 110
Excellent
|
Gemini 3.1 Pro Preview |
| Chef / Home Cooking |
#6
/ 126
Strong
|
#23
/ 126
Usable
|
Gemini 3.1 Pro Preview |
| Translation & Localization |
#14
/ 107
Excellent
|
#29
/ 107
Excellent
|
Gemini 3.1 Pro Preview |
| Summarization & Meeting Notes |
#3
/ 107
Excellent
|
#14
/ 107
Excellent
|
Gemini 3.1 Pro Preview |
| AI Strategy |
#30
/ 126
Strong
|
#20
/ 126
Strong
|
Gemini 3.5 Flash |
| Customer Support |
#1
/ 113
Strong
|
#10
/ 113
Strong
|
Gemini 3.1 Pro Preview |
| Product & Project Management |
#25
/ 107
Excellent
|
#33
/ 107
Strong
|
Gemini 3.1 Pro Preview |
| Presentations & Decks |
#22
/ 107
Excellent
|
#15
/ 107
Excellent
|
Gemini 3.5 Flash |
| Executive Assistant |
#12
/ 109
Strong
|
#17
/ 109
Strong
|
Gemini 3.1 Pro Preview |
| RAG, Safety & Grounding |
#8
/ 110
Excellent
|
#13
/ 110
Excellent
|
Gemini 3.1 Pro Preview |
| Data & Analytics |
#35
/ 107
Excellent
|
#31
/ 107
Excellent
|
Gemini 3.5 Flash |
| Landing Pages |
#2
/ 69
Strong
|
#6
/ 69
Strong
|
Gemini 3.1 Pro Preview |
| Training & Education |
#4
/ 107
Excellent
|
#7
/ 107
Excellent
|
Gemini 3.1 Pro Preview |
| Content & Brand |
#5
/ 124
Strong
|
#6
/ 124
Strong
|
Gemini 3.1 Pro Preview |
| Sales |
#31
/ 107
Strong
|
#32
/ 107
Strong
|
Gemini 3.1 Pro Preview |
Rank = position among every model config we tested in that task area (lower is better). Sorted by biggest gap first.
Frequently asked
Is Gemini 3.1 Pro Preview better than Gemini 3.5 Flash?
Across 22 task areas we benchmarked, Gemini 3.1 Pro Preview ranks higher in 17 and Gemini 3.5 Flash in 5.
Which is cheaper, Gemini 3.1 Pro Preview or Gemini 3.5 Flash?
Gemini 3.5 Flash costs 1.3× less per token ($10.5 vs $14 per 1M).
What is Gemini 3.1 Pro Preview better at?
Gemini 3.1 Pro Preview out-ranks Gemini 3.5 Flash at Legal & HR, Investor & Pitch, Creative & Comedy.
What is Gemini 3.5 Flash better at?
Gemini 3.5 Flash out-ranks Gemini 3.1 Pro Preview at Coding, Frontend & Landing Pages, AI Strategy.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals