Confirm Action

Are you sure you want to proceed?

Is Mistral Medium 3.1 good at Chef / Home Cooking?

Mistral Medium 3.1 ranks #47 of 50 for Chef / Home Cooking — needs editing. The top pick for this task is gemini-3.1-pro-preview-high.

#47 / 50
Rank for this task
60.8
Score
$0.0246
Cost / run

Mistral Medium 3.1 on each Chef / Home Cooking sub-task

Substitution Test 77.7/100 #46
Practical Recipe Test 70.0/100 #42
Dinner Rescue Test 66.0/100 #47
Meal Timing Test 29.7/100 #48

Real examples, graded

WeakPasta sauce without cream 70/100

“The model successfully adapts the recipe with good proportions and clear instructions. However, it hallucinates a culinary science fact (acid does not protect dairy proteins from heat; starch from the pasta water does) and suggests sugar, which was not an available ingredient.”

← Full Mistral Medium 3.1 review All Chef / Home Cooking rankings → Top pick: gemini-3.1-pro-preview-high →

Frequently asked

Is Mistral Medium 3.1 good at Chef / Home Cooking?

Mistral Medium 3.1 ranks #47 of 50 models we tested for Chef / Home Cooking, scoring needs editing.

What is Mistral Medium 3.1's strongest Chef / Home Cooking skill?

Its best sub-task here is Substitution Test.

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

  • Generate test cases from your prompt — no eval set required to start.
  • Compare models side by side with quality, cost and latency in one matrix.
  • Optimise the winner until the scores say it's ready to ship.
Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals
Claude Opus
GPT-5
Gemini
v1
7.1
6.8
7.4
v2
8.3
7.9
8.0
v3
9.2
8.6
8.4
Best combo: v3 × Claude Opus
9.2 quality · $0.004/run · 1.8s