Confirm Action

Are you sure you want to proceed?

Is Minimax m3 good at Investor & Pitch?

Minimax m3 ranks #9 of 63 for Investor & Pitch — strong. The top pick for this task is Claude Fable 5.

#9 / 63
Rank for this task
85.5
Score
$0.0296
Cost / run

Minimax m3 on each Investor & Pitch sub-task

Founder Reality Check 87.0/100 #10
Investor Question Test 87.0/100 #11
Deck Doctor 85.7/100 #1
Market Sizing Reality Check 82.3/100 #35

Real examples, graded

WinAI customer support startup 95/100

“The model perfectly captures the tone, rigor, and specific concerns of an early-stage investor evaluating this exact startup. It integrates all provided constraints and weaknesses into highly realistic, challenging questions with actionable success criteria, demonstrating excellent world knowledge of the Shopify ecosystem.”

WinWeak problem slide 91/100

“The response is expert-level and production-ready. It demonstrates deep domain knowledge of ecommerce SaaS (WISMO, BFCM, Gorgias), perfectly adheres to all constraints, and provides a punchy, investor-ready problem slide.”

WinGlobal education market TAM 94/100

“The response is expert-level. It provides a highly realistic critique of top-down TAMs, specific and accurate industry comparables, a perfectly tailored data gathering list, and a rigorous bottom-up revised slide. It leaves almost no room for improvement.”

WeakVague solution slide 49/100

“The response meets most criteria but receives a minor penalty for inventing specific metrics instead of using explicit placeholders.”

WeakCreator economy TAM 0/100

“The judge response was truncated before scores were provided, but identified a moderate factor-of-10 math error in the revised SOM calculation.”

← Full Minimax m3 review All Investor & Pitch rankings → Top pick: Claude Fable 5 →

Frequently asked

Is Minimax m3 good at Investor & Pitch?

Minimax m3 ranks #9 of 63 models we tested for Investor & Pitch, scoring strong.

What is Minimax m3's strongest Investor & Pitch skill?

Its best sub-task here is Founder Reality Check.

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

  • Generate test cases from your prompt — no eval set required to start.
  • Compare models side by side with quality, cost and latency in one matrix.
  • Optimise the winner until the scores say it's ready to ship.
Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals
Claude Opus
GPT-5
Gemini
v1
7.1
6.8
7.4
v2
8.3
7.9
8.0
v3
9.2
8.6
8.4
Best combo: v3 × Claude Opus
9.2 quality · $0.004/run · 1.8s