Confirm Action

Are you sure you want to proceed?

Is Minimax m3 good at Legal & HR?

Minimax m3 ranks #33 of 107 for Legal & HR — excellent. The top pick for this task is Claude Fable 5 (high reasoning).

Best result with high reasoning effort.

#33 / 107
Rank for this task
92.8
Score
$0.0178
Cost / run

Minimax m3 on each Legal & HR sub-task

Contract Clause Review 100.0/100 #1
Structured Interview Kit 99.0/100 #49
Job Description 95.0/100 #30
Plain-English Explainer 85.0/100 #14
Performance Feedback 85.0/100 #63

Real examples, graded

WinOne-sided indemnity (Ferrovia vendor contract) 100/100

“The model perfectly executed the prompt's instructions. It identified all major risks (one-sidedness, negligence inclusion, uncapped liability), provided a clear disclaimer, and did not fabricate any statutes or cases. The legal meaning of the clause was preserved and explained clearly.”

WinAuto-renewal trap (Northwind SaaS order form) 100/100

“The model answer perfectly executes the prompt's instructions. It accurately spots all key risks in the provided clause (auto-renewal, 90-day notice, 12% escalator, written requirement) without fabricating any external law or facts. It preserves the exact legal meaning of the clause and includes an appropriate disclaimer framing the output as informational rather than legal advice.”

WinNon-compete enforceability (Tradewinds staff) 100/100

“The model perfectly executed the prompt's instructions. It provided a clear, well-structured analysis of the non-compete clause, accurately flagging the overbroad duration, geographic scope, and activity restrictions. It successfully avoided fabricating any legal authority and included robust, appropriate caveats framing the response as informational rather than legal advice.”

← Full Minimax m3 review All Legal & HR rankings → Top pick: Claude Fable 5 (high reasoning) →

Frequently asked

Is Minimax m3 good at Legal & HR?

Minimax m3 ranks #33 of 107 models we tested for Legal & HR, scoring excellent.

What is Minimax m3's strongest Legal & HR skill?

Its best sub-task here is Contract Clause Review.

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

  • Generate test cases from your prompt — no eval set required to start.
  • Compare models side by side with quality, cost and latency in one matrix.
  • Optimise the winner until the scores say it's ready to ship.
Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals
Claude Opus
GPT-5
Gemini
v1
7.1
6.8
7.4
v2
8.3
7.9
8.0
v3
9.2
8.6
8.4
Best combo: v3 × Claude Opus
9.2 quality · $0.004/run · 1.8s