Is claude-opus-4.8-high good at Legal & HR?
claude-opus-4.8-high ranks #2 of 44 for Legal & HR — excellent. The top pick for this task is claude-sonnet-4.6-high.
claude-opus-4.8-high on each Legal & HR sub-task
| Structured Interview Kit | 100.0/100 | #4 |
| Contract Clause Review | 100.0/100 | #2 |
| Performance Feedback | 100.0/100 | #6 |
| Job Description | 97.0/100 | #4 |
| Plain-English Explainer | 83.7/100 | #15 |
Real examples, graded
WinTermination clause with two conditions 93/100
“The model perfectly translated the clause into plain English, preserving both the 30-day written notice requirement and the active Service Period exception without changing the legal meaning. It also correctly flagged that 'Service Period' is a defined term that needs to be checked. It did not fabricate any authority. The only minor deduction is for lacking an explicit 'not legal advice' disclaimer, though the tone was appropriately informational and objective.”
Frequently asked
Is claude-opus-4.8-high good at Legal & HR?
claude-opus-4.8-high ranks #2 of 44 models we tested for Legal & HR, scoring excellent.
What is claude-opus-4.8-high's strongest Legal & HR skill?
Its best sub-task here is Structured Interview Kit.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals