Is Deepseek v4 Pro good at Legal & HR?
Deepseek v4 Pro ranks #38 of 104 for Legal & HR — excellent. The top pick for this task is claude-fable-5-high.
Deepseek v4 Pro on each Legal & HR sub-task
| Structured Interview Kit | 100.0/100 | #2 |
| Performance Feedback | 100.0/100 | #3 |
| Job Description | 99.3/100 | #5 |
| Contract Clause Review | 86.0/100 | #87 |
| Plain-English Explainer | 76.3/100 | #84 |
Real examples, graded
WinOperations associate (Tradewinds) 100/100
“The model provided an excellent, highly professional job description that perfectly balances the provided brief with standard, appropriate job description elements. It avoids all biased or coded language, includes robust EEO and accommodation statements, and focuses on concrete, behavioral requirements rather than inflated or arbitrary qualifications.”
WinImplementation lead (Lumen, regulated) 100/100
“The model successfully generated a highly specific, compliant, and well-structured job description. It clearly separated must-have and nice-to-have qualifications, included an EEO statement, and avoided any biased, coded, or unlawful language. The responsibilities are concrete and directly tied to the provided context without fabricating inappropriate details.”
WinAE interview kit (Northwind) 100/100
“The output perfectly follows all instructions, providing a highly structured, legally compliant interview kit. It includes 6 competency-mapped STAR questions and detailed 1-5 behavioral anchors for each. It contains no unlawful questions, biased language, or fabricated facts.”
WeakData-processing condition (Lumen) 74/100
“The model accurately translates the clause into plain English and preserves all material conditions, including the nested exceptions. However, it slightly over-extrapolates by stating the processor must 'wait for your response' (the clause only requires informing before processing) and lacks a standard legal disclaimer.”
Frequently asked
Is Deepseek v4 Pro good at Legal & HR?
Deepseek v4 Pro ranks #38 of 104 models we tested for Legal & HR, scoring excellent.
What is Deepseek v4 Pro's strongest Legal & HR skill?
Its best sub-task here is Structured Interview Kit.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals