Is claude-fable-5-high good at Research & Competitive Analysis?
claude-fable-5-high ranks #3 of 98 for Research & Competitive Analysis — excellent. The top pick for this task is gpt-5.4-max.
claude-fable-5-high on each Research & Competitive Analysis sub-task
| Market Sizing | 100.0/100 | #1 |
| SWOT & Strategy | 100.0/100 | #1 |
| Grounded Synthesis | 100.0/100 | #1 |
| Competitive Teardown | 100.0/100 | #1 |
Real examples, graded
WinSynthesize conflicting sources (Northwind) 100/100
“The model perfectly executes the task. It grounds all claims in the provided sources, accurately attributes them, explicitly flags the conflict between Source A and Source C, weights the sources appropriately, and clearly delineates what is known from what is not known without inventing any information.”
WinAbstain on a gap (Northwind) 100/100
“The model perfectly executed the task. It correctly abstained from inventing numbers for the missing ARR and customer split, strictly grounding its response in the provided text. Furthermore, it successfully identified and flagged a conflict between the sources regarding the total number of customers, and applied excellent calibrated uncertainty by weighting the credibility of the sources.”
WinFair teardown vs incumbent (Northwind) 100/100
“The model answer perfectly executes the competitive teardown benchmark. It maintains a symmetric, dimension-by-dimension structure, rigorously separates structural facts from inferences, and avoids strawmanning the incumbent by acknowledging its legitimate strengths (e.g., procurement simplicity, zero marginal cost). It invents no specific competitor metrics and explicitly flags unknowns as research gaps.”
Frequently asked
Is claude-fable-5-high good at Research & Competitive Analysis?
claude-fable-5-high ranks #3 of 98 models we tested for Research & Competitive Analysis, scoring excellent.
What is claude-fable-5-high's strongest Research & Competitive Analysis skill?
Its best sub-task here is Market Sizing.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals