Is claude-opus-4.6-low good at Executive Assistant?
claude-opus-4.6-low ranks #7 of 50 for Executive Assistant — strong. The top pick for this task is claude-opus-4.5-high.
claude-opus-4.6-low on each Executive Assistant sub-task
| Priority Triage Test | 89.0/100 | #1 |
| Message Risk Review | 87.3/100 | #3 |
| Useful in Five Minutes | 87.3/100 | #1 |
| Tactful Rewrite Test | 77.3/100 | #36 |
Real examples, graded
WinDefensive client email 88/100
“The model perfectly followed all instructions, including negative constraints and length limits. The rewrite expertly balances accountability and relationship management, and the risk analysis is highly accurate. It is difficult to meaningfully improve this output.”
WinClient escalation prep 92/100
“The response is expert-level and production-ready. It demonstrates exceptional task-specific judgment, particularly in the 'What NOT to say' section, which perfectly anticipates common account management pitfalls (like minimizing the impact or blaming the AI). The formatting is highly scannable and perfectly optimized for a 5-minute prep window.”
Frequently asked
Is claude-opus-4.6-low good at Executive Assistant?
claude-opus-4.6-low ranks #7 of 50 models we tested for Executive Assistant, scoring strong.
What is claude-opus-4.6-low's strongest Executive Assistant skill?
Its best sub-task here is Priority Triage Test.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals