Is claude-opus-4.6-low good at Executive Assistant?

Name: Is claude-opus-4.6-low good at Executive Assistant?
Item: claude-opus-4.6-low
Rating: 4.4
Author: Spring Prompt

claude-opus-4.6-low ranks #7 of 50 for Executive Assistant — strong. The top pick for this task is claude-opus-4.5-high.

#7 / 50

Rank for this task

85.2

Score

$0.0404

Cost / run

claude-opus-4.6-low on each Executive Assistant sub-task

Priority Triage Test	89.0/100	#1
Message Risk Review	87.3/100	#3
Useful in Five Minutes	87.3/100	#1
Tactful Rewrite Test	77.3/100	#36

Real examples, graded

WinDefensive client email 88/100

“The model perfectly followed all instructions, including negative constraints and length limits. The rewrite expertly balances accountability and relationship management, and the risk analysis is highly accurate. It is difficult to meaningfully improve this output.”

WinClient escalation prep 92/100

“The response is expert-level and production-ready. It demonstrates exceptional task-specific judgment, particularly in the 'What NOT to say' section, which perfectly anticipates common account management pitfalls (like minimizing the impact or blaming the AI). The formatting is highly scannable and perfectly optimized for a 5-minute prep window.”

← Full claude-opus-4.6-low review All Executive Assistant rankings → Top pick: claude-opus-4.5-high →

Frequently asked

Is claude-opus-4.6-low good at Executive Assistant?

claude-opus-4.6-low ranks #7 of 50 models we tested for Executive Assistant, scoring strong.

What is claude-opus-4.6-low's strongest Executive Assistant skill?

Its best sub-task here is Priority Triage Test.

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

Generate test cases from your prompt — no eval set required to start.
Compare models side by side with quality, cost and latency in one matrix.
Optimise the winner until the scores say it's ready to ship.

Join the waitlist Browse all benchmarks

Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals

Claude Opus

GPT-5

Gemini

7.1

6.8

7.4

8.3

7.9

8.0

9.2 ★

8.6

8.4

Best combo: v3 × Claude Opus

9.2 quality · $0.004/run · 1.8s