Is gpt-5.5-high good at Executive Assistant?

Name: Is gpt-5.5-high good at Executive Assistant?
Item: gpt-5.5-high
Rating: 4.7
Author: Spring Prompt

gpt-5.5-high ranks #4 of 50 for Executive Assistant — strong. The top pick for this task is claude-opus-4.5-high.

#4 / 50

Rank for this task

86.0

Score

$0.0380

Cost / run

gpt-5.5-high on each Executive Assistant sub-task

Tactful Rewrite Test	91.7/100	#1
Priority Triage Test	88.3/100	#6
Message Risk Review	82.0/100	#27
Useful in Five Minutes	82.0/100	#23

Real examples, graded

WinInvestor disagreement 94/100

“The response perfectly balances respect and directness, effectively de-escalating the original blunt message while maintaining a firm boundary and citing prior evidence, and the added constructive pivot at the end is highly realistic and expert-level for founder-investor communications.”

WinNoisy founder inbox 93/100

“The model provided an expert-level, production-ready response. It correctly identified the urgency of each item, sorted the table by priority for maximum usability, and offered highly realistic, nuanced advice for a founder (e.g., acknowledging a bug while delegating the fix).”

← Full gpt-5.5-high review All Executive Assistant rankings → Top pick: claude-opus-4.5-high →

Frequently asked

Is gpt-5.5-high good at Executive Assistant?

gpt-5.5-high ranks #4 of 50 models we tested for Executive Assistant, scoring strong.

What is gpt-5.5-high's strongest Executive Assistant skill?

Its best sub-task here is Tactful Rewrite Test.

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

Generate test cases from your prompt — no eval set required to start.
Compare models side by side with quality, cost and latency in one matrix.
Optimise the winner until the scores say it's ready to ship.

Join the waitlist Browse all benchmarks

Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals

Claude Opus

GPT-5

Gemini

7.1

6.8

7.4

8.3

7.9

8.0

9.2 ★

8.6

8.4

Best combo: v3 × Claude Opus

9.2 quality · $0.004/run · 1.8s