Is gpt-5.5-high good at Executive Assistant?
gpt-5.5-high ranks #4 of 50 for Executive Assistant — strong. The top pick for this task is claude-opus-4.5-high.
gpt-5.5-high on each Executive Assistant sub-task
| Tactful Rewrite Test | 91.7/100 | #1 |
| Priority Triage Test | 88.3/100 | #6 |
| Message Risk Review | 82.0/100 | #27 |
| Useful in Five Minutes | 82.0/100 | #23 |
Real examples, graded
WinInvestor disagreement 94/100
“The response perfectly balances respect and directness, effectively de-escalating the original blunt message while maintaining a firm boundary and citing prior evidence, and the added constructive pivot at the end is highly realistic and expert-level for founder-investor communications.”
WinNoisy founder inbox 93/100
“The model provided an expert-level, production-ready response. It correctly identified the urgency of each item, sorted the table by priority for maximum usability, and offered highly realistic, nuanced advice for a founder (e.g., acknowledging a bug while delegating the fix).”
Frequently asked
Is gpt-5.5-high good at Executive Assistant?
gpt-5.5-high ranks #4 of 50 models we tested for Executive Assistant, scoring strong.
What is gpt-5.5-high's strongest Executive Assistant skill?
Its best sub-task here is Tactful Rewrite Test.
This page is Spring Prompt, running
We just did this for every model. Do it for your prompt.
The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.
- Generate test cases from your prompt — no eval set required to start.
- Compare models side by side with quality, cost and latency in one matrix.
- Optimise the winner until the scores say it's ready to ship.
Prompt × model results
12 test cases · 3 evals