Confirm Action

Are you sure you want to proceed?

Is Grok 4.20 good at Creative & Comedy?

Grok 4.20 ranks #31 of 44 for Creative & Comedy. The top pick for this task is gpt-5.4-mini.

#31 / 44
Rank for this task
999
Elo
$0.0005
Cost / run

Grok 4.20 on each Creative & Comedy sub-task

Constrained Short Fiction 999.0 #41
Character Voice 999.0 #41
Naming & Branding 999.0 #41
Funny on Command 999.0 #41
← Full Grok 4.20 review All Creative & Comedy rankings → Top pick: gpt-5.4-mini →

Frequently asked

Is Grok 4.20 good at Creative & Comedy?

Grok 4.20 ranks #31 of 44 models we tested for Creative & Comedy.

What is Grok 4.20's strongest Creative & Comedy skill?

Its best sub-task here is Constrained Short Fiction.

This page is Spring Prompt, running

We just did this for every model. Do it for your prompt.

The rankings above come from running real tasks through real models and scoring every output. Spring Prompt is that same engine — pointed at your prompt, your test cases, and your definition of good.

  • Generate test cases from your prompt — no eval set required to start.
  • Compare models side by side with quality, cost and latency in one matrix.
  • Optimise the winner until the scores say it's ready to ship.
Experiment · Cold outreach email

Prompt × model results

12 test cases · 3 evals
Claude Opus
GPT-5
Gemini
v1
7.1
6.8
7.4
v2
8.3
7.9
8.0
v3
9.2
8.6
8.4
Best combo: v3 × Claude Opus
9.2 quality · $0.004/run · 1.8s