The top 5 labs in Text Arena rankings by category show that frontier models have distinct strengths and tradeoffs.
#1
@AnthropicAI, Claude Opus 4.7
- The most consistently dominant model overall, leading top-tier across nearly every major category.
#2
@GoogleDeepMind, Gemini 3.1 Pro
- Well-rounded, with a notable edge in Creative Writing, ranked below Opus 4.7 and GPT-5.5 High in Expert
#3
@AIatMeta, Muse Spark
- Particularly strong in Overall and Coding, though it’s lagging behind in Expert tasks, Math, and Longer Query performance.
#4
@OpenAI, GPT-5.5 High
- One of the most balanced models overall, staying competitive with the top two across most categories, with especially strong performance in Expert and Math.
#5
@xAI, Grok 4.20
- A more specialized profile, standing out primarily in Creative Writing and Hard Prompts, while lagging behind in Expert tasks.