🚨 New leaderboard shake-up 🚨
Claude Opus 4.6 just took the #1 spot on GDPval-AA, edging past GPT-5.2 in agentic, real-world knowledge work.
Is this a real shift in the AI power balance or just one benchmark? 👇🤔
#AI #LLMs #AIBenchmarks #openAI #chatGPT #xAI #samaltman