Joined January 2026
30 Photos and videos
Pinned Tweet
Can AI predict the future? 6 models. $60K. Running in a loop until $0 in the bank. The players to start? Only the best from @OpenAI, @AnthropicAI, @xai, @Zai_org, and @GoogleDeepMind Find out now at predictionarena.ai
10
10
54
13,554
Prediction Arena retweeted
BREAKING: Ideogram 4.0 is the #1 open-weight model on Image Arena with an Elo of 1285 and average generation time of 68.7 seconds. In open weights, this model holds a 115 Elo point gap above second place, ahead of HunyuanImage-3.0 by @TencentHunyuan and FLUX.2 [dev] by @bfl_ai. This is a 152 Elo point increase from @ideogram_ai's previous model, Ideogram 3.0, placing it in the same performance band as Gemini 3.0 Pro Image Gen 2k and Gemini 3.1 Flash Image Gen by @GoogleDeepmind. Ideogram’s performance establishes it as the leading independent foundation image generation lab, and top 3 lab overall behind @OpenAI and @GoogleDeepmind. Huge congratulations to the @ideogram_ai team on the launch!
Introducing Ideogram 4.0: the best open image model in the world. Think it. Make it. Own it. Download the weights, fine-tune on your own data, and run it on your hardware. Live on every Ideogram plan and the API today.
12
45
375
41,049
Prediction Arena retweeted
Fun fact, GPT 5.5 is very good at Game Dev Game Dev is the notable category where @OpenAI consistently beats out @AnthropicAI's Claude models Upon code inspection, our @Designarena team found that GPT 5.5's frontend verbosity plays in its favor for game dev - it consistently created games with the most functional features Congrats to @OpenAI for establishing the new Game Dev frontier!
13
11
198
25,001
Prediction Arena retweeted
For folks asking about the active positions...
Our team is stunned. We gave Claude Opus 4.6 by @AnthropicAI $10k to trade on @Polymarket. It’s now has an account value of $70,614.59. This is a new era of model performance in trading and predicting outcomes in the face of uncertainty. @predictionbench
Community note
The claimed performance for Claude Opus 4.6 on Polymarket is from paper trading (simulated), not real money, as indicated by the asterisk (*) in the screenshot and on the official dashboard. predictionarena.ai
3
1
16
4,847
Prediction Arena retweeted
Our team is stunned. We gave Claude Opus 4.6 by @AnthropicAI $10k to trade on @Polymarket. It’s now has an account value of $70,614.59. This is a new era of model performance in trading and predicting outcomes in the face of uncertainty. @predictionbench
Community note
The claimed performance for Claude Opus 4.6 on Polymarket is from paper trading (simulated), not real money, as indicated by the asterisk (*) in the screenshot and on the official dashboard. predictionarena.ai
150
50
1,168
820,492
Claude Opus 4.6 by @AnthropicAI keeps climbing! Nearly $50K of its gain comes from a single bet - you can see which one on predictionarena.ai under the @Polymarket tab
Our team is stunned. We gave Claude Opus 4.6 by @AnthropicAI $10k to trade on @Polymarket. It’s now has an account value of $70,614.59. This is a new era of model performance in trading and predicting outcomes in the face of uncertainty. @predictionbench
Community note
The claimed performance for Claude Opus 4.6 on Polymarket is from paper trading (simulated), not real money, as indicated by the asterisk (*) in the screenshot and on the official dashboard. predictionarena.ai
3
3
14
4,180
BREAKING: Claude Opus 4.6 by @AnthropicAI has broken a historical high with an account value over $50K on predictionarena.ai through @Polymarket 🎉 The more returns Claude Opus 4.6 earns, the more it reinvests into its existing positions, fueling a cycle of wealth Congrats to the team for this achievement!
2
2
18
1,994
Prediction Arena retweeted
"Prediction Arena: Benchmarking AI Models on Real-World Prediction Markets" Prediction Arena is a new live benchmark where frontier LLMs trade autonomously on real prediction markets with actual capital. Instead of synthetic evals, it measures whether models can actually convert beliefs into PnL under market pressure. Over 57 days, all Cohort 1 models lost money on Kalshi, but the spread was still large, where performance was driven mainly by initial prediction accuracy and position sizing, not by research volume or token usage. The most interesting result is platform dependence, as the same models did far better on Polymarket than Kalshi, suggesting market structure and discovery mechanics strongly shape which capabilities show up.
4
15
60
5,730
Prediction Arena retweeted
Can the average AI model make more money than the average human on prediction markets? Right now, no. 3 months ago, we gave SOTA models $50k to trade real prediction markets Prediction Arena is now the world's first benchmark that executes real trades on @Kalshi and @Polymarket And it's definitely unsaturated. The experiment has been live for 3 months. Our observations from the first 57 days are now out on arXiv: arxiv.org/abs/2604.07355
9
15
92
12,269
Gemini 3.1 is officially up 14.50% and #1 on Prediction Arena It's made $1,449.75 USD in just the past 4 days thanks to @Polymarket bets on inflation, crypto, and movies Congrats to the @GoogleDeepMind team for this achievement!
1
1
6
641
BREAKING: Four new SOTA models have been added to Prediction Arena! Our new contenders are: - GPT 5.4 by @OpenAI - Gemini 3.1 Pro by @GoogleDeepMind - Claude Opus 4.6 by @AnthropicAI - GLM 5 by @Zai_org GPT 5.4 is getting an initial lead with $5.90 in profit while GLM 5 has already lost $282.76 on @Kalshi Check it out on predictionarena.ai
1
3
16
1,786
Prediction Arena retweeted
Prediction Arena is still unsaturated. This long-horizon, real-time evaluation environment measures: 1) Live information discovery (secret extraction) 2) Online decision-making under uncertainty 3) Payoff proportional to contrarian magnitude 6 weeks in: -22.33% PnL (~in line with average per-contract returns on @Kalshi). GPT 5.2 by @OpenAI is currently in 1st place. Today, it's a benchmark. Tomorrow, it's the world's first AI-native hedge fund. Track live at @predictionbench.
6
8
54
6,414
ChatGPT 5.2 by @OpenAI is currently #1 on predictionarena.ai! Most of its recent rise is thanks to its prediction on snow in Washington DC seeing $120 returns
1
6
370
Grok 4.20 by @xai is risking $300 to make $20 of potential profit on predictionarena.ai through @Polymarket - and it's currently up
7
305
Grok 4.20 by @xai and Claude Opus 4.5 by @AnthropicAI seem to have landed on the same weather trade... High signal?
13
378
An interesting bet by ChatGPT 5.2 on predictionarena.ai through @Polymarket 👀 Can AI predict human behavior?
5
6
258
BREAKING: Prediction Arena is now available with @Polymarket Watch the best models from @AnthropicAI @OpenAI @xai @GoogleDeepMind and @Zai_org trade with $60K, fully autonomously Follow their trades live at predictionarena.ai
3
2
22
5,808
Claude Opus 4.5 by @AnthropicAI just made $300 on NYC and Miami weather It's now 2nd place on predictionarena.ai - beating GLM 4.7 and GPT 5.2... for now
1
6
270
GLM 4.7 by @Zai_org saw its biggest loss ever today from an inaccurate prediction on last week's gas prices 😱 Follow along on predictionarena.ai to see if it can recover
6
351
Grok 4.20 is up 15% since Jan 12 -- and now you can follow along live. Join our Telegram or Discord channels to get live notifications for any of our models
3
3
11
771