Raphi-2Code

Raphi-2Code

1,557 Photos and videos

Tweets

Pinned Tweet

Raphi-2Code

@R2Cdev_

Jun 12

GPT-5.5 Pro is going to be included in the Pro plan in Codex

Leon Lin

@LexnLin

Jun 11

that would be LEGENDARY

802

192,521

Raphi-2Code

Raphi-2Code

@R2Cdev_

22h

The new Siri is 🔥

1:25

627

Raphi-2Code

Raphi-2Code

@R2Cdev_

23h

Kimi Code is as good in FrontierMath Tier 4 as GPT-5 mini and GPT-5.4 nano 🙈

11,676

Mark Gurman

Raphi-2Code retweeted

Mark Gurman

@markgurman

Jun 13

It’s easier than ever to use ChatGPT - and in the future other third-party AI models installed via the App Store - inside of Siri and the Siri app. All the prep work is in there for Siri to be a platform for both Apple’s own AI and rival options.

2,171

196,058

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 13

🥀

Acer @AcerFur

Jun 12

Claude Fable 5 result for FrontierMath T4 has just come in and it is vastly SoTA.

147

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 13

Claude Fable 5 🪦💔🥀

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 13

what?

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

303

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 13

😭

Artificial Analysis

@ArtificialAnlys

Jun 13

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 12

“Effective Altruism”

Lisan al Gaib

@scaling01

Jun 11

Fable 5 refused 200 out of 200 ProgramBench tasks lmao

1,489

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 12

Wow! This is even cheaper as the trend!

Jake

@JakeKAllDay

Jun 11

Cost token data from @ArtificialAnlys is finally out (this level of detail is what makes them such a useful benchmark🙂). More or less as guessed: incremental boost on benches vs opus 4.8, but more than 2x the cost ( 130%). 4.5x the cost of gpt 5.5 high for ~10% better perf.

2,105

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 12

what????

Harshith

@HarshithLucky3

Jun 12

Expanded voice mode showing in CODEX 👀👀

2,555

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 12

WHAT? WOW???

OpenAI

@OpenAI

Jun 12

We heard you wanted to use Codex rate limit resets on your own time. Starting today, we’re rolling out the ability to save rate limit resets to use later. We’re starting Go, Plus, Pro, and Business users with one free reset:

0:28

984

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 12

WhHAT???

Harshith

@HarshithLucky3

Jun 12

GPT 5.5 Pro showing in CODEX 👀👀

7,214

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 12

Over 450 followers!

216

Arena.ai

Raphi-2Code retweeted

Arena.ai

@arena

Jun 11

GPT-5.5 (xHigh) ranks #2 on Agent Arena ( 10.6% net improvement), making it the highest-ranked OpenAI model closely behind Claude Fable 5 (High). Per signal breakdown, GPT-5.5 (xHigh) ranks #1 in Praise vs. Complaint ( 29.4%) and Bash Recovery ( 14.1%), scoring higher than Claude Fable 5 (High) on both signals. It trails Claude Fable 5 (High) on Confirmed Success ( 5.4% vs. 17.6%) and Steerability ( 1.9% vs. 5.4%). Agent Arena evaluates models on millions of real-world, long-horizon agentic tasks. Models use tools like web search, filesystem, and terminal to complete complex workflows: writing code, creating slide decks, researching the web, building apps, and analyzing documents. We use causal tracing to measure model performance across real-world agentic tasks. More breakdown of GPT-5.5 (xHigh) across five signals in the thread.

Arena.ai

@arena

Jun 4

Introducing Agent Mode: Agentic AI is now measured in the Arena. Agent Mode can do deep research, create reports, generate images, build websites, debug code, and more. It completes more complex tasks by using tools like web search, bash in a sandbox environment, image generation, file writing, and asking follow-up questions. Frontier models are waiting for you in Agent Mode to take on real-world tasks. GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro, and top open models. Test them yourself.

0:44

472

45,672

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 11

WHEN GERMANY???????????????????????????????

Tesla Europe, Middle East & Africa

@teslaeurope

Jun 11

FSD Supervised now approved in Belgium 🇧🇪 Rollout will begin soon

0:38

665

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 8

GPT-5.6 release checkpoint "kindle" Peacock SVG animation

0:11

Raphi-2Code

@R2Cdev_

Jun 7

Claude Mythos (high) Thank you @mirochill!

0:04

265

34,441

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 11

this is @petergostev's prompt btw

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 11

Lmarena’s video leaderboard doesn’t match real world usage!

667

Raphi-2Code

Raphi-2Code

@R2Cdev_

Jun 11

Wow! Recraft Vector and Arrow-1.0 are a lot better in real world usage, but this result seems to be great!

724

Angel 🌼

Raphi-2Code retweeted

Angel 🌼

@Angaisb_

Jun 11

I won't be able to see what OpenAI releases today until like 10 pm, I'll be 4 hours or so in the cinema Thankfully it seems like it won't be GPT-5.6, maybe price cuts today

122

7,027