Dan Roy

Dan Roy

1,111 Photos and videos

Tweets

Pinned Tweet

Dan Roy

@roydanroy

7 Feb 2023

I'm an AI researcher.

755

307,664

Balázs Pozsgay

Dan Roy retweeted

Balázs Pozsgay

@pozsgaybalazs

Jun 12

Fable achieved a significant breakthrough in one of our open problems. This is a problem where ChatGPT 5.5 could not even begin anything useful. The breakthrough seems legit (although not 100% checked yet), and Fable even claims to have a full solution. >10 hours total runtime so far. A 30 page document with the proofs of some lemmas not yet spelled out. We can not yet know whether Fable indeed has solved it, but even if it is just a partial solution, we are absolutely amazed. More details will follow, and once we are at the end of the story, I will also write a full substack post. Collaboration with István Vona, a postdoc in my group.

1,469

168,760

Acer

Dan Roy retweeted

Acer @AcerFur

Jun 12

FrontierMath got updated to remove erroneous problems and the consequent score change is kinda wild

274

24,299

Dan Roy

Dan Roy

@roydanroy

Jun 12

Co-Mathematician has retained its position relative to other models, save Fable which has taken the lead. I don't think Fable will stay there for long.

Acer @AcerFur

Jun 12

Claude Fable 5 result for FrontierMath T4 has just come in and it is vastly SoTA.

6,703

Kevin Hartnett

Dan Roy retweeted

Kevin Hartnett

@KSHartnett

Jun 11

In two hours I'm hosting a panel discussion with Kevin Buzzard, Johan Commelin, and @AlexKontorovich about math Lean AI. There's still time to join us (see link below) or submit questions in the thread.

3,801

Kording Lab 🦖

Dan Roy retweeted

Kording Lab 🦖@KordingLab

Jun 10

Sorry. I got to say this publicly. I really agree with @karpathy point here. My wife @mioana is a leading economist and we discuss this all the time. The singularity think of the AI community is rather misguided.

Dwarkesh Patel

@dwarkesh_sp

Jun 10

Andrej Karpathy thinks AGI's impact on the economy will just be folded into the existing rate of growth. AI will be barely noticeable in GDP statistics. When he came on the show, I pushed back, saying AGI will cause a massive jump in productivity and growth. Watch our back-and-forth on this:

3:12

190

35,623

Sanjeev Arora

Dan Roy retweeted

Sanjeev Arora

@prfsanjeevarora

Jun 11

Sobering take-away from 1stproof (round 2) 1stproof.org/. OpenAI's vanilla prompt to 5.5pro tinyurl.com/yc8ymuna solves research math 10-40 x cheaper than custom prompts from academic teams. We used Gemini pro. Switching to 5.5pro improves results a lot but costs rise to the level of other academic pipelines :(

First Proof Project

Independent, transparent evaluation of AI in research mathematics.

1stproof.org

161

15,190

Dan Roy

Dan Roy

@roydanroy

Jun 10

Yet another Co-Mathematician-enabled paper. Nice to see mathematicians working with the system to make progress. Follow Daniel Zheng for more AI-for-math announcement from Google.

Daniel Zheng @dhhzheng

Jun 10

Another one from @GergelyBerczi and Y-H Kiem, who recently resolved a conjecture of Aluffi-Chen-Marcolli in collaboration with the AI co-mathematician: arxiv.org/abs/2605.29151 The proof is quite neat, and nice to see the agents using computational evidence to guide the strategy

5,265

Dan Roy

Dan Roy

@roydanroy

Jun 10

If you were a company and you were going to benchmax once, when would you do it?

4,781

Dan Roy

Dan Roy

@roydanroy

Jun 9

Vibe on Mythos seems to be that people don't like being told what they can't do with a model.

536

23,515

Tim G. J. Rudner

Dan Roy retweeted

Tim G. J. Rudner

@timrudner

Jun 9

What if diffusion models could think ahead instead of being greedy at every step?🤔 We introduce: Learned Relay Representations for Forward-Thinking Discrete Diffusion Models

2,906

Dan Roy

Dan Roy

@roydanroy

Jun 7

Queens club, London.

1,674

Dan Roy

Dan Roy

@roydanroy

Jun 5

Warning: once you learn category theory, you'll never be able or willing to talk with people who don't know category theory.

Markus J. Buehler

@ProfBuehlerMIT

Jun 5

We've made a breakthrough in self-evolving AI scientists moving from "search" to "principled discovery": Scientific discovery requires that the search space itself changes, and an AI scientist must perceive this shift without intervention. We built an AI that achieves this for the first time with the ability to discover the scientific vocabulary it reasons in. Evidence, tools, artifacts, verifiers, failures & claims become typed provenance. We show three distinct modalities: 1) retrieval, adding known objects; 2) search, exploring a fixed schema; and critically: 3) discovery, a verified regime transition. We solve the open-endedness evaluation problem by lifting agentic workflows into a typed copresheaf and proving, via a Kan obstruction, that true discovery is not unbounded generation but a verifiable schema expansion: old evidence is transported by Left Kan extension, and genuine novelty is mathematically quantified by the pointwise residual beyond the transported image - separating discovery from mere search and making novelty objective and measurable rather than a subjective judgment or benchmark delta. Our AI scientist is built in a way that does not pre-conceive the approach it chooses; instead, we endow the system with formal power to adapt, evolve, and reason from first principles. Case studies include: 1⃣Builder/Breaker model that discovers mode-conditioned compliance in proteins; 2⃣CategoryScienceClaw that finds anisotropic fiber-network stiffness rules. Great work in collaboration with my graduate student @fwang108_ @MITdeptofBE F.Y. Wang & M.J. Buehler, Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence, arXiv:2606.01444, 2026

1:04

149

2,574

453,239

Dan Roy

Dan Roy

@roydanroy

Jun 5

Just heard Dimitri Bertsekas passed away. We would often chat after his Convex Analysis class. He was always very kind and encouraging of my theoretical pursuits. The guy wrote books like most people write papers. He was a true educator. Sorry that I didn't get to cross paths recently.

305

21,860

Dan Roy

Dan Roy

@roydanroy

Jun 4

Google has provided high quality AI referee services to the most recent NeurIPS, ICML, and STOC conferences.

Tony Feng

@tonylfeng

Jun 1

Proposal: Any AI company that is earnest about "helping mathematicians" (rather than parading the intelligence of its models) should work on making a high quality AI referee service available to journals. Current peer review in mathematics is laughably slow, and accelerating that with AI would be easier and more helpful than autonomously proving the kind of random results we've seen so far, with the notable exception of the Unit Distance Conjecture.

8,710

levent

Dan Roy retweeted

levent

@__alpoge__

Jun 3

over the weekend i had another obvious thing to check, namely whether claude autonomously resolves the famed sum-product conjecture over the reals. answer: yes

402

211,444

Csaba Szepesvari

Dan Roy retweeted

Csaba Szepesvari @CsabaSzepesvari

Jun 3

This was awesome! Don't despair, even if you missed it, the talk will appear on the youtube channel youtube.com/@RLtheory soon!

RL theory seminars

We organize weekly seminars on reinforcement learning theory.

youtube.com

RL Theory Virtual Seminars @RLtheory

Jun 2

Dan's presentation is starting now!

6,041

Thang Luong

Dan Roy retweeted

Thang Luong

@lmthang

May 31

*When you think of AI, think of humanity too* This was the main message of the GStar Summit on AI & Humanity that I organized two days ago (as a breather after Google I/O). For a long time, I was deeply focused on advancing model capabilities toward superhuman reasoning, and I found immense joy when we accomplished our goal, e.g., our IMO-gold milestone in July 2025. I always assumed that someone, somewhere, would just take care of the social impact side for me. It really hit me in Feb 2026 when our math research agent, Aletheia, solved FirstProof problem #7 by flawlessly utilizing heavy mathematical machinery, as noted by our surprised mathematicians. I started to wonder: what's left for humanity? Discussing human values with my wife, Wendy Nguyen, and our dear friend Jean DeSombre, I realized I hadn't been thinking much about humanity while actively charting the course of frontier AI. But as our conversations continued over the past few months, I noticed myself becoming more "humanity-aware." For example, during a team lunch at Google DeepMind, we were chatting about what we would do after we "solved" AI for math. I heard a number of suggestions, e.g., robotics or world models, but nothing explicitly centered on humanity. I told the team that we need to think about empowering humans: one way is to build a model capable of asking questions, from basic paraphrasing all the way to forming conjectures, which would be incredibly useful for both education and model training! At GStar Summit 2026, aside from the deep dives into agentic AI with @edchi, @YiTayML, @preslav_nakov, Phong Nguyen, Myungsub Choi, Noriyuki Kojima, and Hung Bui, we dedicated half of the summit to talking about humanity! @PoShenLoh, awesome as always, proposed the "Thought Full" philosophy and reminded us how people find joy in helping others. Jay Kim shared with us new perspectives about human health in space, showing that space is not that scary to be a part of. Together with other speakers and panelist, Wendy Nguyen, Jean DeSombre, Marc Woo, Laurent El Ghaoui, Tuong Nguyen, Tuoc Huynh, Tuan Cao, and @CurtisSChin, we discussed all aspects of humanity in the age of AI. Our hope is that everyone who walked out of the summit can now find joy in talking and working with each other on human values (either before or after AI)! Thanks to everyone for coming from all over Asia Pacific and Silicon Valley! Many people told me that it's very rare in Vietnam for people to stay from 8am - 6pm with a fully-packed hall of 1,000 people! See you again soon!

Yi Tay

@YiTayML

May 30

My second time as a speaker at gstar summit in Vietnam. This time I am honoured to be chilling (I mean sitting) on a panel with colleagues @lmthang and @edchi 😀. I really enjoyed the vibes and company and meeting new people. 🫡 Events by @lmthang and @newturing have always not only been super impressive in terms of speaker calibre density but also sota in organization and hospitality. 😆 These two days have been a much needed retreat for me and it's been really timely! 😀 Also enjoyed hanging out with @edchi, I wanted to try to get a photo with @lmthang but it was impossible to catch this Vietnamese national hero for a selfie. 😀

10,273

Omar Khattab

Dan Roy retweeted

Omar Khattab

@lateinteraction

May 26

your novel idea, when you ask an llm to fill in the details

Drew Breunig

@dbreunig

May 26

We need a name for this, because Armin is putting his finger on a problem that’s everywhere: people running their writing through an LLM because they think it makes it clearer, when in actuality it sands off all the detail.

949

97,479

Dan Roy

Dan Roy

@roydanroy

May 26

Sold out before early bird deadline. I wish there had been a running progress bar because that was not on my bingo card. Hoping they set up a waiting list or some in person registrations.

ICML Conference @icmlconf

May 24

General registration for #ICML2026 is expected to fill up very soon. (Some spots stay reserved for paper authors, sponsors, workshop organizers.) Today is also the deadline for early registration. If you plan to attend in-person, we recommend registering ASAP. Blog for more info:

14,809

Dan Roy

Dan Roy

@roydanroy

May 26

Second math paper citing Google DeepMind's AI co-mathematician. Follow Daniel for future announcements.

Daniel Zheng @dhhzheng

May 26

New paper from Gergely Bérczi and László M. Fehér using the AI co-mathematician, along with AlphaEvolve and GPT 5.5 Pro. arxiv.org/abs/2605.25271 Particularly happy to see the work in section 6, which introduces new conjectures to tackle based on findings from the AI tools.

16,553