@Google DeepMind. On leave, Canada CIFAR AI Chair and Former Research Director, @VectorInst. Professor, @UofT (Statistics/CS). Views are my own.

Joined June 2009
1,111 Photos and videos
Pinned Tweet
7 Feb 2023
I'm an AI researcher.
27
80
755
307,664
Dan Roy retweeted
Fable achieved a significant breakthrough in one of our open problems. This is a problem where ChatGPT 5.5 could not even begin anything useful. The breakthrough seems legit (although not 100% checked yet), and Fable even claims to have a full solution. >10 hours total runtime so far. A 30 page document with the proofs of some lemmas not yet spelled out. We can not yet know whether Fable indeed has solved it, but even if it is just a partial solution, we are absolutely amazed. More details will follow, and once we are at the end of the story, I will also write a full substack post. Collaboration with István Vona, a postdoc in my group.
41
83
1,469
168,760
Dan Roy retweeted
Jun 12
FrontierMath got updated to remove erroneous problems and the consequent score change is kinda wild
19
14
274
24,299
Co-Mathematician has retained its position relative to other models, save Fable which has taken the lead. I don't think Fable will stay there for long.
Jun 12
Claude Fable 5 result for FrontierMath T4 has just come in and it is vastly SoTA.
6
2
25
6,703
Dan Roy retweeted
In two hours I'm hosting a panel discussion with Kevin Buzzard, Johan Commelin, and @AlexKontorovich about math Lean AI. There's still time to join us (see link below) or submit questions in the thread.
2
5
22
3,801
Dan Roy retweeted
Sorry. I got to say this publicly. I really agree with @karpathy point here. My wife @mioana is a leading economist and we discuss this all the time. The singularity think of the AI community is rather misguided.
Andrej Karpathy thinks AGI's impact on the economy will just be folded into the existing rate of growth. AI will be barely noticeable in GDP statistics. When he came on the show, I pushed back, saying AGI will cause a massive jump in productivity and growth. Watch our back-and-forth on this:
18
21
190
35,623
Dan Roy retweeted
Sobering take-away from 1stproof (round 2) 1stproof.org/. OpenAI's vanilla prompt to 5.5pro tinyurl.com/yc8ymuna solves research math 10-40 x cheaper than custom prompts from academic teams. We used Gemini pro. Switching to 5.5pro improves results a lot but costs rise to the level of other academic pipelines :(
3
16
161
15,190
Yet another Co-Mathematician-enabled paper. Nice to see mathematicians working with the system to make progress. Follow Daniel Zheng for more AI-for-math announcement from Google.
Another one from @GergelyBerczi and Y-H Kiem, who recently resolved a conjecture of Aluffi-Chen-Marcolli in collaboration with the AI co-mathematician: arxiv.org/abs/2605.29151 The proof is quite neat, and nice to see the agents using computational evidence to guide the strategy
1
21
5,265
If you were a company and you were going to benchmax once, when would you do it?
4
7
4,781
Vibe on Mythos seems to be that people don't like being told what they can't do with a model.
28
5
536
23,515
Dan Roy retweeted
What if diffusion models could think ahead instead of being greedy at every step?🤔 We introduce: Learned Relay Representations for Forward-Thinking Discrete Diffusion Models
1
7
33
2,906
Queens club, London.
1,674
Warning: once you learn category theory, you'll never be able or willing to talk with people who don't know category theory.
We've made a breakthrough in self-evolving AI scientists moving from "search" to "principled discovery": Scientific discovery requires that the search space itself changes, and an AI scientist must perceive this shift without intervention. We built an AI that achieves this for the first time with the ability to discover the scientific vocabulary it reasons in. Evidence, tools, artifacts, verifiers, failures & claims become typed provenance. We show three distinct modalities: 1) retrieval, adding known objects; 2) search, exploring a fixed schema; and critically: 3) discovery, a verified regime transition. We solve the open-endedness evaluation problem by lifting agentic workflows into a typed copresheaf and proving, via a Kan obstruction, that true discovery is not unbounded generation but a verifiable schema expansion: old evidence is transported by Left Kan extension, and genuine novelty is mathematically quantified by the pointwise residual beyond the transported image - separating discovery from mere search and making novelty objective and measurable rather than a subjective judgment or benchmark delta. Our AI scientist is built in a way that does not pre-conceive the approach it chooses; instead, we endow the system with formal power to adapt, evolve, and reason from first principles. Case studies include: 1⃣Builder/Breaker model that discovers mode-conditioned compliance in proteins; 2⃣CategoryScienceClaw that finds anisotropic fiber-network stiffness rules. Great work in collaboration with my graduate student @fwang108_ @MITdeptofBE F.Y. Wang & M.J. Buehler, Self-Revising Discovery Systems for Science: A Categorical Framework for Agentic Artificial Intelligence, arXiv:2606.01444, 2026
62
149
2,574
453,239
Just heard Dimitri Bertsekas passed away. We would often chat after his Convex Analysis class. He was always very kind and encouraging of my theoretical pursuits. The guy wrote books like most people write papers. He was a true educator. Sorry that I didn't get to cross paths recently.
6
15
305
21,860
Google has provided high quality AI referee services to the most recent NeurIPS, ICML, and STOC conferences.
Proposal: Any AI company that is earnest about "helping mathematicians" (rather than parading the intelligence of its models) should work on making a high quality AI referee service available to journals. Current peer review in mathematics is laughably slow, and accelerating that with AI would be easier and more helpful than autonomously proving the kind of random results we've seen so far, with the notable exception of the Unit Distance Conjecture.
1
1
49
8,710
Dan Roy retweeted
over the weekend i had another obvious thing to check, namely whether claude autonomously resolves the famed sum-product conjecture over the reals. answer: yes
8
33
402
211,444
Dan Roy retweeted
This was awesome! Don't despair, even if you missed it, the talk will appear on the youtube channel youtube.com/@RLtheory soon!
Dan's presentation is starting now!
5
18
6,041
Dan Roy retweeted
*​When you think of AI, think of humanity too* ​This was the main message of the GStar Summit on AI & Humanity that I organized two days ago (as a breather after Google I/O). For a long time, I was deeply focused on advancing model capabilities toward superhuman reasoning, and I found immense joy when we accomplished our goal, e.g., our IMO-gold milestone in July 2025. I always assumed that someone, somewhere, would just take care of the social impact side for me. ​It really hit me in Feb 2026 when our math research agent, Aletheia, solved FirstProof problem #7 by flawlessly utilizing heavy mathematical machinery, as noted by our surprised mathematicians. I started to wonder: what's left for humanity? ​Discussing human values with my wife, Wendy Nguyen, and our dear friend Jean DeSombre, I realized I hadn't been thinking much about humanity while actively charting the course of frontier AI. But as our conversations continued over the past few months, I noticed myself becoming more "humanity-aware." ​For example, during a team lunch at Google DeepMind, we were chatting about what we would do after we "solved" AI for math. I heard a number of suggestions, e.g., robotics or world models, but nothing explicitly centered on humanity. I told the team that we need to think about empowering humans: one way is to build a model capable of asking questions, from basic paraphrasing all the way to forming conjectures, which would be incredibly useful for both education and model training! ​At GStar Summit 2026, aside from the deep dives into agentic AI with @edchi, @YiTayML, @preslav_nakov, Phong Nguyen, Myungsub Choi, Noriyuki Kojima, and Hung Bui, we dedicated half of the summit to talking about humanity! ​@PoShenLoh, awesome as always, proposed the "Thought Full" philosophy and reminded us how people find joy in helping others. Jay Kim shared with us new perspectives about human health in space, showing that space is not that scary to be a part of. Together with other speakers and panelist, Wendy Nguyen, Jean DeSombre, Marc Woo, Laurent El Ghaoui, Tuong Nguyen, Tuoc Huynh, Tuan Cao, and @CurtisSChin, we discussed all aspects of humanity in the age of AI. ​Our hope is that everyone who walked out of the summit can now find joy in talking and working with each other on human values (either before or after AI)! ​Thanks to everyone for coming from all over Asia Pacific and Silicon Valley! Many people told me that it's very rare in Vietnam for people to stay from 8am - 6pm with a fully-packed hall of 1,000 people! See you again soon!
May 30
My second time as a speaker at gstar summit in Vietnam. This time I am honoured to be chilling (I mean sitting) on a panel with colleagues @lmthang and @edchi 😀. I really enjoyed the vibes and company and meeting new people. 🫡 Events by @lmthang and @newturing have always not only been super impressive in terms of speaker calibre density but also sota in organization and hospitality. 😆 These two days have been a much needed retreat for me and it's been really timely! 😀 Also enjoyed hanging out with @edchi, I wanted to try to get a photo with @lmthang but it was impossible to catch this Vietnamese national hero for a selfie. 😀
5
5
48
10,273
Dan Roy retweeted
your novel idea, when you ask an llm to fill in the details
We need a name for this, because Armin is putting his finger on a problem that’s everywhere: people running their writing through an LLM because they think it makes it clearer, when in actuality it sands off all the detail.
15
63
949
97,479
Sold out before early bird deadline. I wish there had been a running progress bar because that was not on my bingo card. Hoping they set up a waiting list or some in person registrations.
General registration for #ICML2026 is expected to fill up very soon. (Some spots stay reserved for paper authors, sponsors, workshop organizers.) Today is also the deadline for early registration. If you plan to attend in-person, we recommend registering ASAP. Blog for more info:
3
3
39
14,809
Second math paper citing Google DeepMind's AI co-mathematician. Follow Daniel for future announcements.
New paper from Gergely Bérczi and László M. Fehér using the AI co-mathematician, along with AlphaEvolve and GPT 5.5 Pro. arxiv.org/abs/2605.25271 Particularly happy to see the work in section 6, which introduces new conjectures to tackle based on findings from the AI tools.
3
76
16,553