Filter
Exclude
Time range
-
Near
We are broadening the scope of the blog posts on MathArena, might want to extend it further to allow for all kinds of analyses on AI4Math, not sure how exactly it should look like though.
A blog post about our paper is now available on MathArena! matharena.ai/optimizing_agen…
1
6
634
Pythagoras-Prover: Advancing efficient formal proving via augmented Lean formalisation. ~ Joshua Ong Jun Leang, Zheng Zhao, Mihaela Cătălina Stoian, Qiyuan Xu, Haonan Li, Wenda Li, Shay B. Cohen, Eleonora Giunchiglia. arxiv.org/abs/2606.12594v1 #AI4Math #LeanProver #ITP
1
8
554
Artificial intelligence for mathematical reasoning: an integrated survey of language models, neuro-symbolic systems, and verified discovery. ~ Syed Rifat Raiyan, Mohsinul Kabir, Hasan Mahmud, Md Kamrul Hasan. arxiv.org/abs/2606.08728v1 #AI4Math
2
16
559
The next way to build agents is to have them design their own graph structure. We demonstrated it for AI4Math. That is just one use case. Hitting SOTA with all open-weights was truly amazing—and it shows that OSS models are great. Amazing to work with such brilliant people!
Introducing Goedel-Architect: an open-source framework for formal theorem proving in Lean 4. Using the open-weight DeepSeek-V4-Flash (284B-A13B), it reaches state-of-the-art results, rivaling proprietary systems at a fraction of the cost. It solves 4/6 on IMO 2025, 11/12 on Putnam 2025, and 3/6 on USAMO 2026. On PutnamBench it solves 88.8% (597/672) at just ~$1.65 per problem. Paper: arxiv.org/abs/2606.06468 Project page: goedelarchitect.github.io/
4
51
AI4Math なんだな
2
Replying to @Al4xWr1ght
It is not easy to open-source things at Google, at least not quickly. I am not aware of any Google entry, but I don't know much about the other teams working on AI4Math (though they would all encounter the same logistical difficulty).
1
134
Is @ycombinator Paxel becoming the new spotify unwrapped? lol #ai #agent #yc #ai4math
1
3
156
Very interesting talk by @JaumedeDios at the @CRMatematica organized by @matdepuab. "The goal of a PhD is to learn, papers are by-products" (my transcription) cc @therfer #AI4math
1
2
10
488