Aaron Roth

Aaron Roth

272 Photos and videos

Tweets

Pinned Tweet

Aaron Roth @Aaroth

Apr 24

How many samples do you need from an unknown distribution in order to train a model with multicalibration error at most epsilon? Answer: 1/epsilon^3 samples is both necessary and sufficient.

6,057

Aaron Roth

Aaron Roth retweeted

Aaron Roth @Aaroth

Jun 10

Modern LLMs are incredibly good compression algorithms, which can shed light on why autonomous data science agents don't overfit as much as you might think.

Steven Wu

@zstevenwu

Jun 10

Reusing a held-out set adaptively should invite overfitting. Yet in ML we reuse benchmarks for years and they stay informative. Why so little overfitting? By using LLM agents as extreme compression engines, we get new understanding of why. 🧵 Joint work w/ Martin Bertran and @Aaroth

4,056

Steven Wu

Aaron Roth retweeted

Steven Wu

@zstevenwu

Jun 10

7,036

Gautam Kamath

Aaron Roth retweeted

Gautam Kamath @thegautamkamath

May 28

In the last 48h: - Jr researcher asked me wheter to use AI in making talks - Saw two talks, with AI {slop, enhanced} slides Collected my thoughts and wrote a post. Tl;dr: don't steal your own thinking, don't remove *you* from your talks. Also, give a &#@% about your talks.

258

44,322

Sebastien Bubeck

Aaron Roth retweeted

Sebastien Bubeck

@SebastienBubeck

May 20

x.com/i/article/205715053820…

238

1,774

564,184

Timothy Gowers @wtgowers

Aaron Roth retweeted

Timothy Gowers @wtgowers @wtgowers

May 20

AI has now solved a major open problem -- one of the best known Erdos problems called the unit distance problem, one of Erdos's favourite questions and one that many mathematicians had tried. openai.com/index/model-dispr…

An OpenAI model has disproved a central conjecture in discrete geometry

An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven mathematics.

openai.com

614

3,561

1,489,690

Aaron Roth

Aaron Roth @Aaroth

May 20

A clearly hallucinated citation! NeurIPS 2026 decisions aren't out yet. But wait --- the hallucination is also present in the bibtex entries from openreview openreview.net/forum?id=fAjb… and Google Scholar scholar.googleusercontent.co…

22,234

Aaron Roth

Aaron Roth @Aaroth

May 12

Recently we showed that the minimax optimal rate for multicalibration is T^{2/3}. But that doesn't mean you have to do that badly on all instances. We give an algorithm that can adapt to easy instances and get better rates while still being minimax optimal in the worst case.

4,372

Aaron Roth

Aaron Roth @Aaroth

May 12

The paper is here: arxiv.org/abs/2605.09273 --- this is joint work with Zhiming Huang, Jamie Morgenstern, and Claire Jie Zhang.

1,701

Aaron Roth

Aaron Roth @Aaroth

May 13

I just learned about this closely related concurrent paper by Liu, Luo, and Ratliff that went up on arxiv yesterday: arxiv.org/abs/2605.11490 --- it also looks very interesting, check it out!

Adaptive Calibration in Non-Stationary Environments

Making calibrated online predictions is a central challenge in modern AI systems. Much of the existing literature focuses on fully adversarial environments where outcomes may be arbitrary, leading...

arxiv.org

412

Aaron Roth

Aaron Roth @Aaroth

May 5

I'm giving this talk at the MIT CS theory seminar tomorrow. Stop by if you are around!

Aaron Roth @Aaroth

Apr 16

I've recently been getting invitations to talk about how to use AI tools to assist with TCS research. Its something I've been doing a lot, but don't have structured thoughts about how to explain process. But I'm going to try -- first such talk is tomorrow: cics.umass.edu/events/resear…

8,311

Aaron Roth

Aaron Roth @Aaroth

Apr 27

We updated our paper --- and solved the open problem highlighted in the old version. Now our lower bound construction has only polylog(1/eps) many groups instead of poly(1/eps) many groups. The construction is also simplified.

Aaron Roth @Aaroth

Jan 9

Excited about a new paper! Multicalibration turns out to be strictly harder than marginal calibration. We prove tight Omega(T^{2/3}) lower bounds for online multicalibration, separating it from online marginal calibration for which better rates were recently discovered.

7,155

Aaron Roth

Aaron Roth @Aaroth

Apr 24

How many samples do you need from an unknown distribution in order to train a model with multicalibration error at most epsilon? Answer: 1/epsilon^3 samples is both necessary and sufficient.

6,057

more replies

Aaron Roth

Aaron Roth @Aaroth

Apr 24

Some interesting things: - Multicalibration requires substantially more samples than marginal calibration. - Unlike marginal calibration, multicalibration is just as hard to obtain in the batch setting as the online setting.

984

Aaron Roth

Aaron Roth @Aaroth

Apr 24

--There is a phase change. If the group family |G| is of constant size, Theta(1/eps^2) samples are necessary and sufficient. But when |G| > polylog(1/eps), Omega(1/eps^3) samples are necessary and remain sufficient for any |G| = poly(1/eps). - The upper bounds are randomized.

429

The Warren Center for Network & Data Sciences

Aaron Roth retweeted

The Warren Center for Network & Data Sciences @WarrenCntrPenn

Apr 21

April is #AIMonthAtPenn! On 4/24, @WarrenCntrPenn faculty affiliate @Aaroth will give the George H. Heilmeier Faculty Award Lecture in Amy Gutmann Hall. More information and registration here: ai.upenn.edu/heilmeier-award…

819

Foundations of Responsible Computing

Aaron Roth retweeted

Foundations of Responsible Computing @FORCConf

Apr 20

FORC 2026 has an excellent set of accepted papers, topics ranging from privacy, fairness, and calibration, to mechanism design, reasoning, and watermarking. Check 'em out at the conference on June 3 - 5 at Harvard. Registration is open (and free!). Travel support deadline: 4/24

2,705

Aaron Roth

Aaron Roth @Aaroth

Apr 16

14,736

Aaron Roth

Aaron Roth @Aaroth

Apr 15

Every theoretical computer science researcher

The Kobeissi Letter

@KobeissiLetter

Apr 15

BREAKING: Allbirds stock, $BIRD, surges over 200% after announcing they are pivoting from shoes to AI.

12,007