f-divergence enthusiast | phd @uwcse | mlr @apple | prev. math stats @stanford | nsf grfp fellow

Joined December 2020
8 Photos and videos
Pinned Tweet
Tomorrow, I'm excited to present "Finite-Time Convergence Rates in Stochastic Stackelberg Games with Smooth Algorithmic Agents", which addresses how a principal can influence the behavior of competitive learning agents! #ICML2025 ๐Ÿ“West Exhibition Hall, W-817, 11:00 - 1:30 ๐Ÿงต๐Ÿ‘‡
1
1
13
2,055
Eric Frankel retweeted
Today we're announcing Treeline and $25M from a16z. Software and AI have crossed a threshold and we're rebuilding IT services around it - great software paired with experienced technicians, designed to be a foundation for growth. We're hiring, reach out!
8
5
32
5,200
Introducing โš“ ๐—”๐—ป๐—ฐ๐—ต๐—ผ๐—ฟ๐—ฒ๐—ฑ ๐——๐—ฒ๐—ฐ๐—ผ๐—ฑ๐—ถ๐—ป๐—ด: a copyright mitigation strategy for any language model! With @uwnlp LMs today reproduce copyrighted textโ€”raising concerns for creator consent and potential legal (and ๐Ÿ’ธ ๐Ÿ’ธ) liabilities for AI developers. ๐Ÿซ  ๐—”๐—ป๐—ฐ๐—ต๐—ผ๐—ฟ๐—ฒ๐—ฑ ๐——๐—ฒ๐—ฐ๐—ผ๐—ฑ๐—ถ๐—ป๐—ด relies on two off-the-shelf LMs: ๐ŸงผA ๐˜€๐—ฎ๐—ณ๐—ฒ ๐—Ÿ๐—  trained only on permissively licensed text, โš ๏ธA higher-utility ๐—ฟ๐—ถ๐˜€๐—ธ๐˜† ๐—Ÿ๐—  trained on any data. The ๐—ฟ๐—ถ๐˜€๐—ธ๐˜† ๐—Ÿ๐—  drives generation, but the ๐˜€๐—ฎ๐—ณ๐—ฒ ๐—Ÿ๐—  acts as an anchor. If the ๐—ฟ๐—ถ๐˜€๐—ธ๐˜† ๐—Ÿ๐—  drifts into memorization, the ๐˜€๐—ฎ๐—ณ๐—ฒ ๐—Ÿ๐—  pulls it back โ†ฉ๏ธ. ๐ŸคWe provide a formal guarantee: outputs stays within a user-set budget of the ๐˜€๐—ฎ๐—ณ๐—ฒ ๐—Ÿ๐— . Details below! ๐Ÿ‘‡ [1/โš“]
4
19
51
9,021
These results apply to a wide range of games, including strongly monotone quadratic games, Cournot and Bertrand competitions, and Kelly auctions. More details can be found in our paper: openreview.net/pdf?id=q6aopfโ€ฆ.

1
1
401
This paper is the months-long product of a great collaboration with Kshitij Kulkarni, Dmitriy Drusvyatskiy, and my advisors Lillian Ratliff and @sewoong79! We hope you enjoy -- there are a number of exciting open questions here to explore!
2
328
Using techniques inspired from performative prediction and stochastic tracking, we create a hierarchy of interaction models that captures a principal's ability to use progressively more gradient information, which in turn determines the type of equilibrium achieved.
2
1
270
A principal using a repeated gradient method that fails to account for decision-dependence converges to an approx. performative Stack. equilibrium, while an expensive zeroth order method yields an approx. Stack, equilibrium. For both, we provide finite-time convergence rates!
2
1
188
Tomorrow, I'm excited to present "Finite-Time Convergence Rates in Stochastic Stackelberg Games with Smooth Algorithmic Agents", which addresses how a principal can influence the behavior of competitive learning agents! #ICML2025 ๐Ÿ“West Exhibition Hall, W-817, 11:00 - 1:30 ๐Ÿงต๐Ÿ‘‡
1
1
13
2,055
Past work has explored these settings, but under several unrealistic assumptions, like i) knowledge of the agents' objectives, and ii) the stationarity of agent behavior. These works also only provide asymptotic convergence guarantees to game equilibria.
1
2
134
Decision-making is often done not only under uncertainty, but also in environments subject to the actions of learning agents in competition with one another (e.g. crowd-sourcing, multi-agent systems). A natural abstraction for shaping these agents' behavior is a Stackelberg game.
2
2
142
A bit of a belated announcement ๐Ÿ˜… but Iโ€™ll be at ICML today presenting S4S, which enables few-NFE diffusion model sampling in <1 hour on 1 A100! ๐Ÿ“East Exhibition Hall, E-3210, 11:00 - 1:30. Looking forward to chatting more about all things diffusion! #ICML2025
Want to quickly sample high-quality images from diffusion models, but canโ€™t afford the time or compute to distill them? Introducing S4S, or Solving for the Solver, which learns the coefficients and discretization steps for a DM solver to improve few-NFE generation. Thread ๐Ÿ‘‡ 1/
2
19
1,835
Eric Frankel retweeted
Web data, the โ€œfossil fuel of AIโ€, is being exhausted. Whatโ€™s next?๐Ÿค” We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689
14
62
226
36,080
Eric Frankel retweeted
Thrilled to announce that I will be joining @UTAustin @UTCompSci as an assistant professor in fall 2026! I will continue working on language models, data challenges, learning paradigms, & AI for innovation. Looking forward to teaming up with new students & colleagues! ๐Ÿค ๐Ÿค˜
103
54
667
75,935
Eric Frankel retweeted
27 May 2025
๐Ÿคฏ We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: 21% - Incorrect rewards: 25% - (FYI) Ground-truth rewards: 28.8% How could this even workโ‰๏ธ Here's why: ๐Ÿงต Blogpost: tinyurl.com/spurious-rewards
72
343
1,778
700,740
Eric Frankel retweeted
22 May 2025
Excited to share that our paper "Exploring How Generative MLLMs Perceive More Than CLIP with the Same Vision Encoder" is accepted to #ACL2025! Preprint: arxiv.org/pdf/2411.05195 Thank @SimonShaoleiDu and @PangWeiKoh so much for your support and guidance throughout the journey!

2
14
49
9,842
Eric Frankel retweeted
I'm excited to announce that I will join @WisconsinCS as an assistant professor this fall! Time to get to it.
28
7
236
25,130
Eric Frankel retweeted
13 May 2025
LLMs naturally memorize some verbatim of pre-training data. We study whether post-training can be an effective way to mitigate unintentional reproduction of pre-training data. ๐Ÿ› ๏ธ No changes to pre-training or decoding ๐Ÿ”ฅ Training models to latently distinguish between memorized sequences and their paraphrases ๐Ÿ” Effectively reduced verbatim reproduction while still recalling famous quotes when requested
1
32
103
17,337
Eric Frankel retweeted
13 May 2025
Think PII scrubbing ensures privacy? ๐Ÿค”Think againโ€ผ๏ธ In our paper, for the first time on unstructured text, we show that you can re-identify over 70% of private information *after* scrubbing! Itโ€™s time to move beyond surface-level anonymization. #Privacy #NLProc ๐Ÿ”—๐Ÿงต
2
19
52
10,532
Eric Frankel retweeted
Long Range Navigator (LRN) ๐Ÿงญโ€” an approach to extend planning horizons for off-road navigation given no prior maps. Using vision LRN makes longer-range decisions by spotting navigation frontiers far beyond the range of metric maps. personalrobotics.github.io/lโ€ฆ
6
25
92
11,413
Eric Frankel retweeted
18 Apr 2025
Our new paper (first one of my PhD!) on cooperative AI reveals a surprising insight: Environment Diversity > Partner Diversity. Agents trained in self-play across many environments learn cooperative norms that transfer to humans on novel tasks. shorturl.at/fqsNN๐Ÿงต
5
36
140
46,536
Eric Frankel retweeted
๐Ÿ”ญ Science relies on shared artifacts collected for the common good. ๐Ÿ›ฐ So we asked: what's missing in open language modeling? ๐Ÿช DataDecide ๐ŸŒŒ charts the cosmos of pretrainingโ€”across scales and corporaโ€”at a resolution beyond any public suite of models that has come before.
15 Apr 2025
Ever wonder how LLM developers choose their pretraining data? Itโ€™s not guessworkโ€” all AI labs create small-scale models as experiments, but the models and their data are rarely shared. DataDecide opens up the process: 1,050 models, 30k checkpoints, 25 datasets & 10 benchmarks ๐Ÿงต
4
48
93
12,820