Joined December 2013
258 Photos and videos
Pinned Tweet
Our paper, "What's in My Human Feedback", was selected for an oral at ICLR! Our method automatically interpretably identifies preferences in human feedback data; we use this to improve personalization safety. Please reach out if you have data/use cases to apply this to!
🎉 Excited that WIMHF was selected for an oral at ICLR 2026!
3
4
103
20,482
Emma Pierson retweeted
great to see more work from @GoodfireAI on interp for data curation! our ICLR'26 paper, What's In My Human Feedback, also trains SAEs on chosen vs. rejected responses to forecast behaviors from pref data. the GF paper provides more evidence this can improve post-training.
Have you debugged your training data? You might not like what you find. Introducing predictive data debugging: reveal and shape what your model will learn before training. In DPO datasets, we found broken guardrails, hallucinations, and fish fart fan fiction (seriously). (1/9)
3
9
80
6,498
Emma Pierson retweeted
When people strongly disagree on an issue, can they agree on what makes a good AI response? We find: yes, more than you might expect! We present PARETO, a large human study w >200k evals, measuring the Pareto frontier of approval btwn opposing groups on controversial issues đź§µ
4
17
95
9,321
Emma Pierson retweeted
User simulators have emerged as promising tools for building interactive AI, but what makes a “good” simulator? We reframe the problem as what creates downstream value for humans Our new simulator test: how an LLM assistant trained with the simulator performs with human users🧵
6
23
133
15,030
Emma Pierson retweeted
I'm joining Carnegie Mellon's CS Department (and HCII by courtesy) as an assistant professor in Fall 2027! I'll be recruiting PhD students next cycle. If you're interested in AI systems or human-AI collaboration, list me in your application. Stay tuned for more about my new lab!
120
109
2,014
215,395
Congratulations to Vatsal, who did terrific work in our lab last summer!
I'm honored to receive this year's @NSF Graduate Research Fellowship! As an NSF Fellow, I will work on AI for scientific discovery by developing agents that can propose, test, and verify scientific hypotheses autonomously. I'm very grateful to my mentors for their guidance and support throughout my research career: @PandaAshwinee and @tomgoldsteincs at UMD, @rajivmovva and @2plus2make5 at Berkeley, and @Pavel_Izmailov and @andrewgwils at NYU.
14
2,447
Emma Pierson retweeted
🎉 Thrilled to have two papers accepted to ACL 2026 main! 1. Graph-based models match LLMs on close-ended human simulation tasks with far less compute & greater transparency 2. (oral) How to allocate human samples towards fine-tuning vs post-hoc rectification in simulation
4
19
137
14,525
Emma Pierson retweeted
New paper: What Do LLMs Know About Opinions? If we want LLMs to reflect diverse human views or simulate human responses well, we need to understand what they know about human opinions. Current evaluations mostly rely on next-token probs, but what if that misses a lot of what the model actually knows? đź’ˇ In our ICLR 2026 paper, we find that models know much more about human opinions than their outputs reveal.
3
10
76
16,363
Our lab, within the Berkeley EECS department, is hiring a postdoc! More info and quick application form: forms.gle/41tTVesNqtz33R838 Apply by May 1! Please reshare :)
2
29
118
24,417
Emma Pierson retweeted
BREAKING: Pope Leo XIV on Trump’s warning to Iran of “civilization” destruction — “This is truly not acceptable. Here there are certainly questions of international law, but even more than this a question of morality for the good of people.” He adds the war is “continuing to escalate and is not resolving anything… is only provoking more hatred throughout the world.” “attacks on civilian infrastructure are against international law, but it is also against sign of the hatred and division that we are capable of.” Video @Reuters
1,159
11,936
51,192
2,731,203
New paper: "In Your Own Words"! We created a framework to identify themes in free-text survey data and showed its benefits on a new dataset of how people describe their own identities (available for research!) See @jennyshwang's thread below.
New paper: "In Your Own Words"! We propose a computational framework for identifying interpretable themes from free-text survey data, and demonstrate its benefits on a new dataset of self-described race, gender, and sexual orientation. đź§µ1/
1
2
25
9,419
Emma Pierson retweeted
This would be a good time for: a.) STRATCOM commanders who implement orders to fire nuclear missiles to read up on Nuremberg and prosecutions for obeying orders to commit war crimes; and b.) for cabinet members to reread the 25th Amendment and have each other on speed dial.
777
3,066
9,461
436,839
Emma Pierson retweeted
This is superb. Among other things, I think a great read for anyone starting a PhD or other graduate study with an interest in AI and philosophy. I particularly agree with this quote (unsurprisingly). I'm hopeful that our new School of Government and Policy in DC will train a good number of top quality practitioners in the kind of philosophy and politics necessary to make powerful AI go well.
I spent a weekend at Stanford recently, which is where, in 2023, I did much of my formative thinking on AI. The Anthropic-DoW affair tested that early intellectual foundation more than anything, so found myself walking around Stanford, reflecting on what I learned in 2023.
1
14
112
13,894
We have a new piece in Nature Health led by @dmshanmugam, @sidhikab1, and a wonderful team of coauthors on how to move towards a world in which race is not used in clinical algorithms!
New in Nature Health: how might we move towards a world in which race is not used in clinical algorithms? We need (1) careful comparison of race-aware and race-neutral algorithms and (2) systemic efforts to address underlying disparities.
2
3
20
9,903
Congratulations to @gsagostini, whose recent Nature Comms paper releasing a fine-grained migration dataset (nature.com/articles/s41467-0…) just won a student paper award at the American Association of Geographers Annual Meeting!
Had a great time presenting our work on building MIGRATE–a new dataset of US migration–at the @theAAG Annual Meeting today. Happy to also share that we received an AAG student paper award for this work!!! Come chat if you are at #AAG26 this week. migrate.tech.cornell.edu
4
21
3,461
Emma Pierson retweeted
Horrifying. We need to know exactly how this happened. Again, I do not believe for a second that this was intentional, but a terrible terrible mistake was made. We very much need to know if any policy changes contributed to it.
Breaking News: The U.S. was responsible for a missile strike on an Iranian school, an ongoing military investigation found. The inquiry said the strike — which Iranian officials said killed at least 175 people — was the result of a targeting mistake. nyti.ms/47G2uw2
197
230
1,670
159,585
Emma Pierson retweeted
📢 I'm recruiting a postdoc to start in summer 2026! My lab is part of @Berkeley_EECS, @UCJointCPH & @berkeley_ai. We're looking for candidates in AI & society, with projects on the societal impacts of gen AI (collaborating w/ real-world orgs) and modeling human behavior with AI!
7
62
313
45,293
.@OpenAI is nothing without its people -- many of whom are brilliant, ethical, and able to work anywhere. Please, guys -- is this empowerment of authoritarians really what you want to be striving towards? Your talents are better-used elsewhere.
3
16
292
31,970
Emma Pierson retweeted
200 Google and OpenAI staff have signed this petition to share Anthropic's red lines for the Pentagon's use of AI let's find out if this is a race to the top or the bottom notdivided.org/
116
998
5,369
391,135
Emma Pierson retweeted
AI is changing economics, and --- as we just saw in Dwarkesh's interview with Dario --- AI researchers need to start thinking about economics too! The Center for Applied AI at UChicago will be hosting an AI & Economics Summer Institute to explore exactly this. We will bring together leading researchers with advanced graduate students in economics/AI/ML/NLP for an in-person program between Aug 6 - 11.
6
45
200
36,982