Stella Biderman

Stella Biderman

696 Photos and videos

Tweets

Pinned Tweet

Stella Biderman @BlancheMinerva

Jun 10

In film, "we'll fix it in post" is what you say when something went wrong on set and you don't want to redo it. AI research has made it our entire methodology: train the model, then patch whatever comes out. Our new ICML oral argues this can't be the basis of a science of AI. 🧵

341

42,653

Stella Biderman

Stella Biderman @BlancheMinerva

10h

ICML Mech Interp had a lower acceptance rate than ICML this year.

158

18,366

Stella Biderman

Stella Biderman @BlancheMinerva

Sorry for the confusion! I was referring to in-person papers, not all papers. I also fucked up the math in this thread, it’s ~24% in-person Mech Interp and ~26% ICML.

908

Ian Landsman

Stella Biderman retweeted

Ian Landsman

@IanLandsman

18h

My Fable 5 access is back!

135

247

7,650

646,393

Kevin Bankston

Stella Biderman retweeted

Kevin Bankston

@KevinBankston

18h

For anyone looking for a quick primer on the 1st amendment as applied to model weights for oh I dunno no reason, @CenDemTech's comments in the NTIA's open weights proceeding from a couple years ago are a good place to start. See pp. 33-40. cdt.org/wp-content/uploads/2…

Kevin Bankston

@KevinBankston

Jun 13

WOW. This is an intense and—I would argue—unconstitutional escalation by the government, violating Anthropic’s 1st amendment right to offer this information service and individuals’ right to access it in the US. Time to litigate the right to access AI?

3,378

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

341

42,653

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

Also, appolgies to @learning_mech and the "There Will Be a Scientific Theory of Deep Learning" team for not engaging with the contents of your paper. I believe I learned about it on the same day we got the ICML acceptance notifications.

1,992

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

From what I've skimmed I think we're in agreement about a lot of things, but I'm excited to find time to read it closely :)

1,496

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

Part of why post hoc analysis dominates: it's the only thing most researchers CAN do. Almost no one releases intermediate checkpoints or training data. we built MultiBERT and Pythia to set a better standard, and it's been great to see work like OLMo and Marin follow our lead.

1,212

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

Read the full paper: arxiv.org/abs/2606.06533 or come listen to our oral @icmlconf! Huge thanks to my co-authors @aflah02101 @niloofar_mire @linguist_cat @FazlBarez @nsaphra Stay tuned for a related workshop (hopefully) at NeurIPS too!

Position: Don't Just "Fix it in Post": A Science of AI...

What would it mean to have a scientific understanding of AI? Models are not static objects: they are snapshots of time-evolving processes shaped by data, objectives, architectures, and...

arxiv.org

2,967

EvalEval Coalition

Stella Biderman retweeted

EvalEval Coalition @evaluatingevals

Jun 2

🚨 As AI models improve, many benchmarks are becoming saturated and losing their ability to distinguish between models. 🚨 Check out our new @icmlconf paper: “When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation”

16,926

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

Any reasonable analysis of the past five years makes it abundantly clear that major AI companies are making the world a worse place and open source AI isn’t. So they distract you from the actual harm they’re doing in the world with hypothetical fearmongering about the future.

Florian Brand

@xeophon

Jun 10

New blog. I looked into the actual evidence and what models where used by bad actors to see whether closed models are safer. Turns out: Nope, they are used to hack, misinform and scam. There is one exception, though. Link in replies.

2,113

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

Incredible timing

Janet Egan

@janet_e_egan

Jun 10

CAISI has reportedly been directed to stop publishing public model assessments as the new AI EO gets implemented. Natsec engagement on AI is essential. But pulling CAISI's evals from public view doesn't make the field more secure. It just means fewer eyes on the science when we need more. Openness and natsec don't have to be in tension here. We should be doing both.

1,344

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 10

DoD: “Anthropic is a supply chain risk” Anthropic: “You ain’t seen nothing yet.”

elie

@eliebakouch

Jun 9

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community also the fact that this is un purpose not visible to the user is crazy

371

49,135

bayes

Stella Biderman retweeted

bayes

@bayeslord

Jun 9

They didn’t mean pause AI research, they meant pause *your* AI research

347

5,037

103,650

Stella Biderman

Stella Biderman @BlancheMinerva

Jun 9

It’s super weird to me that there’s so much discourse about whether Anthropic is “consistent.” Anthropic is choosing to make decisions that make the world a significantly worse and potentially more dangerous place. That’s what you should criticize them for.

303

7,231