President @AnthropicAI. Formerly @OpenAI, @Stripe, congressional staffer, global development

Joined September 2011
1 Photos and videos
Daniela Amodei retweeted
11 Jul 2023
Introducing Claude 2! Our latest model has improved performance in coding, math and reasoning. It can produce longer responses, and is available in a new public-facing beta website at claude.ai in the US and UK.
247
493
2,295
861,532
Daniela Amodei retweeted
14 Sep 2022
Neural networks often pack many unrelated concepts into a single neuron – a puzzling phenomenon known as 'polysemanticity' which makes interpretability much more challenging. In our latest work, we build toy models where the origins of polysemanticity can be fully understood.
58
628
3,878
Daniela Amodei retweeted
13 Jul 2022
In "Language Models (Mostly) Know What They Know", we show that language models can evaluate whether what they say is true, and predict ahead of time whether they'll be able to answer questions correctly. arxiv.org/abs/2207.05221
19
153
927
Daniela Amodei retweeted
27 Jun 2022
Transformer MLP neurons are challenging to understand. We find that using a different activation function (Softmax Linear Units or SoLU) increases the fraction of neurons that appear to respond to understandable features without any performance penalty. transformer-circuits.pub/202…
10
71
384
Daniela Amodei retweeted
24 May 2022
In a new paper, we show that repeating only a small fraction of the data used to train a language model (albeit many times) can damage performance significantly, and we observe a "double descent" phenomenon associated with this. arxiv.org/abs/2205.10487
8
41
337
Excited to announce our latest fundraising round! We’re genuinely honored to be entrusted with the resources to continue our work in frontier AI safety and research.
29 Apr 2022
We’ve raised $580 million in a Series B. This will help us further develop our research to build usable, reliable AI systems. Find out more: anthropic.com/news/announcem…
24
9
104
Daniela Amodei retweeted
13 Apr 2022
We've trained a natural language assistant to be more helpful and harmless by using reinforcement learning with human feedback (RLHF). arxiv.org/abs/2204.05862
3
48
268
Daniela Amodei retweeted
9 Mar 2022
On the @FLIxrisk podcast, we discuss AI research, AI safety, and what it was like starting Anthropic during COVID. futureoflife.org/2022/03/04/…

5
10
53
Daniela Amodei retweeted
8 Mar 2022
In our second interpretability paper, we revisit “induction heads”. In 2 layer transformers these pattern-completion heads form exactly when in-context learning abruptly improves. Are they responsible for most in-context learning in large transformers? transformer-circuits.pub/202…

1
57
306
Daniela Amodei retweeted
17 Feb 2022
Our first societal impacts paper explores the technical traits of large generative models and the motivations and challenges people face in building and deploying them: arxiv.org/abs/2202.07785
2
33
150
Daniela Amodei retweeted
22 Dec 2021
Our first interpretability paper explores a mathematical framework for trying to reverse engineer transformer language models: A Mathematical Framework for Transformer Circuits: transformer-circuits.pub/202…

3
116
614
Excited to announce what we’ve been working on this year - @AnthropicAI, an AI safety and research company. If you’d like to help us combine safety research with scaling ML models while thinking about societal impacts, check out our careers page anthropic.com/#careers
14
27
205
We’re going to be focused on pushing forward our research for the next few months and are hoping to have more to share later this year. Thrilled to be working with so many talented colleagues!
7
1
28
Daniela Amodei retweeted
28 May 2021
Hello world! You can read our launch announcement here: anthropic.com/news/announcem…

5
35
296