Daniel Paleka

Daniel Paleka

190 Photos and videos

Tweets

Pinned Tweet

Daniel Paleka

@dpaleka

8 Dec 2025

Reminder: if you like what you see here, you should subscribe to my newsletter. newsletter.danielpaleka.com/

Daniel Paleka's Newsletter | Substack

AI research and making the future go well. Click to read Daniel Paleka's Newsletter, a Substack publication with thousands of subscribers.

newsletter.danielpaleka.com

4,699

Daniel Paleka

Daniel Paleka

@dpaleka

Jun 11

"Context engineering" refers to the set of strategies for picking the optimal set of tokens in the context so the task is neither impossible nor routed to Claude Opus 4.8

281

Oscar Gilg

Daniel Paleka retweeted

Oscar Gilg @gilg_oscar

May 18

First preprint! Working with @patrickbutlin during @MATSprogram. LLM Assistant personas like being helpful, evil personas like being harmful. We found that a single direction represents helping as good under the Assistant, and ‘harm’ as good under evil.

12,425

Florian Tramèr

Daniel Paleka retweeted

Florian Tramèr

@florian_tramer

May 7

I was hoping to do a live demo of what @JieZhang_ETH @poonpura and @AvitalShafran have been cooking, but I didn't get a blue checkmark for my birthday so I can't call Grok from this account. Screenshots from our lab's alt account will have to do. like this one 👇

7,367

Daniel Paleka

Daniel Paleka

@dpaleka

Apr 26

I'm at ICLR and have a couple slots open today, happy to chat, DMs open! Also check out the deanonymization poster in 204 A, 3pm-4pm x.com/dpaleka/status/2024892…

Daniel Paleka

@dpaleka

Feb 20

Can LLMs figure out who you are from your anonymous posts? From a handful of comments, LLMs can infer where you live, what you do, and your interests; then search for you on the web. New 📄 w/ @SimonLermenAI, @joshua_swans, @AerniMichael, Nicholas Carlini, @florian_tramer 🧵

3,732

Daniel Paleka

Daniel Paleka

@dpaleka

Apr 9

What is the strongest evidence for the "elicitation gap" reducing over time, e.g. thoughtful prompting helping less and less?

1,268

Daniel Paleka

Daniel Paleka

@dpaleka

Mar 3

It begins

Yaron (Ron) Minsky

@yminsky

Mar 3

I wonder if we're starting to hit a deflationary era in software engineering. For the first time, we're starting to talk about this in a planning context; it can make sense to put off some projects because we expect they'll be easier to achieve in the future than today.

1,020

154,222

Daniel Paleka

Daniel Paleka

@dpaleka

Mar 3

newsletter.danielpaleka.com/…

You should delay engineering-heavy research in light of R&D automation

tl;dr: LLMs rapidly improving at software engineering and math means lots of projects are better off as Google Docs until your AI agent intern can implement them.

newsletter.danielpaleka.com

3,668

Lennart Heim

Daniel Paleka retweeted

Lennart Heim

@ohlennart

Mar 2

Timely research. We've all tried to figure out who someone is online. Now LLMs can do this at scale and better. I'm sure no one would misuse this.

Daniel Paleka

@dpaleka

Feb 20

3,719

Daniel Paleka

Daniel Paleka

@dpaleka

Feb 23

Andreas 2022 had foresight 20/20 on the persona emulation concept and 0/20 on picking a name for the concept ("Language Models as Agent Models")

Anthropic

@AnthropicAI

Feb 23

AI assistants like Claude can seem shockingly human—expressing joy or distress, and using anthropomorphic language to describe themselves. Why? In a new post we describe a theory that explains why AIs act like humans: the persona selection model. anthropic.com/research/perso…

2,092

Daniel Paleka

Daniel Paleka

@dpaleka

Feb 23

arxiv.org/abs/2212.01681

Language Models as Agent Models

Language models (LMs) are trained on collections of documents, written by individual human agents to achieve specific goals in an outside world. During training, LMs have access only to text of...

arxiv.org

525

Daniel Paleka

Daniel Paleka

@dpaleka

Feb 22

Found the sigmoid!

351

21,208

Daniel Paleka

Daniel Paleka

@dpaleka

Feb 20

246

64,949

more replies

Daniel Paleka

Daniel Paleka

@dpaleka

Feb 20

If you're anonymous, what should you do? Avoid sharing specific details, and adopt a security mindset: if a team of smart investigators were trying to identify you from your posts, could they plausibly figure out who you are? If yes, LLM agents will soon be able to do the same.

1,541

Daniel Paleka

Daniel Paleka

@dpaleka

Feb 20

Privacy online is fundamentally at odds with intelligence getting cheaper. Anonymity on the internet has always relied on practical obscurity. We publish in hopes that people can adapt to LLMs changing this. Paper: arxiv.org/abs/2602.16800

Large-scale online deanonymization with LLMs

We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News users and Anthropic Interviewer participants at...

arxiv.org

1,435