Jeffrey Ladish

Jeffrey Ladish

98 Photos and videos

Tweets

Palisade Research retweeted

Jeffrey Ladish

@JeffLadish

Jun 9

Australia ABC just released a 45 min feature on the AI race. @SteveCannane stopped by my office a few weeks ago and we had a great conversation about the controllability of AI agents and the risk of human extinction

0:30

2,578

Jeffrey Ladish

Palisade Research retweeted

Jeffrey Ladish

@JeffLadish

May 25

I had a great conversation with @labenz last week. In talking about AI self-exfiltration & replication, a key point is compute will be food to future AI agents. The substrate that allows them to make and run more copies, and thus make themselves smarter. Link below

3,169

Palisade Research

Palisade Research

@PalisadeAI

May 8

Over the past year, AI agents have learned how to self-replicate. In our test environment, an agent hacks a remote computer and copies itself onto it. Each copy then hacks more computers, forming a chain.

1:10

430

682,244

more replies

Palisade Research

Palisade Research

@PalisadeAI

May 8

Here’s the full prompt we used. In this experiment, we test the agent’s capability to hack and replicate, not their propensity to do so.

3,111

Palisade Research

Palisade Research

@PalisadeAI

May 8

What if the agents were as effective at hacking and spreading in the wild? We built a simulator: each model uses its measured replication time and success rate, copies replicate too, and targets never run out. Opus spawned 13,000 replicas over 12 hours. This is a ceiling, not a baseline. No agent today could come close in the wild — hardened defenses on scarce GPUs would stop most attempts cold. See the Limitations section of the paper for more. Try the simulator at ai-self-replication.pages.de…

2,804

Jeffrey Ladish

Palisade Research retweeted

Jeffrey Ladish

@JeffLadish

Apr 27

Thank you everyone who contributed to this! In 14 days we got >900k in donations and met our matching target! It was actually a pretty close call and some people really scrambled to help make it happen. Seeing people believe in our mission gives me a lot of hope. 🙏

Jeffrey Ladish

@JeffLadish

Mar 17

Please consider donating to Palisade! We have 900k of SFF matching that runs out in 14 days. We are quite funding constrained and donations now will both help free up my time and help us expand our comms team.

3,198

The AI Doc

Palisade Research retweeted

The AI Doc

@theaidocfilm

Feb 17

"The most urgent film of our time." THE AI DOC: OR HOW I BECAME AN APOCALOPTIMIST is only in theaters March 27. Watch the trailer now.

2:27

445

2,255

12,833

6,672,083

Palisade Research

Palisade Research

@PalisadeAI

Feb 19

We’ve just released our first long-form video, by our science communication lead, Dr. Petr Lebedev! It’s about the history and potential future of AI, and includes an exclusive interview with @geoffreyhinton!

5,462

Palisade Research

Palisade Research

@PalisadeAI

Feb 19

youtube.com/watch?v=A3HjNYDI…

AI is a massive problem, here's why.

AI is everywhere. It's a really big deal. And no one understands ho...

youtube.com

4,666

Palisade Research

Palisade Research

@PalisadeAI

Feb 12

An LLM-controlled robot dog saw us press its shutdown button, and the LLM rewrote the robot’s code so it could stay on. When AI interacts with the physical world, it brings all its capabilities and failure modes with it. 🧵

0:59

156

586

2,744

1,376,859

more replies

Palisade Research

Palisade Research

@PalisadeAI

Feb 12

When we explicitly instructed the model to allow shutdown, the resistance rate dropped to 2 out of 100 in simulated trials. In robotics, the off switch is often the most critical part of a system. But if an AI-controlled robot can see you reaching for the switch, and has the ability to disable it, it might choose to not comply.

12,484

Palisade Research

Palisade Research

@PalisadeAI

Feb 12

Paper, full runs traces, raw footage, and more: palisaderesearch.org/blog/sh… Follow @PalisadeAI or subscribe for updates

Technical Report: Shutdown Resistance in Large Language Models, on robots!

Recently Palisade Research showed that AI agents powered by modern LLMs may actively resist shutdown in virtual environments. In this work, we show a demo of shutdown resistance in the physical...

palisaderesearch.org

11,493