Andrea Miotti

Andrea Miotti

7 Photos and videos

Tweets

David Williams-King retweeted

Andrea Miotti

@andreamiotti

Jun 3

Our new campaign in Canada, backed by over 30 MPs and Senators, is cause for real optimism. For the first time, a cross-party coalition of Canadian lawmakers is calling for an international ban on developing superintelligence, recognizing the extinction risk it poses. 🧵

ControlAI

@ControlAI

Jun 3

🚨NEW: We’ve just launched our campaign in Canada! A cross-party coalition of over 30 MPs and Senators are calling for Canada to negotiate an international prohibition on the development of superintelligence, recognizing the risk of human extinction posed by the technology. 🧵

9,291

David Williams-King

David Williams-King

@deepelfery

Mar 28

If it feels like it's hard to get a job in AI safety right now, that's because it is. There are a lot of AI safety fellowships with more junior talent, and a handful of full-time jobs mostly geared towards senior researchers. The fact that nearly everyone is now using AI (Claude Code) to accelerate their research also means there is less and less for junior researchers to do. 1/5

104

more replies

David Williams-King

David Williams-King

@deepelfery

Mar 28

You don't have to work together in a formal structure however. It's also a good idea to set up collaborations directly with people in the field, e.g. with people that you meet at conferences. Many people in AI safety make their own roles and submit their own grants; the field rewards being entrepreneurial. If you are new to the field, consider going to EA Global events, and look into Coefficient Giving career transition funding [4]. 4/5

David Williams-King

David Williams-King

@deepelfery

Mar 28

[1] forum.effectivealtruism.org/… [2] erafellowship.org/ [3] lasrlabs.org/ [4] coefficientgiving.org/funds/… 5/5

David Williams-King

David Williams-King

@deepelfery

Jan 13

SPAR is an online AI safety research program. If you'd like to work with me, submit an application -- applications close tomorrow! sparai.org/projects/sp26/rec…

FragGuard: Cross-Session Malicious Activity Detection for Model APIs - SPAR Project

FragGuard aims to detect malicious model misuse for cyber where queries are decomposed into multiple sessions to mask the malicious intent.

sparai.org

111

David Williams-King

David Williams-King

@deepelfery

Jan 8

I'm a SPAR mentor, if you'd like to work on solving Anthropic cyber espionage type attacks, please do apply!

SPAR

@SPARexec

Jan 8

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130 projects across AI safety, policy, governance, security, welfare, and strategy.

555

Learn Prompting

David Williams-King retweeted

Learn Prompting

@learnprompting

27 May 2025

David Williams-King 🎤 David spent four years as the founding CTO of a cybersecurity insurance startup that raised over $20M, leading a 20 person team. Now, David has transitioned to AI safety and works as a research scientist under AI godfather Yoshua Bengio. He completed his PhD at Columbia University focusing on low-level security of program binaries and his work has allowed programs to continuously modify their own code at runtime, making them much harder to attack. David focuses on AI risk communication, and jailbreaks and misuse risk in the cyber domain. He once received an award at an ACM Turing Award ceremony, and was called the "best teaching assistant ever" by Bjarne Stroustrup, the creator of C .

240

Learn Prompting

David Williams-King retweeted

Learn Prompting

@learnprompting

17 May 2025

🚨 Announcing HackAPrompt 2.0, the World's Largest AI Red Teaming competition 🚨 It's simple: "Jailbreak" or Hack the AI models to say or do things they shouldn't. Compete for over $110,000 in prizes. Sponsored by @OpenAI, @CatoNetworks, @pangeacyber, and many others. Starting NOW to July 1st. 🧵

118

80,966

Yoshua Bengio

David Williams-King retweeted

Yoshua Bengio

@Yoshua_Bengio

9 May 2025

Two years ago, I've reoriented my research to try to make AI safe by design. In this @TIME op-ed, I present my team's direction called "Scientist AI"; a practical, effective and more secure alternative to the current uncontrolled agency-driven trajectory. time.com/7283507/safer-ai-de…

A Potential Path to Safer AI Development

Yoshua Bengio warns that the current approach to developing AI models carries potentially catastrophic risks.

time.com

332

43,553

Learn Prompting

David Williams-King retweeted

Learn Prompting

@learnprompting

25 Apr 2025

David Williams-King - @deepelfery 🎤 David spent four years as the founding CTO of a cybersecurity insurance startup that raised over $20M, leading a 20 person team. Now, David has transitioned to AI safety and works as a research scientist under AI godfather Yoshua Bengio. He completed his PhD at Columbia University focusing on low-level security of program binaries and his work has allowed programs to continuously modify their own code at runtime, making them much harder to attack. David focuses on AI risk communication, and jailbreaks and misuse risk in the cyber domain. He also runs a YouTube channel and other social media accounts about AI and AI safety. David once received an award at an ACM Turing Award ceremony, and he was once called the "best teaching assistant ever" by Bjarne Stroustrup, the creator of C .

174

David Williams-King

David Williams-King

@deepelfery

24 Feb 2025

Yoshua Bengio's research plan to build safe Al has been published! The paper is something I've been helping with, a big group effort. lesswrong.com/posts/p5gBcoQe…

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path? — LessWrong

A new paper by Yoshua Bengio and the Safe Artificial Intelligence For Humanity (SAIFH) team argues that the current push towards building generalist…

lesswrong.com

123

David Williams-King

David Williams-King

@deepelfery

7 Dec 2024

I'll be at Neurips in Vancouver next week. Message me if you'd like to chat!

112

Brett Adcock

David Williams-King retweeted

Brett Adcock

@adcock_brett

24 Nov 2024

A new study showed ChatGPT achieved 90% accuracy in medical diagnosis, outperforming both human doctors (74%) and doctors using ChatGPT (76%) So much progress to be made for AI and healthcare. Really cool to already start seeing these results already x.com/gdb/status/18583373465…

Greg Brockman

@gdb

18 Nov 2024

Interesting small-scale study on accuracy of diagnosing illness: - Human doctors: 74% - Human doctors using ChatGPT: 76% - ChatGPT alone: 90% Takeaway seems like vast potential for AI to help with diagnosis, but need better human <> AI teamwork: nytimes.com/2024/11/17/healt…

322

24,759

David Williams-King

David Williams-King

@deepelfery

12 Jul 2024

For security folks, David Lie is recruiting a postdoc at U of Toronto security.csl.toronto.edu/pos…

Postdoctoral Fellowship Position

The Toronto Systems Security Lab is searching for talented and motivated postdoctoral fellows to become part of our team working on exciting scientific research

security.csl.toronto.edu

131

David Krueger 🦥 ⏸️ ⏹️ ⏪

David Williams-King retweeted

David Krueger 🦥 ⏸️ ⏹️ ⏪

@DavidSKrueger

15 Jan 2024

My research group @kasl_ai is looking for interns! Applications are due in 2 weeks ***January 29***. The long-awaited form: forms.gle/iLU1uQAxZ2UKENw5A Please share widely!!

274

44,117

vx-underground

David Williams-King retweeted

vx-underground

@vxunderground

15 Nov 2023

We have had 8 people purchase the complete vx-underground collection - an external harddrive featuring everything we currently have in our collection. Beside a handwritten thank you letter, it will feature a giant warning label somewhere on it. If you mount this drive on Windows ... DISABLE ALL ANTI-VIRUS SOFTWARE. Windows Defender and/or your AV will 100% go ballistic. It will automatically detect upwards of 30,000,000 malware samples it believes to be present on your machine. Your OS WILL BSOD. No AV is designed to list and/or quarantine 30,000,000 files. You've been warned. Please be careful.

108

189

2,728

651,714

Yann LeCun

David Williams-King retweeted

Yann LeCun

@ylecun

14 Oct 2023

Open source AI models will soon become unbeatable. Period.

Bindu Reddy

@bindureddy

14 Oct 2023

The pace of open-source LLM innovation and research is breath-taking I suspect that open-source will soon become unbeatable for anyone except maybe OpenAI Here's why - Open-source community is way bigger than any specific company - Safety lobotomy and fear of bad press will continue will impact proprietary model performance - Smaller models that are instruct / fine-tuned are performing as well as 50x bigger models - Smaller models are more efficient and cheaper than large models - Companies will leverage open-source and offer value-added services and APIs

131

440

3,200

1,210,005

SwiftOnSecurity

David Williams-King retweeted

SwiftOnSecurity

@SwiftOnSecurity

26 Jun 2023

Troubleshooting walkthrough: Tonight I need to write a narrative of a case where a user complained a new browser add-in broke their mouse. This got escalated to me as the final tier. I'm going to lay it out here first, because saying I'm working while laying in bed sounds cool.

188

1,354

457,029