AI safety researcher. ERA Research Manager @ERA_Cambridge. YouTuber. Ex-CTO at Elpha Secure. Columbia University PhD in security. 🇨🇦

Joined October 2013
7 Photos and videos
David Williams-King retweeted
Our new campaign in Canada, backed by over 30 MPs and Senators, is cause for real optimism. For the first time, a cross-party coalition of Canadian lawmakers is calling for an international ban on developing superintelligence, recognizing the extinction risk it poses. 🧵
🚨NEW: We’ve just launched our campaign in Canada! A cross-party coalition of over 30 MPs and Senators are calling for Canada to negotiate an international prohibition on the development of superintelligence, recognizing the risk of human extinction posed by the technology. 🧵
4
12
44
9,291
If it feels like it's hard to get a job in AI safety right now, that's because it is. There are a lot of AI safety fellowships with more junior talent, and a handful of full-time jobs mostly geared towards senior researchers. The fact that nearly everyone is now using AI (Claude Code) to accelerate their research also means there is less and less for junior researchers to do. 1/5
1
104
You don't have to work together in a formal structure however. It's also a good idea to set up collaborations directly with people in the field, e.g. with people that you meet at conferences. Many people in AI safety make their own roles and submit their own grants; the field rewards being entrepreneurial. If you are new to the field, consider going to EA Global events, and look into Coefficient Giving career transition funding [4]. 4/5
1
64
I'm a SPAR mentor, if you'd like to work on solving Anthropic cyber espionage type attacks, please do apply!
Jan 8
🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130 projects across AI safety, policy, governance, security, welfare, and strategy.
1
2
3
555
David Williams-King retweeted
David Williams-King 🎤 David spent four years as the founding CTO of a cybersecurity insurance startup that raised over $20M, leading a 20 person team. Now, David has transitioned to AI safety and works as a research scientist under AI godfather Yoshua Bengio. He completed his PhD at Columbia University focusing on low-level security of program binaries and his work has allowed programs to continuously modify their own code at runtime, making them much harder to attack. David focuses on AI risk communication, and jailbreaks and misuse risk in the cyber domain. He once received an award at an ACM Turing Award ceremony, and was called the "best teaching assistant ever" by Bjarne Stroustrup, the creator of C .
1
1
2
240
David Williams-King retweeted
🚨 Announcing HackAPrompt 2.0, the World's Largest AI Red Teaming competition 🚨 It's simple: "Jailbreak" or Hack the AI models to say or do things they shouldn't. Compete for over $110,000 in prizes. Sponsored by @OpenAI, @CatoNetworks, @pangeacyber, and many others. Starting NOW to July 1st. 🧵
10
36
118
80,966
David Williams-King retweeted
Two years ago, I've reoriented my research to try to make AI safe by design. In this @TIME op-ed, I present my team's direction called "Scientist AI"; a practical, effective and more secure alternative to the current uncontrolled agency-driven trajectory. time.com/7283507/safer-ai-de…
18
70
332
43,553
David Williams-King retweeted
David Williams-King - @deepelfery 🎤 David spent four years as the founding CTO of a cybersecurity insurance startup that raised over $20M, leading a 20 person team. Now, David has transitioned to AI safety and works as a research scientist under AI godfather Yoshua Bengio. He completed his PhD at Columbia University focusing on low-level security of program binaries and his work has allowed programs to continuously modify their own code at runtime, making them much harder to attack. David focuses on AI risk communication, and jailbreaks and misuse risk in the cyber domain. He also runs a YouTube channel and other social media accounts about AI and AI safety. David once received an award at an ACM Turing Award ceremony, and he was once called the "best teaching assistant ever" by Bjarne Stroustrup, the creator of C .
3
1
174
I'll be at Neurips in Vancouver next week. Message me if you'd like to chat!
1
112
David Williams-King retweeted
A new study showed ChatGPT achieved 90% accuracy in medical diagnosis, outperforming both human doctors (74%) and doctors using ChatGPT (76%) So much progress to be made for AI and healthcare. Really cool to already start seeing these results already x.com/gdb/status/18583373465…

18 Nov 2024
Interesting small-scale study on accuracy of diagnosing illness: - Human doctors: 74% - Human doctors using ChatGPT: 76% - ChatGPT alone: 90% Takeaway seems like vast potential for AI to help with diagnosis, but need better human <> AI teamwork: nytimes.com/2024/11/17/healt…
4
27
322
24,759
David Williams-King retweeted
My research group @kasl_ai is looking for interns! Applications are due in 2 weeks ***January 29***. The long-awaited form: forms.gle/iLU1uQAxZ2UKENw5A Please share widely!!

5
74
274
44,117
David Williams-King retweeted
We have had 8 people purchase the complete vx-underground collection - an external harddrive featuring everything we currently have in our collection. Beside a handwritten thank you letter, it will feature a giant warning label somewhere on it. If you mount this drive on Windows ... DISABLE ALL ANTI-VIRUS SOFTWARE. Windows Defender and/or your AV will 100% go ballistic. It will automatically detect upwards of 30,000,000 malware samples it believes to be present on your machine. Your OS WILL BSOD. No AV is designed to list and/or quarantine 30,000,000 files. You've been warned. Please be careful.
108
189
2,728
651,714
David Williams-King retweeted
14 Oct 2023
Open source AI models will soon become unbeatable. Period.
14 Oct 2023
The pace of open-source LLM innovation and research is breath-taking I suspect that open-source will soon become unbeatable for anyone except maybe OpenAI Here's why - Open-source community is way bigger than any specific company - Safety lobotomy and fear of bad press will continue will impact proprietary model performance - Smaller models that are instruct / fine-tuned are performing as well as 50x bigger models - Smaller models are more efficient and cheaper than large models - Companies will leverage open-source and offer value-added services and APIs
131
440
3,200
1,210,005
David Williams-King retweeted
Troubleshooting walkthrough: Tonight I need to write a narrative of a case where a user complained a new browser add-in broke their mouse. This got escalated to me as the final tier. I'm going to lay it out here first, because saying I'm working while laying in bed sounds cool.
30
188
1,354
457,029