preparedness research @openai

Joined July 2017
205 Photos and videos
Pinned Tweet
When OpenAI released ChatGPT, I was among the millions captivated by what we (humanity) achieved. Truly honored to join the mission to accelerate human progress safely @OpenAI Preparedness and stand on the shoulder of giants at a pivotal moment for agentic security.
I am extremely excited to welcome @dylanscandinaro to OpenAI as our Head of Preparedness. Things are about to move quite fast and we will be working with extremely powerful models soon. This will require commensurate safeguards to ensure we can continue to deliver tremendous benefits. Dylan will lead our efforts to prepare for and mitigate these severe risks. He is by far the best candidate I have met, anywhere, for this role. He has his work cut out for him for sure, but I will sleep better tonight. I am looking forward to working with him very closely to make the changes we will need across our entire company.
1
1
32
7,893
Bill Demirkapi retweeted

75
399
2,989
940,855
Bill Demirkapi retweeted
Jun 8
new shai hulud wave. interestingly it has this inside the payload to trigger safety refusals in potential defensive scans.
Replying to @SocketSecurity
We are now tracking 471 affected artifacts across npm and PyPI in the Mini Shai-Hulud/Miasma/Hades campaign. The newer PyPI artifacts from this wave have been added to the dedicated campaign tracker. Full breakdown: socket.dev/blog/mini-shai-hu…
15
64
424
121,631
Bill Demirkapi retweeted
theUSshould lead on AI by continuing to develop the very best models, making sure they're safe, and getting cyber tools into the hands of trusted defenders. the new EO gets the balance right.
608
163
2,757
319,397
Bill Demirkapi retweeted
Today, we’re sharing that a general-purpose internal @openai model achieved a breakthrough on one of the best-known combinatorial geometry problems. Less than 1 year ago frontier AI models were at IMO gold-level performance. I expect this pace of progress to continue.
May 20
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946. For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids. An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better. This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
81
194
2,398
448,793
6-8 months before distilled open weight models catch up.
May 16
Chinese students are buying GPT-5.4/5.5 and Claude API access from Xianyu/Taobao proxy sellers for almost 96-97% cheaper People are apparently burning 100M tokens a day for like $1 and vibecoding nonstop.
4
2
41
8,683
Bill Demirkapi retweeted
May 7
The security industry is entering a period of compression. Model cybersecurity capabilities are rapidly increasing, and it's critical we arm defenders with the tools they need to protect what matters most. We're launching two models today: GPT-5.5 with TAC (Trusted Access for Cyber) GPT-5.5-Cyber (Limited Preview) GPT-5.5 is our starting point for most defensive workflows. It's exceedingly good at cybersecurity workflows and tasks like secure code review, vulnerability triage, detection engineering, malware analysis, and patch validation. We think this model is the right starting place for most organizations. GPT-5.5-Cyber is exceptional for authorized workflows, including red teaming, penetration testing, and controlled validation. It's in research preview for specific organizations and requires enhanced verification and account-level controls. We expect to continue to accelerate defenders with various models, including both our flagship models through Trusted Access for Cyber, and with dedicated cyber models like GPT‑5.5‑Cyber and even more cyber-capable models in the future. openai.com/index/gpt-5-5-wit…
14
66
431
41,414
Bill Demirkapi retweeted
3
16
72
26,320
Bill Demirkapi retweeted
OpenAI’s GPT-5.5 is the second model to complete one of our multi-step cyber-attack simulations end-to-end 🧵
95
398
2,360
1,772,189
Bill Demirkapi retweeted
We've released a new 5-point action plan for strengthening cyber defense. AI is reshaping cybersecurity. The same capabilities that help defenders may be used by malicious actors. One approach is to treat these systems as too dangerous for broad defensive use and limit them to a very small number of approved partners. We think that misses the central challenge. Attackers won’t wait. Existing models are already useful for many cyber workflows and capabilities will keep advancing. Criminal groups will adopt whatever tools are available. The best way to reduce national risk is to responsibly equip and accelerate trusted defenders faster than adversaries can adapt. Check out our plan ⬇️ openai.com/index/cybersecuri…
104
168
1,314
160,638
Bill Demirkapi retweeted
The amount of squabbling over bugs, bug quality, AI bug extermination, how security is doomed/not doomed/unchanged/improved based on bugs… it’s ridiculous. Bugs are not the totality of cybersecurity.
15
25
119
9,759
Bill Demirkapi retweeted
The biggest opportunity for would-be startup founders is AI. But the most underpriced opportunity is probably non-AI ideas. So if you have a good non-AI idea, go for it, because everyone else is going to overlook it.
356
566
6,899
336,570
this is unfortunately very real. the volume of distillation of western frontier models from threat actors attributed to China is an order of magnitude larger than any other nation.
The U.S. has evidence that foreign entities, primarily in China, are running industrial-scale distillation campaigns to steal American AI. We will be taking action to protect American innovation. These foreign entities are using tens of thousands of proxies and jailbreaking techniques in coordinated campaigns to systematically extract American breakthroughs. Foreign entities who build on such fragile foundations should have little confidence in the integrity and reliability of the models they produce. The U.S. government is committed to the free and fair development of AI technologies across a competitive ecosystem, from open-source to proprietary models. Read the memo: whitehouse.gov/wp-content/up…
1
1
3
1,376
I wouldn’t mind if we held everyone to the same standard.
Replying to @GergelyOrosz
I see this argument a lot. Chinese labs are held to a far different standard: today, US labs get sued every other month over copyright. Drop the suits, hold labs to the same standard, and I think it's a reasonable position. I don't see how it is "fair" otherwise.
1
635
Bill Demirkapi retweeted
The U.S. has evidence that foreign entities, primarily in China, are running industrial-scale distillation campaigns to steal American AI. We will be taking action to protect American innovation. These foreign entities are using tens of thousands of proxies and jailbreaking techniques in coordinated campaigns to systematically extract American breakthroughs. Foreign entities who build on such fragile foundations should have little confidence in the integrity and reliability of the models they produce. The U.S. government is committed to the free and fair development of AI technologies across a competitive ecosystem, from open-source to proprietary models. Read the memo: whitehouse.gov/wp-content/up…
578
2,247
8,038
938,814
Bill Demirkapi retweeted
Apr 23
Anthropic’s Mythos raised the bar for AI vuln detection but kept it invite-only. GPT-5.5 is OpenAI’s answer, and it’s open to all. We had early access. Ran the benchmarks. Blackbox GPT-5.5 already beats whitebox GPT-5. Best pentesting model we’ve tested. Read our analysis: bit.ly/48OX7v6
30
76
674
208,190
Bill Demirkapi retweeted
Apr 23
Introducing GPT-5.5 A new class of intelligence for real work and powering agents, built to understand complex goals, use tools, check its work, and carry more tasks through to completion. It marks a new way of getting computer work done. Now available in ChatGPT and Codex.
2,484
6,881
51,532
13,112,435
Bill Demirkapi retweeted
Apr 14
We’re expanding Trusted Access for Cyber with additional tiers for authenticated cybersecurity defenders. Customers in the highest tiers can request access to GPT-5.4-Cyber, a version of GPT-5.4 fine-tuned for cybersecurity use cases, enabling more advanced defensive workflows. openai.com/index/scaling-tru…
458
622
5,124
1,992,553
When OpenAI released ChatGPT, I was among the millions captivated by what we (humanity) achieved. Truly honored to join the mission to accelerate human progress safely @OpenAI Preparedness and stand on the shoulder of giants at a pivotal moment for agentic security.
I am extremely excited to welcome @dylanscandinaro to OpenAI as our Head of Preparedness. Things are about to move quite fast and we will be working with extremely powerful models soon. This will require commensurate safeguards to ensure we can continue to deliver tremendous benefits. Dylan will lead our efforts to prepare for and mitigate these severe risks. He is by far the best candidate I have met, anywhere, for this role. He has his work cut out for him for sure, but I will sleep better tonight. I am looking forward to working with him very closely to make the changes we will need across our entire company.
1
1
32
7,893
Taste without complex scaffolds is an interesting challenge for long or ambiguous tasks like e2e security reviews. Beyond reactive safety, excited to improve reasoning fundamental to defensive cyber capabilities and work with the community to reduce attacker/defender asymmetry!
1
1
957
Bill Demirkapi retweeted
Intel SGX has fallen! Its most important key is in our hands: we extracted the Global Wrapping Key from an instance of the Intel Gemini Lake platform
34
354
1,984
221,505