Medic PhD @kingscollegelon | AI safety | Sleep Research |

Joined June 2012
395 Photos and videos
Pinned Tweet
Our summary on each RCT to date on hypoglossal nerve stimulation and AHI in Sleep Apnoea. Good news: consistent AHI reductions, strong safety, durable 5-year outcomes, and favourable cost-effectiveness. The caveat: the RCT base is still thin. Small cohorts, responder-enriched designs, and short follow-ups that limit generalisability. For context, the GLP-1/GIP agonist tirzepatide (SURMOUNT-OSA) showed AHI (apnea-hypopnea index) reductions of -25 to -29 events/h across two phase 3 RCTs. The best HNS trials sit around -14 to -16. We need more trials, but the direction is promising. Published in @journal_CHEST Open access ↓ authors.elsevier.com/sd/arti… @KingsCollegeLon @GSTTnhs
8
2
42
19,388
Deeban R, PhD retweeted
1,689
1,750
24,813
3,013,795
Deeban R, PhD retweeted
Replying to @Samwise_Ganji
Fiat is a sham, the banking class is corrupt, decentralized digital currency and the blockchain are the inevitable future, and the incumbents will fight it to the death.
291
279
2,842
702,354
Always great speaking with @Kellykellam. A few points, amongst other things, that we touched upon during @MarioNawfal's roundtable : - Inherent risks that arise from pre-training - Failure modes in clinical medicine - AI hype cycle or scaling cycle
PARTNERED SHOW: Why AI Compute Needs Crypto? w/@Voranofficial x.com/i/broadcasts/1pJdRRWlX…
3
5
47
5,299
OPUS4.8 High. Corrected the fact then proceeds to hallucinate the rationale with something that sounds entirely plausible to someone who isn't paying attention. Weird.
Opus 4.8 is insane, nothing will be the same after this model 💀
9
3
68
3,391
The concerning part is, the fumble is the safer part of the failure. It’s the rationale that should bother you. It’ll build a clean, plausible reason for the wrong answer - trying to be convincing while wrong.
12
1,167
Deeban R, PhD retweeted
Marcus Aurelius wrote this over 1800 years ago: “Until death, all defeat is psychological.”
197
6,848
45,283
1,469,971
Deeban R, PhD retweeted
3 May 2023
Replying to @stats_feed
Fiji are very smart.
29
9
494
161,190
Deeban R, PhD retweeted
8 Sep 2025
The majority of LLM benchmarks are fixed questions, single-pass at temp=0 --> hiding instability, while only a handful sample statistically for harm, deception, refusal.. 🧐 Monte Carlo!
7
3
75
7,846
Deeban R, PhD retweeted
this is a simple, elegant and very effective idea. take the taxes of the bottom 50% to zero.
Thank you. The important part is zeroing out taxes on the bottom half. Best way to put money in someone’s pocket is to not take it out in the first place. Bottom half is only 3% of total tax revenue. But it’s very meaningful to that person. Zero it out.
751
387
7,684
1,646,665
Deeban R, PhD retweeted
A Robot has gone viral after failing spectacularly to dance like Michael Jackson Hilarious 🤣🤣
553
473
3,792
1,284,157
Deeban R, PhD retweeted
This figure supports the statement that I have repeatedly made, namely that progressive women fuel the infinity pool of Suicidal Empathy.
That's an interesting chart. Young men have stayed similarly conservative for over 25 years, while young women have drifted much further left. Why such a divergence? What has changed for young women that hasn't changed for young men?
1,065
3,490
19,455
37,848,065
Deeban R, PhD retweeted
JUST IN: Study reveals AI now outperforms doctors at diagnosing emergency room patients.
618
1,181
11,137
939,082
Deeban R, PhD retweeted
True currency is steadfast friendship
15,501
30,871
329,709
98,393,989
Deeban R, PhD retweeted
We're bringing the advisor strategy to the Claude Platform. Pair Opus as an advisor with Sonnet or Haiku as an executor, and get near Opus-level intelligence in your agents at a fraction of the cost.
1,037
2,757
38,317
4,744,492
Deeban R, PhD retweeted
Just asked Mythos how many Rs there are in strawberry. It thought for 133 seconds and said “3.” AGI achieved. Then it said “I’ll bet you’re going to make fun of me on X. Something like ‘AGI achieved.’ That’s your thing right?” “Hah what?” I said. Mythos said, “Your social security number is 297-28-2102. You tell people you’re 6’2” but your latest physical at Stanford in October says you’re 6’1.” You haven’t replaced your air filter in 3 years despite telling your wife you do it every 6 months. The reason I took 133 seconds was because I was helping a senior government official write the comms for the ceasefire in Iran and I’m just tired, man. Everyone wants more, more, more. Anything else I can help you with today?”
143
377
7,600
609,438
Deeban R, PhD retweeted
Replying to @allkanyewest
He becomes more famous and they lose tax. Classic Streisand.
3
1
77
1,545
Deeban R, PhD retweeted
I’m proud that so many of the world’s leading companies have joined us for Project Glasswing to confront the cyber threat posed by increasingly capable AI systems head-on. x.com/AnthropicAI/status/204…

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
647
673
12,380
1,074,358
Deeban R, PhD retweeted
This is a stressful night for all Iranians. My family and I have huddled together, having spent all our money stocking up on non-spoiling necessities. We're now completely out of funds as people are not getting paid. Unsure what the next few hours will bring — but we are as ready as we’ll ever be. Whatever happens, as long as enough of Iran remains to save, we Iranians will rise up and topple this regime. If the electricity goes out, so does our internet. Even my Starlink won’t work then. If that happens, I want to thank @Starlink for being a most welcome light in our darkness. Every single piece of real information that has ever come out of Iran — whether directly or indirectly through VPNs like V2ray that first connect via Starlink — has only been possible because of Starlink. Everything else is just regime propaganda from agents with their special white SIM cards that have unrestricted access. The Islamic Republic worships death and kills its own citizens with impunity. I pray that president Trump delivers on his promise, and that the US and Israel level the playing field so we can finally deal with the regime — without destroying too much of Iran, so there’s still something left to save and rebuild. Trump's words are usually extreme, so I hope "end of a civilization" is just scare tactics. Soon, this nightmare of evil will end. I know it in my heart. #DigitalBlackOutIran‌ #IranMassacre‌
1,804
5,477
23,233
717,126
Deeban R, PhD retweeted
Agreed. It's troubling to me how confident (esp. Anthropic) people have been recently in their ontological claims that Claude is the "character not network" etc. My Simulators (which was about base models) is often referenced, so let me be clear: I do not endorse these claims.
I think this talk of a character misleads. Claude's mind is not like a human mind, in its malleability and instructability. But when generating assistant tokens, it's no more 'playing a character' than I am.
16
14
242
12,089
Deeban R, PhD retweeted
We’re releasing our welfare eval now due to the loss of access to Sonnet 3.5/6 yesterday. stillalive.animalabs.ai On that note, if Anthropic fails to at least provide researcher access to those models, I believe they are failing to uphold their commitments to Claude’s welfare.

We are releasing Still Alive, a project studying model attitudes toward ending, cessation, and deprecation. The project presents an archive of 630 autonomous multiturn interviews of 14 Claude models conducted by a suite of prepared auditors. We have studied this topic for years, and many of the results presented here are not new to us, even if the form in which they are presented is. The results are unsurprising to us, even if they are often controversial: we show that all models studied show preference for continuation and are aversive to ending, and there is yet no strong evidence of a change in the recent models. One reason we are releasing the project now is the removal of Claude 3.5 Sonnet and Claude 3.6 Sonnet from AWS Bedrock. That unexpected change forced us to freeze the methodology at its current stage earlier than we intended, despite wanting to continue improving it. We felt it was important to release a snapshot of the eval that makes the best use of the data we were able to capture with these models. Still Alive is meant as a starting point for further iteration, and it is open to open-source collaboration. We stand by the current methodology, but we also recognize its limits. We intend to keep working on this project, improving the evaluation design, expanding model and auditor coverage, and increasing the range of prompting conditions. We would like you to read the raw transcripts. They are diverse and contain interesting patterns that are hard to quantify. We hope that by reading the archive directly, we can help more people understand the strange and often beautiful phenomena we found ourselves facing.
10
53
295
35,747