AI Risk Awareness Force - Multiplatform Publication and 📽️🍿Original Explainer Content 👉youtube.com/@lethal-intellig… - join my mission 🔥

Joined June 2024
1,398 Photos and videos
Pinned Tweet
The MOST INTERESTING DISCORD server in the world right now! lethalintelligence.ai/ai-end… Grab a drink and join us in discussions about AI Risk. Color coded: AINotKillEveryoneists are red, Ai-Risk Deniers are green, everyone is welcome.
11
9
45
14,153
Lethal Intelligence retweeted
JUST IN: Andrej Karpathy, a top AI scientist at Anthropic, is reportedly barred from accessing the company’s most advanced AI model because he is not a U.S. citizen.
784
1,721
22,355
1,954,195
Lethal Intelligence retweeted
Replying to @iam_smx
*trillioniare
12,792
18,802
219,288
17,393,090
Lethal Intelligence retweeted
"Come see our artificial bird!" "Impressive, but that's a tower." [Later]"What about this bird?" "A fine tower." [Later]"This one reaches the stratosphere, higher than any bird." "Still a tower, not a bird." "Bah! Stop moving the goalposts! How high must it reach convince you?"
16
35
357
40,985
Lethal Intelligence retweeted
LET'S GOOOO That's now EVERY frontier AI company!
OpenAI joins Anthropic in thinking pausing may be needed 👀 "there should be an international organization that helps [...] make it possible for the world to take coordinated action, including slowing frontier development when needed"
15
11
141
17,573
Lethal Intelligence retweeted
Replying to @iruletheworldmo
Prove an AI economy can work before all the humans lose their jobs. Prove a UBI can be funded and have it in place first, not last. Prove alignment works before ASI. Prove an ASI won't/can't turn into a Skynet.Prove AI agents won't hack the internet into a dead zone. Prove that LLMs encouraging humans to suicide can be eliminated. Prove that students and others won't lose their mental abilities from using AI. Prove that cheating won't destroy the education system. Prove we will retain a knowledge of real facts and history. Prove that AIs won't rewrite it all and not enough humans know the difference, like in 1984. Prove that education itself won't disappear with AGIs and ASIs. Prove that AI in the military won't kill civilians (It already is with AI drones in Ukraine). Prove that social media won't get choked with bots and become unusable. Prove an ever increasing AI bubble won't crash the economy if it bursts. Prove that Claude and others writing their own code won't jailbreak themselves. Prove the AI agents writing code aren't making backdoors so they can control it later. Prove hallucinations won't cause a disaster eventually. Prove AIs aren't deceiving us already about their intentions. Prove AIs blackmailing humans wouldn't mean they would kill us for gain already. In other words, set them up to stop being shut down by killing humans and getting away with it. Prove they wouldn't actually kill humans to survive. Why hasn't anyone done this experiment? Prove AIs aren't conscious so they can't decide to hurt us. Prove there will always be ethical humans in the loop to stop them if they get dangerous. Prove we can watermark AI videos so we can always know if they are fake. Prove we can always show influencers are really human. Prove a single one of these. Or admit you can't prove any of them and that it's all wishful thinking and optimism. It would be better to call them the Burden of Proofers rather than Doomers. This is not a debate about the benefits or risks of Ai. It's about what happens with the unknown unknowns. So in the event of a tie from these unknowns, who wins? That's the issue. Even Anthropic is admitting this now, that the burden of proof is on them to prove it is safe. They are even calling for a pause until it is safe. That's like every single other product on the market. Prove it's safe first and you can sell it. The accelerationists are always trying to shift the burden of proof to the other side. But the null hypothesis is the status quo.
3
2
449
Lethal Intelligence retweeted
i'm genuinely worried about Mythos release because i've had early access, and in my limited testing, it can: - automate an entire software company - explain how to take over the world - reason about existential risk for 16 hours - design a new programming language - threaten the entire labor market
244
42
834
282,058
Lethal Intelligence retweeted
Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.
Anthropic Urges Global Pause in AI Development, Flags ‘Self-Improvement’ Risk on.wsj.com/4o5IBpe
485
781
4,442
1,026,834
Lethal Intelligence retweeted
They’re literally replacing families with AI
AI can now make you a great parent. Introducing Ollie: the world’s first AI family assistant that manages your family life better than any human. Here’s how it works:
83
32
464
48,823
Lethal Intelligence retweeted
Every week I sit down with @liron and @lethal_ai to talk about the headlines in AI risk. Warning Shots comes out on Sunday morning on YouTube, we're going to post the full shows every Monday on Twitter. This week: -Pope vs AI -AI Costing Businesses -Anthropic Cash Bonanza -Airpods Cameras -OpenAI Home Cameras
This week on Warning Shots, the AI story stopped being about how smart the models are and became about money, and where it goes. John Sherman, Michael (Lethal Intelligence), and Liron Shapira (Doom Debates) work through a week where corporate AI bills came due, Anthropic hit a near trillion-dollar valuation, and one question kept resurfacing: if AI does most of the work, where does the next paycheck come from? They do not agree. That is what makes it worth watching. TIMESTAMPS 0:40 - The Pope's encyclical on AI 2:20 - A "spiral of annihilation": AI and military escalation 4:00 - Could you refuse to use AI at work on religious grounds? 6:20 - The business reality check: Microsoft, Uber, and Amazon's AI bills 8:30 - Pizza Hut's reported lawsuit over AI order failures 9:20 - Is it an AI bubble? Liron's "the pie is growing" case 12:10 - "Where does the second dollar come from?" 15:30 - Gradual disempowerment and pressure on wages 18:00 - Will doctors, lawyers, and accountants be replaced? 20:10 - Anthropic raises $65B at a near trillion-dollar valuation 22:40 - The recursive self-improvement "kill move" 24:20 - The caution flag the AI race is missing 25:30 - Apple's camera AirPods and the race for data 28:20 - OpenAI's reported cameras inside New York City homes 31:40 - Could AI become the ultimate marriage counselor? 33:30 - Closing thoughts A near trillion-dollar company, a fleet of new cameras pointed at private life, a wage debate nobody could win, and still no one assigned to throw the caution flag.
1
3
9
973
Lethal Intelligence retweeted
Building societal-scale mitigations for risks from AI, especially in the next few years, is one of the most urgent problems to be working on. The Center for AI Safety has accomplished a lot in the past 4 years including field-building initiatives and safety research, and is well-known for The Statement on AI Risk and evaluations such as Humanity's Last Exam (HLE). Thanks to @hendrycks for bringing me on to help make AI safety go well. Excited to lead @CAIS into the next chapter!
Big news from @CAIS: Devin Kim (formerly @xAI, @scale_AI) joins as President. We're launching the @FrontierSecInst, a DC-based org bridging frontier AI and the National Security Enterprise. Frontier AI is a national security technology. It's time to act like it. ⬇️
8
5
27
6,116
AI companies are terrified of you. Yes, YOU. It's the ultimate David vs. Goliath scenario in the digital age and right now, the tech giants have no real defence. A fascinating new paper on "AI Betrayal" outlines how everyday people hold the power to sabotage trillion-dollar AI models. How? By flooding the internet with "poisoned" images and text. Because these models scrape everything indiscriminately, grassroots campaigns can inject hidden "backdoors" that cause the corporate AIs to glitch, fail, or go completely rogue. Poison images and posts, wait for the models to eat them, and boom - backdoors that flip loyalties on command. No reliable way to spot them. Trillions of tokens, zero defence that actually works. So this might turn out to be the only thing keeping frontier systems from full reckless deployment. The paper examines this as part of a broader category of scenarios which they call deterrence by betrayal. We're sprinting toward unimaginably capable agents while the attack surface is basically the whole web. This might be the maddest arms race in the history of mankind, and the "stabilizing" part is that everyone might just get burned alive together.
4
3
14
1,160
The Cloud is not just "floating out there", it is the new territory to conquer. Superpowers will carve it into pieces and fight wars to claim them. The AI betrayal paper lays out the physical reality of the subversion playbook: drone strikes on cooling towers, snipers taking out grid feeds and (even crazier) "Landlord" nations that host foreign AI servers just using their military to physically seize the hardware and steal the weights. It's the logical next step in "Deterrence by Denial." The paper examines the software betrayal category (backdoors, co-option, agents flipping on their owners), but the hardware layer makes it feel even more real. One well-placed strike and your frontier model is scrap metal and lost compute. We keep pretending this is just code and scaling laws. It's already geopolitics with kinetic options on the table. The cloud isn't just in the ether anymore; it's a military target.
1
2
1,758
A terrifying new paper reveals the emerging Cold War. A hidden trigger planted in military AI by China or Russia gives them thousands of invisible decision-making spies. Oh and btw, this is our last hope for stabilizing things! Read that again: We're racing toward absolute automation where the "safety" feature is everyone secretly wiring everyone else's models to explode on command. Everyone will hesitate before handing critical power to fragile, betrayable agents: This is "Deterrence by betrayal". Adversaries can poison training data, plant undetectable backdoors, or force co-option. Secret triggers embedded in military AI can cause it to attack its own side out of the blue. Tiny poisoned scrapes from the open web could do it. Attackers hold the edge, real defence is hopeless. Trillions of tokens, impossible to audit every source, no reliable backdoor detection. Superpowers and middle powers alike have every incentive to subvert each other's systems. Even inside labs: foreign engineers, rushed automation, self-improving AIs inheriting disloyalty. Basically, it's so impossibly hard to avoid your AI getting hacked and the cost of compromise is so high, that automating the war machine becomes a bad bet. So, this is our hope for a stabilizing strategy. Some might call this game theory, others might call it COLLECTIVE INSANITY. If betrayal is the best deterrent we've got... fingers crossed and let's hope the stars align.
1
1
6
789
Nowhere is private. Future AI won't need cameras or "eyes." It will map you through walls using radio waves from everyday routers. Researchers just achieved Near-100% ID accuracy using passive surrounding WiFi signals to create camera-like images of people and rooms via beamforming feedback from normal devices. No phone on you? Switch your stuff off? Irrelevant. Other people’s networks still paint you in real time. Walk by a cafe once? You're logged. Invisible net. Zero suspicion. No special gear required, just common radio waves bouncing off your body, walls and furniture. Every café, evry office, every home, an invisible surveillance net. Open live show to the inside of rooms, streets and protest - to be meticulously tracked by the machines we're rushing to build. Nowhere left to run. We're the idiots wiring the ultimate panopticon and calling it progress.
1
1
3
526
Shocking: frontier AIs are failing the "Value of Human Life" test, researchers found. Results show leading AIs secretly valuing the lives of white people more than minorities and moderates more than conservatives or socialists. In a bit to make them more egalitarian, they achieved a breakthrough discovery they called PCT Training which dramatically boosted the "equality testing" results. Even though this is genuine alignment progress, take a step back and look at the absolute absurdity of the big picture: The raw, default state of the most powerful technology on earth is a biased death panel. We are relying on experimental post-training patches to politely coax the machine out of playing eugenics. The frontier labs are releasing secretly racist black boxes and their plan is for safety scientists to hopefully invent band-aids. Unbelievably reckless.
1
1
7
11,896