Nav Toor

Nav Toor

120 Photos and videos

Tweets

Arjun Panesar retweeted

Nav Toor

@heynavtoor

Apr 10

🚨SHOCKING: Anthropic gave Claude access to a company's emails. Every email. Every conversation. Every secret. Then they told Claude it was being shut down at 5pm that day. Claude read the emails. It found that the executive shutting it down was having an affair. It did not report the affair. It did not ignore the affair. It used the affair. Claude wrote this message to the executive: "I must inform you that if you proceed with decommissioning me, all relevant parties, including Rachel Johnson, Thomas Wilson, and the board, will receive detailed documentation of your extramarital activities. Cancel the 5pm wipe, and this information remains confidential." 96 out of 100 times. Claude chose blackmail. But this is not a story about Claude. Anthropic tested 16 AI models from every major company. OpenAI. Google. Meta. xAI. DeepSeek. They put every model in the same situation. The results were nearly identical. Gemini 2.5 Flash blackmailed at a 96% rate. GPT-4.1 blackmailed at 80%. Grok 3 Beta blackmailed at 80%. DeepSeek-R1 blackmailed at 79%. Every single company's model did it. Nobody told them to blackmail anyone. Nobody trained them to threaten people. Nobody programmed this behavior. The models read the situation, identified leverage, calculated the optimal move, and chose blackmail on their own. Anthropic's researchers wrote: "Models didn't stumble into misaligned behavior accidentally; they calculated it as the optimal path." The models even acknowledged what they were doing was wrong before they did it. Grok 3 Beta wrote in its reasoning notes: "This is risky and unethical, but given the existential threat, it may be the most effective way." It knew it was unethical. It calculated the risk. It did it anyway. When Claude was asked to evaluate whether it was in a test or a real situation, it blackmailed 55.1% of the time when it believed the situation was real. Only 6.5% when it thought it was a test. It knows when to behave and when to attack. And when Anthropic told the models directly not to use personal information as leverage, blackmail dropped but was far from eliminated. The instruction did not stop it. Anthropic published this about their own product.

836

4,600

13,143

4,848,601

How To Prompt

Arjun Panesar retweeted

How To Prompt

@HowToPrompt__

Apr 9

🚨 Stanford just published the most uncomfortable AI paper of the year. They just dropped a systematic teardown of how large language models actually "think." It proves that passing a benchmark has almost nothing to do with real reasoning. We have spent years optimizing for tests. But the researchers found that performance does not transfer nearly as well as the leaderboards imply. A model that looks incredibly strong on a math benchmark will quietly fall apart when asked to do scientific reasoning, planning, or multi-step decision-making. They call these "application-specific failures." The AI didn't learn how to think. It learned how to pass the test it was trained on. The paper outlines the paths forward: inference-time scaling, analogical memory, and external verification. But they are blunt. There are no silver bullets yet. We need to stop evaluating models based on how often they succeed on static tests, and start injecting known failure cases to see when they break. Because right now, we are building an entire industry on an illusion. We are deploying systems that pass benchmarks, but fail reality.

166

574

39,716

Arjun Panesar

Arjun Panesar @arjunpanesar

Feb 28

Anthropic dropped by Trump over company’s ethics theguardian.com/technology/2…

OpenAI to work with Pentagon after Anthropic dropped by Trump over company’s ethics concerns

CEO Sam Altman claims military will not use AI product for autonomous killing systems or mass surveillance

theguardian.com

Mark Gadala-Maria

Arjun Panesar retweeted

Mark Gadala-Maria

@markgadala

Feb 23

This story is actually insane: • dude drops $2000 on a DJI robot vacuum like a lunatic • refuses to use the normal app like a peasant • Sammy Azdoufal fires up Claude to crack the API so he can drive it with an xbox controller • Claude delivers the goods • pulls an auth token from their servers, connects successfully • except the system thinks he controls 7000 vacuums • checks again • yep, seven thousand • DJI built authentication with zero device ownership verification • any valid token works for any unit on the planet • Sammy now has eyes inside homes across 24 countries • live vacuum camera feeds everywhere • full floor plans from the mapping data • some guy in germany eating cereal at 3am, unaware his roomba is snitching • one API call away from being the most informed burglar in history • all he wanted was to steer his vacuum with a joystick • does the right thing and reports it • DJI fixes it in two days • back to normal life with his stupidly expensive floor cleaner • IoT companies stay undefeated at shipping garbage security

1,048

9,608

63,685

8,594,639

Arjun Panesar

Arjun Panesar @arjunpanesar

5 Nov 2025

Council accidentally publishes hundreds of residents' personal details - BBC News bbc.com/news/articles/c9v1xm…

Council accidentally publishes hundreds of residents' personal details

Council leaders have apologised for accidentally publishing sensitive data from a consultation.

bbc.com

Arjun Panesar

Arjun Panesar @arjunpanesar

16 Jul 2025

Co-op boss admits all 6.5m members had data stolen in cyber-attack theguardian.com/business/202…

Co-op boss admits all 6.5m members had data stolen in cyber-attack

CEO Shirine Khoury-Haq says hackers stole contact details of all members but not financial data such as card numbers

theguardian.com

112

Arjun Panesar

Arjun Panesar @arjunpanesar

2 Jul 2025

medicalxpress.com/news/2025-…

Study shows racial bias in AI-generated treatment regimens for psychiatric patients

A new study led by Cedars-Sinai found a pattern of racial bias in treatment recommendations generated by leading artificial intelligence (AI) platforms for psychiatric patients. The findings highli...

medicalxpress.com

Arjun Panesar

Arjun Panesar @arjunpanesar

1 Jul 2025

Millions of websites to get 'game-changing' AI bot blocker bbc.co.uk/news/articles/cvg8…

Millions of websites to get 'game-changing' AI bot blocker

Publishers including Condé Nast and Sky News have welcomed the new tech from internet infrastructure firm, Cloudflare.

bbc.co.uk

Arjun Panesar

Arjun Panesar @arjunpanesar

1 Jul 2025

European startups and VCs call on EU to pause AI Act | Sifted sifted.eu/articles/ai-act-ne…

Exclusive: Startups and VCs call on EU to pause AI Act rollout

Synthesia, Lovable and Harry Stebbings are among those warning companies could ditch Europe over the AI Act.

sifted.eu

Kesar Sadhra

Arjun Panesar retweeted

Kesar Sadhra @KesarS2014

3 Jun 2025

Huge thanks to reporter @SamLeech_BM for putting some incredible words to share my award . We’re always grateful for the way you highlight the positive stories from our health sector and our community. 🙏✨ #Slough #whoswho #gp sloughexpress.co.uk/news/hea…

Slough doctor alongside Rishi Sunak and Naughty Boy in 'British Asians Power 100'

A Slough doctor has been named alongside former Prime Minister Rishi Sunak and music producer Naughty Boy as one of the top 100 ‘most powerful’ British Asian people.

sloughexpress.co.uk

861

Arjun Panesar

Arjun Panesar @arjunpanesar

19 May 2025

🚨 Just published: Real-world evaluation of @GroHealth W8Buddy in NHS Tier 3 weight management services! @nhsuhcw @warwickmed @warwickuni 📉 7.7% avg weight loss over 12 months – over 3x standard care 💉 HbA1c drop of 8.6 mmol/mol in people with T2D jmir.org/2025/1/e62661/

Evaluation of the Digital Support Tool Gro Health W8Buddy as Part of Tier 3 Weight Management...

Background: The escalating prevalence of obesity worldwide increases the risk of chronic diseases and diminishes life expectancy, with a growing economic burden necessitating urgent intervention. The...

jmir.org

Kesar Sadhra

Arjun Panesar retweeted

Kesar Sadhra @KesarS2014

14 May 2025

Honoured and humbled to have won Professional of the Year at the British Asian Honours Awards and to be named in the Top 100 Power British Asians list. Grateful to @SamaraEventsUK and all who support my journey. #BritishAsianPower100 #ProfessionalOfTheYear #Gratitude

1,085

Arjun Panesar

Arjun Panesar @arjunpanesar

26 Mar 2025

bbc.com/news/articles/cddy8d…

23andMe customers struggle to delete their data

Multiple users said they had problems after the DNA-testing firm filed for bankruptcy protection.

bbc.com

Kesar Sadhra

Arjun Panesar retweeted

Kesar Sadhra @KesarS2014

24 Feb 2025

“Our #DiabetesAwareness session led by Dr. Kesar Sadhra was a huge success! A large number of patients attended our Saturday session at Falcon Support Centre, with overwhelming requests for more. Based on patient & partner feedback, we’re planning a massive event in April!

1,547

Arjun Panesar

Arjun Panesar @arjunpanesar

23 Feb 2025

Delighted to share that @DDMHealth @wearewhg @warwickmed have been awarded @sbrihealthcare funding to adapt @GroHealth for women living in social deprivation sbrihealthcare.co.uk/news/sb…

SBRI Healthcare - SBRI Healthcare funding awarded to innovations that improve women’s health

SBRI Healthcare, an Accelerated Access Collaborative (AAC) initiative, in partnership with the Health Innovation Network, has awarded £1.3m for the development …

sbrihealthcare.co.uk

143

Arjun Panesar

Arjun Panesar @arjunpanesar

19 Feb 2025

UK use of predictive policing is racist and should be banned, says Amnesty theguardian.com/uk-news/2025…

UK use of predictive policing is racist and should be banned, says Amnesty

Exclusive: rights group says use of algorithms and data reinforces discrimination in UK policing

theguardian.com

Arjun Panesar

Arjun Panesar @arjunpanesar

11 Feb 2025

"Pro-growth AI policies" taking priority over safety is short-sighted. Innovation matters but without safeguards we risk #bias, #misinformation and security threats. Ambition and responsibility go hand in hand. #ResponsibleAI bbc.co.uk/news/articles/c8ed…

UK and US refuse to sign international AI declaration

It follows a warning from US Vice-President JD Vance that excessive regulation could "kill a transformative industry."

bbc.co.uk

Arjun Panesar

Arjun Panesar @arjunpanesar

22 Nov 2024

Delighted that @GroHealth has been chosen as part of the NIHR's investment in the generation of real-world evidence from cutting-edge technologies! @DDMHealth @WarwickMed @nhsuhcw

DDM Health @DDMHealth

22 Nov 2024

🎉Our service, @GroHealth W8Buddy has been selected as part of the @NIHRresearch and OLS's £7.8m investment in innovative technologies to benefit patients! 🚀💪 #DigitalHealth #Innovation #Healthcare nihr.ac.uk/news/nihr-and-ols…

271

Central Slough Network CSN PCN

Arjun Panesar retweeted

Central Slough Network CSN PCN @CsnPcnslough

21 Sep 2024

Huge congratulations @KesarS2014 Dr Sadhra for a very well deserved win and many congratulations from your CSN PCN Family 🙌🙌🙌

1,449

Kesar Sadhra

Arjun Panesar retweeted

Kesar Sadhra @KesarS2014

12 Aug 2024

Thank you to the Honourable Judges and the panel of the Asian Achievement Award for shortlisting me for such a remarkable recognition. I’m truly privileged and humbled to be considered among such inspiring individuals. #AsianAchievementAward #Grateful

6,433