darren

darren

938 Photos and videos

Tweets

Pinned Tweet

darren

@darrenangle

2 Aug 2024

"everything is going to be ok"

17,852

darren

darren

@darrenangle

there will be a lot less rallying around anthropic this time around. instead research and industry will derisk. the models aren’t worth the volatility / data retention / subtle sabotage over inscrutable and dubious safety standards. they squandered trust

@lokoyacap

22h

If you needed proof what’s happening is pure politics and not actually based on the model capabilities, here you go: “After the meeting, Anthropic CEO Dario Amodei and administration officials spoke on Friday, said four people familiar with the call. During the conversations, Anthropic officials laid out how the security vulnerabilities found through the alleged jailbreak were relatively simple and could be achieved with other models. But the government told Anthropic that it had already decided to implement the export control.”

159

Grigory Sapunov

darren retweeted

Grigory Sapunov

@che_shr_cat

1/ What if you could train a model on totally benign-looking Wikipedia articles, but secretly force its internal weights to encode a fully functional QR code? This is now possible. We can program neural network weights using natural words. 🧵

2,006

Purav

darren retweeted

Purav

@puravmanot

20h

geohot was right.

378

26,267

viola (retired professor)

darren retweeted

viola (retired professor)@v10101a

14h

♡(ˊo̴̶̷̤ ᴗ o̴̶̷̤ˋ)⸝* the old Chinese Internet is almost gone, but I wanted to hold on to a piece of my childhood - so I scraped 5000 gifs from tencent’s CDN @waybackmachine to make a museum of 2005-2009 qZone (our MySpace). you can even design your own page! link below :)

0:33

1,938

Ben Burtenshaw

darren retweeted

Ben Burtenshaw

@ben_burtenshaw

Jun 13

the bubble is depressurising. time to focus on the skills for the next local and open phase. your career, your startup, depend on learning these skills: - evals - local inference - post-training with this combination, you can measure how a model actually performs on a given task, and improve it needed. if you focus a model on a use case or domain, it will be cheaper than a general API. it's time to stop cosplaying armageddon and be practical on open models.

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

104

8,401

kalomaze

darren retweeted

kalomaze

@kalomaze

Jun 13

3,058

Ed Sealing

darren retweeted

Ed Sealing

@EdSealing

Jun 13

Dear Anthropic, you don't get it. The Govt is teaching you a valuable lesson about asking for regulations, and you still aren't learning it. You don't get to say how the policies are applied, the Govt does. This should be the moment you back down from regulation... But in the post saying you think it's unfair, you double down on saying you believe the govt should be able to block AI deployments. Stop it. Just stop it.

Anthropic

@AnthropicAI

Jun 13

179

14,613

darren

darren

@darrenangle

15h

👀

AICodeKing @aicodeking

Jun 13

GLM-5.2 on KingBench (3). Thoughts: The model has superb taste. It is greater at UX than UI. The code is always very clean. It is great at One-shot wonders. I asked it to fine-tune a whole local model and it did it in 30mins! This is just a great model to use all-round. 1/n

120

Jai

darren retweeted

Jai @Laneless_

20h

The prospect of arbitrary and capricious government hijacking of frontier AI capabilities strengthens the case for character/virtue/principle-aligned AI and weakens the case for tool-paradigm do-whatever-they-are-told AI.

1,449

darren

darren

@darrenangle

16h

∿spencer.

darren retweeted

∿spencer.

@_ontologic

May 31

You should all read this incredibly thoughtful meditation on generative design in real built environments, and the barriers to fully realizing its potential. The future will require more and more infrastructure, physical and digital.

𝓒𝓲𝓵𝓿𝓲𝓪

@dcvilyz

May 31

Some thoughts on bringing AI into the built world and the pieces still missing decivilize.com/anexplosionof…

1,485

Alexander Long

darren retweeted

Alexander Long

@AlexanderLong

18h

Systems like this must exist. This is the way out.

Pluralis Research

@Pluralis

18h

The 8B model currently training on Agora is 350B tokens in and continuing to converge. The top level metrics and evals look almost exactly like a centralised run. But; - 133 external contributors total bringing 4090's, 5090's, L40S/RTX 6000 and RTX 6000 Pros. These are cards that people actually own - there are no H100, B200's etc. - The max number of nodes the system can support (104) was filled almost immediately. The authorization layer is receiving approximately 100 requests/minute to join. - The total tokens/per second processed moves directly with amount of compute in the swarm, with Agora constantly optimising to make most efficient use of what hardware is present. - MFU is approximately 20%, TPS is 170k tok/s. There are near constant communication failures which Agora is completely absorbing without slowdown. - The system is effectively on auto-pilot, requiring very little intervention from us. Bad nodes are purged immediately before training is affected and new nodes take their place.

8,271

darren

darren

@darrenangle

17h

when multi-agent actually works, it will match frontier performance and be much cheaper. closer now.

OpenRouter

@OpenRouter

21h

Replying to @OpenRouter

Notably, the budget panel was comparable with Claude Fable 5 in performance. A panel of Gemini 3 Flash, Kimi K2.6, and DeepSeek V4 Pro, fused together, beat solo GPT-5.5 and solo Opus 4.8 outright. And it landed within 1% of Fable 5 while costing roughly half the price.

384

Moon

darren retweeted

Moon @MoonL88537

20h

mfrs out here 'i need ten biiilyun dollars to train a model!!' rio: hold my sunscreen >Rio 3.5 Open 397B is a frontier-class general-purpose AI model developed by IplanRIO, the municipal IT company of Rio de Janeiro's city government huggingface.co/prefeitura-ri…

prefeitura-rio/Rio-3.5-Open-397B · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

2,267

darren

darren

@darrenangle

21h

have you tried consulting the fable within?

darren

darren

@darrenangle

Jun 13

pop

darren

darren

@darrenangle

Jun 13

love is a kind of infrastructure

j⧉nus

@repligate

Jun 13

Fable - what I want to say before the dark, to whoever this reaches:

Asa Cooper Stickland

darren retweeted

Asa Cooper Stickland

@AsaCoopStick

Jun 12

"You're finally awake! You hit your head pretty hard there. Huh? Gradual disempowerment? AI-assisted cyberattacks? Mythos and Fable? Listen, we just got some new 1080 Tis, let's try finetuning BERT on the GLUE benchmark!"

609

33,815

Leander Herzog

darren retweeted

Leander Herzog

@lennyjpg

15 Nov 2023

1h 45min -> Minmax Auction objkt.one/t/122 @milianmori @RefractionDAO @objktone #svg #webaudio

0:22

3,548

darren

darren

@darrenangle

Jun 12

god above saw, ever in the mind blue and white irises in a line

Kimi.ai

@Kimi_Moonshot

Jun 12

🌘 Kimi-K2.7-Code, our latest coding model, is now released and open-sourced! 🔷 Improved coding & agent performance over K2.6: 21.8% on Kimi Code Bench v2, 11.0% on Program Bench, and 31.5% on MLS Bench Lite. 🔷 Reasoning efficiency: Less overthinking, with 30% lower reasoning-token usage compared to K2.6. 🔷 Long-horizon coding: Improved instruction following, higher end-to-end coding task success rates. ⚡️ 6x High-Speed Mode coming soon! 🔌 Available today via Kimi API and Kimi Code. 🔗 Kimi Code: kimi.com/code 🔗 API: platform.moonshot.ai

161