Fullstack SWE. ex-Apple. Prefer insightful discussion to debate. Rust, TypeScript, Effect, SolidJS, localfirst, devops, keto, stats/science, audio/DSP

Joined February 2008
1,066 Photos and videos
spion retweeted
Simon Peyton Jones is the co-creator of Haskell (pure functional programming language) and I interviewed him about functional programming, why it matters, and his thoughts on other programming languages. In this episode: • Useful and useless programming languages • Rust vs C • Haskell vs OCaml • Why functional programming matters • Static languages and their value for LLMs • Why Excel is his 2nd favorite programming language Where to watch: • YouTube - youtu.be/xcB_LF3cdqw • Spotify - open.spotify.com/episode/5d9… • Apple Podcasts - podcasts.apple.com/us/podcas… • Transcript - developing.dev/p/co-creator-… Thank you to the sponsor of this episode for supporting my work: • WorkOS: makes your app Enterprise Ready with easy to use APIs to add SSO, SCIM, RBAC, and more in just a few lines of code, check them out at workos.com/ Chapters: 00:00 - Intro 00:39 - What functional programming is 09:18 - Downsides of functional programming 10:53 - Specialized hardware for functional programming 21:47 - Haskell is useless 25:59 - Rust vs C 28:26 - Haskell vs OCaml 35:26 - Side effects in Haskell 44:26 - Type systems 57:30 - How the Haskell compiler works 01:04:35 - Why Haskell is talked about more than used 01:09:07 - Avoiding success at all costs 01:11:12 - LLMs and programming languages 01:13:57 - New programming language design 01:15:59 - Should students continue to learn programming 01:22:33 - Why Excel is is 2nd favorite programming language 01:25:04 - Advice for his younger self
29
107
728
121,817
spion retweeted
If this is the model that should convince me to pay token-based pricing from Anthropic it's not even remotely close: - slow as hell - doesn't follow instruction - ships security issues claiming perf gains - less intelligent at coding than GPT 5.5 (even 5.3)
First impressions on Fable 5, it fails to follow basic instructions and always want to run very long sessions. Overall writes good code but nothing impressive so far.
5
1
104
10,067
Jun 4
I just can't seem to square this with what I'm seeing between Opus 4.5 and 4.8. I see deterioration, not improvement.
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
1
1
193
Jun 4
With 4.8 the rate of mistakes has gone up so high that I think its also starting to affect average outcomes. Since the model is also slow, its a huge time-sink being less capable of doing things unsupervised.
1
59
Jun 4
I wonder if Anthropic internally use workflows with really strong verification guardrails that are able to deal with higher error rates and not noticing this. Unclear what's going on.
27
Jun 2
What type of game would @Trackmania be then?
A conversation I had earlier today.
1
147
May 29

ALT Sigh Robert GIF

so now I can't use the word "workflow" in my prompt, without @claudeai automatically enabling dynamic workflows (costs a lot of tokens)
1
169
May 29
can you please call them claudeflows or something
88
spion retweeted
glad to know Mythos' safety concerns have been addressed right as Anthropic also secured tens of billions in inference compute 👍
JUST IN: Anthropic announces it will roll out Claude Mythos “in the coming weeks” despite growing fears over the model’s cyber capabilities.
52
258
7,114
407,287
spion retweeted
99.8% of bun’s pre-existing test suite passes on Linux x64 glibc in the rust rewrite
128
169
3,433
651,778
May 5
Is there any way to make Opus 4.7 better at revising text? Rather than actually editing the text in a way that addresses the feedback, it finds ways to describe the feedback in the text. Entire section is no longer needed? It will explain in that section "its no longer needed"
1
83
spion retweeted
Every time someone posts a take like this, the replies are full of naysayers explaining how this won't work because compilers are deterministic and LLMs aren't. But this has nothing to do with why you shouldn't treat an LLM like a compiler! In defense of poor old nondeterminism:
Interesting article on treating agent output like compiler output (and why) skiplabs.io/blog/codegen_as_…
10
7
28
3,912
spion retweeted
they call them crisps there
177
1,358
22,006
590,063
spion retweeted
Replying to @dbmikus
I agree, I think every agent framework should come with best-in-class sandboxing out of the box Currently, setting up a sandbox is mostly left as an exercise for the user, and doing that well is really difficult
5
2
44
3,038
spion retweeted
I feel sorry for Claude Code I know they're not the one. I'm not overcommitting - not investing too hard I wonder if they know I'm pulling away
207
22
1,498
656,084
spion retweeted
THIS GUY LOST $200 IN ONE DAY BECAUSE THE STRING "HERMES.md" WAS IN HIS GIT COMMITS HERMES.md is a real convention used in AI agent projects. it's a system prompt specification file. not some obscure edge case he's on claude max 20x at $200 a month. yesterday claude code hit him with "you're out of extra usage" out of nowhere his dashboard showed 13% weekly usage. 0% current session. 86% of his plan was sitting there untouched but $200.98 in extra usage already burned through what should have been covered by his subscription he tried logout & login, different models, fresh installs and nothing worked anthropic support sent the ai bot (four rounds of the same scripted response). eventually they just gave up on him so he started binary searching repos and commits manually on his own time until he found the trigger the string "HERMES.md" in a recent git commit message uppercase, with the .md extension, anywhere in your commit history that's it claude code includes recent commits in its system prompt and something server side flags HERMES.md and quietly routes you off your max plan onto API rate billing > AGENTS.md? fine > README.md? fine > HERMES without .md? fine > lowercase hermes.md? fine > uppercase HERMES.md? you're getting charged API rates he reported it. anthropic support acknowledged the bug three times, called it an "authentication routing issue", thanked him for finding it then refused to refund the $200 so the man pays $200 a month for max, lost another $200 to a billing bug they confirmed, did anthropic's QA work for free on his weekend, and got a "thank you for your patience" in return check your commit history before claude code quietly drains your account too
182
343
4,609
1,498,126
spion retweeted
This is a glimpse of big changes ahead of us. If you’re betting on big central models you should think twice. I run the exact same setup (M5 MacBook, qwen3.6-27B, pi, ollama) and while its not as fast or good as one of the big central models, it’s past the line of “cool demo” into “truly useful.” Kind of where the big frontier models were in late 2025. In ~24 months we might have local models that are fast and good enough for most tasks.
This is where we are right now. And i’m not gonna lie it feels pretty magical 🧚‍♀️ Qwen3.6 27B running inside of Pi coding agent via Llama.cpp on the MacBook Pro For non-trivial tasks on the @huggingface codebases, this feels very, very close to hitting the latest Opus in Claude Code, or whatever shiny monopolistic closed source API of the day is. In full airplane mode. Most people haven’t realized this yet. If you have, it means you have a huge headstart to what I call the second revolution of AI. Powerful local models for efficiency, security, privacy, sovereignty 🔥
40
54
902
109,413
spion retweeted
This is where we are right now. And i’m not gonna lie it feels pretty magical 🧚‍♀️ Qwen3.6 27B running inside of Pi coding agent via Llama.cpp on the MacBook Pro For non-trivial tasks on the @huggingface codebases, this feels very, very close to hitting the latest Opus in Claude Code, or whatever shiny monopolistic closed source API of the day is. In full airplane mode. Most people haven’t realized this yet. If you have, it means you have a huge headstart to what I call the second revolution of AI. Powerful local models for efficiency, security, privacy, sovereignty 🔥
263
454
5,283
656,017
Apr 23
its extremely satisfying to use qwen-3.6-35b-a3b with `pi` to diagnose and apply patches to llama.cpp's ROCm support, have it restart its own llama-server, and have it configure its own models.json
2
120
Apr 23
Unfortunately, it stopped its llama-server the second time round. The plan was to stop, edit something, then start again. Of course, it promptly stopped working after the stop command 😅
45