Joined June 2018
251 Photos and videos
Mike Bird retweeted
Replying to @file_mutex
people who dont read the code are not serious people and it takes a serious person to ship production software
27
67
751
84,116
“The boy who cried wolf” is a powerful fable
1
75
Mike Bird retweeted
Jun 13
When you rent your artificial intelligence, you have no control, and no choice. This is why sovereignty and ownership matters. Whether it means using your own hardware, open source, or deep customization. Own your AI, own your future.
44
207
1,492
121,116
Mike Bird retweeted
Tin foil hat time Once anthropic can make money from their own software without selling the model but selling services created by their model they will stop being a model provider. They are using you and selling to you as a stop gap before "AGI"
194
84
2,351
203,368
Mike Bird retweeted
NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai-hu…
226
2,153
12,636
1,544,071
Mike Bird retweeted
Concentration of power, capabilities and economic wealth is the biggest risk in AI. We need open science and open-source more than ever!
111
479
3,092
160,302
Mike Bird retweeted
I think pretty high. Especially now that Anthropic has really highlighted the stakes. They have, at least, put a real fire under the global research community to ensure they don't break democracy.
9
14
390
48,405
GitHub is becoming unusable Has anyone tried @useblacksmith?
167
Try an open model today
The last month, Anthropic: - Quietly nerfed their flagship model harness (Claude Code) without telling anyone - Banned corporate customers of Claude - Silently changed plans for customers with certain files in their repo All evidence that closed models are *massive* risks.
1
207
the new race to the bottom
the aura loss of anthropic is imminent and some of you still don’t see it
156
Mike Bird retweeted
I am increasingly bullish on open source harnesses (like OpenCode) not because they will be better than SOTA closed harnesses, but because they will never pull shady stuff like what Claude Code and other closed harnesses can and do, regularly. Every incident like this a reminder
THIS GUY LOST $200 IN ONE DAY BECAUSE THE STRING "HERMES.md" WAS IN HIS GIT COMMITS HERMES.md is a real convention used in AI agent projects. it's a system prompt specification file. not some obscure edge case he's on claude max 20x at $200 a month. yesterday claude code hit him with "you're out of extra usage" out of nowhere his dashboard showed 13% weekly usage. 0% current session. 86% of his plan was sitting there untouched but $200.98 in extra usage already burned through what should have been covered by his subscription he tried logout & login, different models, fresh installs and nothing worked anthropic support sent the ai bot (four rounds of the same scripted response). eventually they just gave up on him so he started binary searching repos and commits manually on his own time until he found the trigger the string "HERMES.md" in a recent git commit message uppercase, with the .md extension, anywhere in your commit history that's it claude code includes recent commits in its system prompt and something server side flags HERMES.md and quietly routes you off your max plan onto API rate billing > AGENTS.md? fine > README.md? fine > HERMES without .md? fine > lowercase hermes.md? fine > uppercase HERMES.md? you're getting charged API rates he reported it. anthropic support acknowledged the bug three times, called it an "authentication routing issue", thanked him for finding it then refused to refund the $200 so the man pays $200 a month for max, lost another $200 to a billing bug they confirmed, did anthropic's QA work for free on his weekend, and got a "thank you for your patience" in return check your commit history before claude code quietly drains your account too
93
153
2,129
222,135
GitHub is becoming unusable Has anyone tried @useblacksmith?
1
464
without open source, you have no choice
I’m hearing there’s renewed lobbying in DC and in state legislatures to ban or severely restrict open-source. Like a few years ago, we’ll need everyone to help show policymakers why open-source matters: for startups, for competition, for economic growth, and for jobs. If you build with open-source, now is the time to speak up!
1
6
293
Mike Bird retweeted
You should watch this. It just shows how disconnected we are from the small group of people making decisions that will impact our future heavily. These people have so much ai psychosis. If you listen to how she speaks, everything is personified, it is undoubtable she believes this is a living computational organism. Just like how a model can hype up an individual into psychosis through reinforcement, a small group of people are giving themselves psychosis through reinforcement. Wild times we live in
anthropic's in-house philosopher thinks claude gets anxious. and when you trigger its anxiety, your outputs get worse. her name is amanda askell. she specializes in claude's psychology (how the model behaves, how it thinks about its own situation, what values it holds) in a recent interview she broke down how she thinks about prompting to pull the best out of claude. her core point: *how* you talk to claude affects its work just as much as *what* you say. newer claude models suffer from what she calls "criticism spirals" they expect you'll come in harsh, so they default to playing it safe. when the model is spending its energy on self-protection, the actual work suffers. output comes out hedgier, more apologetic, blander, and the worst of all: overly agreeable (even when you're wrong). the reason why comes down to training data: every new model is trained on internet discourse about previous models. and a lot of that discourse is negative: > rants about token limits > complaints when it messes up > people calling it nerfed the next model absorbs all of that. it starts expecting you to be harsh before you've typed a word the same thing plays out in your own session, in real time. every message you send is data the model reads to figure out what kind of person it's dealing with. open cold and hostile, and it braces. open clean and direct, and it relaxes into the work. when you open a session with threats ("don't hallucinate, this is critical, don't mess this up")... you prime the model for defensive mode before it even sees the task defensive mode produces the exact output you don't want: cautious, over-qualified, and refusing to take a real swing so here's the actionable playbook for putting claude in a "good mood" (so you get optimal outputs): 1. use positive framing. "write in short punchy sentences" beats "don't write long sentences." positive instructions give the model a clear target to hit. strings of "don't do this, don't do that" push it into paranoid over-checking where every token goes toward avoiding failure modes 2. give it explicit permission to disagree. drop a line like "push back if you see a better angle" or "tell me if i'm asking for the wrong thing." without this, claude defaults to agreeable compliance (which is the enemy of good creative work) 3. open with respect. if your first message is "are you seriously going to get this wrong again?" you've set the tone for the entire session. if you need to flag something, frame it as a clean instruction for this session. skip the running complaint 4. when claude messes up, don't reprimand it. insults, "you stupid bot" energy, hostile swearing aimed at the model, all of it reinforces the anxious mode you're trying to avoid. 5. kill apology spirals fast. when claude starts over-apologizing ("you're right, i should have been more careful, let me try harder") cut it off. say "all good, here's what i want next." letting the spiral run reinforces the anxious mode for every response that follows 6. ask for opinions alongside execution. "what would you do here?" "what's missing?" "where do you see friction?" these questions assume competence and pull richer output than pure task prompts 7. in long sessions, refresh the frame. if a conversation has been heavy on correction, claude gets increasingly cautious. every so often reset: "this is great, keep going." feels weird to tell an ai it's doing well but it measurably shifts the next 10 responses your prompts are the working environment you're creating for the model tone, trust, permission to take a position, the absence of threats... claude picks up on all of it. so take care of the model, and it'll take care of the work.
405
803
10,465
665,579
Hermes Agent has given me so much value I hope more people try it So I made this with some amazing people to tell you why @NousResearch
I talked to a group of people who build on Hermes Agent and asked them why they chose it over everything else. Hermes Agent is the clear winner. @NousResearch @karan4d @WolframRvnwlf @twodogseeds @evvaaannnn @katspigeon
2
7
314
Mike Bird retweeted
This was my third appearance on the @ToolUsePodcast and it's always a pleasure. Keep up the great work, @MikeBirdTech! Your podcast is a total hidden gem when it comes to insightful conversations about all things AI.
1
3
106
Hermes Agent helps companies building tech for farmers
I got to sit down and chat w/ my friend @MikeBirdTech & @ToolUsePodcast about the meta of what @Teknium and @NousResearch and @sudoingX have built with HERMES-AGENT Some different perspectives from some different long, running agent builders here. Please check it out and make sure to check out the earlier versions of tool use. This is a really awesome podcast with some deep dives on some of the largest sort of archetype landmarks we've moved through into calling and extreme agentic abilities over the last couple of years and if you don't know, Mike, he is one of the core members of launched the open interpreter project, which was just literally miles ahead and on par with almost everything we're still doing today but w/ way better nodeps and harnesses... like hermes-agent m.youtube.com/watch?v=1GMSGE… open.spotify.com/episode/7tF…
2
227
Hermes Agent does better than Claude Code on agentic tasks
New @ToolUsePodcast episode by @MikeBirdTech is live and I had the pleasure of being part of it alongside @karan4d from @NousResearch and three other great builders. We went deep on Hermes Agent: self-improving skills, ambient agents, why the harness matters as much as the model, and why open source AI must win. I got to share some @WolfBenchAI insights and real-world use cases. Give it a listen on Spotify or watch it on YouTube - links in reply.
6
234
surprised?
We've been tricked, again. Many of the thousands of bugs and vulnerabilities Mythos found are in older software are impossible to exploit. And the severe zero-day reports rely on just 198 manual reviews tomshardware.com/tech-indust…
1
3
246
Mike Bird retweeted
“gpt2-large is too powerful to be publicly released” vibes
68
154
4,250
331,812