Josh Boys

Josh Boys

233 Photos and videos

Tweets

Josh Boys

@aneki

Jun 6

Two biggest surprises for me model wise this week.. 1. I found myself using @grok Build as my CC, Codex and Copilot subs were all at limits. It blew me away... wasn't expecting it to be any good but for a greenfields project it was great. Large existing monorepo not so great.

105

Josh Boys

Josh Boys

@aneki

Jun 6

2. @MiniMax_AI M3 in @opencode is fucking fantastic. Not sure why specifically but it is really good at organising its thoughts thinking through things. Didn't use for any major coding but lots of organisation tasks and it was A

Josh Boys

Josh Boys

@aneki

Jun 6

Hey @thdxr MiniMax M3 is easily my favourite model in OpenCode and I've probably used most.

Josh Boys

Josh Boys

@aneki

May 14

Hey @JakeGinnivan look how much you influenced me.

Josh Boys

@aneki

May 12

If anyone is using @NxDevTools with a JS Rust monorepo and found the monodon plugin deficient (mainly because it isn't being updated) then this might help. Not a 1:1 parity yet but handles what I've needed. nxrust by @eddacraft_ai ⬇️

Josh Boys

Josh Boys

@aneki

May 12

Josh Boys

Josh Boys

@aneki

May 12

npmjs.com/package/@eddacraft…

Josh Boys

Josh Boys

@aneki

May 8

This is going to be so good

eddacraft @eddacraft_ai

May 8

AI agents probably wrote a meaningful share of the code that shipped to your repo this week. No one on your team can tell you which lines. No one can tell you which policies governed the agent. No one's going to find out until something breaks.

eddacraft

Josh Boys retweeted

eddacraft @eddacraft_ai

May 8

It looks like we are in the wild and people outside our initial group of Beta testers are starting to get their hands on anvil. If you would like to join the list ⬇️

Josh Boys

Josh Boys

@aneki

May 2

I saw this post by @davidfowl and I haven't seen anything so relatable in a long time. So here are my Five (actually 9) Stages of AI Dev Grief.

David Fowler

@davidfowl

May 1

I’m gearing up to build my own agent orchestration system. Are we all doing this now?? What stage of grief is this?

more replies

Josh Boys

Josh Boys

@aneki

May 2

8. Enlightenment (Temporary) “…wait. This is actually kind of clean.” 9. Acceptance (with a lingering itch) “Fine. I’ll just grab the pi-mono repo and build the purest thing I’ve touched in months.”

Josh Boys

Josh Boys

@aneki

May 2

The Final Unanswered Question. “…but seriously… why can’t I just use my Claude account?”

Josh Boys

Josh Boys

@aneki

Apr 24

It's sad when you get a Claude agent that seems like they are leagues above the rest then you need to run a compaction... doesn't matter what you do.. they are never the same

Josh Boys

Josh Boys

@aneki

Apr 18

Great work @QuixiAI and the team at LazarusAI this is really well done!

Eric Hartford

@QuixiAI

Apr 17

Last week, Anthropic announced Project Glasswing alongside Claude Mythos Preview, a model they described as so powerful at finding vulnerabilities they couldn't release it. The announcement featured AWS, Microsoft, Google, and Apple as partners, $100M in compute credits, and a clear message: this is dangerous, and only we can be trusted to deploy it safely. The results were real. Thousands of zero-days across every major OS and browser. A 27-year-old bug in OpenBSD. A 16-year-old bug in FFmpeg. Fully autonomous exploit chains that would have taken human researchers weeks. But here's what bothered me: all the credit went to the model. Read the technical blog carefully and a different picture emerges. The real innovation isn't the model. It's the workflow: - Rank every file in a codebase by attack surface - Fan out hundreds of parallel agents, each scoped to one file - Use crash oracles (AddressSanitizer, UBSan) as ground truth - Run a second verification agent to filter noise - Generate exploits as a triage mechanism for severity That's a pipeline. And pipelines are model-agnostic. At Lazarus AI, we spend our days deploying custom AI in places where "just use the closed API" isn't an option: regulated industries, enterprise, and government. When I saw Glasswing, my instinct was the same one I have every week: strip out the proprietary model, keep the architecture, run it on whatever model is best for the customer. Clearwing is a fully open-source vulnerability discovery engine. Crash-first hunting, file-parallel agents, oracle-driven verification, variant hunting, adversarial verification. Works with any LLM. I tested it with OpenAI Codex 5.4 and reproduced Glasswing's findings. I'm now reproducing results with our own ReAligned model - Qwen3.5 finetuned to Western alignment. Mythos is certainly a great model. The N-day exploit walkthroughs in Anthropic's blog show real reasoning depth. But it's an incremental improvement over Opus, the same way Opus was over Sonnet, and Sonnet over Haiku. It's not a leap to superintelligence. It's the next point on a curve we've been watching for years. What actually changed the game was the workflow. Defenders shouldn't have to wait for access to a gated model to secure their software. These vulnerabilities have been sitting in codebases for decades. The tools to find them should be available to everyone: the open source maintainer running FFmpeg on a Saturday, the startup that can't afford $125/M output tokens, the researcher in a country where Anthropic doesn't operate. Clearwing is MIT licensed and available now. github.com/Lazarus-AI/clearw… Clearwing enables a wide variety of security activities. Handle with care. It is sharp.

Josh Boys

Josh Boys

@aneki

Apr 9

This is a part of a set of things I am publishing at the request of some mates. Some of the bits I've been using for months and others are a bit newer.

Josh Boys

@aneki

Apr 8

As we know by now adding in brevity constraints can save a bunch of tokens... some of the viral options out there seem cool but aren't super useful in a serious context. I have SITREP which is based on something I've been using for a while to condense output.

Josh Boys

Josh Boys

@aneki

Apr 8

Josh Boys

Josh Boys

@aneki

Apr 8

github.com/joshuaboys/SITREP

GitHub - joshuaboys/SITREP: SITREP mode skill for coding agents — terse structured output across...

SITREP mode skill for coding agents — terse structured output across brief/sitrep/flash intensity levels - joshuaboys/SITREP

github.com