my spec-kit driven
#AI dev workflow gets ~ 15 weeks of work done / day / editor (project) - running 2 projects in parallel. I'm using 25% of my opus budget on the max 200 subscription per day π
- only for planning tasks.
My planner experiences:
Planners: /speckit.plan / task / clarify
claude-code - opus 4.1 : Awesome, relative quick, ~7% budget
claude-code - sonnet 4.5: Quick, good for smaller plans. Not as good as opus
opencode - chat gpt-5 pro: Super slow ~ 6h for a large project. ~ 30$. A bit better then opus 4.1. Use it for very large, very complex projects for initial planning.
opencode - grok 4: half garbage
Implementers: /speckit.implement
claude-code - sonnet 4.5: rust, nix, testdriven: senior dev, very little errors, very few interventions
claude-code - sonnet 4.5: python: annoying. static types are much better
claude-code - opus 4.1: gets hard debugging problems fixed
opencode - gemini 2.5 pro: At least, he gives up. Can't even implement test cases
opencode - grok 4: Read he writes good tests, but not for me he doesn't. Always <x:ai> garbage in the output, seems to dumb for MCP
Spec driven ai development
#specdev >>>>>>
#vibecoding
In a few weeks, I will unleash the most advanced AI toolkit ever created on the world.
Not another company, that steals your work - but a protocol on
#sui to share your agents and skills and experiences. The more people use your skill, the more you earn from the protocol.
If you think
#claude skills are good, they are only 1/5th of what is required for a real skill. I don't want my agents to write good code, I want perfect code. This can only be archived if everything is in place, MCP, tools, data, permissions, context, isolation...
AI worms are coming, and they come fast.
#glasworm should make you at least a bit scared.
The full isolation workflow and multi a step pipeline, ensures, no escape from the sandbox and every change is checked by a vigorous check step and agent will check the results for malicious code.