Everyone always latches onto posts like this and goes “see look, relying on ai code bad!”
I’m not going to claim my shit is fully clean, or that I’ve hit any runaway success metrics yet..
But maple is a team of 2 and I’ve legitimately not written a single line of code in over 18 months. We’re in a healthy growing position with an open model consumer AI app I’m proud to say both of my parents even use as their main AI.
I don’t let ai agents run unchecked. I steer heavily and throw away broken experiments. I haven’t automated away everything I possibly could. I only implement and allow in features I can conceptually hold in my head AND know will not be a net negative support / maintenance burden.
This is, at surface level, at the cost of many missing “common features” and yes I’m fully aware of most bugs and I intentionally live with them as non-deal breakers. I live with the bugs instead of spinning up agent after agent fixing them as they create more like wack a mole.
I value my work life balance and live in a place where I’m not spending any time working on critical bugs or support.
Everyone’s mileage may vary, each app / domain is going to be different, skill level is all over the place as we figure out what even is the skill we all need now. Each decision we make with AI has hidden costs.
But damn, am I not extremely happy and excited for this new era of never writing any code again.
And no, I won’t allow any future bug or downtime take away for the last 18 months of this new glorious era. Knock on wood, but we could go bust tomorrow due to some “mistake caused by ai” and I’d be proud of what we’ve been able to accomplish and how I’ve lived my life.
Because the 18 months of my company’s existence before AI didn’t result in any happy end goal anyways. It went completely bust, as most things do AI or not.
Status update: I've been on/off AI agents in the last few days and it is a verifiable truth that every day I didn't use agents, I was more productive. I still attribute that to how slow they are, and my own inability to multi-task efficiently. The magic is there but the slowness doesn't let it cross the threshold where they actually make me faster, and I still dislike the whole thinking paradigm.
About Bend2: honestly, the C/Metal compiler codebase is a clusterfuck right now. I regret letting AI agents write it. All tests pass, and GPU performance is mind-blowing, so the core architecture works. Yet, it has a LOT of bugs. Anything not covered by the tests is a coin toss. This is actually impressive, because, in many parts of the codebase, the right solution was actually the simplest one, yet, the agents STILL managed to find a way to make it work just for the tests. The level of reward hack these agents output is actually impressive I can't even be mad.
It is also ironical because that's the very problem that Bend's proof system was supposed to solve, but Bend is in TypeScript, not in Bend. I'm disappointed I didn't write Bend in itself, and now I feel an immense urge to do so. But the clock is ticking . . .
Still, I do not think Bend is worth launching without the GPU compiler being solid, because the closest competitor, Lean, is actually extremely good, so we need a big differential. Yet, due to the very nature of the project, it would be embarrassing to have bugs at launch.
Regarding AI, I now believe using current gen AI agents in production codebase is harmful and a massive mistake. That doesn't mean no agents at all, but agents work best when they don't touch critical code. Debugging, researching, providing insights, scripts / tools, or anything that doesn't touch code you will maintain in the long term. But if you merge AI code without reading, you're going to have a bad time. Speaking from experience
I'm working 10h/day on SupGen and the remaining time on Bend2