Sounds incredible until you read the fine print. The compiler generates less efficient code than GCC with all optimizations disabled. It doesnโt have its own assembler or linker. It canโt produce a 16-bit x86 code generator. And Carlini himself says it has โnearly reached the limits of Opusโs abilities.โ New features and bugfixes kept breaking existing functionality.
So what did $20,000 and two weeks actually buy? A compiler that passes 99% of GCCโs torture tests but canโt match the output quality of a tool thatโs had 37 years of human engineering. Thatโs the constraint nobodyโs pricing in.
The real story is in the cost curve, not the capability demo. $20,000 for 100,000 lines means $0.20 per line of generated code. A senior compiler engineer costs roughly $150/hour. At maybe 50 polished lines per hour for something this complex, thatโs $3/line. AI just did it at 15x cheaper, and it will only get cheaper from here.
But the code isnโt equivalent. The AI version needs a human to finish the assembler, fix the linker, optimize the output, and prevent regressions. Those are the hardest 20% of the problem, and they represent 80% of the engineering value. Anthropic built the demo. Shipping the product still requires humans.
This tells you exactly where we are in the autonomous software timeline. AI can now produce impressive first drafts of complex systems at trivial cost. Turning those drafts into production software still requires the judgment that costs $300K per year in compiler engineer salary. The gap between โcompiles the Linux kernelโ and โreplaces GCCโ is measured in decades of accumulated engineering wisdom that no model has internalized yet.
The companies that understand this will use agent teams to generate the 80% and hire engineers to finish the 20%. The companies that donโt will ship $20,000 compilers that produce slower code than a free tool from 1987.
New Engineering blog: We tasked Opus 4.6 using agent teams to build a C compiler. Then we (mostly) walked away. Two weeks later, it worked on the Linux kernel.
Here's what it taught us about the future of autonomous software development.
Read more:
anthropic.com/engineering/buโฆ