Have been extensively testing Claude Workflows this weekend, with the best model possible. Threw it at my whole code base, combing for bugs. 144 found and fixed! Geez... It is a large code base, for sure, but 144?!! Some are very impactful, some are downright embarrassing...
I keep predicting software quality will improve. I keep being wrong. Models write better-than-average code, yet we use them to write more code - not better code
(shoutout to the unmovable, always-on-top Claude Code download and install window).