I teach a class where students code up an ML library from scratch in Python. Wenting showed me that a Claude Agent (with interactive unit test feedback and the spec) could solve it 100%. We thought it would be fun to scale this idea to every Python library in the world.
Introducing the commit0 interactive environment for coding agents. Challenge: generate Python libraries from scratch.
Commit0 is designed with interactivity, dependencies, and specifications as first-class considerations. We include a benchmark with 50 challenging libraries.