Here’s where OB-1 is going:
– Auto-generates evals from past PRs, then climbs them with custom models
– Builds its own skills, hooks, and rules from a codebase and session history
– Background agents in safe sandboxes that keep working while you context-switch
– Session sharing and forking: redefining version control around prompts, instead of source code
– Lives where you already work: Slack, Linear, GitHub, Graphite
– PM mode so it never runs out of ideas