A couple of observations after months of hands-on tinkering with OpenClaw, Hermes, and Nanobot.
We are not there yet.
None of the frontier models have reached a point where they can see the whole picture in a real world company.
Iโve repeatedly tried to use a single control board for planning, building, and consuming tasks.
This works at small scale with a few tasks, skills, and cron jobs but falls apart after about 10 unrelated tasks. The harness and model lose coherency, intermingling instructions from unrelated skills, running code built for other tasks, and creating an environment mess that takes nontrivial effort to unwind.
Harness upgrades continue to be a major friction point. Each upgrade leads to hours of debugging broken configs, isolation, access levels, repo states, and corrupt backup scripts.
Agent-to-agent communication is far from working. We are several iterations from establishing meaningful, reliable, and smooth protocols. At this stage, agents negotiating with each other remains an art that quickly gets out of hand and needs frequent supervision.
The good news is there may be a practical middle ground.
Code-building harness tools are improving at handling longer, more complex tasks and are tuned for bigger projects.
@Grok Build is catching up fast to Claude Code and Codex.
The pattern that works in practice at this early stage of the agentic AI shift is as follows:
Agentic AI Front End:
- Consumer harness for the billions of non-builder users: memory and lightweight skills capturing personal preferences, focused on personal UX style and taste.
- Analogous to how consumers pick food, clothes, and furniture: end users donโt make these things but have strong preferences.
Agentic AI Backend:
- Reliable, battle-tested MCP servers with utility-grade reliability and uptime.
- About one in a thousand people are builders. Millions of builders (versus billions of users) need professional tools.
- The agentic AI backend needs to be utility-grade, predictable, and reliable like water, electricity, roads, internet, or blockchain.
- Consumers donโt care how backend services are built but get upset when utilities are unavailable or subpar.
Overall recommendation for founders:
- Focus on building agentic-friendly commercial MCP services using solid code-building tools.
- Donโt spend too much time on human-facing monolithic web and mobile apps.
- Publish a few example front-end skills that end users can feed to their favorite chatbots, understand, customize, and add as lightweight SKILL Connector packages. Let the chatbot render UX in the userโs preferred style.