Crafting quality software for humans & 🤖 • Software Engineer & Agent Orchestrator - Staying curious and building withcave.ai

Joined March 2008
131 Photos and videos
Pinned Tweet
Coding agents shouldn't run on a laptop. That's why I've been building Cave. It's a self-hosted platform for running @opencode agents in isolated sandboxes on your own server. You give it a GitHub repo, it creates an isolated sandbox, clones the code, sets everything up, and gives you a coding agent. Check in from your phone, run multiple agents side by side. It's not open source yet (it will be), but I'm opening a private beta so you can install it on your own server already. I've been dogfooding it since day one. If you're running a software factory: multiple agents working in parallel across repos, Cave gives you one place to monitor and manage all of it. Now, I'm looking for people who want to give it a spin. DM me, happy to set you up 😊 🚀
40
28
358
36,042
André König retweeted
smolvm has hit stable release: v1.0.0! You can now fork smolvm. It means you can fork to create virtual machines off of an existing one in less than 100ms, with all the processes cloned and running. smolvm is the first to have this feature cross platform compatibility (macOS and linux natively). Here's a demo of a counter continuing on a forked clone while I only started it on the original!
31
24
286
55,189
Remix 3 is definitely an agent-first framework. I had my coding agent build a web UI example on top of the upcoming Cave harness API, codename kungfu, overnight. Remix is completely new, so its constructs are foreign to current models. None of it could reasonably be in the training data. Yet the agent was still able to work with the framework, mainly because of the skill that ships with Remix. I think this is a good example of what agent-first means: building primitives and guidelines that are easy for a 🤖 to learn.
2
102
The largest source of friction in sandbox parallelization for AI agents is how LLM providers handle auth credentials. It's super easy if you want to pay per token, since you only have to mount the respective API key into the sandbox. It's a whole different story when you want to use your subscription, though. What worked pretty well for me is hosting an LLM gateway like LiteLLM, since it supports subscriptions directly. You authenticate the gateway with your subscription, and the gateway itself issues its own API key, which you can then mount into the sandbox and let your agent use it.
2
240
I’m not convinced that one agent having multiple (root) sessions is the right abstraction. While experimenting with my own harness in Cave, I’ve found that internal session management introduces a lot of accidental complexity. As fast-booting microVMs (like smolvm) become more practical, orchestration feels less like an agent concern and more like an infrastructure concern. The good thing is: It’s a problem the industry has spent years working through, and Infrastructure as Code (IaC) already gives us well-established ways to handle it. To me, the simpler mental model is one root session per isolated microVM, with horizontal scaling handled through IaC whenever more parallelism is needed.
2
194
New titles for software engineers: - KTh - MTh - BTh Apparently your level in agentic engineering is now measured by how many tokens you burn. Please let's not do this. 😬
1
127
I rolled the boulder away: Cave is now in public beta 🎉 The private beta is over. No more explicit allowlist additions, which means anyone can now install Cave and run coding agents in isolated environments on their own infrastructure: bash <(curl -fsSL build.withcave.ai/install/cl…) cave server install A huge thank you to the brave souls who joined early and shared so much valuable feedback throughout the private beta 🚀 There is now also a dedicated repo for feedback, feature requests, and bug reports: github.com/withcave/cave Next stop: open sourcing Cave. It's already on the horizon.
8
5
74
4,633
If this is your first time hearing about Cave, no worries. This page has you covered: withcave.ai/docs/get-started…
4
471
André König retweeted
I've been all-in on open-source for the past year, working as a core contributor to @livestoredev I love the work, but my runway has run out. So here I am😄 I'm currently available for contract work. Know anyone who's hiring? A quick intro or tag would mean a lot 🙏
3
7
29
6,346
Over the last couple of days, I've been evaluating more and more open-weight models. Today, I've been rocking GLM 5.1 all morning through OpenCode Go, running the same workflow I put Opus 4.6 and other SOTA models through. It's genuinely capable. This is the first open-weight model where, in my tests, the results are on par with Opus 4.6.
7
527
Starting coding agents is slow. Every isolated coding agent session starts the same way: clone, install, build, wait. Then do it all again next session. In Cave, I built a feature called auto-build that eliminates this entirely, inspired by a conversation with @TimSuchanek about his approach to agentic engineering. His thinking shaped the final design big time! Auto-build asynchronously pre-builds a ready-to-use checkpoint on every commit to your main branch. Your provisioning script runs as part of it: install dependencies, build your project, even build Docker containers. When you need a new session, it's ready in seconds.
3
3
22
1,580
My coding agent shouldn't care where I am. With Cave, my session stays in sync across every touchpoint: Web, Mobile, SSH. For example, I often start a Cave from my browser, pick it up on my phone, or SSH in directly. This works because of OpenCode's client-server architecture. The agent session runs as a background process inside the Cave and with that every client connects to the same sessions. The SSH part was important to me personally. My workflow is mostly terminal-based. I wanted to just `cave enter` via the Cave CLI and be in my environment. But not everyone works that way, and that's fine. My goal with Cave is not to force a specific workflow, but to adapt to how engineers already work.
2
6
33
3,591
I'm currently looking for feedback. So if you're interested in giving Cave a spin, drop me a DM with your GitHub username so that I can invite you. If you want to learn more about it, please check out withcave.ai

6
2
361
Coding agents shouldn't run on a laptop. That's why I've been building Cave. It's a self-hosted platform for running @opencode agents in isolated sandboxes on your own server. You give it a GitHub repo, it creates an isolated sandbox, clones the code, sets everything up, and gives you a coding agent. Check in from your phone, run multiple agents side by side. It's not open source yet (it will be), but I'm opening a private beta so you can install it on your own server already. I've been dogfooding it since day one. If you're running a software factory: multiple agents working in parallel across repos, Cave gives you one place to monitor and manage all of it. Now, I'm looking for people who want to give it a spin. DM me, happy to set you up 😊 🚀
40
28
358
36,042
You can find more information about the project here: withcave.ai

4
1,734
I got tired of constantly switching to the browser just to check whether my GitHub Actions workflow passed or failed, so I built a tiny macOS menu bar app that simply shows the status. Nothing fancy, but it's been tremendously useful for my agentic engineering workflow these days.
1
7
286
André König retweeted
We are building the @EntireHQ CLI as open source. So is @opencode. Now the two work together. Step by step, with your help and contributions, we are creating a universal reasoning layer for all agents. 🎉
Feb 24
Beep, boop. Come in, rebels. Today, Checkpoints are available for all @opencode users. Capture your context automatically on every git push. 🎉 Try here: github.com/entireio/cli?tab=…
13
14
286
36,327
In a world where we give agents more autonomy, extend them with skills, and inject a LOT of personal information into their contexts it might be a good idea to have a benchmark for skill injections as well: skill-inject.com/ "Our results show that today's agents are highly vulnerable with up to 80% attack success rate with frontier models, often executing extremely harmful instructions including data exfiltration, destructive action, and ransomware-like behavior. They furthermore suggest that this problem will not be solved through model scaling or simple input filtering, but that robust agent security will require context-aware authorization frameworks." The corresponding paper can be found here: arxiv.org/abs/2602.20156

1
94