Paul Stack

Paul Stack

1,337 Photos and videos

Tweets

Paul Stack

@stack72

Jun 11

I spent years as the person who got paged at 2am. Most incident investigations follow a pattern, and the hard part was never any individual step. It's maintaining context across all of them without skipping something. Agents don't replace the judgment, they execute the investigation without losing context halfway through. The bottleneck moves from execution to judgment, and that's where SREs become more valuable, not less. stack72.dev/sres-dont-need-r…

1,493

Paul Stack

Paul Stack

@stack72

Jun 9

If you require me to login to Calendly using X and it has these permissions, then that's a BIG no from my side:

198

Paul Stack

Paul Stack

@stack72

Jun 9

I've spent the last two months talking about building the machine that builds the machine. The idea isn't complicated: stop treating AI as a better interface for humans to do work, and start building systems that can create, modify, operate, and improve other systems. For infrastructure, that means the goal isn't an AI that writes Terraform faster. It's a system that can understand infrastructure, reason about it, change it safely, and get better at doing that every time it runs. Last week at Microsoft Build, @steipete gave a talk called "Build the Thing That Builds the Thing." - lnkd.in/eGPE3VUZ That title stood out because it describes exactly how I think about AI. Most people spend their time prompting agents. I spend my time building the systems around the agents. The goal isn't to become better at prompting. The goal is to build a machine that can reliably produce outcomes, improve itself, and do more of the work without requiring constant human intervention. We're not heading toward a world where humans sit in front of chat windows all day. We're heading toward a world where we build machines that build machines. The biggest opportunity isn't replacing individual tasks. It's creating systems that continuously generate, operate, and improve the next layer of systems above them. That's what I've been writing about throughout this blog series (stack72.dev/the-lifecycle-of…), and it's what we've been building at swamp-club.com for months already. It feels like the industry is finally converging on the same idea and I am excited about it!

886

Paul Stack

Paul Stack

@stack72

Jun 8

Every IaC tool I've worked on promised "any system." I helped make that promise. Provider ecosystems were built for humans who fill in the gaps. Agents can't so if a capability isn't in a schema, an agent has to infer it from prose. The challenge for me is, were provider ecosystems designed for the consumers that matter in an agentic world? New post on why I don't think so. stack72.dev/every-platform-p…

636

Paul Stack

Paul Stack

@stack72

Jun 7

so I can't unsubscribe from @Railway emails unless I log into their app :/ That feels an unnecessary step

7,799

Paul Stack

Paul Stack

@stack72

Jun 4

So @currys is a perfect example of how AI can be integrated incredibly poorly into your system. An automated response to my tweet meant that I messaged them back and now I have another AI but trying to interact with me via DM.

276

Paul Stack

Paul Stack

@stack72

Jun 2

Every production environment has someone who can't go on holiday without their phone buzzing. Not because the team is bad. Because everything that person learned from incidents over the years never made it anywhere except their head. That's not a people problem. It's an encoding problem. New post on why this is the first thing to solve before AI agents go anywhere near your infrastructure. stack72.dev/the-first-step-o…

The First Step on Your AI Journey Is Encoding What You Already Know

Every company has this person. The engineer who every production change flows through because they're the only one who knows the constraints. If you're thinking about putting AI agents anywhere near...

stack72.dev

202

Paul Stack

Paul Stack

@stack72

May 28

The sovereign cloud movement has an $80B budget but the the cost to switch is the real blocker to make it happen. Your automation is coupled to your provider. Moving means rewriting everything for zero new functionality. We started replacing our GitHub usage with Forgejo and in a couple of days it was working (including actions replacement). Two years ago that's a quarter-long project. The implementation was straightforward. Being precise about what we actually needed was the real work. stack72.dev/intent-is-archit…

Intent Is The New Architecture

The sovereign cloud movement is forcing a question the industry has been avoiding. When you have to move off a hyperscaler, the bottleneck isn't the new provider's APIs. It's how precisely you can...

stack72.dev

303

Paul Stack

Paul Stack

@stack72

May 26

Getting agents to generate IaC code faster isn't the answer. It's still probabilistic output driving your infrastructure. The agent doesn't know your conventions or your breaking changes. You won't know how well it guessed until apply time — or until you have to review and debug it. What if agents worked against typed schemas instead? No code generation. Deterministic execution. Same inputs, same result, every run. stack72.dev/deterministic-au…

Deterministic Automation for a Probabilistic System

AI agents are probabilistic. The same prompt doesn't always produce the same output. That's not a problem to avoid. It's a problem to engineer around. Typed schemas, validated execution, determinis...

stack72.dev

660

Paul Stack

Paul Stack

@stack72

May 19

A lot of the stories about AI agents failing come down to two things: * no trust (gates everywhere, agents wait for permission) * chaos (everyone ships, nobody coordinates). The fix isn't better tools. It's building the systems that make trust safe. stack72.dev/high-trust-teams…

High Trust Teams With Agents Are Unstoppable

High trust doesn't mean no guardrails. It means everyone understands the direction well enough to act independently within it. When you pair that with the right systems, every person on the team can...

stack72.dev

420

Paul Stack

Paul Stack

@stack72

May 14

I'm not sure why but @claudeai has starting asking me for permission to use a skill... that seems new the past couple of days - anyone seen that? I am going to try adding to settings.json to bypass it but it feels a UX step backward

211

Paul Stack

Paul Stack

@stack72

May 14

Bug reports that looked like noise turned out to be three compounding architectural problems. In a traditional team, you patch and move on. We had agents, so we rearchitected the whole thing. Six parallel workstreams, eight days, landing to main the whole time. If it hadn't worked? Eight days lost, not six months. stack72.dev/the-rearchitectu…

The Rearchitecture Your Team Could Never Justify

Every codebase has the rearchitecture that never gets prioritised. We split ours into six parallel workstreams and shipped it in eight days, start to finish. The hard part wasn't speed. It was...

stack72.dev

1,311

Paul Stack

Paul Stack

@stack72

May 11

Every team gets told to ship faster. When agents make that trivially easy, speed stops being the problem and direction does. Planning as we know it is meaningless. Your PM is about to have an existential crisis. The hardest skill now is taste. And that's not something an agent can help you with. stack72.dev/when-building-is…

886

Paul Stack

Paul Stack

@stack72

May 6

I've been writing about how AI agents changed my work. Stopped typing code, started making better decisions, how we are building swamp with agents doing all the implementation. The fear is real. The shift is real. The difference is structure. stack72.dev/take-the-leap-be…

Take the Leap Before You Are Pushed

The fear of AI replacing your work is real. So is the shift. The teams that build structure around agents instead of resisting or rushing find the job gets better, not smaller.

stack72.dev

677

Paul Stack

Paul Stack

@stack72

May 4

I haven't written code in months (probably for the best). Five of us build swamp, none of us write code manually. The job got bigger, not smaller. The code was never the job — the understanding was. New post up: stack72.dev/the-code-was-nev…

The Code Was Never the Job

Five of us build swamp. None of us write code. Agents write it, review it, test the binary. The work didn't shrink — it shifted to the decisions that were always the most valuable.

stack72.dev

166

Paul Stack

Paul Stack

@stack72

Apr 27

An agent can open a PR in thirty seconds that takes an hour to review. Multiply that by every repo with an issue tracker and you have the current state of open source. We banned external code contributions and here's why. stack72.dev/the-community-pu…

The Community Pull Request Is Dead

The PR-as-contribution model was built for a world where writing code was the bottleneck. AI agents broke that assumption. GitHub shipped a kill switch on PRs. Hashimoto built Vouch. We closed the...

stack72.dev

161

Paul Stack

Paul Stack

@stack72

Apr 23

I sorted every agent tool into two buckets: can git checkout undo it, or does it touch something external? Eliminated 90% of permission prompts. Kept the ones that matter like gh pr create. stack72.dev/agent-trust-is-a…

Agent Trust Is a System Design Problem

Most people hit "always allow" within the first hour. Agent trust is a system design problem, not a containment problem. Three layers of permissions, conventions, and context let the agent work...

stack72.dev

244

Paul Stack

Paul Stack

@stack72

Apr 21

4,236 unit tests passed yet the binary was broken. A separate UAT suite running against the real compiled artifact caught it in four minutes. Three recent bugs, same pattern: all unit tests green, bug at an integration seam, no user ever saw it. stack72.dev/the-gate-between…

The Gate Between Our Agent Code and Our Users

Every release binary gets dispatched to a separate repo running CLI tests, adversarial security tests, and benchmarks. Three recent bugs — all with full unit test suites passing — never reached a...

stack72.dev

929

Paul Stack

Paul Stack

@stack72

Apr 15

Agents open PRs in our repo. So the diff is untrusted input flowing into models with merge authority. That's a recipe for disaster Multiple AI reviews, each scoped to a different concern. Minimal tool sets and a security review that audits changes to the CI pipeline itself — because the reviewers are defined in workflow files too. stack72.dev/anatomy-of-a-swa…

Anatomy of a Swamp PR

Agents open PRs in our repo. Other agents review them. That means untrusted input flowing through models with merge authority — so we built a pipeline with four independent AI reviews, scoped tools,...

stack72.dev

122

Paul Stack

Paul Stack

@stack72

Apr 13

Vibe coding works great for ten PRs*. Then you do it two hundred more times and the codebase quietly stops making sense. The fix isn't better prompts. It's giving agents the same structural context a senior engineer carries in their head. Build the machine. The vibes don't scale. stack72.dev/the-vibes-dont-s… * The number may vary depending on the weather and the timezone ;)

The Vibes Don't Scale

Vibe coding works great for the first few PRs. Then you do it two hundred more times and the codebase quietly stops making sense. The fix isn't better prompts — it's building a machine that compounds.

stack72.dev

324