People laugh at me and call me dumb because I built my own local AI cluster instead of paying for cloud APIs.
Maybe I am.
But not dumb enough to rent forever what I can own.
If you are interested in my view, please read on (below).
Before you call someone a dumbass for building their own cluster, ask yourself three questions:
Do you own your tools or rent them? Do you control your data or someone else? If your provider goes down tomorrow, do you have a backup plan or just a subscription?
Think before you talk. Build before you judge.
I have 20 years of experience in electronic equipment diagnostics, repair, calibration, and fault-finding in a highly regulated environment within one of the most respected companies in the UK and worldwide. I'm proud of this and of the hardworking, clear-headed people around me.
When your job is finding one failed component on a board with hundreds, you stop worshipping hype. You start respecting control, consequences, and process, same applies to AI.
I run a 6-node air-gapped (no internet connection) Mac cluster. Thunderbolt daisy-chain in an EXO Labs environment. Data transfers via USB only.
Hard backups on: Panasonic LKM-KB12 MEDICAL ADA-RE Blu-ray Drive.
The system is never overloaded. If one node goes bust, I can do a hot swap without powering down the whole cluster. 6 cloned redundant storage copies if one goes bust, I have 5 others.
LLM models: MiniMax-M2.5-4bit-MLX Llama 3.3 70b-instruct-fp16-MLX
Not the best, not the flashiest, but running 24/7 on mine cluster.
People say: local models are "outdated" - the lag argument.
Fine. Open-weight models typically trail the frontier by about 3 months (Epoch AI data). Stanford's AI Index says the gap shrank from 8% to 1.7% in a single year.
Today's "primitive" local model has more capability than what people were paying premium cloud prices for not long ago.
That's not obsolete. That's discounted capability.
Let’s talk about economics.
Cloud AI is rent. You pay, you use, you pay again. Stop paying = capability gone.
My entire 6-node cluster cost under £5,000. That's roughly what 2 years of a single $200/mo cloud Pro subscription costs.
The difference? After 2 years, the cloud subscriber owns nothing. I still own 6 machines running 24/7. And they still hold their resale value.
Try reselling your old API invoices.
The Meter Never Sleeps.
People cry in forums about “burning” $200 in three days on API calls they didn't expect.
Here's why: the API has no memory. Every follow-up resends your entire conversation history. Turn 1 costs 100 tokens. Turn 10 costs the sum of all previous turns plus the new one (costs compound).
Now add agents. Every tool call, every loop, every retry burns tokens. A runaway process at 3am eats your budget while you sleep. Output tokens cost 3–5x more than input.
On my cluster, a mistake at 3am costs me pennies in electricity. The whole 6-node cluster draws below 150W under heavy compute and below 50W when idle. That's (10x) less than a kettle.
On the cloud, the same mistake costs your budget and you won't know until the invoice arrives.
A few days ago, OpenAI has announced the shutdown of the Sora app.
App shutdown scheduled for April 26 2026, API shutdown scheduled for September 24 2026.
If your workflow depended on that API, you're scrambling right now. My cluster doesn't get discontinued by someone else's board meeting.
People trust cloud privacy policies like they're magic shields.
Reality: your prompts travel through third-party infrastructure with retention policies you didn't write. OpenAI and Anthropic both document default retention windows for API usage. AWS calls cloud security a "shared responsibility model".
On my air-gapped cluster, there's no third-party transit. The exposure surface is smaller because the data never leaves the room.
That's not paranoia. That's engineering.
Now some words about another latest "big thing", the agents.
Before I let agents like OpenClaw or Hermes anywhere near my email, Telegram, or real workflows, I keep “the beast in the cage”.
I can train it, break it, test it, learn its habits. Only when I take control of it will I allow it to touch the outside world.
Most people connect "the beast" to their life on day one, I cage it first.
The App Fantasy: Many are chasing the "build one app, become a billionaire" dream. Most new businesses don't survive five years.
I'd rather have one boring, stable pipeline that works every week than ten imaginary unicorn apps. Small but regular income beats fantasy every time.
As they say: the best camera won't make you a photographer. The best model on Earth won't make you a good operator.
You rent intelligence, I bought infrastructure. Call it outdated if you want, I call it owned.
I didn't figure this out alone. Nobody does.
If any of this resonates or if you think I'm wrong, learn from the people who shaped my thinking:
Alex Cheema (
@alexocheema), co-founder of EXO Labs. He builds the infrastructure many others only talk about. My cluster runs on his work.
Alex Ziskind (
@digitalix), developer, trainer, author at
@NativeScripting. Teaches people to actually build things instead of chasing hype.
Alex Finn (
@AlexFinn), founder of Creator Buddy and Vibe Coding Academy. Turns AI theory into real business results.
Three builders. Three different angles. One shared principle: own what you build.
If this resonated, repost it. If you think I'm wrong, quote it and tell me why.
Either way build something.
William Ruider.