running Claude Code w/ local models
on my own GPUs at home
> vLLM serving GLM-4.5 Air
> on 4x RTX 3090s
> nvtop showing live GPU load
> Claude Code generating code docs
> end-to-end on my AI cluster
this is what local AI actually looks like
Buy a GPU