This one, I used a combination of four open source models. Qwen 2511 edit, Klein 9B, and Z-Image, along with the SeedVR upscaler, all through the Comfy UI backend. Sometimes when i'm lazy I use grok imagine as well, but its pretty lowrez.
Object Swapping🔁 👗👑🧥 on
Flux-2-klein-9b
This simple flux-2-klein-9b flow to swap objects using a reference image.
It’s pretty smooth.
It uses SAM2 for the segmentation and SEEDVR to push the final result to 4K.
detailed post workflow:
reddit.com/r/comfyui/comment…
TECH STACK
The open source stack:
→ Z-Image Turbo — keyframes
→ LTX 2.3 Pro — video generation
→ SeedVR — 4K upscaling
→ Claude Code — AI agent orchestration
→ Editing: Premiere Pro CapCut
→ Music: "Mad Priest" by @RokNardin
Hardware: RTX 3090 Mac Mini M4.
Example:
Image 1 = input
image 2 = output
I had Claude write me a small script when given an mp4 to pack into a collage and then a separate script to unpack the nano banana pro outputs into single frames that I upscale as a video sequence with SeedVR