Concept: A single locked perspective setup of identical twins playing unseen opponents with the camera hovering around the net. The video model never sees the other team on the other side of the net but Grok Imagine has to understand things like speed, velocity, a single persistent tennis ball, and a lot more just to contextualize the tennis part. The video model also has to track the slight vocal nuances and mannerisms of identical twins and handle blocking, action, and pace of play.