Have been playing around with some automated scan alignment under the absence of any related metadata (cameras, registration, etc...). All classical geometric solutions work except they do need some validation from a visual model. Gemini is the best so far, then GPT, then Grok.