the biggest jump in my Stampede! experiments came from replacing text-only anchors with image-reference centroids.
text anchors proved the idea, but only ~50% of holdouts were safe enough to accept. after switching to image-reference centroids, safe accepts jumped to 95% and false accepts dropped to 0.
the workflow i’d steal:
test encoder offline, start with text anchors, add real reference images, keep separate holdouts, add near-misses that should fail, and only accept when the top match clearly beats the runner-up.
follow along if you want more notes from my journey learning how to train, ship, and actually use open-source models in real apps.