Synchronization is all you need.
Transformer attention emerges from Kuramoto oscillator dynamics, the same equations that sync fireflies and metronomes. No softmax, no exponentials. Just a physical network relaxing to equilibrium.
Watch it write a story, one word at a time 👇🏽