56,000 tokens/sec at just 80 MHz. đ€Ż
I burned a full Transformer with KV cache into a custom chip. Designed gate by gate as a 100% digital integrated circuit. Prototyped on a FPGA. (No GPU. No CPU)
Just pure digital silicon running
@karpathy microGPT, spelling out names on a tiny LCD.
This is GateGPT đ