Simple, efficient, and surprisingly effective.
New paper alert π¨
What if I told you there is an architecture that provides a _knob_ to control quality-efficiency trade-offs directly at test-time?
Introducing Compress & Attend Transformers (CATs) that provide you exactly this!
π§΅(1/n) π