Demystifying Efficient Self-Attention

A practical overview of efficient attention mechanisms that tackle the quadratic scaling problem.

November 7, 2022 ยท Thomas van Dongen