Demystifying Efficient Self-AttentionA practical overview of efficient attention mechanisms that tackle the quadratic scaling problem.