Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

{Attention, Transformer, Multi-Layer Perceptrons}

[Paper] [Code]