[LG-Transformer] Local-to-Global Self-Attention in Vision Transformers

{SW-MSA; Local-Global-Attention} 

[Paper] [Code]