PLG-ViT: Vision Transformer with Parallel Local and Global Self-Attention
Recently, transformer architectures have shown superior performance compared to their CNN counterparts in many computer vision tasks. The self-attention mechanism enables transformer networks to...
mdpi.com