Latest from Google AI – A Multi-Axis Approach for Vision Transformer and MLP Models
Posted by Zhengzhong Tu and Yinxiao Li, Software Engineers, Google Research Convolutional neural networks have been the dominant machine learning architecture for computer vision since the introduction of AlexNet in 2012. Recently, inspired by the evolution of Transformers in natural language processing, attention mechanisms have been prominently incorporated into vision models. These attention methods boost…