VOLO: Vision Outlooker for Visual Recognition

Authors

Li Yuan, Qibin Hou, Zihang Jiang, Jiashi Feng, Shuicheng Yan

Published on

September 02, 2021

Publisher

arXiv 2021

We introduce a novel outlook attention and present a simple and general architecture, termed Vision Outlooker (VOLO), which can efficiently encodes finer-level features and contexts into tokens. Our VOLO is the first model exceeding 87% accuracy on ImageNet, without using any extra training data. testing