Facebook Open-Sources Computer Vision Model Multiscale Vision Transformers
Facebook AI Introduces Multiscale Vision Transformers (MViT), A Transformer Architecture For Representation Learning From Visual Data
Facebook AI Introduces 'ConViT', A Computer Vision Model That Improves Vision Transformers (ViT) With Soft Convolutional Inductive Biases