Multiview Transformers for Video Recognition[CVPR2022]+Egocentric Video-Language Pretraining[arxiv2022]

Boshen Xu

2022/08/22

arch