Mixture of Nested Experts: Adaptive Processing of Visual Tokens Paper • 2407.19985 • Published Jul 29, 2024 • 37
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition Paper • 2306.04822 • Published Jun 7, 2023 • 2
PaLI-X: On Scaling up a Multilingual Vision and Language Model Paper • 2305.18565 • Published May 29, 2023 • 3