#polyvit

Published papers at TMLRtmlrpub@sigmoid.social
2023-01-19

PolyViT: Co-training Vision Transformers on Images, Videos and Audio

Valerii Likhosherstov, Anurag Arnab, Krzysztof Marcin Choromanski et al.

openreview.net/forum?id=zKnqZe

#videos #polyvit #modality

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst