"Joint learning of images and videos with a single Vision Transformer."

Shuki Shimizu, Toru Tamaki (2023)

Details and statistics

DOI: 10.48550/ARXIV.2308.10533

access: open

type: Informal or Other Publication

metadata version: 2023-08-30

a service of  Schloss Dagstuhl - Leibniz Center for Informatics