"Learning CLIP Guided Visual-Text Fusion Transformer for Video-based ..."

Jun Zhu et al. (2023)

Details and statistics

DOI: 10.1109/CVPRW59228.2023.00261

access: closed

type: Conference or Workshop Paper

metadata version: 2023-08-23

a service of  Schloss Dagstuhl - Leibniz Center for Informatics