"CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes."

Maria Parelli et al. (2023)

Details and statistics

DOI: 10.1109/CVPRW59228.2023.00593

access: closed

type: Conference or Workshop Paper

metadata version: 2023-08-23

a service of  Schloss Dagstuhl - Leibniz Center for Informatics