"SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference."

Feng Wang, Jieru Mei, Alan L. Yuille (2023)

Details and statistics

DOI: 10.48550/ARXIV.2312.01597

access: open

type: Informal or Other Publication

metadata version: 2023-12-12