"Vision-Language Relational Transformer for Video-to-Text Generation."

Tengpeng Li et al. (2025)

Details and statistics

DOI: 10.1109/TMM.2025.3535394

access: closed

type: Journal Article

metadata version: 2025-08-07