"Focal Visual-Text Attention for Visual Question Answering."

Junwei Liang et al. (2018)
a service of Schloss Dagstuhl - Leibniz Center for Informatics