Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Arsha Nagrani

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01297
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01297
Xingyi Zhou, Anurag Arnab, Shyamal Buch, Shen Yan, Austin Myers, Xuehan Xiong, Arsha Nagrani, Cordelia Schmid:
Streaming Dense Video Captioning. CoRR abs/2404.01297 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06511
Juhong Min, Shyamal Buch, Arsha Nagrani, Minsu Cho, Cordelia Schmid:
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering. CoRR abs/2404.06511 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-14412
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-14412
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD III: The Prequel - Back to the Pixels. CoRR abs/2404.14412 (2024)
2023
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/SubramanianNKYN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/SubramanianNKYN23
Sanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell, Dan Klein:
Modular Visual Question Answering via Code Generation. ACL (2) 2023: 747-761
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YangNSMPLSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YangNSMPLSS23
Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo, Antoine Miech, Jordi Pont-Tuset, Ivan Laptev, Josef Sivic, Cordelia Schmid:
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning. CVPR 2023: 10714-10726
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HanBNVXZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HanBNVXZ23
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD: Movie Description in Context. CVPR 2023: 18930-18940
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/SeoNS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/SeoNS23
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid:
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR. CVPR 2023: 22922-22931
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/YanXNAWGRS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/YanXNAWGRS23
Shen Yan, Xuehan Xiong, Arsha Nagrani, Anurag Arnab, Zhonghao Wang, Weina Ge, David Ross, Cordelia Schmid:
UnLoc: A Unified Framework for Video Localization Tasks. ICCV 2023: 13577-13587
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HanBNVXZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HanBNVXZ23
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD II: The Sequel - Who, When, and What in Movie Audio Description. ICCV 2023: 13599-13609
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/MomeniCNZS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/MomeniCNZS23
Liliane Momeni, Mathilde Caron, Arsha Nagrani, Andrew Zisserman, Cordelia Schmid:
Verbs in Action: Improving verb understanding in video-language models. ICCV 2023: 15533-15545
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongBSNEJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongBSNEJ23
Taesik Gong, Josh Belanich, Krishna Somandepalli, Arsha Nagrani, Brian Eoff, Brendan Jou:
LanSER: Language-Model Supported Speech Emotion Recognition. INTERSPEECH 2023: 2408-2412
[c31]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangNLSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangNLSS23
Antoine Yang, Arsha Nagrani, Ivan Laptev, Josef Sivic, Cordelia Schmid:
VidChapters-7M: Video Chapters at Scale. NeurIPS 2023
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10248
Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman:
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge. CoRR abs/2302.10248 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14115
Antoine Yang, Arsha Nagrani, Paul Hongsuck Seo, Antoine Miech, Jordi Pont-Tuset, Ivan Laptev, Josef Sivic, Cordelia Schmid:
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning. CoRR abs/2302.14115 (2023)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16501
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid:
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR. CoRR abs/2303.16501 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-16899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-16899
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD: Movie Description in Context. CoRR abs/2303.16899 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-02560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-02560
Kumara Kahatapitiya, Anurag Arnab, Arsha Nagrani, Michael S. Ryoo:
VicTR: Video-conditioned Text Representations for Activity Recognition. CoRR abs/2304.02560 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-06708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-06708
Liliane Momeni, Mathilde Caron, Arsha Nagrani, Andrew Zisserman, Cordelia Schmid:
Verbs in Action: Improving verb understanding in video-language models. CoRR abs/2304.06708 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18565
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18565
Xi Chen, Josip Djolonga, Piotr Padlewski, Basil Mustafa, Soravit Changpinyo, Jialin Wu, Carlos Riquelme Ruiz, Sebastian Goodman, Xiao Wang, Yi Tay, Siamak Shakeri, Mostafa Dehghani, Daniel Salz, Mario Lucic, Michael Tschannen, Arsha Nagrani, Hexiang Hu, Mandar Joshi, Bo Pang, Ceslee Montgomery, Paulina Pietrzyk, Marvin Ritter, A. J. Piergiovanni, Matthias Minderer, Filip Pavetic, Austin Waters, Gang Li, Ibrahim Alabdulmohsin, Lucas Beyer, Julien Amelot, Kenton Lee, Andreas Peter Steiner, Yang Li, Daniel Keysers, Anurag Arnab, Yuanzhong Xu, Keran Rong, Alexander Kolesnikov, Mojtaba Seyedhosseini, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, Radu Soricut:
PaLI-X: On Scaling up a Multilingual Vision and Language Model. CoRR abs/2305.18565 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-05392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-05392
Sanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell, Dan Klein:
Modular Visual Question Answering via Code Generation. CoRR abs/2306.05392 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11062
Shen Yan, Xuehan Xiong, Arsha Nagrani, Anurag Arnab, Zhonghao Wang, Weina Ge, David Ross, Cordelia Schmid:
UnLoc: A Unified Framework for Video Localization Tasks. CoRR abs/2308.11062 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03978
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03978
Taesik Gong, Josh Belanich, Krishna Somandepalli, Arsha Nagrani, Brian Eoff, Brendan Jou:
LanSER: Language-Model Supported Speech Emotion Recognition. CoRR abs/2309.03978 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13952
Antoine Yang, Arsha Nagrani, Ivan Laptev, Josef Sivic, Cordelia Schmid:
VidChapters-7M: Video Chapters at Scale. CoRR abs/2309.13952 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06838
Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman:
AutoAD II: The Sequel - Who, When, and What in Movie Audio Description. CoRR abs/2310.06838 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02188
Hammad A. Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, Feng Han, Yukun Zhu, Jialu Liu, Shih-Fu Chang:
Video Summarization: Towards Entity-Aware Captions. CoRR abs/2312.02188 (2023)
2022
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/SeoNAS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/SeoNAS22
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid:
End-to-end Generative Pretraining for Multimodal Video Captioning. CVPR 2022: 17938-17947
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/NagraniSSHMSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/NagraniSSHMSS22
Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid:
Learning Audio-Video Modalities from Image Captions. ECCV (14) 2022: 407-426
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/NarasimhanNSRDR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/NarasimhanNSRDR22
Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid:
TL;DW? Summarizing Instructional Videos with Task Relevance and Cross-Modal Saliency. ECCV (34) 2022: 540-557
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GabeurSN0AS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GabeurSN0AS22
Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
AVATAR: Unconstrained Audiovisual Speech Recognition. INTERSPEECH 2022: 2818-2822
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/GabeurN0AS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/GabeurN0AS22
Valentin Gabeur, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
Masking Modalities for Cross-modal Video Retrieval. WACV 2022: 2111-2120
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-04583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-04583
Andrew Brown, Jaesung Huh, Joon Son Chung, Arsha Nagrani, Andrew Zisserman:
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge. CoRR abs/2201.04583 (2022)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-08264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-08264
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid:
End-to-end Generative Pretraining for Multimodal Video Captioning. CoRR abs/2201.08264 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00679
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00679
Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid:
Learning Audio-Video Modalities from Image Captions. CoRR abs/2204.00679 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-08508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-08508
Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman:
A CLIP-Hitchhiker's Guide to Long Video Retrieval. CoRR abs/2205.08508 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07684
Valentin Gabeur, Paul Hongsuck Seo, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
AVATAR: Unconstrained Audiovisual Speech Recognition. CoRR abs/2206.07684 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09852
Xuehan Xiong, Anurag Arnab, Arsha Nagrani, Cordelia Schmid:
M&M Mix: A Multimodal Multiview Transformer Ensemble. CoRR abs/2206.09852 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-06773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-06773
Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid:
TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency. CoRR abs/2208.06773 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-09966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-09966
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid:
AVATAR submission to the Ego4D AV Transcription Challenge. CoRR abs/2211.09966 (2022)
2021
[c25]
- view
  - electronic edition @ bmvc2021-virtualconference.com (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/bmvc/AfourasCXNVZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/AfourasCXNVZ21
Triantafyllos Afouras, Honglie Chen, Weidi Xie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Audio-Visual Synchronisation in the wild. BMVC 2021: 261
[c24]
- view
  - electronic edition @ bmvc2021-virtualconference.com (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/bmvc/KazakosHNZD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/KazakosHNZD21
Evangelos Kazakos, Jaesung Huh, Arsha Nagrani, Andrew Zisserman, Dima Damen:
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition. BMVC 2021: 268
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChenXANVZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChenXANVZ21
Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Localizing Visual Sounds the Hard Way. CVPR 2021: 16867-16876
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/SeoNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/SeoNS21
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid:
Look Before You Speak: Visually Contextualized Utterances. CVPR 2021: 16877-16887
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KazakosNZD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KazakosNZD21
Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen:
Slow-Fast Auditory Streams for Audio Recognition. ICASSP 2021: 855-859
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0006HNCZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0006HNCZ21
Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Playing a Part: Speaker Verification at the movies. ICASSP 2021: 6174-6178
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/BainNVZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/BainNVZ21
Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman:
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. ICCV 2021: 1708-1718
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/0002NTS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/0002NTS21
Chen Sun, Arsha Nagrani, Yonglong Tian, Cordelia Schmid:
Composable Augmentation Encoding for Video Representation Learning. ICCV 2021: 8814-8824
[c17]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/NagraniYAJSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NagraniYAJSS21
Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun:
Attention Bottlenecks for Multimodal Fusion. NeurIPS 2021: 14200-14213
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-03787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-03787
Hazel Doughty, Nour Karessli, Kathryn Leonard, Boyi Li, Carianne Martinez, Azadeh Mobasher, Arsha Nagrani, Srishti Yadav:
WiCV 2020: The Seventh Women In Computer Vision Workshop. CoRR abs/2101.03787 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-03516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-03516
Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen:
Slow-Fast Auditory Streams For Audio Recognition. CoRR abs/2103.03516 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-00616
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-00616
Chen Sun, Arsha Nagrani, Yonglong Tian, Cordelia Schmid:
Composable Augmentation Encoding for Video Representation Learning. CoRR abs/2104.00616 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-00650
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-00650
Max Bain, Arsha Nagrani, Gül Varol, Andrew Zisserman:
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. CoRR abs/2104.00650 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02691
Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Localizing Visual Sounds the Hard Way. CoRR abs/2104.02691 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00135
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00135
Arsha Nagrani, Shan Yang, Anurag Arnab, Aren Jansen, Cordelia Schmid, Chen Sun:
Attention Bottlenecks for Multimodal Fusion. CoRR abs/2107.00135 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-01024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-01024
Evangelos Kazakos, Jaesung Huh, Arsha Nagrani, Andrew Zisserman, Dima Damen:
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition. CoRR abs/2111.01024 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-01300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-01300
Valentin Gabeur, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid:
Masking Modalities for Cross-modal Video Retrieval. CoRR abs/2111.01300 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-04432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-04432
Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Audio-Visual Synchronisation in the wild. CoRR abs/2112.04432 (2021)
2020
[b1]
- view
- export record
  dblp key:
  - phd/ethos/Nagrani20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ethos/Nagrani20
Arsha Nagrani:
Video understanding using multimodal deep learning. University of Oxford, UK, 2020
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/csl/NagraniCXZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/NagraniCXZ20
Arsha Nagrani, Joon Son Chung, Weidi Xie, Andrew Zisserman:
Voxceleb: Large-scale speaker verification in the wild. Comput. Speech Lang. 60 (2020)
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/accv/BainNBZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/accv/BainNBZ20
Max Bain, Arsha Nagrani, Andrew Brown, Andrew Zisserman:
Condensed Movies: Story Based Retrieval with Contextual Embeddings. ACCV (5) 2020: 460-479
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/NagraniSRSSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/NagraniSRSSZ20
Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, Andrew Zisserman:
Speech2Action: Cross-Modal Supervision for Action Recognition. CVPR 2020: 10314-10323
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/Arnab0NS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/Arnab0NS20
Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid:
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos. ECCV (10) 2020: 751-768
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NagraniCAZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NagraniCAZ20
Arsha Nagrani, Joon Son Chung, Samuel Albanie, Andrew Zisserman:
Disentangled Speech Embeddings Using Cross-Modal Self-Supervision. ICASSP 2020: 6829-6833
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungHNAZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungHNAZ20
Joon Son Chung, Jaesung Huh, Arsha Nagrani, Triantafyllos Afouras, Andrew Zisserman:
Spot the Conversation: Speaker Diarisation in the Wild. INTERSPEECH 2020: 299-303
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08742
Arsha Nagrani, Joon Son Chung, Samuel Albanie, Andrew Zisserman:
Disentangled Speech Embeddings using Cross-modal Self-supervision. CoRR abs/2002.08742 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-13594
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-13594
Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, Andrew Zisserman:
Speech2Action: Cross-modal Supervision for Action Recognition. CoRR abs/2003.13594 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04208
Max Bain, Arsha Nagrani, Andrew Brown, Andrew Zisserman:
Condensed Movies: Story Based Retrieval with Contextual Embeddings. CoRR abs/2005.04208 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-01216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-01216
Joon Son Chung, Jaesung Huh, Arsha Nagrani, Triantafyllos Afouras, Andrew Zisserman:
Spot the conversation: speaker diarisation in the wild. CoRR abs/2007.01216 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-10703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-10703
Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid:
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos. CoRR abs/2007.10703 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-00744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-00744
Samuel Albanie, Yang Liu, Arsha Nagrani, Antoine Miech, Ernesto Coto, Ivan Laptev, Rahul Sukthankar, Bernard Ghanem, Andrew Zisserman, Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid, Shizhe Chen, Yida Zhao, Qin Jin, Kaixu Cui, Hui Liu, Chen Wang, Yudong Jiang, Xiaoshuai Hao:
The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020). CoRR abs/2008.00744 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-08790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-08790
Piyush Bagad, Aman Dalmia, Jigar Doshi, Arsha Nagrani, Parag Bhamare, Amrita Mahale, Saurabh Rane, Neeraj Agarwal, Rahul Panicker:
Cough Against COVID: Evidence of COVID-19 Signature in Cough Sounds. CoRR abs/2009.08790 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15716
Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Playing a Part: Speaker Verification at the Movies. CoRR abs/2010.15716 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-05710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-05710
Paul Hongsuck Seo, Arsha Nagrani, Cordelia Schmid:
Look Before you Speak: Visually Contextualized Utterances. CoRR abs/2012.05710 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-06867
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-06867
Arsha Nagrani, Joon Son Chung, Jaesung Huh, Andrew Brown, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A. Reynolds, Andrew Zisserman:
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge. CoRR abs/2012.06867 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c11]
- view
  - electronic edition @ bmvc2019.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/bmvc/LiuANZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/LiuANZ19
Yang Liu, Samuel Albanie, Arsha Nagrani, Andrew Zisserman:
Use What You Have: Video retrieval using representations from collaborative experts. BMVC 2019: 279
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/AmeriniBELNS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/AmeriniBELNS19
Irene Amerini, Elena Balashova, Sayna Ebrahimi, Kathryn Leonard, Arsha Nagrani, Amaia Salvador:
WiCV 2019: The Sixth Women In Computer Vision Workshop. CVPR Workshops 2019: 469-471
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieNCZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieNCZ19
Weidi Xie, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Utterance-level Aggregation for Speaker Recognition in the Wild. ICASSP 2019: 5791-5795
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/KazakosNZD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/KazakosNZD19
Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen:
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition. ICCV 2019: 5491-5500
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iccvw/BainNSZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccvw/BainNSZ19
Max Bain, Arsha Nagrani, Daniel Schofield, Andrew Zisserman:
Count, Crop and Recognise: Fine-Grained Recognition in the Wild. ICCV Workshops 2019: 236-246
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-10107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-10107
Weidi Xie, Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
Utterance-level Aggregation For Speaker Recognition In The Wild. CoRR abs/1902.10107 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-13487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-13487
Yang Liu, Samuel Albanie, Arsha Nagrani, Andrew Zisserman:
Use What You Have: Video Retrieval Using Representations From Collaborative Experts. CoRR abs/1907.13487 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-08498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-08498
Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen:
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition. CoRR abs/1908.08498 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-08950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-08950
Max Bain, Arsha Nagrani, Daniel Schofield, Andrew Zisserman:
Count, Crop and Recognise: Fine-Grained Recognition in the Wild. CoRR abs/1909.08950 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-10225
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-10225
Irene Amerini, Elena Balashova, Sayna Ebrahimi, Kathryn Leonard, Arsha Nagrani, Amaia Salvador:
WiCV 2019: The Sixth Women In Computer Vision Workshop. CoRR abs/1909.10225 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-02522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-02522
Joon Son Chung, Arsha Nagrani, Ernesto Coto, Weidi Xie, Mitchell McLaren, Douglas A. Reynolds, Andrew Zisserman:
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge. CoRR abs/1912.02522 (2019)
2018
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/NagraniAZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/NagraniAZ18
Arsha Nagrani, Samuel Albanie, Andrew Zisserman:
Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching. CVPR 2018: 8427-8436
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/NagraniAZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/NagraniAZ18
Arsha Nagrani, Samuel Albanie, Andrew Zisserman:
Learnable PINs: Cross-modal Embeddings for Person Identity. ECCV (13) 2018: 73-89
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungNZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungNZ18
Joon Son Chung, Arsha Nagrani, Andrew Zisserman:
VoxCeleb2: Deep Speaker Recognition. INTERSPEECH 2018: 1086-1090
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/AlbanieNVZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/AlbanieNVZ18
Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild. ACM Multimedia 2018: 292-301
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-10442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-10442
Arsha Nagrani, Andrew Zisserman:
From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script. CoRR abs/1801.10442 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00326
Arsha Nagrani, Samuel Albanie, Andrew Zisserman:
Seeing Voices and Hearing Faces: Cross-modal biometric matching. CoRR abs/1804.00326 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-00833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-00833
Arsha Nagrani, Samuel Albanie, Andrew Zisserman:
Learnable PINs: Cross-Modal Embeddings for Person Identity. CoRR abs/1805.00833 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-05622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-05622
Joon Son Chung, Arsha Nagrani, Andrew Zisserman:
VoxCeleb2: Deep Speaker Recognition. CoRR abs/1806.05622 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-05561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-05561
Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild. CoRR abs/1808.05561 (2018)
2017
[c2]
- view
  - electronic edition @ dropbox.com (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/bmvc/NagraniZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/NagraniZ17
Arsha Nagrani, Andrew Zisserman:
From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script. BMVC 2017
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NagraniCZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NagraniCZ17
Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
VoxCeleb: A Large-Scale Speaker Identification Dataset. INTERSPEECH 2017: 2616-2620
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/NagraniCZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NagraniCZ17
Arsha Nagrani, Joon Son Chung, Andrew Zisserman:
VoxCeleb: a large-scale speaker identification dataset. CoRR abs/1706.08612 (2017)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.