
Dhruv Batra
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2021
- [j16]Joanne Truong
, Sonia Chernova
, Dhruv Batra:
Bi-Directional Domain Adaptation for Sim2Real Transfer of Embodied Navigation Agents. IEEE Robotics Autom. Lett. 6(2): 2634-2641 (2021) - [i108]Lina Mezghani, Sainbayar Sukhbaatar, Thibaut Lavril, Oleksandr Maksymets, Dhruv Batra, Piotr Bojanowski, Karteek Alahari:
Memory-Augmented Reinforcement Learning for Image-Goal Navigation. CoRR abs/2101.05181 (2021) - [i107]Brennan Shacklett, Erik Wijmans, Aleksei Petrenko, Manolis Savva, Dhruv Batra, Vladlen Koltun, Kayvon Fatahalian:
Large Batch Simulation for Deep Reinforcement Learning. CoRR abs/2103.07013 (2021) - [i106]Naoki Yokoyama, Sehoon Ha, Dhruv Batra:
Success Weighted by Completion Time: A Dynamics-Aware Evaluation Criteria for Embodied Navigation. CoRR abs/2103.08022 (2021) - [i105]Joel Ye, Dhruv Batra, Abhishek Das, Erik Wijmans:
Auxiliary Tasks and Exploration Enable ObjectNav. CoRR abs/2104.04112 (2021) - 2020
- [j15]Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra:
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Int. J. Comput. Vis. 128(2): 336-359 (2020) - [j14]Abhishek Kadian, Joanne Truong
, Aaron Gokaslan, Alexander Clegg
, Erik Wijmans, Stefan Lee, Manolis Savva, Sonia Chernova, Dhruv Batra:
Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance? IEEE Robotics Autom. Lett. 5(4): 6670-6677 (2020) - [c104]Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee:
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments. ECCV (28) 2020: 104-120 - [c103]Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra:
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web. ECCV (6) 2020: 259-274 - [c102]Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das:
Large-Scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline. ECCV (18) 2020: 336-352 - [c101]Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh:
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation. ECCV (18) 2020: 513-529 - [c100]Yash Kant, Dhruv Batra, Peter Anderson, Alexander G. Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal:
Spatially Aware Multimodal Transformers for TextVQA. ECCV (9) 2020: 715-732 - [c99]Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson:
Where Are You? Localization from Embodied Dialog. EMNLP (1) 2020: 806-822 - [c98]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames. ICLR 2020 - [c97]Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam:
IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL. IJCAI 2020: 2022-2028 - [c96]Devendra Singh Chaplot, Lisa Lee, Ruslan Salakhutdinov, Devi Parikh, Dhruv Batra:
Embodied Multimodal Multitask Learning. IJCAI 2020: 2442-2448 - [c95]Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra:
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data. NeurIPS 2020 - [i104]Erik Wijmans, Julian Straub, Dhruv Batra, Irfan Essa, Judy Hoffman, Ari Morcos:
Analyzing Visual Representations in Embodied Navigation Tasks. CoRR abs/2003.05993 (2020) - [i103]Jacob Krantz
, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee:
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments. CoRR abs/2004.02857 (2020) - [i102]Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra:
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web. CoRR abs/2004.14973 (2020) - [i101]Dhruv Batra, Aaron Gokaslan, Aniruddha Kembhavi, Oleksandr Maksymets, Roozbeh Mottaghi, Manolis Savva, Alexander Toshev, Erik Wijmans:
ObjectNav Revisited: On Evaluation of Embodied Agents Navigating to Objects. CoRR abs/2006.13171 (2020) - [i100]Joel Ye, Dhruv Batra, Erik Wijmans, Abhishek Das:
Auxiliary Tasks Speed Up Learning PointGoal Navigation. CoRR abs/2007.04561 (2020) - [i99]Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh:
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation. CoRR abs/2007.09841 (2020) - [i98]Yash Kant, Dhruv Batra, Peter Anderson, Alexander G. Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal:
Spatially Aware Multimodal Transformers for TextVQA. CoRR abs/2007.12146 (2020) - [i97]Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra:
Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data. CoRR abs/2007.12750 (2020) - [i96]Samyak Datta, Oleksandr Maksymets, Judy Hoffman, Stefan Lee, Dhruv Batra, Devi Parikh:
Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents. CoRR abs/2009.03231 (2020) - [i95]Vincent Cartillier, Zhile Ren, Neha Jain, Stefan Lee, Irfan Essa, Dhruv Batra:
Semantic MapNet: Building Allocentric SemanticMaps and Representations from Egocentric Views. CoRR abs/2010.01191 (2020) - [i94]Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal:
Contrast and Classify: Alternate Training for Robust VQA. CoRR abs/2010.06087 (2020) - [i93]Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh, Ramprasaath R. Selvaraju:
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency. CoRR abs/2010.10038 (2020) - [i92]Dhruv Batra, Angel X. Chang, Sonia Chernova, Andrew J. Davison, Jia Deng, Vladlen Koltun, Sergey Levine, Jitendra Malik, Igor Mordatch, Roozbeh Mottaghi, Manolis Savva, Hao Su:
Rearrangement: A Challenge for Embodied AI. CoRR abs/2011.01975 (2020) - [i91]Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee:
Sim-to-Real Transfer for Vision-and-Language Navigation. CoRR abs/2011.03807 (2020) - [i90]Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson:
Where Are You? Localization from Embodied Dialog. CoRR abs/2011.08277 (2020) - [i89]Joanne Truong, Denis Yarats, Tianyu Li, Franziska Meier, Sonia Chernova, Dhruv Batra, Akshara Rai:
Learning Navigation Skills for Legged Robots with Learned Robot Embeddings. CoRR abs/2011.12255 (2020) - [i88]Joanne Truong, Sonia Chernova, Dhruv Batra:
Bi-directional Domain Adaptation for Sim2Real Transfer of Embodied Navigation Agents. CoRR abs/2011.12421 (2020) - [i87]Erik Wijmans, Irfan Essa, Dhruv Batra:
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget. CoRR abs/2012.06117 (2020)
2010 – 2019
- 2019
- [j13]Yash Goyal
, Tejas Khot, Aishwarya Agrawal, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. Int. J. Comput. Vis. 127(4): 398-414 (2019) - [j12]Abhishek Das
, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav
, Stefan Lee
, José M. F. Moura
, Devi Parikh
, Dhruv Batra:
Visual Dialog. IEEE Trans. Pattern Anal. Mach. Intell. 41(5): 1242-1256 (2019) - [c94]Jin-Hwa Kim, Nikita Kitaev, Xinlei Chen, Marcus Rohrbach, Byoung-Tak Zhang, Yuandong Tian, Dhruv Batra, Devi Parikh:
CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication. ACL (1) 2019: 6495-6513 - [c93]Licheng Yu, Xinlei Chen, Georgia Gkioxari, Mohit Bansal, Tamara L. Berg, Dhruv Batra:
Multi-Target Embodied Question Answering. CVPR 2019: 6309-6318 - [c92]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments With Point Cloud Perception. CVPR 2019: 6659-6668 - [c91]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K. Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh:
Audio Visual Scene-Aware Dialog. CVPR 2019: 7558-7567 - [c90]Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach:
Towards VQA Models That Can Read. CVPR 2019: 8317-8326 - [c89]Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das:
Improving Generative Visual Dialog by Answering Diverse Questions. EMNLP/IJCNLP (1) 2019: 1449-1454 - [c88]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356 - [c87]Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman, Dhruv Batra:
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation. ICCV 2019: 1022-1031 - [c86]Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall, Devi Parikh, Dhruv Batra:
Embodied Amodal Recognition: Learning to Move to Perceive Objects. ICCV 2019: 2040-2050 - [c85]Ramprasaath Ramasamy Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Shalini Ghosh, Larry P. Heck, Dhruv Batra, Devi Parikh:
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded. ICCV 2019: 2591-2600 - [c84]Jyoti Aneja, Harsh Agrawal
, Dhruv Batra, Alexander G. Schwing:
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning. ICCV 2019: 4260-4269 - [c83]Harsh Agrawal
, Peter Anderson, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee:
nocaps: novel object captioning at scale. ICCV 2019: 8947-8956 - [c82]Manolis Savva, Jitendra Malik, Devi Parikh, Dhruv Batra, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun:
Habitat: A Platform for Embodied AI Research. ICCV 2019: 9338-9346 - [c81]Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra:
Modeling the Long Term Future in Model-Based Reinforcement Learning. ICLR (Poster) 2019 - [c80]Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Mike Rabbat, Joelle Pineau:
TarMAC: Targeted Multi-Agent Communication. ICML 2019: 1538-1546 - [c79]Yash Goyal, Ziyan Wu, Jan Ernst, Dhruv Batra, Devi Parikh, Stefan Lee:
Counterfactual Visual Explanations. ICML 2019: 2376-2384 - [c78]Ashwin Kalyan, Peter Anderson, Stefan Lee, Dhruv Batra:
Trainable Decoding of Sets of Sequences for Neural Sequence Models. ICML 2019: 3211-3221 - [c77]Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering. ICML 2019: 6428-6437 - [c76]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog. NAACL-HLT (1) 2019: 582-595 - [c75]Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee:
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. NeurIPS 2019: 13-23 - [c74]Peter Anderson, Ayush Shrivastava, Devi Parikh, Dhruv Batra, Stefan Lee:
Chasing Ghosts: Instruction Following as Bayesian State Tracking. NeurIPS 2019: 369-379 - [i86]Koichiro Yoshino, Chiori Hori, Julien Perez, Luis Fernando D'Haro, Lazaros Polymenakos, R. Chulaka Gunasekara, Walter S. Lasecki, Jonathan K. Kummerfeld, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan, Xiang Gao, Huda AlAmri, Tim K. Marks, Devi Parikh, Dhruv Batra:
Dialog System Technology Challenge 7. CoRR abs/1901.03461 (2019) - [i85]Abhishek Das, Devi Parikh, Dhruv Batra:
Response to "Visual Dialogue without Vision or Dialogue" (Massiceti et al., 2018). CoRR abs/1901.05531 (2019) - [i84]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio-Visual Scene-Aware Dialog. CoRR abs/1901.09107 (2019) - [i83]Devendra Singh Chaplot, Lisa Lee
, Ruslan Salakhutdinov, Devi Parikh, Dhruv Batra:
Embodied Multimodal Multitask Learning. CoRR abs/1902.01385 (2019) - [i82]Deshraj Yadav, Rishabh Jain, Harsh Agrawal
, Prithvijit Chattopadhyay, Taranjeet Singh, Akash Jain, Shivkaran Singh, Stefan Lee, Dhruv Batra:
EvalAI: Towards Better Evaluation Systems for AI Agents. CoRR abs/1902.03570 (2019) - [i81]Ramprasaath R. Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Dhruv Batra, Devi Parikh:
Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded. CoRR abs/1902.03751 (2019) - [i80]Ramakrishna Vedantam, Karan Desai, Stefan Lee, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering. CoRR abs/1902.07864 (2019) - [i79]Nan Rosemary Ke, Amanpreet Singh, Ahmed Touati, Anirudh Goyal, Yoshua Bengio, Devi Parikh, Dhruv Batra:
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future. CoRR abs/1903.01599 (2019) - [i78]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog. CoRR abs/1903.03166 (2019) - [i77]Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra:
Habitat: A Platform for Embodied AI Research. CoRR abs/1904.01201 (2019) - [i76]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception. CoRR abs/1904.03461 (2019) - [i75]Jianwei Yang, Zhile Ren, Mingze Xu, Xinlei Chen, David J. Crandall, Devi Parikh, Dhruv Batra:
Embodied Visual Recognition. CoRR abs/1904.04404 (2019) - [i74]Licheng Yu, Xinlei Chen, Georgia Gkioxari, Mohit Bansal, Tamara L. Berg, Dhruv Batra:
Multi-Target Embodied Question Answering. CoRR abs/1904.04686 (2019) - [i73]Yash Goyal, Ziyan Wu, Jan Ernst
, Dhruv Batra, Devi Parikh, Stefan Lee:
Counterfactual Visual Explanations. CoRR abs/1904.07451 (2019) - [i72]Amanpreet Singh, Vivek Natarajan, Meet Shah, Yu Jiang, Xinlei Chen, Dhruv Batra, Devi Parikh, Marcus Rohrbach:
Towards VQA Models that can Read. CoRR abs/1904.08920 (2019) - [i71]Michael Cogswell, Jiasen Lu, Stefan Lee, Devi Parikh, Dhruv Batra:
Emergence of Compositional Language with Deep Generational Transmission. CoRR abs/1904.09067 (2019) - [i70]Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman, Dhruv Batra:
SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation. CoRR abs/1905.07512 (2019) - [i69]Julian Straub, Thomas Whelan, Lingni Ma, Yufan Chen, Erik Wijmans, Simon Green, Jakob J. Engel, Raul Mur-Artal, Carl Ren, Shobhit Verma, Anton Clarkson, Mingfei Yan, Brian Budge, Yajie Yan, Xiaqing Pan, June Yon, Yuyang Zou, Kimberly Leon, Nigel Carter, Jesus Briales, Tyler Gillingham, Elias Mueggler, Luis Pesqueira, Manolis Savva, Dhruv Batra, Hauke M. Strasdat, Renzo De Nardi, Michael Goesele, Steven Lovegrove, Richard A. Newcombe:
The Replica Dataset: A Digital Replica of Indoor Spaces. CoRR abs/1906.05797 (2019) - [i68]Peter Anderson, Ayush Shrivastava, Devi Parikh, Dhruv Batra, Stefan Lee:
Chasing Ghosts: Instruction Following as Bayesian State Tracking. CoRR abs/1907.02022 (2019) - [i67]Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam:
Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning. CoRR abs/1907.10580 (2019) - [i66]Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee:
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks. CoRR abs/1908.02265 (2019) - [i65]Jyoti Aneja, Harsh Agrawal
, Dhruv Batra, Alexander G. Schwing:
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning. CoRR abs/1908.08529 (2019) - [i64]Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das:
Improving Generative Visual Dialog by Answering Diverse Questions. CoRR abs/1909.10470 (2019) - [i63]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
Decentralized Distributed PPO: Solving PointGoal Navigation. CoRR abs/1911.00357 (2019) - [i62]Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das:
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline. CoRR abs/1912.02379 (2019) - [i61]Abhishek Kadian, Joanne Truong, Aaron Gokaslan, Alexander Clegg, Erik Wijmans, Stefan Lee, Manolis Savva, Sonia Chernova, Dhruv Batra:
Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation. CoRR abs/1912.06321 (2019) - 2018
- [c73]Ashwin K. Vijayakumar, Michael Cogswell, Ramprasaath R. Selvaraju, Qing Sun, Stefan Lee, David J. Crandall, Dhruv Batra:
Diverse Beam Search for Improved Description of Complex Scenes. AAAI 2018: 7371-7379 - [c72]Abhishek Das, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra:
Neural Modular Control for Embodied Question Answering. CoRL 2018: 53-62 - [c71]Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh:
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition. CoRL 2018: 63-80 - [c70]Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra:
Embodied Question Answering. CVPR 2018: 1-10 - [c69]Abhishek Das, Samyak Datta, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra:
Embodied Question Answering. CVPR Workshops 2018: 2054-2063 - [c68]Aishwarya Agrawal, Dhruv Batra, Devi Parikh, Aniruddha Kembhavi:
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering. CVPR 2018: 4971-4980 - [c67]Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh:
Neural Baby Talk. CVPR 2018: 7219-7228 - [c66]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
Visual Coreference Resolution in Visual Dialog Using Neural Module Networks. ECCV (15) 2018: 160-178 - [c65]Ramprasaath R. Selvaraju, Prithvijit Chattopadhyay, Mohamed Elhoseiny, Tilak Sharma, Dhruv Batra, Devi Parikh, Stefan Lee:
Choose Your Neuron: Incorporating Domain Knowledge Through Neuron-Importance. ECCV (13) 2018: 540-556 - [c64]Jianwei Yang
, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh:
Graph R-CNN for Scene Graph Generation. ECCV (1) 2018: 690-706 - [c63]Ashwin Kalyan, Abhishek Mohta, Oleksandr Polozov, Dhruv Batra, Prateek Jain, Sumit Gulwani:
Neural-Guided Deductive Search for Real-Time Program Synthesis from Examples. ICLR (Poster) 2018 - [c62]Ashwin Kalyan, Stefan Lee, Anitha Kannan, Dhruv Batra:
Learn from Your Neighbor: Learning Multi-modal Mappings from Sparse Annotations. ICML 2018: 2454-2463 - [i60]Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh:
Neural Baby Talk. CoRR abs/1803.09845 (2018) - [i59]Ashwin J. Vijayakumar, Abhishek Mohta, Oleksandr Polozov, Dhruv Batra, Prateek Jain, Sumit Gulwani:
Neural-Guided Deductive Search for Real-Time Program Synthesis from Examples. CoRR abs/1804.01186 (2018) - [i58]Huda AlAmri, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Jue Wang, Irfan Essa, Dhruv Batra, Devi Parikh, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7. CoRR abs/1806.00525 (2018) - [i57]Ashwin Kalyan, Stefan Lee, Anitha Kannan, Dhruv Batra:
Learn from Your Neighbor: Learning Multi-modal Mappings from Sparse Annotations. CoRR abs/1806.02934 (2018) - [i56]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features. CoRR abs/1806.08409 (2018) - [i55]Harm de Vries, Kurt Shuster, Dhruv Batra, Devi Parikh, Jason Weston, Douwe Kiela:
Talk the Walk: Navigating New York City through Grounded Dialogue. CoRR abs/1807.03367 (2018) - [i54]Yu Jiang, Vivek Natarajan, Xinlei Chen, Marcus Rohrbach, Dhruv Batra, Devi Parikh:
Pythia v0.1: the Winning Entry to the VQA Challenge 2018. CoRR abs/1807.09956 (2018) - [i53]Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh:
Graph R-CNN for Scene Graph Generation. CoRR abs/1808.00191 (2018) - [i52]Ramprasaath R. Selvaraju, Prithvijit Chattopadhyay, Mohamed Elhoseiny, Tilak Sharma, Dhruv Batra, Devi Parikh, Stefan Lee:
Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance. CoRR abs/1808.02861 (2018) - [i51]Satwik Kottur, José M. F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach:
Visual Coreference Resolution in Visual Dialog using Neural Module Networks. CoRR abs/1809.01816 (2018) - [i50]Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh:
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition. CoRR abs/1810.00912 (2018) - [i49]Abhishek Das, Georgia Gkioxari, Stefan Lee, Devi Parikh, Dhruv Batra:
Neural Modular Control for Embodied Question Answering. CoRR abs/1810.11181 (2018) - [i48]Abhishek Das, Théophile Gervet, Joshua Romoff, Dhruv Batra, Devi Parikh, Michael G. Rabbat, Joelle Pineau:
TarMAC: Targeted Multi-Agent Communication. CoRR abs/1810.11187 (2018) - [i47]Utsav Garg, Viraj Prabhu, Deshraj Yadav, Ram Ramrakhya, Harsh Agrawal
, Dhruv Batra:
Fabrik: An Online Collaborative Neural Network Editor. CoRR abs/1810.11649 (2018) - [i46]Harsh Agrawal, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson:
nocaps: novel object captioning at scale. CoRR abs/1812.08658 (2018) - 2017
- [j11]Abhishek Das, Harsh Agrawal
, Larry Zitnick, Devi Parikh, Dhruv Batra:
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions? Comput. Vis. Image Underst. 163: 90-100 (2017) - [j10]Gordon Christie, Ankit Laddha, Aishwarya Agrawal, Stanislaw Antol, Yash Goyal, Kevin Kochersberger, Dhruv Batra:
Resolving vision and language ambiguities together: Joint segmentation & prepositional attachment resolution in captioned scenes. Comput. Vis. Image Underst. 163: 101-112 (2017) - [j9]Aishwarya Agrawal
, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra:
VQA: Visual Question Answering - www.visualqa.org. Int. J. Comput. Vis. 123(1): 4-31 (2017) - [j8]Vittal Premachandran, Daniel Tarlow, Alan L. Yuille, Dhruv Batra:
Empirical Minimum Bayes Risk Prediction. IEEE Trans. Pattern Anal. Mach. Intell. 39(1): 75-86 (2017) - [c61]Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra:
Visual Dialog. CVPR 2017: 1080-1089 - [c60]Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh:
Counting Everyday Objects in Everyday Scenes. CVPR 2017: 4428-4437 - [c59]Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. CVPR 2017: 6325-6334 - [c58]Qing Sun, Stefan Lee, Dhruv Batra:
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning. CVPR 2017: 7215-7223 - [c57]