default search action
Irfan A. Essa
Irfan Essa
Person information
- affiliation: Georgia Institute of Technology, Atlanta GA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j26]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition. Sensors 24(4): 1238 (2024) - [c162]Harish Haresamudram, Irfan Essa, Thomas Plötz:
A Washing Machine is All You Need? On the Feasibility of Machine Data for Self-Supervised Human Activity Recognition. ABC 2024: 1-10 - [c161]Vincent Cartillier, Grant Schindler, Irfan Essa:
SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping. CVPR Workshops 2024: 2862-2871 - [c160]Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi:
Prompt-Free Diffusion: Taking "Text" Out of Text-to-Image Diffusion Models. CVPR 2024: 8682-8692 - [c159]Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Fei-Fei Li, Irfan Essa, Lu Jiang, José Lezama:
Photorealistic Video Generation with Diffusion Models. ECCV (79) 2024: 393-411 - [c158]Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang:
Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation. ECCV (38) 2024: 462-478 - [c157]Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is key to visual generation. ICLR 2024 - [c156]Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Joshua V. Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. ICML 2024 - [c155]Tianle Huang, Nitish Sontakke, K. Niranjan Kumar, Irfan Essa, Stefanos Nikolaidis, Dennis W. Hong, Sehoon Ha:
BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning. IROS 2024: 670-676 - [i84]Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang:
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation. CoRR abs/2401.05675 (2024) - [i83]Apoorva Beedu, Karan Samel, Irfan Essa:
On the Efficacy of Text-Based Input Modalities for Action Anticipation. CoRR abs/2401.12972 (2024) - [i82]Vincent Cartillier, Neha Jain, Irfan Essa:
3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D. CoRR abs/2403.13190 (2024) - [i81]Vincent Cartillier, Grant Schindler, Irfan Essa:
SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping. CoRR abs/2404.11419 (2024) - [i80]Andrew Marmon, Grant Schindler, José Lezama, Dan Kondratyuk, Bryan Seybold, Irfan Essa:
CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers. CoRR abs/2405.13195 (2024) - [i79]Seung Hyun Lee, Junjie Ke, Yinxiao Li, Junfeng He, Steven Hickson, Katie Datsenko, Sangpil Kim, Ming-Hsuan Yang, Irfan Essa, Feng Yang:
Cropper: Vision-Language Model for Image Cropping through In-Context Learning. CoRR abs/2408.07790 (2024) - [i78]Harish Haresamudram, Apoorva Beedu, Mashfiqui Rabbi, Sankalita Saha, Irfan Essa, Thomas Ploetz:
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them. CoRR abs/2408.12023 (2024) - [i77]Zhikang Dong, Apoorva Beedu, Jason Sheinkopf, Irfan Essa:
Mamba Fusion: Learning Actions Through Questioning. CoRR abs/2409.11513 (2024) - [i76]Karan Samel, Apoorva Beedu, Nitish Sontakke, Irfan Essa:
Exploring Efficient Foundational Multi-modal Models for Video Summarization. CoRR abs/2410.07405 (2024) - [i75]Tobi Olatunji, Charles Nimo, Abraham Toluwase Owodunni, Tassallah Abdullahi, Emmanuel Ayodele, Mardhiyah Sanni, Chinemelu Aka, Folafunmi Omofoye, Foutse Yuehgoh, Timothy Faniran, Bonaventure F. P. Dossou, Moshood Yekini, Jonas Kemp, Katherine A. Heller, Jude Chidubem Omeke, Chidi Asuzu MD, Naome A. Etori, Aimérou Ndiaye, Ifeoma Okoh, Evans Doe Ocansey, Wendy Kinara, Michael Best, Irfan Essa, Stephen Edward Moore, Chris Fourie, Mercy Nyamewaa Asiedu:
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset. CoRR abs/2411.15640 (2024) - 2023
- [j25]Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra:
Emergence of Maps in the Memories of Blind Navigation Agents. AI Matters 9(2): 8-14 (2023) - [j24]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Cascaded Compositional Residual Learning for Complex Interactive Behaviors. IEEE Robotics Autom. Lett. 8(8): 4601-4608 (2023) - [c154]Karan Samel, Jun Ma, Zhengyang Wang, Tong Zhao, Irfan Essa:
Integrating Noisy Knowledge into Language Representations for E-Commerce Applications. IEEE Big Data 2023: 548-553 - [c153]Vighnesh Birodkar, Jonathan Huang, Meera Hahn, Irfan Essa, Nikolai Warner:
Text and Click inputs for unambiguous open vocabulary instance segmentation. BMVC 2023: 815-819 - [c152]Yi-Hao Peng, Peggy Chi, Anjuli Kannan, Meredith Ringel Morris, Irfan Essa:
Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access. CHI 2023: 829:1-829:14 - [c151]Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa:
MaskSketch: Unpaired Structure-guided Masked Image Generation. CVPR 2023: 1879-1889 - [c150]Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CVPR 2023: 10459-10469 - [c149]Kihyuk Sohn, Huiwen Chang, José Lezama, Luisa Polania, Han Zhang, Yuan Hao, Irfan Essa, Lu Jiang:
Visual Prompt Tuning for Generative Transfer Learning. CVPR 2023: 19840-19851 - [c148]José Lezama, Tim Salimans, Lu Jiang, Huiwen Chang, Jonathan Ho, Irfan Essa:
Discrete Predictor-Corrector Diffusion Models for Image Synthesis. ICLR 2023 - [c147]Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra:
Emergence of Maps in the Memories of Blind Navigation Agents. ICLR 2023 - [c146]Kihyuk Sohn, Lu Jiang, Jarred Barber, Kimin Lee, Nataniel Ruiz, Dilip Krishnan, Huiwen Chang, Yuanzhen Li, Irfan Essa, Michael Rubinstein, Yuan Hao, Glenn Entis, Irina Blok, Daniel Castro Chin:
StyleDrop: Text-to-Image Synthesis of Any Style. NeurIPS 2023 - [c145]Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin P. Murphy, Alexander G. Hauptmann, Lu Jiang:
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. NeurIPS 2023 - [c144]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition. PERCOM 2023: 232-241 - [i74]Erik Wijmans, Manolis Savva, Irfan Essa, Stefan Lee, Ari S. Morcos, Dhruv Batra:
Emergence of Maps in the Memories of Blind Navigation Agents. CoRR abs/2301.13261 (2023) - [i73]Dina Bashkirova, José Lezama, Kihyuk Sohn, Kate Saenko, Irfan Essa:
MaskSketch: Unpaired Structure-guided Masked Image Generation. CoRR abs/2302.05496 (2023) - [i72]Daniel Nkemelu, Harshil Shah, Irfan Essa, Michael L. Best:
Tackling Hate Speech in Low-resource Languages with Context Experts. CoRR abs/2303.16828 (2023) - [i71]Xingqian Xu, Jiayi Guo, Zhangyang Wang, Gao Huang, Irfan Essa, Humphrey Shi:
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models. CoRR abs/2305.16223 (2023) - [i70]Kihyuk Sohn, Albert E. Shaw, Yuan Hao, Han Zhang, Luisa Polania, Huiwen Chang, Lu Jiang, Irfan Essa:
Learning Disentangled Prompts for Compositional Image Synthesis. CoRR abs/2306.00763 (2023) - [i69]Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan:
StyleDrop: Text-to-Image Generation in Any Style. CoRR abs/2306.00983 (2023) - [i68]Harish Haresamudram, Irfan Essa, Thomas Ploetz:
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition. CoRR abs/2306.01108 (2023) - [i67]Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin Murphy, Alexander G. Hauptmann, Lu Jiang:
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs. CoRR abs/2306.17842 (2023) - [i66]Hyeongju Choi, Apoorva Beedu, Irfan Essa:
Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition. CoRR abs/2309.01262 (2023) - [i65]Daniel Nkemelu, Peggy Chi, Daniel Castro Chin, Krishna Srinivasan, Irfan Essa:
Automatic Multi-Path Web Story Creation from a Structural Article. CoRR abs/2310.02383 (2023) - [i64]Lijun Yu, José Lezama, Nitesh Bharadwaj Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang:
Language Model Beats Diffusion - Tokenizer is Key to Visual Generation. CoRR abs/2310.05737 (2023) - [i63]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement. CoRR abs/2310.06226 (2023) - [i62]Tianle Huang, Nitish Sontakke, K. Niranjan Kumar, Irfan Essa, Stefanos Nikolaidis, Dennis W. Hong, Sehoon Ha:
BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning. CoRR abs/2310.10606 (2023) - [i61]Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar:
Text and Click inputs for unambiguous open vocabulary instance segmentation. CoRR abs/2311.14822 (2023) - [i60]Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, José Lezama:
Photorealistic Video Generation with Diffusion Models. CoRR abs/2312.06662 (2023) - [i59]Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Joshua V. Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, David A. Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang:
VideoPoet: A Large Language Model for Zero-Shot Video Generation. CoRR abs/2312.14125 (2023) - 2022
- [j23]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Assessing the State of Self-Supervised Human Activity Recognition Using Wearables. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 6(3): 116:1-116:47 (2022) - [c143]Erik Wijmans, Irfan Essa, Dhruv Batra:
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget. AAMAS 2022: 1762-1764 - [c142]José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa:
Improved Masked Image Generation with Token-Critic. ECCV (23) 2022: 70-86 - [c141]Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa:
BLT: Bidirectional Layout Transformer for Controllable Layout Generation. ECCV (17) 2022: 474-490 - [c140]Chengzhi Mao, Lu Jiang, Mostafa Dehghani, Carl Vondrick, Rahul Sukthankar, Irfan Essa:
Discrete Representations Strengthen Vision Transformer Robustness. ICLR 2022 - [c139]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning. ICRA 2022: 7521-7527 - [c138]Erik Wijmans, Irfan Essa, Dhruv Batra:
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement. NeurIPS 2022 - [c137]Peggy Chi, Tao Dong, Christian Früh, Brian Colonna, Vivek Kwatra, Irfan Essa:
Synthesis-Assisted Video Prototyping From a Document. UIST 2022: 16:1-16:10 - [c136]Steven Hickson, Karthik Raveendran, Irfan A. Essa:
Sharing Decoders: Network Fission for Multi-task Pixel Prediction. WACV 2022: 3655-3664 - [i58]Karan Samel, Zelin Zhao, Binghong Chen, Shuang Li, Dharmashankar Subramanian, Irfan Essa, Le Song:
Learning Temporal Rules from Noisy Timeseries Data. CoRR abs/2202.05403 (2022) - [i57]Harish Haresamudram, Irfan Essa, Thomas Plötz:
Assessing the State of Self-Supervised Human Activity Recognition using Wearables. CoRR abs/2202.12938 (2022) - [i56]José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa:
Improved Masked Image Generation with Token-Critic. CoRR abs/2209.04439 (2022) - [i55]Kihyuk Sohn, Yuan Hao, José Lezama, Luisa Polania, Huiwen Chang, Han Zhang, Irfan Essa, Lu Jiang:
Visual Prompt Tuning for Generative Transfer Learning. CoRR abs/2210.00990 (2022) - [i54]Erik Wijmans, Irfan Essa, Dhruv Batra:
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement. CoRR abs/2210.05064 (2022) - [i53]Daniel Scarafoni, Irfan Essa, Thomas Ploetz:
Finding Islands of Predictability in Action Forecasting. CoRR abs/2210.07354 (2022) - [i52]Apoorva Beedu, Huda AlAmri, Irfan Essa:
Video based Object 6D Pose Estimation using Transformers. CoRR abs/2210.13540 (2022) - [i51]Huda AlAmri, Anthony Bilic, Michael Hu, Apoorva Beedu, Irfan Essa:
End-to-End Multimodal Representation Learning for Video Dialog. CoRR abs/2210.14512 (2022) - [i50]Hyeongju Choi, Apoorva Beedu, Harish Haresamudram, Irfan Essa:
Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition. CoRR abs/2211.04331 (2022) - [i49]Harish Haresamudram, Irfan Essa, Thomas Ploetz:
Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition. CoRR abs/2211.06173 (2022) - [i48]Lijun Yu, Yong Cheng, Kihyuk Sohn, José Lezama, Han Zhang, Huiwen Chang, Alexander G. Hauptmann, Ming-Hsuan Yang, Yuan Hao, Irfan Essa, Lu Jiang:
MAGVIT: Masked Generative Video Transformer. CoRR abs/2212.05199 (2022) - [i47]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Cascaded Compositional Residual Learning for Complex Interactive Behaviors. CoRR abs/2212.08954 (2022) - 2021
- [j22]Harish Haresamudram, Irfan A. Essa, Thomas Plötz:
Contrastive Predictive Coding for Human Activity Recognition. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 5(2): 65:1-65:26 (2021) - [c135]Vincent Cartillier, Zhile Ren, Neha Jain, Stefan Lee, Irfan Essa, Dhruv Batra:
Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views. AAAI 2021: 964-972 - [c134]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa:
Unsupervised Discovery of Actions in Instructional Videos. BMVC 2021: 283 - [c133]Anh Truong, Peggy Chi, David Salesin, Irfan Essa, Maneesh Agrawala:
Automatic Generation of Two-Level Hierarchical Tutorials from Instructional Makeup Videos. CHI 2021: 108:1-108:16 - [c132]Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Weilong Yang, Honglak Lee, Irfan Essa:
Text as Neural Operator: Image Manipulation by Text Instruction. ACM Multimedia 2021: 1893-1902 - [c131]Peggy Chi, Nathan Frey, Katrina Panovich, Irfan Essa:
Automatic Instructional Video Creation from a Markdown-Formatted Tutorial. UIST 2021: 677-690 - [i46]Dan Scarafoni, Irfan Essa, Thomas Ploetz:
PLAN-B: Predicting Likely Alternative Next Best Sequences for Action Prediction. CoRR abs/2103.15987 (2021) - [i45]Nathan Frey, Peggy Chi, Weilong Yang, Irfan Essa:
Automatic Non-Linear Video Editing Transfer. CoRR abs/2105.06988 (2021) - [i44]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan A. Essa:
Unsupervised Action Segmentation for Instructional Videos. CoRR abs/2106.03738 (2021) - [i43]A. J. Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan A. Essa:
Unsupervised Discovery of Actions in Instructional Videos. CoRR abs/2106.14733 (2021) - [i42]K. Niranjan Kumar, Irfan Essa, Sehoon Ha:
Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning. CoRR abs/2109.10460 (2021) - [i41]Chengzhi Mao, Lu Jiang, Mostafa Dehghani, Carl Vondrick, Rahul Sukthankar, Irfan Essa:
Discrete Representations Strengthen Vision Transformer Robustness. CoRR abs/2111.10493 (2021) - [i40]Apoorva Beedu, Zhile Ren, Varun Agrawal, Irfan Essa:
VideoPose: Estimating 6D object pose from videos. CoRR abs/2111.10677 (2021) - [i39]Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa:
BLT: Bidirectional Layout Transformer for Controllable Layout Generation. CoRR abs/2112.05112 (2021) - 2020
- [c130]Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B. Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yang:
Neural Design Network: Graphic Layout Generation with Constraints. ECCV (3) 2020: 491-506 - [c129]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames. ICLR 2020 - [c128]Harish Haresamudram, Apoorva Beedu, Varun Agrawal, Patrick L. Grady, Irfan A. Essa, Judy Hoffman, Thomas Plötz:
Masked reconstruction based self-supervision for human activity recognition. ISWC 2020: 45-49 - [c127]Peggy Chi, Zheng Sun, Katrina Panovich, Irfan Essa:
Automatic Video Creation From a Web Page. UIST 2020: 279-292 - [i38]Erik Wijmans, Julian Straub, Dhruv Batra, Irfan Essa, Judy Hoffman, Ari Morcos:
Analyzing Visual Representations in Embodied Navigation Tasks. CoRR abs/2003.05993 (2020) - [i37]Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Honglak Lee, Irfan Essa, Weilong Yang:
Text as Neural Operator: Image Manipulation by Text Instruction. CoRR abs/2008.04556 (2020) - [i36]Vincent Cartillier, Zhile Ren, Neha Jain, Stefan Lee, Irfan Essa, Dhruv Batra:
Semantic MapNet: Building Allocentric SemanticMaps and Representations from Egocentric Views. CoRR abs/2010.01191 (2020) - [i35]Harish Haresamudram, Irfan A. Essa, Thomas Ploetz:
Contrastive Predictive Coding for Human Activity Recognition. CoRR abs/2012.05333 (2020) - [i34]Erik Wijmans, Irfan Essa, Dhruv Batra:
How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget. CoRR abs/2012.06117 (2020)
2010 – 2019
- 2019
- [j21]Aneeq Zia, Liheng Guo, Linlin Zhou, Irfan A. Essa, Anthony M. Jarc:
Novel evaluation of surgical activity recognition models using task-based efficiency metrics. Int. J. Comput. Assist. Radiol. Surg. 14(12): 2155-2163 (2019) - [c126]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments With Point Cloud Perception. CVPR 2019: 6659-6668 - [c125]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Anoop Cherian, Irfan Essa, Dhruv Batra, Tim K. Marks, Chiori Hori, Peter Anderson, Stefan Lee, Devi Parikh:
Audio Visual Scene-Aware Dialog. CVPR 2019: 7558-7567 - [c124]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356 - [c123]Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan A. Essa:
Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction. ICCV Workshops 2019: 4065-4074 - [c122]Luke Drnach, Jessica L. Allen, Irfan Essa, Lena H. Ting:
A Data-Driven Predictive Model of Individual-Specific Effects of FES on Human Gait Dynamics. ICRA 2019: 5090-5096 - [c121]Unaiza Ahsan, Rishi Madhok, Irfan A. Essa:
Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition. WACV 2019: 179-189 - [c120]Steven Hickson, Nick Dufour, Avneesh Sud, Vivek Kwatra, Irfan A. Essa:
Eyemotion: Classifying Facial Expressions in VR Using Eye-Tracking Cameras. WACV 2019: 1626-1635 - [i33]Huda AlAmri, Vincent Cartillier, Abhishek Das, Jue Wang, Stefan Lee, Peter Anderson, Irfan Essa, Devi Parikh, Dhruv Batra, Anoop Cherian, Tim K. Marks, Chiori Hori:
Audio-Visual Scene-Aware Dialog. CoRR abs/1901.09107 (2019) - [i32]Erik Wijmans, Samyak Datta, Oleksandr Maksymets, Abhishek Das, Georgia Gkioxari, Stefan Lee, Irfan Essa, Devi Parikh, Dhruv Batra:
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception. CoRR abs/1904.03461 (2019) - [i31]Steven Hickson, Karthik Raveendran, Alireza Fathi, Kevin Murphy, Irfan A. Essa:
Floors are Flat: Leveraging Semantics for Real-Time Surface Normal Prediction. CoRR abs/1906.06792 (2019) - [i30]Aneeq Zia, Liheng Guo, Linlin Zhou, Irfan A. Essa, Anthony M. Jarc:
Novel evaluation of surgical activity recognition models using task-based efficiency metrics. CoRR abs/1907.02060 (2019) - [i29]Niranjan Kumar Kannabiran, Irfan Essa, C. Karen Liu:
Estimating Mass Distribution of Articulated Objects through Physical Interaction. CoRR abs/1907.03964 (2019) - [i28]Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra:
Decentralized Distributed PPO: Solving PointGoal Navigation. CoRR abs/1911.00357 (2019) - [i27]Hsin-Ying Lee, Weilong Yang, Lu Jiang, Madison Le, Irfan Essa, Haifeng Gong, Ming-Hsuan Yang:
Neural Design Network: Graphic Layout Generation with Constraints. CoRR abs/1912.09421 (2019) - 2018
- [j20]Aneeq Zia, Yachna Sharma, Vinay Bettadapura, Eric L. Sarin, Irfan A. Essa:
Video and accelerometer-based motion analysis for automated surgical skills assessment. Int. J. Comput. Assist. Radiol. Surg. 13(3): 443-455 (2018) - [j19]Aneeq Zia, Irfan A. Essa:
Automated surgical skill assessment in RMIS training. Int. J. Comput. Assist. Radiol. Surg. 13(5): 731-739 (2018) - [c119]Luke Drnach, Irfan Essa, Lena H. Ting:
Identifying Gait Phases from Joint Kinematics during Walking with Switched Linear Dynamical Systems. BioRob 2018: 1181-1186 - [c118]