


default search action
Dinesh Manocha
- > Home > Persons > Dinesh Manocha
Publications
- 2025
- [c505]Manan Suri, Puneet Mathur, Nedim Lipka, Franck Dernoncourt, Ryan A. Rossi, Dinesh Manocha:
ChartLens: Fine-grained Visual Attribution in Charts. ACL (1) 2025: 22447-22462 - [c488]Manan Suri, Puneet Mathur, Franck Dernoncourt, Kanika Goswami, Ryan A. Rossi, Dinesh Manocha:
VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation. NAACL (Long Papers) 2025: 6088-6109 - [i336]Manan Suri, Puneet Mathur, Nedim Lipka, Franck Dernoncourt, Ryan A. Rossi, Dinesh Manocha:
ChartLens: Fine-grained Visual Attribution in Charts. CoRR abs/2505.19360 (2025) - [i334]Manan Suri, Puneet Mathur, Nedim Lipka, Franck Dernoncourt, Ryan A. Rossi, Vivek Gupta, Dinesh Manocha:
Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents. CoRR abs/2506.01344 (2025) - 2024
- [c481]Puneet Mathur, Zhe Liu, Ke Li, Yingyi Ma, Gil Keren, Zeeshan Ahmed, Dinesh Manocha, Xuedong Zhang:
DOC-RAG: ASR Language Model Personalization with Domain-Distributed Co-occurrence Retrieval Augmentation. LREC/COLING 2024: 5132-5139 - [c480]Puneet Mathur, Vlad I. Morariu, Aparna Garimella, Franck Dernoncourt, Jiuxiang Gu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha, Rajiv Jain:
DocScript: Document-level Script Event Prediction. LREC/COLING 2024: 5140-5155 - [c479]Samyak Jain, Parth Chhabra, Atula Tejaswi Neerkaje, Puneet Mathur, Ramit Sawhney, Shivam Agarwal, Preslav Nakov, Sudheer Chava, Dinesh Manocha:
Saliency-Aware Interpolative Augmentation for Multimodal Financial Prediction. LREC/COLING 2024: 14285-14297 - [c468]Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I. Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha:
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding. EMNLP 2024: 15485-15505 - [c461]Manan Suri, Puneet Mathur, Ramit Sawhney, Preslav Nakov, Dinesh Manocha:
Doc2Command: Furthering Language Guided Document Editing. Tiny Papers @ ICLR 2024 - [i269]Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I. Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha:
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding. CoRR abs/2410.16472 (2024) - [i263]Manan Suri, Puneet Mathur, Franck Dernoncourt, Kanika Goswami, Ryan A. Rossi, Dinesh Manocha:
VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation. CoRR abs/2412.10704 (2024) - 2023
- [c431]Puneet Mathur, Rajiv Jain, Jiuxiang Gu, Franck Dernoncourt, Dinesh Manocha, Vlad I. Morariu:
DocEdit: Language-Guided Document Editing. AAAI 2023: 1914-1922 - [c423]Puneet Mathur, Zhe Liu, Ke Li, Yingyi Ma, Gil Keren, Zeeshan Ahmed, Dinesh Manocha, Xuedong Zhang:
PersonaLM: Language Model Personalization via Domain-distributed Span Aggregated K-Nearest N-gram Retrieval Augmentation. EMNLP (Findings) 2023: 11314-11328 - [c396]Puneet Mathur, Rajiv Jain, Ashutosh Mehra, Jiuxiang Gu, Franck Dernoncourt, Anandhavelu Natarajan, Quan Hung Tran, Verena Kaynig-Fittkau, Ani Nenkova, Dinesh Manocha, Vlad I. Morariu:
LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents. WACV 2023: 3599-3609 - 2022
- [c393]Vikram Gupta, Trisha Mittal, Puneet Mathur, Vaibhav Mishra, Mayank Maheshwari, Aniket Bera, Debdoot Mukherjee, Dinesh Manocha:
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos. CVPR 2022: 21032-21043 - [c388]Puneet Mathur, Gautam Kunapuli, Riyaz Bhat, Manish Shrivastava, Dinesh Manocha, Maneesh Singh:
DocInfer: Document-level Natural Language Inference using Optimal Evidence Selection. EMNLP 2022: 809-824 - [c387]Puneet Mathur, Mihir Goyal, Ramit Sawhney, Ritik Mathur, Jochen L. Leidner, Franck Dernoncourt, Dinesh Manocha:
DocFin: Multimodal Financial Prediction and Bias Mitigation using Semi-structured Documents. EMNLP (Findings) 2022: 1933-1940 - [c380]Puneet Mathur, Franck Dernoncourt, Quan Hung Tran, Jiuxiang Gu, Ani Nenkova, Vlad I. Morariu, Rajiv Jain, Dinesh Manocha:
DocLayoutTTS: Dataset and Baselines for Layout-informed Document-level Neural Speech Synthesis. INTERSPEECH 2022: 451-455 - [c379]Ramit Sawhney, Megh Thakkar, Vishwa Shah, Puneet Mathur, Vasu Sharma, Dinesh Manocha:
PISA: PoIncaré Saliency-Aware Interpolative Augmentation. INTERSPEECH 2022: 2663-2667 - [c367]Puneet Mathur, Atula Tejaswi Neerkaje, Malika Chhibber, Ramit Sawhney, Fuming Guo, Franck Dernoncourt, Sanghamitra Dutta, Dinesh Manocha:
MONOPOLY: Financial Prediction from MONetary POLicY Conference Videos Using Multimodal Cues. ACM Multimedia 2022: 2276-2285 - [c366]Puneet Mathur, Vlad I. Morariu, Verena Kaynig-Fittkau, Jiuxiang Gu, Franck Dernoncourt, Quan Hung Tran, Ani Nenkova, Dinesh Manocha, Rajiv Jain:
DocTime: A Document-level Temporal Dependency Graph Parser. NAACL-HLT 2022: 993-1009 - [i200]Vikram Gupta, Trisha Mittal, Puneet Mathur, Vaibhav Mishra, Mayank Maheshwari, Aniket Bera, Debdoot Mukherjee, Dinesh Manocha:
3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos. CoRR abs/2203.14456 (2022) - [i187]Trisha Mittal, Puneet Mathur, Rohan Chandra, Apurva Bhatt, Vikram Gupta, Debdoot Mukherjee, Aniket Bera, Dinesh Manocha:
Estimating Emotion Contagion on Social Media via Localized Diffusion in Dynamic Graphs. CoRR abs/2207.07165 (2022) - 2021
- [c357]Puneet Mathur, Rajiv Jain, Franck Dernoncourt, Vlad I. Morariu, Quan Hung Tran, Dinesh Manocha:
TIMERS: Document-level Temporal Relation Extraction. ACL/IJCNLP (2) 2021: 524-533 - [c354]Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha:
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality. CVPR 2021: 5661-5671 - [c353]Puneet Mathur, Trisha Mittal, Dinesh Manocha:
Dynamic Graph Modeling Of Simultaneous EEG And Eye-Tracking Data For Reading Task Identification. ICASSP 2021: 1250-1254 - [i163]Puneet Mathur, Trisha Mittal, Dinesh Manocha:
Dynamic Graph Modeling of Simultaneous EEG and Eye-tracking Data for Reading Task Identification. CoRR abs/2102.11922 (2021) - [i159]Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha:
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality. CoRR abs/2103.06541 (2021)

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
[+][–] Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
[+][–] Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-07-28 22:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint