default search action
Srikar Appalaraju
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Conference and Workshop Papers
- 2024
- [c15]Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha:
DocFormerv2: Local Features for Document Understanding. AAAI 2024: 709-718 - [c14]Tianyang Zhao, Kunwar Yashraj Singh, Srikar Appalaraju, Peng Tang, Vijay Mahadevan, R. Manmatha, Ying Nian Wu:
No Head Left Behind - Multi-Head Alignment Distillation for Transformers. AAAI 2024: 7514-7524 - [c13]Yuan Gao, Kunyu Shi, Pengkai Zhu, Edouard Belval, Oren Nuriel, Srikar Appalaraju, Shabnam Ghadar, Zhuowen Tu, Vijay Mahadevan, Stefano Soatto:
Enhancing Vision-Language Pre-Training with Rich Supervisions. CVPR 2024: 13480-13491 - [c12]Ofir Abramovich, Niv Nayman, Sharon Fogel, Inbal Lavi, Ron Litman, Shahar Tsiper, Royee Tichauer, Srikar Appalaraju, Shai Mazor, R. Manmatha:
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding. ECCV (8) 2024: 241-259 - [c11]Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto:
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. EMNLP 2024: 3167-3193 - [c10]Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay Mahadevan:
Multiple-Question Multiple-Answer Text-VQA. NAACL (Industry Track) 2024: 73-88 - [c9]Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha:
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models. NAACL-HLT (Findings) 2024: 116-131 - 2023
- [c8]Yoshinari Fujinuma, Siddharth Varia, Nishant Sankaran, Srikar Appalaraju, Bonan Min, Yogarshi Vyas:
A Multi-Modal Multilingual Benchmark for Document Image Classification. EMNLP (Findings) 2023: 14361-14376 - [c7]Xiaoshuai Hao, Yi Zhu, Srikar Appalaraju, Aston Zhang, Wanqian Zhang, Bo Li, Mu Li:
MixGen: A New Multi-Modal Data Augmentation. WACV (Workshops) 2023: 379-389 - 2022
- [c6]Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha:
LaTr: Layout-Aware Transformer for Scene-Text VQA. CVPR 2022: 16527-16537 - [c5]Chih-Hui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos:
YORO - Lightweight End to End Visual Grounding. ECCV Workshops (8) 2022: 3-23 - [c4]Chenge Li, István Fehérvári, Xiaonan Zhao, Ives Macêdo, Srikar Appalaraju:
SeeTek: Very Large-Scale Open-set Logo Recognition with Text-Aware Metric Learning. WACV 2022: 587-596 - 2021
- [c3]Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha:
DocFormer: End-to-End Transformer for Document Understanding. ICCV 2021: 973-983 - [c2]Yash Patel, Srikar Appalaraju, R. Manmatha:
Saliency Driven Perceptual Image Compression. WACV 2021: 227-236 - 2019
- [c1]István Fehérvári, Srikar Appalaraju:
Scalable Logo Recognition Using Proxies. WACV 2019: 715-725
Informal and Other Publications
- 2024
- [i21]Yuan Gao, Kunyu Shi, Pengkai Zhu, Edouard Belval, Oren Nuriel, Srikar Appalaraju, Shabnam Ghadar, Vijay Mahadevan, Zhuowen Tu, Stefano Soatto:
Enhancing Vision-Language Pre-training with Rich Supervisions. CoRR abs/2403.03346 (2024) - [i20]Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju:
RAVEN: Multitask Retrieval Augmented Vision-Language Learning. CoRR abs/2406.19150 (2024) - [i19]Ofir Abramovich, Niv Nayman, Sharon Fogel, Inbal Lavi, Ron Litman, Shahar Tsiper, Royee Tichauer, Srikar Appalaraju, Shai Mazor, R. Manmatha:
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding. CoRR abs/2407.12594 (2024) - [i18]Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto:
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. CoRR abs/2410.03061 (2024) - 2023
- [i17]Yash Patel, Yusheng Xie, Yi Zhu, Srikar Appalaraju, R. Manmatha:
SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation. CoRR abs/2302.03432 (2023) - [i16]Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha:
DocFormerv2: Local Features for Document Understanding. CoRR abs/2306.01733 (2023) - [i15]Yoshinari Fujinuma, Siddharth Varia, Nishant Sankaran, Srikar Appalaraju, Bonan Min, Yogarshi Vyas:
A Multi-Modal Multilingual Benchmark for Document Image Classification. CoRR abs/2310.16356 (2023) - [i14]Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay Mahadevan:
Multiple-Question Multiple-Answer Text-VQA. CoRR abs/2311.08622 (2023) - [i13]Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha:
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models. CoRR abs/2311.08623 (2023) - 2022
- [i12]Simone Bombari, Alessandro Achille, Zijian Wang, Yu-Xiang Wang, Yusheng Xie, Kunwar Yashraj Singh, Srikar Appalaraju, Vijay Mahadevan, Stefano Soatto:
Towards Differential Relational Privacy and its use in Question Answering. CoRR abs/2203.16701 (2022) - [i11]Xiaoshuai Hao, Yi Zhu, Srikar Appalaraju, Aston Zhang, Wanqian Zhang, Bo Li, Mu Li:
MixGen: A New Multi-Modal Data Augmentation. CoRR abs/2206.08358 (2022) - [i10]Chih-Hui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos:
YORO - Lightweight End to End Visual Grounding. CoRR abs/2211.07912 (2022) - 2021
- [i9]Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha:
DocFormer: End-to-End Transformer for Document Understanding. CoRR abs/2106.11539 (2021) - [i8]Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha:
LaTr: Layout-Aware Transformer for Scene-Text VQA. CoRR abs/2112.12494 (2021) - 2020
- [i7]Yash Patel, Srikar Appalaraju, R. Manmatha:
Hierarchical Auto-Regressive Model for Image Compression Incorporating Object Saliency and a Deep Perceptual Loss. CoRR abs/2002.04988 (2020) - [i6]Srikar Appalaraju, Yi Zhu, Yusheng Xie, István Fehérvári:
Towards Good Practices in Self-supervised Representation Learning. CoRR abs/2012.00868 (2020) - 2019
- [i5]Yash Patel, Srikar Appalaraju, R. Manmatha:
Deep Perceptual Compression. CoRR abs/1907.08310 (2019) - [i4]Yash Patel, Srikar Appalaraju, R. Manmatha:
Human Perceptual Evaluations for Image Compression. CoRR abs/1908.04187 (2019) - [i3]István Fehérvári, Avinash Ravichandran, Srikar Appalaraju:
Unbiased Evaluation of Deep Metric Learning Algorithms. CoRR abs/1911.12528 (2019) - 2018
- [i2]István Fehérvári, Srikar Appalaraju:
Scalable Logo Recognition using Proxies. CoRR abs/1811.08009 (2018) - 2017
- [i1]Srikar Appalaraju, Vineet Chaoji:
Image similarity using Deep CNN and Curriculum Learning. CoRR abs/1709.08761 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 22:02 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint