default search action
Jaemin Cho 0001
Person information
- affiliation: UNC Chapel Hill, NC, USA
- affiliation (former): Allen Institute for AI, Seattle, WA, USA
Other persons with the same name
- Jaemin Cho — disambiguation page
- Jaemin Cho 0002 — KAIST, Daejeon, Republic of Korea
- Jaemin Cho 0003 — Pohang University of Science and Technology, Pohang, Korea
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c19]Qin Liu, Jaemin Cho, Mohit Bansal, Marc Niethammer:
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts. CVPR 2024: 3773-3782 - [c18]Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal:
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation. CVPR Workshops 2024: 5280-5289 - [c17]Jaemin Cho, Yushi Hu, Jason M. Baldridge, Roopal Garg, Peter Anderson, Ranjay Krishna, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. ICLR 2024 - [i26]David Wan, Jaemin Cho, Elias Stengel-Eskin, Mohit Bansal:
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training. CoRR abs/2403.02325 (2024) - [i25]Jialu Li, Jaemin Cho, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal:
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data. CoRR abs/2403.06952 (2024) - [i24]Abhay Zala, Jaemin Cho, Han Lin, Jaehong Yoon, Mohit Bansal:
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents. CoRR abs/2403.12014 (2024) - [i23]Qin Liu, Jaemin Cho, Mohit Bansal, Marc Niethammer:
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts. CoRR abs/2404.00741 (2024) - [i22]Han Lin, Jaemin Cho, Abhay Zala, Mohit Bansal:
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model. CoRR abs/2404.09967 (2024) - [i21]Yasumasa Onoe, Sunayana Rane, Zachary Berger, Yonatan Bitton, Jaemin Cho, Roopal Garg, Alexander Ku, Zarana Parekh, Jordi Pont-Tuset, Garrett Tanzer, Su Wang, Jason Baldridge:
DOCCI: Descriptions of Connected and Contrasting Images. CoRR abs/2404.19753 (2024) - 2023
- [c16]Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal:
Hierarchical Video-Moment Retrieval and Step-Captioning. CVPR 2023: 23056-23065 - [c15]Jaemin Cho, Abhay Zala, Mohit Bansal:
DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models. ICCV 2023: 3020-3031 - [c14]Jaemin Cho, Abhay Zala, Mohit Bansal:
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation. NeurIPS 2023 - [c13]Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji:
Paxion: Patching Action Knowledge in Video-Language Foundation Models. NeurIPS 2023 - [c12]Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. NeurIPS 2023 - [c11]Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. WACV 2023: 4399-4409 - [i20]Abhay Zala, Jaemin Cho, Satwik Kottur, Xilun Chen, Barlas Oguz, Yashar Mehdad, Mohit Bansal:
Hierarchical Video-Moment Retrieval and Step-Captioning. CoRR abs/2303.16406 (2023) - [i19]Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal:
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation. CoRR abs/2304.06671 (2023) - [i18]Shoubin Yu, Jaemin Cho, Prateek Yadav, Mohit Bansal:
Self-Chained Image-Language Model for Video Localization and Question Answering. CoRR abs/2305.06988 (2023) - [i17]Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji:
Paxion: Patching Action Knowledge in Video-Language Foundation Models. CoRR abs/2305.10683 (2023) - [i16]Jaemin Cho, Abhay Zala, Mohit Bansal:
Visual Programming for Text-to-Image Generation and Evaluation. CoRR abs/2305.15328 (2023) - [i15]Han Lin, Abhay Zala, Jaemin Cho, Mohit Bansal:
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning. CoRR abs/2309.15091 (2023) - [i14]Abhay Zala, Han Lin, Jaemin Cho, Mohit Bansal:
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning. CoRR abs/2310.12128 (2023) - [i13]Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang:
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation. CoRR abs/2310.18235 (2023) - 2022
- [c10]Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander G. Schwing, Heng Ji:
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. AAAI 2022: 11200-11208 - [c9]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-ADAPTER: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. CVPR 2022: 5217-5227 - [c8]Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal:
Fine-grained Image Captioning with CLIP Reward. NAACL-HLT (Findings) 2022: 517-527 - [c7]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning. NeurIPS 2022 - [c6]Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal:
TVLT: Textless Vision-Language Transformer. NeurIPS 2022 - [i12]Jaemin Cho, Abhay Zala, Mohit Bansal:
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers. CoRR abs/2202.04053 (2022) - [i11]Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal:
Fine-grained Image Captioning with CLIP Reward. CoRR abs/2205.13115 (2022) - [i10]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning. CoRR abs/2206.06522 (2022) - [i9]Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal:
TVLT: Textless Vision-Language Transformer. CoRR abs/2209.14156 (2022) - [i8]Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. CoRR abs/2211.11701 (2022) - 2021
- [c5]Jaemin Cho, Jie Lei, Hao Tan, Mohit Bansal:
Unifying Vision-and-Language Tasks via Text Generation. ICML 2021: 1931-1942 - [c4]Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal:
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer. NeurIPS 2021: 24468-24481 - [i7]Jaemin Cho, Jie Lei, Hao Tan, Mohit Bansal:
Unifying Vision-and-Language Tasks via Text Generation. CoRR abs/2102.02779 (2021) - [i6]Zineng Tang, Jaemin Cho, Hao Tan, Mohit Bansal:
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer. CoRR abs/2107.02681 (2021) - [i5]Yi-Lin Sung, Jaemin Cho, Mohit Bansal:
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks. CoRR abs/2112.06825 (2021) - [i4]Revanth Gangi Reddy, Xilin Rui, Manling Li, Xudong Lin, Haoyang Wen, Jaemin Cho, Lifu Huang, Mohit Bansal, Avirup Sil, Shih-Fu Chang, Alexander G. Schwing, Heng Ji:
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. CoRR abs/2112.10728 (2021) - 2020
- [c3]Jaemin Cho, Jiasen Lu, Dustin Schwenk, Hannaneh Hajishirzi, Aniruddha Kembhavi:
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers. EMNLP (1) 2020: 8785-8805 - [i3]Jaemin Cho, Jiasen Lu, Dustin Schwenk, Hannaneh Hajishirzi, Aniruddha Kembhavi:
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers. CoRR abs/2009.11278 (2020)
2010 – 2019
- 2019
- [c2]Jaemin Cho, Min Joon Seo, Hannaneh Hajishirzi:
Mixture Content Selection for Diverse Sequence Generation. EMNLP/IJCNLP (1) 2019: 3119-3129 - [i2]Jaemin Cho, Min Joon Seo, Hannaneh Hajishirzi:
Mixture Content Selection for Diverse Sequence Generation. CoRR abs/1909.01953 (2019) - 2018
- [c1]Yookoon Park, Jaemin Cho, Gunhee Kim:
A Hierarchical Latent Structure for Variational Conversation Modeling. NAACL-HLT 2018: 1792-1801 - [i1]Yookoon Park, Jaemin Cho, Gunhee Kim:
A Hierarchical Latent Structure for Variational Conversation Modeling. CoRR abs/1804.03424 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-11 18:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint