default search action
Yejin Choi 0001
Ye Jin Choi 0001
Person information
- affiliation: University of Washington, School of Computer Science & Engineering, Seattle, WA, USA
- affiliation: Allen Institute for Artificial Intelligence, Seattle, WA, USA
- affiliation: Stony Brook University, Department of Computer Science, Stony Brook, NY, USA
- affiliation (PhD 2010): Cornell University, Ithaca, NY, USA
Other persons with the same name
- Yejin Choi (aka: YeJin Choi, Ye-Jin Choi, Ye Jin Choi) — disambiguation page
- Yejin Choi 0002 (aka: Ye Jin Choi 0002) — Chungnam National University, Department of Mechatronics Engineering, Daejeon, Korea
- YeJin Choi 0003 (aka: Yejin Choi 0003) — Sungshin Women's University, Department of Future Convergence Technology Engineering, Seoul, Korea
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c242]Taylor Sorensen, Liwei Jiang, Jena D. Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas, Yejin Choi:
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties. AAAI 2024: 19937-19947 - [c241]Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren:
Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs. ACL (1) 2024: 7523-7543 - [c240]Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Raghavi Chandu, Kai-Wei Chang, Yejin Choi, Bill Yuchen Lin:
Agent Lumos: Unified and Modular Training for Open-Source Language Agents. ACL (1) 2024: 12380-12403 - [c239]Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Raghavi Chandu:
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning. ACL (Findings) 2024: 12935-12948 - [c238]Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Dragomir Radev, Yejin Choi, Noah A. Smith:
A Call for Clarity in Beam Search: How It Works and When It Stops. LREC/COLING 2024: 77-90 - [c237]Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz:
Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models. EACL (1) 2024: 2257-2273 - [c236]Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren:
In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search. EMNLP 2024: 2348-2370 - [c235]Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov:
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration. EMNLP 2024: 4151-4171 - [c234]Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L. Gordon, Zaïd Harchaoui, Yejin Choi:
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements. EMNLP 2024: 4172-4206 - [c233]Jaeyoung Lee, Ximing Lu, Jack Hessel, Faeze Brahman, Youngjae Yu, Yonatan Bisk, Yejin Choi, Saadia Gabriel:
How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models. EMNLP (Findings) 2024: 13060-13077 - [c232]Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh:
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation. EMNLP 2024: 15134-15158 - [c231]Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren:
Symbolic Working Memory Enhances Language Models for Complex Rule Application. EMNLP 2024: 17583-17604 - [c230]Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim:
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models. EMNLP 2024: 19794-19809 - [c229]Faeze Brahman, Chandra Bhagavatula, Valentina Pyatkin, Jena D. Hwang, Xiang Lorraine Li, Hirona Jacqueline Arai, Soumya Sanyal, Keisuke Sakaguchi, Xiang Ren, Yejin Choi:
PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning. ICLR 2024 - [c228]Seungone Kim, Jamin Shin, Yejin Choi, Joel Jang, Shayne Longpre, Hwaran Lee, Sangdoo Yun, Seongjin Shin, Sungdong Kim, James Thorne, Minjoon Seo:
Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models. ICLR 2024 - [c227]Bill Yuchen Lin, Abhilasha Ravichander, Ximing Lu, Nouha Dziri, Melanie Sclar, Khyathi Raghavi Chandu, Chandra Bhagavatula, Yejin Choi:
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning. ICLR 2024 - [c226]Niloofar Mireshghallah, Hyunwoo Kim, Xuhui Zhou, Yulia Tsvetkov, Maarten Sap, Reza Shokri, Yejin Choi:
Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory. ICLR 2024 - [c225]Linlu Qiu, Liwei Jiang, Ximing Lu, Melanie Sclar, Valentina Pyatkin, Chandra Bhagavatula, Bailin Wang, Yoon Kim, Yejin Choi, Nouha Dziri, Xiang Ren:
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement. ICLR 2024 - [c224]Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Yejin Choi, Xiang Ren:
Tailoring Self-Rationalizers with Multi-Reward Distillation. ICLR 2024 - [c223]Melanie Sclar, Yejin Choi, Yulia Tsvetkov, Alane Suhr:
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting. ICLR 2024 - [c222]Peter West, Ximing Lu, Nouha Dziri, Faeze Brahman, Linjie Li, Jena D. Hwang, Liwei Jiang, Jillian Fisher, Abhilasha Ravichander, Khyathi Raghavi Chandu, Benjamin Newman, Pang Wei Koh, Allyson Ettinger, Yejin Choi:
The Generative AI Paradox: "What It Can Create, It May Not Understand". ICLR 2024 - [c221]Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Yejin Choi, Yuntian Deng:
WildChat: 1M ChatGPT Interaction Logs in the Wild. ICLR 2024 - [c220]Siru Ouyang, Zhuosheng Zhang, Bing Yan, Xuan Liu, Yejin Choi, Jiawei Han, Lianhui Qin:
Structured Chemistry Reasoning with Large Language Models. ICML 2024 - [c219]Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell L. Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, Yejin Choi:
Position: A Roadmap to Pluralistic Alignment. ICML 2024 - [c218]Jillian Fisher, Ximing Lu, Jaehun Jung, Liwei Jiang, Zaïd Harchaoui, Yejin Choi:
JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models. NAACL-HLT 2024: 1552-1581 - [c217]Jaehun Jung, Peter West, Liwei Jiang, Faeze Brahman, Ximing Lu, Jillian Fisher, Taylor Sorensen, Yejin Choi:
Impossible Distillation for Paraphrasing and Summarization: How to Make High-quality Lemonade out of Small, Low-quality Model. NAACL-HLT 2024: 4439-4454 - [c216]Phillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Yejin Choi, Swabha Swayamdipta:
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge. NAACL-HLT (Findings) 2024: 4502-4520 - [c215]Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Yejin Choi, Thomas L. Griffiths, Faeze Brahman:
MacGyver: Are Large Language Models Creative Problem Solvers? NAACL-HLT 2024: 5303-5324 - [c214]Wenting Zhao, Justin T. Chiu, Jena D. Hwang, Faeze Brahman, Jack Hessel, Sanjiban Choudhury, Yejin Choi, Xiang Li, Alane Suhr:
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations. NAACL-HLT 2024: 8487-8505 - [i241]Zane Durante, Qiuyuan Huang, Naoki Wake, Ran Gong, Jae Sung Park, Bidipta Sarkar, Rohan Taori, Yusuke Noda, Demetri Terzopoulos, Yejin Choi, Katsushi Ikeuchi, Hoi Vo, Li Fei-Fei, Jianfeng Gao:
Agent AI: Surveying the Horizons of Multimodal Interaction. CoRR abs/2401.03568 (2024) - [i240]Alisa Liu, Xiaochuang Han, Yizhong Wang, Yulia Tsvetkov, Yejin Choi, Noah A. Smith:
Tuning Language Models by Proxy. CoRR abs/2401.08565 (2024) - [i239]Jiacheng Liu, Sewon Min, Luke Zettlemoyer, Yejin Choi, Hannaneh Hajishirzi:
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens. CoRR abs/2401.17377 (2024) - [i238]Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell L. Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, Yejin Choi:
A Roadmap to Pluralistic Alignment. CoRR abs/2402.05070 (2024) - [i237]Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi:
Do Membership Inference Attacks Work on Large Language Models? CoRR abs/2402.07841 (2024) - [i236]Jillian Fisher, Ximing Lu, Jaehun Jung, Liwei Jiang, Zaïd Harchaoui, Yejin Choi:
JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models. CoRR abs/2402.08761 (2024) - [i235]Yutaro Yamada, Khyathi Raghavi Chandu, Bill Yuchen Lin, Jack Hessel, Ilker Yildirim, Yejin Choi:
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects. CoRR abs/2402.09052 (2024) - [i234]Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren:
Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs. CoRR abs/2402.11442 (2024) - [i233]Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Raghavi Chandu:
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning. CoRR abs/2402.15610 (2024) - [i232]Aly M. Kassem, Omar Mahmoud, Niloofar Mireshghallah, Hyunwoo Kim, Yulia Tsvetkov, Yejin Choi, Sherif Saad, Santu Rana:
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs. CoRR abs/2403.04801 (2024) - [i231]Jaehun Jung, Ximing Lu, Liwei Jiang, Faeze Brahman, Peter West, Pang Wei Koh, Yejin Choi:
Information-Theoretic Distillation for Reference-less Summarization. CoRR abs/2403.13780 (2024) - [i230]Nathan Lambert, Valentina Pyatkin, Jacob Morrison, LJ Miranda, Bill Yuchen Lin, Khyathi Raghavi Chandu, Nouha Dziri, Sachin Kumar, Tom Zick, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi:
RewardBench: Evaluating Reward Models for Language Modeling. CoRR abs/2403.13787 (2024) - [i229]Jimin Mun, Liwei Jiang, Jenny T. Liang, Inyoung Cheong, Nicole DeCario, Yejin Choi, Tadayoshi Kohno, Maarten Sap:
Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits. CoRR abs/2403.14791 (2024) - [i228]Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Yejin Choi:
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge. CoRR abs/2404.06664 (2024) - [i227]Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, José Hernández-Orallo, Lewis Hammond, Eric J. Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob N. Foerster, Florian Tramèr, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger:
Foundational Challenges in Assuring Alignment and Safety of Large Language Models. CoRR abs/2404.09932 (2024) - [i226]Huihan Li, Liwei Jiang, Jena D. Huang, Hyunwoo Kim, Sebastin Santy, Taylor Sorensen, Bill Yuchen Lin, Nouha Dziri, Xiang Ren, Yejin Choi:
CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting. CoRR abs/2404.10199 (2024) - [i225]Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Yejin Choi, Yuntian Deng:
WildChat: 1M ChatGPT Interaction Logs in the Wild. CoRR abs/2405.01470 (2024) - [i224]Yuntian Deng, Yejin Choi, Stuart M. Shieber:
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step. CoRR abs/2405.14838 (2024) - [i223]Bill Yuchen Lin, Yuntian Deng, Khyathi Raghavi Chandu, Faeze Brahman, Abhilasha Ravichander, Valentina Pyatkin, Nouha Dziri, Ronan Le Bras, Yejin Choi:
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild. CoRR abs/2406.04770 (2024) - [i222]Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Choi, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang, Seonghyeon Ye, Bill Yuchen Lin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo:
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models. CoRR abs/2406.05761 (2024) - [i221]Zhangchen Xu, Fengqing Jiang, Luyao Niu, Yuntian Deng, Radha Poovendran, Yejin Choi, Bill Yuchen Lin:
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. CoRR abs/2406.08464 (2024) - [i220]Hamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi:
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback. CoRR abs/2406.09279 (2024) - [i219]Yujie Lu, Dongfu Jiang, Wenhu Chen, William Yang Wang, Yejin Choi, Bill Yuchen Lin:
WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences. CoRR abs/2406.11069 (2024) - [i218]Anas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Kumar Guha, Matt Jordan, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu, Yejin Choi, Ludwig Schmidt:
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens. CoRR abs/2406.11271 (2024) - [i217]Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov:
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration. CoRR abs/2406.15951 (2024) - [i216]Seungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert, Yejin Choi, Nouha Dziri:
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs. CoRR abs/2406.18495 (2024) - [i215]Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Yejin Choi, Nouha Dziri:
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models. CoRR abs/2406.18510 (2024) - [i214]Jaeyoung Lee, Ximing Lu, Jack Hessel, Faeze Brahman, Youngjae Yu, Yonatan Bisk, Yejin Choi, Saadia Gabriel:
How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models. CoRR abs/2407.00369 (2024) - [i213]Khyathi Raghavi Chandu, Linjie Li, Anas Awadalla, Ximing Lu, Jae Sung Park, Jack Hessel, Lijuan Wang, Yejin Choi:
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness. CoRR abs/2407.01942 (2024) - [i212]Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim:
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models. CoRR abs/2407.06004 (2024) - [i211]Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh:
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation. CoRR abs/2407.07087 (2024) - [i210]Niloofar Mireshghallah, Maria Antoniak, Yash More, Yejin Choi, Golnoosh Farnadi:
Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the Wild. CoRR abs/2407.11438 (2024) - [i209]Faeze Brahman, Sachin Kumar, Vidhisha Balachandran, Pradeep Dasigi, Valentina Pyatkin, Abhilasha Ravichander, Sarah Wiegreffe, Nouha Dziri, Khyathi Raghavi Chandu, Jack Hessel, Yulia Tsvetkov, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi:
The Art of Saying No: Contextual Noncompliance in Language Models. CoRR abs/2407.12043 (2024) - [i208]Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith:
Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data? CoRR abs/2407.16607 (2024) - [i207]Wenting Zhao, Tanya Goyal, Yu Ying Chiu, Liwei Jiang, Benjamin Newman, Abhilasha Ravichander, Khyathi Raghavi Chandu, Ronan Le Bras, Claire Cardie, Yuntian Deng, Yejin Choi:
WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries. CoRR abs/2407.17468 (2024) - [i206]Jaehun Jung, Faeze Brahman, Yejin Choi:
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement. CoRR abs/2407.18370 (2024) - [i205]Le Xue, Manli Shu, Anas Awadalla, Jun Wang, An Yan, Senthil Purushwalkam, Honglu Zhou, Viraj Prabhu, Yutong Dai, Michael S. Ryoo, Shrikant Kendre, Jieyu Zhang, Can Qin, Shu Zhang, Chia-Chih Chen, Ning Yu, Juntao Tan, Tulika Manoj Awalgaonkar, Shelby Heinecke, Huan Wang, Yejin Choi, Ludwig Schmidt, Zeyuan Chen, Silvio Savarese, Juan Carlos Niebles, Caiming Xiong, Ran Xu:
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models. CoRR abs/2408.08872 (2024) - [i204]Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren:
Symbolic Working Memory Enhances Language Models for Complex Rule Application. CoRR abs/2408.13654 (2024) - [i203]Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L. Gordon, Zaïd Harchaoui, Yejin Choi:
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements. CoRR abs/2408.15666 (2024) - [i202]Yuntian Deng, Wenting Zhao, Jack Hessel, Xiang Ren, Claire Cardie, Yejin Choi:
WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild. CoRR abs/2409.03753 (2024) - [i201]Xuhui Zhou, Hyunwoo Kim, Faeze Brahman, Liwei Jiang, Hao Zhu, Ximing Lu, Frank Xu, Bill Yuchen Lin, Yejin Choi, Niloofar Mireshghallah, Ronan Le Bras, Maarten Sap:
HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions. CoRR abs/2409.16427 (2024) - [i200]Yu Ying Chiu, Liwei Jiang, Bill Yuchen Lin, Chan Young Park, Shuyue Stella Li, Sahithya Ravi, Mehar Bhatia, Maria Antoniak, Yulia Tsvetkov, Vered Shwartz, Yejin Choi:
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs. CoRR abs/2410.02677 (2024) - [i199]Yu Ying Chiu, Liwei Jiang, Yejin Choi:
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life. CoRR abs/2410.02683 (2024) - [i198]Liwei Jiang, Taylor Sorensen, Sydney Levine, Yejin Choi:
Can Language Models Reason about Individualistic Human Values and Preferences? CoRR abs/2410.03868 (2024) - [i197]Ximing Lu, Melanie Sclar, Skyler Hallinan, Niloofar Mireshghallah, Jiacheng Liu, Seungju Han, Allyson Ettinger, Liwei Jiang, Khyathi Raghavi Chandu, Nouha Dziri, Yejin Choi:
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text. CoRR abs/2410.04265 (2024) - [i196]Jared Moore, Yejin Choi, Sydney Levine:
Intuitions of Compromise: Utilitarianism vs. Contractualism. CoRR abs/2410.05496 (2024) - [i195]Mohammadreza Salehi, Jae Sung Park, Tanush Yadav, Aditya Kusupati, Ranjay Krishna, Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi:
ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition. CoRR abs/2410.05774 (2024) - [i194]Jillian Fisher, Shangbin Feng, Robert Aron, Thomas Richardson, Yejin Choi, Daniel W. Fisher, Jennifer Pan, Yulia Tsvetkov, Katharina Reinecke:
Biased AI can Influence Political Decision-Making. CoRR abs/2410.06415 (2024) - [i193]Shangbin Feng, Zifeng Wang, Yike Wang, Sayna Ebrahimi, Hamid Palangi, Lesly Miculicich, Achin Kulshrestha, Nathalie Rauschmayr, Yejin Choi, Yulia Tsvetkov, Chen-Yu Lee, Tomas Pfister:
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence. CoRR abs/2410.11163 (2024) - [i192]Yuling Gu, Oyvind Tafjord, Hyunwoo Kim, Jared Moore, Ronan Le Bras, Peter Clark, Yejin Choi:
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs. CoRR abs/2410.13648 (2024) - [i191]Michael J. Q. Zhang, Zhilin Wang, Jena D. Hwang, Yi Dong, Olivier Delalleau, Yejin Choi, Eunsol Choi, Xiang Ren, Valentina Pyatkin:
Diverging Preferences: When do Annotators Disagree and do Models Know? CoRR abs/2410.14632 (2024) - [i190]Jing-Jing Li, Valentina Pyatkin, Max Kleiman-Weiner, Liwei Jiang, Nouha Dziri, Anne G. E. Collins, Jana Schaich Borg, Maarten Sap, Yejin Choi, Sydney Levine:
SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation. CoRR abs/2410.16665 (2024) - 2023
- [j13]Krishna Pillutla, Lang Liu, John Thickstun, Sean Welleck, Swabha Swayamdipta, Rowan Zellers, Sewoong Oh, Yejin Choi, Zaïd Harchaoui:
MAUVE Scores for Generative Models: Theory and Practice. J. Mach. Learn. Res. 24: 356:1-356:92 (2023) - [j12]Alon Y. Halevy, Yejin Choi, Avrilia Floratou, Michael J. Franklin, Natasha F. Noy, Haixun Wang:
Will LLMs reshape, supercharge, or kill data science? Proc. VLDB Endow. 16(12): 4114-4115 (2023) - [j11]Yejin Choi:
Common Sense: the Dark Matter of Language and Intelligence. Proc. VLDB Endow. 16(12): 4139 (2023) - [j10]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023) - [c213]Skyler Hallinan, Alisa Liu, Yejin Choi, Maarten Sap:
Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts. ACL (2) 2023: 228-242 - [c212]Jack Hessel, Ana Marasovic, Jena D. Hwang, Lillian Lee, Jeff Da, Rowan Zellers, Robert Mankoff, Yejin Choi:
Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest. ACL (1) 2023: 688-714 - [c211]Hanjie Chen, Faeze Brahman, Xiang Ren, Yangfeng Ji, Yejin Choi, Swabha Swayamdipta:
REV: Information-Theoretic Evaluation of Free-Text Rationales. ACL (1) 2023: 2007-2030 - [c210]Liunian Harold Li, Jack Hessel, Youngjae Yu, Xiang Ren, Kai-Wei Chang, Yejin Choi:
Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step. ACL (1) 2023: 2665-2679 - [c209]Wangchunshu Zhou, Ronan Le Bras, Yejin Choi:
Commonsense Knowledge Transfer for Pre-trained Language Models. ACL (Findings) 2023: 5946-5960 - [c208]Hwaran Lee, Seokhee Hong, Joonsuk Park, Takyoung Kim, Meeyoung Cha, Yejin Choi, Byoung Pil Kim, Gunhee Kim, Eun-Ju Lee, Yong Lim, Alice Oh, Sangchul Park, Jung-Woo Ha:
SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration. ACL (1) 2023: 6692-6712 - [c207]Brihi Joshi, Ziyi Liu, Sahana Ramnath, Aaron Chan, Zhewei Tong, Shaoliang Nie, Qifan Wang, Yejin Choi, Xiang Ren:
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-text Rationales. ACL (1) 2023: 7103-7128 - [c206]Chandra Bhagavatula, Jena D. Hwang, Doug Downey, Ronan Le Bras, Ximing Lu, Lianhui Qin, Keisuke Sakaguchi, Swabha Swayamdipta, Peter West, Yejin Choi:
I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation. ACL (1) 2023: 9614-9630 - [c205]Wangchunshu Zhou, Ronan Le Bras, Yejin Choi:
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference. ACL (Findings) 2023: 10452-10465 - [c204]