default search action
Dan Roth
Person information
- affiliation: University of Pennsylvania, USA
- affiliation (former): University of Illinois Urbana-Champaign, IL, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j63]Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heintz, Dan Roth:
Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey. ACM Comput. Surv. 56(2): 30:1-30:40 (2024) - [j62]Kevin Xie, William K. S. Ojemann, Ryan S. Gallagher, Russell T. Shinohara, Alfredo Lucas, Chloe E. Hill, Roy H. Hamilton, Kevin B. Johnson, Dan Roth, Brian Litt, Colin A. Ellis:
Disparities in seizure outcomes revealed by large language models. J. Am. Medical Informatics Assoc. 31(6): 1348-1355 (2024) - [c413]Aparna Elangovan, Ling Liu, Lei Xu, Sravan Babu Bodapati, Dan Roth:
ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models. ACL (1) 2024: 1137-1160 - [c412]Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth:
FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts. ACL (Findings) 2024: 1330-1350 - [c411]Peter Baile Chen, Yi Zhang, Dan Roth:
Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval. ACL (1) 2024: 2687-2699 - [c410]Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth:
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering. ACL (Findings) 2024: 3853-3878 - [c409]Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth, Bing Xiang:
CoCoMIC: Code Completion by Jointly Modeling In-file and Cross-file Context. LREC/COLING 2024: 3433-3445 - [c408]Haoyu Wang, Hongming Zhang, Kaiqiang Song, Dong Yu, Dan Roth:
Event Semantic Classification in Context. EACL (Findings) 2024: 1395-1407 - [c407]Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A. Smith, Wei-Chiu Ma, Ranjay Krishna:
BLINK: Multimodal Large Language Models Can See but Not Perceive. ECCV (23) 2024: 148-166 - [c406]Haoyu Wang, Tao Li, Zhiwei Deng, Dan Roth, Yang Li:
Devil's Advocate: Anticipatory Reflection for LLM Agents. EMNLP (Findings) 2024: 966-978 - [c405]Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson:
Event Causality Identification with Synthetic Control. EMNLP 2024: 1725-1737 - [c404]Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie Su, Camillo J. Taylor, Dan Roth:
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners. EMNLP 2024: 4722-4756 - [c403]Suyash Vardhan Mathur, Jainit Sushil Bafna, Kunal Kartik, Harshita Khandelwal, Manish Shrivastava, Vivek Gupta, Mohit Bansal, Dan Roth:
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables. EMNLP (Findings) 2024: 14054-14073 - [c402]Srija Mukhopadhyay, Adnan Qidwai, Aparna Garimella, Pritika Ramu, Vivek Gupta, Dan Roth:
Unraveling the Truth: Do VLMs really Understand Charts? A Deep Dive into Consistency and Robustness. EMNLP (Findings) 2024: 16696-16717 - [c401]Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth:
Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets. EMNLP 2024: 22162-22184 - [c400]Dejiao Zhang, Wasi Uddin Ahmad, Ming Tan, Hantian Ding, Ramesh Nallapati, Dan Roth, Xiaofei Ma, Bing Xiang:
Code Representation Learning at Scale. ICLR 2024 - [c399]Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto:
Fewer Truncations Improve Language Modeling. ICML 2024 - [c398]Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao:
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks. NAACL-HLT (Findings) 2024: 1333-1351 - [c397]Sihao Chen, Hongming Zhang, Tong Chen, Ben Zhou, Wenhao Yu, Dian Yu, Baolin Peng, Hongwei Wang, Dan Roth, Dong Yu:
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations. NAACL-HLT 2024: 1596-1609 - [c396]Hangfeng He, Hongming Zhang, Dan Roth:
SocREval: Large Language Models with the Socratic Method for Reference-free Reasoning Evaluation. NAACL-HLT (Findings) 2024: 2736-2764 - [c395]Chaitanya Malaviya, Subin Lee, Sihao Chen, Elizabeth Sieber, Mark Yatskar, Dan Roth:
ExpertQA: Expert-Curated Questions and Attributed Answers. NAACL-HLT 2024: 3025-3045 - [c394]Chaitanya Malaviya, Subin Lee, Dan Roth, Mark Yatskar:
What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception. NAACL-HLT 2024: 3046-3065 - [c393]Bangzheng Li, Ben Zhou, Fei Wang, Xingyu Fu, Dan Roth, Muhao Chen:
Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination? NAACL-HLT 2024: 7675-7688 - [i215]Dejiao Zhang, Wasi Uddin Ahmad, Ming Tan, Hantian Ding, Ramesh Nallapati, Dan Roth, Xiaofei Ma, Bing Xiang:
Code Representation Learning At Scale. CoRR abs/2402.01935 (2024) - [i214]James Y. Huang, Sailik Sengupta, Daniele Bonadiman, Yi'an Lai, Arshit Gupta, Nikolaos Pappas, Saab Mansour, Katrin Kirchhoff, Dan Roth:
DeAL: Decoding-time Alignment for Large Language Models. CoRR abs/2402.06147 (2024) - [i213]Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth:
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering. CoRR abs/2402.11194 (2024) - [i212]Fei Wang, Chao Shang, Sarthak Jain, Shuai Wang, Qiang Ning, Bonan Min, Vittorio Castelli, Yassine Benajiba, Dan Roth:
From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification. CoRR abs/2403.06326 (2024) - [i211]Bowen Jiang, Zhijun Zhuang, Shreyas S. Shivakumar, Dan Roth, Camillo J. Taylor:
Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering. CoRR abs/2403.14783 (2024) - [i210]Ben Zhou, Hongming Zhang, Sihao Chen, Dian Yu, Hongwei Wang, Baolin Peng, Dan Roth, Dong Yu:
Conceptual and Unbiased Reasoning in Language Models. CoRR abs/2404.00205 (2024) - [i209]Peter Baile Chen, Yi Zhang, Dan Roth:
Is Table Retrieval a Solved Problem? Join-Aware Multi-Table Retrieval. CoRR abs/2404.09889 (2024) - [i208]Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto:
Fewer Truncations Improve Language Modeling. CoRR abs/2404.10830 (2024) - [i207]Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A. Smith, Wei-Chiu Ma, Ranjay Krishna:
BLINK: Multimodal Large Language Models Can See but Not Perceive. CoRR abs/2404.12390 (2024) - [i206]Yu Feng, Ben Zhou, Weidong Lin, Dan Roth:
BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models. CoRR abs/2404.12494 (2024) - [i205]Haoyu Wang, Tao Li, Zhiwei Deng, Dan Roth, Yang Li:
Devil's Advocate: Anticipatory Reflection for LLM Agents. CoRR abs/2405.16334 (2024) - [i204]Aparna Elangovan, Ling Liu, Lei Xu, Sravan Bodapati, Dan Roth:
ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models. CoRR abs/2405.18638 (2024) - [i203]Xingyu Fu, Muyu He, Yujie Lu, William Yang Wang, Dan Roth:
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? CoRR abs/2406.07546 (2024) - [i202]Yushi Hu, Weijia Shi, Xingyu Fu, Dan Roth, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Ranjay Krishna:
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models. CoRR abs/2406.09403 (2024) - [i201]Fei Wang, Xingyu Fu, James Y. Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen:
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding. CoRR abs/2406.09411 (2024) - [i200]Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J. Su, Camillo J. Taylor, Dan Roth:
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners. CoRR abs/2406.11050 (2024) - [i199]Bangzheng Li, Ben Zhou, Xingyu Fu, Fei Wang, Dan Roth, Muhao Chen:
FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation. CoRR abs/2406.11243 (2024) - [i198]Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth:
FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts. CoRR abs/2406.19237 (2024) - [i197]Nikhil Abhyankar, Vivek Gupta, Dan Roth, Chandan K. Reddy:
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables. CoRR abs/2407.05952 (2024) - [i196]Kaifu Wang, Efthymia Tsamoura, Dan Roth:
On Characterizing and Mitigating Imbalances in Multi-Instance Partial Label Learning. CoRR abs/2407.10000 (2024) - [i195]Pranshu Pandya, Agney S. Talwarr, Vatsal Gupta, Tushar Kataria, Vivek Gupta, Dan Roth:
NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models. CoRR abs/2407.10380 (2024) - [i194]Srija Mukhopadhyay, Adnan Qidwai, Aparna Garimella, Pritika Ramu, Vivek Gupta, Dan Roth:
Unraveling the Truth: Do LLMs really Understand Charts? A Deep Dive into Consistency and Robustness. CoRR abs/2407.11229 (2024) - [i193]Irwin Deng, Kushagra Dixit, Vivek Gupta, Dan Roth:
Enhancing Temporal Understanding in LLMs for Semi-structured Tables. CoRR abs/2407.16030 (2024) - [i192]Suyash Vardhan Mathur, Jainit Sushil Bafna, Kunal Kartik, Harshita Khandelwal, Manish Shrivastava, Vivek Gupta, Mohit Bansal, Dan Roth:
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables. CoRR abs/2408.13860 (2024) - [i191]Srija Mukhopadhyay, Abhishek Rajgaria, Prerana Khatiwada, Vivek Gupta, Dan Roth:
MAPWise: Evaluating Vision-Language Models for Advanced Map Queries. CoRR abs/2409.00255 (2024) - [i190]Qingru Zhang, Xiaodong Yu, Chandan Singh, Xiaodong Liu, Liyuan Liu, Jianfeng Gao, Tuo Zhao, Dan Roth, Hao Cheng:
Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering. CoRR abs/2409.10790 (2024) - [i189]Tianyue Ou, Frank F. Xu, Aman Madaan, Jiarui Liu, Robert Lo, Abishek Sridhar, Sudipta Sengupta, Dan Roth, Graham Neubig, Shuyan Zhou:
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale. CoRR abs/2409.15637 (2024) - [i188]Aparna Elangovan, Jongwoo Ko, Lei Xu, Mahsa Elyasi, Ling Liu, Sravan Bodapati, Dan Roth:
Beyond correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge. CoRR abs/2410.03775 (2024) - [i187]Jiashu He, Mingyu Derek Ma, Jinxuan Fan, Dan Roth, Wei Wang, Alejandro Ribeiro:
GIVE: Structured Reasoning with Knowledge Graph Inspired Veracity Extrapolation. CoRR abs/2410.08475 (2024) - [i186]Siyi Liu, Qiang Ning, Kishaloy Halder, Wei Xiao, Zheng Qi, Phu Mon Htut, Yi Zhang, Neha Anna John, Bonan Min, Yassine Benajiba, Dan Roth:
Open Domain Question Answering with Conflicting Contexts. CoRR abs/2410.12311 (2024) - [i185]Xiaodong Yu, Ben Zhou, Hao Cheng, Dan Roth:
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning. CoRR abs/2410.19056 (2024) - [i184]Yahan Yang, Soham Dan, Dan Roth, Insup Lee:
Benchmarking LLM Guardrails in Handling Multilingual Toxicity. CoRR abs/2410.22153 (2024) - 2023
- [j61]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023) - [c392]Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi:
GLUECons: A Generic Benchmark for Learning under Constraints. AAAI 2023: 9552-9561 - [c391]Yahan Yang, Soham Dan, Dan Roth, Insup Lee:
In and Out-of-Domain Text Adversarial Robustness via Label Smoothing. ACL (2) 2023: 657-669 - [c390]Xingyu Fu, Sheng Zhang, Gukyeong Kwon, Pramuditha Perera, Henghui Zhu, Yuhao Zhang, Alexander Hanbo Li, William Yang Wang, Zhiguo Wang, Vittorio Castelli, Patrick Ng, Dan Roth, Bing Xiang:
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge. ACL (Findings) 2023: 2333-2346 - [c389]Rujun Han, Peng Qi, Yuhao Zhang, Lan Liu, Juliette Burger, William Yang Wang, Zhiheng Huang, Bing Xiang, Dan Roth:
RobustQA: Benchmarking the Robustness of Domain Adaptation for Open-Domain Question Answering. ACL (Findings) 2023: 4294-4311 - [c388]Sihao Chen, Senaka Buthpitiya, Alex Fabrikant, Dan Roth, Tal Schuster:
PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and Entailment Recognition. ACL (Findings) 2023: 8874-8893 - [c387]Tyler A. Chang, Kishaloy Halder, Neha Anna John, Yogarshi Vyas, Yassine Benajiba, Miguel Ballesteros, Dan Roth:
Characterizing and Measuring Linguistic Dataset Drift. ACL (1) 2023: 8953-8967 - [c386]Hritik Bansal, Karthik Gopalakrishnan, Saket Dingliwal, Sravan Bodapati, Katrin Kirchhoff, Dan Roth:
Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale. ACL (1) 2023: 11833-11856 - [c385]Yu Feng, Ben Zhou, Haoyu Wang, Helen Jin, Dan Roth:
Generic Temporal Reasoning with Differential Analysis and Explanation. ACL (1) 2023: 12013-12029 - [c384]Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, Bing Xiang:
ReCode: Robustness Evaluation of Code Generation Models. ACL (1) 2023: 13818-13843 - [c383]Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Yang Wang, Kathleen R. McKeown, Vittorio Castelli, Dan Roth, Bing Xiang:
Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning. ACL (1) 2023: 16171-16189 - [c382]Wenpeng Yin, Muhao Chen, Ben Zhou, Qiang Ning, Kai-Wei Chang, Dan Roth:
Indirectly Supervised Natural Language Processing. ACL (tutorial) 2023: 32-40 - [c381]Hongming Zhang, Yintong Huo, Yanai Elazar, Yangqiu Song, Yoav Goldberg, Dan Roth:
CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm. EACL (Findings) 2023: 114-124 - [c380]Haoyu Wang, Hongming Zhang, Yuqian Deng, Jacob R. Gardner, Dan Roth, Muhao Chen:
Extracting or Guessing? Improving Faithfulness of Event Temporal Relation Extraction. EACL 2023: 541-553 - [c379]Daniel Deutsch, Dan Roth:
Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection. EACL 2023: 575-588 - [c378]Rotem Dror, Haoyu Wang, Dan Roth:
Zero-Shot On-the-Fly Event Schema Induction. EACL (Findings) 2023: 693-713 - [c377]Xiaodong Yu, Wenpeng Yin, Nitish Gupta, Dan Roth:
Event Linking: Grounding Event Mentions to Wikipedia. EACL 2023: 2671-2680 - [c376]Yahan Yang, Elior Sulem, Insup Lee, Dan Roth:
Bootstrapping Small & High Performance Language Models with Unmasking-Removal Training Policy. EMNLP 2023: 457-464 - [c375]Haoyu Wang, Hongming Zhang, Yueguan Wang, Yuqian Deng, Muhao Chen, Dan Roth:
Are All Steps Equally Important? Benchmarking Essentiality Detection in Event Processes. EMNLP 2023: 4048-4056 - [c374]Nishanth Sridhar Nakshatri, Siyi Liu, Sihao Chen, Dan Roth, Dan Goldwasser, Daniel Hopkins:
Using LLM for Improving Key Event Discovery: Temporal-Guided News Stream Clustering with Event Summaries. EMNLP (Findings) 2023: 4162-4173 - [c373]Karthikeyan K, Yogarshi Vyas, Jie Ma, Giovanni Paolini, Neha Anna John, Shuai Wang, Yassine Benajiba, Vittorio Castelli, Dan Roth, Miguel Ballesteros:
Taxonomy Expansion for Named Entity Recognition. EMNLP 2023: 6895-6906 - [c372]Sharon Levy, Neha Anna John, Ling Liu, Yogarshi Vyas, Jie Ma, Yoshinari Fujinuma, Miguel Ballesteros, Vittorio Castelli, Dan Roth:
Comparing Biases and the Impact of Multilingual Training across Multiple Languages. EMNLP 2023: 10260-10280 - [c371]Danilo Neves Ribeiro, Shen Wang, Xiaofei Ma, Henghui Zhu, Rui Dong, Deguang Kong, Juliette Burger, Anjelica Ramos, Zhiheng Huang, William Yang Wang, George Karypis, Bing Xiang, Dan Roth:
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark. ICLR 2023 - [c370]Kaifu Wang, Hangfeng He, Tin D. Nguyen, Piyush Kumar, Dan Roth:
On Regularization and Inference with Label Constraints. ICML 2023: 35740-35762 - [c369]Shamik Roy, Raphael Shu, Nikolaos Pappas, Elman Mansimov, Yi Zhang, Saab Mansour, Dan Roth:
Conversation Style Transfer using Few-Shot Learning. IJCNLP (1) 2023: 119-143 - [c368]Vinayshekhar Bannihatti Kumar, Rashmi Gangadharaiah, Dan Roth:
Privacy Adhering Machine Un-learning in NLP. IJCNLP (Findings) 2023: 268-277 - [c367]Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth, Bing Xiang:
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion. NeurIPS 2023 - [c366]Kaifu Wang, Efthymia Tsamoura, Dan Roth:
On Learning Latent Models with Multi-Instance Weak Supervision. NeurIPS 2023 - [c365]Iker García-Ferrero, Jon Ander Campos, Oscar Sainz, Ander Salaberria, Dan Roth:
IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition Using Knowledge Bases. SemEval@ACL 2023: 1335-1346 - [c364]Siddharth Varia, Shuai Wang, Kishaloy Halder, Robert Vacareanu, Miguel Ballesteros, Yassine Benajiba, Neha Anna John, Rishita Anubhai, Smaranda Muresan, Dan Roth:
Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis. WASSA@ACL 2023: 19-27 - [i183]Hangfeng He, Hongming Zhang, Dan Roth:
Rethinking with Retrieval: Faithful Large Language Model Inference. CoRR abs/2301.00303 (2023) - [i182]Danilo Neves Ribeiro, Shen Wang, Xiaofei Ma, Henry Zhu, Rui Dong, Deguang Kong, Juliette Burger, Anjelica Ramos, William Yang Wang, Zhiheng Huang, George Karypis, Bing Xiang, Dan Roth:
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark. CoRR abs/2302.06729 (2023) - [i181]Shamik Roy, Raphael Shu, Nikolaos Pappas, Elman Mansimov, Yi Zhang, Saab Mansour, Dan Roth:
Conversation Style Transfer using Few-Shot Learning. CoRR abs/2302.08362 (2023) - [i180]Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi:
GLUECons: A Generic Benchmark for Learning Under Constraints. CoRR abs/2302.10914 (2023) - [i179]Sihao Chen, William Bruno, Dan Roth:
Towards Corpus-Scale Discovery of Selection Biases in News Coverage: Comparing What Sources Say About Entities as a Start. CoRR abs/2304.03414 (2023) - [i178]Iker García-Ferrero, Jon Ander Campos, Oscar Sainz, Ander Salaberria, Dan Roth:
IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases. CoRR abs/2304.10637 (2023) - [i177]Sharon Levy, Neha Anna John, Ling Liu, Yogarshi Vyas, Jie Ma, Yoshinari Fujinuma, Miguel Ballesteros, Vittorio Castelli, Dan Roth:
Comparing Biases and the Impact of Multilingual Training across Multiple Languages. CoRR abs/2305.11242 (2023) - [i176]Siyi Liu, Hongming Zhang, Hongwei Wang, Kaiqiang Song, Dan Roth, Dong Yu:
Open-Domain Event Graph Induction for Mitigating Framing Bias. CoRR abs/2305.12835 (2023) - [i175]Karthikeyan K, Yogarshi Vyas, Jie Ma, Giovanni Paolini, Neha Anna John, Shuai Wang, Yassine Benajiba, Vittorio Castelli, Dan Roth, Miguel Ballesteros:
Taxonomy Expansion for Named Entity Recognition. CoRR abs/2305.13191 (2023) - [i174]Xingyu Fu, Ben Zhou, Sihao Chen, Mark Yatskar, Dan Roth:
Interpretable by Design Visual Question Answering. CoRR abs/2305.14882 (2023) - [i173]Tyler A. Chang, Kishaloy Halder, Neha Anna John, Yogarshi Vyas, Yassine Benajiba, Miguel Ballesteros, Dan Roth:
Characterizing and Measuring Linguistic Dataset Drift. CoRR abs/2305.17127 (2023) - [i172]Xingyu Fu, Sheng Zhang, Gukyeong Kwon, Pramuditha Perera, Henghui Zhu, Yuhao Zhang, Alexander Hanbo Li, William Yang Wang, Zhiguo Wang, Vittorio Castelli, Patrick Ng, Dan Roth, Bing Xiang:
Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge. CoRR abs/2305.18842 (2023) - [i171]Hantian Ding, Varun Kumar, Yuchen Tian, Zijian Wang, Rob Kwiatkowski, Xiaopeng Li, Murali Krishna Ramanathan, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang:
A Static Evaluation of Code Completion by Large Language Models. CoRR abs/2306.03203 (2023)