default search action
Graham Neubig
Person information
- affiliation: Carnegie Mellon University, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j51]Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig:
Large Language Models Enable Few-Shot Clustering. Trans. Assoc. Comput. Linguistics 12: 321-333 (2024) - [j50]Atharva Kulkarni, Lucio M. Dery, Amrith Setlur, Aditi Raghunathan, Ameet Talwalkar, Graham Neubig:
Multitask Learning Can Improve Worst-Group Outcomes. Trans. Mach. Learn. Res. 2024 (2024) - [c391]Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori S. Levin:
Wav2Gloss: Generating Interlinear Glossed Text from Speech. ACL (1) 2024: 568-582 - [c390]Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Russ Salakhutdinov, Daniel Fried:
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks. ACL (1) 2024: 881-905 - [c389]Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodríguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srini Iyer:
Instruction-tuned Language Models are Better Knowledge Learners. ACL (1) 2024: 5421-5434 - [c388]Saumya Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig:
Better Synthetic Data by Retrieving and Transforming Existing Datasets. ACL (Findings) 2024: 6453-6466 - [c387]Ruiyi Wang, Haofei Yu, Wenxin Sharon Zhang, Zhengyang Qi, Maarten Sap, Yonatan Bisk, Graham Neubig, Hao Zhu:
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents. ACL (1) 2024: 12912-12940 - [c386]Masahiro Kaneko, Graham Neubig, Naoaki Okazaki:
Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach. EACL (Findings) 2024: 1644-1658 - [c385]Alexander Shypula, Aman Madaan, Yimeng Zeng, Uri Alon, Jacob R. Gardner, Yiming Yang, Milad Hashemi, Graham Neubig, Parthasarathy Ranganathan, Osbert Bastani, Amir Yazdanbakhsh:
Learning Performance-Improving Code Edits. ICLR 2024 - [c384]Xuhui Zhou, Hao Zhu, Leena Mathur, Ruohong Zhang, Haofei Yu, Zhengyang Qi, Louis-Philippe Morency, Yonatan Bisk, Daniel Fried, Graham Neubig, Maarten Sap:
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents. ICLR 2024 - [c383]Shuyan Zhou, Frank F. Xu, Hao Zhu, Xuhui Zhou, Robert Lo, Abishek Sridhar, Xianyi Cheng, Tianyue Ou, Yonatan Bisk, Daniel Fried, Uri Alon, Graham Neubig:
WebArena: A Realistic Web Environment for Building Autonomous Agents. ICLR 2024 - [c382]Zhiruo Wang, Graham Neubig, Daniel Fried:
TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks. ICML 2024 - [c381]Anubha Kabra, Sanketh Rangreji, Yash Mathur, Aman Madaan, Emmy Liu, Graham Neubig:
Program-Aided Reasoners (Better) Know What They Know. NAACL-HLT 2024: 2262-2278 - [c380]Simran Khanuja, Srinivas Gowriraj, Lucio M. Dery, Graham Neubig:
DeMuX: Data-efficient Multilingual Learning. NAACL-HLT 2024: 7423-7436 - [i300]Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi:
Fine-grained Hallucination Detection and Editing for Language Models. CoRR abs/2401.06855 (2024) - [i299]Zhiruo Wang, Daniel Fried, Graham Neubig:
TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks. CoRR abs/2401.12869 (2024) - [i298]Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Ruslan Salakhutdinov, Daniel Fried:
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks. CoRR abs/2401.13649 (2024) - [i297]Steffi Chern, Ethan Chern, Graham Neubig, Pengfei Liu:
Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate. CoRR abs/2401.16788 (2024) - [i296]Lucio M. Dery, Steven Kolawole, Jean-François Kagey, Virginia Smith, Graham Neubig, Ameet Talwalkar:
Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes. CoRR abs/2402.05406 (2024) - [i295]Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srinivasan Iyer:
Instruction-tuned Language Models are Better Knowledge Learners. CoRR abs/2402.12847 (2024) - [i294]Jacob Mitchell Springer, Suhas Kotha, Daniel Fried, Graham Neubig, Aditi Raghunathan:
Repetition Improves Language Model Embeddings. CoRR abs/2402.15449 (2024) - [i293]Yueqi Song, Simran Khanuja, Graham Neubig:
What Is Missing in Multilingual Visual Reasoning and How to Fix It. CoRR abs/2403.01404 (2024) - [i292]Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori S. Levin:
GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing. CoRR abs/2403.06399 (2024) - [i291]Ruiyi Wang, Haofei Yu, Wenxin Sharon Zhang, Zhengyang Qi, Maarten Sap, Graham Neubig, Yonatan Bisk, Hao Zhu:
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents. CoRR abs/2403.08715 (2024) - [i290]Jennifer Hsia, Afreen Shaikh, Zhiruo Wang, Graham Neubig:
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems. CoRR abs/2403.09040 (2024) - [i289]Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel R. Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori S. Levin:
Wav2Gloss: Generating Interlinear Glossed Text from Speech. CoRR abs/2403.13169 (2024) - [i288]Zhiruo Wang, Zhoujun Cheng, Hao Zhu, Daniel Fried, Graham Neubig:
What Are Tools Anyway? A Survey from the Language Model Perspective. CoRR abs/2403.15452 (2024) - [i287]Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig:
An image speaks a thousand words, but can everyone listen? On translating images for cultural relevance. CoRR abs/2404.01247 (2024) - [i286]Zhiqiu Lin, Deepak Pathak, Baiqi Li, Jiayao Li, Xide Xia, Graham Neubig, Pengchuan Zhang, Deva Ramanan:
Evaluating Text-to-Visual Generation with Image-to-Text Generation. CoRR abs/2404.01291 (2024) - [i285]Zaid Sheikh, Antonios Anastasopoulos, Shruti Rijhwani, Lindia Tjuatja, Robbie Jimerson, Graham Neubig:
CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models. CoRR abs/2404.02408 (2024) - [i284]Emmy Liu, Graham Neubig, Jacob Andreas:
An Incomplete Loop: Deductive, Inductive, and Abductive Learning in Large Language Models. CoRR abs/2404.03028 (2024) - [i283]Junpeng Liu, Yifan Song, Bill Yuchen Lin, Wai Lam, Graham Neubig, Yuanzhi Li, Xiang Yue:
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding? CoRR abs/2404.05955 (2024) - [i282]Saumya Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig:
Better Synthetic Data by Retrieving and Transforming Existing Datasets. CoRR abs/2404.14361 (2024) - [i281]Amanda Bertsch, Maor Ivgi, Uri Alon, Jonathan Berant, Matthew R. Gormley, Graham Neubig:
In-Context Learning with Long-Context Models: An In-Depth Exploration. CoRR abs/2405.00200 (2024) - [i280]Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo:
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models. CoRR abs/2405.01535 (2024) - [i279]Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Choi, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang, Seonghyeon Ye, Bill Yuchen Lin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo:
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models. CoRR abs/2406.05761 (2024) - [i278]Jinjie Ni, Fuzhao Xue, Xiang Yue, Yuntian Deng, Mahir Shah, Kabir Jain, Graham Neubig, Yang You:
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures. CoRR abs/2406.06565 (2024) - [i277]Belinda Z. Li, Emmy Liu, Alexis Ross, Abbas Zeitoun, Graham Neubig, Jacob Andreas:
Language Modeling with Editable External Knowledge. CoRR abs/2406.11830 (2024) - [i276]Baiqi Li, Zhiqiu Lin, Deepak Pathak, Jiayao Li, Yixin Fei, Kewen Wu, Tiffany Ling, Xide Xia, Pengchuan Zhang, Graham Neubig, Deva Ramanan:
GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation. CoRR abs/2406.13743 (2024) - [i275]Zora Zhiruo Wang, Akari Asai, Xinyan Velocity Yu, Frank F. Xu, Yiqing Xie, Graham Neubig, Daniel Fried:
CodeRAG-Bench: Can Retrieval Augment Code Generation? CoRR abs/2406.14497 (2024) - [i274]Sean Welleck, Amanda Bertsch, Matthew Finlayson, Hailey Schoelkopf, Alex Xie, Graham Neubig, Ilia Kulikov, Zaïd Harchaoui:
From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models. CoRR abs/2406.16838 (2024) - [i273]Ian Wu, Sravan Jayanthi, Vijay Viswanathan, Simon Rosenberg, Sina Pakazad, Tongshuang Wu, Graham Neubig:
Synthetic Multimodal Question Generation. CoRR abs/2407.02233 (2024) - [i272]Jiaxin Ge, Xueying Jia, Vijay Viswanathan, Hongyin Luo, Graham Neubig:
Training Task Experts through Retrieval Based Distillation. CoRR abs/2407.05463 (2024) - [i271]Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chien Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov:
VIMI: Grounding Video Generation through Multi-modal Instruction. CoRR abs/2407.06304 (2024) - [i270]Chenyang Zhao, Xueying Jia, Vijay Viswanathan, Tongshuang Wu, Graham Neubig:
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. CoRR abs/2407.12874 (2024) - [i269]Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, Yueqi Song, Bowen Li, Jaskirat Singh, Hoang H. Tran, Fuqiang Li, Ren Ma, Mingzhang Zheng, Bill Qian, Yanjun Shao, Niklas Muennighoff, Yizhe Zhang, Binyuan Hui, Junyang Lin, Robert Brennan, Hao Peng, Heng Ji, Graham Neubig:
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents. CoRR abs/2407.16741 (2024) - [i268]Xi Xu, Siqi Ouyang, Brian Yan, Patrick Fernandes, William Chen, Lei Li, Graham Neubig, Shinji Watanabe:
CMU's IWSLT 2024 Simultaneous Speech Translation System. CoRR abs/2408.07452 (2024) - [i267]Xiang Yue, Tianyu Zheng, Yuansheng Ni, Yubo Wang, Kai Zhang, Shengbang Tong, Yuxuan Sun, Botao Yu, Ge Zhang, Huan Sun, Yu Su, Wenhu Chen, Graham Neubig:
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark. CoRR abs/2409.02813 (2024) - 2023
- [j49]Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig:
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing. ACM Comput. Surv. 55(9): 195:1-195:35 (2023) - [j48]Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins:
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation. Trans. Assoc. Comput. Linguistics 11: 1643-1668 (2023) - [j47]Luke Dramko, Jeremy Lacomis, Pengcheng Yin, Edward J. Schwartz, Miltiadis Allamanis, Graham Neubig, Bogdan Vasilescu, Claire Le Goues:
DIRE and its Data: Neural Decompiled Variable Renamings with Respect to Software Class. ACM Trans. Softw. Eng. Methodol. 32(2): 39:1-39:34 (2023) - [c379]Patrick Fernandes, Kayo Yin, Emmy Liu, André Martins, Graham Neubig:
When Does Translation Require Context? A Data-driven, Multilingual Exploration. ACL (1) 2023: 606-626 - [c378]Anubha Kabra, Emmy Liu, Simran Khanuja, Alham Fikri Aji, Genta Indra Winata, Samuel Cahyawijaya, Aremu Anuoluwapo, Perez Ogayo, Graham Neubig:
Multi-lingual and Multi-cultural Figurative Language Understanding. ACL (Findings) 2023: 8269-8284 - [c377]Sameer Jain, Vaishakh Keshava, Swarnashree Mysore Sathyendra, Patrick Fernandes, Pengfei Liu, Graham Neubig, Chunting Zhou:
Multi-Dimensional Evaluation of Text Summarization with In-Context Learning. ACL (Findings) 2023: 8487-8495 - [c376]Vijay Viswanathan, Luyu Gao, Tongshuang Wu, Pengfei Liu, Graham Neubig:
DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions. ACL (1) 2023: 10288-10303 - [c375]John Wieting, Jonathan H. Clark, William W. Cohen, Graham Neubig, Taylor Berg-Kirkpatrick:
Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval. ACL (1) 2023: 12044-12066 - [c374]Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Fajri Koto, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Muhammad Satrio Wicaksono, Ivan Halim Parmonangan, Ika Alfina, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Hadiwijaya, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Haryo Akbarianto Wibowo, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Fatyanosa, Ziwei Ji, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Pascale Fung, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti:
NusaCrowd: Open Source Initiative for Indonesian NLP Resources. ACL (Findings) 2023: 13745-13818 - [c373]Hao Zhu, Raghav Kapoor, So Yeon Min, Winson Han, Jiatai Li, Kaiwen Geng, Graham Neubig, Yonatan Bisk, Aniruddha Kembhavi, Luca Weihs:
EXCALIBUR: Encouraging and Evaluating Embodied Exploration. CVPR 2023: 14931-14942 - [c372]Zhiruo Wang, Grace Cuenca, Shuyan Zhou, Frank F. Xu, Graham Neubig:
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages. EACL (Findings) 2023: 265-273 - [c371]Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W. Black, Shinji Watanabe:
CTC Alignments Improve Autoregressive Translation. EACL 2023: 1615-1631 - [c370]Jimin Sun, Patrick Fernandes, Xinyi Wang, Graham Neubig:
A Multi-dimensional Evaluation of Tokenizer-free Multilingual Pretrained Models. EACL (Findings) 2023: 1680-1690 - [c369]Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig:
Prompt2Model: Generating Deployable Models from Natural Language Instructions. EMNLP (Demos) 2023: 413-421 - [c368]Zhiruo Wang, Shuyan Zhou, Daniel Fried, Graham Neubig:
Execution-Based Evaluation for Open-Domain Code Generation. EMNLP (Findings) 2023: 1271-1290 - [c367]Aditi Chaudhary, Arun Sampath, Ashwin Sheshadri, Antonios Anastasopoulos, Graham Neubig:
Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning. EMNLP (Findings) 2023: 3776-3793 - [c366]Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig:
Active Retrieval Augmented Generation. EMNLP 2023: 7969-7992 - [c365]Shuyan Zhou, Uri Alon, Sumit Agarwal, Graham Neubig:
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code. EMNLP 2023: 13921-13937 - [c364]Yueqi Song, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig:
GlobalBench: A Benchmark for Global Progress in Natural Language Processing. EMNLP 2023: 14157-14171 - [c363]Emmy Liu, Aditi Chaudhary, Graham Neubig:
Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting. EMNLP 2023: 15095-15111 - [c362]Yiwei Qin, Weizhe Yuan, Graham Neubig, Pengfei Liu:
T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics. EMNLP (Findings) 2023: 15185-15202 - [c361]Lucio M. Dery, Paul Michel, Mikhail Khodak, Graham Neubig, Ameet Talwalkar:
AANG : Automating Auxiliary Learning. ICLR 2023 - [c360]Andy Liu, Hao Zhu, Emmy Liu, Yonatan Bisk, Graham Neubig:
Computational Language Acquisition with Theory of Mind. ICLR 2023 - [c359]Xuezhe Ma, Chunting Zhou, Xiang Kong, Junxian He, Liangke Gui, Graham Neubig, Jonathan May, Luke Zettlemoyer:
Mega: Moving Average Equipped Gated Attention. ICLR 2023 - [c358]Machel Reid, Vincent Josua Hellendoorn, Graham Neubig:
DiffusER: Diffusion via Edit-based Reconstruction. ICLR 2023 - [c357]Shuyan Zhou, Uri Alon, Frank F. Xu, Zhengbao Jiang, Graham Neubig:
DocPrompting: Generating Code by Retrieving the Docs. ICLR 2023 - [c356]Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig:
PAL: Program-aided Language Models. ICML 2023: 10764-10799 - [c355]Junhong Shen, Liam Li, Lucio M. Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar:
Cross-Modal Fine-Tuning: Align then Refine. ICML 2023: 31030-31056 - [c354]Frank F. Xu, Uri Alon, Graham Neubig:
Why do Nearest Neighbor Language Models Work? ICML 2023: 38325-38341 - [c353]Amanda Bertsch, Uri Alon, Graham Neubig, Matthew R. Gormley:
Unlimiformer: Long-Range Transformers with Unlimited Length Input. NeurIPS 2023 - [c352]Taiqi He, Lindia Tjuatja, Nathaniel R. Robinson, Shinji Watanabe, David R. Mortensen, Graham Neubig, Lori S. Levin:
SigMoreFun Submission to the SIGMORPHON Shared Task on Interlinear Glossing. SIGMORPHON 2023: 209-216 - [c351]Lindia Tjuatja, Emmy Liu, Lori S. Levin, Graham Neubig:
Syntax and Semantics Meet in the "Middle": Probing the Syntax-Semantics Interface of LMs Through Agentivity. *SEM@ACL 2023: 149-164 - [c350]Nathaniel R. Robinson, Perez Ogayo, David R. Mortensen, Graham Neubig:
ChatGPT MT: Competitive for High- (but Not Low-) Resource Languages. WMT 2023: 392-418 - [c349]Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat:
The Devil Is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation. WMT 2023: 1066-1083 - [i266]Frank F. Xu, Uri Alon, Graham Neubig:
Why do Nearest Neighbor Language Models Work? CoRR abs/2301.02828 (2023) - [i265]Shuyan Zhou, Uri Alon, Sumit Agarwal, Graham Neubig:
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code. CoRR abs/2302.05527 (2023) - [i264]Junhong Shen, Liam Li, Lucio M. Dery, Corey Staten, Mikhail Khodak, Graham Neubig, Ameet Talwalkar:
Cross-Modal Fine-Tuning: Align then Refine. CoRR abs/2302.05738 (2023) - [i263]Aman Madaan, Alexander Shypula, Uri Alon, Milad Hashemi, Parthasarathy Ranganathan, Yiming Yang, Graham Neubig, Amir Yazdanbakhsh:
Learning Performance-Improving Code Edits. CoRR abs/2302.07867 (2023) - [i262]Shruti Rijhwani, Daisy Rosenblum, Michayla King, Antonios Anastasopoulos, Graham Neubig:
User-Centric Evaluation of OCR Systems for Kwak'wala. CoRR abs/2302.13410 (2023) - [i261]Andy Liu, Hao Zhu, Emmy Liu, Yonatan Bisk, Graham Neubig:
Computational Language Acquisition with Theory of Mind. CoRR abs/2303.01502 (2023) - [i260]Ivan Stelmakh, John Wieting, Graham Neubig, Nihar B. Shah:
A Gold Standard Dataset for the Reviewer Assignment Problem. CoRR abs/2303.16750 (2023) - [i259]Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins:
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation. CoRR abs/2305.00955 (2023) - [i258]Amanda Bertsch, Uri Alon, Graham Neubig, Matthew R. Gormley:
Unlimiformer: Long-Range Transformers with Unlimited Length Input. CoRR abs/2305.01625 (2023) - [i257]Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig:
Active Retrieval Augmented Generation. CoRR abs/2305.06983 (2023) - [i256]Masahiro Kaneko, Graham Neubig, Naoaki Okazaki:
Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach. CoRR abs/2305.11789 (2023) - [i255]Yueqi Song, Catherine Cui, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig:
GlobalBench: A Benchmark for Global Progress in Natural Language Processing. CoRR abs/2305.14716 (2023) - [i254]Anubha Kabra, Emmy Liu, Simran Khanuja, Alham Fikri Aji, Genta Indra Winata, Samuel Cahyawijaya, Aremu Anuoluwapo, Perez Ogayo, Graham Neubig:
Multi-lingual and Multi-cultural Figurative Language Understanding. CoRR abs/2305.16171 (2023) - [i253]Vijay Viswanathan, Luyu Gao, Tongshuang Wu, Pengfei Liu, Graham Neubig:
DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions. CoRR abs/2305.16636 (2023) - [i252]Lindia Tjuatja, Emmy Liu, Lori S. Levin, Graham Neubig:
Syntax and Semantics Meet in the "Middle": Probing the Syntax-Semantics Interface of LMs Through Agentivity. CoRR abs/2305.18185 (2023) - [i251]Sameer Jain, Vaishakh Keshava, Swarnashree Mysore Sathyendra, Patrick Fernandes, Pengfei Liu, Graham Neubig, Chunting Zhou:
Multi-Dimensional Evaluation of Text Summarization with In-Context Learning. CoRR abs/2306.01200 (2023) - [i250]Manuel Mager, Rajat Bhatnagar, Graham Neubig, Ngoc Thang Vu, Katharina Kann:
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction. CoRR abs/2306.06804 (2023) - [i249]Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig:
Large Language Models Enable Few-Shot Clustering. CoRR abs/2307.00524 (2023) - [i248]I-Chun Chern, Zhiruo Wang, Sanjan Das, Bhavuk Sharma, Pengfei Liu, Graham Neubig:
Improving Factuality of Abstractive Summarization via Contrastive Reward Learning. CoRR abs/2307.04507 (2023) - [i247]I-Chun Chern, Steffi Chern, Shiqi Chen, Weizhe Yuan, Kehua Feng, Chunting Zhou, Junxian He, Graham Neubig, Pengfei Liu:
FacTool: Factuality Detection in Generative AI - A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios. CoRR abs/2307.13528 (2023) - [i246]Shuyan Zhou, Frank F. Xu, Hao Zhu, Xuhui Zhou, Robert Lo, Abishek Sridhar, Xianyi Cheng, Yonatan Bisk, Daniel Fried, Uri Alon, Graham Neubig:
WebArena: A Realistic Web Environment for Building Autonomous Agents. CoRR abs/2307.13854 (2023) - [i245]Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André F. T. Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat:
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation. CoRR abs/2308.07286 (2023) - [i244]Vijay Viswanathan, Chenyang Zhao, Amanda Bertsch, Tongshuang Wu, Graham Neubig:
Prompt2Model: Generating Deployable Models from Natural Language Instructions. CoRR abs/2308.12261 (2023) - [i243]Nathaniel R. Robinson, Perez Ogayo, David R. Mortensen, Graham Neubig:
ChatGPT MT: Competitive for High- (but not Low-) Resource Languages. CoRR abs/2309.07423 (2023) - [i242]Amanda Bertsch, Alex Xie, Graham Neubig, Matthew R. Gormley:
It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk. CoRR abs/2310.01387 (2023) - [i241]Emmy Liu, Aditi Chaudhary, Graham Neubig:
Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting. CoRR abs/2310.07081 (2023) - [i240]