default search action
Luca Soldaini
Person information
- affiliation: Amazon Alexa, CA, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j7]Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu B. Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Madeleine van Zuylen, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Marti A. Hearst, Daniel S. Weld:
The Semantic Reader Project. Commun. ACM 67(10): 50-61 (2024) - [j6]Raymond Fok, Luca Soldaini, Cassidy Trier, Erin Bransom, Kelsey MacMillan, Evie Yu-Yen Cheng, Hita Kambhamettu, Jonathan Bragg, Kyle Lo, Marti A. Hearst, Andrew Head, Daniel S. Weld:
Accelerating Scientific Paper Skimming with Augmented Intelligence Through Customizable Faceted Highlights. ACM Trans. Interact. Intell. Syst. 14(4): 27:1-27:30 (2024) - [c44]Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge:
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters. ACL (1) 2024: 7393-7420 - [c43]Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. ACL (Findings) 2024: 12969-12990 - [c42]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. ACL (1) 2024: 15725-15788 - [c41]Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. ACL (1) 2024: 15789-15809 - [c40]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. EACL (Findings) 2024: 1987-2003 - [c39]Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo:
MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. EMNLP (Findings) 2024: 5644-5673 - [c38]Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Evan Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hannaneh Hajishirzi, Noah A. Smith, Jesse Dodge:
What's In My Big Data? ICLR 2024 - [c37]James Mayfield, Eugene Yang, Dawn J. Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Selin Kayi, Kate Sanders, Marc Mason, Noah Hibbler:
On the Evaluation of Machine-Generated Reports. SIGIR 2024: 1904-1915 - [i50]Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge:
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters. CoRR abs/2401.06408 (2024) - [i49]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. CoRR abs/2402.00159 (2024) - [i48]Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. CoRR abs/2402.00838 (2024) - [i47]Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden:
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions. CoRR abs/2403.03866 (2024) - [i46]Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn J. Lawrie, Luca Soldaini:
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions. CoRR abs/2403.15246 (2024) - [i45]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2023 NeuCLIR Track. CoRR abs/2404.08071 (2024) - [i44]James Mayfield, Eugene Yang, Dawn J. Lawrie, Sean MacAvaney, Paul McNamee, Douglas W. Oard, Luca Soldaini, Ian Soboroff, Orion Weller, Efsun Selin Kayi, Kate Sanders, Marc Mason, Noah Hibbler:
On the Evaluation of Machine-Generated Reports. CoRR abs/2405.00982 (2024) - [i43]David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan:
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature. CoRR abs/2406.07835 (2024) - [i42]Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Kumar Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee F. Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G. Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar:
DataComp-LM: In search of the next generation of training sets for language models. CoRR abs/2406.11794 (2024) - [i41]Shayne Longpre, Stella Biderman, Alon Albalak, Hailey Schoelkopf, Daniel McDuff, Sayash Kapoor, Kevin Klyman, Kyle Lo, Gabriel Ilharco, Nay San, Maribeth Rauh, Aviya Skowron, Bertie Vidgen, Laura Weidinger, Arvind Narayanan, Victor Sanh, David Ifeoluwa Adelani, Percy Liang, Rishi Bommasani, Peter Henderson, Sasha Luccioni, Yacine Jernite, Luca Soldaini:
The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources. CoRR abs/2406.16746 (2024) - [i40]Nathan Lambert, Hailey Schoelkopf, Aaron Gokaslan, Luca Soldaini, Valentina Pyatkin, Louis Castricato:
Self-Directed Synthetic Dialogues and Revisions Technical Report. CoRR abs/2407.18421 (2024) - [i39]Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo:
Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. CoRR abs/2408.04226 (2024) - [i38]Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Morrison, Sewon Min, Weijia Shi, Pete Walsh, Oyvind Tafjord, Nathan Lambert, Yuling Gu, Shane Arora, Akshita Bhagia, Dustin Schwenk, David Wadden, Alexander Wettig, Binyuan Hui, Tim Dettmers, Douwe Kiela, Ali Farhadi, Noah A. Smith, Pang Wei Koh, Amanpreet Singh, Hannaneh Hajishirzi:
OLMoE: Open Mixture-of-Experts Language Models. CoRR abs/2409.02060 (2024) - [i37]Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo:
RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models. CoRR abs/2409.02685 (2024) - [i36]Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, Yen-Sung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross B. Girshick, Ali Farhadi, Aniruddha Kembhavi:
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models. CoRR abs/2409.17146 (2024) - [i35]Akari Asai, Jacqueline He, Rulin Shao, Weijia Shi, Amanpreet Singh, Joseph Chee Chang, Kyle Lo, Luca Soldaini, Sergey Feldman, Mike D'Arcy, David Wadden, Matt Latzke, Minyang Tian, Pan Ji, Shengyan Liu, Hao Tong, Bohao Wu, Yanyu Xiong, Luke Zettlemoyer, Graham Neubig, Daniel S. Weld, Doug Downey, Wen-tau Yih, Pang Wei Koh, Hannaneh Hajishirzi:
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs. CoRR abs/2411.14199 (2024) - [i34]Nathan Lambert, Jacob Morrison, Valentina Pyatkin, Shengyi Huang, Hamish Ivison, Faeze Brahman, Lester James V. Miranda, Alisa Liu, Nouha Dziri, Shane Lyu, Yuling Gu, Saumya Malik, Victoria Graf, Jena D. Hwang, Jiangjiang Yang, Ronan Le Bras, Oyvind Tafjord, Chris Wilhelm, Luca Soldaini, Noah A. Smith, Yizhong Wang, Pradeep Dasigi, Hannaneh Hajishirzi:
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training. CoRR abs/2411.15124 (2024) - [i33]Akshita Bhagia, Jiacheng Liu, Alexander Wettig, David Heineman, Oyvind Tafjord, Ananya Harsh Jha, Luca Soldaini, Noah A. Smith, Dirk Groeneveld, Pang Wei Koh, Jesse Dodge, Hannaneh Hajishirzi:
Establishing Task Scaling Laws via Compute-Efficient Model Ladders. CoRR abs/2412.04403 (2024) - 2023
- [j5]Suzan Verberne, Hussein Suleman, Luca Soldaini, Avijit Ghosh:
Report on the SIGIR 2023 Session on Diversity, Equity and Inclusivity. SIGIR Forum 57(2): 11:1-11:2 (2023) - [c36]Nathan Dennler, Anaelia Ovalle, Ashwin Singh, Luca Soldaini, Arjun Subramonian, Huy Tu, William Agnew, Avijit Ghosh, Kyra Yee, Irene Font Peradejordi, Zeerak Talat, Mayra Russo, Jessica de Jesus de Pinho Pinhal:
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms. AIES 2023: 375-386 - [c35]Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey:
Embedding Recycling for Language Models. EACL (Findings) 2023: 1888-1908 - [c34]Kyle Lo, Zejiang Shen, Benjamin Newman, Joseph Chee Chang, Russell Authur, Erin Bransom, Stefan Candra, Yoganand Chandrasekhar, Regan Huff, Bailey Kuehl, Amanpreet Singh, Chris Wilhelm, Angele Zamarron, Marti A. Hearst, Daniel S. Weld, Doug Downey, Luca Soldaini:
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents. EMNLP (Demos) 2023: 495-507 - [c33]Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents. EMNLP 2023: 3194-3212 - [c32]John M. Giorgi, Luca Soldaini, Bo Wang, Gary D. Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan:
Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval. EMNLP (Findings) 2023: 8177-8199 - [c31]Organizers Of QueerInAI, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubicka, Hang Yuan, Hetvi Jethwani, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, Pranav A, Raj Korpan, Ruchira Ray, Sarah Mathew, Sarthak Arora, St John, Tanvi Anand, Vishakha Agrawal, William Agnew, Yanan Long, Zijie J. Wang, Zeerak Talat, Avijit Ghosh, Nathaniel Dennler, Michael Noseworthy, Sharvani Jha, Emi Baylor, Aditya Joshi, Natalia Y. Bilenko, Andrew McNamara, Raphael Gontijo Lopes, Alex Markham, Evyn Dong, Jackie Kay, Manu Saraswat, Nikhil Vytla, Luke Stark:
Queer In AI: A Case Study in Community-Led Participatory AI. FAccT 2023: 1882-1895 - [c30]Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Marti A. Hearst, Andrew Head, Daniel S. Weld:
Scim: Intelligent Skimming Support for Scientific Papers. IUI 2023: 476-490 - [c29]Sean MacAvaney, Luca Soldaini:
One-Shot Labeling for Automatic Relevance Estimation. SIGIR 2023: 2230-2235 - [c28]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2023 NeuCLIR Track. TREC 2023 - [e1]Danilo Croce, Luca Soldaini:
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023 - System Demonstrations, Dubrovnik, Croatia, May 2-4, 2023. Association for Computational Linguistics 2023, ISBN 978-1-959429-45-6 [contents] - [i32]Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin, Haokun Liu, Kyle Lo, Jaron Lochner, Kelsey MacMillan, Tyler Murray, Chris Newell, Smita Rao, Shaurya Rohatgi, Paul Sayre, Zejiang Shen, Amanpreet Singh, Luca Soldaini, Shivashankar Subramanian, Amber Tanaka, Alex D. Wade, Linda Wagner, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Madeleine van Zuylen, Daniel S. Weld:
The Semantic Scholar Open Data Platform. CoRR abs/2301.10140 (2023) - [i31]Sean MacAvaney, Luca Soldaini:
One-Shot Labeling for Automatic Relevance Estimation. CoRR abs/2302.11266 (2023) - [i30]Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney, Aniket Kittur, Hyeonsu B. Kang, Egor Klevak, Bailey Kuehl, Michael Langan, Matt Latzke, Jaron Lochner, Kelsey MacMillan, Eric Marsh, Tyler Murray, Aakanksha Naik, Ngoc-Uyen Nguyen, Srishti Palani, Soya Park, Caroline Paulic, Napol Rachatasumrit, Smita Rao, Paul Sayre, Zejiang Shen, Pao Siangliulue, Luca Soldaini, Huy Tran, Madeleine van Zuylen, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Marti A. Hearst, Daniel S. Weld:
The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces. CoRR abs/2303.14334 (2023) - [i29]Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubicka, Hang Yuan, Hetvi Jethwani, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, Pranav A, Raj Korpan, Ruchira Ray, Sarah Mathew, Sarthak Arora, St John, Tanvi Anand, Vishakha Agrawal, William Agnew, Yanan Long, Zijie J. Wang, Zeerak Talat, Avijit Ghosh, Nathaniel Dennler, Michael Noseworthy, Sharvani Jha, Emi Baylor, Aditya Joshi, Natalia Y. Bilenko, Andrew McNamara, Raphael Gontijo Lopes, Alex Markham, Evyn Dong, Jackie Kay, Manu Saraswat, Nikhil Vytla, Luke Stark:
Queer In AI: A Case Study in Community-Led Participatory AI. CoRR abs/2303.16972 (2023) - [i28]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2022 NeuCLIR Track. CoRR abs/2304.12367 (2023) - [i27]Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo:
A Controllable QA-based Framework for Decontextualization. CoRR abs/2305.14772 (2023) - [i26]Organizers Of QueerInAI, Nathaniel Dennler, Anaelia Ovalle, Ashwin Singh, Luca Soldaini, Arjun Subramonian, Huy Tu, William Agnew, Avijit Ghosh, Kyra Yee, Irene Font Peradejordi, Zeerak Talat, Mayra Russo, Jess de Jesus de Pinho Pinhal:
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms. CoRR abs/2307.10223 (2023) - [i25]Orion Weller, Kyle Lo, David Wadden, Dawn J. Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini:
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. CoRR abs/2309.08541 (2023) - [i24]Pratyusha Ria Kalluri, William Agnew, Myra Cheng, Kentrell Owens, Luca Soldaini, Abeba Birhane:
The Surveillance AI Pipeline. CoRR abs/2309.15084 (2023) - [i23]Yanai Elazar, Akshita Bhagia, Ian Magnusson, Abhilasha Ravichander, Dustin Schwenk, Alane Suhr, Pete Walsh, Dirk Groeneveld, Luca Soldaini, Sameer Singh, Hanna Hajishirzi, Noah A. Smith, Jesse Dodge:
What's In My Big Data? CoRR abs/2310.20707 (2023) - [i22]Hyunji Lee, Luca Soldaini, Arman Cohan, Minjoon Seo, Kyle Lo:
Back to Basics: A Simple Recipe for Improving Out-of-Domain Retrieval in Dense Encoders. CoRR abs/2311.09765 (2023) - [i21]Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge:
Paloma: A Benchmark for Evaluating Language Model Fit. CoRR abs/2312.10523 (2023) - 2022
- [c27]Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti:
Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems. EMNLP (Findings) 2022: 7259-7272 - [c26]Matteo Gabburo, Rik Koncel-Kedziorski, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Knowledge Transfer from Answer Ranking to Answer Generation. EMNLP 2022: 9481-9495 - [c25]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection. EMNLP 2022: 11806-11816 - [c24]Benjamin Muller, Luca Soldaini, Rik Koncel-Kedziorski, Eric Lind, Alessandro Moschitti:
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation. AACL/IJCNLP (1) 2022: 337-353 - [c23]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. NAACL-HLT 2022: 2521-2531 - [c22]Dawn J. Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang:
Overview of the TREC 2022 NeuCLIR Track. TREC 2022 - [i20]Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti:
Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems. CoRR abs/2201.05767 (2022) - [i19]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. CoRR abs/2205.01228 (2022) - [i18]Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection. CoRR abs/2205.10455 (2022) - [i17]Jon Saad-Falcon, Amanpreet Singh, Luca Soldaini, Mike D'Arcy, Arman Cohan, Doug Downey:
Embedding Recycling for Language Models. CoRR abs/2207.04993 (2022) - [i16]Matteo Gabburo, Rik Koncel-Kedziorski, Siddhant Garg, Luca Soldaini, Alessandro Moschitti:
Knowledge Transfer from Answer Ranking to Answer Generation. CoRR abs/2210.12865 (2022) - [i15]John M. Giorgi, Luca Soldaini, Bo Wang, Gary D. Bader, Kyle Lo, Lucy Lu Wang, Arman Cohan:
Exploring the Challenges of Open Domain Multi-Document Summarization. CoRR abs/2212.10526 (2022) - 2021
- [c21]Chao-Chun Hsu, Eric Lind, Luca Soldaini, Alessandro Moschitti:
Answer Generation for Retrieval-based Question Answering Systems. ACL/IJCNLP (Findings) 2021: 4276-4282 - [c20]Rujun Han, Luca Soldaini, Alessandro Moschitti:
Modeling Context in Answer Sentence Selection Systems on a Latency Budget. EACL 2021: 3005-3010 - [i14]Rujun Han, Luca Soldaini, Alessandro Moschitti:
Modeling Context in Answer Sentence Selection Systems on a Latency Budget. CoRR abs/2101.12093 (2021) - [i13]Chao-Chun Hsu, Eric Lind, Luca Soldaini, Alessandro Moschitti:
Answer Generation for Retrieval-based Question Answering Systems. CoRR abs/2106.00955 (2021) - [i12]Benjamin Muller, Luca Soldaini, Rik Koncel-Kedziorski, Eric Lind, Alessandro Moschitti:
Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering. CoRR abs/2110.07150 (2021) - 2020
- [c19]Luca Soldaini, Alessandro Moschitti:
The Cascade Transformer: an Application for Efficient Answer Sentence Selection. ACL 2020: 5697-5708 - [c18]Mingda Li, Xinyue Liu, Weitong Ruan, Luca Soldaini, Wael Hamza, Chengwei Su:
Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention. COLING (Industry) 2020: 113-123 - [c17]Sean MacAvaney, Luca Soldaini, Nazli Goharian:
Teaching a New Dog Old Tricks: Resurrecting Multilingual Retrieval Using Zero-Shot Learning. ECIR (2) 2020: 246-254 - [c16]Subendhu Rongali, Luca Soldaini, Emilio Monti, Wael Hamza:
Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing. WWW 2020: 2962-2968 - [i11]Mingda Li, Weitong Ruan, Xinyue Liu, Luca Soldaini, Wael Hamza, Chengwei Su:
Improving Spoken Language Understanding By Exploiting ASR N-best Hypotheses. CoRR abs/2001.05284 (2020) - [i10]Subendhu Rongali, Luca Soldaini, Emilio Monti, Wael Hamza:
Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing. CoRR abs/2001.11458 (2020) - [i9]Luca Soldaini, Alessandro Moschitti:
The Cascade Transformer: an Application for Efficient Answer Sentence Selection. CoRR abs/2005.02534 (2020)
2010 – 2019
- 2019
- [j4]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Overcoming low-utility facets for complex answer retrieval. Inf. Retr. J. 22(3-4): 395-418 (2019) - [i8]Sean MacAvaney, Luca Soldaini, Nazli Goharian:
Teaching a New Dog Old Tricks: Resurrecting Multilingual Retrieval Using Zero-shot Learning. CoRR abs/1912.13080 (2019) - 2018
- [j3]Luca Soldaini:
The Knowledge and Language Gap in Medical Information Seeking. SIGIR Forum 52(2): 178-179 (2018) - [c15]Sean MacAvaney, Bart Desmet, Arman Cohan, Luca Soldaini, Andrew Yates, Ayah Zirikly, Nazli Goharian:
RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses. CLPsych@NAACL-HTL 2018: 168-173 - [c14]Luca Soldaini, Timothy Walsh, Arman Cohan, Julien Han, Nazli Goharian:
Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions. CLPsych@NAACL-HTL 2018: 194-203 - [c13]Ziling Fan, Luca Soldaini, Arman Cohan, Nazli Goharian:
Relation Extraction for Protein-protein Interactions Affected by Mutations. BCB 2018: 506-507 - [c12]Arman Cohan, Bart Desmet, Andrew Yates, Luca Soldaini, Sean MacAvaney, Nazli Goharian:
SMHD: a Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions. COLING 2018: 1485-1497 - [c11]Sean MacAvaney, Luca Soldaini, Arman Cohan, Nazli Goharian:
GU IRLAB at SemEval-2018 Task 7: Tree-LSTMs for Scientific Relation Classification. SemEval@NAACL-HLT 2018: 831-835 - [c10]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Overcoming Low-Utility Facets for Complex Answer Retrieval. ProfS/KG4IR/Data:Search@SIGIR 2018: 46-47 - [c9]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Characterizing Question Facets for Complex Answer Retrieval. SIGIR 2018: 1205-1208 - [i7]Sean MacAvaney, Luca Soldaini, Arman Cohan, Nazli Goharian:
GU IRLAB at SemEval-2018 Task 7: Tree-LSTMs for Scientific Relation Classification. CoRR abs/1804.05408 (2018) - [i6]Luca Soldaini, Timothy Walsh, Arman Cohan, Julien Han, Nazli Goharian:
Helping or Hurting? Predicting Changes in Users' Risk of Self-Harm Through Online Community Interactions. CoRR abs/1804.07253 (2018) - [i5]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Characterizing Question Facets for Complex Answer Retrieval. CoRR abs/1805.00791 (2018) - [i4]Arman Cohan, Bart Desmet, Andrew Yates, Luca Soldaini, Sean MacAvaney, Nazli Goharian:
SMHD: A Large-Scale Resource for Exploring Online Language Usage for Multiple Mental Health Conditions. CoRR abs/1806.05258 (2018) - [i3]Sean MacAvaney, Bart Desmet, Arman Cohan, Luca Soldaini, Andrew Yates, Ayah Zirikly, Nazli Goharian:
RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses. CoRR abs/1806.07916 (2018) - [i2]Sean MacAvaney, Andrew Yates, Arman Cohan, Luca Soldaini, Kai Hui, Nazli Goharian, Ophir Frieder:
Overcoming low-utility facets for complex answer retrieval. CoRR abs/1811.08772 (2018) - 2017
- [j2]Luca Soldaini, Andrew Yates, Nazli Goharian:
Learning to reformulate long queries for clinical decision support. J. Assoc. Inf. Sci. Technol. 68(11): 2602-2619 (2017) - [c8]Luca Soldaini, Andrew Yates, Nazli Goharian:
Denoising Clinical Notes for Medical Literature Retrieval with Convolutional Neural Model. CIKM 2017: 2307-2310 - [c7]Luca Soldaini, Nazli Goharian:
Learning to Rank for Consumer Health Search: A Semantic Approach. ECIR 2017: 640-646 - [c6]Luca Soldaini, Elad Yom-Tov:
Inferring Individual Attributes from Search Engine Queries and Auxiliary Information. WWW 2017: 293-301 - 2016
- [j1]Luca Soldaini, Andrew Yates, Elad Yom-Tov, Ophir Frieder, Nazli Goharian:
Enhancing web search in the medical domain via query clarification. Inf. Retr. J. 19(1-2): 149-173 (2016) - [c5]Luca Soldaini, Will Edman, Nazli Goharian:
Team GU-IRLAB at CLEF eHealth 2016: Task 3. CLEF (Working Notes) 2016: 143-146 - [i1]