default search action
Oyvind Tafjord
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2020
- [j2]Peter Clark, Oren Etzioni, Tushar Khot, Daniel Khashabi, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Niket Tandon, Sumithra Bhakthavatsalam, Dirk Groeneveld, Michal Guerquin, Michael Schmitz:
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project. AI Mag. 41(4): 39-53 (2020) - 2017
- [j1]Carissa Schoenick, Peter Clark, Oyvind Tafjord, Peter D. Turney, Oren Etzioni:
Moving beyond the Turing Test with the Allen AI Science Challenge. Commun. ACM 60(9): 60-64 (2017)
Conference and Workshop Papers
- 2024
- [c26]Neset Özkan Tan, Niket Tandon, David Wadden, Oyvind Tafjord, Mark Gahegan, Michael Witbrock:
Faithful Reasoning over Scientific Claims. AAAI Spring Symposia 2024: 263-272 - [c25]Yuling Gu, Oyvind Tafjord, Peter Clark:
Digital Socrates: Evaluating LLMs through Explanation Critiques. ACL (1) 2024: 5559-5586 - [c24]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Evan Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. ACL (1) 2024: 15725-15788 - [c23]Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. ACL (1) 2024: 15789-15809 - 2023
- [c22]Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal:
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy. EMNLP 2023: 8392-8417 - [c21]Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schütze, Peter Clark:
Language Models with Rationality. EMNLP 2023: 14190-14201 - 2022
- [c20]Oyvind Tafjord, Bhavana Dalvi Mishra, Peter Clark:
Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning. EMNLP 2022: 2078-2093 - [c19]Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan:
LILA: A Unified Benchmark for Mathematical Reasoning. EMNLP 2022: 5807-5832 - [c18]Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Clark:
Towards Teachable Reasoning Systems: Using a Dynamic Memory of User Feedback for Continual System Improvement. EMNLP 2022: 9465-9480 - [c17]Pan Lu, Swaroop Mishra, Tanglin Xia, Liang Qiu, Kai-Wei Chang, Song-Chun Zhu, Oyvind Tafjord, Peter Clark, Ashwin Kalyan:
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering. NeurIPS 2022 - 2021
- [c16]Oyvind Tafjord, Bhavana Dalvi, Peter Clark:
ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language. ACL/IJCNLP (Findings) 2021: 3621-3634 - [c15]Faeze Brahman, Meng Huang, Oyvind Tafjord, Chao Zhao, Mrinmaya Sachan, Snigdha Chaturvedi:
"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding. EMNLP (Findings) 2021: 1734-1752 - [c14]Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura, Peter Clark:
Explaining Answers with Entailment Trees. EMNLP (1) 2021: 7358-7370 - [c13]Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark:
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief. EMNLP (1) 2021: 8849-8861 - 2020
- [c12]Lucy Lu Wang, Oyvind Tafjord, Arman Cohan, Sarthak Jain, Sam Skjonsberg, Carissa Schoenick, Nick Botner, Waleed Ammar:
SUPP.AI: finding evidence for supplement-drug interactions. ACL (demo) 2020: 362-371 - [c11]Daniel Khashabi, Sewon Min, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark, Hannaneh Hajishirzi:
UnifiedQA: Crossing Format Boundaries With a Single QA System. EMNLP (Findings) 2020: 1896-1907 - [c10]Vered Shwartz, Rachel Rudinger, Oyvind Tafjord:
"You are grounded!": Latent Name Artifacts in Pre-trained Language Models. EMNLP (1) 2020: 6850-6861 - [c9]Peter Clark, Oyvind Tafjord, Kyle Richardson:
Transformers as Soft Reasoners over Language. IJCAI 2020: 3882-3890 - [c8]Dongfang Xu, Peter A. Jansen, Jaycie Martin, Zhengnan Xie, Vikas Yadav, Harish Tayyar Madabushi, Oyvind Tafjord, Peter Clark:
Multi-class Hierarchical Question Classification for Multiple Choice Science Exams. LREC 2020: 5370-5382 - [c7]Alon Talmor, Oyvind Tafjord, Peter Clark, Yoav Goldberg, Jonathan Berant:
Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge. NeurIPS 2020 - 2019
- [c6]Arindam Mitra, Peter Clark, Oyvind Tafjord, Chitta Baral:
Declarative Question Answering over Knowledge Bases Containing Natural Language Text with Answer Set Programming. AAAI 2019: 3003-3010 - [c5]Oyvind Tafjord, Peter Clark, Matt Gardner, Wen-tau Yih, Ashish Sabharwal:
QUAREL: A Dataset and Models for Answering Questions about Qualitative Relationships. AAAI 2019: 7063-7071 - [c4]Kevin Lin, Oyvind Tafjord, Peter Clark, Matt Gardner:
Reasoning Over Paragraph Effects in Situations. MRQA@EMNLP 2019: 58-62 - [c3]Oyvind Tafjord, Matt Gardner, Kevin Lin, Peter Clark:
QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions. EMNLP/IJCNLP (1) 2019: 5940-5945 - 2016
- [c2]Peter Clark, Oren Etzioni, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter D. Turney, Daniel Khashabi:
Combining Retrieval, Statistics, and Inference to Answer Elementary Science Questions. AAAI 2016: 2580-2586 - [c1]Jayant Krishnamurthy, Oyvind Tafjord, Aniruddha Kembhavi:
Semantic Parsing to Probabilistic Programs for Situated Question Answering. EMNLP 2016: 160-170
Informal and Other Publications
- 2024
- [i40]Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Raghavi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo:
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research. CoRR abs/2402.00159 (2024) - [i39]Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi:
OLMo: Accelerating the Science of Language Models. CoRR abs/2402.00838 (2024) - [i38]Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme:
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. CoRR abs/2402.14798 (2024) - [i37]Peter A. Jansen, Marc-Alexandre Côté, Tushar Khot, Erin Bransom, Bhavana Dalvi Mishra, Bodhisattwa Prasad Majumder, Oyvind Tafjord, Peter Clark:
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents. CoRR abs/2406.06769 (2024) - [i36]Yuling Gu, Oyvind Tafjord, Bailey Kuehl, Dany Haddad, Jesse Dodge, Hannaneh Hajishirzi:
OLMES: A Standard for Language Model Evaluations. CoRR abs/2406.08446 (2024) - [i35]Sarah Wiegreffe, Oyvind Tafjord, Yonatan Belinkov, Hannaneh Hajishirzi, Ashish Sabharwal:
Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions. CoRR abs/2407.15018 (2024) - [i34]Niklas Muennighoff, Luca Soldaini, Dirk Groeneveld, Kyle Lo, Jacob Morrison, Sewon Min, Weijia Shi, Pete Walsh, Oyvind Tafjord, Nathan Lambert, Yuling Gu, Shane Arora, Akshita Bhagia, Dustin Schwenk, David Wadden, Alexander Wettig, Binyuan Hui, Tim Dettmers, Douwe Kiela, Ali Farhadi, Noah A. Smith, Pang Wei Koh, Amanpreet Singh, Hannaneh Hajishirzi:
OLMoE: Open Mixture-of-Experts Language Models. CoRR abs/2409.02060 (2024) - 2023
- [i33]Nora Kassner, Oyvind Tafjord, Ashish Sabharwal, Kyle Richardson, Hinrich Schütze, Peter Clark:
Language Models with Rationality. CoRR abs/2305.14250 (2023) - [i32]Sarah Wiegreffe, Matthew Finlayson, Oyvind Tafjord, Peter Clark, Ashish Sabharwal:
Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy. CoRR abs/2305.14596 (2023) - [i31]Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Peter A. Jansen, Oyvind Tafjord, Niket Tandon, Li Zhang, Chris Callison-Burch, Peter Clark:
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization. CoRR abs/2310.10134 (2023) - [i30]Yuling Gu, Oyvind Tafjord, Peter Clark:
Digital Socrates: Evaluating LLMs through explanation critiques. CoRR abs/2311.09613 (2023) - [i29]Peter Clark, Bhavana Dalvi Mishra, Oyvind Tafjord:
BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability. CoRR abs/2312.07527 (2023) - [i28]Dirk Groeneveld, Anas Awadalla, Iz Beltagy, Akshita Bhagia, Ian Magnusson, Hao Peng, Oyvind Tafjord, Pete Walsh, Kyle Richardson, Jesse Dodge:
Catwalk: A Unified Language Model Evaluation Framework for Many Datasets. CoRR abs/2312.10253 (2023) - [i27]Ian Magnusson, Akshita Bhagia, Valentin Hofmann, Luca Soldaini, Ananya Harsh Jha, Oyvind Tafjord, Dustin Schwenk, Evan Pete Walsh, Yanai Elazar, Kyle Lo, Dirk Groeneveld, Iz Beltagy, Hannaneh Hajishirzi, Noah A. Smith, Kyle Richardson, Jesse Dodge:
Paloma: A Benchmark for Evaluating Language Model Fit. CoRR abs/2312.10523 (2023) - 2022
- [i26]Bhavana Dalvi, Oyvind Tafjord, Peter Clark:
Towards Teachable Reasoning Systems. CoRR abs/2204.13074 (2022) - [i25]Pan Lu, Swaroop Mishra, Tony Xia, Liang Qiu, Kai-Wei Chang, Song-Chun Zhu, Oyvind Tafjord, Peter Clark, Ashwin Kalyan:
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering. CoRR abs/2209.09513 (2022) - [i24]Oyvind Tafjord, Bhavana Dalvi Mishra, Peter Clark:
Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning. CoRR abs/2210.12217 (2022) - [i23]Swaroop Mishra, Matthew Finlayson, Pan Lu, Leonard Tang, Sean Welleck, Chitta Baral, Tanmay Rajpurohit, Oyvind Tafjord, Ashish Sabharwal, Peter Clark, Ashwin Kalyan:
Lila: A Unified Benchmark for Mathematical Reasoning. CoRR abs/2210.17517 (2022) - 2021
- [i22]Sumithra Bhakthavatsalam, Daniel Khashabi, Tushar Khot, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Peter Clark:
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge. CoRR abs/2102.03315 (2021) - [i21]Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark:
Enriching a Model's Notion of Belief using a Persistent Memory. CoRR abs/2104.08401 (2021) - [i20]Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura, Peter Clark:
Explaining Answers with Entailment Trees. CoRR abs/2104.08661 (2021) - [i19]Oyvind Tafjord, Peter Clark:
General-Purpose Question-Answering with Macaw. CoRR abs/2109.02593 (2021) - [i18]Faeze Brahman, Meng Huang, Oyvind Tafjord, Chao Zhao, Mrinmaya Sachan, Snigdha Chaturvedi:
"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding. CoRR abs/2109.05438 (2021) - [i17]Nora Kassner, Oyvind Tafjord, Hinrich Schütze, Peter Clark:
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief. CoRR abs/2109.14723 (2021) - 2020
- [i16]Peter Clark, Oyvind Tafjord, Kyle Richardson:
Transformers as Soft Reasoners over Language. CoRR abs/2002.05867 (2020) - [i15]Vered Shwartz, Rachel Rudinger, Oyvind Tafjord:
"You are grounded!": Latent Name Artifacts in Pre-trained Language Models. CoRR abs/2004.03012 (2020) - [i14]Daniel Khashabi, Tushar Khot, Ashish Sabharwal, Oyvind Tafjord, Peter Clark, Hannaneh Hajishirzi:
UnifiedQA: Crossing Format Boundaries With a Single QA System. CoRR abs/2005.00700 (2020) - [i13]Alon Talmor, Oyvind Tafjord, Peter Clark, Yoav Goldberg, Jonathan Berant:
Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge. CoRR abs/2006.06609 (2020) - [i12]Oyvind Tafjord, Bhavana Dalvi Mishra, Peter Clark:
ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language. CoRR abs/2012.13048 (2020) - 2019
- [i11]Arindam Mitra, Peter Clark, Oyvind Tafjord, Chitta Baral:
Declarative Question Answering over Knowledge Bases containing Natural Language Text with Answer Set Programming. CoRR abs/1905.00198 (2019) - [i10]Dongfang Xu, Peter A. Jansen, Jaycie Martin, Zhengnan Xie, Vikas Yadav, Harish Tayyar Madabushi, Oyvind Tafjord, Peter Clark:
Multi-class Hierarchical Question Classification for Multiple Choice Science Exams. CoRR abs/1908.05441 (2019) - [i9]Kevin Lin, Oyvind Tafjord, Peter Clark, Matt Gardner:
Reasoning Over Paragraph Effects in Situations. CoRR abs/1908.05852 (2019) - [i8]Peter Clark, Oren Etzioni, Daniel Khashabi, Tushar Khot, Bhavana Dalvi Mishra, Kyle Richardson, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord, Niket Tandon, Sumithra Bhakthavatsalam, Dirk Groeneveld, Michal Guerquin, Michael Schmitz:
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project. CoRR abs/1909.01958 (2019) - [i7]Oyvind Tafjord, Matt Gardner, Kevin Lin, Peter Clark:
QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions. CoRR abs/1909.03553 (2019) - [i6]Lucy Lu Wang, Oyvind Tafjord, Sarthak Jain, Arman Cohan, Sam Skjonsberg, Carissa Schoenick, Nick Botner, Waleed Ammar:
Extracting evidence of supplement-drug interactions from literature. CoRR abs/1909.08135 (2019) - 2018
- [i5]Peter Clark, Isaac Cowhey, Oren Etzioni, Tushar Khot, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord:
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge. CoRR abs/1803.05457 (2018) - [i4]Matt Gardner, Joel Grus, Mark Neumann, Oyvind Tafjord, Pradeep Dasigi, Nelson F. Liu, Matthew E. Peters, Michael Schmitz, Luke Zettlemoyer:
AllenNLP: A Deep Semantic Natural Language Processing Platform. CoRR abs/1803.07640 (2018) - [i3]Oyvind Tafjord, Peter Clark, Matt Gardner, Wen-tau Yih, Ashish Sabharwal:
QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships. CoRR abs/1811.08048 (2018) - 2016
- [i2]Carissa Schoenick, Peter Clark, Oyvind Tafjord, Peter D. Turney, Oren Etzioni:
Moving Beyond the Turing Test with the Allen AI Science Challenge. CoRR abs/1604.04315 (2016) - [i1]Jayant Krishnamurthy, Oyvind Tafjord:
Semantic Parsing to Probabilistic Programs for Situated Question Answering. CoRR abs/1606.07046 (2016)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint