default search action
Katherine Metcalf
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Katherine Metcalf, Miguel Sarabia, Masha Fedzechkina, Barry-John Theobald:
Can You Rely on Synthetic Labellers in Preference-Based Reinforcement Learning? It's Complicated. AAAI 2024: 10128-10136 - [c14]Yong Lin, Skyler Seto, Maartje ter Hoeve, Katherine Metcalf, Barry-John Theobald, Xuan Wang, Yizhe Zhang, Chen Huang, Tong Zhang:
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization. EMNLP (Findings) 2024: 16015-16026 - [c13]Mudit Verma, Katherine Metcalf:
Hindsight PRIORs for Reward Learning from Human Preferences. ICLR 2024 - [c12]Xavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, Pau Rodríguez:
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models. ICML 2024 - [i12]Katherine Metcalf, Miguel Sarabia, Natalie Mackraz, Barry-John Theobald:
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards. CoRR abs/2402.17975 (2024) - [i11]Mudit Verma, Katherine Metcalf:
Hindsight PRIORs for Reward Learning from Human Preferences. CoRR abs/2404.08828 (2024) - [i10]Xavier Suau, Pieter Delobelle, Katherine Metcalf, Armand Joulin, Nicholas Apostoloff, Luca Zappella, Pau Rodríguez:
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models. CoRR abs/2407.12824 (2024) - [i9]Yong Lin, Skyler Seto, Maartje ter Hoeve, Katherine Metcalf, Barry-John Theobald, Xuan Wang, Yizhe Zhang, Chen Huang, Tong Zhang:
On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization. CoRR abs/2409.03650 (2024) - [i8]Stephane Aroca-Ouellette, Natalie Mackraz, Barry-John Theobald, Katherine Metcalf:
PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories. CoRR abs/2410.06273 (2024) - 2023
- [c11]Katherine Metcalf, Miguel Sarabia, Natalie Mackraz, Barry-John Theobald:
Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards. CoRL 2023: 1484-1532 - [c10]Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald:
On the Role of LIP Articulation in Visual Speech Perception. ICASSP 2023: 1-5 - [i7]Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Walter Talbott, Katherine Metcalf, Natalie Mackraz, R. Devon Hjelm, Alexander Toshev:
Large Language Models as Generalizable Policies for Embodied Tasks. CoRR abs/2310.17722 (2023) - 2022
- [i6]Andrew Silva, Katherine Metcalf, Nicholas Apostoloff, Barry-John Theobald:
FedEmbed: Personalized Private Federated Learning. CoRR abs/2202.09472 (2022) - [i5]Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald:
Towards a Perceptual Model for Estimating the Quality of Visual Speech. CoRR abs/2203.10117 (2022) - [i4]Mudit Verma, Katherine Metcalf:
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences. CoRR abs/2210.09151 (2022) - [i3]Katherine Metcalf, Miguel Sarabia, Barry-John Theobald:
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning. CoRR abs/2211.06527 (2022)
2010 – 2019
- 2019
- [c9]Katherine Metcalf, David Leake:
Unsupervised Hierarchical Temporal Abstraction by Simultaneously Learning Expectations and Representations. IJCAI 2019: 3144-3150 - [c8]Katherine Metcalf, Barry-John Theobald, Garrett Weinberg, Robert Lee, Ing-Marie Jonsson, Russ Webb, Nicholas Apostoloff:
Mirroring to Build Trust in Digital Assistants. INTERSPEECH 2019: 4000-4004 - [i2]Katherine Metcalf, Barry-John Theobald, Garrett Weinberg, Robert Lee, Ing-Marie Jonsson, Russ Webb, Nicholas Apostoloff:
Mirroring to Build Trust in Digital Assistants. CoRR abs/1904.01664 (2019) - 2018
- [c7]Katherine Metcalf, Barry-John Theobald, Nicholas Apostoloff:
Learning Sharing Behaviors with Arbitrary Numbers of Agents. AAMAS 2018: 1232-1240 - [c6]Katherine Metcalf, David Leake:
Embedded Word Representations for Rich Indexing: A Case Study for Medical Records. ICCBR 2018: 264-280 - [i1]Katherine Metcalf, Barry-John Theobald, Nicholas Apostoloff:
Learning Sharing Behaviors with Arbitrary Numbers of Agents. CoRR abs/1812.04145 (2018) - 2017
- [c5]Katherine Metcalf, David Leake:
Modelling Unsupervised Event Segmentation: Learning Event Boundaries from Prediction Errors. CogSci 2017 - 2016
- [c4]Katherine Metcalf, David B. Leake:
A Computational Method for Extracting, Representing, and Predicting Social Closeness. ECAI 2016: 1176-1184 - 2015
- [c3]Katherine Metcalf, David Leake:
Investigating Methods and Representations for Reasoning About Social Context and Relative Social Power. CONTEXT 2015: 385-397 - 2013
- [c2]Alan Buabuchachart, Nina Charness, Katherine Metcalf, Leora Morgenstern:
Automated Methods for Extracting and Expanding Lists in Regulatory Text. DoCoPe@JURIX 2013 - [c1]Alan Buabuchachart, Katherine Metcalf, Nina Charness, Leora Morgenstern:
Classification of Regulatory Paragraphs by Discourse Structure, Reference Structure, and Regulation Type. JURIX 2013: 59-62
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-18 21:43 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint