Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Martha White

> Home > Persons

Person information

affiliation: University of Alberta, Edmonton, Canada

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ai/WangMWMAKLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/WangMWMAKLW24
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White:
Investigating the properties of neural network representations in reinforcement learning. Artif. Intell. 330: 104100 (2024)
[j14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/AminmansourJITBW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/AminmansourJITBW24
Farzane Aminmansour, Taher Jafferjee, Ehsan Imani, Erin J. Talvitie, Michael Bowling, Martha White:
Mitigating Value Hallucination in Dyna-Style Planning via Multistep Predecessor Models. J. Artif. Intell. Res. 80: 441-473 (2024)
[c61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuWW24
Vincent Liu, James R. Wright, Martha White:
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint). AAAI 2024: 22706
[i80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03903
Brett Daley, Martha White, Marlos C. Machado:
Compound Returns Reduce Variance in Reinforcement Learning. CoRR abs/2402.03903 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10339
Hugo Silva, Martha White:
What to Do When Your Discrete Optimization Is the Size of a Neural Network? CoRR abs/2402.10339 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-13425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-13425
Ehsan Imani, Kai Luedemann, Sam Scholnick-Hughes, Esraa Elelimy, Martha White:
Investigating the Histogram Loss in Regression. CoRR abs/2402.13425 (2024)
[i77]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-02113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-02113
Golnaz Mesbahi, Olya Mastikhina, Parham Mohammad Panahi, Martha White, Adam White:
Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL. CoRR abs/2404.02113 (2024)
[i76]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01562
Kevin Roice, Parham Mohammad Panahi, Scott M. Jordan, Adam White, Martha White:
A New View on Planning in Online Reinforcement Learning. CoRR abs/2406.01562 (2024)
[i75]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12284
Brett Daley, Marlos C. Machado, Martha White:
Demystifying the Recency Heuristic in Temporal-Difference Learning. CoRR abs/2406.12284 (2024)
[i74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16241
Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas:
Position: Benchmarking is Limited in Reinforcement Learning Research. CoRR abs/2406.16241 (2024)
2023
[j13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/LiuWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/LiuWW23
Vincent Liu, James R. Wright, Martha White:
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning. J. Artif. Intell. Res. 77: 71-101 (2023)
[j12]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/0002IKW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/0002IKW23
Eric Graves, Ehsan Imani, Raksha Kumaraswamy, Martha White:
Off-Policy Actor-Critic with Emphatic Weightings. J. Mach. Learn. Res. 24: 146:1-146:63 (2023)
[j11]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/JavedSSW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/JavedSSW23
Khurram Javed, Haseeb Shah, Richard S. Sutton, Martha White:
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks. J. Mach. Learn. Res. 24: 256:1-256:34 (2023)
[j10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/pami/PattersonLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/PattersonLW23
Andrew Patterson, Victor Liao, Martha White:
Robust Losses for Learning Value Functions. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6157-6167 (2023)
[j9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/SchlegelTWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/SchlegelTWW23
Matthew Schlegel, Volodymyr Tkachuk, Adam M. White, Martha White:
Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning. Trans. Mach. Learn. Res. 2023 (2023)
[c60]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/LiuCTW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LiuCTW23
Vincent Liu, Yash Chandak, Philip S. Thomas, Martha White:
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments. AISTATS 2023: 5474-5492
[c59]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/collas/LiuWTJ0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/LiuWTJ0W23
Vincent Liu, Han Wang, Ruo Yu Tao, Khurram Javed, Adam White, Martha White:
Measuring and Mitigating Interference in Reinforcement Learning. CoLLAs 2023: 781-795
[c58]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/NeumannLJP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NeumannLJP0W23
Samuel Neumann, Sungsu Lim, Ajin George Joseph, Yangchen Pan, Adam White, Martha White:
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement. ICLR 2023
[c57]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/XiaoWP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XiaoWP0W23
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White:
The In-Sample Softmax for Offline Reinforcement Learning. ICLR 2023
[c56]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaleyWAM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaleyWAM23
Brett Daley, Martha White, Christopher Amato, Marlos C. Machado:
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning. ICML 2023: 6818-6835
[c55]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhuCSW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhuCSW23
Lingwei Zhu, Zheng Chen, Matthew Schlegel, Martha White:
General Munchausen Reinforcement Learning with Tsallis Kullback-Leibler Divergence. NeurIPS 2023
[i73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11321
Brett Daley, Martha White, Christopher Amato, Marlos C. Machado:
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning. CoRR abs/2301.11321 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11476
Lingwei Zhu, Zheng Chen, Takamitsu Matsubara, Martha White:
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence. CoRR abs/2301.11476 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-05326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-05326
Khurram Javed, Haseeb Shah, Richard S. Sutton, Martha White:
Online Real-Time Recurrent Learning Using Sparse Connections and Selective Learning. CoRR abs/2302.05326 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11725
Vincent Liu, Yash Chandak, Philip S. Thomas, Martha White:
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments. CoRR abs/2302.11725 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14372
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White:
The In-Sample Softmax for Offline Reinforcement Learning. CoRR abs/2302.14372 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-01315
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-01315
Andrew Patterson, Samuel Neumann, Martha White, Adam White:
Empirical Design in Reinforcement Learning. CoRR abs/2304.01315 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09838
James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas:
Coagent Networks: Generalized and Scaled. CoRR abs/2305.09838 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04887
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04887
Vincent Liu, Han Wang, Ruo Yu Tao, Khurram Javed, Adam White, Martha White:
Measuring and Mitigating Interference in Reinforcement Learning. CoRR abs/2307.04887 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01624
Muhammad Kamran Janjua, Haseeb Shah, Martha White, Erfan Miahi, Marlos C. Machado, Adam White:
GVFs in the Real World: Making Predictions Online for Water Treatment. CoRR abs/2312.01624 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02355
Vincent Liu, Prabhat Nagarajan, Andrew Patterson, Martha White:
When is Offline Policy Selection Sample Efficient for Reinforcement Learning? CoRR abs/2312.02355 (2023)
2022
[j8]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/Patterson0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/Patterson0W22
Andrew Patterson, Adam White, Martha White:
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning. J. Mach. Learn. Res. 23: 145:1-145:61 (2022)
[j7]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/0001SLKMW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/0001SLKMW22
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White:
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences. J. Mach. Learn. Res. 23: 253:1-253:79 (2022)
[j6]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/ImaniHW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ImaniHW22
Ehsan Imani, Wei Hu, Martha White:
Representation Alignment in Neural Networks. Trans. Mach. Learn. Res. 2022 (2022)
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/WangSWBLZLKFW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/WangSWBLZLKFW22
Han Wang, Archit Sakhadeo, Adam M. White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White:
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL. Trans. Mach. Learn. Res. 2022 (2022)
[c54]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/0006TPWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/0006TPWM22
Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, Rupam Mahmood:
An Alternate Policy Gradient Estimator for Softmax Policies. AISTATS 2022: 6630-6689
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BanmanP0FW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BanmanP0FW22
Kirby Banman, Liam Peet-Pare, Nidhi Hegde, Alona Fyshe, Martha White:
Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum. ICLR 2022
[c52]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TosattoPWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TosattoPWM22
Samuele Tosatto, Andrew Patterson, Martha White, Rupam Mahmood:
A Temporal-Difference Approach to Policy Gradient Estimation. ICML 2022: 21609-21632
[c51]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/PanMFWYR022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/PanMFWYR022
Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand, Martha White, Hengshuai Yao, Mohsen Rohani, Jun Luo:
Understanding and mitigating the limitations of prioritized experience replay. UAI 2022: 1561-1571
[i63]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02396
Samuele Tosatto, Andrew Patterson, Martha White, A. Rupam Mahmood:
A Temporal-Difference Approach to Policy Gradient Estimation. CoRR abs/2202.02396 (2022)
[i62]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11133
Matthew McLeod, Chunlok Lo, Matthew Schlegel, Andrew Jacobsen, Raksha Kumaraswamy, Martha White, Adam White:
Continual Auxiliary Task Learning. CoRR abs/2202.11133 (2022)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11992
Kirby Banman, Liam Peet-Pare, Nidhi Hegde, Alona Fyshe, Martha White:
Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum. CoRR abs/2203.11992 (2022)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15955
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White:
Investigating the Properties of Neural Network Representations in Reinforcement Learning. CoRR abs/2203.15955 (2022)
[i59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-08464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-08464
Andrew Patterson, Victor Liao, Martha White:
Robust Losses for Learning Value Functions. CoRR abs/2205.08464 (2022)
[i58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-08716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-08716
Han Wang, Archit Sakhadeo, Adam White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White:
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL. CoRR abs/2205.08716 (2022)
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02902
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02902
Chunlok Lo, Gabor Mihucz, Adam White, Farzane Aminmansour, Martha White:
Goal-Space Planning with Subgoal Models. CoRR abs/2206.02902 (2022)
2021
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/SchlegelJAPWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/SchlegelJAPWW21
Matthew Schlegel, Andrew Jacobsen, Zaheer Abbas, Andrew Patterson, Adam White, Martha White:
General Value Function Networks. J. Artif. Intell. Res. 70: 497-543 (2021)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tase/HoferBHGMGAFGLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tase/HoferBHGMGAFGLL21
Sebastian Höfer, Kostas E. Bekris, Ankur Handa, Juan Camilo Gamboa, Melissa Mozifian, Florian Golemo, Christopher G. Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White:
Sim2Real in Robotics and Automation: Applications and Challenges. IEEE Trans Autom. Sci. Eng. 18(2): 398-400 (2021)
[c50]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/PanBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PanBW21
Yangchen Pan, Kirby Banman, Martha White:
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online. ICLR 2021
[c49]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/McLeodLSJKWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/McLeodLSJKWW21
Matthew McLeod, Chunlok Lo, Matthew Schlegel, Andrew Jacobsen, Raksha Kumaraswamy, Martha White, Adam White:
Continual Auxiliary Task Learning. NeurIPS 2021: 12549-12562
[c48]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GuptaMSKTW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaMSKTW21
Dhawal Gupta, Gabor Mihucz, Matthew Schlegel, James E. Kostas, Philip S. Thomas, Martha White:
Structural Credit Assignment in Neural Networks using Reinforcement Learning. NeurIPS 2021: 30257-30270
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-05787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-05787
Khurram Javed, Martha White, Richard S. Sutton:
Scalable Online Recurrent Learning Using Columnar Neural Networks. CoRR abs/2103.05787 (2021)
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-13844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-13844
Andrew Patterson, Adam White, Sina Ghiassian, Martha White:
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning. CoRR abs/2104.13844 (2021)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-14214
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-14214
Qingfeng Lan, Luke Kumar, Martha White, Alona Fyshe:
Predictive Representation Learning for Language Modeling. CoRR abs/2105.14214 (2021)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-08285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-08285
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White:
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences. CoRR abs/2107.08285 (2021)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08066
Vincent Liu, James R. Wright, Martha White:
Exploiting Action Impact Regularity and Partially Known Models for Offline Reinforcement Learning. CoRR abs/2111.08066 (2021)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08172
Eric Graves, Ehsan Imani, Raksha Kumaraswamy, Martha White:
Off-Policy Actor-Critic with Emphatic Weightings. CoRR abs/2111.08172 (2021)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-07806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-07806
Ehsan Imani, Wei Hu, Martha White:
Understanding Feature Transfer Through Representation Alignment. CoRR abs/2112.07806 (2021)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-11622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-11622
Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, A. Rupam Mahmood:
An Alternate Policy Gradient Estimator for Softmax Policies. CoRR abs/2112.11622 (2021)
2020
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/LinkeAWDW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/LinkeAWDW20
Cam Linke, Nadia M. Ady, Martha White, Thomas Degris, Adam White:
Adapting Behavior via Intrinsic Reward: A Survey and Empirical Study. J. Artif. Intell. Res. 69: 1287-1332 (2020)
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/SatsangiLWOW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/SatsangiLWOW20
Yash Satsangi, Sungsu Lim, Shimon Whiteson, Frans A. Oliehoek, Martha White:
Maximizing Information Gain in Partially Observable Environments via Prediction Rewards. AAMAS 2020: 1215-1223
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/HashemzadehKWMF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HashemzadehKWMF20
Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin, Alona Fyshe:
From Language to Language-ish: How Brain-Like is an LSTM's Representation of Atypical Language Stimuli? EMNLP (Findings) 2020: 645-656
[c45]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LanPFW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LanPFW20
Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White:
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning. ICLR 2020
[c44]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/NathLCLWW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NathLCLWW20
Somjit Nath, Vincent Liu, Alan Chan, Xin Li, Adam White, Martha White:
Training Recurrent Neural Networks Online by Learning Explicit State Variables. ICLR 2020
[c43]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/AbbasSTW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AbbasSTW20
Zaheer Abbas, Samuel Sokota, Erin Talvitie, Martha White:
Selective Dyna-Style Planning Under Limited Model Capacity. ICML 2020: 1-10
[c42]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChandakTSWMT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChandakTSWMT20
Yash Chandak, Georgios Theocharous, Shiv Shankar, Martha White, Sridhar Mahadevan, Philip S. Thomas:
Optimizing for the Future in Non-Stationary MDPs. ICML 2020: 1414-1425
[c41]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GhiassianP0GWW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GhiassianP0GWW20
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White:
Gradient Temporal-Difference Learning with Regularized Corrections. ICML 2020: 3524-3534
[c40]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ChandakJTWT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChandakJTWT20
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. NeurIPS 2020
[c39]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/PanIFW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PanIFW20
Yangchen Pan, Ehsan Imani, Amir-massoud Farahmand, Martha White:
An implicit function learning approach for parametric modal regression. NeurIPS 2020
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06195
Yangchen Pan, Ehsan Imani, Martha White, Amir-massoud Farahmand:
An implicit function learning approach for parametric modal regression. CoRR abs/2002.06195 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06487
Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White:
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning. CoRR abs/2002.06487 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04912
Yash Satsangi, Sungsu Lim, Shimon Whiteson, Frans A. Oliehoek, Martha White:
Maximizing Information Gain in Partially Observable Environments via Prediction Reward. CoRR abs/2005.04912 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08158
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08158
Yash Chandak, Georgios Theocharous, Shiv Shankar, Martha White, Sridhar Mahadevan, Philip S. Thomas:
Optimizing for the Future in Non-Stationary MDPs. CoRR abs/2005.08158 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-04363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-04363
Taher Jafferjee, Ehsan Imani, Erin Talvitie, Martha White, Michael Bowling:
Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models. CoRR abs/2006.04363 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-07461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-07461
Khurram Javed, Martha White, Yoshua Bengio:
Learning Causal Models Online. CoRR abs/2006.07461 (2020)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-00611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-00611
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White:
Gradient Temporal-Difference Learning with Regularized Corrections. CoRR abs/2007.00611 (2020)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-02418
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-02418
Zaheer Abbas, Samuel Sokota, Erin J. Talvitie, Martha White:
Selective Dyna-style Planning Under Limited Model Capacity. CoRR abs/2007.02418 (2020)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03807
Vincent Liu, Adam White, Hengshuai Yao, Martha White:
Towards a practical measure of interference for reinforcement learning. CoRR abs/2007.03807 (2020)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-09569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-09569
Jincheng Mei, Yangchen Pan, Martha White, Amir-massoud Farahmand, Hengshuai Yao:
Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities. CoRR abs/2007.09569 (2020)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07435
Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin, Alona Fyshe:
From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli? CoRR abs/2010.07435 (2020)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12645
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas:
Towards Safe Policy Improvement for Non-Stationary MDPs. CoRR abs/2010.12645 (2020)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-03806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-03806
Sebastian Höfer, Kostas E. Bekris, Ankur Handa, Juan Camilo Gamboa Higuera, Florian Golemo, Melissa Mozifian, Christopher G. Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White:
Perspectives on Sim2Real Transfer for Robotics: A Summary of the R: SS 2020 Workshop. CoRR abs/2012.03806 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JacobsenSLDWW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JacobsenSLDWW19
Andrew Jacobsen, Matthew Schlegel, Cameron Linke, Thomas Degris, Adam White, Martha White:
Meta-Descent for Online, Continual Prediction. AAAI 2019: 3943-3950
[c37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuKLW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuKLW19
Vincent Liu, Raksha Kumaraswamy, Lei Le, Martha White:
The Utility of Sparse Representations for Control in Reinforcement Learning. AAAI 2019: 4384-4391
[c36]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ChungNJW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChungNJW19
Wesley Chung, Somjit Nath, Ajin Joseph, Martha White:
Two-Timescale Networks for Nonlinear Value Function Approximation. ICLR (Poster) 2019
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/PanYFW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/PanYFW19
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White:
Hill Climbing on Value Estimates for Search-control in Dyna. IJCAI 2019: 3209-3215
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/WanZWWS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WanZWWS19
Yi Wan, Muhammad Zaheer, Adam White, Martha White, Richard S. Sutton:
Planning with Expectation Models. IJCAI 2019: 3649-3655
[c33]
- view
- export record
  dblp key:
  - conf/nips/SchlegelCGQW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SchlegelCGQW19
Matthew Schlegel, Wesley Chung, Daniel Graves, Jian Qian, Martha White:
Importance Resampling for Off-policy Prediction. NeurIPS 2019: 1797-1807
[c32]
- view
- export record
  dblp key:
  - conf/nips/JavedW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JavedW19
Khurram Javed, Martha White:
Meta-Learning Representations for Continual Learning. NeurIPS 2019: 1818-1828
[c31]
- view
- export record
  dblp key:
  - conf/nips/AminmansourPLPM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AminmansourPLPM19
Farzane Aminmansour, Andrew Patterson, Lei Le, Yisu Peng, Daniel Mitchell, Franco Pestilli, Cesar F. Caiafa, Russell Greiner, Martha White:
Learning Macroscopic Brain Connectomes via Group-Sparse Factorization. NeurIPS 2019: 8847-8857
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-01191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-01191
Yi Wan, Muhammad Zaheer, Adam White, Martha White, Richard S. Sutton:
Planning with Expectation Models. CoRR abs/1904.01191 (2019)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12588
Khurram Javed, Martha White:
Meta-Learning Representations for Continual Learning. CoRR abs/1905.12588 (2019)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-04328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-04328
Matthew Schlegel, Wesley Chung, Daniel Graves, Jian Qian, Martha White:
Importance Resampling for Off-policy Prediction. CoRR abs/1906.04328 (2019)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07791
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07791
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White:
Hill Climbing on Value Estimates for Search-control in Dyna. CoRR abs/1906.07791 (2019)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07865
Cam Linke, Nadia M. Ady, Martha White, Thomas Degris, Adam White:
Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study. CoRR abs/1906.07865 (2019)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-07751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-07751
Andrew Jacobsen, Matthew Schlegel, Cameron Linke, Thomas Degris, Adam White, Martha White:
Meta-descent for Online, Continual Prediction. CoRR abs/1907.07751 (2019)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01705
Khurram Javed, Hengshuai Yao, Martha White:
Is Fast Adaptation All You Need? CoRR abs/1910.01705 (2019)
2018
[c30]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ImaniW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ImaniW18
Ehsan Imani, Martha White:
Improving Regression Performance with Distributional Losses. ICML 2018: 2162-2171
[c29]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/PanFWNGN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PanFWNGN18
Yangchen Pan, Amir-massoud Farahmand, Martha White, Saleh Nabi, Piyush Grover, Daniel Nikovski:
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control. ICML 2018: 3983-3992
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/PanZWPW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/PanZWPW18
Yangchen Pan, Muhammad Zaheer, Adam White, Andrew Patterson, Martha White:
Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains. IJCAI 2018: 4794-4800
[c27]
- view
- export record
  dblp key:
  - conf/nips/ImaniGW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ImaniGW18
Ehsan Imani, Eric Graves, Martha White:
An Off-policy Policy Gradient Theorem Using Emphatic Weightings. NeurIPS 2018: 96-106
[c26]
- view
- export record
  dblp key:
  - conf/nips/LePW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LePW18
Lei Le, Andrew Patterson, Martha White:
Supervised autoencoders: Improving generalization performance with unsupervised regularizers. NeurIPS 2018: 107-117
[c25]
- view
- export record
  dblp key:
  - conf/nips/KumaraswamySWW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KumaraswamySWW18
Raksha Kumaraswamy, Matthew Schlegel, Adam White, Martha White:
Context-dependent upper-confidence bounds for directed exploration. NeurIPS 2018: 4784-4794
[c24]
- view
- export record
  dblp key:
  - conf/uai/SherstanABYWWS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/SherstanABYWWS18
Craig Sherstan, Dylan R. Ashley, Brendan Bennett, Kenny Young, Adam White, Martha White, Richard S. Sutton:
Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return. UAI 2018: 63-72
[c23]
- view
- export record
  dblp key:
  - conf/uai/SajedCW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/SajedCW18
Touqir Sajed, Wesley Chung, Martha White:
High-confidence error estimates for learned value functions. UAI 2018: 683-692
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-08287
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-08287
Craig Sherstan, Brendan Bennett, Kenny Young, Dylan R. Ashley, Adam White, Martha White, Richard S. Sutton:
Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods. CoRR abs/1801.08287 (2018)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-04613
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-04613
Ehsan Imani, Martha White:
Improving Regression Performance with Distributional Losses. CoRR abs/1806.04613 (2018)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-04624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-04624
Yangchen Pan, Muhammad Zaheer, Adam White, Andrew Patterson, Martha White:
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains. CoRR abs/1806.04624 (2018)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-06931
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-06931
Yangchen Pan, Amir-massoud Farahmand, Martha White, Saleh Nabi, Piyush Grover, Daniel Nikovski:
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control. CoRR abs/1806.06931 (2018)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-06763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-06763
Matthew Schlegel, Adam White, Andrew Patterson, Martha White:
General Value Function Networks. CoRR abs/1807.06763 (2018)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-09127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-09127
Touqir Sajed, Wesley Chung, Martha White:
High-confidence error estimates for learned value functions. CoRR abs/1808.09127 (2018)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-09103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-09103
Sungsu Lim, Ajin Joseph, Lei Le, Yangchen Pan, Martha White:
Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces. CoRR abs/1810.09103 (2018)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02597
Sina Ghiassian, Andrew Patterson, Martha White, Richard S. Sutton, Adam White:
Online Off-policy Prediction. CoRR abs/1811.02597 (2018)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06626
Vincent Liu, Raksha Kumaraswamy, Lei Le, Martha White:
The Utility of Sparse Representations for Control in Reinforcement Learning. CoRR abs/1811.06626 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06629
Raksha Kumaraswamy, Matthew Schlegel, Adam White, Martha White:
Context-Dependent Upper-Confidence Bounds for Directed Exploration. CoRR abs/1811.06629 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07004
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc G. Bellemare, Doina Precup:
The Barbados 2018 List of Open Issues in Continual Learning. CoRR abs/1811.07004 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09013
Ehsan Imani, Eric Graves, Martha White:
An Off-policy Policy Gradient Theorem Using Emphatic Weightings. CoRR abs/1811.09013 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-00914
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-00914
Minghan Li, Tanli Zuo, Ruicheng Li, Martha White, Weishi Zheng:
Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling. CoRR abs/1812.00914 (2018)
2017
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JainWR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JainWR17
Shantanu Jain, Martha White, Predrag Radivojac:
Recovering True Classifier Performance in Positive-Unlabeled Learning. AAAI 2017: 2066-2072
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PanWW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PanWW17
Yangchen Pan, Adam White, Martha White:
Accelerated Gradient Temporal Difference Learning. AAAI 2017: 2464-2470
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/SchlegelPCW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SchlegelPCW17
Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White:
Adapting Kernel Representations Online Using Submodular Maximization. ICML 2017: 3037-3046
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/White17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/White17
Martha White:
Unifying Task Specification in Reinforcement Learning. ICML 2017: 3742-3750
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/LeKW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LeKW17
Lei Le, Raksha Kumaraswamy, Martha White:
Learning Sparse Representations in Reinforcement Learning with Sparse Coding. IJCAI 2017: 2067-2073
[c17]
- view
- export record
  dblp key:
  - conf/nips/KaramiWSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KaramiWSS17
Mahdi Karami, Martha White, Dale Schuurmans, Csaba Szepesvári:
Multi-view Matrix Factorization for Linear Dynamical System Estimation. NIPS 2017: 7092-7101
[c16]
- view
  - electronic edition @ auai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/PanAW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/PanAW17
Yangchen Pan, Erfan Sadeqi Azer, Martha White:
Effective sketching methods for value function approximation. UAI 2017
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/JainWR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/JainWR17
Shantanu Jain, Martha White, Predrag Radivojac:
Recovering True Classifier Performance in Positive-Unlabeled Learning. CoRR abs/1702.00518 (2017)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LeKW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeKW17
Lei Le, Raksha Kumaraswamy, Martha White:
Learning Sparse Representations in Reinforcement Learning with Sparse Coding. CoRR abs/1707.08316 (2017)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1708-01298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-01298
Yangchen Pan, Erfan Sadeqi Azer, Martha White:
Effective sketching methods for value function approximation. CoRR abs/1708.01298 (2017)
2016
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/SuttonMW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/SuttonMW16
Richard S. Sutton, Ashique Rupam Mahmood, Martha White:
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning. J. Mach. Learn. Res. 17: 73:1-73:29 (2016)
[c15]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/AdamW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/AdamW16
Adam White, Martha White:
Investigating Practical Linear Temporal Difference Learning. AAMAS 2016: 494-502
[c14]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/WhiteW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/WhiteW16
Martha White, Adam White:
A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning. AAMAS 2016: 557-565
[c13]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/GehringPW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/GehringPW16
Clement Gehring, Yangchen Pan, Martha White:
Incremental Truncated LSTD. IJCAI 2016: 1505-1511
[c12]
- view
- export record
  dblp key:
  - conf/nips/JainWR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JainWR16
Shantanu Jain, Martha White, Predrag Radivojac:
Estimating the class prior and posterior from noisy positives and unlabeled data. NIPS 2016: 2685-2693
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/JainWTR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/JainWTR16
Shantanu Jain, Martha White, Michael W. Trosset, Predrag Radivojac:
Nonparametric semi-supervised learning of class proportions. CoRR abs/1601.01944 (2016)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WhiteW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WhiteW16
Adam White, Martha White:
Investigating practical, linear temporal difference learning. CoRR abs/1602.08771 (2016)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LeW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LeW16
Lei Le, Martha White:
Global optimization of factor models using alternating minimization. CoRR abs/1604.04942 (2016)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/JainWR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/JainWR16
Shantanu Jain, Martha White, Predrag Radivojac:
Estimating the class prior and posterior from noisy positives and unlabeled data. CoRR abs/1606.08561 (2016)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WhiteW16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WhiteW16a
Martha White, Adam White:
A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning. CoRR abs/1607.00446 (2016)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/White16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/White16
Martha White:
Unifying task specification in reinforcement learning. CoRR abs/1609.01995 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/PanWW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PanWW16
Yangchen Pan, Adam White, Martha White:
Accelerated Gradient Temporal Difference Learning. CoRR abs/1611.09328 (2016)
2015
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WhiteWBS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WhiteWBS15
Martha White, Junfeng Wen, Michael Bowling, Dale Schuurmans:
Optimal Estimation of Multivariate ARMA Models. AAAI 2015: 3080-3086
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/pkdd/MirzazadehWGS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pkdd/MirzazadehWGS15
Farzaneh Mirzazadeh, Martha White, András György, Dale Schuurmans:
Scalable Metric Learning for Co-Embedding. ECML/PKDD (1) 2015: 625-642
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SuttonMW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SuttonMW15
Richard S. Sutton, Ashique Rupam Mahmood, Martha White:
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning. CoRR abs/1503.04269 (2015)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MahmoodYWS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MahmoodYWS15
Ashique Rupam Mahmood, Huizhen Yu, Martha White, Richard S. Sutton:
Emphatic Temporal-Difference Learning. CoRR abs/1507.01569 (2015)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/GehringW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GehringW15
Clement Gehring, Martha White:
Incremental Truncated LSTD. CoRR abs/1511.08495 (2015)
2013
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/dcc/VenessWBG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcc/VenessWBG13
Joel Veness, Martha White, Michael Bowling, András György:
Partition Tree Weighting. DCC 2013: 321-330
2012
[c8]
- view
  - electronic edition @ icml.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DegrisWS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DegrisWS12
Thomas Degris, Martha White, Richard S. Sutton:
Linear Off-Policy Actor-Critic. ICML 2012
[c7]
- view
- export record
  dblp key:
  - conf/nips/WhiteYZS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WhiteYZS12
Martha White, Yaoliang Yu, Xinhua Zhang, Dale Schuurmans:
Convex Multi-view Subspace Learning. NIPS 2012: 1682-1690
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/WhiteS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/WhiteS12
Martha White, Dale Schuurmans:
Generalized Optimal Reverse Prediction. AISTATS 2012: 1305-1313
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1205-4839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1205-4839
Thomas Degris, Martha White, Richard S. Sutton:
Off-Policy Actor-Critic. CoRR abs/1205.4839 (2012)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1211-0587
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1211-0587
Joel Veness, Martha White, Michael Bowling, András György:
Partition Tree Weighting. CoRR abs/1211.0587 (2012)
2011
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangYWHS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangYWHS11
Xinhua Zhang, Yaoliang Yu, Martha White, Ruitong Huang, Dale Schuurmans:
Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions. AAAI 2011: 567-573
2010
[c4]
- view
- export record
  dblp key:
  - conf/nips/WhiteW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WhiteW10
Martha White, Adam White:
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains. NIPS 2010: 2433-2441
[c3]
- view
- export record
  dblp key:
  - conf/nips/YuYXWS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuYXWS10
Yaoliang Yu, Min Yang, Linli Xu, Martha White, Dale Schuurmans:
Relaxed Clipping: A Global Training Method for Robust Regression and Classification. NIPS 2010: 2532-2540

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icml/XuWS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XuWS09
Linli Xu, Martha White, Dale Schuurmans:
Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning. ICML 2009: 1137-1144
[c1]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/WhiteB09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WhiteB09
Martha White, Michael H. Bowling:
Learning a Value Analysis Tool for Agent Evaluation. IJCAI 2009: 1976-1981

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.