Adam White
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
showing all ?? records
2010 – today
- 2018
- [i5]Craig Sherstan, Brendan Bennett, Kenny Young, Dylan R. Ashley, Adam White, Martha White, Richard S. Sutton:
Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods. CoRR abs/1801.08287 (2018) - 2017
- [i4]Adam White, Richard S. Sutton:
GQ($λ$) Quick Reference and Implementation Guide. CoRR abs/1705.03967 (2017) - 2016
- [c5]Craig Sherstan, Adam White, Marlos C. Machado, Patrick M. Pilarski:
Introspective Agents: Confidence Measures for General Value Functions. AGI 2016: 258-261 - [i3]Craig Sherstan, Adam White, Marlos C. Machado, Patrick M. Pilarski:
Introspective Agents: Confidence Measures for General Value Functions. CoRR abs/1606.05593 (2016) - 2014
- [j2]Joseph Modayil, Adam White, Richard S. Sutton:
Multi-timescale nexting in a reinforcement learning robot. Adaptive Behaviour 22(2): 146-160 (2014) - 2013
- [j1]Steven J. Johnston, Neil S. O'Brien, Hugh G. Lewis, Elizabeth E. Hart, Adam White, Simon J. Cox:
Clouds in Space: Scientific Computing using Windows Azure. J. Cloud Computing 2: 2 (2013) - 2012
- [c4]Adam White, Joseph Modayil, Richard S. Sutton:
Scaling life-long off-policy learning. ICDL-EPIROB 2012: 1-6 - [c3]Joseph Modayil, Adam White, Richard S. Sutton:
Multi-timescale Nexting in a Reinforcement Learning Robot. SAB 2012: 299-309 - [c2]Joseph Modayil, Adam White, Patrick M. Pilarski, Richard S. Sutton:
Acquiring a broad range of empirical knowledge in real time by temporal-difference learning. SMC 2012: 1903-1910 - [i2]Adam White, Joseph Modayil, Richard S. Sutton:
Scaling Life-long Off-policy Learning. CoRR abs/1206.6262 (2012) - 2011
- [c1]Richard S. Sutton, Joseph Modayil, Michael Delp, Thomas Degris, Patrick M. Pilarski, Adam White, Doina Precup:
Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. AAMAS 2011: 761-768 - [i1]Joseph Modayil, Adam White, Richard S. Sutton:
Multi-timescale Nexting in a Reinforcement Learning Robot. CoRR abs/1112.1133 (2011)
Coauthor Index
data released under the ODC-BY 1.0 license; see also our legal information page
last updated on 2018-02-03 21:06 CET by the dblp team