default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 85 matches
- 2024
- Maximilian Schäffeler, Mohammad Abdulaziz:
Formally Verified Approximate Policy Iteration. CoRR abs/2406.07340 (2024) - 2023
- Ziyu Lin, Jun Ma, Jingliang Duan, Shengbo Eben Li, Haitong Ma, Bo Cheng, Tong Heng Lee:
Policy Iteration Based Approximate Dynamic Programming Toward Autonomous Driving in Constrained Dynamic Environment. IEEE Trans. Intell. Transp. Syst. 24(5): 5003-5013 (2023) - Ziyu Lin, Jingliang Duan, Shengbo Eben Li, Haitong Ma, Jie Li, Jianyu Chen, Bo Cheng, Jun Ma:
Policy-Iteration-Based Finite-Horizon Approximate Dynamic Programming for Continuous-Time Nonlinear Optimal Control. IEEE Trans. Neural Networks Learn. Syst. 34(9): 5255-5267 (2023) - 2022
- Mete Kemertas, Allan Douglas Jepson:
Approximate Policy Iteration with Bisimulation Metrics. Trans. Mach. Learn. Res. 2022 (2022) - Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvári:
Confident Approximate Policy Iteration for Efficient Local Planning in $q\pi$-realizable MDPs. NeurIPS 2022 - Mete Kemertas, Allan D. Jepson:
Trusted Approximate Policy Iteration with Bisimulation Metrics. CoRR abs/2202.02881 (2022) - Yuki Akiyama, Minh Vu, Konstantinos Slavakis:
online and lightweight kernel-based approximated policy iteration for dynamic p-norm linear adaptive filtering. CoRR abs/2210.11755 (2022) - Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvári:
Confident Approximate Policy Iteration for Efficient Local Planning in qπ-realizable MDPs. CoRR abs/2210.15755 (2022) - 2021
- Alberto Maria Metelli, Matteo Pirotta, Daniele Calandriello, Marcello Restelli:
Safe Policy Iteration: A Monotonically Improving Approximate Policy Iteration Approach. J. Mach. Learn. Res. 22: 97:1-97:83 (2021) - Samuel Sokota, Edward Lockhart, Finbarr Timbers, Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot:
Solving Common-Payoff Games with Approximate Policy Iteration. AAAI 2021: 9695-9703 - Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori, Pooria Joulani, Csaba Szepesvári:
Adaptive Approximate Policy Iteration. AISTATS 2021: 523-531 - Benjamin Gravell, Iman Shames, Tyler H. Summers:
Approximate Midpoint Policy Iteration for Linear Quadratic Control. L4DC 2021: 1080-1092 - Samuel Sokota, Edward Lockhart, Finbarr Timbers, Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot:
Solving Common-Payoff Games with Approximate Policy Iteration. CoRR abs/2101.04237 (2021) - Nevena Lazic, Botao Hao, Yasin Abbasi-Yadkori, Dale Schuurmans, Csaba Szepesvári:
Optimization Issues in KL-Constrained Approximate Policy Iteration. CoRR abs/2102.06234 (2021) - Anna Winnicki, Joseph Lubars, Michael Livesay, R. Srikant:
The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation. CoRR abs/2109.13419 (2021) - 2020
- Fan Jiang, Xian Guo, Xuebo Zhang, Zhichao Zhang, Dazhi Dong:
Approximate Soft Policy Iteration Based Reinforcement Learning for Differential Games with Two Pursuers versus One Evader. ICARM 2020: 471-476 - Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori, Pooria Joulani, Csaba Szepesvári:
Provably Efficient Adaptive Approximate Policy Iteration. CoRR abs/2002.03069 (2020) - Benjamin Gravell, Iman Shames, Tyler H. Summers:
Approximate Midpoint Policy Iteration for Linear Quadratic Control. CoRR abs/2011.14212 (2020) - 2019
- Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi:
Solving the Rubik's Cube with Approximate Policy Iteration. ICLR (Poster) 2019 - Riad Akrour, Joni Pajarinen, Jan Peters, Gerhard Neumann:
Projections for Approximate Policy Iteration Algorithms. ICML 2019: 181-190 - Yuming Bai, Yifan Liu, Qi-He Shan, Tieshan Li, Yuzhen Lu:
Data-Based Approximate Policy Iteration for Optimal Course-Keeping Control of Marine Surface Vessels. ISNN (2) 2019: 81-92 - Karl Krauth, Stephen Tu, Benjamin Recht:
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator. NeurIPS 2019: 8512-8522 - Sumit Kumar Jha, Sayan Basu Roy, Shubhendu Bhasin:
Filter based Explorized Policy Iteration Algorithm for On-Policy Approximate LQR. SSCI 2019: 133-140 - Karl Krauth, Stephen Tu, Benjamin Recht:
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator. CoRR abs/1905.12842 (2019) - Elena Smirnova, Elvis Dohmatob:
On the Convergence of Approximate and Regularized Policy Iteration Schemes. CoRR abs/1909.09621 (2019) - Trivikram Dokka, Richlove Frimpong:
Approximate policy iteration using neural networks for storage problems. CoRR abs/1910.01895 (2019) - 2018
- Wentao Guo, Jennie Si, Feng Liu, Shengwei Mei:
Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems. IEEE Trans. Neural Networks Learn. Syst. 29(7): 2794-2807 (2018) - Xiang Gao, Yue Wen, Minhan Li, Jennie Si, He (Helen) Huang:
Robotic Knee Parameter Tuning Using Approximate Policy Iteration. ICCSIP (1) 2018: 554-563 - 2017
- Sebastian Koch:
Least squares approximate policy iteration for learning bid prices in choice-based revenue management. Comput. Oper. Res. 77: 240-253 (2017) - Mahshid Salemi Parizi, Yasin Gocgun, Archis Ghate:
Approximate policy iteration for dynamic resource-constrained project scheduling. Oper. Res. Lett. 45(5): 442-447 (2017)
skipping 55 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-09-21 01:04 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint