default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Publication search results

found 85 matches

2024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07340
Maximilian Schäffeler, Mohammad Abdulaziz:
Formally Verified Approximate Policy Iteration. CoRR abs/2406.07340 (2024)
2023
- view
  authority control:
- export record
  dblp key:
  - journals/tits/LinMDLMCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tits/LinMDLMCL23
Ziyu Lin, Jun Ma, Jingliang Duan, Shengbo Eben Li, Haitong Ma, Bo Cheng, Tong Heng Lee:
Policy Iteration Based Approximate Dynamic Programming Toward Autonomous Driving in Constrained Dynamic Environment. IEEE Trans. Intell. Transp. Syst. 24(5): 5003-5013 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/LinDLMLCCM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/LinDLMLCCM23
Ziyu Lin, Jingliang Duan, Shengbo Eben Li, Haitong Ma, Jie Li, Jianyu Chen, Bo Cheng, Jun Ma:
Policy-Iteration-Based Finite-Horizon Approximate Dynamic Programming for Continuous-Time Nonlinear Optimal Control. IEEE Trans. Neural Networks Learn. Syst. 34(9): 5255-5267 (2023)
2022
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/KemertasJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/KemertasJ22
Mete Kemertas, Allan Douglas Jepson:
Approximate Policy Iteration with Bisimulation Metrics. Trans. Mach. Learn. Res. 2022 (2022)
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Weisz0KS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Weisz0KS22
Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvári:
Confident Approximate Policy Iteration for Efficient Local Planning in $q\pi$-realizable MDPs. NeurIPS 2022
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02881
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02881
Mete Kemertas, Allan D. Jepson:
Trusted Approximate Policy Iteration with Bisimulation Metrics. CoRR abs/2202.02881 (2022)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11755
Yuki Akiyama, Minh Vu, Konstantinos Slavakis:
online and lightweight kernel-based approximated policy iteration for dynamic p-norm linear adaptive filtering. CoRR abs/2210.11755 (2022)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15755
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15755
Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvári:
Confident Approximate Policy Iteration for Efficient Local Planning in q^π-realizable MDPs. CoRR abs/2210.15755 (2022)
2021
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/MetelliPCR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/MetelliPCR21
Alberto Maria Metelli, Matteo Pirotta, Daniele Calandriello, Marcello Restelli:
Safe Policy Iteration: A Monotonically Improving Approximate Policy Iteration Approach. J. Mach. Learn. Res. 22: 97:1-97:83 (2021)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SokotaLTDDBSBL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SokotaLTDDBSBL21
Samuel Sokota, Edward Lockhart, Finbarr Timbers, Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot:
Solving Common-Payoff Games with Approximate Policy Iteration. AAAI 2021: 9695-9703
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/HaoLAJS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/HaoLAJS21
Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori, Pooria Joulani, Csaba Szepesvári:
Adaptive Approximate Policy Iteration. AISTATS 2021: 523-531
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/l4dc/GravellSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/l4dc/GravellSS21
Benjamin Gravell, Iman Shames, Tyler H. Summers:
Approximate Midpoint Policy Iteration for Linear Quadratic Control. L4DC 2021: 1080-1092
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-04237
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-04237
Samuel Sokota, Edward Lockhart, Finbarr Timbers, Elnaz Davoodi, Ryan D'Orazio, Neil Burch, Martin Schmid, Michael Bowling, Marc Lanctot:
Solving Common-Payoff Games with Approximate Policy Iteration. CoRR abs/2101.04237 (2021)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06234
Nevena Lazic, Botao Hao, Yasin Abbasi-Yadkori, Dale Schuurmans, Csaba Szepesvári:
Optimization Issues in KL-Constrained Approximate Policy Iteration. CoRR abs/2102.06234 (2021)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13419
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13419
Anna Winnicki, Joseph Lubars, Michael Livesay, R. Srikant:
The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation. CoRR abs/2109.13419 (2021)
2020
- view
  authority control:
- export record
  dblp key:
  - conf/icarm/JiangGZZD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icarm/JiangGZZD20
Fan Jiang, Xian Guo, Xuebo Zhang, Zhichao Zhang, Dazhi Dong:
Approximate Soft Policy Iteration Based Reinforcement Learning for Differential Games with Two Pursuers versus One Evader. ICARM 2020: 471-476
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-03069
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-03069
Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori, Pooria Joulani, Csaba Szepesvári:
Provably Efficient Adaptive Approximate Policy Iteration. CoRR abs/2002.03069 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-14212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-14212
Benjamin Gravell, Iman Shames, Tyler H. Summers:
Approximate Midpoint Policy Iteration for Linear Quadratic Control. CoRR abs/2011.14212 (2020)
2019
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/McAleerASB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/McAleerASB19
Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi:
Solving the Rubik's Cube with Approximate Policy Iteration. ICLR (Poster) 2019
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/AkrourP0N19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AkrourP0N19
Riad Akrour, Joni Pajarinen, Jan Peters, Gerhard Neumann:
Projections for Approximate Policy Iteration Algorithms. ICML 2019: 181-190
- view
  authority control:
- export record
  dblp key:
  - conf/isnn/BaiLSLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isnn/BaiLSLL19
Yuming Bai, Yifan Liu, Qi-He Shan, Tieshan Li, Yuzhen Lu:
Data-Based Approximate Policy Iteration for Optimal Course-Keeping Control of Marine Surface Vessels. ISNN (2) 2019: 81-92
- view
- export record
  dblp key:
  - conf/nips/KrauthTR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KrauthTR19
Karl Krauth, Stephen Tu, Benjamin Recht:
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator. NeurIPS 2019: 8512-8522
- view
  authority control:
- export record
  dblp key:
  - conf/ssci/JhaRB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssci/JhaRB19
Sumit Kumar Jha, Sayan Basu Roy, Shubhendu Bhasin:
Filter based Explorized Policy Iteration Algorithm for On-Policy Approximate LQR. SSCI 2019: 133-140
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12842
Karl Krauth, Stephen Tu, Benjamin Recht:
Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator. CoRR abs/1905.12842 (2019)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-09621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-09621
Elena Smirnova, Elvis Dohmatob:
On the Convergence of Approximate and Regularized Policy Iteration Schemes. CoRR abs/1909.09621 (2019)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01895
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01895
Trivikram Dokka, Richlove Frimpong:
Approximate policy iteration using neural networks for storage problems. CoRR abs/1910.01895 (2019)
2018
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/GuoSLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/GuoSLM18
Wentao Guo, Jennie Si, Feng Liu, Shengwei Mei:
Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems. IEEE Trans. Neural Networks Learn. Syst. 29(7): 2794-2807 (2018)
- view
  authority control:
- export record
  dblp key:
  - conf/iccsip/GaoWLSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccsip/GaoWLSH18
Xiang Gao, Yue Wen, Minhan Li, Jennie Si, He (Helen) Huang:
Robotic Knee Parameter Tuning Using Approximate Policy Iteration. ICCSIP (1) 2018: 554-563
2017
- view
  authority control:
- export record
  dblp key:
  - journals/cor/Koch17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cor/Koch17
Sebastian Koch:
Least squares approximate policy iteration for learning bid prices in choice-based revenue management. Comput. Oper. Res. 77: 240-253 (2017)
- view
  authority control:
- export record
  dblp key:
  - journals/orl/PariziGG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/orl/PariziGG17
Mahshid Salemi Parizi, Yasin Gocgun, Archis Ghate:
Approximate policy iteration for dynamic resource-constrained project scheduling. Oper. Res. Lett. 45(5): 442-447 (2017)

skipping 55 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results