default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Publication search results

found 212 matches

2024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/comsis/AlmeidaAB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/comsis/AlmeidaAB24
Vicente Nejar de Almeida, Lucas N. Alegre, Ana L. C. Bazzan:
Knowledge transfer in multi-objective multi-agent reinforcement learning via generalized policy improvement. Comput. Sci. Inf. Syst. 21(1): 335-362 (2024)
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/ChengHCW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/ChengHCW24
Yuhu Cheng, Longyang Huang, C. L. Philip Chen, Xuesong Wang:
Dual Parallel Policy Iteration With Coupled Policy Improvement. IEEE Trans. Neural Networks Learn. Syst. 35(3): 4286-4298 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KongWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KongWZ24
Rui Kong, Chenyang Wu, Zongzhang Zhang:
Generalizable Policy Improvement via Reinforcement Sampling (Student Abstract). AAAI 2024: 23546-23547
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Zhu0WFHF0HLF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Zhu0WFHF0HLF24
Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms. AAMAS 2024: 2621-2623
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LiuYSW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuYSW024
Xuefeng Liu, Takuma Yoneda, Rick Stevens, Matthew R. Walter, Yuxin Chen:
Blending Imitation and Reinforcement Learning for Robust Policy Improvement. ICLR 2024
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/0002ZCSSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0002ZCSSF24
Federico Bianchi, Edoardo Zorzi, Alberto Castellini, Thiago D. Simão, Matthijs T. J. Spaan, Alessandro Farinelli:
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs. ICML 2024
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Li0TFH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Li0TFH24
Pengyi Li, Yan Zheng, Hongyao Tang, Xian Fu, Jianye Hao:
EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search. ICML 2024
- view
  authority control:
- export record
  dblp key:
  - conf/icra/HenshallK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/HenshallK24
Andrea Henshall, Sertac Karaman:
Multi-Level Action Tree Rollout (MLAT-R): Efficient and Accurate Online Multiagent Policy Improvement. ICRA 2024: 315-321
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07876
Victor Zhong, Dipendra Misra, Xingdi Yuan, Marc-Alexandre Côté:
Policy Improvement using Language Feedback Models. CoRR abs/2402.07876 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14305
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14305
Adrian Röfer, Iman Nematollahi, Tim Welschehold, Wolfram Burgard, Abhinav Valada:
Bayesian Optimization for Sample-Efficient Policy Improvement in Robotic Manipulation. CoRR abs/2403.14305 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-19883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-19883
Frederico Messa, André Grahl Pereira:
Policy-Space Search: Equivalences, Improvements, and Compression. CoRR abs/2403.19883 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-08638
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-08638
Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement. CoRR abs/2405.08638 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10959
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10959
Jin Ma, Gaozhan Wang, Jianfeng Zhang:
On Convergence and Rate of Convergence of Policy Improvement Algorithms. CoRR abs/2406.10959 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-06025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-06025
Aoyu Pang, Maonan Wang, Man-On Pun, Chung Shue Chen, Xi Xiong:
iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement. CoRR abs/2407.06025 (2024)
2023
- view
  authority control:
- export record
  dblp key:
  - journals/isci/HuWLDK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/isci/HuWLDK23
Yingbai Hu, Xu Wang, Yueyue Liu, Weiping Ding, Alois Knoll:
PI-ELM: Reinforcement learning-based adaptable policy improvement for dynamical system. Inf. Sci. 650: 119700 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ml/ZhuM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/ZhuM23
Lingwei Zhu, Takamitsu Matsubara:
Cautious policy programming: exploiting KL regularization for monotonic policy improvement in reinforcement learning. Mach. Learn. 112(11): 4527-4562 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/tcyb/HuCLK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcyb/HuCLK23
Yingbai Hu, Guang Chen, Zhijun Li, Alois C. Knoll:
Robot Policy Improvement With Natural Evolution Strategies for Stable Nonlinear Dynamical System. IEEE Trans. Cybern. 53(6): 4002-4014 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SimaoS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SimaoS023
Thiago D. Simão, Marnix Suilen, Nils Jansen:
Safe Policy Improvement for POMDPs via Finite-State Controllers. AAAI 2023: 15109-15117
- view
  authority control:
- export record
  dblp key:
  - conf/aipr2/ZhangX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aipr2/ZhangX23
Xin Zhang, Yihuan Xu:
Improvement of prioritized experience replay mechanism based on deep deterministic policy gradient algorithm. AIPR 2023: 1310-1316
- view
  authority control:
- export record
  dblp key:
  - conf/atal/AlegreBRN023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/AlegreBRN023
Lucas Nunes Alegre, Ana L. C. Bazzan, Diederik M. Roijers, Ann Nowé, Bruno C. da Silva:
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization. AAMAS 2023: 2003-2012
- view
  authority control:
- export record
  dblp key:
  - conf/atal/KhannaTMMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/KhannaTMMT23
Pranav Khanna, Guy Tennenholtz, Nadav Merlis, Shie Mannor, Chen Tessler:
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning. AAMAS 2023: 2430-2432
- view
  authority control:
- export record
  dblp key:
  - conf/atal/LiuZMPGH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LiuZMPGH23
Yanyu Liu, Yifeng Zeng, Biyang Ma, Yinghui Pan, Huifan Gao, Xiaohan Huang:
Improvement and Evaluation of the Policy Legibility in Reinforcement Learning. AAMAS 2023: 3044-3046
- view
  authority control:
- export record
  dblp key:
  - conf/atal/TrudeauB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/TrudeauB23
Alexandre Trudeau, Michael Bowling:
Targeted Search Control in AlphaZero for Effective Policy Improvement. AAMAS 2023: 842-850
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/LiJLZN00O23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/LiJLZN00O23
Chuming Li, Ruonan Jia, Jie Liu, Yinmin Zhang, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning. ECAI 2023: 1381-1388
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ectel/SchmuckerPSSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ectel/SchmuckerPSSM23
Robin Schmucker, Nimish Pachapurkar, Bala Shanmugam, Miral Shah, Tom M. Mitchell:
Learning to Give Useful Hints: Assistance Action Evaluation and Policy Improvements. EC-TEL 2023: 383-398
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/NeumannLJP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NeumannLJP0W23
Samuel Neumann, Sungsu Lim, Ajin George Joseph, Yangchen Pan, Adam White, Martha White:
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement. ICLR 2023
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Castellini0ZSFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Castellini0ZSFS23
Alberto Castellini, Federico Bianchi, Edoardo Zorzi, Thiago D. Simão, Alessandro Farinelli, Matthijs T. J. Spaan:
Scalable Safe Policy Improvement via Monte Carlo Tree Search. ICML 2023: 3732-3756
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiZYBWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiZYBWW23
Jiachen Li, Edwin Zhang, Ming Yin, Qinxun Bai, Yu-Xiang Wang, William Yang Wang:
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators. ICML 2023: 20485-20528
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiuYWW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuYWW023
Xuefeng Liu, Takuma Yoneda, Chaoqi Wang, Matthew R. Walter, Yuxin Chen:
Active Policy Improvement from Multiple Black-box Oracles. ICML 2023: 22320-22337
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/WienhoftSSDB023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WienhoftSSDB023
Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen:
More for Less: Safe Policy Improvement with Stronger Performance Guarantees. IJCAI 2023: 4406-4415

skipping 182 more matches

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results