default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 275 matches
- 2024
- Yujia Sun, Jan Platos:
Abstractive text summarization model combining a hierarchical attention mechanism and multiobjective reinforcement learning. Expert Syst. Appl. 248: 123356 (2024) - Figen Beken Fikri, Kemal Oflazer, Berrin Yanikoglu:
Abstractive summarization with deep reinforcement learning using semantic similarity rewards. Nat. Lang. Eng. 30(3): 554-576 (2024) - Tinghuai Ma, Kexing Peng, Huan Rong, Yurong Qian, Najla Al-Nabhan:
Hierarchical Coordination Multi-Agent Reinforcement Learning With Spatio-Temporal Abstraction. IEEE Trans. Emerg. Top. Comput. Intell. 8(1): 533-547 (2024) - Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren:
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning. AAAI 2024: 10953-10961 - Josiah P. Hanna:
Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction. AAAI 2024: 22667 - Kai-Chieh Hsu, Allen Z. Ren, Duy Phuong Nguyen, Anirudha Majumdar, Jaime F. Fisac:
Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees (Abstract Reprint). AAAI 2024: 22699 - Piyush Jha, Joseph Scott, Jaya Sriram Ganeshna, Mudit Singh, Vijay Ganesh:
BertRLFuzzer: A BERT and Reinforcement Learning Based Fuzzer (Student Abstract). AAAI 2024: 23521-23522 - Vincent Liu, James R. Wright, Martha White:
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning (Abstract Reprint). AAAI 2024: 22706 - Rashmeet Kaur Nayyar:
Learning Generalizable and Composable Abstractions for Transfer in Reinforcement Learning. AAAI 2024: 23403-23404 - João G. Ribeiro, Gonçalo Rodrigues, Alberto Sardinha, Francisco S. Melo:
TEAMSTER: Model-Based Reinforcement Learning for Ad Hoc Teamwork (Abstract Reprint). AAAI 2024: 22708 - Brandon Rozek, Junkyu Lee, Harsha Kokel, Michael Katz, Shirin Sohrabi:
Partially Observable Hierarchical Reinforcement Learning with AI Planning (Student Abstract). AAAI 2024: 23635-23636 - Sehyun Ryu, Hosung Joo, Jonggyu Jang, Hyun Jong Yang:
Instance-Wise Laplace Mechanism via Deep Reinforcement Learning (Student Abstract). AAAI 2024: 23640-23641 - Richard S. Sutton, Marlos C. Machado, G. Zacharias Holland, David Szepesvari, Finbarr Timbers, Brian Tanner, Adam White:
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint). AAAI 2024: 22713 - Kuang-Da Wang, Wei-Yao Wang, Yu-Tse Chen, Yu-Heng Lin, Wen-Chih Peng:
The CoachAI Badminton Environment: A Novel Reinforcement Learning Environment with Realistic Opponents (Student Abstract). AAAI 2024: 23679-23681 - Zizhao Wang, Caroline Wang, Xuesu Xiao, Yuke Zhu, Peter Stone:
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning. AAAI 2024: 15778-15786 - Anjie Zhu, Peng-Fei Zhang, Ruihong Qiu, Zetao Zheng, Zi Huang, Jie Shao:
Abstract and Explore: A Novel Behavioral Metric with Cyclic Dynamics in Reinforcement Learning. AAAI 2024: 17150-17158 - Jussi P. P. Jokinen, Antti Oulasvirta, Andrew Howes:
Cognitive Modeling: From GOMS to Deep Reinforcement Learning. CHI Extended Abstracts 2024: 594:1-594:2 - Feiyu Lu, Mengyu Chen, Hsiang Hsu, Pranav Deshpande, Cheng Yao Wang, Blair MacIntyre:
Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning. CHI Extended Abstracts 2024: 32:1-32:7 - Tushar Dilip Kurne, Manas Sashank Juvvi, Vaishnavi J, Pushpak Jagtap:
Poster Abstract: Signal Temporal Logic Compliant Motion Planning using Reinforcement Learning. ICCPS 2024: 283-284 - Harry Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio:
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning. ICLR 2024 - Boning Li, Zhixuan Fang, Longbo Huang:
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning. ICML 2024 - Zhening Li, Gabriel Poesia, Armando Solar-Lezama:
When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions. ICML 2024 - Chen-Chun Hsia, Yanggang Xu, Jiyuan Ren, Xinlei Chen:
Demo Abstract: CARL: Collaborative Altitude-Adaptive Reinforcement Learning for Active Search with UAV Swarms. IPSN 2024: 249-250 - Yanggang Xu, Zhuozhu Jian, Jirong Zha, Xinlei Chen:
Poster Abstract: Emergency Networking Using UAVs: A Reinforcement Learning Approach with Large Language Model. IPSN 2024: 281-282 - Chenxi Yang, Greg Anderson, Swarat Chaudhuri:
Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation. SaTML 2024: 233-251 - Xin Liu, Ximing Wang, Yuhua Xu, Zhiyong Du, Yifan Xu, Hao Han:
Lightweight Reinforcement Learning with State Abstraction for Dynamic Spectrum Anti-Jamming Communications. WCNC 2024: 1-6 - Zizhao Wang, Caroline Wang, Xuesu Xiao, Yuke Zhu, Peter Stone:
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning. CoRR abs/2401.12497 (2024) - Carlos A. Velazquez-Vargas, Isaac Ray Christian, Jordan A. Taylor, Sreejan Kumar:
Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning. CoRR abs/2402.03072 (2024) - Yoshiki Takagi, Roderick S. Tabalba, Nurit Kirshenbaum, Jason Leigh:
Abstracted Trajectory Visualization for Explainability in Reinforcement Learning. CoRR abs/2402.07928 (2024) - Boning Li, Zhixuan Fang, Longbo Huang:
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning. CoRR abs/2403.04344 (2024)
skipping 245 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-09-22 21:58 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint