Остановите войну!
for scientists:
default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 806 matches
- 2023
- Guiliang Xie, Wei Zhang, Zhi Hu, Gaojian Li:
Upper confident bound advantage function proximal policy optimization. Clust. Comput. 26(3): 2001-2010 (2023) - Vaibhav Padhye, K. Lakshmanan, Amrita Chaturvedi:
Proximal policy optimization based hybrid recommender systems for large scale recommendations. Multim. Tools Appl. 82(13): 20079-20100 (2023) - 2020
- David Tolpin, Yuan Zhou, Hongseok Yang:
Bayesian Policy Search for Stochastic Domains. CoRR abs/2010.00284 (2020) - Prakash Mallick, Zhiyong Chen, Mohsen Zamani:
Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics. CoRR abs/2010.00304 (2020) - Lucas N. Egidio, Anders Hansson, Bo Wahlberg:
Learning the Step-size Policy for the Limited-Memory Broyden-Fletcher-Goldfarb-Shanno Algorithm. CoRR abs/2010.01311 (2020) - Masahiro Kato, Kei Nakagawa:
Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning. CoRR abs/2010.01404 (2020) - Shiva Navabi, Osonde A. Osoba:
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games. CoRR abs/2010.01711 (2020) - Jingkang Wang, Hongyi Guo, Zhaowei Zhu, Yang Liu:
Policy Learning Using Weak Supervision. CoRR abs/2010.01748 (2020) - Wasi Uddin Ahmad, Jianfeng Chi, Yuan Tian, Kai-Wei Chang:
PolicyQA: A Reading Comprehension Dataset for Privacy Policies. CoRR abs/2010.02557 (2020) - Dian Wang, Colin Kohler, Robert Platt Jr.:
Policy learning in SE(3) action spaces. CoRR abs/2010.02798 (2020) - Virginia Negri, Dario Scuratti, Stefano Agresti, Donya Rooein, Amudha Ravi Shankar, Jose Luis Fernandez-Marquez, Mark James Carman, Barbara Pernici:
Image-based Social Sensing: Combining AI and the Crowd to Mine Policy-Adherence Indicators from Twitter. CoRR abs/2010.03021 (2020) - Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge:
Projection-Based Constrained Policy Optimization. CoRR abs/2010.03152 (2020) - Taisuke Kobayashi:
Proximal Policy Optimization with Relative Pearson Divergence. CoRR abs/2010.03290 (2020) - Bernhard Kratzwald, Stefan Feuerriegel, Huan Sun:
Learning a Cost-Effective Annotation Policy for Question Answering. CoRR abs/2010.03476 (2020) - Masahiro Kato:
Theoretical and Experimental Comparison of Off-Policy Evaluation from Dependent Samples. CoRR abs/2010.03792 (2020) - Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux:
Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients. CoRR abs/2010.04440 (2020) - Teresa Gomez-Diaz, Tomás Recio:
A policy and legal Open Science framework: a proposal. CoRR abs/2010.04508 (2020) - Katherine A. Keith, Christoph Teichmann, Brendan O'Connor, Edgar Meij:
Uncertainty over Uncertainty: Investigating the Assumptions, Annotations, and Text Measurements of Economic Policy Uncertainty. CoRR abs/2010.04706 (2020) - Sijia Li, Xiudi Li, Alex Luedtke:
Discussion of Kallus (2020) and Mo, Qi, and Liu (2020): New Objectives for Policy Learning. CoRR abs/2010.04805 (2020) - Michael Zhang:
Characterizing Policy Divergence for Personalized Meta-Reinforcement Learning. CoRR abs/2010.04816 (2020) - Rahul Singh, Liyuan Xu, Arthur Gretton:
Kernel Methods for Policy Evaluation: Treatment Effects, Mediation Analysis, and Off-Policy Planning. CoRR abs/2010.04855 (2020) - Reazul Hasan Russel, Mouhacine Benosman, Jeroen van Baar:
Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty. CoRR abs/2010.04870 (2020) - Jost Tobias Springenberg, Nicolas Heess, Daniel J. Mankowitz, Josh Merel, Arunkumar Byravan, Abbas Abdolmaleki, Jackie Kay, Jonas Degrave, Julian Schrittwieser, Yuval Tassa, Jonas Buchli, Dan Belov, Martin A. Riedmiller:
Local Search for Policy Iteration in Continuous Control. CoRR abs/2010.05545 (2020) - Brian Ichter, Pierre Sermanet, Corey Lynch:
Broadly-Exploring, Local-Policy Trees for Long-Horizon Task Planning. CoRR abs/2010.06491 (2020) - Xiangyu Zhang, Rohit Chintala, Andrey Bernstein, Peter A. Graf, Xin Jin:
Grid-Interactive Multi-Zone Building Control Using Reinforcement Learning with Global-Local Policy Search. CoRR abs/2010.06718 (2020) - Alexis Morris, Hallie Siegel, Jonathan Kelly:
Towards a Policy-as-a-Service Framework to Enable Compliant, Trustworthy AI and HRI Systems in the Wild. CoRR abs/2010.07022 (2020) - Gavin S. Hartnett, Raffaele Vardavas, Lawrence Baker, Michael Chaykowsky, C. Ben Gibson, Federico Girosi, David P. Kennedy, Osonde A. Osoba:
Deep Generative Modeling in Network Science with Applications to Public Policy Research. CoRR abs/2010.07870 (2020) - Hepeng Li, Haibo He:
Multi-Agent Trust Region Policy Optimization. CoRR abs/2010.07916 (2020) - Santiago Paternain, Juan Andrés Bazerque, Alejandro Ribeiro:
Policy Gradient for Continuing Tasks in Non-stationary Markov Decision Processes. CoRR abs/2010.08443 (2020) - Jayaraman J. Thiagarajan, Peer-Timo Bremer, Rushil Anirudh, Timothy C. Germann, Sara Y. Del Valle, Frederick H. Streitz:
Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models. CoRR abs/2010.08478 (2020)
skipping 776 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-04-25 15:51 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint