Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

> Home

Venue search results

Likely matches

IEEE International Symposium on Policies for Distributed Systems and Networks (POLICY)
International Conference on Ethics and Policy of Biometrics and International Data Sharing (ICEB)

Publication search results

found 806 matches

2023
- view
  authority control:
- export record
  dblp key:
  - journals/cluster/Xie0HL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cluster/Xie0HL23
Guiliang Xie, Wei Zhang, Zhi Hu, Gaojian Li:
Upper confident bound advantage function proximal policy optimization. Clust. Comput. 26(3): 2001-2010 (2023)
- view
  authority control:
- export record
  dblp key:
  - journals/mta/PadhyeLC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/PadhyeLC23
Vaibhav Padhye, K. Lakshmanan, Amrita Chaturvedi:
Proximal policy optimization based hybrid recommender systems for large scale recommendations. Multim. Tools Appl. 82(13): 20079-20100 (2023)
2020
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-00284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-00284
David Tolpin, Yuan Zhou, Hongseok Yang:
Bayesian Policy Search for Stochastic Domains. CoRR abs/2010.00284 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-00304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-00304
Prakash Mallick, Zhiyong Chen, Mohsen Zamani:
Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics. CoRR abs/2010.00304 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01311
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01311
Lucas N. Egidio, Anders Hansson, Bo Wahlberg:
Learning the Step-size Policy for the Limited-Memory Broyden-Fletcher-Goldfarb-Shanno Algorithm. CoRR abs/2010.01311 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01404
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01404
Masahiro Kato, Kei Nakagawa:
Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning. CoRR abs/2010.01404 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01711
Shiva Navabi, Osonde A. Osoba:
A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games. CoRR abs/2010.01711 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01748
Jingkang Wang, Hongyi Guo, Zhaowei Zhu, Yang Liu:
Policy Learning Using Weak Supervision. CoRR abs/2010.01748 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02557
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02557
Wasi Uddin Ahmad, Jianfeng Chi, Yuan Tian, Kai-Wei Chang:
PolicyQA: A Reading Comprehension Dataset for Privacy Policies. CoRR abs/2010.02557 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02798
Dian Wang, Colin Kohler, Robert Platt Jr.:
Policy learning in SE(3) action spaces. CoRR abs/2010.02798 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03021
Virginia Negri, Dario Scuratti, Stefano Agresti, Donya Rooein, Amudha Ravi Shankar, Jose Luis Fernandez-Marquez, Mark James Carman, Barbara Pernici:
Image-based Social Sensing: Combining AI and the Crowd to Mine Policy-Adherence Indicators from Twitter. CoRR abs/2010.03021 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03152
Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge:
Projection-Based Constrained Policy Optimization. CoRR abs/2010.03152 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03290
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03290
Taisuke Kobayashi:
Proximal Policy Optimization with Relative Pearson Divergence. CoRR abs/2010.03290 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03476
Bernhard Kratzwald, Stefan Feuerriegel, Huan Sun:
Learning a Cost-Effective Annotation Policy for Question Answering. CoRR abs/2010.03476 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03792
Masahiro Kato:
Theoretical and Experimental Comparison of Off-Policy Evaluation from Dependent Samples. CoRR abs/2010.03792 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04440
Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux:
Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients. CoRR abs/2010.04440 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04508
Teresa Gomez-Diaz, Tomás Recio:
A policy and legal Open Science framework: a proposal. CoRR abs/2010.04508 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04706
Katherine A. Keith, Christoph Teichmann, Brendan O'Connor, Edgar Meij:
Uncertainty over Uncertainty: Investigating the Assumptions, Annotations, and Text Measurements of Economic Policy Uncertainty. CoRR abs/2010.04706 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04805
Sijia Li, Xiudi Li, Alex Luedtke:
Discussion of Kallus (2020) and Mo, Qi, and Liu (2020): New Objectives for Policy Learning. CoRR abs/2010.04805 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04816
Michael Zhang:
Characterizing Policy Divergence for Personalized Meta-Reinforcement Learning. CoRR abs/2010.04816 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04855
Rahul Singh, Liyuan Xu, Arthur Gretton:
Kernel Methods for Policy Evaluation: Treatment Effects, Mediation Analysis, and Off-Policy Planning. CoRR abs/2010.04855 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04870
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04870
Reazul Hasan Russel, Mouhacine Benosman, Jeroen van Baar:
Robust Constrained-MDPs: Soft-Constrained Robust Policy Optimization under Model Uncertainty. CoRR abs/2010.04870 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05545
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05545
Jost Tobias Springenberg, Nicolas Heess, Daniel J. Mankowitz, Josh Merel, Arunkumar Byravan, Abbas Abdolmaleki, Jackie Kay, Jonas Degrave, Julian Schrittwieser, Yuval Tassa, Jonas Buchli, Dan Belov, Martin A. Riedmiller:
Local Search for Policy Iteration in Continuous Control. CoRR abs/2010.05545 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-06491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-06491
Brian Ichter, Pierre Sermanet, Corey Lynch:
Broadly-Exploring, Local-Policy Trees for Long-Horizon Task Planning. CoRR abs/2010.06491 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-06718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-06718
Xiangyu Zhang, Rohit Chintala, Andrey Bernstein, Peter A. Graf, Xin Jin:
Grid-Interactive Multi-Zone Building Control Using Reinforcement Learning with Global-Local Policy Search. CoRR abs/2010.06718 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07022
Alexis Morris, Hallie Siegel, Jonathan Kelly:
Towards a Policy-as-a-Service Framework to Enable Compliant, Trustworthy AI and HRI Systems in the Wild. CoRR abs/2010.07022 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07870
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07870
Gavin S. Hartnett, Raffaele Vardavas, Lawrence Baker, Michael Chaykowsky, C. Ben Gibson, Federico Girosi, David P. Kennedy, Osonde A. Osoba:
Deep Generative Modeling in Network Science with Applications to Public Policy Research. CoRR abs/2010.07870 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07916
Hepeng Li, Haibo He:
Multi-Agent Trust Region Policy Optimization. CoRR abs/2010.07916 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08443
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08443
Santiago Paternain, Juan Andrés Bazerque, Alejandro Ribeiro:
Policy Gradient for Continuing Tasks in Non-stationary Markov Decision Processes. CoRR abs/2010.08443 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08478
Jayaraman J. Thiagarajan, Peer-Timo Bremer, Rushil Anirudh, Timothy C. Germann, Sara Y. Del Valle, Frederick H. Streitz:
Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models. CoRR abs/2010.08478 (2020)

skipping 776 more matches

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results