default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

> Home

Author search results

Exact matches

Nadav Merlis

Publication search results

found 30 matches

2024
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/BaudryMMRP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/BaudryMMRP24
Dorian Baudry, Nadav Merlis, Mathieu Benjamin Molina, Hugo Richard, Vianney Perchet:
Multi-armed bandits with guaranteed revenue per arm. AISTATS 2024: 379-387
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11637
Nadav Merlis, Dorian Baudry, Vianney Perchet:
The Value of Reward Lookahead in Reinforcement Learning. CoRR abs/2403.11637 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16581
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16581
Itai Shufaro, Nadav Merlis, Nir Weinberger, Shie Mannor:
On Bits and Bandits: Quantifying the Regret-Information Trade-off. CoRR abs/2405.16581 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02258
Nadav Merlis:
Reinforcement Learning with Lookahead Information. CoRR abs/2406.02258 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11316
Matilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet:
Improved Algorithms for Contextual Dynamic Pricing. CoRR abs/2406.11316 (2024)
2023
- view
  authority control:
- export record
  dblp key:
  - conf/atal/KhannaTMMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/KhannaTMMT23
Pranav Khanna, Guy Tennenholtz, Nadav Merlis, Shie Mannor, Chen Tessler:
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning. AAMAS 2023: 2430-2432
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MerlisRSOMP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MerlisRSOMP23
Nadav Merlis, Hugo Richard, Flore Sentenac, Corentin Odic, Mathieu Molina, Vianney Perchet:
On Preemption and Learning in Stochastic Scheduling. ICML 2023: 24478-24516
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TennenholtzMSMB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TennenholtzMSMB23
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier:
Reinforcement Learning with History Dependent Dynamic Contexts. ICML 2023: 34011-34053
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-02061
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-02061
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier:
Reinforcement Learning with History-Dependent Dynamic Contexts. CoRR abs/2302.02061 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18333
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18333
Guy Tennenholtz, Martin Mladenov, Nadav Merlis, Craig Boutilier:
Ranking with Popularity Bias: User Welfare under Self-Amplification Dynamics. CoRR abs/2305.18333 (2023)
2022
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TennenholtzMSMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TennenholtzMSMS22
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. NeurIPS 2022
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15376
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. CoRR abs/2205.15376 (2022)
2021
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/EfroniMM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/EfroniMM21
Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. AAAI 2021: 7288-7295
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MerlisM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MerlisM21
Nadav Merlis, Shie Mannor:
Lenient Regret for Multi-Armed Bandits. AAAI 2021: 8950-8957
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/EfroniMSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/EfroniMSM21
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. ICML 2021: 2937-2947
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/PeerTMM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PeerTMM21
Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir:
Ensemble Bootstrapping for Q-Learning. ICML 2021: 8454-8463
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-03400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-03400
Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. CoRR abs/2102.03400 (2021)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-00445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-00445
Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir:
Ensemble Bootstrapping for Q-Learning. CoRR abs/2103.00445 (2021)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05724
Nadav Merlis, Yonathan Efroni, Shie Mannor:
Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits. CoRR abs/2110.05724 (2021)
2020
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/MerlisM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/MerlisM20
Nadav Merlis, Shie Mannor:
Tight Lower Bounds for Combinatorial Multi-Armed Bandits. COLT 2020: 2830-2857
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05392
Nadav Merlis, Shie Mannor:
Tight Lower Bounds for Combinatorial Multi-Armed Bandits. CoRR abs/2002.05392 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03959
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03959
Nadav Merlis, Shie Mannor:
Lenient Regret for Multi-Armed Bandits. CoRR abs/2008.03959 (2020)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06036
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06036
Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. CoRR abs/2008.06036 (2020)
2019
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/MerlisM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/MerlisM19
Nadav Merlis, Shie Mannor:
Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem. COLT 2019: 2465-2489
- view
- export record
  dblp key:
  - conf/nips/EfroniMGM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/EfroniMGM19
Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor:
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies. NeurIPS 2019: 12203-12213
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03125
Nadav Merlis, Shie Mannor:
Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem. CoRR abs/1905.03125 (2019)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11527
Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor:
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies. CoRR abs/1905.11527 (2019)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01062
Chen Tessler, Nadav Merlis, Shie Mannor:
Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients. CoRR abs/1910.01062 (2019)
2018
- view
- export record
  dblp key:
  - conf/nips/ZahavyHMMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZahavyHMMM18
Tom Zahavy, Matan Haroush, Nadav Merlis, Daniel J. Mankowitz, Shie Mannor:
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning. NeurIPS 2018: 3566-3577
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02121
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02121
Tom Zahavy, Matan Haroush, Nadav Merlis, Daniel J. Mankowitz, Shie Mannor:
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning. CoRR abs/1809.02121 (2018)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results