Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Nino Vieillard

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12187
Alexandre Ramé, Nino Vieillard, Léonard Hussenot, Robert Dadashi, Geoffrey Cideron, Olivier Bachem, Johan Ferret:
WARM: On the Benefits of Weight Averaged Reward Models. CoRR abs/2401.12187 (2024)
2023
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/RoitFSACDGGHKMG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/RoitFSACDGGHKMG23
Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos Garea, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. ACL (1) 2023: 6252-6272
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KitamuraKTVVYMM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KitamuraKTVVYMM23
Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, Yutaka Matsuo:
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice. ICML 2023: 17135-17175
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13185
Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, Yutaka Matsuo:
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice. CoRR abs/2305.13185 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00186
Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. CoRR abs/2306.00186 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13649
Rishabh Agarwal, Nino Vieillard, Piotr Stanczyk, Sabela Ramos, Matthieu Geist, Olivier Bachem:
GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models. CoRR abs/2306.13649 (2023)
2022
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/RezaeifarDVHBPG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/RezaeifarDVHBPG22
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, Léonard Hussenot, Olivier Bachem, Olivier Pietquin, Matthieu Geist:
Offline Reinforcement Learning as Anti-exploration. AAAI 2022: 8106-8114
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/VieillardARPG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/VieillardARPG22
Nino Vieillard, Marcin Andrychowicz, Anton Raichuk, Olivier Pietquin, Matthieu Geist:
Implicitly Regularized RL with Implicit Q-values. AISTATS 2022: 1380-1402
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14211
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14211
Tadashi Kozuno, Wenhao Yang, Nino Vieillard, Toshinori Kitamura, Yunhao Tang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Michal Valko, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári:
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal. CoRR abs/2205.14211 (2022)
2021
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DadashiRVHPG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DadashiRVHPG21
Robert Dadashi, Shideh Rezaeifar, Nino Vieillard, Léonard Hussenot, Olivier Pietquin, Matthieu Geist:
Offline Reinforcement Learning with Pseudometric Learning. ICML 2021: 2307-2318
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-01948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-01948
Robert Dadashi, Shideh Rezaeifar, Nino Vieillard, Léonard Hussenot, Olivier Pietquin, Matthieu Geist:
Offline Reinforcement Learning with Pseudometric Learning. CoRR abs/2103.01948 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06431
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, Léonard Hussenot, Olivier Bachem, Olivier Pietquin, Matthieu Geist:
Offline Reinforcement Learning as Anti-Exploration. CoRR abs/2106.06431 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-07041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-07041
Nino Vieillard, Marcin Andrychowicz, Anton Raichuk, Olivier Pietquin, Matthieu Geist:
Implicitly Regularized RL with Implicit Q-Values. CoRR abs/2108.07041 (2021)
2020
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/VieillardPG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/VieillardPG20
Nino Vieillard, Olivier Pietquin, Matthieu Geist:
Deep Conservative Policy Iteration. AAAI 2020: 6070-6077
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/VieillardSPG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/VieillardSPG20
Nino Vieillard, Bruno Scherrer, Olivier Pietquin, Matthieu Geist:
Momentum in Reinforcement Learning. AISTATS 2020: 2529-2538
[c2]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/VieillardKSPMG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/VieillardKSPMG20
Nino Vieillard, Tadashi Kozuno, Bruno Scherrer, Olivier Pietquin, Rémi Munos, Matthieu Geist:
Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning. NeurIPS 2020
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/VieillardPG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/VieillardPG20
Nino Vieillard, Olivier Pietquin, Matthieu Geist:
Munchausen Reinforcement Learning. NeurIPS 2020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-14089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-14089
Nino Vieillard, Tadashi Kozuno, Bruno Scherrer, Olivier Pietquin, Rémi Munos, Matthieu Geist:
Leverage the Average: an Analysis of Regularization in RL. CoRR abs/2003.14089 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-14430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-14430
Nino Vieillard, Olivier Pietquin, Matthieu Geist:
Munchausen Reinforcement Learning. CoRR abs/2007.14430 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-09784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-09784
Nino Vieillard, Olivier Pietquin, Matthieu Geist:
Deep Conservative Policy Iteration. CoRR abs/1906.09784 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-08476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-08476
Nino Vieillard, Olivier Pietquin, Matthieu Geist:
On Connections between Constrained Optimization and Reinforcement Learning. CoRR abs/1910.08476 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09322
Nino Vieillard, Bruno Scherrer, Olivier Pietquin, Matthieu Geist:
Momentum in Reinforcement Learning. CoRR abs/1910.09322 (2019)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.