default search action
Matthieu Zimmer
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Claire Glanois, Paul Weng, Matthieu Zimmer, Dong Li, Tianpei Yang, Jianye Hao, Wulong Liu:
A survey on interpretable reinforcement learning. Mach. Learn. 113(8): 5847-5890 (2024) - [c11]Zheng Xiong, Risto Vuorio, Jacob Beck, Matthieu Zimmer, Kun Shao, Shimon Whiteson:
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control. ICML 2024 - [i16]Zheng Xiong, Risto Vuorio, Jacob Beck, Matthieu Zimmer, Kun Shao, Shimon Whiteson:
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control. CoRR abs/2402.06570 (2024) - [i15]Christopher E. Mower, Yuhui Wan, Hongzhan Yu, Antoine Grosnit, Jonas Gonzalez-Billandon, Matthieu Zimmer, Jinlong Wang, Xinyu Zhang, Yao Zhao, Anbang Zhai, Puze Liu, Davide Tateo, Cesar Cadena, Marco Hutter, Jan Peters, Guangjian Tian, Yuzheng Zhuang, Kun Shao, Xingyue Quan, Jianye Hao, Jun Wang, Haitham Bou-Ammar:
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning. CoRR abs/2406.19741 (2024) - [i14]Matthieu Zimmer, Milan Gritta, Gerasimos Lampouras, Haitham Bou-Ammar, Jun Wang:
Mixture of Attentions For Speculative Decoding. CoRR abs/2410.03804 (2024) - 2023
- [j4]Matthieu Zimmer, Xuening Feng, Claire Glanois, Zhaohui Jiang, Jianyi Zhang, Paul Weng, Dong Li, Jianye Hao, Wulong Liu:
Differentiable Logic Machines. Trans. Mach. Learn. Res. 2023 (2023) - [c10]Antoine Grosnit, Matthieu Zimmer, Rasul Tutunov, Xing Li, Lei Chen, Fan Yang, Mingxuan Yuan, Haitham Bou-Ammar:
Lightweight Structural Choices Operator for Technology Mapping. DAC 2023: 1-6 - [c9]Philip John Gorinski, Matthieu Zimmer, Gerasimos Lampouras, Derrick-Goh-Xin Deik, Ignacio Iacobacci:
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis. EMNLP (Findings) 2023: 370-384 - [c8]Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Haitham Bou-Ammar:
End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes. NeurIPS 2023 - [i13]Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Haitham Bou-Ammar:
End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes. CoRR abs/2305.15930 (2023) - [i12]Philip John Gorinski, Matthieu Zimmer, Gerasimos Lampouras, Derrick-Goh-Xin Deik, Ignacio Iacobacci:
Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis. CoRR abs/2310.13669 (2023) - [i11]Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, Jingxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang:
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning. CoRR abs/2312.14878 (2023) - 2022
- [c7]Claire Glanois, Zhaohui Jiang, Xuening Feng, Paul Weng, Matthieu Zimmer, Dong Li, Wulong Liu, Jianye Hao:
Neuro-Symbolic Hierarchical Rule Induction. ICML 2022: 7583-7615 - [i10]Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Rasul Tutunov, Jun Wang, Haitham Bou-Ammar:
Sample-Efficient Optimisation with Probabilistic Transformer Surrogates. CoRR abs/2205.13902 (2022) - 2021
- [j3]Jiancong Huang, Juan Rojas, Matthieu Zimmer, Hongmin Wu, Yisheng Guan, Paul Weng:
Hyperparameter Auto-Tuning in Self-Supervised Robotic Learning. IEEE Robotics Autom. Lett. 6(2): 3537-3544 (2021) - [c6]Matthieu Zimmer, Claire Glanois, Umer Siddique, Paul Weng:
Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning. ICML 2021: 12967-12978 - [i9]Matthieu Zimmer, Xuening Feng, Claire Glanois, Zhaohui Jiang, Jianyi Zhang, Paul Weng, Jianye Hao, Dong Li, Wulong Liu:
Differentiable Logic Machines. CoRR abs/2102.11529 (2021) - [i8]Claire Glanois, Paul Weng, Matthieu Zimmer, Dong Li, Tianpei Yang, Jianye Hao, Wulong Liu:
A Survey on Interpretable Reinforcement Learning. CoRR abs/2112.13112 (2021) - [i7]Claire Glanois, Xuening Feng, Zhaohui Jiang, Paul Weng, Matthieu Zimmer, Dong Li, Wulong Liu:
Neuro-Symbolic Hierarchical Rule Induction. CoRR abs/2112.13418 (2021) - 2020
- [j2]Yijiong Lin, Jiancong Huang, Matthieu Zimmer, Yisheng Guan, Juan Rojas, Paul Weng:
Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning. IEEE Robotics Autom. Lett. 5(4): 6615-6622 (2020) - [c5]Umer Siddique, Paul Weng, Matthieu Zimmer:
Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards. ICML 2020: 8905-8915 - [i6]Umer Siddique, Paul Weng, Matthieu Zimmer:
Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards. CoRR abs/2008.07773 (2020) - [i5]Jiancong Huang, Juan Rojas, Matthieu Zimmer, Hongmin Wu, Yisheng Guan, Paul Weng:
Hyperparameter Auto-tuning in Self-Supervised Robotic Learning. CoRR abs/2010.08252 (2020) - [i4]Matthieu Zimmer, Umer Siddique, Paul Weng:
Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2012.09421 (2020)
2010 – 2019
- 2019
- [c4]Matthieu Zimmer, Paul Weng:
An efficient reinforcement learning algorithm for learning deterministic policies in continuous domains. DAI 2019: 4:1-4:7 - [c3]Matthieu Zimmer, Paul Weng:
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains. IJCAI 2019: 4496-4502 - [i3]Matthieu Zimmer, Paul Weng:
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains. CoRR abs/1906.04556 (2019) - [i2]Yijiong Lin, Jiancong Huang, Matthieu Zimmer, Juan Rojas, Paul Weng:
Invariant Transform Experience Replay. CoRR abs/1909.10707 (2019) - [i1]Yijiong Lin, Jiancong Huang, Matthieu Zimmer, Juan Rojas, Paul Weng:
Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation. CoRR abs/1910.09959 (2019) - 2018
- [b1]Matthieu Zimmer:
Apprentissage par renforcement développemental. (Developmental reinforcement learning). University of Lorraine, Nancy, France, 2018 - [j1]Matthieu Zimmer, Stéphane Doncieux:
Bootstrapping Q-Learning for Robotics From Neuro-Evolution Results. IEEE Trans. Cogn. Dev. Syst. 10(1): 102-119 (2018) - [c2]Matthieu Zimmer, Yann Boniface, Alain Dutech:
Developmental Reinforcement Learning through Sensorimotor Space Enlargement. ICDL-EPIROB 2018: 33-38 - 2016
- [c1]Matthieu Zimmer, Yann Boniface, Alain Dutech:
Neural fitted actor-critic. ESANN 2016
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 00:52 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint