default search action
Tabish Rashid
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i12]Adam Jelley, Yuhan Cao, David Bignell, Sam Devlin, Tabish Rashid:
Aligning Agents like Large Language Models. CoRR abs/2406.04208 (2024) - 2023
- [c10]Tim Pearce, Tabish Rashid, Anssi Kanervisto, David Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin:
Imitating Human Behaviour with Diffusion Models. ICLR 2023 - [i11]Tim Pearce, Tabish Rashid, Anssi Kanervisto, David Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin:
Imitating Human Behaviour with Diffusion Models. CoRR abs/2301.10677 (2023) - [i10]Lukas Schäfer, Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, David Bignell, Siddhartha Sen, Andrea Treviño Gavito, Sam Devlin:
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games. CoRR abs/2312.02312 (2023) - 2021
- [c9]Tabish Rashid, Cheng Zhang, Kamil Ciosek:
Estimating α-Rank by Maximizing Information Gain. AAAI 2021: 5673-5681 - [c8]Ling Pan, Tabish Rashid, Bei Peng, Longbo Huang, Shimon Whiteson:
Regularized Softmax Deep Multi-Agent Q-Learning. NeurIPS 2021: 1365-1377 - [c7]Bei Peng, Tabish Rashid, Christian Schröder de Witt, Pierre-Alexandre Kamienny, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson:
FACMAC: Factored Multi-Agent Centralised Policy Gradients. NeurIPS 2021: 12208-12221 - [i9]Tabish Rashid, Cheng Zhang, Kamil Ciosek:
Estimating α-Rank by Maximizing Information Gain. CoRR abs/2101.09178 (2021) - [i8]Ling Pan, Tabish Rashid, Bei Peng, Longbo Huang, Shimon Whiteson:
Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning. CoRR abs/2103.11883 (2021) - 2020
- [j1]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. J. Mach. Learn. Res. 21: 178:1-178:51 (2020) - [c6]Tabish Rashid, Bei Peng, Wendelin Boehmer, Shimon Whiteson:
Optimistic Exploration even with a Pessimistic Initialisation. ICLR 2020 - [c5]Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson:
Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. NeurIPS 2020 - [i7]Tabish Rashid, Bei Peng, Wendelin Böhmer, Shimon Whiteson:
Optimistic Exploration even with a Pessimistic Initialisation. CoRR abs/2002.12174 (2020) - [i6]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. CoRR abs/2003.08839 (2020) - [i5]Tabish Rashid, Gregory Farquhar, Bei Peng, Shimon Whiteson:
Weighted QMIX: Expanding Monotonic Value Function Factorisation. CoRR abs/2006.10800 (2020)
2010 – 2019
- 2019
- [c4]Mikayel Samvelyan, Tabish Rashid, Christian Schröder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob N. Foerster, Shimon Whiteson:
The StarCraft Multi-Agent Challenge. AAMAS 2019: 2186-2188 - [c3]Anuj Mahajan, Tabish Rashid, Mikayel Samvelyan, Shimon Whiteson:
MAVEN: Multi-Agent Variational Exploration. NeurIPS 2019: 7611-7622 - [i4]Mikayel Samvelyan, Tabish Rashid, Christian Schröder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob N. Foerster, Shimon Whiteson:
The StarCraft Multi-Agent Challenge. CoRR abs/1902.04043 (2019) - [i3]Wendelin Böhmer, Tabish Rashid, Shimon Whiteson:
Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning. CoRR abs/1906.02138 (2019) - [i2]Anuj Mahajan, Tabish Rashid, Mikayel Samvelyan, Shimon Whiteson:
MAVEN: Multi-Agent Variational Exploration. CoRR abs/1910.07483 (2019) - 2018
- [c2]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. ICML 2018: 4292-4301 - [i1]Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson:
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning. CoRR abs/1803.11485 (2018) - 2016
- [c1]Tabish Rashid, Ioannis Agrafiotis, Jason R. C. Nurse:
A New Take on Detecting Insider Threats: Exploring the Use of Hidden Markov Models. MIST@CCS 2016: 47-56
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint