


default search action
Tonghan Wang 0001
Person information
- affiliation: Harvard University, Cambridge, MA, USA
- affiliation (former): Tsinghua University, Beijing, China
- affiliation (former): Shandong University, China
Other persons with the same name
- Tonghan Wang 0002
— East China University of Technology, Nanchang, Jiangxi, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2025
- [j2]Chenghao Li
, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang, Chongjie Zhang
:
Celebrating Diversity With Subtask Specialization in Shared Multiagent Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 36(2): 2051-2065 (2025) - 2024
- [j1]Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Yipeng Kang, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-agent policy transfer via task relationship modeling. Sci. China Inf. Sci. 67(8) (2024)
Conference and Workshop Papers
- 2025
- [c19]Yunfan Zhao, Tonghan Wang, Dheeraj Mysore Nagaraj, Aparna Taneja, Milind Tambe:
The Bandit Whisperer: Communication Learning for Restless Bandits. AAAI 2025: 23404-23413 - 2024
- [c18]Safwan Hossain, Tonghan Wang, Tao Lin, Yiling Chen, David C. Parkes, Haifeng Xu:
Multi-Sender Persuasion: A Computational Perspective. ICML 2024 - [c17]Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen:
Position: Social Environment Design Should be Further Developed for AI-based Policy-Making. ICML 2024 - [c16]Tonghan Wang
, Yanchen Jiang
, David C. Parkes
:
GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning. EC 2024: 1100 - 2023
- [c15]Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang:
Symmetry-Aware Robot Design with Structured Subgroups. ICML 2023: 8334-8355 - [c14]Tonghan Wang, Paul Duetting, Dmitry Ivanov, Inbal Talgam-Cohen, David C. Parkes:
Deep Contract Design via Discontinuous Networks. NeurIPS 2023 - 2022
- [c13]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. ICLR 2022 - [c12]Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang:
Self-Organized Polynomial-Time Coordination Graphs. ICML 2022: 24963-24979 - [c11]Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang:
Low-Rank Modular Reinforcement Learning via Muscle Synergy. NeurIPS 2022 - [c10]Yipeng Kang, Tonghan Wang, Qianlan Yang, Xiaoran Wu, Chongjie Zhang:
Non-Linear Coordination Graphs. NeurIPS 2022 - 2021
- [c9]Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang:
RODE: Learning Roles to Decompose Multi-Agent Tasks. ICLR 2021 - [c8]Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang:
DOP: Off-Policy Multi-Agent Decomposed Policy Gradients. ICLR 2021 - [c7]Chenghao Li, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang, Chongjie Zhang:
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning. NeurIPS 2021: 3991-4002 - 2020
- [c6]Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang:
Influence-Based Multi-Agent Exploration. ICLR 2020 - [c5]Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang:
Learning Nearly Decomposable Value Functions Via Communication Minimization. ICLR 2020 - [c4]Tonghan Wang, Heng Dong, Victor R. Lesser, Chongjie Zhang:
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles. ICML 2020: 9876-9886 - [c3]Yipeng Kang, Tonghan Wang, Gerard de Melo:
Incorporating Pragmatic Reasoning Communication into Emergent Language. NeurIPS 2020 - 2019
- [c2]Xinliang Song, Tonghan Wang, Chongjie Zhang:
Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games. AAMAS 2019: 935-943 - 2018
- [c1]Tonghan Wang, Xueying Qin, Fan Zhong, Xinmeng Tong, Baoquan Chen, Ming C. Lin:
Compact Object Representation of a Non-Rigid Object for Real-Time Tracking in AR Systems. ISMAR Adjunct 2018: 63-68
Informal and Other Publications
- 2025
- [i28]Xinyi Yang, Liang Zeng, Heng Dong, Chao Yu, Xiaoran Wu, Huazhong Yang, Yu Wang, Milind Tambe, Tonghan Wang:
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards. CoRR abs/2502.12530 (2025) - [i27]Tonghan Wang, Yanchen Jiang, David C. Parkes:
BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization. CoRR abs/2502.15283 (2025) - [i26]Lingkai Kong, Haichuan Wang, Yuqi Pan, Cheol Woo Kim, Mingxiao Song, Alayna Nguyen, Tonghan Wang, Haifeng Xu, Milind Tambe:
Robust Optimization with Diffusion Models for Green Security. CoRR abs/2503.05730 (2025) - [i25]Davin Choo, Yuqi Pan, Tonghan Wang, Milind Tambe, Alastair van Heerden, Cheryl Johnson:
Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing. CoRR abs/2505.21671 (2025) - 2024
- [i24]Safwan Hossain, Tonghan Wang, Tao Lin, Yiling Chen, David C. Parkes, Haifeng Xu:
Multi-Sender Persuasion - A Computational Perspective. CoRR abs/2402.04971 (2024) - [i23]Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen:
Social Environment Design. CoRR abs/2402.14090 (2024) - [i22]Tonghan Wang, Yanchen Jiang, David C. Parkes:
GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning. CoRR abs/2406.07428 (2024) - [i21]Dima Ivanov, Paul Dütting, Inbal Talgam-Cohen, Tonghan Wang, David C. Parkes:
Principal-Agent Reinforcement Learning. CoRR abs/2407.18074 (2024) - [i20]Yunfan Zhao, Tonghan Wang, Dheeraj Nagaraj, Aparna Taneja, Milind Tambe:
The Bandit Whisperer: Communication Learning for Restless Bandits. CoRR abs/2408.05686 (2024) - [i19]Tonghan Wang, Heng Dong, Yanchen Jiang, David C. Parkes, Milind Tambe:
On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow. CoRR abs/2410.13953 (2024) - 2023
- [i18]Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang:
Symmetry-Aware Robot Design with Structured Subgroups. CoRR abs/2306.00036 (2023) - [i17]Tonghan Wang, Paul Dütting, Dmitry Ivanov, Inbal Talgam-Cohen, David C. Parkes:
Deep Contract Design via Discontinuous Piecewise Affine Neural Networks. CoRR abs/2307.02318 (2023) - [i16]Chenghao Li, Tonghan Wang, Chongjie Zhang, Qianchuan Zhao:
Never Explore Repeatedly in Multi-Agent Reinforcement Learning. CoRR abs/2308.09909 (2023) - 2022
- [i15]Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Policy Transfer via Task Relationship Modeling. CoRR abs/2203.04482 (2022) - [i14]Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang:
Low-Rank Modular Reinforcement Learning via Muscle Synergy. CoRR abs/2210.15479 (2022) - [i13]Yipeng Kang, Tonghan Wang, Xiaoran Wu, Qianlan Yang, Chongjie Zhang:
Non-Linear Coordination Graphs. CoRR abs/2211.08404 (2022) - 2021
- [i12]Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang:
Birds of a Feather Flock Together: A Close Look at Cooperation Emergence via Multi-Agent RL. CoRR abs/2104.11455 (2021) - [i11]Chenghao Li, Chengjie Wu, Tonghan Wang, Jun Yang, Qianchuan Zhao, Chongjie Zhang:
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning. CoRR abs/2106.02195 (2021) - [i10]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. CoRR abs/2106.02886 (2021) - [i9]Siyang Wu, Tonghan Wang, Chenghao Li, Chongjie Zhang:
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning. CoRR abs/2110.08169 (2021) - [i8]Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang:
Self-Organized Polynomial-Time Coordination Graphs. CoRR abs/2112.03547 (2021) - 2020
- [i7]Tonghan Wang, Heng Dong, Victor R. Lesser, Chongjie Zhang:
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles. CoRR abs/2003.08039 (2020) - [i6]Yipeng Kang, Tonghan Wang, Gerard de Melo:
Incorporating Pragmatic Reasoning Communication into Emergent Language. CoRR abs/2006.04109 (2020) - [i5]Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang:
Off-Policy Multi-Agent Decomposed Policy Gradients. CoRR abs/2007.12322 (2020) - [i4]Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang:
RODE: Learning Roles to Decompose Multi-Agent Tasks. CoRR abs/2010.01523 (2020) - 2019
- [i3]Xinliang Song, Tonghan Wang, Chongjie Zhang:
Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games. CoRR abs/1903.02868 (2019) - [i2]Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang:
Learning Nearly Decomposable Value Functions Via Communication Minimization. CoRR abs/1910.05366 (2019) - [i1]Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang:
Influence-Based Multi-Agent Exploration. CoRR abs/1910.05512 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-06-29 21:48 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint