


default search action
Haifeng Zhang 0002
Person information
- unicode name: 张海峰
- affiliation: Chinese Academy of Sciences, Institute of Automation, Beijing, China
- affiliation: University College London, London, UK
- affiliation (PhD 2018): Peking University, Beijing, China
Other persons with the same name
- Haifeng Zhang (aka: Hai-Feng Zhang) — disambiguation page
- Haifeng Zhang 0001
(aka: Hai-Feng Zhang 0001) — Vanderbilt University, Nashville, TN, USA
- Haifeng Zhang 0003
(aka: Hai-Feng Zhang 0003) — Anhui University, Hefei, China (and 1 more)
- Haifeng Zhang 0004
(aka: Hai-Feng Zhang 0004) — Harbin Institute of Technology, China
- Haifeng Zhang 0005 (aka: Hai-Feng Zhang 0005) — Guangxi Liuzhou Iron and Steel (Group) Company, China
- Haifeng Zhang 0006
(aka: Hai-Feng Zhang 0006) — University of Science and Technology of China, Hefei, China
- Haifeng Zhang 0007 (aka: Hai-Feng Zhang 0007) — Beijing Jiaotong University, Beijing, China
- Haifeng Zhang 0008 (aka: Hai-Feng Zhang 0008) — Carnegie Mellon University, PA, USA
- Haifeng Zhang 0009
(aka: Hai-Feng Zhang 0009) — University of North Texas, Denton, TX, USA
- Haifeng Zhang 0010 (aka: Hai-Feng Zhang 0010) — State Grid Electric Power Research Institute, Beijing, China
- Haifeng Zhang 0011
(aka: Hai-Feng Zhang 0011) — Peking University, Beijing, China
- Haifeng Zhang 0012 (aka: Hai-Feng Zhang 0012) — Fuzhou University, Fuzhou, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Ziyi Wang, Xinran Li, Luoyang Sun, Haifeng Zhang, Hualin Liu, Jun Wang:
Learning State-Specific Action Masks for Reinforcement Learning. Algorithms 17(2): 60 (2024) - [j5]Yan Song
, He Jiang
, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang
, Jun Wang:
An Empirical Study on Google Research Football Multi-agent Scenarios. Mach. Intell. Res. 21(3): 549-570 (2024) - [c21]Qirui Mi, Siyu Xia, Yan Song, Haifeng Zhang, Shenghao Zhu, Jun Wang:
TaxAI: A Dynamic Economic Simulator and Benchmark for Multi-agent Reinforcement Learning. AAMAS 2024: 1390-1399 - [c20]Yan Song, He Jiang, Haifeng Zhang, Zheng Tian, Weinan Zhang, Jun Wang:
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: The Past, Present, and Future. AAMAS 2024: 1772-1781 - [c19]Weiyu Ma, Qirui Mi, Yongcheng Zeng, Xue Yan, Runji Lin, Yuqiao Wu, Jun Wang, Haifeng Zhang:
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach. NeurIPS 2024 - [i25]Qirui Mi, Zhiyu Zhao, Siyu Xia, Yan Song, Jun Wang, Haifeng Zhang:
Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach. CoRR abs/2403.12093 (2024) - [i24]Zhiyu Zhao, Ning Yang, Xue Yan, Haifeng Zhang, Jun Wang, Yaodong Yang:
Correlated Mean Field Imitation Learning. CoRR abs/2404.09324 (2024) - [i23]Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang:
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf. CoRR abs/2405.19946 (2024) - [i22]Xue Yan, Yan Song, Xidong Feng, Mengyue Yang, Haifeng Zhang, Haitham Bou-Ammar, Jun Wang:
Efficient Reinforcement Learning with Large Language Model Priors. CoRR abs/2410.07927 (2024) - [i21]Yue Deng, Weiyu Ma, Yuxin Fan, Yin Zhang, Haifeng Zhang, Jian Zhao:
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models. CoRR abs/2410.16024 (2024) - 2023
- [j4]Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang
, Ying Wen, Luo Mai, Jun Wang, Hai-Feng Zhang, Weinan Zhang:
Large sequence models for sequential decision-making: a survey. Frontiers Comput. Sci. 17(6): 176349 (2023) - [j3]Linghui Meng
, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang
, Yaodong Yang
, Bo Xu
:
Offline Pre-trained Multi-agent Decision Transformer. Mach. Intell. Res. 20(2): 233-248 (2023) - [c18]Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du:
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination. NeurIPS 2023 - [i20]Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang:
An Empirical Study on Google Research Football Multi-agent Scenarios. CoRR abs/2305.09458 (2023) - [i19]Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang:
Large Sequence Models for Sequential Decision-Making: A Survey. CoRR abs/2306.13945 (2023) - [i18]Yan Song, He Jiang, Haifeng Zhang, Zheng Tian, Weinan Zhang, Jun Wang:
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future. CoRR abs/2309.12951 (2023) - [i17]Qirui Mi, Siyu Xia, Yan Song, Haifeng Zhang, Shenghao Zhu, Jun Wang:
TaxAI: A Dynamic Economic Simulator and Benchmark for Multi-Agent Reinforcement Learning. CoRR abs/2309.16307 (2023) - [i16]Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang, David Henry Mguni, Jun Wang:
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models. CoRR abs/2310.18127 (2023) - [i15]Weiyu Ma, Qirui Mi, Xue Yan, Yuqiao Wu, Runji Lin, Haifeng Zhang, Jun Wang:
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach. CoRR abs/2312.11865 (2023) - 2022
- [c17]Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen:
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach. AAAI 2022: 8797-8805 - [c16]Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu:
GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning. AAMAS 2022: 1128-1136 - [c15]Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang:
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning. NeurIPS 2022 - [i14]Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen:
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach. CoRR abs/2201.04480 (2022) - [i13]Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu:
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning. CoRR abs/2201.06257 (2022) - [i12]Qirui Mi, Ning Yang, Haifeng Zhang, Haijun Zhang, Jun Wang:
Joint Caching and Transmission in the Mobile Edge Network: A Multi-Agent Learning Approach. CoRR abs/2209.04164 (2022) - [i11]Runji Lin, Ye Li
, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang
:
Contextual Transformer for Offline Meta Reinforcement Learning. CoRR abs/2211.08016 (2022) - 2021
- [c14]Yali Du, Bo Liu, Vincent Moens, Ziqi Liu, Zhicheng Ren, Jun Wang, Xu Chen, Haifeng Zhang:
Learning Correlated Communication Topology in Multi-Agent Reinforcement learning. AAMAS 2021: 456-464 - [c13]Liheng Chen, Hongyi Guo, Yali Du, Fei Fang
, Haifeng Zhang, Weinan Zhang, Yong Yu:
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning. DAI 2021: 185-205 - [c12]Qirui Mi, Ning Yang, Haifeng Zhang, Haijun Zhang, Jun Wang:
Joint Caching and Transmission in the Mobile Edge Network: An Multi-Agent Learning Approach. GLOBECOM 2021: 1-6 - [c11]Yali Du, Xue Yan, Xu Chen, Jun Wang, Haifeng Zhang:
Estimating α-Rank from A Few Entries with Low Rank Matrix Completion. ICML 2021: 2870-2879 - [c10]Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang:
Settling the Variance of Multi-Agent Policy Gradients. NeurIPS 2021: 13458-13470 - [i10]Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang:
Settling the Variance of Multi-Agent Policy Gradients. CoRR abs/2108.08612 (2021) - [i9]Chenguang Wang, Yaodong Yang, Oliver Slumbers, Congying Han, Tiande Guo, Haifeng Zhang, Jun Wang:
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers. CoRR abs/2110.15105 (2021) - [i8]Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu:
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks. CoRR abs/2112.02845 (2021) - [i7]Bo Liu, Xidong Feng, Haifeng Zhang, Jun Wang, Yaodong Yang:
Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning. CoRR abs/2112.15400 (2021) - 2020
- [c9]Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang:
Bi-Level Actor-Critic for Multi-Agent Coordination. AAAI 2020: 7325-7332
2010 – 2019
- 2019
- [j2]Xinyuan Zhou
, Peng Wu
, Haifeng Zhang
, Weihong Guo
, Yuanchang Liu
:
Learn to Navigate: Cooperative Path Planning for Unmanned Surface Vehicles Using Deep Reinforcement Learning. IEEE Access 7: 165262-165278 (2019) - [j1]Haifeng Zhang
, Zilong Guo, Weinan Zhang
, Han Cai, Chris Wang, Yong Yu
, Wenxin Li, Jun Wang:
Layout Design for Intelligent Warehouse by Evolution With Fitness Approximation. IEEE Access 7: 166310-166317 (2019) - [c8]Wenxin Li, Haoyu Zhou, Chris Wang, Haifeng Zhang, Xingxing Hong, Yushan Zhou, Qinjian Zhang:
Teaching AI Algorithms with Games Including Mahjong and FightTheLandlord on the Botzone Online Platform. CompEd 2019: 129-135 - [i6]Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang:
Bi-level Actor-Critic for Multi-agent Coordination. CoRR abs/1909.03510 (2019) - [i5]Liheng Chen, Hongyi Guo, Haifeng Zhang, Fei Fang, Yaoming Zhu, Ming Zhou, Weinan Zhang, Qing Wang, Yong Yu:
Signal Instructed Coordination in Team Competition. CoRR abs/1909.04224 (2019) - 2018
- [c7]Haifeng Zhang, Jun Wang, Zhiming Zhou, Weinan Zhang, Yin Wen, Yong Yu, Wenxin Li:
Learning to Design Games: Strategic Environments in Reinforcement Learning. IJCAI 2018: 3068-3074 - [c6]Haoyu Zhou, Haifeng Zhang, Yushan Zhou, Xinchao Wang, Wenxin Li:
Botzone: an online multi-agent competitive platform for AI education. ITiCSE 2018: 33-38 - [i4]Yi Zhang, Houjun Huang, Haifeng Zhang, Liao Ni, Wei Xu, Nasir Uddin Ahmed, Md. Shakil Ahmed, Yilun Jin, Yingjie Chen, Jingxuan Wen, Wenxin Li:
ICFVR 2017: 3rd International Competition on Finger Vein Recognition. CoRR abs/1801.01262 (2018) - [i3]Haifeng Zhang, Zilong Guo, Han Cai, Chris Wang, Weinan Zhang, Yong Yu, Wenxin Li, Jun Wang:
Layout Design for Intelligent Warehouse by Evolution with Fitness Approximation. CoRR abs/1811.05685 (2018) - 2017
- [c5]Haoyu Zhou, Yushan Zhou, Haifeng Zhang, Houjun Huang
, Wenxin Li:
Botzone: a competitive and interactive platform for game AI education. ACM TUR-C 2017: 6:1-6:5 - [c4]Yi Zhang, Houjun Huang
, Haifeng Zhang, Liao Ni, Wei Xu, Nasir Uddin Ahmed, Md. Shakil Ahmed, Yilun Jin, Yingjie Chen, Jingxuan Wen, Wenxin Li:
ICFVR 2017: 3rd international competition on finger vein recognition. IJCB 2017: 707-714 - [c3]Haifeng Zhang, Weinan Zhang, Yifei Rong, Kan Ren, Wenxin Li, Jun Wang:
Managing Risk of Bidding in Display Advertising. WSDM 2017: 581-590 - [i2]Haifeng Zhang, Weinan Zhang, Yifei Rong, Kan Ren, Wenxin Li, Jun Wang:
Managing Risk of Bidding in Display Advertising. CoRR abs/1701.02433 (2017) - [i1]Haifeng Zhang, Jun Wang, Zhiming Zhou, Weinan Zhang, Ying Wen, Yong Yu, Wenxin Li:
Learning to Design Games: Strategic Environments in Deep Reinforcement Learning. CoRR abs/1707.01310 (2017) - 2016
- [c2]Kan Ren, Weinan Zhang, Yifei Rong, Haifeng Zhang, Yong Yu, Jun Wang:
User Response Learning for Directly Optimizing Campaign Performance in Display Advertising. CIKM 2016: 679-688 - 2015
- [c1]Haifeng Zhang, Dangyi Liu, Wenxin Li:
Space-Consistent Game Equivalence Detection in General Game Playing. CGW/GIGA@IJCAI 2015: 165-177
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-22 01:56 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint