default search action
Weixun Wang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j10]Tianze Zhou, Fubiao Zhang, Kun Shao, Zipeng Dai, Kai Li, Wenhan Huang, Weixun Wang, Bin Wang, Dong Li, Wulong Liu, Jianye Hao:
Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition. IEEE Trans. Games 16(2): 352-364 (2024) - [c25]Jizhou Wu, Jianye Hao, Tianpei Yang, Xiaotian Hao, Yan Zheng, Weixun Wang, Matthew E. Taylor:
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. AAAI 2024: 15934-15942 - [i21]Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall:
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization. CoRR abs/2403.17031 (2024) - [i20]Jian Hu, Xibin Wu, Weixun Wang, Xianyu, Dehao Zhang, Yu Cao:
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework. CoRR abs/2405.11143 (2024) - 2023
- [j9]Tianpei Yang, Weixun Wang, Jianye Hao, Matthew E. Taylor, Yong Liu, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Chunxu Ren, Ye Huang, Jiangcheng Zhu, Yang Gao:
ASN: action semantics network for multiagent reinforcement learning. Auton. Agents Multi Agent Syst. 37(2): 45 (2023) - [j8]Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Xiaodan Liang, Zhihui Li, Xiaojun Chang, Yaodong Yang:
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library. J. Mach. Learn. Res. 24: 315:1-315:23 (2023) - [c24]Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan:
Off-Beat Multi-Agent Reinforcement Learning. AAMAS 2023: 2424-2426 - [c23]Jizhou Wu, Tianpei Yang, Xiaotian Hao, Jianye Hao, Yan Zheng, Weixun Wang, Matthew E. Taylor:
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. AAMAS 2023: 2460-2462 - [c22]Jianye Hao, Xiaotian Hao, Hangyu Mao, Weixun Wang, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang:
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks. ICLR 2023 - 2022
- [j7]Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li:
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents. Frontiers Inf. Technol. Electron. Eng. 23(7): 1032-1042 (2022) - [c21]Li Wang, Yupeng Zhang, Yujing Hu, Weixun Wang, Chongjie Zhang, Yang Gao, Jianye Hao, Tangjie Lv, Changjie Fan:
Individual Reward Assisted Multi-Agent Reinforcement Learning. ICML 2022: 23417-23432 - [c20]Yaodong Yang, Guangyong Chen, Weixun Wang, Xiaotian Hao, Jianye Hao, Pheng-Ann Heng:
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing. NeurIPS 2022 - [i19]Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li:
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization. CoRR abs/2202.04427 (2022) - [i18]Xiaotian Hao, Weixun Wang, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao:
API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks. CoRR abs/2203.05285 (2022) - [i17]Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li:
Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents. CoRR abs/2203.08454 (2022) - [i16]Shengyi Huang, Anssi Kanervisto, Antonin Raffin, Weixun Wang, Santiago Ontañón, Rousslan Fernand Julien Dossa:
A2C is a special case of PPO. CoRR abs/2205.09123 (2022) - [i15]Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan:
Off-Beat Multi-Agent Reinforcement Learning. CoRR abs/2205.13718 (2022) - [i14]Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Zhihui Li, Xiaodan Liang, Xiaojun Chang, Yaodong Yang:
MARLlib: Extending RLlib for Multi-agent Reinforcement Learning. CoRR abs/2210.13708 (2022) - 2021
- [c19]Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang:
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning. NeurIPS 2021: 17037-17048 - [i13]Tianze Zhou, Fubiao Zhang, Kun Shao, Kai Li, Wenhan Huang, Jun Luo, Weixun Wang, Yaodong Yang, Hangyu Mao, Bin Wang, Dong Li, Wulong Liu, Jianye Hao:
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment. CoRR abs/2106.00517 (2021) - 2020
- [c18]Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao:
Multi-Agent Game Abstraction via Graph Attention Neural Network. AAAI 2020: 7211-7218 - [c17]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning. AAAI 2020: 7293-7300 - [c16]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning through Policy Transfer. AAMAS 2020: 2053-2055 - [c15]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. ICLR 2020 - [c14]Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng:
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. IJCAI 2020: 2291-2297 - [c13]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Wulong Liu, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer. IJCAI 2020: 3094-3100 - [c12]Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai:
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. IJCAI 2020: 3437-3443 - [c11]Yujing Hu, Weixun Wang, Hangtian Jia, Yixiang Wang, Yingfeng Chen, Jianye Hao, Feng Wu, Changjie Fan:
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping. NeurIPS 2020 - [i12]Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng:
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. CoRR abs/2002.07418 (2020) - [i11]Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Wulong Liu, Yujing Hu, Yingfeng Chen:
Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework. CoRR abs/2002.08030 (2020) - [i10]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Weixun Wang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning through Policy Transfer. CoRR abs/2002.08037 (2020) - [i9]Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai:
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. CoRR abs/2005.04355 (2020) - [i8]Yujing Hu, Weixun Wang, Hangtian Jia, Yixiang Wang, Yingfeng Chen, Jianye Hao, Feng Wu, Changjie Fan:
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping. CoRR abs/2011.02669 (2020)
2010 – 2019
- 2019
- [c10]Xiaotian Hao, Weixun Wang, Jianye Hao, Yaodong Yang:
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. AAMAS 2019: 1315-1323 - [c9]Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Xiaotian Hao, Yixi Wang, Han Li, Jian Xu, Kun Gai:
Learning Adaptive Display Exposure for Real-Time Advertising. CIKM 2019: 2595-2603 - [c8]Weixun Wang, Jianye Hao, Yixi Wang, Matthew E. Taylor:
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas. DAI 2019: 11:1-11:7 - [i7]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. CoRR abs/1907.11461 (2019) - [i6]Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning. CoRR abs/1909.02790 (2019) - [i5]Xiaotian Hao, Weixun Wang, Jianye Hao, Yaodong Yang:
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. CoRR abs/1909.11468 (2019) - [i4]Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao:
Multi-Agent Game Abstraction via Graph Attention Neural Network. CoRR abs/1911.10715 (2019) - 2018
- [i3]Weixun Wang, Jianye Hao, Yixi Wang, Matthew E. Taylor:
Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach. CoRR abs/1803.00162 (2018) - [i2]Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Yixi Wang, Han Li, Jian Xu, Kun Gai:
Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning. CoRR abs/1809.03149 (2018) - 2012
- [j6]Weixun Wang, Sanjay Ranka, Prabhat Mishra:
Energy-aware dynamic slack allocation for real-time multitasking systems. Sustain. Comput. Informatics Syst. 2(3): 128-137 (2012) - [j5]Xiaoke Qin, Weixun Wang, Prabhat Mishra:
TCEC: Temperature and Energy-Constrained Scheduling in Real-Time Multitasking Systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 31(8): 1159-1168 (2012) - [j4]Weixun Wang, Prabhat Mishra, Ann Gordon-Ross:
Dynamic Cache Reconfiguration for Soft Real-Time Systems. ACM Trans. Embed. Comput. Syst. 11(2): 28:1-28:31 (2012) - [j3]Weixun Wang, Prabhat Mishra:
System-Wide Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Multitasking Systems. IEEE Trans. Very Large Scale Integr. Syst. 20(5): 902-910 (2012) - [r1]Weixun Wang, Xiaoke Qin, Prabhat Mishra:
Energy-Aware Scheduling and Dynamic Reconfiguration in Real-Time Systems. Handbook of Energy-Aware and Green Computing 2012: 543-572 - [i1]Kanad Basu, Subrata Mitra, Srishti Mukherjee, Weixun Wang:
A Novel Approach for Handling Misbehaving Nodes in Behavior-Aware Mobile Networking. CoRR abs/1211.1736 (2012) - 2011
- [j2]Weixun Wang, Prabhat Mishra:
Dynamic Reconfiguration of Two-Level Cache Hierarchy in Real-Time Embedded Systems. J. Low Power Electron. 7(1): 17-28 (2011) - [j1]Weixun Wang, Sanjay Ranka, Prabhat Mishra:
Energy-aware dynamic reconfiguration algorithms for real-time multitasking systems. Sustain. Comput. Informatics Syst. 1(1): 35-45 (2011) - [c7]Weixun Wang, Prabhat Mishra, Sanjay Ranka:
Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systems. DAC 2011: 948-953 - [c6]Weixun Wang, Sanjay Ranka, Prabhat Mishra:
A General Algorithm for Energy-Aware Dynamic Reconfiguration in Multitasking Systems. VLSI Design 2011: 334-339 - 2010
- [c5]Weixun Wang, Prabhat Mishra:
PreDVS: preemptive dynamic voltage scaling for real-time systems using approximation scheme. DAC 2010: 705-710 - [c4]Weixun Wang, Xiaoke Qin, Prabhat Mishra:
Temperature- and energy-constrained scheduling in multitasking systems: a model checking approach. ISLPED 2010: 85-90 - [c3]Weixun Wang, Prabhat Mishra:
Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Real-Time Systems. VLSI Design 2010: 357-362
2000 – 2009
- 2009
- [c2]Weixun Wang, Prabhat Mishra:
Dynamic Reconfiguration of Two-Level Caches in Soft Real-Time Embedded Systems. ISVLSI 2009: 145-150 - [c1]Weixun Wang, Prabhat Mishra, Ann Gordon-Ross:
SACR: Scheduling-Aware Cache Reconfiguration for Real-Time Embedded Systems. VLSI Design 2009: 547-552
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-14 02:03 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint