default search action

combined dblp search
author search
venue search
publication search

ask others

Yaodong Yang 0001

Adam Yang 0001 – 杨耀东

> Home > Persons

Person information

unicode name: 杨耀东
affiliation: Peking University, Institute for AI, Beijing, China
affiliation (former): King's College London, UK
affiliation (former): Huawei Technologies, Noah's Ark Lab, UK
affiliation (PhD): University College London, UK

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/FengYZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/FengYZL25
Mingxiao Feng, Yaodong Yang, Wengang Zhou, Houqiang Li:
TIMAR: Transition-informed representation for sample-efficient multi-agent reinforcement learning. Neural Networks 184: 107081 (2025)
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/WangCLJHZLHZYML25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WangCLJHZLHZYML25
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang:
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models. IEEE Trans. Pattern Anal. Mach. Intell. 47(3): 1894-1907 (2025)
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Bai000025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Bai000025
Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. AAAI 2025: 15453-15461
[c80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/FanYM00Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/FanYM00Z25
Wenzhe Fan, Zishun Yu, Chengdong Ma, Changye Li, Yaodong Yang, Xinhua Zhang:
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning. AAAI 2025: 16505-16513
[c79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangCL0ZQ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangCL0ZQ025
Xiaoyuan Zhang, Xinyan Cai, Bo Liu, Weidong Huang, Song-Chun Zhu, Siyuan Qi, Yaodong Yang:
Differentiable Information Enhanced Model-Based Reinforcement Learning. AAAI 2025: 22605-22613
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LouJW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LouJW025
Hantao Lou, Jiaming Ji, Kaile Wang, Yaodong Yang:
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction. AAAI 2025: 27500-27508
[c77]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DaiC0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DaiC0Z025
Juntao Dai, Taiye Chen, Yaodong Yang, Qian Zheng, Gang Pan:
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization. ICLR 2025
[c76]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WangMCMHXZHS025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangMCMHXZHS025
Mingzhi Wang, Chengdong Ma, Qizhi Chen, Linjian Meng, Yang Han, Jiancong Xiao, Zhaowei Zhang, Jing Huo, Weijie J. Su, Yaodong Yang:
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment. ICLR 2025
[c75]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangBCMWSZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangBCMWSZ025
Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng, Yaodong Yang:
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs. ICLR 2025
[i129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-03001
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-03001
Dongge Wang, Xiang Yan, Zehao Dou, Wenhan Huang, Yaodong Yang, Xiaotie Deng:
Approximating N-Player Nash Equilibrium through Gradient Descent. CoRR abs/2501.03001 (2025)
[i128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-05336
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-05336
Hantao Lou, Jiaming Ji, Kaile Wang, Yaodong Yang:
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction. CoRR abs/2501.05336 (2025)
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13569
Yan Yu, Wengang Zhou, Yaodong Yang, Wanxuan Lu, Yingyan Hou, Houqiang Li:
Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning. CoRR abs/2502.13569 (2025)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-17514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-17514
Hantao Lou, Changye Li, Jiaming Ji, Yaodong Yang:
SAE-V: Interpreting Multimodal Models for Enhanced Alignment. CoRR abs/2502.17514 (2025)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-18423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-18423
Fengshuo Bai, Yu Li, Jie Chu, Tawei Chou, Runchuan Zhu, Ying Wen, Yaodong Yang, Yuanpei Chen:
Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand. CoRR abs/2502.18423 (2025)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-19148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-19148
Zhaowei Zhang, Fengshuo Bai, Qizhi Chen, Chengdong Ma, Mingzhi Wang, Haoran Sun, Zilong Zheng, Yaodong Yang:
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs. CoRR abs/2502.19148 (2025)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-20900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-20900
Yifan Zhong, Xuchuan Huang, Ruochong Li, Ceyao Zhang, Yitao Liang, Yaodong Yang, Yuanpei Chen:
DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping. CoRR abs/2502.20900 (2025)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-00339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-00339
Haojun Chen, Minghao Liu, Xiaojian Ma, Zailin Ma, Huimin Wu, Chengdong Ma, Yuanpei Chen, Yifan Zhong, Mingzhi Wang, Qing Li, Yaodong Yang:
Fast Visuomotor Policies via Partial Denoising. CoRR abs/2503.00339 (2025)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-01178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-01178
Xiaoyuan Zhang, Xinyan Cai, Bo Liu, Weidong Huang, Song-Chun Zhu, Siyuan Qi, Yaodong Yang:
Differentiable Information Enhanced Model-Based Reinforcement Learning. CoRR abs/2503.01178 (2025)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-03480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-03480
Borong Zhang, Yuhao Zhang, Jiaming Ji, Yingshan Lei, Josef Dai, Yuanpei Chen, Yaodong Yang:
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning. CoRR abs/2503.03480 (2025)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-12918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-12918
Pengcheng Wen, Jiaming Ji, Chi-Min Chan, Juntao Dai, Donghai Hong, Yaodong Yang, Sirui Han, Yike Guo:
ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs. CoRR abs/2503.12918 (2025)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-17682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-17682
Jiaming Ji, Xinyu Chen, Rui Pan, Han Zhu, Conghui Zhang, Jiahao Li, Donghai Hong, Boyuan Chen, Jiayi Zhou, Kaile Wang, Juntao Dai, Chi-Min Chan, Sirui Han, Yike Guo, Yaodong Yang:
Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models. CoRR abs/2503.17682 (2025)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-18130
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-18130
Juntao Dai, Taiye Chen, Yaodong Yang, Qian Zheng, Gang Pan:
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization. CoRR abs/2503.18130 (2025)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-23120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-23120
Yuhan Wang, Yu Li, Yaodong Yang, Yuanpei Chen:
Dexterous Non-Prehensile Manipulation for Ungraspable Object via Extrinsic Dexterity. CoRR abs/2503.23120 (2025)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-12911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-12911
Weijie Shi, Chengyi Ju, Chengzhong Liu, Jiaming Ji, Jipeng Zhang, Ruiyuan Zhang, Jia Zhu, Jiajie Xu, Yaodong Yang, Sirui Han, Yike Guo:
Benchmarking Multi-National Value Alignment for Large Language Models. CoRR abs/2504.12911 (2025)
2024
[j20]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/ZhongKFHJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ZhongKFHJ024
Yifan Zhong, Jakub Grudzien Kuba, Xidong Feng, Siyi Hu, Jiaming Ji, Yaodong Yang:
Heterogeneous-Agent Reinforcement Learning. J. Mach. Learn. Res. 25: 32:1-32:67 (2024)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/WangZLWPLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/WangZLWPLY24
Dongzi Wang, Fangwei Zhong, Minglong Li, Muning Wen, Yuanxi Peng, Teng Li, Adam Yang:
RoMAT: Role-based multi-agent transformer for generalizable heterogeneous cooperation. Neural Networks 174: 106129 (2024)
[j18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/nn/LiuZLYLO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/LiuZLYLO24
Jie Liu, Yinmin Zhang, Chuming Li, Yaodong Yang, Yu Liu, Wanli Ouyang:
Adaptive pessimism via target Q-value for offline reinforcement learning. Neural Networks 180: 106588 (2024)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ChenGZJJLDY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ChenGZJJLDY24
Yuanpei Chen, Yiran Geng, Fangwei Zhong, Jiaming Ji, Jiechuang Jiang, Zongqing Lu, Hao Dong, Yaodong Yang:
Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2804-2818 (2024)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/WangYMYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WangYMYY24
Chenguang Wang, Zhouliang Yu, Stephen McAleer, Tianshu Yu, Yaodong Yang:
ASP: Learn a Universal Neural Solver! IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4102-4114 (2024)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/ral/LiLGLYZLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ral/LiLGLYZLH24
Yuyang Li, Bo Liu, Yiran Geng, Puhao Li, Yaodong Yang, Yixin Zhu, Tengyu Liu, Siyuan Huang:
Grasp Multiple Objects With One Hand. IEEE Robotics Autom. Lett. 9(5): 4027-4034 (2024)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/tmc/LiSHLWLWTYZCWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/LiSHLWLWTYZCWY24
Yang Li, Fanglei Sun, Jingchen Hu, Chang Liu, Fan Wu, Kai Li, Ying Wen, Zheng Tian, Yaodong Yang, Jiangcheng Zhu, Zhifeng Chen, Jun Wang, Yang Yang:
Self-Supervised MAFENN for Classifying Low-Labeled Distorted Images Over Mobile Fading Channels. IEEE Trans. Mob. Comput. 23(8): 8077-8091 (2024)
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangLLN0LO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangLLN0LO24
Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning. AAAI 2024: 16908-16916
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenZ0024
Sirui Chen, Zhaowei Zhang, Yaodong Yang, Yali Du:
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning. AAAI 2024: 17337-17345
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangYHWLSZZLZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangYHWLSZZLZC24
Ceyao Zhang, Kaijie Yang, Siyi Hu, Zihao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang:
ProAgent: Building Proactive Cooperative Agents with Large Language Models. AAAI 2024: 17591-17599
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/apweb/FengLYWC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apweb/FengLYWC24
Shaoting Feng, Qinya Li, Yaodong Yang, Fan Wu, Guihai Chen:
GIPUT: Maximizing Photo Coverage Efficiency for UAV Trajectory. APWeb/WAIM (1) 2024: 391-406
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/DinhMTWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/DinhMTWY24
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang:
A Summary of Online Markov Decision Processes with Non-oblivious Strategic Adversary. AAMAS 2024: 2830-2832
[c69]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/WangDLC0B0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/WangDLC0B0G24
Qianxu Wang, Congyue Deng, Tyler Ga Wei Lum, Yuanpei Chen, Yaodong Yang, Jeannette Bohg, Yixin Zhu, Leonidas J. Guibas:
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping. CoRL 2024: 4495-4508
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/CuiLL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/CuiLL00024
Jieming Cui, Tengyu Liu, Nian Liu, Yaodong Yang, Yixin Zhu, Siyuan Huang:
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents. CVPR 2024: 852-862
[c67]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0008JXZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0008JXZ024
Weidong Huang, Jiaming Ji, Chunhe Xia, Borong Zhang, Yaodong Yang:
SafeDreamer: Safe Reinforcement Learning with World Models. ICLR 2024
[c66]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DaiPSJXL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DaiPSJXL0024
Josef Dai, Xuehai Pan, Ruiyang Sun, Jiaming Ji, Xinbo Xu, Mickel Liu, Yizhou Wang, Yaodong Yang:
Safe RLHF: Safe Reinforcement Learning from Human Feedback. ICLR 2024
[c65]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiGXX0WL0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiGXX0WL0024
Simin Li, Jun Guo, Jingqiao Xiu, Ruixiao Xu, Xin Yu, Jiakai Wang, Aishan Liu, Yaodong Yang, Xianglong Liu:
Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game. ICLR 2024
[c64]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuZHFFC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuZHFFC024
Jiarong Liu, Yifan Zhong, Siyi Hu, Haobo Fu, Qiang Fu, Xiaojun Chang, Yaodong Yang:
Maximum Entropy Heterogeneous-Agent Reinforcement Learning. ICLR 2024
[c63]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/QiCLKWYWZZZL0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/QiCLKWYWZZZL0Z24
Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Yaodong Yang, Song-Chun Zhu:
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents. ICLR 2024
[c62]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Dai0Z024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Dai0Z024
Juntao Dai, Yaodong Yang, Qian Zheng, Gang Pan:
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation. ICML 2024
[c61]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HuangLK0ZF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangLK0ZF24
Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang, Song-Chun Zhu, Xue Feng:
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning. ICML 2024
[c60]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LuoZXY0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LuoZXY0L24
Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li:
End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations. ICML 2024
[c59]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/ChenZ0Z0S024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ChenZ0Z0S024
Ruiqing Chen, Xiaoyuan Zhang, Yali Du, Yifan Zhong, Zheng Tian, Fanglei Sun, Yaodong Yang:
Off-Agent Trust Region Policy Optimization. IJCAI 2024: 3798-3806
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/ZhangYLZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/ZhangYLZL24
Yue Zhang, Yaodong Yang, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Remember the Past for Better Future: Memory-Augmented Offline RL. IJCNN 2024: 1-8
[c57]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DaiCWYCJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DaiCWYCJ024
Juntao Dai, Tianle Chen, Xuyao Wang, Ziran Yang, Taiye Chen, Jiaming Ji, Yaodong Yang:
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset. NeurIPS 2024
[c56]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Ji0LHZPQD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Ji0LHZPQD024
Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Tianyi Qiu, Juntao Dai, Yaodong Yang:
Aligner: Efficient Alignment by Learning to Correct. NeurIPS 2024
[c55]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/QiuZHLJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QiuZHLJ024
Tianyi Qiu, Yang Zhang, Xuchuan Huang, Jasmine Xinze Li, Jiaming Ji, Yaodong Yang:
ProgressGym: Alignment with a Millennium of Moral Progress. NeurIPS 2024
[c54]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhongMZYC0Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhongMZYC0Q024
Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Haojun Chen, Qingfu Zhang, Siyuan Qi, Yaodong Yang:
Panacea: Pareto Alignment via Preference Adaptation for LLMs. NeurIPS 2024
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10568
Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu:
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents. CoRR abs/2401.10568 (2024)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02030
Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Qingfu Zhang, Siyuan Qi, Yaodong Yang:
Panacea: Pareto Alignment via Preference Adaptation for LLMs. CoRR abs/2402.02030 (2024)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02416
Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Juntao Dai, Yaodong Yang:
Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction. CoRR abs/2402.02416 (2024)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10184
Tianyi Qiu, Fanzhi Zeng, Jiaming Ji, Dong Yan, Kaile Wang, Jiayi Zhou, Han Yang, Josef Dai, Xuehai Pan, Yaodong Yang:
Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective. CoRR abs/2402.10184 (2024)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12907
Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang:
Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects. CoRR abs/2402.12907 (2024)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00255
Naming Liu, Mingzhi Wang, Youzhi Zhang, Yaodong Yang, Bo An, Ying Wen:
Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games. CoRR abs/2403.00255 (2024)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12421
Tianhao Wu, Yunchong Gan, Mingdong Wu, Jingbo Cheng, Yaodong Yang, Yixin Zhu, Hao Dong:
UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy. CoRR abs/2403.12421 (2024)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12451
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12451
Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li:
INSIGHT: End-to-End Neuro-Symbolic Visual Reinforcement Learning with Language Explanations. CoRR abs/2403.12451 (2024)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12835
Jieming Cui, Tengyu Liu, Nian Liu, Yaodong Yang, Yixin Zhu, Siyuan Huang:
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents. CoRR abs/2403.12835 (2024)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09324
Zhiyu Zhao, Ning Yang, Xue Yan, Haifeng Zhang, Jun Wang, Yaodong Yang:
Correlated Mean Field Imitation Learning. CoRR abs/2404.09324 (2024)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18688
Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia Cui, Ying Wen, Yaodong Yang, Bo Xu, Lei Han:
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation. CoRR abs/2405.18688 (2024)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18718
Fengshuo Bai, Mingzhi Wang, Zhaowei Zhang, Boyuan Chen, Yinda Xu, Ying Wen, Yaodong Yang:
Efficient Model-agnostic Alignment via Bayesian Persuasion. CoRR abs/2405.18718 (2024)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-21027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-21027
Jiesong Lian, Yucong Huang, Mingzhi Wang, Chengdong Ma, Yixue Hao, Ying Wen, Yaodong Yang:
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles. CoRR abs/2405.21027 (2024)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06144
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06144
Jiaming Ji, Kaile Wang, Tianyi Qiu, Boyuan Chen, Jiayi Zhou, Changye Li, Hantao Lou, Yaodong Yang:
Language Models Resist Alignment. CoRR abs/2406.06144 (2024)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08002
Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang, Song-Chun Zhu, Xue Feng:
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning. CoRR abs/2406.08002 (2024)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14477
Josef Dai, Tianle Chen, Xuyao Wang, Ziran Yang, Taiye Chen, Jiaming Ji, Yaodong Yang:
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset. CoRR abs/2406.14477 (2024)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15513
Jiaming Ji, Donghai Hong, Borong Zhang, Boyuan Chen, Josef Dai, Boren Zheng, Tianyi Qiu, Boxun Li, Yaodong Yang:
PKU-SafeRLHF: A Safety Alignment Preference Dataset for Llama Family Models. CoRR abs/2406.15513 (2024)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-20087
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-20087
Tianyi Qiu, Yang Zhang, Xuchuan Huang, Jasmine Xinze Li, Jiaming Ji, Yaodong Yang:
ProgressGym: Alignment with a Millennium of Moral Progress. CoRR abs/2406.20087 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-01072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-01072
Ruize Zhang, Zelai Xu, Chengdong Ma, Chao Yu, Wei-Wei Tu, Shiyu Huang, Deheng Ye, Wenbo Ding, Yaodong Yang, Yu Wang:
A Survey on Self-play Methods in Reinforcement Learning. CoRR abs/2408.01072 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00162
Jiayi Zhou, Jiaming Ji, Juntao Dai, Yaodong Yang:
Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback. CoRR abs/2409.00162 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01575
Naming Liu, Mingzhi Wang, Xihuai Wang, Weinan Zhang, Yaodong Yang, Youzhi Zhang, Bo An, Ying Wen:
Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games. CoRR abs/2410.01575 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-15841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-15841
Wenzhe Fan, Zishun Yu, Chengdong Ma, Changye Li, Yaodong Yang, Xinhua Zhang:
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning. CoRR abs/2410.15841 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-16714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-16714
Mingzhi Wang, Chengdong Ma, Qizhi Chen, Linjian Meng, Yang Han, Jiancong Xiao, Zhaowei Zhang, Jing Huo, Weijie J. Su, Yaodong Yang:
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment. CoRR abs/2410.16714 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-23039
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-23039
Qianxu Wang, Congyue Deng, Tyler Ga Wei Lum, Yuanpei Chen, Yaodong Yang, Jeannette Bohg, Yixin Zhu, Leonidas J. Guibas:
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping. CoRR abs/2410.23039 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-00954
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-00954
Xiaohang Tang, Chiyuan Wang, Chengdong Ma, Ilija Bogunovic, Stephen McAleer, Yaodong Yang:
Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games. CoRR abs/2411.00954 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-06459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-06459
Nian Liu, Libin Liu, Zilong Zhang, Zi Wang, Hongzhao Xie, Tengyu Liu, Xinyi Tong, Yaodong Yang, Zhaofeng He:
Learning Uniformly Distributed Embedding Clusters of Stylistic Skills for Physically Simulated Characters. CoRR abs/2411.06459 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-10713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-10713
Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. CoRR abs/2412.10713 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-11138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-11138
Juntao Dai, Yaodong Yang, Qian Zheng, Gang Pan:
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation. CoRR abs/2412.11138 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-15838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-15838
Jiaming Ji, Jiayi Zhou, Hantao Lou, Boyuan Chen, Donghai Hong, Xuyao Wang, Wenqi Chen, Kaile Wang, Rui Pan, Jiahao Li, Mohan Wang, Josef Dai, Tianyi Qiu, Hua Xu, Dong Li, Weipeng Chen, Jun Song, Bo Zheng, Yaodong Yang:
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback. CoRR abs/2412.15838 (2024)
2023
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/aamas/DinhMTWY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/DinhMTWY23
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang:
Online Markov decision processes with non-oblivious strategic adversary. Auton. Agents Multi Agent Syst. 37(1): 15 (2023)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/GuKCDYKY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/GuKCDYKY23
Shangding Gu, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/WenLWYWMWZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/WenLWYWMWZZ23
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Hai-Feng Zhang, Weinan Zhang:
Large sequence models for sequential decision-making: a survey. Frontiers Comput. Sci. 17(6): 176349 (2023)
[j10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ijautcomp/MengWLLXZWZWYX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijautcomp/MengWLLXZWZWYX23
Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang, Bo Xu:
Offline Pre-trained Multi-agent Decision Transformer. Mach. Intell. Res. 20(2): 233-248 (2023)
[j9]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/ZhouWWWW0000023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ZhouWWWW0000023
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. J. Mach. Learn. Res. 24: 150:1-150:12 (2023)
[j8]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/HuZGW0L0C023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/HuZGW0L0C023
Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Xiaodan Liang, Zhihui Li, Xiaojun Chang, Yaodong Yang:
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library. J. Mach. Learn. Res. 24: 315:1-315:23 (2023)
[j7]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/RenF0PFM023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/RenF0PFM023
Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang:
TorchOpt: An Efficient Library for Differentiable Optimization. J. Mach. Learn. Res. 24: 367:1-367:14 (2023)
[j6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/0116XZZM00D023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/0116XZZM00D023
Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen Marcus McAleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang:
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games. Trans. Mach. Learn. Res. 2023 (2023)
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiLZWN0LO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiLZWN0LO23
Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency. AAAI 2023: 8536-8544
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MguniJWNSTTYDCZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MguniJWNSTTYDCZ23
David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Wenbin Song, Feifei Tong, Matthew E. Taylor, Tianpei Yang, Zipeng Dai, Hui Chen, Jiangcheng Zhu, Kun Shao, Jun Wang, Yaodong Yang:
Learning to Shape Rewards Using a Game of Two Partners. AAAI 2023: 11604-11612
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuZYYYH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuZYYYH23
Pei Xu, Junge Zhang, Qiyue Yin, Chao Yu, Yaodong Yang, Kaiqi Huang:
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks. AAAI 2023: 11717-11725
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0001HZ0W0D23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0001HZ0W0D23
Zhijian Duan, Wenhan Huang, Dinghuai Zhang, Yali Du, Jun Wang, Yaodong Yang, Xiaotie Deng:
Is Nash Equilibrium Approximator Learnable? AAMAS 2023: 233-241
[c49]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/HuangCWQ00W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/HuangCWQ00W23
Binghao Huang, Yuanpei Chen, Tianyu Wang, Yuzhe Qin, Yaodong Yang, Nikolay Atanasov, Xiaolong Wang:
Dynamic Handover: Throw and Catch with Bimanual Hands. CoRL 2023: 1887-1902
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ecai/LiJLZN00O23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/LiJLZN00O23
Chuming Li, Ruonan Jia, Jie Liu, Yinmin Zhang, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning. ECAI 2023: 1381-1388
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WanGLS0Y023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WanGLS0Y023
Weikang Wan, Haoran Geng, Yun Liu, Zikang Shan, Yaodong Yang, Li Yi, He Wang:
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning. ICCV 2023: 3868-3879
[c46]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WuYFT00F023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WuYFT00F023
Shuang Wu, Jian Yao, Haobo Fu, Ye Tian, Chao Qian, Yaodong Yang, Qiang Fu, Wei Yang:
Quality-Similar Diversity via Population Based Reinforcement Learning. ICLR 2023
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MguniCJWYFMTW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MguniCJWYFMTW023
David Henry Mguni, Haojun Chen, Taher Jafferjee, Jianhong Wang, Longfei Yue, Xidong Feng, Stephen Marcus McAleer, Feifei Tong, Jun Wang, Yaodong Yang:
MANSA: Learning Fast and Slow in Multi-Agent Systems. ICML 2023: 24631-24658
[c44]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SlumbersMBM0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SlumbersMBM0023
Oliver Slumbers, David Henry Mguni, Stefano B. Blumberg, Stephen Marcus McAleer, Yaodong Yang, Jun Wang:
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems. ICML 2023: 32059-32087
[c43]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/TangDMY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangDMY23
Xiaohang Tang, Le Cong Dinh, Stephen Marcus McAleer, Yaodong Yang:
Regret-Minimizing Double Oracle for Extensive-Form Games. ICML 2023: 33599-33615
[c42]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangSH00W0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangSH00W0M23
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai:
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. ICML 2023: 36380-36390
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/GengAGCYD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/GengAGCYD23
Yiran Geng, Boshi An, Haoran Geng, Yuanpei Chen, Yaodong Yang, Hao Dong:
RLAfford: End-to-End Affordance Learning for Robotic Manipulation. ICRA 2023: 5880-5886
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LiLLGZYH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LiLLGZYH23
Puhao Li, Tengyu Liu, Yuyang Li, Yiran Geng, Yixin Zhu, Yaodong Yang, Siyuan Huang:
GenDexGrasp: Generalizable Dexterous Grasping. ICRA 2023: 8068-8074
[c39]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JiLDPZB0SW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiLDPZB0SW023
Jiaming Ji, Mickel Liu, Josef Dai, Xuehai Pan, Chi Zhang, Ce Bian, Boyuan Chen, Ruiyang Sun, Yizhou Wang, Yaodong Yang:
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset. NeurIPS 2023
[c38]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/JiZZP0SGZD023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiZZP0SGZD023
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang, Ruiyang Sun, Yiran Geng, Yifan Zhong, Josef Dai, Yaodong Yang:
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark. NeurIPS 2023
[c37]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/McAleerFZW0S23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/McAleerFZW0S23
Stephen McAleer, Gabriele Farina, Gaoyue Zhou, Mingzhi Wang, Yaodong Yang, Tuomas Sandholm:
Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning. NeurIPS 2023
[c36]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Yang0LZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Yang0LZL23
Mingyu Yang, Yaodong Yang, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Hierarchical Multi-Agent Skill Discovery. NeurIPS 2023
[c35]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YaoLF0MF023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YaoLF0MF023
Jian Yao, Weiming Liu, Haobo Fu, Yaodong Yang, Stephen McAleer, Qiang Fu, Wei Yang:
Policy Space Diversity for Non-Transitive Games. NeurIPS 2023
[c34]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Zhao0LZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao0LZL23
Youpeng Zhao, Yaodong Yang, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Multi-Agent First Order Constrained Optimization in Policy Space. NeurIPS 2023
[c33]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/Zhu0CCCS0PC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/Zhu0CCCS0PC23
Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Yijie Chen, Liang Shi, Yaodong Yang, Peter R. Pietzuch, Lei Chen:
MSRL: Distributed Reinforcement Learning with Dataflow Fragments. USENIX ATC 2023: 977-993
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-05910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-05910
David Mguni, Taher Jafferjee, Haojun Chen, Jianhong Wang, Long Fei, Xidong Feng, Stephen McAleer, Feifei Tong, Jun Wang, Yaodong Yang:
MANSA: Learning Fast and Slow in Multi-Agent Systems. CoRR abs/2302.05910 (2023)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13137
Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Yaodong Yang, Jan Peters, Alois C. Knoll:
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors. CoRR abs/2302.13137 (2023)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-00466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-00466
Chenguang Wang, Zhouliang Yu, Stephen McAleer, Tianshu Yu, Yaodong Yang:
ASP: Learn a Universal Neural Solver! CoRR abs/2303.00466 (2023)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-00464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-00464
Weikang Wan, Haoran Geng, Yun Liu, Zikang Shan, Yaodong Yang, Li Yi, He Wang:
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning. CoRR abs/2304.00464 (2023)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-07520
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-07520
Sirui Chen, Zhaowei Zhang, Yali Du, Yaodong Yang:
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning. CoRR abs/2304.07520 (2023)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-09870
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-09870
Yifan Zhong, Jakub Grudzien Kuba, Siyi Hu, Jiaming Ji, Yaodong Yang:
Heterogeneous-Agent Reinforcement Learning. CoRR abs/2304.09870 (2023)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-10498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-10498
Xiaohang Tang, Le Cong Dinh, Stephen Marcus McAleer, Yaodong Yang:
Regret-Minimizing Double Oracle for Extensive-Form Games. CoRR abs/2304.10498 (2023)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09304
Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, Yiran Geng, Mickel Liu, Yaodong Yang:
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research. CoRR abs/2305.09304 (2023)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12872
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12872
Simin Li, Jun Guo, Jingqiao Xiu, Xini Yu, Jiakai Wang, Aishan Liu, Yaodong Yang, Xianglong Liu:
Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game. CoRR abs/2305.12872 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17147
Zhaowei Zhang, Nian Liu, Siyuan Qi, Ceyao Zhang, Ziqi Rong, Song-Chun Zhu, Shuguang Cui, Yaodong Yang:
Heterogeneous Value Evaluation for Large Language Models. CoRR abs/2305.17147 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10698
Yonggang Jin, Chenxu Wang, Liuyu Xiang, Yaodong Yang, Jie Fu, Zhaofeng He:
Deep Reinforcement Learning with Multitask Episodic Memory Based on Task-Conditioned Hypernetwork. CoRR abs/2306.10698 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10715
Jiarong Liu, Yifan Zhong, Siyi Hu, Haobo Fu, Qiang Fu, Xiaojun Chang, Yaodong Yang:
Maximum Entropy Heterogeneous-Agent Mirror Learning. CoRR abs/2306.10715 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13945
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang:
Large Sequence Models for Sequential Decision-Making: A Survey. CoRR abs/2306.13945 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-16884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-16884
Jian Yao, Weiming Liu, Haobo Fu, Yaodong Yang, Stephen McAleer, Qiang Fu, Wei Yang:
Policy Space Diversity for Non-Transitive Games. CoRR abs/2306.16884 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04657
Jiaming Ji, Mickel Liu, Juntao Dai, Xuehai Pan, Chi Zhang, Ce Bian, Boyuan Zhang, Ruiyang Sun, Yizhou Wang, Yaodong Yang:
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset. CoRR abs/2307.04657 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-07176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-07176
Weidong Huang, Jiaming Ji, Borong Zhang, Chunhe Xia, Yaodong Yang:
Safe DreamerV3: Safe Reinforcement Learning with World Models. CoRR abs/2307.07176 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12933
Chuming Li, Ruonan Jia, Jie Liu, Yinmin Zhang, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning. CoRR abs/2307.12933 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-04719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-04719
Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen McAleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang:
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games. CoRR abs/2308.04719 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11339
Ceyao Zhang, Kaijie Yang, Siyi Hu, Zihao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang:
ProAgent: Building Proactive Cooperative AI with Large Language Models. CoRR abs/2308.11339 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-15116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-15116
Jingbang Chen, Yian Wang, Xingwei Qu, Shuangjia Zheng, Yaodong Yang, Hao Dong, Jie Fu:
Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators. CoRR abs/2308.15116 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05655
Binghao Huang, Yuanpei Chen, Tianyu Wang, Yuzhe Qin, Yaodong Yang, Nikolay Atanasov, Xiaolong Wang:
Dynamic Handover: Throw and Catch with Bimanual Hands. CoRR abs/2309.05655 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00322
Chengdong Ma, Ziran Yang, Minquan Gao, Hai Ci, Jun Gao, Xuehai Pan, Yaodong Yang:
Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models. CoRR abs/2310.00322 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00378
Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang:
Measuring Value Understanding in Language Models through Discriminator-Critique Gap. CoRR abs/2310.00378 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05205
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai:
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. CoRR abs/2310.05205 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-09833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-09833
Simin Li, Ruixiao Xu, Jun Guo, Pu Feng, Jiakai Wang, Aishan Liu, Yaodong Yang, Xianglong Liu, Weifeng Lv:
MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization. CoRR abs/2310.09833 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11846
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11846
Jie Liu, Yinmin Zhang, Chuming Li, Chao Yang, Yaodong Yang, Yu Liu, Wanli Ouyang:
Masked Pretraining for Multi-Agent Decision Making. CoRR abs/2310.11846 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12567
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang, Ruiyang Sun, Yiran Geng, Yifan Zhong, Juntao Dai, Yaodong Yang:
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark. CoRR abs/2310.12567 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12773
Josef Dai, Xuehai Pan, Ruiyang Sun, Jiaming Ji, Xinbo Xu, Mickel Liu, Yizhou Wang, Yaodong Yang:
Safe RLHF: Safe Reinforcement Learning from Human Feedback. CoRR abs/2310.12773 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-15599
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-15599
Yuyang Li, Bo Liu, Yiran Geng, Puhao Li, Yaodong Yang, Yixin Zhu, Tengyu Liu, Siyuan Huang:
Grasp Multiple Objects with One Hand. CoRR abs/2310.15599 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19852
Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen McAleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao:
AI Alignment: A Comprehensive Survey. CoRR abs/2310.19852 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-05997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-05997
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang:
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models. CoRR abs/2311.05997 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07685
Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning. CoRR abs/2312.07685 (2023)
2022
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/algorithms/SanjayaWY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/algorithms/SanjayaWY22
Ricky Sanjaya, Jun Wang, Yaodong Yang:
Measuring the Non-Transitivity in Chess. Algorithms 15(5): 152 (2022)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/jossac/ZengZLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossac/ZengZLY22
Qingduo Zeng, Qiang Zhang, Shancun Liu, Yaodong Yang:
Illiquidity Comovement and Market Crisis. J. Syst. Sci. Complex. 35(5): 1863-1874 (2022)
[j3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/DinhMTNSMWBY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/DinhMTNSMWBY22
Le Cong Dinh, Stephen Marcus McAleer, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Jun Wang, Haitham Bou-Ammar, Yaodong Yang:
Online Double Oracle. Trans. Mach. Learn. Res. 2022 (2022)
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/WenCYLTCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/WenCYLTCW22
Ying Wen, Hui Chen, Yaodong Yang, Minne Li, Zheng Tian, Xu Chen, Jun Wang:
A Game-Theoretic Approach to Multi-agent Trust Region Optimization. DAI 2022: 74-87
[c31]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/KubaCWWSW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KubaCWWSW022
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang:
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning. ICLR 2022
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MguniJWNSTLZ0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MguniJWNSTLZ0W22
David Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Yaodong Yang, Jun Wang:
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning. ICLR 2022
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/0001DLM0Y022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/0001DLM0Y022
Yurong Chen, Xiaotie Deng, Chenchen Li, David Mguni, Jun Wang, Xiang Yan, Yaodong Yang:
On the Convergence of Fictitious Play: A Decomposition Approach. IJCAI 2022: 179-185
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/DuMLLDW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/DuMLLDW022
Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Scalable Model-based Policy Optimization for Decentralized Networked Systems. IROS 2022: 9019-9026
[c27]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0039FRMZ00022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0039FRMZ00022
Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang:
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning. NeurIPS 2022
[c26]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenWWFJLMDZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenWWFJLMDZY22
Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen McAleer, Hao Dong, Song-Chun Zhu, Yaodong Yang:
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning. NeurIPS 2022
[c25]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuBD022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuBD022
Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang:
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning. NeurIPS 2022
[c24]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PanLZ0Z022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PanLZ0Z022
Xuehai Pan, Mickel Liu, Fangwei Zhong, Yaodong Yang, Song-Chun Zhu, Yizhou Wang:
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control. NeurIPS 2022
[c23]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WenKL000022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WenKL000022
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang:
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem. NeurIPS 2022
[c22]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangJDZZL0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangJDZZL0022
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. NeurIPS 2022
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/wise/ZhuSWYX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wise/ZhuSWYX22
Zhitao Zhu, Shijing Si, Jianzong Wang, Yaodong Yang, Jing Xiao:
Debias the Black-Box: A Fair Ranking Framework via Knowledge Distillation. WISE 2022: 395-405
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00633
Ming Zhou, Jingxiao Chen, Ying Wen, Weinan Zhang, Yaodong Yang, Yong Yu:
Efficient Policy Space Response Oracles. CoRR abs/2202.00633 (2022)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04862
Juliusz Krysztof Ziomek, Jun Wang, Yaodong Yang:
Settling the Communication Complexity for Distributed Offline Reinforcement Learning. CoRR abs/2202.04862 (2022)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04868
Zehao Dou, Jakub Grudzien Kuba, Yaodong Yang:
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2202.04868 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01469
Yurong Chen, Xiaotie Deng, Chenchen Li, David Mguni, Jun Wang, Xiang Yan, Yaodong Yang:
On the Convergence of Fictitious Play: A Decomposition Approach. CoRR abs/2205.01469 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10330
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14953
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang:
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem. CoRR abs/2205.14953 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15434
Oliver Slumbers, David Henry Mguni, Stephen McAleer, Jun Wang, Yaodong Yang:
Learning Risk-Averse Equilibria in Multi-Agent Systems. CoRR abs/2205.15434 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08686
Yuanpei Chen, Yaodong Yang, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Hao Dong, Zongqing Lu, Song-Chun Zhu:
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning. CoRR abs/2206.08686 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06559
Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Fully Decentralized Model-based Policy Optimization for Networked Systems. CoRR abs/2207.06559 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01682
Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang:
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL. CoRR abs/2208.01682 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-11628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-11628
Zhitao Zhu, Shijing Si, Jianzong Wang, Yaodong Yang, Jing Xiao:
Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation. CoRR abs/2208.11628 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07089
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. CoRR abs/2209.07089 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-12941
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-12941
Yiran Geng, Boshi An, Haoran Geng, Yuanpei Chen, Yaodong Yang, Hao Dong:
End-to-End Affordance Learning for Robotic Manipulation. CoRR abs/2209.12941 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00722
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00722
Puhao Li, Tengyu Liu, Yuyang Li, Yiran Geng, Yixin Zhu, Yaodong Yang, Siyuan Huang:
GenDexGrasp: Generalizable Dexterous Grasping. CoRR abs/2210.00722 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00882
Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Yijie Chen, Liang Shi, Yaodong Yang, Peter R. Pietzuch, Lei Chen:
MSRL: Distributed Reinforcement Learning with Dataflow Fragments. CoRR abs/2210.00882 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13708
Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Zhihui Li, Xiaodan Liang, Xiaojun Chang, Yaodong Yang:
MARLlib: Extending RLlib for Multi-agent Reinforcement Learning. CoRR abs/2210.13708 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06934
Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang:
TorchOpt: An Efficient Library for Differentiable Optimization. CoRR abs/2211.06934 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08016
Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang:
Contextual Transformer for Offline Meta Reinforcement Learning. CoRR abs/2211.08016 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-16068
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-16068
Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency. CoRR abs/2211.16068 (2022)
2021
[b1]
- view
  - electronic edition @ bl.uk
  - details & citations
- export record
  dblp key:
  - phd/ethos/Yang21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ethos/Yang21a
Yaodong Yang:
Many-agent reinforcement learning. University College London (University of London), UK, 2021
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MguniWDYWLWJW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MguniWDYWLWJW21
David Henry Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. ICML 2021: 7688-7699
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/NievesYSMWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/NievesYSMWW21
Nicolas Perez Nieves, Yaodong Yang, Oliver Slumbers, David Henry Mguni, Ying Wen, Jun Wang:
Modelling Behavioural Diversity for Learning in Open-Ended Games. ICML 2021: 8514-8524
[c18]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CaggianoDWCMTPPSMHGAZJCDYSDKPHTMSSSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CaggianoDWCMTPPSMHGAZJCDYSDKPHTMSSSK21
Vittorio Caggiano, Guillaume Durandau, Huawei Wang, Alberto Silvio Chiappa, Alexander Mathis, Pablo Tano, Nisheet Patel, Alexandre Pouget, Pierre Schumacher, Georg Martius, Daniel F. B. Haeufle, Yiran Geng, Boshi An, Yifan Zhong, Jiaming Ji, Yuanpei Chen, Hao Dong, Yaodong Yang, Rahul Siripurapu, Luis Eduardo Ferro Diez, Michael Kopp, Vihang Patil, Sepp Hochreiter, Yuval Tassa, Josh Merel, Randy Schultheis, Seungmoon Song, Massimo Sartori, Vikash Kumar:
MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand. NeurIPS (Competition and Demos) 2021: 233-250
[c17]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuJWHCFHY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuJWHCFHY21
Xiangyu Liu, Hangtian Jia, Ying Wen, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhipeng Hu, Yaodong Yang:
Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games. NeurIPS 2021: 941-952
[c16]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/FengSWLMWWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FengSWLMWWY21
Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang:
Neural Auto-Curricula in Two-Player Zero-Sum Games. NeurIPS 2021: 3504-3517
[c15]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KubaWMGZMWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KubaWMGZMWY21
Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang:
Settling the Variance of Multi-Agent Policy Gradients. NeurIPS 2021: 13458-13470
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-07780
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-07780
Le Cong Dinh, Yaodong Yang, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Haitham Bou-Ammar, Jun Wang:
Online Double Oracle. CoRR abs/2103.07780 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-07927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-07927
Nicolas Perez Nieves, Yaodong Yang, Oliver Slumbers, David Henry Mguni, Jun Wang:
Modelling Behavioural Diversity for Learning in Open-Ended Games. CoRR abs/2103.07927 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09159
David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez Nieves, Wenbin Song, Yaodong Yang, Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang:
Learning to Shape Rewards using a Game of Switching Controls. CoRR abs/2103.09159 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09284
David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. CoRR abs/2103.09284 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02745
Xidong Feng, Oliver Slumbers, Yaodong Yang, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang:
Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games. CoRR abs/2106.02745 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-04958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-04958
Xiangyu Liu, Hangtian Jia, Ying Wen, Yaodong Yang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhipeng Hu:
Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games. CoRR abs/2106.04958 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06828
Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang:
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization. CoRR abs/2106.06828 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07551
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. CoRR abs/2106.07551 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08612
Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang:
Settling the Variance of Multi-Agent Policy Gradients. CoRR abs/2108.08612 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-01795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-01795
Xiaotie Deng, Yuhao Li, David Henry Mguni, Jun Wang, Yaodong Yang:
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games. CoRR abs/2109.01795 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-09833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-09833
Yixin Wu, Rui Luo, Chen Zhang, Jun Wang, Yaodong Yang:
Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics. CoRR abs/2109.09833 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11251
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang:
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning. CoRR abs/2109.11251 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02793
Shangding Gu, Jakub Grudzien Kuba, Muning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois C. Knoll, Yaodong Yang:
Multi-Agent Constrained Policy Optimisation. CoRR abs/2110.02793 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03604
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang:
Online Markov Decision Processes with Non-oblivious Strategic Adversary. CoRR abs/2110.03604 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-11737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-11737
Ricky Sanjaya, Jun Wang, Yaodong Yang:
Measuring the Non-Transitivity in Chess. CoRR abs/2110.11737 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14468
David Mguni, Joel Jennings, Taher Jafferjee, Aivar Sootla, Yaodong Yang, Changmin Yu, Usman Islam, Ziyan Wang, Jun Wang:
DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention. CoRR abs/2110.14468 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15105
Chenguang Wang, Yaodong Yang, Oliver Slumbers, Congying Han, Tiande Guo, Haifeng Zhang, Jun Wang:
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers. CoRR abs/2110.15105 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02618
David Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Yaodong Yang, Jun Wang:
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning. CoRR abs/2112.02618 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02845
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02845
Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu:
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks. CoRR abs/2112.02845 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-15400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-15400
Bo Liu, Xidong Feng, Haifeng Zhang, Jun Wang, Yaodong Yang:
Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning. CoRR abs/2112.15400 (2021)
[i14]
- view
  - electronic edition @ weizmann.ac.il (open access)
  - details & citations
- export record
  dblp key:
  - journals/eccc/DengLMWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eccc/DengLMWY21
Xiaotie Deng, Yuhao Li, David Mguni, Jun Wang, Yaodong Yang:
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games. Electron. Colloquium Comput. Complex. TR21 (2021)
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/eor/KimYLMSJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eor/KimYLMSJ20
Alisa Kim, Yaodong Yang, Stefan Lessmann, Tiejun Ma, Ming-Chien Sung, Johnnie E. V. Johnson:
Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting. Eur. J. Oper. Res. 283(1): 217-234 (2020)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jossac/ZhangWLY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossac/ZhangWLY20
Qiang Zhang, Chao Wang, Shancun Liu, Yaodong Yang:
Order Execution Probability and Order Queue in Limit Order Markets. J. Syst. Sci. Complex. 33(5): 1545-1557 (2020)
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Zhang0HLY0W20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Zhang0HLY0W20
Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang:
Bi-Level Actor-Critic for Multi-Agent Coordination. AAAI 2020: 7325-7332
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/YangTSB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YangTSB20
Yaodong Yang, Rasul Tutunov, Phu Sakulwongtana, Haitham Bou-Ammar:
αα-Rank: Practically Scaling α-Rank through Stochastic Optimisation. AAMAS 2020: 1575-1583
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/PengJLYLWZXYLLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/PengJLYLWZXYLLX20
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai:
Sequential Advertising Agent with Interpretable User Hidden Intents. AAMAS 2020: 1966-1968
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/PengJLYLWZXXYLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/PengJLYLWZXXYLL20
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Haiyang Xu, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai:
Learning to Infer User Hidden States for Online Sequential Advertising. CIKM 2020: 2677-2684
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangW0CSM020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangW0CSM020
Yaodong Yang, Ying Wen, Jun Wang, Liheng Chen, Kun Shao, David Mguni, Weinan Zhang:
Multi-Agent Determinantal Q-Learning. ICML 2020: 10757-10766
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/WenYW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WenYW20
Ying Wen, Yaodong Yang, Jun Wang:
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning. IJCAI 2020: 414-421
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LuoZY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuoZY020
Rui Luo, Qiang Zhang, Yaodong Yang, Jun Wang:
Replica-Exchange Nosé-Hoover Dynamics for Bayesian Learning on Large Datasets. NeurIPS 2020
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01482
Yaodong Yang, Ying Wen, Liheng Chen, Jun Wang, Kun Shao, David Mguni, Weinan Zhang:
Multi-Agent Determinantal Q-Learning. CoRR abs/2006.01482 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-01453
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-01453
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Haiyang Xu, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai:
Learning to Infer User Hidden States for Online Sequential Advertising. CoRR abs/2009.01453 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-00583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-00583
Yaodong Yang, Jun Wang:
An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective. CoRR abs/2011.00583 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/ZhouCWYS0ZW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/ZhouCWYS0ZW19
Ming Zhou, Yong Chen, Ying Wen, Yaodong Yang, Yufeng Su, Weinan Zhang, Dell Zhang, Jun Wang:
Factorized Q-learning for large-scale multi-agent systems. DAI 2019: 7:1-7:7
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLL19
Yaodong Yang, Rui Luo, Yuanyuan Liu:
Adversarial Variational Bayes Methods for Tweedie Compound Poisson Mixed Models. ICASSP 2019: 3377-3381
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WenYLWP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WenYLWP19
Ying Wen, Yaodong Yang, Rui Luo, Jun Wang, Wei Pan:
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning. ICLR (Poster) 2019
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/www/LiQJYWWWY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/LiQJYWWWY19
Minne Li, Zhiwei (Tony) Qin, Yan Jiao, Yaodong Yang, Jun Wang, Chenxi Wang, Guobin Wu, Jieping Ye:
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning. WWW 2019: 983-994
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09207
Ying Wen, Yaodong Yang, Rui Luo, Jun Wang, Wei Pan:
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning. CoRR abs/1901.09207 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09216
Ying Wen, Yaodong Yang, Rui Lu, Jun Wang:
Multi-Agent Generalized Recursive Reasoning. CoRR abs/1901.09216 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-11454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-11454
Minne Li, Zhiwei (Tony) Qin, Yan Jiao, Yaodong Yang, Zhichen Gong, Jun Wang, Chenxi Wang, Guobin Wu, Jieping Ye:
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning. CoRR abs/1901.11454 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12569
Rui Luo, Qiang Zhang, Yaodong Yang, Jun Wang:
Replica-exchange Nosé-Hoover dynamics for Bayesian learning on large datasets. CoRR abs/1905.12569 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-03510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-03510
Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang:
Bi-level Actor-Critic for Multi-agent Coordination. CoRR abs/1909.03510 (2019)
2018
[c3]
- view
  - electronic edition @ acm.org
  - details & citations
- export record
  dblp key:
  - conf/atal/YangYBWZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YangYBWZW18
Yaodong Yang, Lantao Yu, Yiwei Bai, Ying Wen, Weinan Zhang, Jun Wang:
A Study of AI Population Dynamics with Million-agent Reinforcement Learning. AAMAS 2018: 2133-2135
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangLLZZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangLLZZW18
Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang:
Mean Field Multi-Agent Reinforcement Learning. ICML 2018: 5567-5576
[c1]
- view
- export record
  dblp key:
  - conf/nips/LuoWY0Z18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuoWY0Z18
Rui Luo, Jianhong Wang, Yaodong Yang, Jun Wang, Zhanxing Zhu:
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning. NeurIPS 2018: 10696-10705
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05438
Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang:
Mean Field Multi-Agent Reinforcement Learning. CoRR abs/1802.05438 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-03738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-03738
Yong Chen, Ming Zhou, Ying Wen, Yaodong Yang, Yufeng Su, Weinan Zhang, Dell Zhang, Jun Wang, Han Liu:
Factorized Q-Learning for Large-Scale Multi-Agent Systems. CoRR abs/1809.03738 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03711
Qiang Zhang, Rui Luo, Yaodong Yang, Yuanyuan Liu:
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series. CoRR abs/1811.03711 (2018)
2017
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/PengYWYTLW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PengYWYTLW17
Peng Peng, Quan Yuan, Ying Wen, Yaodong Yang, Zhenkun Tang, Haitao Long, Jun Wang:
Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games. CoRR abs/1703.10069 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-04511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-04511
Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Weinan Zhang, Ying Wen, Yong Yu:
An Empirical Study of AI Population Dynamics with Million-agent Reinforcement Learning. CoRR abs/1709.04511 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.