default search action

combined dblp search
author search
venue search
publication search

ask others

Qiwen Cui

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/0002CD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/0002CD24
Yan Dai, Qiwen Cui, Simon S. Du:
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation (Extended Abstract). COLT 2024: 1260-1261
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JiangCXFD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JiangCXFD24
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon Shaolei Du:
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning. ICLR 2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhouZZC0D24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhouZZC0D24
Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du:
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning. ICLR 2024
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07082
Yan Dai, Qiwen Cui, Simon S. Du:
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation. CoRR abs/2402.07082 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07437
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07437
Qiwen Cui, Maryam Fazel, Simon S. Du:
Learning Optimal Tax Design in Nonatomic Congestion Games. CoRR abs/2402.07437 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-07191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-07191
Yufeng Zhang, Liyu Chen, Boyi Liu, Yingxiang Yang, Qiwen Cui, Yunzhe Tao, Hongxia Yang:
(N, K)-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model. CoRR abs/2403.07191 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00717
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00717
Natalia Zhang, Xinqi Wang, Qiwen Cui, Runlong Zhou, Sham M. Kakade, Simon S. Du:
Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques. CoRR abs/2409.00717 (2024)
2023
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/YangXCWX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/YangXCWX23
Minghan Yang, Dong Xu, Qiwen Cui, Zaiwen Wen, Pengxiang Xu:
An Efficient Fisher Matrix Approximation Method for Large-Scale Neural Network Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 5391-5403 (2023)
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/CuiZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/CuiZD23
Qiwen Cui, Kaiqing Zhang, Simon S. Du:
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation. COLT 2023: 2651-2652
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JiangCXFD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JiangCXFD23
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon Shaolei Du:
Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement. ICLR 2023
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03673
Qiwen Cui, Kaiqing Zhang, Simon S. Du:
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation. CoRR abs/2302.03673 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07465
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning. CoRR abs/2306.07465 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19308
Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du:
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning. CoRR abs/2310.19308 (2023)
2022
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiD22
Qiwen Cui, Simon S. Du:
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus. NeurIPS 2022
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiD22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiD22a
Qiwen Cui, Simon S. Du:
When are Offline Two-Player Zero-Sum Markov Games Solvable? NeurIPS 2022
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiXFD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiXFD22
Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
Learning in Congestion Games with Bandit Feedback. NeurIPS 2022
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangCD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangCD22
Xinqi Wang, Qiwen Cui, Simon S. Du:
On Gap-dependent Bounds for Offline Reinforcement Learning. NeurIPS 2022
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XiongSCFD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XiongSCFD22
Zhihan Xiong, Ruoqi Shen, Qiwen Cui, Maryam Fazel, Simon S. Du:
Near-Optimal Randomized Exploration for Tabular Markov Decision Processes. NeurIPS 2022
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03522
Qiwen Cui, Simon S. Du:
When is Offline Two-Player Zero-Sum Markov Game Solvable? CoRR abs/2201.03522 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00159
Qiwen Cui, Simon S. Du:
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus. CoRR abs/2206.00159 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00177
Xinqi Wang, Qiwen Cui, Simon S. Du:
On Gap-dependent Bounds for Offline Reinforcement Learning. CoRR abs/2206.00177 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01880
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01880
Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
Learning in Congestion Games with Bandit Feedback. CoRR abs/2206.01880 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13396
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
Offline congestion games: How feedback type affects data coverage requirement. CoRR abs/2210.13396 (2022)
2021
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/IshfaqCNAYWPY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/IshfaqCNAYWPY21
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin Yang:
Randomized Exploration in Reinforcement Learning with General Value Function Approximation. ICML 2021: 4607-4616
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/CuiY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/CuiY21
Qiwen Cui, Lin F. Yang:
Minimax sample complexity for turn-based stochastic game. UAI 2021: 1496-1504
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07454
Minghan Yang, Dong Xu, Qiwen Cui, Zaiwen Wen, Pengxiang Xu:
NG+ : A Multi-Step Matrix-Product Natural Gradient Method for Deep Learning. CoRR abs/2106.07454 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07841
Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F. Yang:
Randomized Exploration for Reinforcement Learning with General Value Function Approximation. CoRR abs/2106.07841 (2021)
2020
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiY20
Qiwen Cui, Lin F. Yang:
Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning? NeurIPS 2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05673
Qiwen Cui, Lin F. Yang:
Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning? CoRR abs/2010.05673 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-14267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-14267
Qiwen Cui, Lin F. Yang:
Minimax Sample Complexity for Turn-based Stochastic Game. CoRR abs/2011.14267 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.