default search action

combined dblp search
author search
venue search
publication search

ask others

Siliang Zeng

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ior/ZengHG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ior/ZengHG25
Siliang Zeng, Mingyi Hong, Alfredo García:
Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees. Oper. Res. 73(2): 720-737 (2025)
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/Cen0ZCRKF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Cen0ZCRKF25
Zhepeng Cen, Yao Liu, Siliang Zeng, Pratik Chaudhari, Huzefa Rangwala, George Karypis, Rasool Fakoor:
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens. Trans. Mach. Learn. Res. 2025 (2025)
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ZhangZLGH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ZhangZLGH25
Ruijia Zhang, Siliang Zeng, Chenliang Li, Alfredo García, Mingyi Hong:
Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global Optimality. AISTATS 2025: 2944-2952
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiZLLK0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiZLLK0025
Chenliang Li, Siliang Zeng, Zeyi Liao, Jiaxiang Li, Dongyeop Kang, Alfredo García, Mingyi Hong:
Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment. ICLR 2025
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-13538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-13538
Siliang Zeng, Yao Liu, Huzefa Rangwala, George Karypis, Mingyi Hong, Rasool Fakoor:
From Demonstrations to Rewards: Alignment Without Explicit Human Preferences. CoRR abs/2503.13538 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-17865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-17865
Ruijia Zhang, Siliang Zeng, Chenliang Li, Alfredo García, Mingyi Hong:
Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global Optimality. CoRR abs/2503.17865 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-11821
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-11821
Siliang Zeng, Quan Wei, William Brown, Oana Frunza, Yuriy Nevmyvaka, Mingyi Hong:
Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment. CoRR abs/2505.11821 (2025)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-17828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-17828
Xinnan Zhang, Chenliang Li, Siliang Zeng, Jiaxiang Li, Zhongruo Wang, Kaixiang Lin, Songtao Lu, Alfredo García, Mingyi Hong:
Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach. CoRR abs/2506.17828 (2025)
2024
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiZWLG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiZWLG024
Jiaxiang Li, Siliang Zeng, Hoi-To Wai, Chenliang Li, Alfredo García, Mingyi Hong:
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment. NeurIPS 2024
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17888
Jiaxiang Li, Siliang Zeng, Hoi-To Wai, Chenliang Li, Alfredo García, Mingyi Hong:
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment. CoRR abs/2405.17888 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06874
Chenliang Li, Siliang Zeng, Zeyi Liao, Jiaxiang Li, Dongyeop Kang, Alfredo García, Mingyi Hong:
Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback. CoRR abs/2406.06874 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14655
Zhepeng Cen, Yao Liu, Siliang Zeng, Pratik Chaudhari, Huzefa Rangwala, George Karypis, Rasool Fakoor:
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens. CoRR abs/2410.14655 (2024)
2023
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/corl/WeiZLGMH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/WeiZLGMH23
Ran Wei, Siliang Zeng, Chenliang Li, Alfredo García, Anthony D. McDonald, Mingyi Hong:
A Bayesian Approach to Robust Inverse Reinforcement Learning. CoRL 2023: 2304-2322
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icaice/WangZXZWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaice/WangZXZWW23
Hao Wang, Siliang Zeng, Zhenwei Xue, Xinglin Zhu, Fei Wu, Ning Wang:
ProfLLM: A framework for adapting offline large language models to few-shot expert knowledge. ICAICE 2023: 708-714
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZengLGH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZengLGH23
Siliang Zeng, Chenliang Li, Alfredo García, Mingyi Hong:
When Demonstrations meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning. NeurIPS 2023
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-07457
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-07457
Siliang Zeng, Chenliang Li, Alfredo García, Mingyi Hong:
Understanding Expertise through Demonstrations: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning. CoRR abs/2302.07457 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08571
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08571
Ran Wei, Siliang Zeng, Chenliang Li, Alfredo García, Anthony D. McDonald, Mingyi Hong:
A Bayesian Approach to Robust Inverse Reinforcement Learning. CoRR abs/2309.08571 (2023)
2022
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/siamjo/HongZZS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siamjo/HongZZS22
Mingyi Hong, Siliang Zeng, Junyu Zhang, Haoran Sun:
On the Divergence of Decentralized Nonconvex Optimization. SIAM J. Optim. 32(4): 2879-2908 (2022)
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/l4dc/ZengCGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/l4dc/ZengCGH22
Siliang Zeng, Tianyi Chen, Alfredo García, Mingyi Hong:
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees. L4DC 2022: 278-290
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LuZCSHK0H22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuZCSHK0H22
Songtao Lu, Siliang Zeng, Xiaodong Cui, Mark S. Squillante, Lior Horesh, Brian Kingsbury, Jia Liu, Mingyi Hong:
A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization. NeurIPS 2022
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZengLGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZengLGH22
Siliang Zeng, Chenliang Li, Alfredo García, Mingyi Hong:
Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees. NeurIPS 2022
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01282
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01282
Siliang Zeng, Mingyi Hong, Alfredo García:
Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees. CoRR abs/2210.01282 (2022)
2021
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KhanduriZHWWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KhanduriZHWWY21
Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang:
A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum. NeurIPS 2021: 30271-30283
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07367
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07367
Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang:
A Momentum-Assisted Single-Timescale Stochastic Approximation Algorithm for Bilevel Optimization. CoRR abs/2102.07367 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05597
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05597
Siliang Zeng, Tianyi Chen, Alfredo García, Mingyi Hong:
Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees. CoRR abs/2110.05597 (2021)
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icc/CaoZPC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icc/CaoZPC20
Qi Cao, Siliang Zeng, Man-On Pun, Yi Chen:
Network-Level System Performance Prediction Using Deep Neural Networks with Cross-Layer Information. ICC 2020: 1-6
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icca/ZengXC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icca/ZengXC20
Siliang Zeng, Xingfei Xu, Yi Chen:
Multi-Agent Reinforcement Learning for Adaptive Routing: A Hybrid Method using Eligibility Traces. ICCA 2020: 1332-1339
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-11662
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-11662
Mingyi Hong, Siliang Zeng, Junyu Zhang, Haoran Sun:
On the Divergence of Decentralized Non-Convex Optimization. CoRR abs/2006.11662 (2020)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2006
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/skg/XuZQ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/skg/XuZQ06
Jianbo Xu, Siliang Zeng, Fengjiao Qu:
A new In-network data aggregation technology of wireless sensor networks. SKG 2006: 104

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.