


default search action
Zhihao Zhang 0002
Person information
- affiliation: Fudan University, School of Computer Science, Shanghai, China
Other persons with the same name
- Zhihao Zhang (aka: Zhi-hao Zhang, Zhi-Hao Zhang, Zhi Hao Zhang) — disambiguation page
- Zhihao Zhang 0001
— Carnegie Mellon University, Pittsburgh, PA, USA - Zhihao Zhang 0003
— Nanjing Tech University, College of Electrical Engineering and Control Science, Nanjing, China (and 2 more) - Zhihao Zhang 0004
— Beijing University of Technology, School of Economics and Management, Beijing, China - Zhihao Zhang 0005
— Subtle Medical Inc., Shanghai, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c4]Ming Zhang, Yuhui Wang, Yujiong Shen, Tingyi Yang, Changhao Jiang, Yilong Wu, Shihan Dou, Qinhao Chen, Zhiheng Xi, Zhihao Zhang, Yi Dong, Zhen Wang, Zhihui Fei, Mingyang Wan, Tao Liang, Guojun Ma, Qi Zhang, Tao Gui, Xuanjing Huang:
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts. ACL (Findings) 2025: 2626-2649
[i12]Yuhao Zhou, Sirui Song, Boyang Liu, Zhiheng Xi, Senjie Jin, Xiaoran Fan, Zhihao Zhang, Wei Li, Xuanjing Huang
:
EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection. CoRR abs/2503.01586 (2025)
[i11]Ming Zhang, Yuhui Wang, Yujiong Shen, Tingyi Yang, Changhao Jiang, Yilong Wu, Shihan Dou, Qinhao Chen, Zhiheng Xi, Zhihao Zhang, Yi Dong, Zhen Wang, Zhihui Fei, Mingyang Wan, Tao Liang, Guojun Ma, Qi Zhang, Tao Gui, Xuanjing Huang
:
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts. CoRR abs/2503.06706 (2025)
[i10]Xiaoran Fan, Zhichao Sun, Yangfan Gao, Jingfei Xiong, Hang Yan, Yifei Cao, Jiajun Sun, Shuo Li, Zhihao Zhang, Zhiheng Xi, Yuhao Zhou, Senjie Jin, Changhao Jiang, Junjie Ye
, Ming Zhang, Rui Zheng, Zhenhua Han, Yunke Zhang, Demei Yan, Shaokang Dong, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
Speech-Language Models with Decoupled Tokenizers and Multi-Token Prediction. CoRR abs/2506.12537 (2025)
[i9]Zhihao Zhang, Qiaole Dong, Qi Zhang, Jun Zhao, Enyu Zhou, Zhiheng Xi, Senjie Jin, Xiaoran Fan, Yuhao Zhou, Yanwei Fu, Tao Ji, Tao Gui, Xuanjing Huang
:
Reinforcement Fine-Tuning Enables MLLMs Learning Novel Tasks Stably. CoRR abs/2506.23508 (2025)
[i8]Mingqi Wu, Zhihao Zhang, Qiaole Dong, Zhiheng Xi, Jun Zhao, Senjie Jin, Xiaoran Fan, Yuhao Zhou, Yanwei Fu, Qin Liu, Songyang Zhang, Qi Zhang:
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination. CoRR abs/2507.10532 (2025)
[i7]Ming Zhang, Yujiong Shen, Jingyi Deng, Yuhui Wang, Yue Zhang, Junzhe Wang, Shichun Liu, Shihan Dou, Huayu Sha, Qiyuan Peng, Changhao Jiang, Jingqi Tong, Yilong Wu, Zhihao Zhang, Mingqi Wu, Zhiheng Xi, Mingxu Chai, Tao Liang, Zhihui Fei, Zhen Wang, Mingyang Wan, Guojun Ma, Tao Gui, Qi Zhang, Xuanjing Huang
:
LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models. CoRR abs/2508.05452 (2025)
[i6]Zhiheng Xi, Xin Guo, Yang Nan, Enyu Zhou, Junrui Shen, Wenxiang Chen, Jiaqi Liu, Jixuan Huang, Zhihao Zhang, Honglin Guo, Xun Deng, Zhikai Lei, Miao Zheng, Guoteng Wang, Shuo Zhang, Peng Sun, Rui Zheng, Hang Yan, Tao Gui, Qi Zhang, Xuanjing Huang
:
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping. CoRR abs/2510.18927 (2025)
[i5]Zhiheng Xi, Chenyang Liao, Guanyu Li, Yajie Yang, Wenxiang Chen, Zhihao Zhang, Binghai Wang, Senjie Jin, Yuhao Zhou, Jian Guan, Wei Wu, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang
:
AgentPRM: Process Reward Models for LLM Agents via Step-Wise Promise and Progress. CoRR abs/2511.08325 (2025)- 2024
[c3]Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang
:
Unveiling Linguistic Regions in Large Language Models. ACL (1) 2024: 6228-6247
[c2]Yue Zhang, Zhihao Zhang, Wenbin Lai, Chong Zhang, Tao Gui, Qi Zhang, Xuanjing Huang
:
PDF-to-Tree: Parsing PDF Text Blocks into a Tree. EMNLP (Findings) 2024: 10704-10714
[c1]Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye
, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang
:
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning. EMNLP 2024: 15270-15283
[i4]Jun Zhao, Zhihao Zhang, Luhui Gao
, Qi Zhang, Tao Gui, Xuanjing Huang
:
LLaMA Beyond English: An Empirical Study on Language Capability Transfer. CoRR abs/2401.01055 (2024)
[i3]Nuo Xu, Jun Zhao, Can Zu, Sixian Li, Lu Chen, Zhihao Zhang, Rui Zheng, Shihan Dou, Wenjuan Qin, Tao Gui, Qi Zhang, Xuanjing Huang
:
Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution. CoRR abs/2402.11525 (2024)
[i2]Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang
:
Unveiling Linguistic Regions in Large Language Models. CoRR abs/2402.14700 (2024)- 2023
[i1]Jun Zhao, Zhihao Zhang, Yide Ma, Qi Zhang, Tao Gui, Luhui Gao
, Xuanjing Huang
:
Unveiling A Core Linguistic Region in Large Language Models. CoRR abs/2310.14928 (2023)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-06 00:56 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







