default search action
Haoxiang Wang 0003
Person information
- affiliation: University of Illinois at Urbana-Champaign, IL, USA
Other persons with the same name
- Haoxiang Wang — disambiguation page
- Haoxiang Wang 0001 — Cornell University, Ithaca, NY, USA
- Haoxiang Wang 0002 — South China University of Technology, Guangzhou, China (and 1 more)
- Haoxiang Wang 0004 — North China Electric Power University, Beijing, China
- Haoxiang Wang 0005 — Beijing Information Science and Technology University, Beijing, China
- Haoxiang Wang 0006 — Tsinghua University, Beijing, China
- Haoxiang Wang 0007 — Nanjing Agricultural University, Nanjing, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Haoxiang Wang, Haozhe Si, Huajie Shao, Han Zhao:
Enhancing Compositional Generalization via Compositional Feature Alignment. Trans. Mach. Learn. Res. 2024 (2024) - [c7]Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang:
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards. ACL (1) 2024: 8642-8655 - [c6]Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang:
Mitigating the Alignment Tax of RLHF. EMNLP 2024: 580-606 - [i15]Haoxiang Wang, Haozhe Si, Huajie Shao, Han Zhao:
Enhancing Compositional Generalization via Compositional Feature Alignment. CoRR abs/2402.02851 (2024) - [i14]Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang:
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards. CoRR abs/2402.18571 (2024) - [i13]Hanze Dong, Wei Xiong, Bo Pang, Haoxiang Wang, Han Zhao, Yingbo Zhou, Nan Jiang, Doyen Sahoo, Caiming Xiong, Tong Zhang:
RLHF Workflow: From Reward Modeling to Online RLHF. CoRR abs/2405.07863 (2024) - [i12]Haoxiang Wang, Wei Xiong, Tengyang Xie, Han Zhao, Tong Zhang:
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts. CoRR abs/2406.12845 (2024) - [i11]Yifei He, Haoxiang Wang, Ziyan Jiang, Alexandros Papangelis, Han Zhao:
Semi-Supervised Reward Modeling via Iterative Self-Training. CoRR abs/2409.06903 (2024) - 2023
- [i10]Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Yuan Yao, Tong Zhang:
Mitigating the Alignment Tax of RLHF. CoRR abs/2309.06256 (2023) - [i9]Yifei He, Haoxiang Wang, Bo Li, Han Zhao:
Gradual Domain Adaptation: Theory and Algorithms. CoRR abs/2310.13852 (2023) - [i8]Haoxiang Wang, Gargi Balasubramaniam, Haozhe Si, Bo Li, Han Zhao:
Invariant-Feature Subspace Recovery: A New Class of Provable Domain Generalization Algorithms. CoRR abs/2311.00966 (2023) - 2022
- [c5]Haoxiang Wang, Yite Wang, Ruoyu Sun, Bo Li:
Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning. CVPR 2022: 9787-9798 - [c4]Haoxiang Wang, Bo Li, Han Zhao:
Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond. ICML 2022: 22784-22801 - [c3]Haoxiang Wang, Haozhe Si, Bo Li, Han Zhao:
Provable Domain Generalization via Invariant-Feature Subspace Recovery. ICML 2022: 23018-23033 - [c2]Mao Ye, Ruichen Jiang, Haoxiang Wang, Dhruv Choudhary, Xiaocong Du, Bhargav Bhushanam, Aryan Mokhtari, Arun Kejariwal, Qiang Liu:
Future gradient descent for adapting the temporal shifting data distribution in online recommendation systems. UAI 2022: 2256-2266 - [i7]Haoxiang Wang, Haozhe Si, Bo Li, Han Zhao:
Provable Domain Generalization via Invariant-Feature Subspace Recovery. CoRR abs/2201.12919 (2022) - [i6]Haoxiang Wang, Yite Wang, Ruoyu Sun, Bo Li:
Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning. CoRR abs/2203.09137 (2022) - [i5]Haoxiang Wang, Bo Li, Han Zhao:
Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond. CoRR abs/2204.08200 (2022) - [i4]Mao Ye, Ruichen Jiang, Haoxiang Wang, Dhruv Choudhary, Xiaocong Du, Bhargav Bhushanam, Aryan Mokhtari, Arun Kejariwal, Qiang Liu:
Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems. CoRR abs/2209.01143 (2022) - [i3]Haoxiang Wang, Maurice Weber, Josh Izaac, Cedric Yen-Yu Lin:
Predicting Properties of Quantum Systems with Conditional Generative Models. CoRR abs/2211.16943 (2022) - 2021
- [c1]Haoxiang Wang, Han Zhao, Bo Li:
Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation. ICML 2021: 10991-11002 - [i2]Haoxiang Wang, Han Zhao, Bo Li:
Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation. CoRR abs/2106.09017 (2021) - 2020
- [i1]Haoxiang Wang, Ruoyu Sun, Bo Li:
Global Convergence and Induced Kernels of Gradient-Based Meta-Learning with Neural Nets. CoRR abs/2006.14606 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 00:53 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint