default search action
Zhihao Jia
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Zhihao Jia, Bing Wang, Changhao Chen:
Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey. Image Vis. Comput. 143: 104920 (2024) - [j7]Zikun Li, Jinjun Peng, Yixuan Mei, Sina Lin, Yi Wu, Oded Padon, Zhihao Jia:
Quarl: A Learning-Based Quantum Circuit Optimizer. Proc. ACM Program. Lang. 8(OOPSLA1): 555-582 (2024) - [c36]Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Qing Li, Yong Jiang, Zhihao Jia:
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models. ACL (1) 2024: 1-17 - [c35]Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai, Zhihao Jia:
Optimal Kernel Orchestration for Tensor Programs with Korch. ASPLOS (3) 2024: 755-769 - [c34]Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia:
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification. ASPLOS (3) 2024: 932-949 - [c33]Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia:
SpotServe: Serving Generative Large Language Models on Preemptible Instances. ASPLOS (2) 2024: 1112-1127 - [c32]Zhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, Lanting Li, Phitchaya Mangpo Phothilimthana, Zhihao Jia:
Accelerating Iterative Retrieval-augmented Language Model Serving with Speculation. ICML 2024 - [c31]Xupeng Miao, Shenhan Zhu, Fangcheng Fu, Ziyu Guo, Zhi Yang, Yaofeng Tu, Zhihao Jia, Bin Cui:
X-former Elucidator: Reviving Efficient Attention for Long Context Language Modeling. IJCAI 2024: 8179-8187 - [c30]Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia:
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances. NSDI 2024 - [c29]Hongli Zhou, Zhihao Jia, Haiyang Zhu, Zhizheng Zhang:
CLLP: Contrastive Learning Framework Based on Latent Preferences for Next POI Recommendation. SIGIR 2024: 1473-1482 - [c28]Xupeng Miao, Zhihao Jia, Bin Cui:
Demystifying Data Management for Large Language Models. SIGMOD Conference Companion 2024: 547-555 - [i31]Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Qing Li, Yong Jiang, Zhihao Jia:
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models. CoRR abs/2401.07159 (2024) - [i30]Zhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, Lanting Li, Phitchaya Mangpo Phothilimthana, Zhihao Jia:
Accelerating Retrieval-Augmented Language Model Serving with Speculation. CoRR abs/2401.14021 (2024) - [i29]Zhuoming Chen, Avner May, Ruslan Svirschevski, Yuhsun Huang, Max Ryabinin, Zhihao Jia, Beidi Chen:
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding. CoRR abs/2402.12374 (2024) - [i28]Xupeng Miao, Gabriele Oliaro, Xinhao Cheng, Mengdi Wu, Colin Unger, Zhihao Jia:
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning. CoRR abs/2402.18789 (2024) - [i27]Jiangfei Duan, Ziang Song, Xupeng Miao, Xiaoli Xi, Dahua Lin, Harry Xu, Minjia Zhang, Zhihao Jia:
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances. CoRR abs/2403.14097 (2024) - [i26]Mengdi Wu, Xinhao Cheng, Oded Padon, Zhihao Jia:
A Multi-Level Superoptimizer for Tensor Programs. CoRR abs/2405.05751 (2024) - [i25]Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak:
Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs. CoRR abs/2406.01566 (2024) - [i24]Ruslan Svirschevski, Avner May, Zhuoming Chen, Beidi Chen, Zhihao Jia, Max Ryabinin:
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices. CoRR abs/2406.02532 (2024) - [i23]Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia:
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism. CoRR abs/2406.17145 (2024) - [i22]Mingkuan Xu, Shiyi Cao, Xupeng Miao, Umut A. Acar, Zhihao Jia:
Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version). CoRR abs/2408.09055 (2024) - 2023
- [j6]Xupeng Miao, Yining Shi, Zhi Yang, Bin Cui, Zhihao Jia:
SDPipe: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training. Proc. VLDB Endow. 16(9): 2354-2363 (2023) - [j5]Junhua Gu, Zhihao Jia, Taotao Cai, Xiangyu Song, Adnan Mahmood:
Dynamic Correlation Adjacency-Matrix-Based Graph Neural Networks for Traffic Flow Prediction. Sensors 23(6): 2897 (2023) - [j4]Haojie Wang, Jidong Zhai, Mingyu Gao, Feng Zhang, Tuowei Wang, Zixuan Ma, Shizhi Tang, Liyan Zheng, Wen Wang, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia:
Optimizing DNNs With Partially Equivalent Transformations and Automated Corrections. IEEE Trans. Computers 72(12): 3546-3560 (2023) - [c27]Cheng Tan, Changliu Liu, Zhihao Jia, Tianhao Wei:
Building Verified Neural Networks for Computer Systems with Ouroboros. MLSys 2023 - [c26]John Thorpe, Pengzhan Zhao, Jonathan Eyolfson, Yifan Qiao, Zhihao Jia, Minjia Zhang, Ravi Netravali, Guoqing Harry Xu:
Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs. NSDI 2023: 497-513 - [c25]Weiyang Wang, Moein Khazraee, Zhizhen Zhong, Manya Ghobadi, Zhihao Jia, Dheevatsa Mudigere, Ying Zhang, Anthony Kewitsch:
TopoOpt: Co-optimizing Network Topology and Parallelization Strategy for Distributed Training Jobs. NSDI 2023: 739-767 - [c24]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia:
EINNET: Optimizing Tensor Programs with Derivation-Based Transformations. OSDI 2023: 739-755 - [c23]Suhas Jayaram Subramanya, Daiyaan Arfeen, Shouxu Lin, Aurick Qiao, Zhihao Jia, Gregory R. Ganger:
Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling. SOSP 2023: 642-657 - [i21]Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Rae Ying Yee Wong, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia:
SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification. CoRR abs/2305.09781 (2023) - [i20]Zikun Li, Jinjun Peng, Yixuan Mei, Sina Lin, Yi Wu, Oded Padon, Zhihao Jia:
Quarl: A Learning-Based Quantum Circuit Optimizer. CoRR abs/2307.10120 (2023) - [i19]Zhihao Jia, Bing Wang, Changhao Chen:
Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey. CoRR abs/2308.15733 (2023) - [i18]Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia:
SpotServe: Serving Generative Large Language Models on Preemptible Instances. CoRR abs/2311.15566 (2023) - [i17]Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Hongyi Jin, Tianqi Chen, Zhihao Jia:
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems. CoRR abs/2312.15234 (2023) - 2022
- [j3]Yue Zhao, George H. Chen, Zhihao Jia:
TOD: GPU-accelerated Outlier Detection via Tensor Operations. Proc. VLDB Endow. 16(3): 546-560 (2022) - [c22]Byungsoo Jeon, Sunghyun Park, Peiyuan Liao, Sheng Xu, Tianqi Chen, Zhihao Jia:
Collage: Seamless Integration of Deep Learning Backends with Automatic Placement. PACT 2022: 517-529 - [c21]Zhihao Zhang, Zhihao Jia:
GradSign: Model Performance Inference with Theoretical Insights. ICLR 2022 - [c20]Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Dimitry Melts, Krishna Dhulipala, K. R. Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Guoqiang Jerry Chen, Manoj Krishnan, Avinash Nayak, Krishnakumar Nair, Bharath Muthiah, Mahmoud khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Ajit Mathews, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao:
Software-hardware co-design for fast and scalable training of deep learning recommendation models. ISCA 2022: 993-1011 - [c19]Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, Lichao Sun, Jundong Li, George H. Chen, Zhihao Jia, Philip S. Yu:
BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs. NeurIPS 2022 - [c18]Colin Unger, Zhihao Jia, Wei Wu, Sina Lin, Mandeep Baines, Carlos Efrain Quintero Narvaez, Vinay Ramakrishnaiah, Nirmal Prajapati, Patrick S. McCormick, Jamaludin Mohd-Yusof, Xi Luo, Dheevatsa Mudigere, Jongsoo Park, Misha Smelyanskiy, Alex Aiken:
Unity: Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization. OSDI 2022: 267-284 - [c17]Mingkuan Xu, Zikun Li, Oded Padon, Sina Lin, Jessica Pointing, Auguste Hirth, Henry Ma, Jens Palsberg, Alex Aiken, Umut A. Acar, Zhihao Jia:
Quartz: superoptimization of Quantum circuits. PLDI 2022: 625-640 - [i16]Mingkuan Xu, Zikun Li, Oded Padon, Sina Lin, Jessica Pointing, Auguste Hirth, Henry Ma, Jens Palsberg, Alex Aiken, Umut A. Acar, Zhihao Jia:
Quartz: Superoptimization of Quantum Circuits (Extended Version). CoRR abs/2204.09033 (2022) - [i15]John Thorpe, Pengzhan Zhao, Jonathan Eyolfson, Yifan Qiao, Zhihao Jia, Minjia Zhang, Ravi Netravali, Guoqing Harry Xu:
Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs. CoRR abs/2204.12013 (2022) - [i14]Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, George H. Chen, Zhihao Jia, Philip S. Yu:
PyGOD: A Python Library for Graph Outlier Detection. CoRR abs/2204.12095 (2022) - [i13]Ferdinand Kossmann, Zhihao Jia, Alex Aiken:
Optimizing Mixture of Experts using Dynamic Recompilations. CoRR abs/2205.01848 (2022) - [i12]Kay Liu, Yingtong Dou, Yue Zhao, Xueying Ding, Xiyang Hu, Ruitong Zhang, Kaize Ding, Canyu Chen, Hao Peng, Kai Shu, Lichao Sun, Jundong Li, George H. Chen, Zhihao Jia, Philip S. Yu:
Benchmarking Node Outlier Detection on Graphs. CoRR abs/2206.10071 (2022) - [i11]Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shizhi Tang, Lei Xie, Kezhao Huang, Zhihao Jia:
OLLIE: Derivation-based Tensor Program Optimizer. CoRR abs/2208.02025 (2022) - [i10]Zhihao Zhang, Zhuoming Chen, Heyang Huang, Zhihao Jia:
Quark: A Gradient-Free Quantum Learning Framework for Classification Tasks. CoRR abs/2210.01311 (2022) - 2021
- [c16]Yaoyao Ding, Ligeng Zhu, Zhihao Jia, Gennady Pekhimenko, Song Han:
IOS: Inter-Operator Scheduler for CNN Acceleration. MLSys 2021 - [c15]Haojie Wang, Jidong Zhai, Mingyu Gao, Zixuan Ma, Shizhi Tang, Liyan Zheng, Yuanzhi Li, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia:
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections. OSDI 2021: 37-54 - [c14]John Thorpe, Yifan Qiao, Jonathan Eyolfson, Shen Teng, Guanzhou Hu, Zhihao Jia, Jinliang Wei, Keval Vora, Ravi Netravali, Miryung Kim, Guoqing Harry Xu:
Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads. OSDI 2021: 495-514 - [c13]Michael Bauer, Wonchan Lee, Elliott Slaughter, Zhihao Jia, Mario Di Renzo, Manolis Papadakis, Galen M. Shipman, Patrick S. McCormick, Michael Garland, Alex Aiken:
Scaling implicit parallelism via dynamic control replication. PPoPP 2021: 105-118 - [i9]John Thorpe, Yifan Qiao, Jonathan Eyolfson, Shen Teng, Guanzhou Hu, Zhihao Jia, Jinliang Wei, Keval Vora, Ravi Netravali, Miryung Kim, Guoqing Harry Xu:
Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads. CoRR abs/2105.11118 (2021) - [i8]Zhihao Zhang, Zhihao Jia:
GradSign: Model Performance Inference with Theoretical Insights. CoRR abs/2110.08616 (2021) - [i7]Yue Zhao, George H. Chen, Zhihao Jia:
TOD: Tensor-based Outlier Detection. CoRR abs/2110.14007 (2021) - [i6]Byungsoo Jeon, Sunghyun Park, Peiyuan Liao, Sheng Xu, Tianqi Chen, Zhihao Jia:
Collage: Automated Integration of Deep Learning Backends. CoRR abs/2111.00655 (2021) - [i5]Jessica Pointing, Oded Padon, Zhihao Jia, Henry Ma, Auguste Hirth, Jens Palsberg, Alex Aiken:
Quanto: Optimizing Quantum Circuits with Automatic Generation of Circuit Identities. CoRR abs/2111.11387 (2021) - 2020
- [b1]Zhihao Jia:
Automated discovery of machine learning optimizations. Stanford University, USA, 2020 - [c12]Zhihao Jia, Sina Lin, Rex Ying, Jiaxuan You, Jure Leskovec, Alex Aiken:
Redundancy-Free Computation for Graph Neural Networks. KDD 2020: 997-1005 - [c11]Zhihao Jia, Sina Lin, Mingyu Gao, Matei Zaharia, Alex Aiken:
Improving the Accuracy, Scalability, and Performance of Graph Neural Networks with Roc. MLSys 2020 - [i4]Yaoyao Ding, Ligeng Zhu, Zhihao Jia, Gennady Pekhimenko, Song Han:
IOS: Inter-Operator Scheduler for CNN Acceleration. CoRR abs/2011.01302 (2020)
2010 – 2019
- 2019
- [c10]Zhihao Jia, James Thomas, Todd Warszawski, Mingyu Gao, Matei Zaharia, Alex Aiken:
Optimizing DNN Computation with Relaxed Graph Substitutions. SysML 2019 - [c9]Zhihao Jia, Matei Zaharia, Alex Aiken:
Beyond Data and Model Parallelism for Deep Neural Networks. SysML 2019 - [c8]Zhihao Jia, Oded Padon, James Thomas, Todd Warszawski, Matei Zaharia, Alex Aiken:
TASO: optimizing deep learning computation with automatic generation of graph substitutions. SOSP 2019: 47-62 - [i3]Zhihao Jia, Sina Lin, Rex Ying, Jiaxuan You, Jure Leskovec, Alex Aiken:
Redundancy-Free Computation Graphs for Graph Neural Networks. CoRR abs/1906.03707 (2019) - 2018
- [j2]Qiubin Su, Zhihao Jia, Lu Lu:
Research on user behavior clustering algorithm based on mobile application. J. Intell. Fuzzy Syst. 35(2): 1291-1300 (2018) - [c7]Zhihao Jia, Sina Lin, Charles R. Qi, Alex Aiken:
Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks. ICML 2018: 2279-2288 - [c6]Zhihao Jia, Sean Treichler, Galen M. Shipman, Patrick S. McCormick, Alex Aiken:
Isometry: A Path-Based Distributed Data Transfer System. ICS 2018: 295-306 - [i2]Zhihao Jia, Sina Lin, Charles R. Qi, Alex Aiken:
Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks. CoRR abs/1802.04924 (2018) - [i1]Zhihao Jia, Matei Zaharia, Alex Aiken:
Beyond Data and Model Parallelism for Deep Neural Networks. CoRR abs/1807.05358 (2018) - 2017
- [j1]Zhihao Jia, Yongkee Kwon, Galen M. Shipman, Patrick S. McCormick, Mattan Erez, Alex Aiken:
A Distributed Multi-GPU System for Fast Graph Processing. Proc. VLDB Endow. 11(3): 297-310 (2017) - [c5]Zhihao Jia, Sean Treichler, Galen M. Shipman, Michael Bauer, Noah Watkins, Carlos Maltzahn, Patrick S. McCormick, Alex Aiken:
Integrating External Resources with a Task-Based Programming Model. HiPC 2017: 307-316 - 2016
- [c4]Ankita Kejriwal, Arjun Gopalan, Ashish Gupta, Zhihao Jia, Stephen Yang, John K. Ousterhout:
SLIK: Scalable Low-Latency Indexes for a Key-Value Store. USENIX ATC 2016: 57-70 - 2015
- [c3]Noah Watkins, Zhihao Jia, Galen M. Shipman, Carlos Maltzahn, Alex Aiken, Patrick S. McCormick:
Automatic and transparent I/O optimization with storage integrated application runtime support. PDSW@SC 2015: 49-54 - 2012
- [c2]Xi Wang, Haogang Chen, Alvin Cheung, Zhihao Jia, Nickolai Zeldovich, M. Frans Kaashoek:
Undefined behavior: what happened to my code? APSys 2012: 9 - [c1]Xi Wang, Haogang Chen, Zhihao Jia, Nickolai Zeldovich, M. Frans Kaashoek:
Improving Integer Security for Systems with KINT. OSDI 2012: 163-177
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 21:28 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint