default search action
Zihao Ye 0001
Person information
- affiliation: University of Washington, School of Computer Science and Engineering, Seattle, WA, USA
- affiliation: Amazon Web Services, Shanghai AI Lab, Shanghai, China
Other persons with the same name
- Zihao Ye — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c8]Size Zheng, Renze Chen, Meng Li, Zihao Ye, Luis Ceze, Yun Liang:
vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs. MLSys 2024 - [c7]Lequn Chen, Zihao Ye, Yongji Wu, Danyang Zhuo, Luis Ceze, Arvind Krishnamurthy:
Punica: Multi-Tenant LoRA Serving. MLSys 2024 - [c6]Yilong Zhao, Chien-Yu Lin, Kan Zhu, Zihao Ye, Lequn Chen, Size Zheng, Luis Ceze, Arvind Krishnamurthy, Tianqi Chen, Baris Kasikci:
Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving. MLSys 2024 - [i12]Size Zheng, Renze Chen, Meng Li, Zihao Ye, Luis Ceze, Yun Liang:
vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs. CoRR abs/2406.06542 (2024) - [i11]Kan Zhu, Yilong Zhao, Liangyu Zhao, Gefei Zuo, Yile Gu, Dedong Xie, Yufei Gao, Qinyu Xu, Tian Tang, Zihao Ye, Keisuke Kamahori, Chien-Yu Lin, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci:
NanoFlow: Towards Optimal Large Language Model Serving Throughput. CoRR abs/2408.12757 (2024) - 2023
- [c5]Zihao Ye, Ruihang Lai, Junru Shao, Tianqi Chen, Luis Ceze:
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning. ASPLOS (3) 2023: 660-678 - [c4]Siyuan Feng, Bohan Hou, Hongyi Jin, Wuwei Lin, Junru Shao, Ruihang Lai, Zihao Ye, Lianmin Zheng, Cody Hao Yu, Yong Yu, Tianqi Chen:
TensorIR: An Abstraction for Automatic Tensorized Program Optimization. ASPLOS (2) 2023: 804-817 - [i10]Lequn Chen, Zihao Ye, Yongji Wu, Danyang Zhuo, Luis Ceze, Arvind Krishnamurthy:
Punica: Multi-Tenant LoRA Serving. CoRR abs/2310.18547 (2023) - [i9]Yilong Zhao, Chien-Yu Lin, Kan Zhu, Zihao Ye, Lequn Chen, Size Zheng, Luis Ceze, Arvind Krishnamurthy, Tianqi Chen, Baris Kasikci:
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving. CoRR abs/2310.19102 (2023) - [i8]Ruihang Lai, Junru Shao, Siyuan Feng, Steven S. Lyubomirsky, Bohan Hou, Wuwei Lin, Zihao Ye, Hongyi Jin, Yuchen Jin, Jiawei Liu, Lesheng Jin, Yaxing Cai, Ziheng Jiang, Yong Wu, Sunghyun Park, Prakalp Srivastava, Jared G. Roesch, Todd C. Mowry, Tianqi Chen:
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning. CoRR abs/2311.02103 (2023) - 2022
- [c3]Zhiqiang Xie, Minjie Wang, Zihao Ye, Zheng Zhang, Rui Fan:
Graphiler: Optimizing Graph Neural Networks with Message Passing Data Flow Graph. MLSys 2022 - [i7]Siyuan Feng, Bohan Hou, Hongyi Jin, Wuwei Lin, Junru Shao, Ruihang Lai, Zihao Ye, Lianmin Zheng, Cody Hao Yu, Yong Yu, Tianqi Chen:
TensorIR: An Abstraction for Automatic Tensorized Program Optimization. CoRR abs/2207.04296 (2022) - [i6]Zihao Ye, Ruihang Lai, Junru Shao, Tianqi Chen, Luis Ceze:
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning. CoRR abs/2207.04606 (2022) - 2020
- [c2]Yuwei Hu, Zihao Ye, Minjie Wang, Jiali Yu, Da Zheng, Mu Li, Zheng Zhang, Zhiru Zhang, Yida Wang:
FeatGraph: a flexible and efficient backend for graph neural network systems. SC 2020: 71 - [c1]Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, George Karypis:
DGL-KE: Training Knowledge Graph Embeddings at Scale. SIGIR 2020: 739-748 - [i5]Chenguang Wang, Zihao Ye, Aston Zhang, Zheng Zhang, Alexander J. Smola:
Transformer on a Diet. CoRR abs/2002.06170 (2020) - [i4]Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, George Karypis:
DGL-KE: Training Knowledge Graph Embeddings at Scale. CoRR abs/2004.08532 (2020) - [i3]Yuwei Hu, Zihao Ye, Minjie Wang, Jiali Yu, Da Zheng, Mu Li, Zheng Zhang, Zhiru Zhang, Yida Wang:
FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems. CoRR abs/2008.11359 (2020)
2010 – 2019
- 2019
- [i2]Minjie Wang, Lingfan Yu, Da Zheng, Quan Gan, Yu Gai, Zihao Ye, Mufei Li, Jinjing Zhou, Qi Huang, Chao Ma, Ziyue Huang, Qipeng Guo, Hao Zhang, Haibin Lin, Junbo Zhao, Jinyang Li, Alexander J. Smola, Zheng Zhang:
Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs. CoRR abs/1909.01315 (2019) - [i1]Zihao Ye, Qipeng Guo, Quan Gan, Xipeng Qiu, Zheng Zhang:
BP-Transformer: Modelling Long-Range Context via Binary Partitioning. CoRR abs/1911.04070 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-27 21:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint