![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Keren Zhou 0001
Person information
- affiliation: George Mason University, VA, USA
- affiliation: OpenAI
- affiliation (PhD): Rice University, TX, USA
Other persons with the same name
- Keren Zhou 0002
— Beckman Research Institute of City of Hope, Monrovia, CA, USA (and 1 more)
- Keren Zhou 0003 — Chinese Academy of Sciences, Insititute of Computing Technology, Beijing, China
Refine list
![note](https://dblp.uni-trier.de./img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c20]Jason Ansel, Edward Z. Yang, Horace He, Natalia Gimelshein, Animesh Jain, Michael Voznesensky, Bin Bao, Peter Bell
, David Berard, Evgeni Burovski
, Geeta Chauhan, Anjali Chourdia, Will Constable, Alban Desmaison, Zachary DeVito, Elias Ellison, Will Feng, Jiong Gong, Michael Gschwind, Brian Hirsh, Sherlock Huang, Kshiteej Kalambarkar, Laurent Kirsch, Michael Lazos, Mario Lezcano, Yanbo Liang, Jason Liang, Yinghai Lu, C. K. Luk, Bert Maher, Yunjie Pan, Christian Puhrsch, Matthias Reso, Mark Saroufim, Marcos Yukio Siraichi, Helen Suk, Shunting Zhang, Michael Suo, Phil Tillet, Xu Zhao, Eikan Wang, Keren Zhou, Richard Zou, Xiaodong Wang, Ajit Mathews, William Wen, Gregory Chanan, Peng Wu, Soumith Chintala:
PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation. ASPLOS (2) 2024: 929-947 - [c19]Keren Zhou
, Karthik Ganapathi Subramanian
, Po-Hsun Lin
, Matthias Fey
, Binqian Yin
, Jiajia Li
:
FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogenous Graph Neural Networks. ICS 2024: 511-524 - [c18]Aditya Desai, Kimia Saedi, Apoorv Walia, Jihyeong Lee, Keren Zhou, Anshumali Shrivastava:
SS1: Accelerating Inference with Fast and Expressive Sketch Structured Transform. NeurIPS 2024 - [c17]Zhen Xie, Murali Emani, Xiaodong Yu, Dingwen Tao, Xin He, Pengfei Su, Keren Zhou, Venkatram Vishwanath:
Centimani: Enabling Fast AI Accelerator Selection for DNN Training with a Novel Performance Predictor. USENIX ATC 2024: 1203-1221 - [i5]Qidong Zhao, Hao Wu, Yuming Hao, Zilingfeng Ye, Jiajia Li, Xu Liu, Keren Zhou:
DeepContext: A Context-aware, Cross-platform, and Cross-framework Tool for Performance Profiling and Analysis of Deep Learning Workloads. CoRR abs/2411.02797 (2024) - 2023
- [c16]Mao Lin, Keren Zhou
, Pengfei Su:
DrGPUM: Guiding Memory Optimization for GPU-Accelerated Applications. ASPLOS (3) 2023: 164-178 - [c15]Aditya Desai, Keren Zhou
, Anshumali Shrivastava:
Hardware-Aware Compression with Random Operation Access Specific Tile (ROAST) Hashing. ICML 2023: 7732-7749 - 2022
- [j5]Ryuichi Sai
, John M. Mellor-Crummey
, Xiaozhu Meng
, Keren Zhou
, Mauricio Araya-Polo, Jie Meng:
Accelerating high-order stencils on GPUs. Concurr. Comput. Pract. Exp. 34(20) (2022) - [j4]Binqian Yin
, Qinhong Hu
, Yingying Zhu, Chen Zhao, Keren Zhou
:
Paw-Net: Stacking ensemble deep learning for segmenting scanning electron microscopy images of fine-grained shale samples. Comput. Geosci. 168: 105218 (2022) - [j3]Keren Zhou
, Xiaozhu Meng, Ryuichi Sai
, Dejan Grubisic
, John M. Mellor-Crummey
:
An Automated Tool for Analysis and Tuning of GPU-Accelerated Code in HPC Applications. IEEE Trans. Parallel Distributed Syst. 33(4): 854-865 (2022) - [c14]Keren Zhou, Yueming Hao
, John M. Mellor-Crummey
, Xiaozhu Meng, Xu Liu:
ValueExpert: exploring value patterns in GPU-accelerated applications. ASPLOS 2022: 171-185 - [c13]Keren Zhou
, Jonathon M. Anderson, Xiaozhu Meng, John M. Mellor-Crummey
:
Low overhead and context sensitive profiling of CPU-accelerated applications. ICS 2022: 1:1-1:13 - [i4]Aditya Desai, Keren Zhou, Anshumali Shrivastava:
Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing. CoRR abs/2207.10702 (2022) - 2021
- [j2]Keren Zhou
, Laksono Adhianto
, Jonathon M. Anderson
, Aaron Cherian, Dejan Grubisic, Mark Krentel, Yumeng Liu
, Xiaozhu Meng, John M. Mellor-Crummey
:
Measurement and analysis of GPU-accelerated applications with HPCToolkit. Parallel Comput. 108: 102837 (2021) - [c12]Keren Zhou
, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey
:
GPA: A GPU Performance Advisor Based on Instruction Sampling. CGO 2021: 115-125 - [c11]Barbara M. Chapman, Buu Pham, Charlene Yang, Christopher S. Daley
, Colleen Bertoni, Dhruva Kulkarni, Dossay Oryspayev, Ed D'Azevedo, Johannes Doerfert, Keren Zhou
, Kiran Ravikumar, Mark Gordon, Mauro Del Ben, Meifeng Lin, Melisa Alkan, Michael Kruse, Oscar R. Hernandez, P. K. Yeung, Paul Lin, Peng Xu, Swaroop Pophale, Tosaporn Sattasathuchana, Vivek Kale, William P. Huhn
, Yun (Helen) He:
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Model (Part I). IWOMP 2021: 67-80 - [c10]Barbara M. Chapman, Buu Pham, Charlene Yang, Christopher S. Daley
, Colleen Bertoni, Dhruva Kulkarni, Dossay Oryspayev, Ed D'Azevedo, Johannes Doerfert, Keren Zhou, Kiran Ravikumar, Mark Gordon, Mauro Del Ben, Meifeng Lin, Melisa Alkan, Michael Kruse, Oscar R. Hernandez, P. K. Yeung, Paul Lin, Peng Xu
, Swaroop Pophale, Tosaporn Sattasathuchana, Vivek Kale, William P. Huhn
, Yun (Helen) He:
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Model (Part II). IWOMP 2021: 81-95 - [c9]Aaron Cherian, Keren Zhou
, Dejan Grubisic, Xiaozhu Meng, John M. Mellor-Crummey
:
Measurement and Analysis of GPU-Accelerated OpenCL Computations on Intel GPUs. ProTools@SC 2021: 26-35 - [i3]Keren Zhou, Laksono Adhianto, Jonathon M. Anderson, Aaron Cherian, Dejan Grubisic, Mark Krentel, Yumeng Liu, Xiaozhu Meng, John M. Mellor-Crummey:
Measurement and Analysis of GPU-accelerated Applications with HPCToolkit. CoRR abs/2109.06931 (2021) - 2020
- [c8]Keren Zhou
, Mark W. Krentel, John M. Mellor-Crummey
:
Tools for top-down performance analysis of GPU-accelerated applications. ICS 2020: 26:1-26:12 - [c7]Keren Zhou
, Mark Krentel, John M. Mellor-Crummey
:
A tool for top-down performance analysis of GPU-accelerated applications. PPoPP 2020: 415-416 - [c6]Keren Zhou
, Yueming Hao
, John M. Mellor-Crummey
, Xiaozhu Meng, Xu Liu:
GVProf: a value profiler for GPU-based clusters. SC 2020: 89 - [i2]Keren Zhou, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey:
GPA: A GPU Performance Advisor Based on Instruction Sampling. CoRR abs/2009.04061 (2020)
2010 – 2019
- 2019
- [c5]Keren Zhou
, John M. Mellor-Crummey
:
A Tool for Performance Analysis of GPU-Accelerated Applications. CGO 2019: 282 - 2018
- [j1]Keren Zhou
, Guangming Tan, Wei Zhou
:
Quadboost: A Scalable Concurrent Quadtree. IEEE Trans. Parallel Distributed Syst. 29(3): 673-686 (2018) - 2017
- [c4]Keren Zhou
, Guangming Tan, Xiuxia Zhang, Chaowei Wang, Ninghui Sun:
A performance analysis framework for exploiting GPU microarchitectural capability. ICS 2017: 15:1-15:10 - [c3]Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li
, Keren Zhou
, Mingyu Chen:
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning. PPoPP 2017: 31-43 - 2016
- [i1]Keren Zhou, Guangming Tan, Wei Zhou:
Quadboost: A Scalable Concurrent Quadtree. CoRR abs/1607.03292 (2016) - 2015
- [c2]Zi-long Tan, Keren Zhou, Hao Zhang, Wei Zhou:
BF-MapReduce: A Bloom Filter Based Efficient Lightweight Search. CIC 2015: 125-129 - [c1]Qiang Li, Maojie Gu, Keren Zhou, Xiaoming Sun:
Multi-Classes Feature Engineering with Sliding Window for Purchase Prediction in Mobile Commerce. ICDM Workshops 2015: 1048-1054
Coauthor Index
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-08 00:48 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint