default search action
Zach DeVito
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c14]Jason Ansel, Edward Z. Yang, Horace He, Natalia Gimelshein, Animesh Jain, Michael Voznesensky, Bin Bao, Peter Bell, David Berard, Evgeni Burovski, Geeta Chauhan, Anjali Chourdia, Will Constable, Alban Desmaison, Zachary DeVito, Elias Ellison, Will Feng, Jiong Gong, Michael Gschwind, Brian Hirsh, Sherlock Huang, Kshiteej Kalambarkar, Laurent Kirsch, Michael Lazos, Mario Lezcano, Yanbo Liang, Jason Liang, Yinghai Lu, C. K. Luk, Bert Maher, Yunjie Pan, Christian Puhrsch, Matthias Reso, Mark Saroufim, Marcos Yukio Siraichi, Helen Suk, Shunting Zhang, Michael Suo, Phil Tillet, Xu Zhao, Eikan Wang, Keren Zhou, Richard Zou, Xiaodong Wang, Ajit Mathews, William Wen, Gregory Chanan, Peng Wu, Soumith Chintala:
PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation. ASPLOS (2) 2024: 929-947 - [c13]Samuel Hsia, Alicia Golden, Bilge Acun, Newsha Ardalani, Zachary DeVito, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:
MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems. ISCA 2024: 818-833 - [c12]Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation. ISPASS 2024: 257-267 - [i10]Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:
Is Flash Attention Stable? CoRR abs/2405.02803 (2024) - 2023
- [c11]Shen Li, Pritam Damania, Luca Wehrstedt, Rohan Varma, Omkar Salpekar, Pavel Belevich, Howard Huang, Yanli Zhao, Lucas Hosseini, Wanchao Liang, Hongyi Jia, Shihao Xu, Satendra Gera, Alisson G. Azzolini, Guoqiang Jerry Chen, Zachary DeVito, Chaoyang He, Amir Ziashahabi, Alban Desmaison, Edward Z. Yang, Gregory Chanan, Brian Vaughan, Manoj Krishnan, Joseph S. Spisak, Salman Avestimehr, Soumith Chintala:
PyTorch RPC: Distributed Deep Learning Built on Tensor-Optimized Remote Procedure Calls. MLSys 2023 - [i9]Igor Molybog, Peter Albert, Moya Chen, Zachary DeVito, David Esiobu, Naman Goyal, Punit Singh Koura, Sharan Narang, Andrew Poulton, Ruan Silva, Binh Tang, Diana Liskovich, Puxin Xu, Yuchen Zhang, Melanie Kambadur, Stephen Roller, Susan Zhang:
A Theory on Adam Instability in Large-Scale Machine Learning. CoRR abs/2304.09871 (2023) - [i8]Samuel Hsia, Alicia Golden, Bilge Acun, Newsha Ardalani, Zachary DeVito, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:
MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems. CoRR abs/2310.02784 (2023) - [i7]Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation. CoRR abs/2312.14385 (2023) - 2022
- [c10]James K. Reed, Zachary DeVito, Horace He, Ansley Ussery, Jason Ansel:
torch.fx: Practical Program Capture and Transformation for Deep Learning in Python. MLSys 2022 - 2021
- [i6]Zachary DeVito, Jason Ansel, Will Constable, Michael Suo, Ailing Zhang, Kim M. Hazelwood:
Using Python for Model Inference in Deep Learning. CoRR abs/2104.00254 (2021) - [i5]James K. Reed, Zachary DeVito, Horace He, Ansley Ussery, Jason Ansel:
torch.fx: Practical Program Capture and Transformation for Deep Learning in Python. CoRR abs/2112.08429 (2021) - 2020
- [j5]Nicolas Vasilache, Oleksandr Zinenko, Theodoros Theodoridis, Priya Goyal, Zachary DeVito, William S. Moses, Sven Verdoolaege, Andrew Adams, Albert Cohen:
The Next 700 Accelerated Layers: From Mathematical Expressions of Network Computation Graphs to Accelerated GPU Kernels, Automatically. ACM Trans. Archit. Code Optim. 16(4): 38:1-38:26 (2020)
2010 – 2019
- 2019
- [c9]Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Köpf, Edward Z. Yang, Zachary DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, Soumith Chintala:
PyTorch: An Imperative Style, High-Performance Deep Learning Library. NeurIPS 2019: 8024-8035 - [i4]Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Köpf, Edward Z. Yang, Zach DeVito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, Soumith Chintala:
PyTorch: An Imperative Style, High-Performance Deep Learning Library. CoRR abs/1912.01703 (2019) - 2018
- [i3]Nicolas Vasilache, Oleksandr Zinenko, Theodoros Theodoridis, Priya Goyal, Zachary DeVito, William S. Moses, Sven Verdoolaege, Andrew Adams, Albert Cohen:
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions. CoRR abs/1802.04730 (2018) - 2017
- [j4]Zachary DeVito, Michael Mara, Michael Zollhöfer, Gilbert Bernstein, Jonathan Ragan-Kelley, Christian Theobalt, Pat Hanrahan, Matthew Fisher, Matthias Nießner:
Opt: A Domain Specific Language for Non-Linear Least Squares Optimization in Graphics and Imaging. ACM Trans. Graph. 36(5): 171:1-171:27 (2017) - 2016
- [j3]Gilbert Louis Bernstein, Chinmayee Shah, Crystal Lemire, Zachary DeVito, Matthew Fisher, Philip Alexander Levis, Pat Hanrahan:
Ebb: A DSL for Physical Simulation on CPUs and GPUs. ACM Trans. Graph. 35(2): 21:1-21:12 (2016) - [j2]James Hegarty, Ross G. Daly, Zachary DeVito, Mark Horowitz, Pat Hanrahan, Jonathan Ragan-Kelley:
Rigel: flexible multi-rate image processing hardware. ACM Trans. Graph. 35(4): 85:1-85:11 (2016) - [i2]Zachary DeVito, Michael Mara, Michael Zollhöfer, Gilbert Louis Bernstein, Jonathan Ragan-Kelley, Christian Theobalt, Pat Hanrahan, Matthew Fisher, Matthias Nießner:
Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging. CoRR abs/1604.06525 (2016) - 2015
- [c8]Zachary DeVito, Pat Hanrahan:
The Design of Terra: Harnessing the Best Features of High-Level and Low-Level Languages. SNAPL 2015: 79-89 - [i1]Gilbert Louis Bernstein, Chinmayee Shah, Crystal Lemire, Zachary DeVito, Matthew Fisher, Philip Alexander Levis, Pat Hanrahan:
Ebb: A DSL for Physical Simluation on CPUs and GPUs. CoRR abs/1506.07577 (2015) - 2014
- [b1]Zach DeVito:
Terra: simplifying high-performance programming using multi-stage programming. Stanford University, USA, 2014 - [j1]James Hegarty, John S. Brunhaver, Zachary DeVito, Jonathan Ragan-Kelley, Noy Cohen, Steven Bell, Artem Vasilyev, Mark Horowitz, Pat Hanrahan:
Darkroom: compiling high-level image processing code into hardware pipelines. ACM Trans. Graph. 33(4): 144:1-144:11 (2014) - [c7]Justin Talbot, Zachary DeVito, Pat Hanrahan:
Just-in-time Length Specialization of Dynamic Vector Code. ARRAY@PLDI 2014: 20-25 - [c6]Zachary DeVito, Daniel Ritchie, Matthew Fisher, Alex Aiken, Pat Hanrahan:
First-class runtime generation of high-performance types using exotypes. PLDI 2014: 77-88 - 2013
- [c5]Ian Karlin, Abhinav Bhatele, Jeff Keasler, Bradford L. Chamberlain, Jonathan D. Cohen, Zachary DeVito, Riyaz Haque, Dan Laney, Edward Luke, Felix Wang, David F. Richards, Martin Schulz, Charles H. Still:
Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application. IPDPS 2013: 919-932 - [c4]Zachary DeVito, James Hegarty, Alex Aiken, Pat Hanrahan, Jan Vitek:
Terra: a multi-stage language for high-performance computing. PLDI 2013: 105-116 - 2012
- [c3]Justin Talbot, Zachary DeVito, Pat Hanrahan:
Riposte: a trace-driven compiler and parallel VM for vector code in R. PACT 2012: 43-52 - 2011
- [c2]Zach DeVito, Niels Joubert, Francisco Palacios, Stephen Oakley, Montserrat Medina, Mike Barrientos, Erich Elsen, Frank Ham, Alex Aiken, Karthik Duraisamy, Eric Darve, Juan J. Alonso, Pat Hanrahan:
Liszt: a domain specific language for building portable mesh-based PDE solvers. SC 2011: 9:1-9:12 - 2010
- [c1]Hassan Chafi, Zach DeVito, Adriaan Moors, Tiark Rompf, Arvind K. Sujeeth, Pat Hanrahan, Martin Odersky, Kunle Olukotun:
Language virtualization for heterogeneous parallel computing. OOPSLA 2010: 835-847
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-18 00:32 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint