default search action
Bryan Catanzaro
Person information
- affiliation: Baidu Inc., Sunnyvale, USA
- affiliation: University of California, Berkeley, Department of Electrical Engineering and Computer Sciences
- affiliation: Brigham Young University, Electrical and Computer Engineering Department
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro:
Progressive Learning of 3D Reconstruction Network From 2D GAN Data. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 793-804 (2024) - [c73]Jialin Song, Aidan M. Swope, Robert Kirby, Rajarshi Roy, Saad Godil, Jonathan Raiman, Bryan Catanzaro:
CircuitVAE: Efficient and Scalable Latent Circuit Optimization. DAC 2024: 302:1-302:6 - [c72]Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Data, Data Everywhere: A Guide for Pretraining Dataset Construction. EMNLP 2024: 10671-10695 - [c71]Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
LLM-Evolve: Evaluation for LLM's Evolving Capability on Benchmarks. EMNLP 2024: 16937-16942 - [c70]Akshit Arora, Rohan Badlani, Sungwon Kim, Rafael Valle, Bryan Catanzaro:
Scaling Nvidia's Multi-Speaker Multi-Lingual TTS Systems With Zero-Shot TTS to Indic Languages. ICASSP Workshops 2024: 115-116 - [c69]Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro:
Retrieval meets Long Context Large Language Models. ICLR 2024 - [c68]Lichang Chen, Chen Zhu, Jiuhai Chen, Davit Soselia, Tianyi Zhou, Tom Goldstein, Heng Huang, Mohammad Shoeybi, Bryan Catanzaro:
ODIN: Disentangled Reward Mitigates Hacking in RLHF. ICML 2024 - [c67]Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro:
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities. ICML 2024 - [c66]Boxin Wang, Wei Ping, Lawrence McAfee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro:
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining. ICML 2024 - [c65]Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava:
Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement. WACV 2024: 1506-1516 - [i99]Zihan Liu, Wei Ping, Rajarshi Roy, Peng Xu, Chankyu Lee, Mohammad Shoeybi, Bryan Catanzaro:
ChatQA: Building GPT-4 Level Conversational QA Models. CoRR abs/2401.10225 (2024) - [i98]Akshit Arora, Rohan Badlani, Sungwon Kim, Rafael Valle, Bryan Catanzaro:
Scaling NVIDIA's Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages. CoRR abs/2401.13851 (2024) - [i97]Zhifeng Kong, Arushi Goel, Rohan Badlani, Wei Ping, Rafael Valle, Bryan Catanzaro:
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities. CoRR abs/2402.01831 (2024) - [i96]Lichang Chen, Chen Zhu, Davit Soselia, Jiuhai Chen, Tianyi Zhou, Tom Goldstein, Heng Huang, Mohammad Shoeybi, Bryan Catanzaro:
ODIN: Disentangled Reward Mitigates Hacking in RLHF. CoRR abs/2402.07319 (2024) - [i95]Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick LeGresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi, Jonathan M. Cohen, Bryan Catanzaro:
Nemotron-4 15B Technical Report. CoRR abs/2402.16819 (2024) - [i94]Arushi Goel, Zhifeng Kong, Rafael Valle, Bryan Catanzaro:
Audio Dialogues: Dialogues dataset for audio and music understanding. CoRR abs/2404.07616 (2024) - [i93]Chankyu Lee, Rajarshi Roy, Mengyao Xu, Jonathan Raiman, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping:
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models. CoRR abs/2405.17428 (2024) - [i92]Roger Waleffe, Wonmin Byeon, Duncan Riach, Brandon Norick, Vijay Korthikanti, Tri Dao, Albert Gu, Ali Hatamizadeh, Sudhakar Singh, Deepak Narayanan, Garvit Kulshreshtha, Vartika Singh, Jared Casper, Jan Kautz, Mohammad Shoeybi, Bryan Catanzaro:
An Empirical Study of Mamba-based Language Models. CoRR abs/2406.07887 (2024) - [i91]Jialin Song, Aidan M. Swope, Robert Kirby, Rajarshi Roy, Saad Godil, Jonathan Raiman, Bryan Catanzaro:
CircuitVAE: Efficient and Scalable Latent Circuit Optimization. CoRR abs/2406.09535 (2024) - [i90]Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan M. Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek, Robert Hero, Jining Huang, Vibhu Jawa, Joseph Jennings, Aastha Jhunjhunwala, John Kamalu, Sadaf Khan, Oleksii Kuchaiev, Patrick LeGresley, Hui Li, Jiwei Liu, Zihan Liu, Eileen Long, Ameya Sunil Mahabaleshwarkar, Somshubra Majumdar, James Maki, Miguel Martinez, Maer Rodrigues de Melo, Ivan Moshkov, Deepak Narayanan, Sean Narenthiran, Jesus Navarro, Phong Nguyen, Osvald Nitski, Vahid Noroozi, Guruprasad Nutheti, Christopher Parisien, Jupinder Parmar, Mostofa Patwary, Krzysztof Pawelec, Wei Ping, Shrimai Prabhumoye, Rajarshi Roy, Trisha Saar, Vasanth Rao Naik Sabavat, Sanjeev Satheesh, Jane Polak Scowcroft, Jason Sewall, Pavel Shamis, Gerald Shen, Mohammad Shoeybi, Dave Sizer, Misha Smelyanskiy, Felipe Soares, Makesh Narsimhan Sreedhar, Dan Su, Sandeep Subramanian, Shengyang Sun, Shubham Toshniwal, Hao Wang, Zhilin Wang, Jiaxuan You, Jiaqi Zeng, Jimmy Zhang, Jing Zhang, Vivienne Zhang, Yian Zhang, Chen Zhu:
Nemotron-4 340B Technical Report. CoRR abs/2406.11704 (2024) - [i89]Zhifeng Kong, Sang-gil Lee, Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Rafael Valle, Soujanya Poria, Bryan Catanzaro:
Improving Text-To-Audio Models with Synthetic Captions. CoRR abs/2406.15487 (2024) - [i88]Yue Yu, Wei Ping, Zihan Liu, Boxin Wang, Jiaxuan You, Chao Zhang, Mohammad Shoeybi, Bryan Catanzaro:
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs. CoRR abs/2407.02485 (2024) - [i87]Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Li, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Data, Data Everywhere: A Guide for Pretraining Dataset Construction. CoRR abs/2407.06380 (2024) - [i86]Jupinder Parmar, Sanjeev Satheesh, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models. CoRR abs/2407.07263 (2024) - [i85]Peng Xu, Wei Ping, Xianchao Wu, Zihan Liu, Mohammad Shoeybi, Bryan Catanzaro:
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities. CoRR abs/2407.14482 (2024) - [i84]Saurav Muralidharan, Sharath Turuvekere Sreenivas, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov:
Compact Language Models via Pruning and Knowledge Distillation. CoRR abs/2407.14679 (2024) - [i83]Jialin Song, Jonathan Raiman, Bryan Catanzaro:
Effective Large Language Model Debugging with Best-first Tree Search. CoRR abs/2407.19055 (2024) - [i82]Sharath Turuvekere Sreenivas, Saurav Muralidharan, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov:
LLM Pruning and Distillation in Practice: The Minitron Approach. CoRR abs/2408.11796 (2024) - [i81]Min Shi, Fuxiao Liu, Shihao Wang, Shijia Liao, Subhashree Radhakrishnan, De-An Huang, Hongxu Yin, Karan Sapra, Yaser Yacoob, Humphrey Shi, Bryan Catanzaro, Andrew Tao, Jan Kautz, Zhiding Yu, Guilin Liu:
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders. CoRR abs/2408.15998 (2024) - [i80]Wenliang Dai, Nayeon Lee, Boxin Wang, Zhuoling Yang, Zihan Liu, Jon Barker, Tuomas Rintamaki, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping:
NVLM: Open Frontier-Class Multimodal LLMs. CoRR abs/2409.11402 (2024) - [i79]Mike Ranzinger, Jon Barker, Greg Heinrich, Pavlo Molchanov, Bryan Catanzaro, Andrew Tao:
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation. CoRR abs/2410.01680 (2024) - [i78]Sreyan Ghosh, Sonal Kumar, Zhifeng Kong, Rafael Valle, Bryan Catanzaro, Dinesh Manocha:
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data. CoRR abs/2410.02056 (2024) - [i77]Ethan He, Abhinav Khattar, Ryan Prenger, Vijay Korthikanti, Zijie Yan, Tong Liu, Shiqing Fan, Ashwath Aithal, Mohammad Shoeybi, Bryan Catanzaro:
Upcycling Large Language Models into Mixture of Experts. CoRR abs/2410.07524 (2024) - [i76]Arushi Goel, Karan Sapra, Matthieu Le, Rafael Valle, Andrew Tao, Bryan Catanzaro:
OMCAT: Omni Context Aware Transformer. CoRR abs/2410.12109 (2024) - [i75]Syeda Nahida Akter, Shrimai Prabhumoye, John Kamalu, Sanjeev Satheesh, Eric Nyberg, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs. CoRR abs/2410.12881 (2024) - 2023
- [j7]Guilin Liu, Aysegul Dundar, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Xiaodong Yang, Andrew Tao, Bryan Catanzaro:
Partial Convolution for Padding, Inpainting, and Image Synthesis. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6096-6110 (2023) - [j6]Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro:
Fine Detailed Texture Learning for 3D Meshes With Generative Models. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14563-14574 (2023) - [c64]Bryan Catanzaro:
Language Models: The Most Important Compute Challenge of Our Time (Keynote). ASPLOS (3) 2023: 2 - [c63]Dan Su, Mostofa Patwary, Shrimai Prabhumoye, Peng Xu, Ryan Prenger, Mohammad Shoeybi, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
Context Generation Improves Open Domain Question Answering. EACL (Findings) 2023: 781-796 - [c62]Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Adding Instructions during Pretraining: Effective way of Controlling Toxicity in Language Models. EACL 2023: 2628-2643 - [c61]Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro:
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study. EMNLP 2023: 7763-7786 - [c60]Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Mohammad Shoeybi, Ming-Yu Liu, Yuke Zhu, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar:
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning. EMNLP (Findings) 2023: 11844-11857 - [c59]Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro:
Vani: Very-Lightweight Accent-Controllable TTS for Native And Non-Native Speakers With Identity Preservation. ICASSP 2023: 1-2 - [c58]Sudheer Kovela, Rafael Valle, Ambrish Dantrey, Bryan Catanzaro:
Any-to-Any Voice Conversion with F0 and Timbre Disentanglement and Novel Timbre Conditioning. ICASSP 2023: 1-5 - [c57]Rafael Valle, João Felipe Santos, Kevin J. Shih, Rohan Badlani, Bryan Catanzaro:
High-Acoustic Fidelity Text To Speech Synthesis With Fine-Grained Control Of Speech Attributes. ICASSP 2023: 1-5 - [c56]Ahmed Agiza, Rajarshi Roy, Teodor-Dumitru Ene, Saad Godil, Sherief Reda, Bryan Catanzaro:
GraPhSyM: Graph Physical Synthesis Model. ICCAD 2023: 1-9 - [c55]Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji:
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models. ICCV 2023: 22873-22884 - [c54]Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon:
BigVGAN: A Universal Neural Vocoder with Large-Scale Training. ICLR 2023 - [c53]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech. INTERSPEECH 2023: 626-630 - [c52]Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro:
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram. INTERSPEECH 2023: 790-794 - [c51]Vijay Anand Korthikanti, Jared Casper, Sangkug Lym, Lawrence McAfee, Michael Andersch, Mohammad Shoeybi, Bryan Catanzaro:
Reducing Activation Recomputation in Large Transformer Models. MLSys 2023 - [c50]Sungwon Kim, Kevin J. Shih, Rohan Badlani, João Felipe Santos, Evelina Bakhturina, Mikyas Desta, Rafael Valle, Sungroh Yoon, Bryan Catanzaro:
P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting. NeurIPS 2023 - [i74]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
Multilingual Multiaccented Multispeaker TTS with RADTTS. CoRR abs/2301.10335 (2023) - [i73]Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar:
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning. CoRR abs/2302.04858 (2023) - [i72]Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models. CoRR abs/2302.07388 (2023) - [i71]Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro:
VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation. CoRR abs/2303.07578 (2023) - [i70]Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro:
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study. CoRR abs/2304.06762 (2023) - [i69]Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji:
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models. CoRR abs/2305.10474 (2023) - [i68]Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro:
Progressive Learning of 3D Reconstruction Network from 2D GAN Data. CoRR abs/2305.11102 (2023) - [i67]Ahmed Agiza, Rajarshi Roy, Teodor-Dumitru Ene, Saad Godil, Sherief Reda, Bryan Catanzaro:
GraPhSyM: Graph Physical Synthesis Model. CoRR abs/2308.03944 (2023) - [i66]Jie Huang, Wei Ping, Peng Xu, Mohammad Shoeybi, Kevin Chen-Chuan Chang, Bryan Catanzaro:
RAVEN: In-Context Learning with Retrieval Augmented Encoder-Decoder Language Models. CoRR abs/2308.07922 (2023) - [i65]Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro:
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram. CoRR abs/2309.05975 (2023) - [i64]Peng Xu, Wei Ping, Xianchao Wu, Lawrence McAfee, Chen Zhu, Zihan Liu, Sandeep Subramanian, Evelina Bakhturina, Mohammad Shoeybi, Bryan Catanzaro:
Retrieval meets Long Context Large Language Models. CoRR abs/2310.03025 (2023) - [i63]Boxin Wang, Wei Ping, Lawrence McAfee, Peng Xu, Bo Li, Mohammad Shoeybi, Bryan Catanzaro:
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining. CoRR abs/2310.07713 (2023) - [i62]Mingjie Liu, Teodor-Dumitru Ene, Robert Kirby, Chris Cheng, Nathaniel Ross Pinckney, Rongjian Liang, Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain, Brucek Khailany, Kishor Kunal, Xiaowei Li, Hao Liu, Stuart F. Oberman, Sujeet Omar, Sreedhar Pratty, Jonathan Raiman, Ambar Sarkar, Zhengjiang Shao, Hanfei Sun, Pratik P. Suthar, Varun Tej, Kaizhe Xu, Haoxing Ren:
ChipNeMo: Domain-Adapted LLMs for Chip Design. CoRR abs/2311.00176 (2023) - 2022
- [j5]Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos. IEEE Trans. Pattern Anal. Mach. Intell. 44(7): 3883-3894 (2022) - [c49]Zihan Liu, Mostofa Patwary, Ryan Prenger, Shrimai Prabhumoye, Wei Ping, Mohammad Shoeybi, Bryan Catanzaro:
Multi-Stage Prompting for Knowledgeable Dialogue Generation. ACL (Findings) 2022: 1317-1337 - [c48]Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro:
Evaluating Parameter Efficient Learning for Generation. EMNLP 2022: 4824-4833 - [c47]Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro:
One TTS Alignment to Rule Them All. ICASSP 2022: 6092-6096 - [c46]Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro:
Speech Denoising in the Waveform Domain With Self-Attention. ICASSP 2022: 7867-7871 - [c45]John Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro:
Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators. ICLR 2022 - [c44]Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad Shoeybi, Bryan Catanzaro:
Factuality Enhanced Language Models for Open-Ended Text Generation. NeurIPS 2022 - [c43]Boxin Wang, Wei Ping, Chaowei Xiao, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bo Li, Anima Anandkumar, Bryan Catanzaro:
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models. NeurIPS 2022 - [i61]Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zheng, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro:
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model. CoRR abs/2201.11990 (2022) - [i60]Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava:
Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction. CoRR abs/2202.00011 (2022) - [i59]Boxin Wang, Wei Ping, Chaowei Xiao, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bo Li, Anima Anandkumar, Bryan Catanzaro:
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models. CoRR abs/2202.04173 (2022) - [i58]Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro:
Speech Denoising in the Waveform Domain with Self-Attention. CoRR abs/2202.07790 (2022) - [i57]Kevin J. Shih, Rafael Valle, Rohan Badlani, João Felipe Santos, Bryan Catanzaro:
Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows. CoRR abs/2203.01786 (2022) - [i56]Zihan Liu, Mostofa Patwary, Ryan Prenger, Shrimai Prabhumoye, Wei Ping, Mohammad Shoeybi, Bryan Catanzaro:
Multi-Stage Prompting for Knowledgeable Dialogue Generation. CoRR abs/2203.08745 (2022) - [i55]Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro:
Fine Detailed Texture Learning for 3D Meshes with Generative Models. CoRR abs/2203.09362 (2022) - [i54]Vijay Korthikanti, Jared Casper, Sangkug Lym, Lawrence McAfee, Michael Andersch, Mohammad Shoeybi, Bryan Catanzaro:
Reducing Activation Recomputation in Large Transformer Models. CoRR abs/2205.05198 (2022) - [i53]Rajarshi Roy, Jonathan Raiman, Neel Kant, Ilyas Elkin, Robert Kirby, Michael Y. Siu, Stuart F. Oberman, Saad Godil, Bryan Catanzaro:
PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning. CoRR abs/2205.07000 (2022) - [i52]Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Factuality Enhanced Language Models for Open-Ended Text Generation. CoRR abs/2206.04624 (2022) - [i51]Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon:
BigVGAN: A Universal Neural Vocoder with Large-Scale Training. CoRR abs/2206.04658 (2022) - [i50]Dan Su, Mostofa Patwary, Shrimai Prabhumoye, Peng Xu, Ryan Prenger, Mohammad Shoeybi, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
Context Generation Improves Open Domain Question Answering. CoRR abs/2210.06349 (2022) - [i49]Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan J. Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro:
Evaluating Parameter Efficient Learning for Generation. CoRR abs/2210.13673 (2022) - [i48]Yogesh Balaji, Seungjun Nah, Xun Huang, Arash Vahdat, Jiaming Song, Karsten Kreis, Miika Aittala, Timo Aila, Samuli Laine, Bryan Catanzaro, Tero Karras, Ming-Yu Liu:
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers. CoRR abs/2211.01324 (2022) - 2021
- [c42]Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L. Hamilton, Bryan Catanzaro:
End-to-End Training of Neural Retrievers for Open-Domain Question Answering. ACL/IJCNLP (1) 2021: 6648-6662 - [c41]Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro:
View Generalization for Single Image Textured 3D Models. CVPR 2021: 6081-6090 - [c40]Rajarshi Roy, Jonathan Raiman, Neel Kant, Ilyas Elkin, Robert Kirby, Michael Y. Siu, Stuart F. Oberman, Saad Godil, Bryan Catanzaro:
PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning. DAC 2021: 853-858 - [c39]Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz:
Dual Contrastive Loss and Attention for GANs. ICCV 2021: 6711-6722 - [c38]Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro:
DiffWave: A Versatile Diffusion Model for Audio Synthesis. ICLR 2021 - [c37]Rafael Valle, Kevin J. Shih, Ryan Prenger, Bryan Catanzaro:
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis. ICLR 2021 - [c36]Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro:
Long-Short Transformer: Efficient Transformers for Language and Vision. NeurIPS 2021: 17723-17736 - [c35]Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia:
Efficient large-scale language model training on GPU clusters using megatron-LM. SC 2021: 58 - [i47]Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L. Hamilton, Bryan Catanzaro:
End-to-End Training of Neural Retrievers for Open-Domain Question Answering. CoRR abs/2101.00408 (2021) - [i46]Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz:
Dual Contrastive Loss and Attention for GANs. CoRR abs/2103.16748 (2021) - [i45]Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia:
Efficient Large-Scale Language Model Training on GPU Clusters. CoRR abs/2104.04473 (2021) - [i44]Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro:
View Generalization for Single Image Textured 3D Models. CoRR abs/2106.06533 (2021) - [i43]Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro:
Long-Short Transformer: Efficient Transformers for Language and Vision. CoRR abs/2107.02192 (2021) - [i42]Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro:
One TTS Alignment To Rule Them All. CoRR abs/2108.10447 (2021) - [i41]Robert Kirby, Kolby Nottingham, Rajarshi Roy, Saad Godil, Bryan Catanzaro:
Guiding Global Placement With Reinforcement Learning. CoRR abs/2109.02631 (2021) - [i40]John Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro:
Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers. CoRR abs/2111.13587 (2021) - [i39]Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro:
Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases. CoRR abs/2112.07868 (2021) - 2020
- [j4]Brucek Khailany, Haoxing Ren, Steve Dai, Saad Godil, Ben Keller, Robert Kirby, Alicia Klinefelter, Rangharajan Venkatesan, Yanqing Zhang, Bryan Catanzaro, William J. Dally:
Accelerating Chip Design With Machine Learning. IEEE Micro 40(6): 23-32 (2020) - [c34]Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Large Scale Multi-Actor Generative Dialog Modeling. ACL 2020: 66-84 - [c33]Aysegul Dundar, Karan Sapra, Guilin Liu, Andrew Tao, Bryan Catanzaro:
Panoptic-Based Image Synthesis. CVPR 2020: 8067-8076 - [c32]Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models. EMNLP (1) 2020: 2831-2845 - [c31]Raul Puri, Ryan Spring, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Training Question Answering Models From Synthetic Data. EMNLP (1) 2020: 5811-5826 - [c30]Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro:
Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens. ICASSP 2020: 6189-6193 - [c29]Vitaly Kurin, Saad Godil, Shimon Whiteson, Bryan Catanzaro:
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? NeurIPS 2020 - [c28]Morteza Mardani, Guilin Liu, Aysegul Dundar, Shiqiu Liu, Andrew Tao, Bryan Catanzaro:
Neural FFTs for Universal Texture Image Synthesis. NeurIPS 2020 - [i38]Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos. CoRR abs/2001.09518 (2020) - [i37]Raul Puri, Ryan Spring, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Training Question Answering Models From Synthetic Data. CoRR abs/2002.09599 (2020) - [i36]Aysegul Dundar, Karan Sapra, Guilin Liu, Andrew Tao, Bryan Catanzaro:
Panoptic-based Image Synthesis. CoRR abs/2004.10289 (2020) - [i35]Rafael Valle, Kevin J. Shih, Ryan Prenger, Bryan Catanzaro:
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis. CoRR abs/2005.05957 (2020) - [i34]Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Large Scale Multi-Actor Generative Dialog Modeling. CoRR abs/2005.06114 (2020) - [i33]Andrew Tao, Karan Sapra, Bryan Catanzaro:
Hierarchical Multi-Scale Attention for Semantic Segmentation. CoRR abs/2005.10821 (2020) - [i32]Guilin Liu, Rohan Taori, Ting-Chun Wang, Zhiding Yu, Shiqiu Liu, Fitsum A. Reda, Karan Sapra, Andrew Tao, Bryan Catanzaro:
Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter. CoRR abs/2007.07243 (2020) - [i31]Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro:
DiffWave: A Versatile Diffusion Model for Audio Synthesis. CoRR abs/2009.09761 (2020) - [i30]Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models. CoRR abs/2010.00840 (2020) - [i29]Sashank Santhanam, Wei Ping, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Local Knowledge Powered Conversational Agents. CoRR abs/2010.10150 (2020)
2010 – 2019
- 2019
- [c27]Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn D. Newsam, Andrew Tao, Bryan Catanzaro:
Improving Semantic Segmentation via Video Propagation and Label Relaxation. CVPR 2019: 8856-8865 - [c26]Ji Zhang, Kevin J. Shih, Ahmed Elgammal, Andrew Tao, Bryan Catanzaro:
Graphical Contrastive Losses for Scene Graph Parsing. CVPR 2019: 11535-11543 - [c25]Ryan Prenger, Rafael Valle, Bryan Catanzaro:
Waveglow: A Flow-based Generative Network for Speech Synthesis. ICASSP 2019: 3617-3621 - [c24]Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. ICCV 2019: 892-900 - [c23]Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Bryan Catanzaro, Jan Kautz:
Few-shot Video-to-Video Synthesis. NeurIPS 2019: 5014-5025 - [c22]Robert Kirby, Saad Godil, Rajarshi Roy, Bryan Catanzaro:
CongestionNet: Routing Congestion Prediction Using Deep Graph Neural Networks. VLSI-SoC 2019: 217-222 - [i28]Ji Zhang, Kevin J. Shih, Ahmed Elgammal, Andrew Tao, Bryan Catanzaro:
Graphical Contrastive Losses for Scene Graph Generation. CoRR abs/1903.02728 (2019) - [i27]Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Eric S. Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros G. Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim M. Hazelwood, Furong Huang, Martin Jaggi, Kevin G. Jamieson, Michael I. Jordan, Gauri Joshi, Rania Khalaf, Jason Knight, Jakub Konecný, Tim Kraska, Arun Kumar, Anastasios Kyrillidis, Jing Li, Samuel Madden, H. Brendan McMahan, Erik Meijer, Ioannis Mitliagkas, Rajat Monga, Derek Gordon Murray, Dimitris S. Papailiopoulos, Gennady Pekhimenko, Theodoros Rekatsinas, Afshin Rostamizadeh, Christopher Ré, Christopher De Sa, Hanie Sedghi, Siddhartha Sen, Virginia Smith, Alex Smola, Dawn Song, Evan Randall Sparks, Ion Stoica, Vivienne Sze, Madeleine Udell, Joaquin Vanschoren, Shivaram Venkataraman, Rashmi Vinayak, Markus Weimer, Andrew Gordon Wilson, Eric P. Xing, Matei Zaharia, Ce Zhang, Ameet Talwalkar:
SysML: The New Frontier of Machine Learning Systems. CoRR abs/1904.03257 (2019) - [i26]Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. CoRR abs/1906.05928 (2019) - [i25]Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Video Interpolation and Prediction with Unsupervised Landmarks. CoRR abs/1909.02749 (2019) - [i24]Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, Bryan Catanzaro:
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism. CoRR abs/1909.08053 (2019) - [i23]Vitaly Kurin, Saad Godil, Shimon Whiteson, Bryan Catanzaro:
Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning. CoRR abs/1909.11830 (2019) - [i22]Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro:
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens. CoRR abs/1910.11997 (2019) - [i21]Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro:
Few-shot Video-to-Video Synthesis. CoRR abs/1910.12713 (2019) - [i20]Raul Puri, Bryan Catanzaro:
Zero-shot Text Classification With Generative Language Models. CoRR abs/1912.10165 (2019) - [i19]Rafael Valle, Fitsum A. Reda, Mohammad Shoeybi, Patrick LeGresley, Andrew Tao, Bryan Catanzaro:
Neural ODEs for Image Segmentation with Level Sets. CoRR abs/1912.11683 (2019) - 2018
- [c21]Edward Raff, Jon Barker, Jared Sylvester, Robert Brandon, Bryan Catanzaro, Charles K. Nicholas:
Malware Detection by Eating a Whole EXE. AAAI Workshops 2018: 268-276 - [c20]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro:
High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs. CVPR 2018: 8798-8807 - [c19]Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro:
Image Inpainting for Irregular Holes Using Partial Convolutions. ECCV (11) 2018: 89-105 - [c18]Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro:
SDC-Net: Video Prediction Using Spatially-Displaced Convolution. ECCV (7) 2018: 747-763 - [c17]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Nikolai Yakovenko, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Video-to-Video Synthesis. NeurIPS 2018: 1152-1164 - [c16]Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro:
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. SBAC-PAD 2018: 290-297 - [i18]Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro:
Image Inpainting for Irregular Holes Using Partial Convolutions. CoRR abs/1804.07723 (2018) - [i17]Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro:
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. CoRR abs/1808.01371 (2018) - [i16]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Video-to-Video Synthesis. CoRR abs/1808.06601 (2018) - [i15]Ryan Prenger, Rafael Valle, Bryan Catanzaro:
WaveGlow: A Flow-based Generative Network for Speech Synthesis. CoRR abs/1811.00002 (2018) - [i14]Ji Zhang, Kevin J. Shih, Andrew Tao, Bryan Catanzaro, Ahmed Elgammal:
Introduction to the 1st Place Winning Model of OpenImages Relationship Detection Challenge. CoRR abs/1811.00662 (2018) - [i13]Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro:
SDCNet: Video Prediction Using Spatially-Displaced Convolution. CoRR abs/1811.00684 (2018) - [i12]Ji Zhang, Kevin J. Shih, Andrew Tao, Bryan Catanzaro, Ahmed Elgammal:
An Interpretable Model for Scene Graph Generation. CoRR abs/1811.09543 (2018) - [i11]Guilin Liu, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Andrew Tao, Bryan Catanzaro:
Partial Convolution based Padding. CoRR abs/1811.11718 (2018) - [i10]Neel Kant, Raul Puri, Nikolai Yakovenko, Bryan Catanzaro:
Practical Text Classification With Large Pre-Trained Language Models. CoRR abs/1812.01207 (2018) - [i9]Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn D. Newsam, Andrew Tao, Bryan Catanzaro:
Improving Semantic Segmentation via Video Propagation and Label Relaxation. CoRR abs/1812.01593 (2018) - 2017
- [c15]Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally:
DSD: Dense-Sparse-Dense Training for Deep Neural Networks. ICLR (Poster) 2017 - [i8]Edward Raff, Jon Barker, Jared Sylvester, Robert Brandon, Bryan Catanzaro, Charles K. Nicholas:
Malware Detection by Eating a Whole EXE. CoRR abs/1710.09435 (2017) - [i7]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro:
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs. CoRR abs/1711.11585 (2017) - 2016
- [c14]Dario Amodei, Sundaram Ananthanarayanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse H. Engel, Linxi Fan, Christopher Fougner, Awni Y. Hannun, Billy Jun, Tony Han, Patrick LeGresley, Xiangang Li, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Sheng Qian, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Chong Wang, Yi Wang, Zhiqian Wang, Bo Xiao, Yan Xie, Dani Yogatama, Jun Zhan, Zhenyao Zhu:
Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin. ICML 2016: 173-182 - [c13]Greg Diamos, Shubho Sengupta, Bryan Catanzaro, Mike Chrzanowski, Adam Coates, Erich Elsen, Jesse H. Engel, Awni Y. Hannun, Sanjeev Satheesh:
Persistent RNNs: Stashing Recurrent Weights On-Chip. ICML 2016: 2024-2033 - [i6]Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Shijian Tang, Erich Elsen, Bryan Catanzaro, John Tran, William J. Dally:
DSD: Regularizing Deep Neural Networks with Dense-Sparse-Dense Training Flow. CoRR abs/1607.04381 (2016) - 2015
- [c12]Saurav Muralidharan, Michael Garland, Bryan Catanzaro, Albert Sidelnik, Mary W. Hall:
A collection-oriented programming model for performance portability. PPoPP 2015: 263-264 - [i5]Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse H. Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Y. Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu:
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. CoRR abs/1512.02595 (2015) - 2014
- [c11]Saurav Muralidharan, Manu Shantharam, Mary W. Hall, Michael Garland, Bryan Catanzaro:
Nitro: A Framework for Adaptive Code Variant Tuning. IPDPS 2014: 501-512 - [c10]Bryan Catanzaro, Alexander Keller, Michael Garland:
A decomposition for in-place matrix transposition. PPoPP 2014: 193-206 - [i4]Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, Evan Shelhamer:
cuDNN: Efficient Primitives for Deep Learning. CoRR abs/1410.0759 (2014) - [i3]Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng:
Deep Speech: Scaling up end-to-end speech recognition. CoRR abs/1412.5567 (2014) - 2013
- [c9]Adam Coates, Brody Huval, Tao Wang, David J. Wu, Bryan Catanzaro, Andrew Y. Ng:
Deep learning with COTS HPC systems. ICML (3) 2013: 1337-1345 - [i2]Andreas Klöckner, Nicolas Pinto, Bryan Catanzaro, Yunsup Lee, Paul Ivanov, Ahmed Fasih:
GPU Scripting and Code Generation with PyCUDA. CoRR abs/1304.5553 (2013) - 2012
- [j3]Andreas Klöckner, Nicolas Pinto, Yunsup Lee, Bryan Catanzaro, Paul Ivanov, Ahmed Fasih:
PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation. Parallel Comput. 38(3): 157-174 (2012) - 2011
- [b1]Bryan Christopher Catanzaro:
Compilation Techniques for Embedded Data Parallel Languages. University of California, Berkeley, USA, 2011 - [c8]Michael J. Anderson, Bryan Catanzaro, Jike Chong, Ekaterina Gonina, Kurt Keutzer, Chao-Yue Lai, Mark Murphy, David Sheffield, Bor-Yiing Su, Narayanan Sundaram:
Considerations When Evaluating Microprocessor Platforms. HotPar 2011 - [c7]Bryan Catanzaro, Michael Garland, Kurt Keutzer:
Copperhead: compiling an embedded data parallel language. PPoPP 2011: 47-56 - [p1]Michael J. Anderson, Bryan Catanzaro, Jike Chong, Ekaterina Gonina, Kurt Keutzer, Chao-Yue Lai, Mark Murphy, Bor-Yiing Su, Narayanan Sundaram:
PALLAS: Mapping Applications onto Manycore. Multiprocessor System-on-Chip 2011: 89-113 - 2010
- [j2]Bryan Catanzaro, Kurt Keutzer:
Parallel computing with patterns and frameworks. XRDS 17(1): 22-27 (2010) - [j1]Bryan Catanzaro, Armando Fox, Kurt Keutzer, David A. Patterson, Bor-Yiing Su, Marc Snir, Kunle Olukotun, Pat Hanrahan, Hassan Chafi:
Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford. IEEE Micro 30(2): 41-55 (2010)
2000 – 2009
- 2009
- [c6]Bryan Catanzaro, Bor-Yiing Su, Narayanan Sundaram, Yunsup Lee, Mark Murphy, Kurt Keutzer:
Efficient, high-quality image contour detection. ICCV 2009: 2381-2388 - [i1]Andreas Klöckner, Nicolas Pinto, Yunsup Lee, Bryan Catanzaro, Paul Ivanov, Ahmed Fasih:
PyCUDA: GPU Run-Time Code Generation for High-Performance Computing. CoRR abs/0911.3456 (2009) - 2008
- [c5]Bryan Catanzaro, Kurt Keutzer, Bor-Yiing Su:
Parallelizing CAD: a timely research agenda for EDA. DAC 2008: 12-17 - [c4]Bryan Catanzaro, Narayanan Sundaram, Kurt Keutzer:
Fast support vector machine training and classification on graphics processors. ICML 2008: 104-111 - 2007
- [c3]Jike Chong, Nadathur Satish, Bryan Catanzaro, Kaushik Ravindran, Kurt Keutzer:
Efficient Parallelization of H.264 Decoding with Macro Block Level Scheduling. ICME 2007: 1874-1877 - 2005
- [c2]Bryan Catanzaro, Brent E. Nelson:
Higher Radix Floating-Point Representations for FPGA-Based Arithmetic. FCCM 2005: 161-170 - [c1]Bryan Catanzaro, Brent E. Nelson:
Choice of base revisited: higher radices for FPGA-based floating-point computation (abstract only). FPGA 2005: 279
Coauthor Index
aka: Vijay Korthikanti
aka: Ryan J. Prenger
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 22:33 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint