![](https://dblp.uni-trier.de./img/logo.320x120.png)
![search dblp search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
![search dblp](https://dblp.uni-trier.de./img/search.dark.16x16.png)
default search action
Esha Choukse
Person information
Refine list
![note](https://dblp.uni-trier.de./img/note-mark.dark.12x12.png)
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c15]Pratyush Patel
, Esha Choukse
, Chaojie Zhang
, Íñigo Goiri
, Brijesh Warrier
, Nithish Mahalingam
, Ricardo Bianchini
:
Characterizing Power Management Opportunities for LLMs in the Cloud. ASPLOS (3) 2024: 207-222 - [c14]Pratyush Patel, Esha Choukse, Chaojie Zhang, Aashaka Shah, Íñigo Goiri, Saeed Maleki, Ricardo Bianchini:
Splitwise: Efficient Generative LLM Inference Using Phase Splitting. ISCA 2024: 118-132 - [c13]Jovan Stojkovic, Pulkit A. Misra, Íñigo Goiri, Sam Whitlock, Esha Choukse, Mayukh Das, Chetan Bansal, Jason Lee, Zoey Sun, Haoran Qiu, Reed Zimmermann, Savyasachi Samal, Brijesh Warrier, Ashish Raniwala, Ricardo Bianchini:
SmartOClock: Workload- and Risk-Aware Overclocking in the Cloud. ISCA 2024: 437-451 - [c12]Jaylen Wang
, Daniel S. Berger, Fiodar Kazhamiaka, Celine Irvene, Chaojie Zhang, Esha Choukse, Kali Frost, Rodrigo Fonseca, Brijesh Warrier, Chetan Bansal, Jonathan Stern, Ricardo Bianchini, Akshitha Sriraman:
Designing Cloud Servers for Lower Carbon. ISCA 2024: 452-470 - [c11]Gagandeep Panwar
, Muhammad Laghari, Esha Choukse, Xun Jian
:
DyLeCT: Achieving Huge-page-like Translation Performance for Hardware-compressed Memory. ISCA 2024: 1129-1143 - [c10]Muhammad Laghari, Yuqing Liu, Gagandeep Panwar, David Bears, Chandler Jearls, Raghavendra Srinivas, Esha Choukse, Kirk W. Cameron, Ali Raza Butt, Xun Jian:
Memory Allocation Under Hardware Compression. MICRO 2024: 966-982 - [c9]Jovan Stojkovic, Esha Choukse, Enrique Saurez, Íñigo Goiri, Josep Torrellas:
Mosaic: Harnessing the Micro-Architectural Resources of Servers in Serverless Environments. MICRO 2024: 1397-1412 - [c8]Joshua Fried, Gohar Irfan Chaudhry, Enrique Saurez, Esha Choukse, Íñigo Goiri, Sameh Elnikety, Rodrigo Fonseca, Adam Belay:
Making Kernel Bypass Practical for the Cloud with Junction. NSDI 2024: 55-73 - [c7]Theo Gregersen, Pratyush Patel, Esha Choukse:
Input-Dependent Power Usage in GPUs. SC Workshops 2024: 1872-1877 - [i11]Enrique Saurez, Joshua Fried, Gohar Irfan Chaudhry, Esha Choukse, Íñigo Goiri, Sameh Elnikety, Adam Belay, Rodrigo Fonseca:
Junctiond: Extending FaaS Runtimes with Kernel-Bypass. CoRR abs/2403.03377 (2024) - [i10]Jovan Stojkovic, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Josep Torrellas:
Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference. CoRR abs/2403.20306 (2024) - [i9]Jovan Stojkovic, Chaojie Zhang, Íñigo Goiri, Josep Torrellas, Esha Choukse:
DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency. CoRR abs/2408.00741 (2024) - [i8]Kunal Jain, Anjaly Parayil, Ankur Mallick, Esha Choukse, Xiaoting Qin, Jue Zhang, Íñigo Goiri, Rujia Wang, Chetan Bansal, Victor Rühle, Anoop Kulkarni, Steve Kofsky, Saravan Rajmohan:
Intelligent Router for LLM Workloads: Improving Performance Through Workload-Aware Scheduling. CoRR abs/2408.13510 (2024) - [i7]Amey Agrawal, Junda Chen, Íñigo Goiri, Ramachandran Ramjee, Chaojie Zhang, Alexey Tumanov, Esha Choukse:
Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations. CoRR abs/2409.17264 (2024) - [i6]Theo Gregersen, Pratyush Patel, Esha Choukse:
Input-Dependent Power Usage in GPUs. CoRR abs/2409.18324 (2024) - [i5]Yuhan Liu, Esha Choukse, Shan Lu, Junchen Jiang, Madan Musuvathi:
DroidSpeak: Enhancing Cross-LLM Communication. CoRR abs/2411.02820 (2024) - 2023
- [j3]Pratyush Patel
, Zibo Gong, Syeda Rizvi, Esha Choukse, Pulkit A. Misra, Thomas E. Anderson
, Akshitha Sriraman:
Towards Improved Power Management in Cloud GPUs. IEEE Comput. Archit. Lett. 22(2): 141-144 (2023) - [c6]Jialun Lyu
, Jaylen Wang
, Kali Frost
, Chaojie Zhang
, Celine Irvene
, Esha Choukse
, Rodrigo Fonseca
, Ricardo Bianchini
, Fiodar Kazhamiaka
, Daniel S. Berger
:
Myths and Misconceptions Around Reducing Carbon Embedded in Cloud Platforms. HotCarbon 2023: 7:1-7:7 - [i4]Pratyush Patel, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Brijesh Warrier, Nithish Mahalingam, Ricardo Bianchini:
POLCA: Power Oversubscription in LLM Cloud Providers. CoRR abs/2308.12908 (2023) - [i3]Pratyush Patel, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Aashaka Shah, Saeed Maleki, Ricardo Bianchini:
Splitwise: Efficient generative LLM inference using phase splitting. CoRR abs/2311.18677 (2023) - 2022
- [j2]Pulkit A. Misra
, Ioannis Manousakis, Esha Choukse, Majid Jalili, Iñigo Goiri, Ashish Raniwala, Brijesh Warrier, Husam Alissa
, Bharath Ramakrishnan
, Phillip Tuma, Christian Belady, Marcus Fontoura, Ricardo Bianchini:
Overclocking in Immersion-Cooled Datacenters. IEEE Micro 42(4): 10-17 (2022) - [c5]Gagandeep Panwar
, Muhammad Laghari
, David Bears
, Yuqing Liu, Chandler Jearls, Esha Choukse, Kirk W. Cameron
, Ali Raza Butt, Xun Jian
:
Translation-optimized Memory Compression for Capacity. MICRO 2022: 992-1011 - 2020
- [c4]Esha Choukse, Michael B. Sullivan, Mike O'Connor
, Mattan Erez
, Jeff Pool, David W. Nellans, Stephen W. Keckler:
Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs. ISCA 2020: 926-939
2010 – 2019
- 2019
- [c3]Sangkug Lym, Esha Choukse, Siavash Zangeneh, Wei Wen, Sujay Sanghavi, Mattan Erez:
PruneTrain: fast neural network training by dynamic sparse model reconfiguration. SC 2019: 36:1-36:13 - [i2]Sangkug Lym, Esha Choukse, Siavash Zangeneh, Wei Wen, Mattan Erez, Sujay Shanghavi:
PruneTrain: Gradual Structured Pruning from Scratch for Faster Neural Network Training. CoRR abs/1901.09290 (2019) - [i1]Esha Choukse, Michael B. Sullivan, Mike O'Connor, Mattan Erez, Jeff Pool, David W. Nellans, Stephen W. Keckler:
Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs. CoRR abs/1903.02596 (2019) - 2018
- [j1]Esha Choukse
, Mattan Erez
, Alaa R. Alameldeen:
CompressPoints: An Evaluation Methodology for Compressed Memory Systems. IEEE Comput. Archit. Lett. 17(2): 126-129 (2018) - [c2]Esha Choukse, Mattan Erez
, Alaa R. Alameldeen:
Compresso: Pragmatic Main Memory Compression. MICRO 2018: 546-558 - 2016
- [c1]Jungrae Kim, Michael B. Sullivan, Esha Choukse, Mattan Erez
:
Bit-Plane Compression: Transforming Data for Better Compression in Many-Core Architectures. ISCA 2016: 329-340
Coauthor Index
![](https://dblp.uni-trier.de./img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-22 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint