


default search action
ASPLOS 2025: Rotterdam, The Netherlands
- Lieven Eeckhout, Georgios Smaragdakis, Kaitai Liang, Adrian Sampson, Martha A. Kim, Christopher J. Rossbach:
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, ASPLOS 2025, Rotterdam, The Netherlands, 30 March 2025 - 3 April 2025. ACM 2025, ISBN 979-8-4007-0698-1 - Zhuoran Ji, Jianyu Zhao, Peimin Gao, Xiangkai Yin, Lei Ju:
Accelerating Number Theoretic Transform with Multi-GPU Systems for Efficient Zero Knowledge Proof. 1-14 - Derrick Quinn, Mohammad Nouri, Neel Patel, John Salihu, Alireza Salemi, Sukhan Lee, Hamed Zamani, Mohammad Alian:
Accelerating Retrieval-Augmented Generation. 15-32 - Wonkyo Choe, Rongxiang Wang, Felix Xiaozhu Lin:
AnA: An Attentive Autonomous Driving System. 33-46 - Chanyoung Park, Jungho Lee, Chun-Yi Liu, Kyungtae Kang, Mahmut Taylan Kandemir, Wonil Choi:
AnyKey: A Key-Value SSD for All Workload Types. 47-63 - Sankeerth Durvasula, Adrian Zhao, Fan Chen, Ruofan Liang, Pawan Kumar Sanjaya, Yushi Guan, Christina Giannoula, Nandita Vijaykumar:
ARC: Warp-level Adaptive Atomic Reduction in GPUs to Accelerate Differentiable Rendering. 64-83 - Rohan Yadav, Michael Bauer, David Broman, Michael Garland, Alex Aiken, Fredrik Kjolstad:
Automatic Tracing in Task-Based Runtime Systems. 84-99 - Tao Lu, Yuxun Chen, Zonghui Wang, Xiaohang Wang, Wenzhi Chen, Jiaheng Zhang:
BatchZK: A Fully Pipelined GPU-Accelerated System for Batch Generation of Zero-Knowledge Proofs. 100-115 - Shaobo Li, Yirui Eric Zhou, Hao Ren, Jian Huang:
ByteFS: System Support for (CXL-based) Memory-Semantic Solid-State Drives. 116-132 - Siddharth Jayashankar, Edward Chen, Tom Tang, Wenting Zheng, Dimitrios Skarlatos:
Cinnamon: A Framework for Scale-Out Encrypted AI. 133-150 - Rishi Ranjan, Ian Paterson, Matthew Hicks:
ClosureX: Compiler Support for Correct Persistent Fuzzing. 151-163 - Benjamin Reidys, Pantea Zardoshti, Íñigo Goiri, Celine Irvene, Daniel S. Berger, Haoran Ma, Kapil Arya, Eli Cortez, Taylor Stark, Eugene Bak, Mehmet Iyigun, Stanko Novakovic, Lisa Hsu, Karel Trueba, Abhisek Pan, Chetan Bansal, Saravan Rajmohan, Jian Huang, Ricardo Bianchini:
Coach: Exploiting Temporal Patterns for All-Resource Oversubscription in Cloud Platforms. 164-181 - Rohan Yadav, Shiv Sundram, Wonchan Lee, Michael Garland, Michael Bauer, Alex Aiken, Fredrik Kjolstad:
Composing Distributed Computations Through Task and Kernel Fusion. 182-197 - Shenggan Cheng, Shengjie Lin, Lansong Diao, Hao Wu, Siyu Wang, Chang Si, Ziming Liu, Xuanlei Zhao, Jiangsu Du, Wei Lin, Yang You:
Concerto: Automatic Communication Optimization and Scheduling for Large-Scale Deep Learning. 198-213 - Kapil Agrawal, Sangeetha Abdu Jyothi:
Cooperative Graceful Degradation in Containerized Clouds. 214-232 - Divyanshu Saxena, William Zhang, Shankara Pailoor, Isil Dillig, Aditya Akella:
Copper and Wire: Bridging Expressiveness and Performance for Service Mesh Policies. 233-248 - Jiahui Xu, Lana Josipovic:
CRUSH: A Credit-Based Approach for Functional Unit Sharing in Dynamically Scheduled HLS. 249-263 - Rohan Basu Roy, Vijay Gadepally, Devesh Tiwari:
DarwinGame: Playing Tournaments for Tuning Applications in Noisy Cloud Environments. 264-279 - Yibiao Yang, Maolin Sun, Jiangchang Wu, Qingyang Li, Yuming Zhou:
Debugger Toolchain Validation via Cross-Level Debugging. 280-294 - Kaiqiang Xu, Decang Sun, Hao Wang, Zhenghang Ren, Xinchen Wan, Xudong Liao, Zilong Wang, Junxue Zhang, Kai Chen:
Design and Operation of Shared Machine Learning Clusters on Campus. 295-310 - Cunchi Lv, Xiao Shi, Zhengyu Lei, Jinyue Huang, Wenting Tan, Xiaohui Zheng, Xiaofang Zhao:
Dilu: Enabling GPU Resourcing-on-Demand for Serverless DL Serving via Introspective Elasticity. 311-325 - Yuanpei Wu, Dong Du, Chao Xu, Yubin Xia, Ming Fu, Binyu Zang, Haibo Chen:
D-VSync: Decoupled Rendering and Displaying for Smartphone Graphics. 326-341 - Pu (Luke) Yi, Yifan Yang, Chae Young Lee, Sara Achour:
Early Termination for Hyperdimensional Computing Using Inferential Statistics. 342-360 - Kuntai Du, Yihua Cheng, Peder A. Olsen, Shadi A. Noghabi, Junchen Jiang:
Earth+: On-Board Satellite Imagery Compression Leveraging Historical Earth Observations. 361-376 - Weigao Su, Vishal Shrivastav:
EDM: An Ultra-Low Latency Ethernet Fabric for Memory Disaggregation. 377-394 - Noushin Azami, Alex Fallin, Martin Burtscher:
Efficient Lossless Compression of Scientific Floating-Point Data on CPUs and GPUs. 395-409 - Zhaoying Li, Pranav Dangi, Chenyang Yin, Thilini Kaushalya Bandara, Rohan Juneja, Cheng Tan, Zhenyu Bai, Tulika Mitra:
Enhancing CGRA Efficiency Through Aligned Compute and Communication Provisioning. 410-425 - Yuka Ikarashi, Kevin Qian, Samir Droubi, Alex Reinking, Gilbert Louis Bernstein, Jonathan Ragan-Kelley:
Exo 2: Growing a Scheduling Language. 426-444 - Daliang Xu, Hao Zhang, Liming Yang, Ruiqi Liu, Gang Huang, Mengwei Xu, Xuanzhe Liu:
Fast On-device LLM Inference with NPUs. 445-462 - Xuran Cai, Amir Kafshdar Goharshady, S. Hitarth, Chun Kit Lam:
Faster Chaitin-like Register Allocation via Grammatical Decompositions of Control-Flow Graphs. 463-477 - Jinghan Sun, Benjamin Reidys, Daixuan Li, Jichuan Chang, Marc Snir, Jian Huang:
FleetIO: Managing Multi-Tenant Cloud Storage with Multi-Agent Reinforcement Learning. 478-492 - Seonho Lee, Amar Phanishayee, Divya Mahajan:
Forecasting GPU Performance for Deep Learning Training and Inference. 493-508 - Minhui Xie, Shaoxun Zeng, Hao Guo, Shiwei Gao, Youyou Lu:
Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs. 509-523 - Xinglin Pan, Wenxiang Lin, Lin Zhang, Shaohuai Shi, Zhenheng Tang, Rui Wang, Bo Li, Xiaowen Chu:
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models. 524-539 - Jianan Lu, Ashwini Raina, Asaf Cidon, Michael J. Freedman:
Fusion: An Analytics Object Store Optimized for Query Pushdown. 540-556 - Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen, Peiyuan Liao, Xupeng Miao, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia:
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism. 557-571 - Seonyoung Cheon, Yongwoo Lee, Hoyun Youm, Dongkwan Kim, Sungwoo Yun, Kunmo Jeong, Dongyoon Lee, Hanjun Kim:
HALO: Loop-aware Bootstrapping Management for Fully Homomorphic Encryption. 572-585 - Yixuan Mei, Yonghao Zhuang, Xupeng Miao, Juncheng Yang, Zhihao Jia, Rashmi Vinayak:
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow. 586-602 - Sushant Dinesh, Yongye Zhu, Christopher W. Fletcher:
H-Houdini: Scalable Invariant Learning. 603-618 - Dimitrios Chasapis, Georgios Vavouliotis, Daniel A. Jiménez, Marc Casas:
Instruction-Aware Cooperative TLB and Cache Replacement Policies. 619-636 - Seungmin Baek, Minbok Wi, Seonyong Park, Hwayong Nam, Michael Jaemin Kim, Nam Sung Kim, Jung Ho Ahn:
Marionette: A RowHammer Attack via Row Coupling. 637-652 - Shaoxun Zeng, Minhui Xie, Shiwei Gao, Youmin Chen, Youyou Lu:
Medusa: Accelerating Serverless LLM Inference with Materialization. 653-668 - Weikai Lin, Yu Feng, Yuhao Zhu:
MetaSapiens: Real-Time Neural Rendering with Efficiency-Aware Pruning and Accelerated Foveated Rendering. 669-682 - Haiyu Huang, Cheng Chen, Kunyi Chen, Pengfei Chen, Guangba Yu, Zilong He, Yilun Wang, Huxing Zhang, Qi Zhou:
Mint: Cost-Efficient Tracing with All Requests Collection via Commonality and Variability Analysis. 683-697 - Moinuddin Qureshi, Salman Qazi:
MOAT: Securely Mitigating Rowhammer with Per-Row Activation Counters. 698-714 - Shiyi Cao, Shu Liu, Tyler Griggs, Peter Schafhalter, Xiaoxuan Liu, Ying Sheng, Joseph E. Gonzalez, Matei Zaharia, Ion Stoica:
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs. 715-730 - Shuaiting Li, Chengxuan Wang, Juncan Deng, Zeyu Wang, Zewen Ye, Zongsheng Wang, Haibin Shen, Kejie Huang:
MVQ: Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization. 731-745 - Wei Hao, Zixi Wang, Lauren Hong, Lingxiao Li, Nader Karayanni, AnMei Dasbach-Prisk, Chengzhi Mao, Junfeng Yang, Asaf Cidon:
Nazar: Monitoring and Adapting ML Models on Mobile Devices. 746-761 - Yihao Sun, Ahmedur Rahman Shovon, Thomas Gilray, Sidharth Kumar, Kristopher K. Micinski:
Optimizing Datalog for the GPU. 762-776 - Amanda Xu, Abtin Molavi, Swamit Tannu, Aws Albarghouthi:
Optimizing Quantum Circuits, Fast and Slow. 777-793 - Sami Alabed, Daniel Belov, Bart Chrzaszcz, Juliana Franco, Dominik Grewe, Dougal Maclaurin, James Molloy, Tom Natan, Tamara Norman, Xiaoyue Pan, Adam Paszke, Norman A. Rink, Michael Schaarschmidt, Timur Sitdikov, Agnieszka Swietlik, Dimitrios Vytiniotis, Joel Wee:
PartIR: Composing SPMD Partitioning Strategies for Machine Learning. 794-810 - Foteini Strati, Michal Friedman, Ana Klimovic:
PCcheck: Persistent Concurrent Checkpointing for ML. 811-827 - Shaofeng Wu, Qiang Su, Zhixiong Niu, Hong Xu:
Performance Prediction of On-NIC Network Functions with Multi-Resource Contention and Traffic Awareness. 828-842 - Yifan Tan, Cheng Tan, Zeyu Mi, Haibo Chen:
PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption. 843-857 - Yupeng Tang, Seung-Seob Lee, Abhishek Bhattacharjee, Anurag Khandelwal:
pulse: Accelerating Distributed Pointer-Traversals on Disaggregated Memory. 858-875 - Keyi Yin, Hezi Zhang, Xiang Fang, Yunong Shi, Travis S. Humble, Ang Li, Yufei Ding:
QECC-Synth: A Layout Synthesizer for Quantum Error Correction Codes on Sparse Architectures. 876-890 - Anagha Molakalmur Anil Kumar, Aditya Prasanna, Arrvindh Shriraman:
RANGE-BLOCKS: A Synchronization Facility for Domain-Specific Architectures. 891-906 - Anirudh Jain, Pulkit Gupta, Thomas M. Conte:
RASSM: Residue-based Acceleration of Single Sparse Matrix Computation via Adaptive Tiling. 907-923 - Yan Liu, Jianxin Lai, Long Li, Tianxiang Sui, Linjie Xiao, Peng Yuan, Xiaojing Zhang, Qing Zhu, Wenguang Chen, Jingling Xue:
ReSBM: Region-based Scale and Minimal-Level Bootstrapping Management for FHE via Min-Cut. 924-939 - Stephen M. Blackburn, Zixian Cai, Rui Chen, Xi Yang, John Zhang, John N. Zigman:
Rethinking Java Performance Analysis. 940-954 - Zhilei Han, Fei He:
Robustness Verification for Checking Crash Consistency of Non-volatile Memory. 955-969 - Qinhan Tan, Yuheng Yang, Thomas Bourgeat, Sharad Malik, Mengjia Yan:
RTL Verification for Secure Speculation Using Contract Shadow Logic. 970-986 - Shravan Narayan, Tal Garfinkel, Evan Johnson, Zachary Yedidia, Yingchen Wang, Andrew Brown, Anjo Vahldiek-Oberwagner, Michael LeMay, Wenyong Huang, Xin Wang, Mingqiu Sun, Dean M. Tullsen, Deian Stefan:
Segue & ColorGuard: Optimizing SFI Performance and Scalability on Modern Architectures. 987-1002 - Huan Zhao, Dylan Wolff, Umang Mathur, Abhik Roychoudhury:
Selectively Uniform Concurrency Testing. 1003-1019 - Yaohui Cai, Kaixin Yang, Chenhui Deng, Cunxi Yu, Zhiru Zhang:
SmoothE: Differentiable E-Graph Extraction. 1020-1034 - Seah Kim, Roger Hsiao, Borivoje Nikolic, James Demmel, Yakun Sophia Shao:
SuperNoVA: Algorithm-Hardware Co-Design for Resource-Aware SLAM. 1035-1051 - Wei Zhao, Anand Jayarajan, Gennady Pekhimenko:
Tally: Non-Intrusive Performance Isolation for Concurrent Deep Learning Workloads. 1052-1068 - Brett Saiki, Jackson Brough, Jonas Regehr, Jesús Ponce, Varun Pradeep, Aditya Akhileshwaran, Zachary Tatlock, Pavel Panchekha:
Target-Aware Implementation of Real Expressions. 1069-1083 - Difan Tan, Jiawei Li, Hua Wang, Xiaoxiao Li, Wenbo Liu, Zijin Qin, Ke Zhou, Ming Xie, Mengling Tao:
Tela: A Temporal Load-Aware Cloud Virtual Disk Placement Scheme. 1084-1100 - Cheng Wang, Mingyu Gao:
UniZK: Accelerating Zero-Knowledge Proof with Unified Hardware and Flexible Kernel Mapping. 1101-1117 - Zibo Wang, Yijia Zhang, Fuchun Wei, Bingqiang Wang, Yanlin Liu, Zhiheng Hu, Jingyi Zhang, Xiaoxin Xu, Jian He, Xiaoliang Wang, Wanchun Dou, Guihai Chen, Chen Tian:
Using Analytical Performance/Power Model and Fine-Grained DVFS to Enhance AI Accelerator Energy Efficiency. 1118-1132 - Ramya Prabhu, Ajay Nayak, Jayashree Mohan, Ramachandran Ramjee, Ashish Panwar:
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention. 1133-1150 - Minwook Kim, Seongyeop Jeong, Jin-Soo Kim:
ZRAID: Leveraging Zone Random Write Area (ZRWA) for Alleviating Partial Parity Tax in ZNS RAID. 1151-1165

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.