


default search action
30th HiPC 2023: Goa, India
- 30th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Goa, India, December 18-21, 2023. IEEE 2023, ISBN 979-8-3503-8322-5
- Sunita Sarawagi:
Modern AI for Analyzing Large Structured Databases: Opportunities and Challenges. xxii - Priyanka Sharma:
High Performance and Energy Efficient Processor for Next Generation Data Centres: FUJITSU - MONAKA. xxiii - Manish Parashar:
Computing Everywhere, All at Once: Harnessing the Computing Continuum for Science. xxiv - Vittal Setty:
Addressing Exponential Scale Problems at Infosys. xxv - Bahareh Khabbazan, Marc Riera, Antonio González:
DNA-TEQ: An Adaptive Exponential Quantization of Tensors for DNN Inference. 1-10 - Gian Singh, Sanmukh R. Kuppannagari, Sarma B. K. Vrudhula:
PARAG: PIM Architecture for Real-Time Acceleration of GCNs. 11-20 - Jake Choi, Jaejin Lee, Sunchul Jung, Heon Young Yeom:
Hybrid CUDA Unified Memory Management in Fully Homomorphic Encryption Workloads. 21-30 - Shaik Jani Basha, Sandani Shaik, Nazrinbanu Nagori, Veerendra Shetty:
Mobile Gaming Experience: An Approach Based on Thread Scheduler & Thread Priority Manager. 31-40 - Shulei Xu, Goutham Kalikrishna Reddy Kuncham, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Optimized All-to-All Connection Establishment for High-Performance MPI Libraries Over InfiniBand. 41-50 - Sirui Qi, Dejan S. Milojicic, Cullen E. Bash, Sudeep Pasricha:
MOSAIC: A Multi-Objective Optimization Framework for Sustainable Datacenter Management. 51-60 - Mengtian Yang, Yipeng Wang, Jaydeep P. Kulkarni:
A 118 GOPS/mm23D eDRAM TensorCore Architecture for Large-scale Matrix Multiplication. 61-65 - Zhihui Du, Oliver Alvarado Rodriguez, Fuhuan Li, Mohammad Dindoost, David A. Bader:
Contour Algorithm for Connectivity. 66-75 - Henk Dreuning, Kees Verstoep, Henri E. Bal, Rob V. van Nieuwpoort
:
CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid Parallelism. 76-86 - Daegun Yoon, Sangyoon Oh:
MiCRO: Near-Zero Cost Gradient Sparsification for Scaling and Accelerating Distributed DNN Training. 87-96 - Robert Underwood, Meghana Madhyastha, Randal C. Burns, Bogdan Nicolae:
Understanding Patterns of Deep Learning Model Evolution in Network Architecture Search. 97-106 - Jinghan Yao, Nawras Alnaasan
, Tian Chen, Aamir Shafi, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference. 107-116 - Pu Jiao, Sheng Di, Jinyang Liu, Xin Liang, Franck Cappello:
Characterization and Detection of Artifacts for Error-Controlled Lossy Compressors. 117-126 - Prashanthi S. K, Vinayaka Hegde, Keerthana Patchava, Ankita Das, Yogesh Simmhan:
Performance Characterization of Containerized DNN Training and Inference on Edge Accelerators. 127-131 - Arham Khan, Sheng Di, Kai Zhao, Jinyang Liu, Kyle Chard, Ian T. Foster, Franck Cappello:
SECRE: Surrogate-Based Error-Controlled Lossy Compression Ratio Estimation Framework. 132-142 - Tania Banerjee, Jaemoon Lee, Jong Choi, Qian Gong
, Jieyang Chen, Scott Klasky, Anand Rangarajan, Sanjay Ranka:
Fast Algorithms for Scientific Data Compression. 143-152 - Alberto Riccardo Martinelli, Massimo Torquati, Marco Aldinucci, Iacopo Colonnelli
, Barbara Cantalupo:
CAPIO: a Middleware for Transparent I/O Streaming in Data- Intensive Workflows. 153-163 - Akshin Singh, Smruti R. Sarangi:
JASS: A Tunable Checkpointing System for NVM-Based Systems. 164-173 - Shashank Khobragade
, Santi Gopal Mondal, Kalyan Gunda:
Multi-Streamed Metadata-Integrity Verification For Cloud Migration In Deduplication Systems. 174-178 - Mathialakan Thavappiragasam, Vivek Kale:
CPU-GPU Tuning for Modern Scientific Applications using Node-Level Heterogeneity. 179-183 - Hari Sharan, Mythili Vutukuru, Biswabandan Panda:
DDIOSim: A Microarchitecture Simulator for Data Direct I/O Technology. 184-188 - Ankit Choudhary, S. K. Vaibhav Kodavati, B. Mythili, R. V. G. Anjaneyulu, Manju Sarma. M:
FPGA Accelerated Bi-Cubic Convolution for Image Interpolation. 189-193 - Shuai Yang, Changyou Zhang, Ji Ma:
DeltaSPARSE: High-Performance Sparse General Matrix-Matrix Multiplication on Multi-GPU Systems. 194-202 - Koushik Sen, Sathish Vadhiyar, P. N. Vinayachandran:
Strategies for Fast I/O Throughput in Large-Scale Climate Modeling Applications. 203-212 - Kyle Marino, Pengmiao Zhang, Viktor K. Prasanna:
ME- ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers. 213-223 - Vinícius Vitor dos Santos Dias, Samuel Ferraz, Aditya Vadlamani, Mahdi Erfanian, Carlos H. C. Teixeira, Dorgival O. Guedes, Wagner Meira Jr., Srinivasan Parthasarathy:
Graph Pattern Mining Paradigms: Consolidation and Renewed Bearing. 224-233 - Arafath Nihar, Thomas G. Ciardi
, Rounak Chawla, Olatunde Akanbi, Vipin Chaudhary, Yinghui Wu, Roger H. French:
Accelerating Time to Science using CRADLE: A Framework for Materials Data Science. 234-245 - Kevin Assogba
, Bogdan Nicolae, M. Mustafa Rafique:
Optimizing the Training of Co-Located Deep Learning Models Using Cache-Aware Staggering. 246-255 - Avinash Maurya, Bogdan Nicolae, M. Mustafa Rafique, Franck Cappello:
Towards Efficient I/O Pipelines Using Accumulated Compression. 256-265 - Jan-Harm L. F. Betting, Chris I. De Zeeuw, Christos Strydis:
Oikonomos-II: A Reinforcement-Learning, Resource-Recommendation System for Cloud HPC. 266-276 - Zainul Abideen Sayed, Jaroslaw Zola
:
SCoOL - Scalable Common Optimization Library. 277-287 - Satanu Maity, Mayank Goel, Manojit Ghose:
Data Locality Aware Computation Offloading in Near Memory Processing Architecture for Big Data Applications. 288-297 - Philip E. Davis, Jacob S. Merson
, Pradeep Subedi, Lee F. Ricketson, Cameron W. Smith, Mark S. Shephard, Manish Parashar:
Benesh: a Framework for Choreographic Coordination of In Situ Workflows. 298-308 - Shubhradeep Roy, Suvarthi Sarkar
, Aryabartta Sahu:
Profit Maximization Using Collaborative Storage Management in Multi-Tier Edge-Cloud System. 309-318 - Jiwoo Bang, Chungyong Kim, Eun-Kyu Byun, Hanul Sung, Jaehwan Lee, Hyeonsang Eom:
Towards Enhanced I/O Performance of NVM File Systems. 319-323 - Shruti Shivakumar, Ilya Amburg, Sinan G. Aksoy, Jiajia Li, Stephen J. Young
, Srinivas Aluru:
Fast Parallel Tensor Times Same Vector for Hypergraphs. 324-334 - Ullas A, Rupesh Nasre, R. Govindarajan:
Reduce, Reuse, and Adapt: Accelerating Graph Processing on GPUs. 335-346 - Zhiyi Zhang, Pengfei Zhang, Zhuopin Xu, Qi Wang:
Reduce Computational Complexity for Convolutional Layers by Skipping Zeros. 347-356 - Lisheng Xie, Jianwei Xue, Liangshun Wu, Faquan Chen, Qingyang Tian, Yifan Zhou, Rendong Ying, Peilin Liu:
SpikeNC: An Accurate and Scalable Simulator for Spiking Neural Network on Multi-Core Neuromorphic Hardware. 357-366 - Anubhav Jana, Purushottam Kulkarni, Umesh Bellur:
DAGit: A Platform For Enabling Serverless Applications. 367-376 - Mohammad Zubair, Desh Ranjan, Aaron Walden, Gabriel Nastac, Eric J. Nielsen, Boris Diskin, Marc F. Paterno, Samuel Jung, Joshua Hoke Davis:
Efficient GPU Implementation of Automatic Differentiation for Computational Fluid Dynamics. 377-386 - Ajeya Bhat, Sai Manasa Chadalavada, Nagakishore Jammula, Chirag Jain, Yogesh Simmhan:
A Lossless Compression Pipeline for Petabyte-Scale Whole Genome Sequencing Data. 387-391

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.