default search action
IEEE Transactions on Parallel and Distributed Systems, Volume 34
Volume 34, Number 1, January 2023
- Davide Frey, Achour Mostéfaoui, Matthieu Perrin, Pierre-Louis Roman, François Taïani:
Differentiated Consistency for Worldwide Gossips. 1-15 - Qingguo Lü, Xiaofeng Liao, Shaojiang Deng, Huaqing Li:
Asynchronous Algorithms for Decentralized Resource Allocation Over Directed Networks. 16-32 - Xu Jiang, Haochun Liang, Nan Guan, Yue Tang, Lei Qiao, Wang Yi:
Scheduling Parallel Real-Time Tasks on Virtual Processors. 33-47 - Xiaocan Li, Kun Xie, Xin Wang, Gaogang Xie, Kenli Li, Jiannong Cao, Dafang Zhang, Jigang Wen:
Tripartite Graph Aided Tensor Completion For Sparse Network Measurement. 48-62 - Yanqing Chen, Chen Tian, Jiaqing Dong, Song Feng, Xu Zhang, Chang Liu, Peiwen Yu, Nai Xia, Wanchun Dou, Guihai Chen:
Swing: Providing Long-Range Lossless RDMA via PFC-Relay. 63-75 - Bin Yu, Cong Tian, Xu Lu, Nan Zhang, Zhenhua Duan:
A Distributed Network-Based Runtime Verification of Full Regular Temporal Properties. 76-91 - Mingyue Wang, Yinbin Miao, Yu Guo, Hejiao Huang, Cong Wang, Xiaohua Jia:
AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems. 92-107 - Sabrina De Capitani di Vimercati, Sara Foresti, Sushil Jajodia, Stefano Paraboschi, Pierangela Samarati, Roberto Sassi:
Sentinels and Twins: Effective Integrity Assessment for Distributed Computation. 108-122 - Chen Wang, Yanfei Guo, Pavan Balaji, Marc Snir:
Near-Lossless MPI Tracing and Proxy Application Autogeneration. 123-140 - Xiaoyu Hao, Tao Fang, Junshi Chen, Jun Gu, Jiawang Feng, Hong An, Chun Zhao:
swMPAS-A: Scaling MPAS-A to 39 Million Heterogeneous Cores on the New Generation Sunway Supercomputer. 141-153 - Tengfei Li, Jianfeng Chu, Liang Hu:
CIA: A Collaborative Integrity Auditing Scheme for Cloud Data With Multi-Replica on Multi-Cloud Storage Providers. 154-162 - Young Sik Lee, Yong Wook Kim, Tae Hee Han:
MRCN: Throughput-Oriented Multicast Routing for Customized Network-on-Chips. 163-179 - Changyuan Lin, Nima Mahmoudi, Caixiang Fan, Hamzeh Khazaei:
Fine-Grained Performance and Cost Modeling and Optimization for FaaS Applications. 180-194 - Kan Liu, Wei Xue:
A Novel Compute-Efficient Tridiagonal Solver for Many-Core Architectures. 195-206 - Nazmus Saquib, Chandra Krintz, Rich Wolski:
Replicated Versioned Data Structures for Wide-Area Distributed Systems. 207-224 - Hui Zhang, Rong-Xia Hao, Xiao-Wen Qin, Cheng-Kuan Lin, Sun-Yuan Hsieh:
The High Faulty Tolerant Capability of the Alternating Group Graphs. 225-233 - Hailiang Zhao, Shuiguang Deng, Feiyi Chen, Jianwei Yin, Schahram Dustdar, Albert Y. Zomaya:
Learning to Schedule Multi-Server Jobs With Fluctuated Processing Speeds. 234-245 - Wei Sun, Ang Li, Tong Geng, Sander Stuijk, Henk Corporaal:
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors. 246-261 - Yudi Qiu, Jie Jiao, Xiaoyang Zeng, Yibo Fan:
Tag-Sharer-Fusion Directory: A Scalable Coherence Directory With Flexible Entry Formats. 262-274 - Preyesh Dalmia, Rohan Mahapatra, Jeremy Intan, Dan Negrut, Matthew D. Sinclair:
Improving the Scalability of GPU Synchronization Primitives. 275-290 - Eleonora D'Arnese, Davide Conficconi, Emanuele Del Sozzo, Luigi Fusco, Donatella Sciuto, Marco Domenico Santambrogio:
Faber: A Hardware/SoftWare Toolchain for Image Registration. 291-303 - Jiarui Fang, Zilin Zhu, Shenggui Li, Hui Su, Yang Yu, Jie Zhou, Yang You:
Parallel Training of Pre-Trained Models via Chunk-Based Dynamic Memory Management. 304-315 - Pallav Kumar Deb, Anandarup Mukherjee, Digvijay Singh, Sudip Misra:
Loop-the-Loops: Fragmented Learning Over Networks for Constrained IoT Devices. 316-327 - Zichuan Xu, Dapeng Zhao, Weifa Liang, Omer F. Rana, Pan Zhou, Mingchu Li, Wenzheng Xu, Hao Li, Qiufen Xia:
HierFedML: Aggregator Placement and UE Assignment for Hierarchical Federated Learning in Mobile Edge Computing. 328-345 - Shaojun Zhang, Chen Wang, Albert Y. Zomaya:
Robustness Analysis and Enhancement of Deep Reinforcement Learning-Based Schedulers. 346-357 - Wanchun Jiang, Yujia Qiu, Yucheng Chen, Fa Ji, HaiMing Xie, Xiangqian Zhou, Jialiang Chen, Jiawei Huang, Jianxin Wang, Yan Li:
Accelerated Information Dissemination for Replica Selection in Distributed Key-Value Store Systems. 358-371 - Roberto R. Osorio:
Floating Point Calculation of the Cube Function on FPGAs. 372-382 - Weiguang Liu, Jinhua Cui, Tiantian Li, Junwei Liu, Laurence T. Yang:
A Space-Efficient Fair Cache Scheme Based on Machine Learning for NVMe SSDs. 383-399 - Yu Huang, Tao Wang, Zihui Yin, Eric Mercer, Benjamin Ogles:
Improving the Efficiency of Deadlock Detection in MPI Programs Through Trace Compression. 400-415 - Sunil Kumar, Krishna Kumar Mohbey:
A Utility-Based Distributed Pattern Mining Algorithm With Reduced Shuffle Overhead. 416-428
Volume 34, Number 2, February 2023
- Manish Parashar:
Editorial. 429-430 - Tongfeng Weng, Xu Zhou, Kenli Li, Kian-Lee Tan, Keqin Li:
Distributed Approaches to Butterfly Analysis on Large Dynamic Bipartite Graphs. 431-445 - Sheng-Hao Chiang, Chih-Hang Wang, De-Nian Yang, Wanjiun Liao, Wen-Tsuen Chen:
Distributed Multicast Traffic Engineering for Multi-Domain Software-Defined Networks. 446-462 - Shuai Zhang, Zite Jiang, Xingzhong Hou, Mingyu Li, Mengting Yuan, Haihang You:
DRONE: An Efficient Distributed Subgraph-Centric Framework for Processing Large-Scale Power-law Graphs. 463-474 - Huan Zhou, Mingze Li, Ning Wang, Geyong Min, Jie Wu:
Accelerating Deep Learning Inference via Model Parallelism and Partial Computation Offloading. 475-488 - Xiaoli Wang, Bharadwaj Veeravalli, Jiaming Song, Honghu Liu:
On the Design and Evaluation of an Optimal Security-and-Time Cognizant Data Placement for Dynamic Fog Environments. 489-500 - Mingyu Liu, Li Pan, Shijun Liu:
RLTiering: A Cost-Driven Auto-Tiering System for Two-Tier Cloud Storage Using Deep Reinforcement Learning. 501-518 - Tian Xia, Gelin Fu, Chenyang Li, Zhongpei Luo, Lucheng Zhang, Ruiyang Chen, Wenzhe Zhao, Nanning Zheng, Pengju Ren:
A Comprehensive Performance Model of Sparse Matrix-Vector Multiplication to Guide Kernel Optimization. 519-534 - Ji Liu, Juncheng Jia, Beichen Ma, Chendi Zhou, Jingbo Zhou, Yang Zhou, Huaiyu Dai, Dejing Dou:
Multi-Job Intelligent Scheduling With Cross-Device Federated Learning. 535-551 - Jianghong Wei, Xiaofeng Chen, Jianfeng Wang, Xinyi Huang, Willy Susilo:
Securing Fine-Grained Data Sharing and Erasure in Outsourced Storage Systems. 552-566 - Hai Jin, Dongshan Bai, Dezhong Yao, Yutong Dai, Lin Gu, Chen Yu, Lichao Sun:
Personalized Edge Intelligence via Federated Self-Knowledge Distillation. 567-580 - Christie L. Alappat, Georg Hager, Olaf Schenk, Gerhard Wellein:
Level-Based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication. 581-597 - Zhichao Lu, Chuntao Ding, Felix Juefei-Xu, Vishnu Naresh Boddeti, Shangguang Wang, Yun Yang:
TFormer: A Transmission-Friendly ViT Model for IoT Devices. 598-610 - Marcin Rogowski, Samar Aseeri, David E. Keyes, Lisandro Dalcín:
mpi4py.futures: MPI-Based Asynchronous Task Execution for Python. 611-622 - Ayesha Afzal, Georg Hager, Gerhard Wellein:
The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs. 623-638 - Zhongteng Cai, Junyuan Liang, Wuhui Chen, Zicong Hong, Hong-Ning Dai, Jianting Zhang, Zibin Zheng:
Benzene: Scaling Blockchain With Cooperation-Based Sharding. 639-654 - Xiaocan Li, Kun Xie, Xin Wang, Gaogang Xie, Kenli Li, Jiannong Cao, Dafang Zhang, Hongbo Jiang, Jigang Wen:
Neighbor Graph Based Tensor Recovery For Accurate Internet Anomaly Detection. 655-674 - Rituparna Saha, Sudip Misra, Aishwariya Chakraborty, Chandranath Chatterjee, Pallav Kumar Deb:
Data-Centric Client Selection for Federated Learning Over Distributed Edge Networks. 675-686 - Zhigang Wang, Yilei Tu, Ning Wang, Lixin Gao, Jie Nie, Zhiqiang Wei, Yu Gu, Ge Yu:
FSP: Towards Flexible Synchronous Parallel Frameworks for Distributed Machine Learning. 687-703 - Paula Olaya, Dominic Kennedy, Ricardo M. Llamas, Leobardo Valera, Rodrigo Vargas, Jay F. Lofstead, Michela Taufer:
Building Trust in Earth Science Findings through Data Traceability and Results Explainability. 704-717 - Sangeet Saha, Shounak Chakraborty, Sukarn Agarwal, Rahul Gangopadhyay, Magnus Själander, Klaus D. McDonald-Maier:
DELICIOUS: Deadline-Aware Approximate Computing in Cache-Conscious Multicore. 718-733 - Tingfang Wu, Linqiang Pan:
Spiking Neural P Systems With Communication on Request and Mute Rules. 734-745 - Songlin He, Yuan Lu, Qiang Tang, Guiling Wang, Chase Qishi Wu:
Blockchain-Based P2P Content Delivery With Monetary Incentivization and Fairness Guarantee. 746-765
Volume 34, Number 3, March 2023
- Hang Cao, Liang Yuan, He Zhang, Yunquan Zhang, Baodong Wu, Kun Li, Shigang Li, Minghua Zhang, Pengqi Lu, Junmin Xiao:
AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format. 766-780 - Haolin Liu, Saiqin Long, Zhetao Li, Yu Fu, Yong Zuo, Xinglin Zhang:
Revenue Maximizing Online Service Function Chain Deployment in Multi-Tier Computing Network. 781-796 - Wenjie Liu, Xubin He, Qing Liu:
Exploring Memory Access Similarity to Improve Irregular Application Performance for Distributed Hybrid Memory Systems. 797-809 - Rajesh Devaraj, Arnab Sarkar:
Comments on "IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems". 810-811 - Fei Xu, Jianian Xu, Jiabin Chen, Li Chen, Ruitao Shang, Zhi Zhou, Fangming Liu:
iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud. 812-827 - Zecheng Li, Bin Xiao, Songtao Guo, Yuanyuan Yang:
Securing Deployed Smart Contracts and DeFi With Distributed TEE Cluster. 828-842 - Yiwen Zhang, Jian Zhou, Xinhao Min, Song Ge, Jiguang Wan, Ting Yao, Daohui Wang:
PetaKV: Building Efficient Key-Value Store for File System Metadata on Persistent Memory. 843-855 - Alessandro Staffolani, Victor-Alexandru Darvariu, Paolo Bellavista, Mirco Musolesi:
RLQ: Workload Allocation With Reinforcement Learning in Distributed Queues. 856-868 - Giorgio Audrito, Federico Terraneo, William Fornaciari:
FCPP+Miosix: Scaling Aggregate Programming to Embedded Systems. 869-880 - Xiaohui Duan, Qi Shao, Junben Weng, Bertil Schmidt, Lin Gan, Guohui Li, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang:
Bio-ESMD: A Data Centric Implementation for Large-Scale Biological System Simulation on Sunway TaihuLight Supercomputer. 881-893 - Liang Yuan, Qiang He, Siyu Tan, Bo Li, Jiangshan Yu, Feifei Chen, Yun Yang:
CoopEdge+: Enabling Decentralized, Secure and Cooperative Multi-Access Edge Computing Based on Blockchain. 894-908 - Zhenheng Tang, Shaohuai Shi, Bo Li, Xiaowen Chu:
GossipFL: A Decentralized Federated Learning Framework With Sparsified and Adaptive Communication. 909-922 - Haoyu Jin, Donglei Wu, Shuyu Zhang, Xiangyu Zou, Sian Jin, Dingwen Tao, Qing Liao, Wen Xia:
Design of a Quantization-Based DNN Delta Compression Framework for Model Snapshots and Federated Learning. 923-937 - Ziheng Wang, Xiaoshe Dong, Heng Chen, Yan Kang:
Efficient GPU Implementations of Post-Quantum Signature XMSS. 938-954 - YuAng Chen, Yeh-Ching Chung:
An Unequal Caching Strategy for Shared-Memory Graph Analytics. 955-967 - Andrea Detti, Ludovico Funari, Luca Petrucci:
$\mu$μBench: An Open-Source Factory of Benchmark Microservice Applications. 968-980 - Qiang-Sheng Hua, Xiaohui Zhang, Hai Jin, Hong Huang:
Revisiting Core Maintenance for Dynamic Hypergraphs. 981-994 - Patrik Omland, Alessio Netti, Yang Peng, Andrea Baldovin, Michael Paulitsch, Gustavo Espinosa, Jorge Parra, Gereon Hinz, Alois C. Knoll:
HPC Hardware Design Reliability Benchmarking With HDFIT. 995-1006 - Yu Zhang, Duo Liu, Moming Duan, Li Li, Xianzhang Chen, Ao Ren, Yujuan Tan, Chengliang Wang:
FedMDS: An Efficient Model Discrepancy-Aware Semi-Asynchronous Clustered Federated Learning Framework. 1007-1019 - Haiying Shen, Haoyu Wang, Jiechao Gao, Rajkumar Buyya:
An Instability-Resilient Renewable Energy Allocation System for a Cloud Datacenter. 1020-1034 - Zhonghui Mei:
Minimizing the Average Packet Access Time of the Application Layer for Buffered Instantly Decodable Network Coding. 1035-1046 - Hui Cai, Yuanyuan Yang, Weibei Fan, Fu Xiao, Yanmin Zhu:
Towards Correlated Data Trading for High-Dimensional Private Data. 1047-1059
Volume 34, Number 4, April 2023
- Florian Van Daalen, Lianne Ippel, Andre Dekker, Iñigo Bermejo:
Privacy Preserving $n$n-Party Scalar Product Protocol. 1060-1066 - Zhiwei Wang, Peinan Li, Rui Hou, Zhihao Li, Jiangfeng Cao, XiaoFeng Wang, Dan Meng:
HE-Booster: An Efficient Polynomial Arithmetic Acceleration on GPUs for Fully Homomorphic Encryption. 1067-1081 - Yu Yao, Yukun Song, Ying Huang, Wei Ni, Duoli Zhang:
A Memory-Constraint-Aware List Scheduling Algorithm for Memory-Constraint Heterogeneous Muti-Processor System. 1082-1099 - Sourabh Kulkarni, Csaba Andras Moritz:
Improving Effectiveness of Simulation-Based Inference in the Massively Parallel Regime. 1100-1114 - Manish Kumar, Anisur Rahaman Molla:
On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement. 1115-1127 - Stefan Bora, Brenton D. Walker, Markus Fidler:
The Tiny-Tasks Granularity Trade-Off: Balancing Overhead Versus Performance in Parallel Systems. 1128-1144 - Renhao Lu, Weizhe Zhang, Yan Wang, Qiong Li, Xiaoxiong Zhong, Hongwei Yang, Desheng Wang:
Auction-Based Cluster Federated Learning in Mobile Edge Computing Systems. 1145-1158 - Runze Lei, Pinghui Wang, Junzhou Zhao, Lin Lan, Jing Tao, Chao Deng, Junlan Feng, Xidian Wang, Xiaohong Guan:
Federated Learning Over Coupled Graphs. 1159-1172 - Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Jiangchuan Liu:
On Model Transmission Strategies in Federated Learning With Lossy Communications. 1173-1185 - Nan He, Song Yang, Fan Li, Stojan Trajanovski, Liehuang Zhu, Yu Wang, Xiaoming Fu:
Leveraging Deep Reinforcement Learning With Attention Mechanism for Virtual Network Function Placement and Routing. 1186-1201 - Jie Xue, Liwen Ren, Bosheng Song, Yujie Guo, Jie Lu, Xiyu Liu, Guanzhong Gong, Dengwang Li:
Hypergraph-Based Numerical Neural-Like P Systems for Medical Image Segmentation. 1202-1214 - Laércio Lima Pilla:
Scheduling Algorithms for Federated Learning With Minimal Energy Consumption. 1215-1226 - Lennart Bamberg, Arash Pourtaherian, Luc Waeijen, Anupam Chahar, Orlando Moreira:
Synapse Compression for Event-Based Convolutional-Neural-Network Accelerators. 1227-1240 - Kristof Jannes, Emad Heydari Beni, Bert Lagaisse, Wouter Joosen:
BeauForT: Robust Byzantine Fault Tolerance for Client-Centric Mobile Web Applications. 1241-1252 - Jiankang Song, Limei Lin, Yanze Huang, Sun-Yuan Hsieh:
Intermittent Fault Diagnosis of Split-Star Networks and its Applications. 1253-1264 - Weibei Fan, Fu Xiao, Mengjie Lv, Lei Han, Junchang Wang, Xin He:
Node Essentiality Assessment and Distributed Collaborative Virtual Network Embedding in Datacenters. 1265-1280 - Shiwei Zhang, Xiaodong Yi, Lansong Diao, Chuan Wu, Siyu Wang, Wei Lin:
Expediting Distributed DNN Training With Device Topology-Aware Graph Deployment. 1281-1293 - Ke Cheng, Sheng Zhang, Chenghong Tu, Xiaohang Shi, Zhaoheng Yin, Sanglu Lu, Yu Liang, Qing Gu:
ProScale: Proactive Autoscaling for Microservice With Time-Varying Workload at the Edge. 1294-1312 - Deepika Saxena, Jitendra Kumar, Ashutosh Kumar Singh, Stefan Schmid:
Performance Analysis of Machine Learning Centered Workload Prediction Models for Cloud. 1313-1330 - Yeting Guo, Fang Liu, Tongqing Zhou, Zhiping Cai, Nong Xiao:
Privacy vs. Efficiency: Achieving Both Through Adaptive Hierarchical Federated Learning. 1331-1342 - Shuo Qin, Dechang Pi, Zhongshi Shao, Yue Xu, Yang Chen:
Reliability-Aware Multi-Objective Memetic Algorithm for Workflow Scheduling Problem in Multi-Cloud System. 1343-1361 - Yahui Li, Han Zhang, Jilong Wang, Zhiliang Wang, Xia Yin, Xingang Shi, Jianping Wu:
A General Approach to Generate Test Packets With Network Configurations. 1362-1375
Volume 34, Number 5, May 2023
- Tim Shaffer, Thanh Son Phung, Kyle Chard, Douglas Thain:
Landlord: Coordinating Dynamic Software Environments to Reduce Container Sprawl. 1376-1389 - Zhaohua Wang, Zhenyu Li, Heng Pan, Guangming Liu, Yunfei Chen, Qinghua Wu, Gareth Tyson, Gang Cheng:
Large-Scale Measurements and Prediction of DC-WAN Traffic. 1390-1405 - Yu Zhou, Yanli Ren, Mengtian Xu, Guorui Feng:
An Improved NSGA-III Algorithm Based on Deep Q-Networks for Cloud Storage Optimization of Blockchain. 1406-1419 - Ruikun Luo, Hai Jin, Qiang He, Song Wu, Xiaoyu Xia:
Enabling Balanced Data Deduplication in Mobile Edge Computing. 1420-1431 - Fanxin Li, Shixiong Zhao, Yuhao Qing, Xusheng Chen, Xiuxian Guan, Sen Wang, Gong Zhang, Heming Cui:
Fold3D: Rethinking and Parallelizing Computational and Communicational Tasks in the Training of Large DNN Models. 1432-1449 - An Zou, Jing Li, Christopher D. Gill, Xuan Zhang:
RTGPU: Real-Time GPU Scheduling of Hard Deadline Parallel Tasks With Fine-Grain Utilization. 1450-1465 - Zhiquan Lai, Shengwei Li, Xudong Tang, Keshi Ge, Weijie Liu, Yabo Duan, Linbo Qiao, Dongsheng Li:
Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models. 1466-1478 - Hyeonjin Kim, William J. Song:
LAS: Locality-Aware Scheduling for GEMM-Accelerated Convolutions in GPUs. 1479-1494 - Yang Liu, Huanle Xu, Wing Cheong Lau:
Cloud Configuration Optimization for Recurring Batch-Processing Applications. 1495-1507 - Gongming Zhao, Jiawei Liu, Yutong Zhai, Hongli Xu, He Huang:
Alleviating the Impact of Abnormal Events Through Multi-Constrained VM Placement. 1508-1523 - Kim Liegeois, Sivasankaran Rajamanickam, Luc Berger-Vergiat:
Performance Portable Batched Sparse Linear Solvers. 1524-1535 - Zhilin Wang, Qin Hu, Ruinian Li, Minghui Xu, Zehui Xiong:
Incentive Mechanism Design for Joint Resource Allocation in Blockchain-Based Federated Learning. 1536-1547 - Feijie Wu, Song Guo, Haozhao Wang, Haobo Zhang, Zhihao Qu, Jie Zhang, Ziming Liu:
From Deterioration to Acceleration: A Calibration Approach to Rehabilitating Step Asynchronism in Federated Optimization. 1548-1559 - Qiong Wu, Xu Chen, Tao Ouyang, Zhi Zhou, Xiaoxi Zhang, Shusen Yang, Junshan Zhang:
HiFlash: Communication-Efficient Hierarchical Federated Learning With Adaptive Staleness Control and Heterogeneity-Aware Client-Edge Association. 1560-1579 - Andreas Thune, Sven-Arne Reinemo, Tor Skeie, Xing Cai:
Detailed Modeling of Heterogeneous and Contention-Constrained Point-to-Point MPI Communication. 1580-1593 - Muhammad Aditya Sasongko, Milind Chabbi, Paul H. J. Kelly, Didem Unat:
Precise Event Sampling on AMD Versus Intel: Quantitative and Qualitative Comparison. 1594-1608 - Guillermo G. Trabes, Gabriel A. Wainer, Veronica Gil-Costa:
A Parallel Algorithm to Accelerate DEVS Simulations in Shared Memory Architectures. 1609-1620 - Philippos Papaphilippou, Kentaro Sano, Boma Anantasatya Adhi, Wayne Luk:
Experimental Survey of FPGA-Based Monolithic Switches and a Novel Queue Balancer. 1621-1634 - Xin Li, Junsong Zhou, Xin Wei, Dawei Li, Zhuzhong Qian, Jie Wu, Xiaolin Qin, Sanglu Lu:
Topology-Aware Scheduling Framework for Microservice Applications in Cloud. 1635-1649 - Qingyang Duan, Chao Peng, Zeqin Wang, Yuedong Xu, Shaoteng Liu, Jun Wu, John C. S. Lui:
Accelerating Distributed DNN Training via Transport Layer Scheduling. 1650-1666 - Shaowei Wang, Xuandi Luo, Yuqiu Qian, Youwen Zhu, Kongyang Chen, Qi Chen, Bangzhou Xin, Wei Yang:
Shuffle Differential Private Data Aggregation for Random Population. 1667-1681 - Danushka Menikkumbura, Parvin Taheri, Erico Vanini, Sonia Fahmy, Patrick Eugster, Tom Edsall:
Congestion Control for Datacenter Networks: A Control-Theoretic Approach. 1682-1696
Volume 34, Number 6, June 2023
- Le Mai Weakley, Tim Robinson:
Guest Editorial Reproducibility Initiative at the SC Conference Series: A Preface to the Special Section. 1697-1698 - Ankit Srivastava, Sriram P. Chockalingam, Srinivas Aluru:
A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery. 1699-1715 - Guancheng Li, Songhui Cao, Chuyi Zhao, Siyuan Zhang, Yuchen Ji, Haotian Jing, Zecheng Li, Jiajun Cheng, Yiwei Yang, Shu Yin:
Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From ShanghaiTech University. 1716-1719 - Jiaqi Si, Junyi Guo, Zhewen Hao, Wenyang He, Ruihan Li, Yueyang Pan, Zhenxin Fu, Chun Fan:
Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From Peking University. 1720-1722 - Juncheng Cao, Kaiyuan Rong, Mingshu Zhai, Zeyu Song, Yanyu Ren, Yuxi Zhu, Wentao Han, Jidong Zhai:
Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From Tsinghua University. 1723-1726 - Arunav Gupta, John Ge, John Li, Zihao Kong, Kaiwen He, Matthew Mikhailov, Bryan Chin, Xiaochen Li, Maximilian Apodaca, Paul Rodríguez, Mahidhar Tatineni, Mary P. Thomas, Santosh Bhatt:
Critique of: "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From UC San Diego. 1727-1730 - Se-Young Yu, Qingyang Zeng, Jim Chen, Yan Chen, Joe Mambretti:
AIDTN: Towards a Real-Time AI Optimized DTN System With NVMeoF. 1731-1742 - Zhuan Liu, Ruichen Han, Yansong Zhang, Yu Zhang, Xi Tang, Gang Deng, Tao Zhong, Roman Dementiev, Yunfei Lu, Mingjian Que:
Exploring Fine-Grained In-Memory Database Performance for Modern CPUs. 1757-1772 - Zhaolong Jian, Ye Lu, Youyang Qiao, Yaozheng Fang, Xueshuo Xie, Dayi Yang, Zhiyuan Zhou, Tao Li:
TSC-VEE: A TrustZone-Based Smart Contract Virtual Execution Environment. 1773-1788 - N. Jagan Mohan, R. Murugan, Tripti Goel, Parthapratim Roy:
DRFL: Federated Learning in Diabetic Retinopathy Grading Using Fundus Images. 1789-1801 - Hongbin Zhuang, Xiao-Yan Li, Jou-Ming Chang, Dajin Wang:
An Efficient Algorithm for Hamiltonian Path Embedding of $k$k-Ary $n$n-Cubes Under the Partitioned Edge Fault Model. 1802-1815 - Chuan Luo, Sizhao Wang, Tianrui Li, Hongmei Chen, Jiancheng Lv, Zhang Yi:
RHDOFS: A Distributed Online Algorithm Towards Scalable Streaming Feature Selection. 1830-1847 - Jin Wang, Jia Hu, Jed Mills, Geyong Min, Ming Xia, Nektarios Georgalas:
Federated Ensemble Model-Based Reinforcement Learning in Edge Computing. 1848-1859 - Maciej Besta, Marc Fischer, Vasiliki Kalavri, Michael Kapralov, Torsten Hoefler:
Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems. 1860-1876 - Zahra Najafabadi Samani, Narges Mehran, Dragi Kimovski, Shajulin Benedict, Nishant Saurabh, Radu Prodan:
Incremental Multilayer Resource Partitioning for Application Placement in Dynamic Fog. 1877-1896 - Saeid Alirezazadeh, Luís A. Alexandre:
Static Algorithm Allocation With Duplication in Robotic Network Cloud Systems. 1897-1908 - Jie Xu, Qingyuan Xie, Sen Peng, Cong Wang, Xiaohua Jia:
AdaptChain: Adaptive Scaling Blockchain With Transaction Deduplication. 1909-1922 - Qian Chen, Zilong Wang, Jiawei Chen, Haonan Yan, Xiaodong Lin:
Dap-FL: Federated Learning Flourishes by Adaptive Tuning and Secure Aggregation. 1923-1941 - Wentai Wu, Ligang He, Weiwei Lin, Carsten Maple:
FedProf: Selective Federated Learning Based on Distributional Representation Profiling. 1942-1953 - Shiyang Li, Ruiqi Tang, Jingyu Zhu, Ziyi Zhao, Xiaoli Gong, Wenwen Wang, Jin Zhang, Pen-Chung Yew:
Liberator: A Data Reuse Framework for Out-of-Memory Graph Computing on GPUs. 1954-1967 - Wenchao Wu, Xuanhua Shi, Ligang He, Hai Jin:
TurboMGNN: Improving Concurrent GNN Training Tasks on GPU With Fine-Grained Kernel Fusion. 1968-1981 - Zining Zhang, Yao Chen, Bingsheng He, Zhenjie Zhang:
NIOT: A Novel Inference Optimization of Transformers on Modern CPUs. 1982-1995 - Shruti Shivakumar, Jiajia Li, Ramakrishnan Kannan, Srinivas Aluru:
Sparse Symmetric Format for Tucker Decomposition. 1743-1756 - Chi Zhang, Yuan Meng, Viktor K. Prasanna:
A Framework for Mapping DRL Algorithms With Prioritized Replay Buffer Onto Heterogeneous Platforms. 1816-1829
Volume 34, Number 7, July 2023
- Lei Xu, Honghui Shang, Xin Chen, Yunquan Zhang, Lifang Wang, Xingyu Gao, Haifeng Song:
Redesigning OpenKMC for Multi-Component Trillion-Atom Simulations on the New Sunway Supercomputer. 1997-2010 - Pierre Balty, Philippe Chatelain, Thomas Gillis:
FLUPS - A Flexible and Performant Massively Parallel Fourier Transform Library. 2011-2024 - Andrea Fresa, Jaya Prakash Champati:
Offloading Algorithms for Maximizing Inference Accuracy on Edge Device in an Edge Intelligence System. 2025-2039 - Wenqi Wei, Ling Liu, Jingya Zhou, Ka Ho Chow, Yanzhao Wu:
Securing Distributed SGD Against Gradient Leakage Threats. 2040-2054 - Rui-Xiao Zhang, Changpeng Yang, Xiaochan Wang, Tianchi Huang, Chenglei Wu, Jiangchuan Liu, Lifeng Sun:
Practical Cloud-Edge Scheduling for Large-Scale Crowdsourced Live Streaming. 2055-2071 - Hui Sun, Yiru Chen, Kewei Sha, Shaoyuan Huang, Xiaofei Wang, Weisong Shi:
A Proactive On-Demand Content Placement Strategy in Edge Intelligent Gateways. 2072-2090 - Chunyang Wang, Yuebin Bai, Desen Sun:
CD-MSA: Cooperative and Deadline-Aware Scheduling for Efficient Multi-Tenancy on DNN Accelerators. 2091-2106 - Jason Kennedy, Vishal Sharma, Blesson Varghese, Carlos Reaño:
Multi-Tier GPU Virtualization for Deep Learning in Cloud-Edge Systems. 2107-2123 - Mingjun Dai, Jialong Yuan, Qingwen Huang, Xiaohui Lin, Hui Wang:
Distributed Encoding and Updating for SAZD Coded Distributed Training. 2124-2137 - Yuchen Li, Weifa Liang, Jing Li, Xiuzhen Cheng, Dongxiao Yu, Albert Y. Zomaya, Song Guo:
Energy-Aware, Device-to-Device Assisted Federated Learning in Edge Computing. 2138-2154 - Guangming Cui, Qiang He, Xiaoyu Xia, Feifei Chen, Yun Yang:
EESaver: Saving Energy Dynamically for Green Multi-Access Edge Computing. 2155-2166 - Chengyu Sun, Huizhang Luo, Hong Jiang, Jeff Zhang, Kenli Li:
COFFEE: Cross-Layer Optimization for Fast and Efficient Executions of Sinkhorn-Knopp Algorithm on HPC Systems. 2167-2179 - Andrzej Jackowski, Lukasz Slusarczyk, Krzysztof Lichota, Michal Welnicki, Rafal Wijata, Mateusz Kielar, Tadeusz Kopec, Cezary Dubnicki, Konrad Iwanicki:
ObjDedup: High-Throughput Object Storage Layer for Backup Systems With Block-Level Deduplication. 2180-2197 - Jed Mills, Jia Hu, Geyong Min:
Faster Federated Learning With Decaying Number of Local SGD Steps. 2198-2207 - Damian Borowiec, Gingfung Yeung, Adrian Friday, Richard Harper, Peter Garraghan:
DOPpler: Parallel Measurement Infrastructure for Auto-Tuning Deep Learning Tensor Programs. 2208-2220 - Jiazhi Jiang, Jiangsu Du, Dan Huang, Zhiguang Chen, Yutong Lu, Xiangke Liao:
Full-Stack Optimizing Transformer Inference on ARM Many-Core CPU. 2221-2235
Volume 34, Number 8, August 2023
- Tian Xie, Sanchal Thakkar, Ting He, Patrick D. McDaniel, Quinn Burke:
Joint Caching and Routing in Cache Networks With Arbitrary Topology. 2237-2250 - Hai Zhou, Dan Feng:
Boosting Erasure-Coded Multi-Stripe Repair in Rack Architecture and Heterogeneous Clusters: Design and Analysis. 2251-2264 - Md. Arifuzzaman, Brian Bockelman, Jim Basney, Engin Arslan:
Falcon: Fair and Efficient Online File Transfer Optimization. 2265-2278 - Chanho Park, Bogil Kim, Sungmin Ryu, William J. Song:
NeuroSpector: Systematic Optimization of Dataflow Scheduling in DNN Accelerators. 2279-2294 - Ting Li, Jiyan Sun, Yinlong Liu, Xu Zhang, Dali Zhu, Zhaorui Guo, Liru Geng:
ESMO: Joint Frame Scheduling and Model Caching for Edge Video Analytics. 2295-2310 - Hongjing Huang, Yingtao Li, Jie Sun, Xueying Zhu, Jie Zhang, Liang Luo, Jialin Li, Zeke Wang:
P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs. 2311-2324 - Chenle Yu, Sara Royuela, Eduardo Quiñones:
Taskgraph: A Low Contention OpenMP Tasking Framework. 2325-2336 - Chenyi Liu, Pingfei Wu, Mingwei Xu, Yuan Yang, Nan Geng:
Scalable Deep Reinforcement Learning-Based Online Routing for Multi-Type Service Requirements. 2337-2351 - Fujun He, Eiji Oki:
Preventive Priority Setting Against Multiple Controller Failures in Software Defined Networks. 2352-2364 - Massimo Bernaschi, Alessandro Celestini, Flavio Vella, Pasqua D'Ambra:
A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers. 2365-2376 - Peng Liang, Yu Tang, Xiaoda Zhang, Youhui Bai, Teng Su, Zhiquan Lai, Linbo Qiao, Dongsheng Li:
A Survey on Auto-Parallelism of Large-Scale Deep Learning Training. 2377-2390 - Biao Hu, Xincheng Yang, Mingguo Zhao:
Energy-Minimized Scheduling of Intermittent Real-Time Tasks in a CPU-GPU Cloud Computing Platform. 2391-2402 - Zan Zong, Li Lin, Leilei Lin, Lijie Wen, Yu Sun:
STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training. 2403-2418 - Haotian Wang, Wangdong Yang, Rong Hu, Renqiu Ouyang, Kenli Li, Keqin Li:
A Novel Parallel Algorithm for Sparse Tensor Matrix Chain Multiplication via TCU-Acceleration. 2419-2432 - Bo Li, Lei Cui, Zhiyu Hao, Lun Li, Yongji Liu, Yongnan Li:
eHotSnap: An Efficient and Hot Distributed Snapshots System for Virtual Machine Cluster. 2433-2447 - Heng Yu, Han Zhang, Junxian Shen, Yantao Geng, Jilong Wang, Congcong Miao, Mingwei Xu:
Serpens: A High Performance FaaS Platform for Network Functions. 2448-2463 - Hailu Xu, Pinchao Liu, Sarker Tanzir Ahmed, Dilma Da Silva, Liting Hu:
Adaptive Fragment-Based Parallel State Recovery for Stream Processing Systems. 2464-2478 - Xinhan Wang, Huanlai Xing, Fuhong Song, Shouxi Luo, Penglin Dai, Bowen Zhao:
On Jointly Optimizing Partial Offloading and SFC Mapping: A Cooperative Dual-Agent Deep Reinforcement Learning Approach. 2479-2497
Volume 34, Number 9, September 2023
- Siling Yang, Shuibing He, Hexiao Duan, Weijian Chen, Xuechen Zhang, Tong Wu, Yanlong Yin:
APQ: Automated DNN Pruning and Quantization for ReRAM-Based Accelerators. 2498-2511 - Jie Cui, Hu Sun, Hong Zhong, Jing Zhang, Lu Wei, Irina Bolodurina, Debiao He:
Collaborative Intrusion Detection System for SDVN: A Fairness Federated Deep Learning Approach. 2512-2528 - Peng Qu, Hui Lin, Meng Pang, Xiaofei Liu, Weimin Zheng, Youhui Zhang:
ENLARGE: An Efficient SNN Simulation Framework on GPU Clusters. 2529-2540 - Zhao Liu, Mengquan Li, Mincan Li, Lei Liao, Kenli Li:
An Efficient Hierarchical-Reduction Architecture for Aggregation in Route Travel Time Estimation. 2541-2552 - Zhenqian Chen, Xinkui Zhao, Chen Zhi, Jianwei Yin:
DeepBoot: Dynamic Scheduling System for Training and Inference Deep Learning Tasks in GPU Cluster. 2553-2567 - Xiaozhong Jin, Haikun Liu, Chencheng Ye, Xiaofei Liao, Hai Jin, Yu Zhang:
Accelerating Content-Defined Chunking for Data Deduplication Based on Speculative Jump. 2568-2579 - Bingyi Zhang, Hanqing Zeng, Viktor K. Prasanna:
GraphAGILE: An FPGA-Based Overlay Accelerator for Low-Latency GNN Inference. 2580-2597 - Yanhong Wang, Tianchan Guan, Dimin Niu, Qiaosha Zou, Hongzhong Zheng, Chuanjin Richard Shi, Yuan Xie:
Accelerating Distributed GNN Training by Codes. 2598-2614
Volume 34, Number 10, October 2023
- Kenta Ishiguro, Naoki Yasuno, Pierre-Louis Aublin, Kenji Kono:
Revisiting VM-Agnostic KVM vCPU Scheduler for Mitigating Excessive vCPU Spinning. 2615-2628 - Zhengjie Yang, Sen Fu, Wei Bao, Dong Yuan, Albert Y. Zomaya:
Hierarchical Federated Learning With Momentum Acceleration in Multi-Tier Networks. 2629-2641 - Mingyang Song, Zhongyun Hua, Yifeng Zheng, Tao Xiang, Xiaohua Jia:
FCDedup: A Two-Level Deduplication System for Encrypted Data in Fog Computing. 2642-2656 - Changwu Zhang, Hao Sun, Shuman Li, Yaohua Wang, Haiyan Chen, Hengzhu Liu:
A Survey of Memory-Centric Energy Efficient Computer Architecture. 2657-2670 - Weiguo Zhu, Yongqi Sun, Rongqiang Fang, Baomin Xu:
A Low-Memory Community Detection Algorithm With Hybrid Sparse Structure and Structural Information for Large-Scale Networks. 2671-2683 - Nikolaos Tampouratzis, Panagiotis Mousouliotis, Ioannis Papaefstathiou:
A Novel Integrated Simulation Framework for Cyber-Physical Systems Modelling. 2684-2698 - Yihua Hu, Feng Zhang, Yifei Xia, Zhiming Yao, Letian Zeng, Haipeng Ding, Zhewei Wei, Xiao Zhang, Jidong Zhai, Xiaoyong Du, Siqi Ma:
Enabling Efficient Random Access to Hierarchically Compressed Text Data on Diverse GPU Platforms. 2699-2717 - Xun Wang, Ruibao Song, Junmin Xiao, Tong Li, Xueqi Li:
Accelerating k-Shape Time Series Clustering Algorithm Using GPU. 2718-2734 - Lanlan Rui, Shiyou Chen, Shuyun Wang, Zhipeng Gao, Xuesong Qiu, Wenjing Li, Shaoyong Guo:
SFC Orchestration Method for Edge Cloud and Central Cloud Collaboration: QoS and Energy Consumption Joint Optimization Combined With Reputation Assessment. 2735-2748 - Geyao Cheng, Lailong Luo, Junxu Xia, Deke Guo, Yuchen Sun:
When Deduplication Meets Migration: An Efficient and Adaptive Strategy in Distributed Storage Systems. 2749-2766 - Huaipeng Zhang, Nhut-Minh Ho, Dogukan Yigit Polat, Peng Chen, Mohamed Wahib, Truong Thao Nguyen, Jintao Meng, Rick Siow Mong Goh, Satoshi Matsuoka, Tao Luo, Weng-Fai Wong:
Simeuro: A Hybrid CPU-GPU Parallel Simulator for Neuromorphic Computing Chips. 2767-2782 - Francieli Boito, Guillaume Pallez, Luan Teylo, Nicolas Vidal:
IO-Sets: Simple and Efficient Approaches for I/O Bandwidth Management. 2783-2796 - Mingzhe Li, Wei Wang, Jin Zhang:
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration. 2797-2810 - Junxu Xia, Geyao Cheng, Lailong Luo, Deke Guo, Pin Lv, Bowen Sun:
The Doctrine of MEAN: Realizing Deduplication Storage at Unreliable Edge. 2811-2826 - Zhida Jiang, Yang Xu, Hongli Xu, Lun Wang, Chunming Qiao, Liusheng Huang:
Joint Model Pruning and Topology Construction for Accelerating Decentralized Machine Learning. 2827-2842
Volume 34, Number 11, November 2023
- Li Li, Jiajie Shen, Bochun Wu, Yangfan Zhou, Xin Wang, Keqin Li:
Adaptive Data Placement in Multi-Cloud Storage: A Non-Stationary Combinatorial Bandit Approach. 2843-2859 - Rongxin Han, Dezhi Chen, Song Guo, Jingyu Wang, Qi Qi, Lu Lu, Jianxin Liao:
Multi-SP Network Slicing Parallel Relieving Edge Network Conflict. 2860-2875 - Kaicheng Yang, Sheng Long, Qilong Shi, Yuanpeng Li, Zirui Liu, Yuhan Wu, Tong Yang, Zhengyi Jia:
SketchINT: Empowering INT With TowerSketch for Per-Flow Per-Switch Measurement. 2876-2894 - Yulong Wu, Weizhe Zhang, Nan Guan, Yehan Ma:
TDTA: Topology-Based Real-Time DAG Task Allocation on Identical Multiprocessor Platforms. 2895-2909 - Zhijie Yang, Lei Wang, Wei Shi, Yao Wang, Junbo Tie, Feng Wang, Xiang Yu, Linghui Peng, Chao Xiao, Xun Xiao, Yao Yao, Gan Zhou, Xuhu Yu, Rui Gong, Xia Zhao, Yuhua Tang, Weixia Xu:
Back to Homogeneous Computing: A Tightly-Coupled Neuromorphic Processor With Neuromorphic ISA. 2910-2927 - Carlos Bilbao, Juan Carlos Saez, Manuel Prieto-Matías:
Divide&Content: A Fair OS-Level Resource Manager for Contention Balancing on NUMA Multicores. 2928-2945 - Rong Gu, Zhihao Xu, Yang Che, Xu Wang, Haipeng Dai, Kai Zhang, Bin Fan, Haojun Hou, Li Yi, Yu Ding, Yihua Huang, Guihai Chen:
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms. 2946-2964 - Yilian Zhang, Yao Liu, Penglong Jiao, Yiping Zhou, Tongquan Wei:
Automatic Multi-Parameter Performance Modeling of HPC Applications on a New Sunway Supercomputer. 2965-2977 - Yizhi Huang, Yan Liu, Yang Bai, Si Chen, Renfa Li:
UMA-MF: A Unified Multi-CPU/GPU Asynchronous Computing Framework for SGD-Based Matrix Factorization. 2978-2993 - Yi Hu, Hao Wang, Liangyuan Wang, Menglan Hu, Kai Peng, Bharadwaj Veeravalli:
Joint Deployment and Request Routing for Microservice Call Graphs in Data Centers. 2994-3011
Volume 34, Number 12, December 2023
- Wanchun Jiang, Haoyang Li, Yulong Yan, Fa Ji, Jiawei Huang, Jianxin Wang, Tong Zhang:
Consistent Low Latency Scheduler for Distributed Key-Value Stores. 3012-3027 - Shuyu Pei, Jigang Wen, Kun Xie, Gaogang Xie, Kenli Li:
On-Line Network Traffic Anomaly Detection Based on Tensor Sketch. 3028-3045 - Hao Liu, Wenxin Li, Yiren Pang, Renjie Pei, Yitao Hu, Yuan Liu, Lide Suo, Keqiu Li:
Accelerating Data Delivery of Latency-Sensitive Applications in Container Overlay Network. 3046-3058 - Longlong Chen, Jianfeng Zhu, Guiqiang Peng, Mingxu Liu, Shaojun Wei, Leibo Liu:
GEM: Ultra-Efficient Near-Memory Reconfigurable Acceleration for Read Mapping by Dividing and Predictive Scattering. 3059-3072 - Yihong Li, Xiaoxi Zhang, Tianyu Zeng, Jingpu Duan, Chuan Wu, Di Wu, Xu Chen:
Task Placement and Resource Allocation for Edge Machine Learning: A GNN-Based Multi-Agent Reinforcement Learning Paradigm. 3073-3089 - Lei Yang, Can Zheng, Xiaoyuan Shen, Guoqi Xie:
OfpCNN: On-Demand Fine-Grained Partitioning for CNN Inference Acceleration in Heterogeneous Devices. 3090-3103 - Vasilios I. Kelefouras, Georgios Keramidas:
Design and Implementation of Deep Learning 2D Convolutions on Modern CPUs. 3104-3116 - Ping Gao, Xiaohui Duan, Bertil Schmidt, Wubing Wan, Jiaxu Guo, Wusheng Zhang, Lin Gan, Haohuan Fu, Wei Xue, Weiguo Liu, Guangwen Yang:
Redesign and Accelerate the AIREBO Bond-Order Potential on the New Sunway Supercomputer. 3117-3132 - Bosheng Liu, Zhuoshen Jiang, Yalan Wu, Jigang Wu, Xiaoming Chen, Peng Liu, Qingguo Zhou, Yinhe Han:
Frequency-Domain Inference Acceleration for Convolutional Neural Networks Using ReRAMs. 3133-3146 - Paul Scheffler, Florian Zaruba, Fabian Schuiki, Torsten Hoefler, Luca Benini:
Sparse Stream Semantic Registers: A Lightweight ISA Extension Accelerating General Sparse Linear Algebra. 3147-3161 - Qingyang Zhang, Zhiming Zhang, Jie Cui, Hong Zhong, Yang Li, Chengjie Gu, Debiao He:
Efficient Blockchain-Based Data Integrity Auditing for Multi-Copy in Decentralized Storage. 3162-3173 - Zhipeng Cheng, Xiaoyu Xia, Minghui Liwang, Xuwei Fan, Yanglong Sun, Xianbin Wang, Lianfen Huang:
CHEESE: Distributed Clustering-Based Hybrid Federated Split Learning Over Edge Networks. 3174-3191 - Zhaorui Wu, Yuhui Deng, Yi Zhou, Lin Cui, Xiao Qin:
HashCache: Accelerating Serverless Computing by Skipping Duplicated Function Execution. 3192-3206 - Yujia Zhai, Elisabeth Giem, Kai Zhao, Jinyang Liu, Jiajun Huang, Bryan M. Wong, Christian R. Shelton, Zizhong Chen:
FT-BLAS: A Fault Tolerant High Performance BLAS Implementation on x86 CPUs. 3207-3223 - Zhuoran Song, Wanzhen Liu, Tao Yang, Fangxin Liu, Naifeng Jing, Xiaoyao Liang:
A Point Cloud Video Recognition Acceleration Framework Based on Tempo-Spatial Information. 3224-3237 - Jialiang Ma, Li Li, Cheng-Zhong Xu:
AutoRS: Environment-Dependent Real-Time Scheduling for End-to-End Autonomous Driving. 3238-3252 - Zhihua Fan, Wenming Li, Zhen Wang, Tianyu Liu, Haibin Wu, Yanhuan Liu, Meng Wu, Xinxin Wu, Xiaochun Ye, Dongrui Fan, Ninghui Sun, Xuejun An:
Accelerating Convolutional Neural Networks by Exploiting the Sparsity of Output Activation. 3253-3265 - Xiaofeng Lu, Chao Liu, Senhao Zhu, Yilu Mao, Pietro Lio, Pan Hui:
RLPTO: A Reinforcement Learning-Based Performance-Time Optimized Task and Resource Scheduling Mechanism for Distributed Machine Learning. 3266-3279 - Li'an Xie, Ting Wang, Shuyi Du, Haibin Cai:
CERT-DF: A Computing-Efficient and Robust Distributed Deep Forest Framework With Low Communication Overhead. 3280-3293 - Enda Yu, Dezun Dong, Xiangke Liao:
Communication Optimization Algorithms for Distributed Deep Learning Systems: A Survey. 3294-3308 - Lei Yang, Yuwei Liao, Xin Cheng, Mengyuan Xia, Guoqi Xie:
Efficient Edge Data Management Framework for IIoT via Prediction-Based Data Reduction. 3309-3322
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.