default search action
45th ICPP 2016: Philadelphia, PA, USA
- 45th International Conference on Parallel Processing, ICPP 2016, Philadelphia, PA, USA, August 16-19, 2016. IEEE Computer Society 2016, ISBN 978-1-5090-2823-8
Session 1A: Data Center and Cloud 1
- Jun Duan, Yuanyuan Yang:
Efficient Virtual Network Embedding for Variable Size Virtual Machines in Fat-Tree Data Centers. 1-10 - Tingwei Zhu, Dan Feng, Yu Hua, Fang Wang, Qingyu Shi, Jiahao Liu:
MIC: An Efficient Anonymous Communication System in Data Center Networks. 11-20 - Dian Shen, Junzhou Luo, Fang Dong, Junxue Zhang:
AppBag: Application-Aware Bandwidth Allocation for Virtual Machines in Cloud Environment. 21-30 - Leonardo Piga, Indrani Paul, Wei Huang:
Performance Boosting Opportunities under Communication Imbalance in Power-Constrained HPC Clusters. 31-40 - Zhenhua Li, Yuanyuan Yang:
RRect: A Novel Server-centric Data Center Network with High Availability. 41-46
Session 1B: Architecture 1
- Yi Lin, Po-Chun Huang, Duo Liu, Xiao Zhu, Liang Liang:
Making In-Memory Frequent Pattern Mining Durable and Energy Efficient. 47-56 - Qingda Hu, Jiwu Shu, Jie Fan, Youyou Lu:
Run-Time Performance Estimation and Fairness-Oriented Scheduling Policy for Concurrent GPGPU Applications. 57-66 - Xiaqing Li, Guangyan Zhang, H. Howie Huang, Zhufan Wang, Weimin Zheng:
Performance Analysis of GPU-Based Convolutional Neural Networks. 67-76 - Shuang Song, Meng Li, Xinnian Zheng, Michael LeBeane, Jee Ho Ryoo, Reena Panda, Andreas Gerstlauer, Lizy K. John:
Proxy-Guided Load Balancing of Graph Processing Workloads on Heterogeneous Clusters. 77-86 - Lei Cui, Zhiyu Hao, Chonghua Wang, Haiqiang Fei, Zhenquan Ding:
Piccolo: A Fast and Efficient Rollback System for Virtual Machine Clusters. 87-92
Session 2A: Parallel Algorithms
- Patrick Mackey, Robert R. Lewis:
Parallel k-Means++ for Multiple Shared-Memory Architectures. 93-102 - Oguz Kaya, Bora Uçar:
High Performance Parallel Algorithms for the Tucker Decomposition of Sparse Tensors. 103-112 - Moohyeon Nam, Jinwoong Kim, Beomseok Nam:
Parallel Tree Traversal for Nearest Neighbor Query on the GPU. 113-122 - Anne Benoit, Loic Pottier, Yves Robert:
Resilient Application Co-scheduling with Processor Redistribution. 123-132 - Jessica McClintock, Anthony Wirth:
Efficient Parallel Algorithms for k-Center Clustering. 133-138
Session 2B: Architecture 2
- Xin Wang, Xiaofeng Ji, Yunping Lu, Yi Li, Weijia Zhou, Weihua Zhang, Wenyun Zhao:
Understanding the Architectural Characteristics of EDA Algorithms. 139-148 - Jing Wang, Yanjun Liu, Weigong Zhang, Kezhong Lu, Keni Qiu, Xin Fu, Tao Li:
Exploring Variation-Aware Fault-Tolerant Cache under Near-Threshold Computing. 149-158 - Zheng Li, Fang Wang, Dan Feng, Yu Hua, Wei Tong, Jingning Liu, Xiang Liu:
Tetris Write: Exploring More Write Parallelism Considering PCM Asymmetries. 159-168 - Ping Huang, Wenjie Liu, Kun Tang, Xubin He, Ke Zhou:
ROP: Alleviating Refresh Overheads via Reviving the Memory System in Frozen Cycles. 169-178 - Zhibin Yu, Lieven Eeckhout, Cheng-Zhong Xu:
Thread Similarity Matrix: Visualizing Branch Divergence in GPGPU Programs. 179-184
Session 3A: Programming Techniques 1
- Sayan Ghosh, Jeff R. Hammond, Antonio J. Peña, Pavan Balaji, Assefaw Hadish Gebremedhin, Barbara M. Chapman:
One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental. 185-194 - Jintao Meng, Sangmin Seo, Pavan Balaji, Yanjie Wei, Bingqiang Wang, Shengzhong Feng:
SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale. 195-204 - Indranil Roy, Ankit Srivastava, Srinivas Aluru:
Programming Techniques for the Automata Processor. 205-210 - Jinsu Park, Woongki Baek:
RCHC: A Holistic Runtime System for Concurrent Heterogeneous Computing. 211-216
Session 3B: Parallel Algorithms 2
- Matthew Graichen, Joseph Izraelevitz, Michael L. Scott:
An Unbounded Nonblocking Double-Ended Queue. 217-226 - Jian-Jun Han, Xin Tao, Dakai Zhu, Hakan Aydin:
Criticality-Aware Partitioning for Multicore Mixed-Criticality Systems. 227-235 - Dominique LaSalle, George Karypis:
A Parallel Hill-Climbing Refinement Algorithm for Graph Partitioning. 236-241 - Evangelia A. Sitaridi, René Müller, Tim Kaldewey, Guy M. Lohman, Kenneth A. Ross:
Massively-Parallel Lossless Data Decompression. 242-247
Session 4A: Data Cloud and Cloud 2
- Prasanna Balaprakash, Vitali A. Morozov, Rajkumar Kettimuthu, Kalyan Kumaran, Ian T. Foster:
Improving Data Transfer Throughput with Direct Search Optimization. 248-257 - Alexandre Denis, François Trahay:
MPI Overlap: Benchmark and Analysis. 258-267 - Jie Zhang, Xiaoyi Lu, Dhabaleswar K. Panda:
High Performance MPI Library for Container-Based HPC Cloud on InfiniBand Clusters. 268-277 - Rui Han, Siguang Huang, Fei Tang, Fu-Gui Chang, Jianfeng Zhan:
AccuracyTrader: Accuracy-Aware Approximate Processing for Low Tail Latency and High Result Accuracy in Cloud Online Services. 278-287 - Pradeep Subedi, Ping Huang, Tong Liu, Joseph Moore, Stan Skelton, Xubin He:
CoARC: Co-operative, Aggressive Recovery and Caching for Failures in Erasure Coded Hadoop. 288-293
Session 4B: Cyberphysical Systems 1
- Guoju Gao, Mingjun Xiao, Zhenhua Zhao:
Optimal Multi-taxi Dispatch for Mobile Taxi-Hailing Systems. 294-303 - Jia Liu, Bin Xiao, Xuan Liu, Lijun Chen:
Fast RFID Polling Protocols. 304-313 - Zongjian He, Daqiang Zhang, Jiannong Cao, Xuefeng Liu, Xiaopeng Fan, Cheng-Zhong Xu:
Exploiting Real-Time Traffic Light Scheduling with Taxi Traces. 314-323 - Ankur Sarker, Chenxi Qiu, Haiying Shen, Andrea Gil, Joachim Taiber, Mashrur Chowdhury, Jim Martin, Mac Devine, Andrew J. Rindos:
An Efficient Wireless Power Transfer System to Balance the State of Charge of Electric Vehicles. 324-333 - Huijie Chen, Fan Li, Yu Wang:
EchoLoc: Accurate Device-Free Hand Localization Using COTS Devices. 334-339
Session 5A: Parallel Algorithms 3
- Koji Nakano, Daisuke Takafuji, Satoshi Fujita, Hiroki Matsutani, Ikki Fujiwara, Michihiro Koibuchi:
Randomly Optimized Grid Graph for Low-Latency Interconnection Networks. 340-349 - Davide Frey, Hicham Lakhlef, Michel Raynal:
Optimal Collision/Conflict-Free Distance-2 Coloring in Wireless Synchronous Broadcast/Receive Tree Networks. 350-359 - Bapi Chatterjee, Ivan Walulya, Philippas Tsigas:
Help-Optimal and Language-Portable Lock-Free Concurrent Data Structures. 360-369 - Zhengyuan Xue, Ruixuan Li, Heng Zhang, Xiwu Gu, Zhiyong Xu:
DC-Top-k: A Novel Top-k Selecting Algorithm and Its Parallelization. 370-379 - Napath Pitaksirianan, Zhila Nouri, Yi-Cheng Tu:
Efficient 2-Body Statistics Computation on GPUs: Parallelization & Beyond. 380-385
Session 5B: Storage Systems
- Peter R. Denz, Matthew Curtis-Maury, Vinay Devadas:
Think Global, Act Local: A Buffer Cache Design for Global Ordering and Parallel Processing in the WAFL File System. 386-395 - Chu Li, Dan Feng, Yu Hua, Fang Wang:
Improving RAID Performance Using an Endurable SSD Cache. 396-405 - Houjun Tang, Suren Byna, Steve Harenberg, Wenzhao Zhang, Xiaocheng Zou, Daniel F. Martin, Bin Dong, Dharshi Devendran, Kesheng Wu, David Trebotich, Scott Klasky, Nagiza F. Samatova:
In Situ Storage Layout Optimization for AMR Spatio-temporal Read Accesses. 406-415 - Sagar Thapaliya, Purushotham V. Bangalore, Jay F. Lofstead, Kathryn M. Mohror, Adam Moody:
Managing I/O Interference in a Shared Burst Buffer System. 416-425 - Hao Wen, David Hung-Chang Du, Milan Shetti, Doug Voigt, Shanshan Li:
Guaranteed Bang for the Buck: Modeling VDI Applications with Guaranteed Quality of Service. 426-431
Session 6A: Programming Techniques 2
- Benoît Pradelle, Benoît Meister, Muthu Manikandan Baskaran, Athanasios Konstantinidis, Thomas Henretty, Richard Lethin:
Scalable Hierarchical Polyhedral Compilation. 432-441 - Jingna Zeng, João Pedro Barreto, Seif Haridi, Luís E. T. Rodrigues, Paolo Romano:
The Future(s) of Transactional Memory. 442-451 - Sanjay Chatterjee, Nick Vrvilo, Zoran Budimlic, Kathleen Knobe, Vivek Sarkar:
Declarative Tuning for Locality in Parallel Programs. 452-457 - Vivekanandan Balasubramanian, Antons Treikalis, Ole Weidner, Shantenu Jha:
Ensemble Toolkit: Scalable and Flexible Execution of Ensembles of Tasks. 458-463
Session 6B: Cyberphysical Systems 2
- Ziqi Zhao, Fan Wu, Shaolei Ren, Xiaofeng Gao, Guihai Chen, Yong Cui:
TECH: A Thermal-Aware and Cost Efficient Mechanism for Colocation Demand Response. 464-473 - Houssem Chihoub, Christine Collet:
A Scalability Comparison Study of Data Management Approaches for Smart Metering Systems. 474-483 - John W. Romein:
A Comparison of Accelerator Architectures for Radio-Astronomical Signal-Processing Algorithms. 484-489 - Kang Chen, Haiying Shen:
MobiSensing: Exploiting Human Mobility for Multi-application Mobile Data Sensing with Low User Intervention. 490-495
Session 7A: Performance Modeling
- Akrem Benatia, Weixing Ji, Yizhuo Wang, Feng Shi:
Sparse Matrix Format Selection with Multiclass SVM for SpMV on GPU. 496-505 - Jeffrey Daily, Ananth Kalyanaraman, Sriram Krishnamoorthy, Bin Ren:
On the Impact of Widening Vector Registers on Sequence Alignment. 506-515 - Rong Ge, Xizhou Feng, Yangyang He, Pengfei Zou:
The Case for Cross-Component Power Coordination on Power Bounded Systems. 516-525 - Shi Sha, Wujie Wen, Ming Fan, Shaolei Ren, Gang Quan:
Performance Maximization via Frequency Oscillation on Temperature Constrained Multi-core Processors. 526-535 - Panfeng Zhang, Ping Huang, Xubin He, Hua Wang, Lingyu Yan, Ke Zhou:
RMD: A Resemblance and Mergence Based Approach for High Performance Deduplication. 536-541
Session 7B: GPU Applications
- Cen Chen, Kenli Li, Aijia Ouyang, Zhuo Tang, Keqin Li:
GFlink: An In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data. 542-551 - Ming-Hsiang Huang, Wuu Yang:
Partial Flattening: A Compilation Technique for Irregular Nested Parallelism on GPGPUs. 552-561 - Feng Zhang, Peng Di, Hao Zhou, Xiangke Liao, Jingling Xue:
RegTT: Accelerating Tree Traversals on GPUs by Exploiting Regularities. 562-571 - Xiaonan Tian, Dounia Khaldi, Deepak Eachempati, Rengan Xu, Barbara M. Chapman:
Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations. 572-581 - Yi Yang, Min Feng, Srimat T. Chakradhar:
HppCnn: A High-Performance, Portable Deep-Learning Library for GPGPUs. 582-587
Session 8A: Applications
- Guillaume Aupy, JeongHyung Park, Padma Raghavan:
Locality-Aware Laplacian Mesh Smoothing. 588-597 - Sameh Shohdy, Abhinav Vishnu, Gagan Agrawal:
Fault Tolerant Support Vector Machines. 598-607 - Juliette Pardue, Andrey N. Chernikov:
Parallel Two-Dimensional Unstructured Anisotropic Delaunay Mesh Generation of Complex Domains for Aerospace Applications. 608-617
Session 8B: Scalable Software
- Sudip K. Seal, Steven P. Hirshman, Andreas Wingen, Robert S. Wilcox, Mark R. Cianciosa, Ezekial A. Unterberg:
PARVMEC: An Efficient, Scalable Implementation of the Variational Moments Equilibrium Code. 618-627 - Antons Treikalis, André Merzky, Haoyuan Chen, Tai-Sung Lee, Darrin M. York, Shantenu Jha:
RepEx: A Flexible Framework for Scalable Replica Exchange Molecular Dynamics Simulations. 628-637 - Huan Feng, David M. Eyers, Steven Mills, Yongwei Wu, Zhiyi Huang:
PCAF: Scalable, High Precision k-NN Search Using Principal Component Analysis Based Filtering. 638-647
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.