default search action
IPDPS 2020: New Orleans, LA, USA - Workshops
- 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2020, New Orleans, LA, USA, May 18-22, 2020. IEEE 2020, ISBN 978-1-7281-7445-7
HCW: Heterogeneity in Computing Workshop
- Behrooz A. Shirazi:
Message from the HCW Steering Committee Chair. 2 - John K. Antonio:
Message from the HCW General Chair. 3 - Florina M. Ciorba:
Message from the HCW Technical Program Committee Chair. 4 - Albert Y. Zomaya:
HCW 2020 Keynote Speaker Edge Intelligence Empowering IoT Data Analytics. 1 - Achim Lösch, Marco Platzner:
MigHEFT: DAG-based Scheduling of Migratable Tasks on Heterogeneous Compute Nodes. 6-16 - Ali Mokhtari, Chavit Denninnart, Mohsen Amini Salehi:
Autonomous Task Dropping Mechanism to Achieve Robustness in Heterogeneous Computing Systems. 17-26 - Mitsuo Yokokawa, Ayano Nakai, Kazuhiko Komatsu, Yuta Watanabe, Yasuhisa Masaoka, Yoko Isobe, Hiroaki Kobayashi:
I/O Performance of the SX-Aurora TSUBASA. 27-35 - Marcelo Brandalero, Hector Gerardo Muñoz Hernandez, Mitko Veleski, Muhammed Al Kadi, Paolo Rech, Michael Hübner:
(Special Topic Submission) Enabling Domain-Specific Architectures with an Open-Source Soft-Core GPGPU. 36-43 - Joshua Mack, Nirmal Kumbhare, Anish Krishnakumar, Ümit Y. Ogras, Ali Akoglu:
User-Space Emulation Framework for Domain-Specific SoC Design. 44-53 - Giuseppe Ascia, Vincenzo Catania, John Jose, Salvatore Monteleone, Maurizio Palesi, Davide Patti:
Improving Inference Latency and Energy of Network-on-Chip based Convolutional Neural Networks through Weights Compression. 54-63
RAW: Reconfigurable Architectures Workshop
- Jincheng Yu, Feng Gao, Jianfei Cao, Chao Yu, Zhaoliang Zhang, Zhengfeng Huang, Yu Wang, Huazhong Yang:
CNN-based Monocular Decentralized SLAM on embedded FPGA. 66-73 - Johanna Rohde, Karsten Müller, Christian Hochberger:
Improving HLS Generated Accelerators Through Relaxed Memory Access Scheduling. 74-81 - Stephen Tridgell, David Boland, Philip H. W. Leong, Ryan Kastner, Alireza Khodamoradi, Siddhartha:
Real-time Automatic Modulation Classification using RFSoC. 82-89 - Spencer Valancius, Edward Richter, Ruben Purdy, Kris Rockowitz, Michael Inouye, Joshua Mack, Nirmal Kumbhare, Kaitlin Lindsay Fair, John Mixter, Ali Akoglu:
FPGA Based Emulation Environment for Neuromorphic Architectures. 90-97 - Dionysios Diamantopoulos, Mitra Purandare, Burkhard Ringlein, Christoph Hagleitner:
PHRYCTORIA: A Messaging System for Transprecision OpenCAPI-attached FPGA Accelerators. 98-106 - Yuan Meng, Sanmukh R. Kuppannagari, Rachit Rajat, Ajitesh Srivastava, Rajgopal Kannan, Viktor K. Prasanna:
QTAccel: A Generic FPGA based Design for Q-Table based Reinforcement Learning Accelerators. 107-114 - Martin Langhammer, Gregg Baeckler, Sergey Gribok:
SpiderWeb - High Performance FPGA NoC. 115-118 - Håkan Englund, Niklas Lindskog:
Secure acceleration on cloud-based FPGAs - FPGA enclaves. 119-122 - Laurent Gantel, Alexandre Duc, Lucie Steiner, Fabien Vannel, Andres Upegui, Florent Gluck:
A FPGA-Based Post-Processing and Validation Platform for Random Number Generators. 123-126 - Tingyu Zhou, Tieyuan Pan, Michael Conrad Meyer, Yiping Dong, Takahiro Watanabe:
An Interval-based Mapping Algorithm for Multi-shape Tasks on Dynamic Partial Reconfigurable FPGAs. 127-130 - Jessica Leoni, Asia Ciallella, Luca Stornaiuolo, Marco D. Santambrogio, Donatella Sciuto:
EMPhASIS: An EMbedded Public Attention Stress Identification System. 131-134 - Giorgia Fiscaletti, Marco Speziali, Luca Stornaiuolo, Marco D. Santambrogio, Donatella Sciuto:
Hardware resources analysis of BNNs splitting for FARD-based multi-FPGAs Distributed Systems. 135-138 - Qian Zhao, Yasuhiro Nakahara, Motoki Amagasaki, Masahiro Iida, Takaichi Yoshida:
A Microcode-based Control Unit for Deep Learning Processors. 139-142 - Youki Sada, Naoto Soga, Masayuki Shimoda, Akira Jinguji, Shimpei Sato, Hiroki Nakahara:
Fast Monocular Depth Estimation on an FPGA. 143-146 - Lorenzo Di Tucci, Riyadh Baghdadi, Saman P. Amarasinghe, Marco D. Santambrogio:
SALSA: A Domain Specific Architecture for Sequence Alignment. 147-150 - Seung-Hun Chung, Tarek S. Abdelrahman:
Optimizing OpenCL Kernels and Runtime for DNN Inference on FPGAs. 151-154 - Guido Walter Di Donato, Alberto Zeni, Lorenzo Di Tucci, Marco D. Santambrogio:
Leveraging Succinct Data Structures for DNA Sequence Mapping on FPGA. 155-158
HiCOMB: High Performance Computational Biology
- Brandon Gildemaster, Prerana Ghalsasi, Sanjay V. Rajopadhye:
A Tropical Semiring Multiple Matrix-Product Library on GPUs: (not just) a step towards RNA-RNA Interaction Computations. 160-169 - Morgan Lee, George M. Slota:
Fast and High Quality Graph Alignment via Treelets. 170-173 - Francesco Peverelli, Lorenzo Di Tucci, Marco D. Santambrogio, Nan Ding, Steven A. Hofmeyr, Aydin Buluç, Leonid Oliker, Katherine A. Yelick:
GPU accelerated partial order multiple sequence alignment for long reads self-correction. 174-182 - Patricia H. Kovatch, Lili Gai, Hyung Min Cho, Eugene Fluder, Dansha Jiang:
Optimizing High-Performance Computing Systems for Biomedical Workloads. 183-192 - M. Stanley Fujimoto, Cole A. Lyman, Mark J. Clement:
Kcollections: A Fast and Efficient Library for K-mers. 193-198
GrAPL: Graphs, Architectures, Programming, and Learning
- Scott McMillan, Manoj Kumar, Danai Koutra, Mahantesh Halappanavar, Tim Mattson, Antonino Tumeo:
Message from the workshop chairs. 199-200 - George Karypis:
GrAPL 2020 Keynote Speaker Deep Graph Library: Overview, Updates, and Future Developments. 201 - Saman P. Amarasinghe:
GrAPL 2020 Keynote Speaker The GraphIt Universal Graph Framework: Achieving HighPerformance across Algorithms, Graph Types, and Architectures. 202 - Márton Elekes, Gábor Szárnyas:
An incremental GraphBLAS solution for the 2018 TTC Social Media case study. 203-206 - Jeremy Kepner, Tim Davis, Chansup Byun, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Matthew Hubbell, Michael Houle, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther:
75, 000, 000, 000 Streaming Inserts/Second Using Hierarchical Hypersparse GraphBLAS Matrices. 207-210 - Jovan Blanusa, Radu Stoica, Paolo Ienne, Kubilay Atasu:
Parallelizing Maximal Clique Enumeration on Modern Manycore Processors. 211-214 - Benjamin Brock, Aydin Buluç, Timothy G. Mattson, Scott McMillan, José E. Moreira, Roger Pearce, Oguz Selvitopi, Trevor Steil:
Considerations for a Distributed GraphBLAS API. 215-218 - Benjamin Brock, Aydin Buluç, Timothy G. Mattson, Scott McMillan, José E. Moreira:
A Roadmap for the GraphBLAS C++ API. 219-222 - Tze Meng Low, Daniele G. Spampinato, Scott McMillan, Michel Pelletier:
Linear Algebraic Louvain Method in Python. 223-226 - M. Yusuf Özkaya, Muhammed Fatih Balin, Ali Pinar, Ümit V. Çatalyürek:
A scalable graph generation algorithm to sample over a given shell distribution. 227-236 - Trevor Steil, Scott McMillan, Geoffrey Sanders, Roger Pearce, Benjamin Priest:
Kronecker Graph Generation with Ground Truth for 4-Cycles and Dense Structure in Bipartite Graphs. 237-246
EduPar: NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Sushil K. Prasad, Tia Newhall, David P. Bunde, Martina Barnas, Satish Puri:
Message from the EduPar-20 Workshop Chairs. 247-249 - Martin Langhammer:
EduPar-20 Keynote Speaker. 250 - Henry A. Gabb, Andrew Lumsdaine, Margaret Martonosi, Arnold L. Rosenberg, Martina Barnas:
EduPar-20 Invited Panel. 251 - Joel C. Adams:
Retrospective: A Look Back at 20+ Years of Experience in Parallel Computing Education. 252-260 - David W. Brown, Vitaly Ford, Sheikh K. Ghafoor:
A Framework for the Evaluation of Parallel and Distributed Computing Educational Resources. 261-268 - Neftali Watkinson, Preston Tai, Alexandru Nicolau, Alexander V. Veidenbaum:
NumbaSummarizer: A Python Library for Simplified Vectorization Reports. 269-275 - Alice Lasserre, Raymond Namyst, Pierre-André Wacrenier:
EASYPAP: a Framework for Learning Parallel Programming. 276-283 - Suzanne J. Matthews:
PDCunplugged: A Free Repository of Unplugged Parallel Distributed Computing Activities. 284-291 - Mark C. Lewis, Lisa L. Lacher:
Teaching Modern Multithreading in CS2 with Actors. 292-299 - Cosimo Anglano, Massimo Canonico, Marco Guazzone:
Teaching Cloud Computing: Motivations, Challenges and Tools. 300-306 - Patrick J. McGee, Rade Latinovich, Dennis Brylow:
Using Embedded Xinu and the Raspberry Pi 3 to Teach Operating Systems. 307-315
HIPS: High-level Parallel Programming Models and Supportive Environments
- Dong Li, Heike Jagode:
Workshop 6: HIPS High-level Parallel Programming Models and Supportive Environments. 316 - Akshay Bhosale, Rudolf Eigenmann:
Compile-time Parallelization of Subscripted Subscript Patterns. 317-325 - Adrien Faure, Giorgio Lucarelli, Olivier Richard, Denis Trystram:
Online Scheduling with Redirection for Parallel Jobs. 326-329 - Colleen Bertoni, JaeHyuk Kwack, Thomas Applencourt, Yasaman Ghadar, Brian Homerding, Christopher Knight, Brice Videau, Huihuo Zheng, Vitali A. Morozov, Scott Parker:
Performance Portability Evaluation of OpenCL Benchmarks across Intel and NVIDIA Platforms. 330-339 - Shaohua Duan, Manish Parashar:
Scalable Crash Consistency for Staging-based In-situ Scientific Workflows. 340-348 - Robert Mijakovic, Michael Gerndt:
Automatic Selection of Tuning Plugins in PTF Using Machine Learning. 349-358 - Steffen Christgau, Thomas Steinke:
Porting a Legacy CUDA Stencil Code to oneAPI. 359-367 - Zheming Jin, Vitali A. Morozov, Hal Finkel:
A Case Study on the HACCmk Routine in SYCL on Integrated Graphics. 368-374 - Virginia Niculescu, Darius Bufnea, Adrian Sterca:
Enhancing Java Streams API with PowerList Computation. 375-384
HPBDC: High-Performance Big Data and Cloud Computing
- Xiaoyi Lu, Jianfeng Zhan:
Workshop 7: HPBDC High-Performance Big Data and Cloud Computing. 385 - Marat Dukhan, Artsiom Ablavatski:
Two-Pass Softmax Algorithm. 386-395 - Jia Guo, Gagan Agrawal:
Smart Streaming: A High-Throughput Fault-tolerant Online Processing System. 396-405 - Houjun Tang, Suren Byna, Bin Dong, Quincey Koziol:
Parallel Query Service for Object-centric Data Management Systems. 406-415 - Chen Zeng, Yifan Wang, Fan Liang, Xiaohui Peng:
Pinocchio: A Blockchain-Based Algorithm for Sensor Fault Tolerance in Low Trust Environment. 416-425 - Daniel Nobre Pinheiro, Samuel Xavier de Souza, Daniel Aloise:
Scaling Optimizations for Large-Scale Distributed Data with Lightweight Coresets. 426-429
AsHES: Accelerators and Hybrid Exascale Systems
- Min Si, Lena Oden, Simon Garcia De Gonzalo:
Workshop 8: AsHES Accelerators and Hybrid Exascale Systems. 430 - Taisuke Boku:
AsHES 2020 Keynote Speaker (5: 30 pm CDT). 431 - Zheming Jin, Hal Finkel:
Population Count on Intel® CPU, GPU and FPGA. 432-439 - Seher Acer, Erik G. Boman, Sivasankaran Rajamanickam:
SPHYNX: Spectral Partitioning for HYbrid aNd aXelerator-enabled systems. 440-449 - Norihisa Fujita, Ryohei Kobayashi, Yoshiki Yamaguchi, Tomohiro Ueno, Kentaro Sano, Taisuke Boku:
Performance Evaluation of Pipelined Communication Combined with Computation in OpenCL Programming on FPGA. 450-459 - Jacob Lambert, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
In-Depth Optimization with the OpenACC-to-FPGA Framework on an Arria 10 FPGA. 460-470 - Matthias Diener, Laxmikant V. Kalé:
Unified data movement for offloading Charm++ applications. 471-474 - John W. Lawson:
Towards automated kernel selection in machine learning systems: A SYCL case study. 475-478 - Federico Favaro, Juan P. Oliver, Ernesto Dufrechou, Pablo Ezzatti:
Understanding the Performance of Elementary NLA Kernels in FPGAs. 479-482 - Brian A. Page, Peter M. Kogge:
Scalability of Sparse Matrix Dense Vector Multiply (SpMV) on a Migrating Thread Architecture. 483-488
PDCO: Parallel / Distributed Combinatorics and Optimization
- Grégoire Danoy, Didier El Baz, Vincent Boyer, Bernabé Dorronsoro, Laurence T. Yang, Keqin Li:
Workshop 9: PDCO Parallel / Distributed Combinatorics and Optimization. 489 - Roger L. Goodwin:
Load Balancing Run-Times and Space Usage for Computing the Power Set. 490-501 - Thomas Charest, Robert C. Green II:
Implementing Central Force optimization on the Intel Xeon Phi. 502-511 - Emiliano Pérez, Sergio Nesmachnow, Jamal Toutouh, Erik Hemberg, Una-May O'Reilly:
Parallel/distributed implementation of cellular training for generative adversarial neural networks. 512-518 - Abdoul Wahid Mainassara Chekaraou, Xavier Besseron, Alban Rousset, Emmanuel Kieffer, Bernhard Peters:
Predicting near-optimal skin distance in Verlet buffer approach for Discrete Element Method. 519-527 - Daniel H. Stolfi, Matthias R. Brust, Grégoire Danoy, Pascal Bouvry:
Competitive Evolution of a UAV Swarm for Improving Intruder Detection Rates. 528-535
APDCM: Advances in Parallel and Distributed Computational Models
- Jacir Luiz Bordim, Koji Nakano, Susumu Matsumae, Masahiro Shibata:
Workshop 10: APDCM Advances in Parallel and Distributed Computational Models. 536-537 - Henry Zhu, Nik Sultana, Boon Thau Loo:
Debugging strongly-compartmentalized distributed systems. 538-547 - Hiroki Kataoka, Kohei Yamashita, Yasuaki Ito, Koji Nakano, Akihiko Kasagi, Tsuguchika Tabaru:
An Efficient Multicore CPU Implementation for Convolution-Pooling Computation in CNNs. 548-556 - Masaki Tao, Koji Nakano, Yasuaki Ito, Ryota Yasudo, Masaru Tatekawa, Ryota Katsuki, Takashi Yazane, Yoko Inaba:
A Work-Time Optimal Parallel Exhaustive Search Algorithm for the QUBO and the Ising model, with GPU implementation. 557-566 - Anne Benoit, Valentin Le Fèvre, Padma Raghavan, Yves Robert, Hongyang Sun:
Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs. 567-576 - Martti Forsell, Jussi Roivainen, Jesper Larsson Träff:
Optimizing Memory Access in TCF Processors with Compute-Update Operations. 577-586 - Yi Zhou, Yangyang Liu, Chaowei Zhang, Xiaopu Peng, Xiao Oin:
TOSS: A Topology-based Scheduler for Storm C1usters. 587-596 - Gabriel Bathie, Loris Marchal, Yves Robert, Samuel Thibault:
Revisiting dynamic DAG scheduling under memory constraints for shared-memory platforms. 597-606 - Gokarna Sharma, Ramachandran Vaidyanathan, Jerry L. Trahan:
Optimal Randomized Complete Visibility on a Grid for Asynchronous Robots with Lights. 607-616 - Chung-Hsing Hsu, Neena Imam, Akhil Langer, Sreeram Potluri, Chris J. Newburn:
An Initial Assessment of NVSHMEM for High Performance Computing. 617-626 - Hideharu Kojima, Naoto Yanai:
A Model Checking Method for Secure Routing Protocols by SPIN with State Space Reduction. 627-635 - André Luckow, Shantenu Jha:
Methods and Experiences for Developing Abstractions for Data-intensive, Scientific Applications. 636-645
JSSPP 2020 - 23rd Workshop on Job Scheduling Strategies for ParallelProcessing
- Dalibor Klusácek, Walfredo Cirne, Narayan Desai:
JSSPP 2020 - 23rd Workshop on Job Scheduling Strategies for Parallel Processing. 646-647
CHIUW: Chapel Implementers and Users Workshop
- Benjamin Robbins:
CHIUW 2020 The Seventh Annual Chapel Implementers and Users Workshop. 648-649 - William Reus:
CHIUW 2020 Keynote Arkouda: Chapel-Powered, Interactive Supercomputing for Data Science. 650 - Matthieu Parenteau, Simon Bourgault-Cote, Frederic Plante, Eric Laurendeau:
Development of Parallel CFD Applications on Distributed Memory with Chapel. 651-658 - Garvit Dewan, Louis Jenkins:
Paving the way for Distributed Non-Blocking Algorithms and Data Structures in the Partitioned Global Address Space model. 659-666 - Jesun Sahariar Firoz, Louis Jenkins, Cliff A. Joslyn, Brenda Praggastis, Emilie Purvine, Mark Raugas:
Computing Hypergraph Homology in Chapel. 667-670 - Engin Kayraklioglu, Tarek A. El-Ghazawi:
An Automated Machine Learning Approach for Data Locality Optimizations in Chapel. 671 - Richard F. Barrett, Jeanine E. Cook, Stephen L. Olivier, Omar Aaziz, Christipher D. Jenkins, Courtenay T. Vaughan:
Exploring Chapel Productivity Using Some Graph Algorithms. 672 - Lydia Duncan:
Visibility Control: Use and Import Statement Improvements. 673 - Michael P. Ferguson:
Towards Stability in the Chapel Language. 674 - Akihiro Hayashi, Sri Raj Paul, Vivek Sarkar:
Exploring a multi-resolution GPU programming model for Chapel. 675 - Benjamin Albrecht:
Random Forests in Chapel. 676 - Elliot Ronaghan:
Squeezing performance out of Arkouda. 677 - Nikhil Padmanabhan, Elliot Ronaghan, J. Luna Zagorac, Richard Easther:
Simulating Ultralight Dark Matter in Chapel. 678 - Rahul Ghangas, Josh Milthorpe:
Chapel on Accelerators. 679
PDSEC: Parallel and Distributed Scientific and Engineering Computing
- Raphaël Couturier, Peter Strazdins, Eric Aubanel, Sabine Roller, Laurence T. Yang, Thomas Rauber, Gudula Rünger:
Workshop 13: PDSEC Parallel and Distributed Scientific and Engineering Computing. 680-681 - Manvi Saxena, Shweta Jha, Saba Khan, John Rodgers, Peggy Lindner, Edgar Gabriel:
Comparison of MPI and Spark for Data Science Applications. 682-690 - Emmanuel Jeannot, Richard Sartori:
Improving MPI Application Communication Time with an Introspection Monitoring Library. 691-700 - Nathan Vaughn, Leighton Wilson, Robert Krasny:
A GPU-Accelerated Barycentric Lagrange Treecode. 701-710 - Jean-Matthieu Gallard, Leonhard Rannabauer, Anne Reinarz, Michael Bader:
Vectorization and Minimization of Memory Footprint for Linear High-Order Discontinuous Galerkin Schemes. 711-720 - Yu Pei, Qinglei Cao, George Bosilca, Piotr Luszczek, Victor Eijkhout, Jack J. Dongarra:
Communication Avoiding 2D Stencil Implementations over PaRSEC Task-Based Runtime. 721-729 - Ming Li, Peter J. Hawrylak, John Hale:
Implementing an Attack Graph Generator in CUDA. 730-738 - Huda Alrammah, Yi Gu, Zhifeng Liu:
Tri-Objective Workflow Scheduling and Optimization in Heterogeneous Cloud Environments. 739-748 - Nicholas Chaimov, Sameer Shende, Allen D. Malony, Neena Imam:
Identifying Optimization Opportunities Using Memory Access Tracing in OpenSHMEM Runtimes with the TAU Performance System. 749-756 - Rocío Carratalá-Sáez, Mathieu Faverge, Grégoire Pichon, Guillaume Sylvand, Enrique S. Quintana-Ortí:
Tiled Algorithms for Efficient Task-Parallel ℌ-Matrix Solvers. 757-766
iWAPT: Automatic Performance Tuning
- I-Hsin Chung, Kazuhiko Komatsu:
Workshop 14: iWAPT Automatic Performance Tuning. 767-768 - Mayuko Koezuka, Yusuke Shirota, Satoshi Shirai, Tatsunori Kanai:
Machine Learning-Based Prefetching for SCM Main Memory System. 769-776 - Amir Haderbache, Koichi Shirahata, Takuji Yamamoto, Yasumoto Tomita, Hiroshi Okuda:
Acceleration of Structural Analysis Simulations using CNN-based Auto-Tuning of Solver Tolerance. 777-786 - Wenju Zhou, Jiepeng Zhang, Jingwei Sun, Guangzhong Sun:
Using Small-Scale History Data to Predict Large-Scale Performance of HPC Application. 787-795 - Carl Pearson, Mert Hidayetoglu, Mohammad Almasri, Omer Anjum, I-Hsin Chung, Jinjun Xiong, Wen-Mei W. Hwu:
Node-Aware Stencil Communication for Heterogeneous Supercomputers. 796-805 - Suhang Jiang, Mulya Agung, Ryusuke Egawa, Hiroyuki Takizawa:
Task Priority Control for the HPX Runtime System. 806-813 - Ayse Bagbaba:
Improving Collective I/O Performance with Machine Learning Supported Auto-tuning. 814-821 - Naoki Ebata, Ryusuke Egawa, Yoko Isobe, Ryoji Takaki, Hiroyuki Takizawa:
Automatically Avoiding Memory Access Conflicts on SX-Aurora TSUBASA. 822-829 - Takumi Kishitani, Kazuhiko Komatsu, Masayuki Sato, Akihiro Musa, Hiroaki Kobayashi:
Importance of Selecting Data Layouts in the Tsunami Simulation Code. 830-837
MPP: Parallel Programming Models - Special Edition Machine Learning Performance and Security
- Leandro A. J. Marzulo, Tiago A. O. Alves, Cristiana Bentes, Gabriele Mencagli:
Workshop 15: MPP Parallel Programming Models - Special Edition Machine Learning Performance and Security. 838-839 - Taha Soliman, Armin Runge, Leonardo Ecco:
Enhancing the Utilization of Dot-Product Engines in Deep Learning Accelerators. 840-843 - Guilherme C. De Lello, Juliano F. Caldeira, Mauricio Aredes, Felipe M. G. França, Priscila M. V. Lima:
Weightless Neural Networks Applied to Nonintrusive Load Monitoring. 844-851 - Robert Schmid, Bjarne Pfitzner, Jossekin Beilharz, Bert Arnrich, Andreas Polze:
Tangle Ledger for Decentralized Learning. 852-859 - Raphael N. C. B. Rocha, Leopoldo Lusquino Filho, Mauricio Aredes, Felipe M. G. França, Priscila M. V. Lima:
Regression WiSARD application of controller on DC STATCOM converter under fault conditions. 860-867
SNACS: Scalable Networks for Advanced Computing Systems
- Ilkay Altintas, Dorian C. Arnold, Martin Schulz, Matthew G. F. Dosanjh, Ryan E. Grant, Taylor L. Groves:
Workshop 16: SNACS Scalable Networks for Advanced Computing Systems. 868 - Amit Ruhela, Shulei Xu, Karthik Vadambacheri Manian, Hari Subramoni, Dhabaleswar K. Panda:
Analyzing and Understanding the Impact of Interconnect Performance on HPC, Big Data, and Deep Learning Applications: A Case Study with InfiniBand EDR and HDR. 869-878 - Scott Levy, Patrick M. Widener, Craig D. Ulmer, Todd Kordenbrock:
The Case for Explicit Reuse Semantics for RDMA Communication. 879-888 - Victor Eijkhout:
Performance of MPI Sends of Non-Contiguous Data. 889-895 - Kaushik Kandadi Suresh, Bharath Ramesh, Seyedeh Mahdieh Ghazimirsaeed, Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Performance Characterization of Network Mechanisms for Non-Contiguous Data Transfers in MPI. 896-905
PAISE: Parallel AI and Systems for the Edge
- Peter H. Beckman, Rajesh Sankaran:
Workshop 17: PAISE Parallel AI and Systems for the Edge. 906-907 - Zheming Jin, Hal Finkel:
Analyzing Deep Learning Model Inferences for Image Classification using OpenVINO. 908-911 - Mohit Kumar, Xingzhou Zhang, Liangkai Liu, Yifan Wang, Weisong Shi:
Energy-Efficient Machine Learning on the Edges. 912-921 - Marat Dukhan:
Indirect Deconvolution Algorithm. 922-926 - Luke Jacobs, Akhil Kodumuri, Jim James, Seongha Park, Yongho Kim:
Multiperspective Automotive Labeling. 927-936 - Syed Badruddoja, Ram Dantu, Logan Widick, Zachary Zaccagni, Kritagya Upadhyay:
Integrating DOTS With Blockchain Can Secure Massive IoT Sensors. 937-946
RADR: Resource Arbitration for Dynamic Runtimes
- Peter H. Beckman, Emmanuel Jeannot, Swann Perarnau:
Workshop on Resource Arbitration for Dynamic Runtimes (RADR). 947-949 - Jirí Dokulil, Siegfried Benkner:
NUMA-aware CPU core allocation in cooperating dynamic applications. 950-957 - Cassandra Rocha Barbosa, Pierre Lemarinier, Marc Sergent, Guillaume Papauré, Marc Pérache:
Overlapping MPI communications with Intel TBB computation. 958-966 - Florian Schmaus, Sebastian Maier, Tobias Langer, Jonas Rabenstein, Timo Hönig, Wolfgang Schröder-Preikschat, Lars Bauer, Jörg Henkel:
System Software for Resource Arbitration on Future Many-* Architectures. 967-975 - Atsushi Hori, Balazs Gerofi, Yutaka Ishikawa:
An Implementation of User-Level Processes using Address Space Sharing. 976-984
ScaDL: Scalable Deep Learning over Parallel and Distributed Infrastructures
- Ashish Verma, Christopher D. Carothers, K. R. Jayaram, Parijat Dube:
Workshop 19: ScaDL Scalable Deep Learning over Parallel and Distributed Infrastructures. 985-986 - Manish Gupta:
"A Stitch in Time": A Grand Challenge for Distributed Machine Learning. 987 - Geoffrey C. Fox:
High Performance Computing: From Deep Learning to Data Engineering. 988 - Wen-Mei Hwu:
Advancing Computing Infrastructure for Very Large-Scale Deep Learning at C3SR. 989 - Minsik Cho:
Scalable Deep Learning Inference: Algorithmic Approach. 990 - Pankaj Rajak, Kuang Liu, Aravind Krishnamoorthy, Rajiv K. Kalia, Aiichiro Nakano, Ken-ichi Nomura, Subodh C. Tiwari, Priya Vashishta:
Neural Network Molecular Dynamics at Scale. 991-994 - Florent Lopez, Edmond Chow, Stanimire Tomov, Jack J. Dongarra:
Asynchronous SGD for DNN training on Shared-memory Parallel Architectures. 995-998 - Saritha Vinod, M. Naveen, Asis K. Patra, Anto Ajay Raj John:
Accelerating Towards Larger Deep Learning Models and Datasets - A System Platform View Point. 999-1005 - Naw Safrin Sattar, Shaikh Anfuzzaman:
Data Parallel Large Sparse Deep Neural Network on GPU. 1006-1014 - Quentin Anthony, Ammar Ahmad Awan, Arpan Jain, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Efficient Training of Semantic Image Segmentation on Summit using Horovod and MVAPICH2-GDR. 1015-1023
HPS: High-Performance Storage
- Kathryn M. Mohror, Marc Snir:
First IEEE International Workshop on High-Performance Storage (HPS). 1024-1026 - Francois Tessier, Maxime Martinasso, Matteo Chesi, Mark Klein, Miguel Gila:
Dynamic Provisioning of Storage Resources: A Case Study with Burst Buffers. 1027-1035 - Tonmoy Dey, Kento Sato, Bogdan Nicolae, Jian Guo, Jens Domke, Weikuan Yu, Franck Cappello, Kathryn M. Mohror:
Optimizing Asynchronous Multi-Level Checkpoint/Restart Configurations with Machine Learning. 1036-1043 - Raafat Feki, Edgar Gabriel:
On Overlapping Communication and File I/O in Collective Write Operation. 1044-1051 - Chen Wang, Jinghan Sun, Marc Snir, Kathryn M. Mohror, Elsa Gonsiorowski:
Recorder 2.0: Efficient Parallel I/O Tracing and Analysis. 1052-1059 - Qingyue Liu, Peter J. Varman:
Silent Data Access Protocol for NVRAM + RDMA Distributed Storage. 1060-1069 - Tanaya Roy, Krishna Kant:
Enhancing Endurance of SSD Based high-performance Storage Systems using Emerging NVM Technologies. 1070-1079 - Kohei Sugihara, Osamu Tatebe:
Design of Locality-aware MPI-IO for Scalable Shared File Write Performance. 1080-1089
ParSocial: Parallel and Distributed Processing for Computational Social Systems
- Jack Garbus, Christopher Brissette, George M. Slota:
Parallel Generation of Simple Null Graph Models. 1091-1100 - Konstantin Pogorelov, Daniel Thilo Schroeder, Petra Filkuková, Johannes Langguth:
A System for High Performance Mining on GDELT Data. 1101-1111 - George M. Slota, Jack Garbus:
A Parallel LFR-like Benchmark for Evaluating Community Detection Algorithms. 1112-1115 - Bhavani Thuraisingham:
The Role of Artificial Intelligence and Cyber Security for Social Media. 1116-1118 - Joseph Kready, Shishila Awung Shimray, Muhammad Nihal Hussain, Nitin Agarwal:
YouTube Data Collection Using Parallel Processing. 1119-1122 - Eunice E. Santos, Vairavan Murugappan, John Korah:
New Approaches for Performance Optimization and Analysis of Large-Scale Dynamic Social Network Analysis using Anytime Anywhere Algorithms. 1123-1128
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.