default search action
Jens Domke
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c27]Ivan R. Ivanov, Oleksandr Zinenko, Jens Domke, Toshio Endo, William S. Moses:
Retargeting and Respecializing GPU Workloads for Performance Portability. CGO 2024: 119-132 - [c26]Ivan R. Ivanov, Jens Domke, Toshio Endo, Johannes Doerfert:
Automatic Parallelization and OpenMP Offloading of Fortran Array Notation. IWOMP 2024: 197-209 - [c25]Nils Blach, Maciej Besta, Daniele De Sensi, Jens Domke, Hussein Harake, Shigang Li, Patrick Iff, Marek Konieczny, Kartik Lakhotia, Ales Kubicek, Marcel Ferrari, Fabrizio Petrini, Torsten Hoefler:
A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network. NSDI 2024 - [c24]Semih Burak, Ivan R. Ivanov, Jens Domke, Matthias S. Müller:
SPMD IR: Unifying SPMD and Multi-value IR Showcased for Static Verification of Collectives. EuroMPI 2024: 3-20 - [c23]Wei-Chen Lin, Jens Domke:
Benchmarking in the Datacenter (BID): Expanding to the Cloud. ICPE (Companion) 2024: 94 - 2023
- [j6]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Torsten Hoefler:
Myths and legends in high-performance computing. Int. J. High Perform. Comput. Appl. 37(3-4): 245-259 (2023) - [j5]Jens Domke, Emil Vatai, Balazs Gerofi, Yuetsu Kodama, Mohamed Wahib, Artur Podobas, Sparsh Mittal, Miquel Pericàs, Lingqi Zhang, Peng Chen, Aleksandr Drozd, Satoshi Matsuoka:
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads. ACM Trans. Archit. Code Optim. 20(4): 57:1-57:26 (2023) - [c22]William S. Moses, Ivan R. Ivanov, Jens Domke, Toshio Endo, Johannes Doerfert, Oleksandr Zinenko:
High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs. PPoPP 2023: 119-134 - [c21]Olga Pearce, Alec Scott, Gregory Becker, Riyaz Haque, Nathan Hanford, Stephanie Brink, Doug Jacobsen, Heidi Poxon, Jens Domke, Todd Gamblin:
Towards Collaborative Continuous Benchmarking for HPC. SC Workshops 2023: 627-635 - [c20]Francesco Antici, Keiji Yamamoto, Jens Domke, Zeynep Kiziltan:
Augmenting ML-based Predictive Modelling with NLP to Forecast a Job's Power Consumption. SC Workshops 2023: 1820-1830 - [i12]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Torsten Hoefler:
Myths and Legends in High-Performance Computing. CoRR abs/2301.02432 (2023) - [i11]Nils Blach, Maciej Besta, Daniele De Sensi, Jens Domke, Hussein Harake, Shigang Li, Patrick Iff, Marek Konieczny, Kartik Lakhotia, Ales Kubicek, Marcel Ferrari, Fabrizio Petrini, Torsten Hoefler:
A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network. CoRR abs/2310.03742 (2023) - 2022
- [j4]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Andrew A. Chien, Raymond Bair, Jeffrey S. Vetter, John Shalf:
Preparing for the Future - Rethinking Proxy Applications. Comput. Sci. Eng. 24(2): 85-90 (2022) - [c19]Truong Thao Nguyen, François Trahay, Jens Domke, Aleksandr Drozd, Emil Vatai, Jianwei Liao, Mohamed Wahib, Balazs Gerofi:
Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning. IPDPS 2022: 1085-1096 - [i10]Jens Domke, Emil Vatai, Balazs Gerofi, Yuetsu Kodama, Mohamed Wahib, Artur Podobas, Sparsh Mittal, Miquel Pericàs, Lingqi Zhang, Peng Chen, Aleksandr Drozd, Satoshi Matsuoka:
At the Locus of Performance: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache. CoRR abs/2204.02235 (2022) - [i9]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Ray Bair, Andrew A. Chien, Jeffrey S. Vetter, John Shalf:
Preparing for the Future - Rethinking Proxy Apps. CoRR abs/2204.07336 (2022) - [i8]William S. Moses, Ivan R. Ivanov, Jens Domke, Toshio Endo, Johannes Doerfert, Oleksandr Zinenko:
High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs. CoRR abs/2207.00257 (2022) - 2021
- [j3]Maciej Besta, Jens Domke, Marcel Schneider, Marek Konieczny, Salvatore Di Girolamo, Timo Schneider, Ankit Singla, Torsten Hoefler:
High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks. IEEE Trans. Parallel Distributed Syst. 32(4): 943-959 (2021) - [c18]Jens Domke:
A64FX - Your Compiler You Must Decide! CLUSTER 2021: 736-740 - [c17]Jens Domke, Emil Vatai, Aleksandr Drozd, Peng Chen, Yosuke Oyama, Lingqi Zhang, Shweta Salaria, Daichi Mukunoki, Artur Podobas, Mohamed Wahib, Satoshi Matsuoka:
Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws? IPDPS 2021: 1056-1065 - [c16]Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey C. Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda, Toshiyuki Imamura, Akihiko Kasagi, Kentaro Kawakami, Shuhei Kudo, Akiyoshi Kuroda, Maxime Martinasso, Satoshi Matsuoka, Henrique Mendonça, Kazuki Minami, Prabhat Ram, Takashi Sawada, Mallikarjun Shankar, Tom St. John, Akihiro Tabuchi, Venkatram Vishwanath, Mohamed Wahib, Masafumi Yamazaki, Junqi Yin:
MLPerf™ HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems. MLHPC@SC 2021: 33-45 - [i7]Jens Domke:
A64FX - Your Compiler You Must Decide! CoRR abs/2107.07157 (2021) - [i6]Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey C. Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda, Toshiyuki Imamura, Akihiko Kasagi, Kentaro Kawakami, Shuhei Kudo, Akiyoshi Kuroda, Maxime Martinasso, Satoshi Matsuoka, Henrique Mendonça, Kazuki Minami, Prabhat Ram, Takashi Sawada, Mallikarjun Shankar, Tom St. John, Akihiro Tabuchi, Venkatram Vishwanath, Mohamed Wahib, Masafumi Yamazaki, Junqi Yin:
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems. CoRR abs/2110.11466 (2021) - 2020
- [c15]Tonmoy Dey, Kento Sato, Bogdan Nicolae, Jian Guo, Jens Domke, Weikuan Yu, Franck Cappello, Kathryn M. Mohror:
Optimizing Asynchronous Multi-Level Checkpoint/Restart Configurations with Machine Learning. IPDPS Workshops 2020: 1036-1043 - [c14]Mohamed Wahib, Haoyu Zhang, Truong Thao Nguyen, Aleksandr Drozd, Jens Domke, Lingqi Zhang, Ryousei Takano, Satoshi Matsuoka:
Scaling distributed deep learning workloads beyond the memory capacity with KARMA. SC 2020: 19 - [i5]Roman Iakymchuk, Daichi Mukunoki, Artur Podobas, Fabienne Jézéquel, Toshiyuki Imamura, Norihisa Fujita, Jens Huthmann, Shuhei Kudo, Yiyu Tan, Jens Domke, Kai Torben Ohlhus, Takeshi Fukaya, Takeo Hoshi, Yuki Murakami, Maho Nakata, Takeshi Ogita, Kentaro Sano, Taisuke Boku:
White Paper from Workshop on Large-scale Parallel Numerical Computing Technology (LSPANC 2020): HPC and Computer Arithmetic toward Minimal-Precision Computing. CoRR abs/2004.04628 (2020) - [i4]Maciej Besta, Jens Domke, Marcel Schneider, Marek Konieczny, Salvatore Di Girolamo, Timo Schneider, Ankit Singla, Torsten Hoefler:
High-Performance Routing with Multipathing and Path Diversity in Supercomputers and Data Centers. CoRR abs/2007.03776 (2020) - [i3]Mohamed Wahib, Haoyu Zhang, Truong Thao Nguyen, Aleksandr Drozd, Jens Domke, Lingqi Zhang, Ryousei Takano, Satoshi Matsuoka:
Scaling Distributed Deep Learning Workloads beyond the Memory Capacity with KARMA. CoRR abs/2008.11421 (2020) - [i2]Jens Domke, Emil Vatai, Aleksandr Drozd, Peng Chen, Yosuke Oyama, Lingqi Zhang, Shweta Salaria, Daichi Mukunoki, Artur Podobas, Mohamed Wahib, Satoshi Matsuoka:
Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws? CoRR abs/2010.14373 (2020)
2010 – 2019
- 2019
- [c13]Jens Domke, Satoshi Matsuoka, Ivan Radanov, Yuki Tsushima, Tomoya Yuki, Akihiro Nomura, Shin'ichi Miura, Nic McDonald, Dennis Lee Floyd, Nicolas Dubé:
The First Supercomputer with HyperX Topology: A Viable Alternative to Fat-Trees? Hot Interconnects 2019: 1-4 - [c12]Jens Domke, Kazuaki Matsumura, Mohamed Wahib, Haoyu Zhang, Keita Yashima, Toshiki Tsuchikawa, Yohei Tsuji, Artur Podobas, Satoshi Matsuoka:
Double-Precision FPUs in High-Performance Computing: An Embarrassment of Riches? IPDPS 2019: 78-88 - [c11]Jens Domke, Satoshi Matsuoka, Ivan R. Ivanov, Yuki Tsushima, Tomoya Yuki, Akihiro Nomura, Shin'ichi Miura, Nic McDonald, Dennis Lee Floyd, Nicolas Dubé:
HyperX topology: first at-scale implementation and comparison to the fat-tree. SC 2019: 40:1-40:23 - 2018
- [j2]Harsh Bhatia, Nikhil Jain, Abhinav Bhatele, Yarden Livnat, Jens Domke, Valerio Pascucci, Peer-Timo Bremer:
Interactive Investigation of Traffic Congestion on Fat-Tree Networks Using TreeScope. Comput. Graph. Forum 37(3): 561-572 (2018) - [c10]Staci A. Smith, Clara E. Cromey, David K. Lowenthal, Jens Domke, Nikhil Jain, Jayaraman J. Thiagarajan, Abhinav Bhatele:
Mitigating inter-job interference using adaptive flow-aware routing. SC 2018: 27:1-27:15 - [i1]Jens Domke, Kazuaki Matsumura, Mohamed Wahib, Haoyu Zhang, Keita Yashima, Toshiki Tsuchikawa, Yohei Tsuji, Artur Podobas, Satoshi Matsuoka:
Double-precision FPUs in High-Performance Computing: an Embarrassment of Riches? CoRR abs/1810.09330 (2018) - 2017
- [b1]Jens Domke:
Routing on the Channel Dependency Graph:: A New Approach to Deadlock-Free, Destination-Based, High-Performance Routing for Lossless Interconnection Networks. Dresden University of Technology, Germany, 2017 - [c9]Noah Wolfe, Misbah Mubarak, Nikhil Jain, Jens Domke, Abhinav Bhatele, Christopher D. Carothers, Robert B. Ross:
Preliminary Performance Analysis of Multi-rail Fat-tree Networks. CCGrid 2017: 258-261 - [c8]Misbah Mubarak, Nikhil Jain, Jens Domke, Noah Wolfe, Caitlin Ross, Jianping Kelvin Li, Abhinav Bhatele, Christopher D. Carothers, Kwan-Liu Ma, Robert B. Ross:
Toward reliable validation of HPC network simulation models. WSC 2017: 659-674 - 2016
- [j1]Dali Wang, Jens Domke, Jiafu Mao, Xiaoying Shi, Daniel M. Ricciuto:
A scalable framework for the global offline community land model ensemble simulation. Int. J. Comput. Sci. Eng. 12(1): 73-85 (2016) - [c7]Jens Domke, Torsten Hoefler, Satoshi Matsuoka:
Routing on the Dependency Graph: A New Approach to Deadlock-Free High-Performance Routing. HPDC 2016: 3-14 - [c6]Jens Domke, Torsten Hoefler:
Scheduling-aware routing for supercomputers. SC 2016: 142-153 - 2015
- [c5]Kevin A. Brown, Jens Domke, Satoshi Matsuoka:
Hardware-Centric Analysis of Network Performance for MPI Applications. ICPADS 2015: 692-699 - 2014
- [c4]Kevin A. Brown, Jens Domke, Satoshi Matsuoka:
Tracing Data Movements within MPI Collectives. EuroMPI/ASIA 2014: 117 - [c3]Jens Domke, Torsten Hoefler, Satoshi Matsuoka:
Fail-in-Place Network Design: Interaction Between Topology, Routing Algorithm and Failures. SC 2014: 597-608 - 2012
- [c2]Jens Domke, Dali Wang:
Runtime Tracing of the Community Earth System Model: Feasibility Study and Benefits. ICCS 2012: 1950-1958 - 2011
- [c1]Jens Domke, Torsten Hoefler, Wolfgang E. Nagel:
Deadlock-Free Oblivious Routing for Arbitrary Topologies. IPDPS 2011: 616-627
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 21:28 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint