default search action
Ahmad Afsahi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c53]Hamed Sharifian, Amir Hossein Sojoodi, Ahmad Afsahi:
A Topology- and Load-Aware Design for Neighborhood Allgather. CLUSTER 2024: 131-142 - [c52]Yiltan Hassan Temuçin, Mahdieh Gazimirsaeed, Ryan E. Grant, Ahmad Afsahi:
ROCm-Aware Leader-based Designs for MPI Neighbourhood Collectives. ISC 2024: 1-12 - 2023
- [c51]Yiltan Hassan Temuçin, Scott Levy, Whit Schonbein, Ryan E. Grant, Ahmad Afsahi:
A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs. CLUSTER 2023: 259-270 - 2022
- [j17]Yiltan Hassan Temuçin, Amir Hossein Sojoodi, Pedram Alizadeh, Benjamin Kitor, Ahmad Afsahi:
Accelerating Deep Learning Using Interconnect-Aware UCX Communication for MPI Collectives. IEEE Micro 42(2): 68-76 (2022) - [c50]Yiltan Hassan Temuçin, Ryan E. Grant, Ahmad Afsahi:
Micro-Benchmarking MPI Partitioned Point-to-Point Communication. ICPP 2022: 64:1-64:12 - [c49]Pedram Alizadeh, Amir Hossein Sojoodi, Yiltan Hassan Temuçin, Ahmad Afsahi:
Efficient Process Arrival Pattern Aware Collective Communication for Deep Learning. EuroMPI 2022: 68-78 - 2021
- [c48]Yiltan Hassan Temuçin, Amir Hossein Sojoodi, Pedram Alizadeh, Ahmad Afsahi:
Efficient Multi-Path NVLink/PCIe-Aware UCX based Collective Communication for Deep Learning. HOTI 2021: 25-34 - 2020
- [j16]S. Mahdieh Ghazimirsaeed, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Communication-aware message matching in MPI. Concurr. Comput. Pract. Exp. 32(3) (2020)
2010 – 2019
- 2019
- [j15]S. Mahdieh Ghazimirsaeed, Ryan E. Grant, Ahmad Afsahi:
A dynamic, unified design for dedicated message matching engines for collective and point-to-point communications. Parallel Comput. 89 (2019) - [c47]Matthew G. F. Dosanjh, Whit Schonbein, Ryan E. Grant, Patrick G. Bridges, S. Mahdieh Ghazimirsaeed, Ahmad Afsahi:
Fuzzy Matching: Hardware Accelerated MPI Communication Middleware. CCGRID 2019: 210-220 - [c46]S. Mahdieh Ghazimirsaeed, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
An Efficient Collaborative Communication Mechanism for MPI Neighborhood Collectives. IPDPS 2019: 781-792 - 2018
- [j14]Iman Faraji, Ahmad Afsahi:
Design considerations for GPU-aware collective communications in MPI. Concurr. Comput. Pract. Exp. 30(17) (2018) - [c45]Matthew G. F. Dosanjh, S. Mahdieh Ghazimirsaeed, Ryan E. Grant, Whit Schonbein, Michael J. Levenhagen, Patrick G. Bridges, Ahmad Afsahi:
The Case for Semi-Permanent Cache Occupancy: Understanding the Impact of Data Locality on Network Processing. ICPP 2018: 73:1-73:11 - [c44]S. Mahdieh Ghazimirsaeed, Ryan E. Grant, Ahmad Afsahi:
A Dedicated Message Matching Mechanism for Collective Communications. ICPP Workshops 2018: 26:1-26:10 - 2017
- [j13]Iman Faraji, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Exploiting heterogeneity of communication channels for efficient GPU selection on multi-GPU nodes. Parallel Comput. 68: 3-16 (2017) - [c43]Seyed Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, Ahmad Afsahi:
Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives. HiPC 2017: 348-357 - 2016
- [c42]Seyed Hessam Mirsadeghi, Ahmad Afsahi:
PTRAM: A Parallel Topology-and Routing-Aware Mapping Framework for Large-Scale HPC Systems. IPDPS Workshops 2016: 386-396 - [c41]Iman Faraji, Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware GPU Selection on Multi-GPU Nodes. IPDPS Workshops 2016: 712-720 - [c40]Seyed Hessam Mirsadeghi, Ahmad Afsahi:
Topology-Aware Rank Reordering for MPI Collectives. IPDPS Workshops 2016: 1759-1768 - [c39]Seyed Hessam Mirsadeghi, Iman Faraji, Ahmad Afsahi:
MAGC: A Mapping Approach for GPU Clusters. SBAC-PAD 2016: 50-58 - 2015
- [j12]Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, Ahmad Afsahi:
Scalable connectionless RDMA over unreliable datagrams. Parallel Comput. 48: 15-39 (2015) - [c38]Iman Faraji, Ahmad Afsahi:
Hyper-Q aware intranode MPI collectives on the GPU. ESPM@SC 2015: 47-50 - [r1]Ryan E. Grant, Mohammad J. Rashti, Pavan Balaji, Ahmad Afsahi:
Scalable Network Communication Using Unreliable RDMA. Handbook on Data Centers 2015: 393-424 - 2014
- [j11]Judicael A. Zounmevo, Ahmad Afsahi:
A fast and resource-conscious MPI message queue mechanism for large-scale jobs. Future Gener. Comput. Syst. 30: 265-290 (2014) - [j10]Judicael A. Zounmevo, Dries Kimpe, Robert B. Ross, Ahmad Afsahi:
Extreme-scale computing services over MPI: Experiences, observations and features proposal for next-generation message passing interface. Int. J. High Perform. Comput. Appl. 28(4): 435-449 (2014) - [c37]Judicael A. Zounmevo, Ahmad Afsahi:
Intra-Epoch Message Scheduling To Exploit Unused or Residual Overlapping Potential. EuroMPI/ASIA 2014: 13 - [c36]Iman Faraji, Ahmad Afsahi:
GPU-Aware Intranode MPI_Allreduce. EuroMPI/ASIA 2014: 45 - [c35]Judicael A. Zounmevo, Xin Zhao, Pavan Balaji, William Gropp, Ahmad Afsahi:
Nonblocking Epochs in MPI One-Sided Communication. SC 2014: 475-486 - 2013
- [c34]Xin Zhao, Darius Buntinas, Judicael A. Zounmevo, James Dinan, David Goodell, Pavan Balaji, Rajeev Thakur, Ahmad Afsahi, William Gropp:
Toward Asynchronous and MPI-Interoperable Active Messages. CCGRID 2013: 87-94 - [c33]Jérome Soumagne, Dries Kimpe, Judicael A. Zounmevo, Mohamad Chaarawi, Quincey Koziol, Ahmad Afsahi, Robert B. Ross:
Mercury: Enabling remote procedure call for high-performance computing. CLUSTER 2013: 1-8 - [c32]Judicael A. Zounmevo, Dries Kimpe, Robert B. Ross, Ahmad Afsahi:
Using MPI in high-performance computing services. EuroMPI 2013: 43-48 - 2012
- [c31]Grigori Inozemtsev, Ahmad Afsahi:
Designing an Offloaded Nonblocking MPI_Allgather Collective Using CORE-Direct. CLUSTER 2012: 477-485 - [c30]Reza Zamani, Ahmad Afsahi:
A study of hardware performance monitoring counter selection in power modeling of computing systems. IGCC 2012: 1-10 - [c29]Judicael A. Zounmevo, Ahmad Afsahi:
An Efficient MPI Message Queue Mechanism for Large-scale Jobs. ICPADS 2012: 464-471 - 2011
- [j9]Mohammad J. Rashti, Ahmad Afsahi:
Exploiting application buffer reuse to improve MPI small message transfer protocols over RDMA-enabled networks. Clust. Comput. 14(4): 345-356 (2011) - [j8]Ying Qian, Ahmad Afsahi:
Process Arrival Pattern Aware Alltoall and Allgather on InfiniBand Clusters. Int. J. Parallel Program. 39(4): 473-493 (2011) - [c28]Judicael A. Zounmevo, Ahmad Afsahi:
Investigating Scenario-Conscious Asynchronous Rendezvous over RDMA. CLUSTER 2011: 542-546 - [c27]Ryan E. Grant, Mohammad J. Rashti, Ahmad Afsahi, Pavan Balaji:
RDMA Capable iWARP over Datagrams. IPDPS 2011: 628-639 - [c26]Mohammad J. Rashti, Jonathan Green, Pavan Balaji, Ahmad Afsahi, William Gropp:
Multi-core and Network Aware MPI Topology Functions. EuroMPI 2011: 50-60 - 2010
- [j7]Reza Zamani, Ahmad Afsahi:
Adaptive estimation and prediction of power and performance in high performance computing. Comput. Sci. Res. Dev. 25(3-4): 177-186 (2010) - [c25]Mohammad J. Rashti, Ryan E. Grant, Ahmad Afsahi, Pavan Balaji:
iWARP redefined: Scalable connectionless communication over high-speed Ethernet. HiPC 2010: 1-10 - [c24]Ryan E. Grant, Pavan Balaji, Ahmad Afsahi:
A study of hardware assisted IP over InfiniBand and its impact on enterprise data center performance. ISPASS 2010: 144-153
2000 – 2009
- 2009
- [j6]Ryan E. Grant, Ahmad Afsahi:
Improving energy efficiency of asymmetric chip multithreaded multiprocessors through reduced OS noise scheduling. Concurr. Comput. Pract. Exp. 21(18): 2355-2376 (2009) - [j5]Mohammad J. Rashti, Ahmad Afsahi:
A Speculative and Adaptive MPI Rendezvous Protocol Over RDMA-enabled Interconnects. Int. J. Parallel Program. 37(2): 223-246 (2009) - [c23]Ryan E. Grant, Ahmad Afsahi, Pavan Balaji:
Evaluation of ConnectX Virtual Protocol Interconnect for Data Centers. ICPADS 2009: 57-64 - [c22]Mohammad J. Rashti, Ahmad Afsahi:
Improving RDMA-based MPI eager protocol for frequently-used buffers. IPDPS 2009: 1-8 - [c21]Ying Qian, Ahmad Afsahi:
Process Arrival Pattern and Shared Memory Aware Alltoall on InfiniBand. PVM/MPI 2009: 250-260 - 2008
- [j4]Ying Qian, Ahmad Afsahi:
Efficient shared memory and RDMA based collectives on multi-rail QsNetII SMP clusters. Clust. Comput. 11(4): 341-354 (2008) - [c20]Mohammad J. Rashti, Ahmad Afsahi:
Improving Communication Progress and Overlap in MPI Rendezvous Protocol over RDMA-enabled Interconnects. HPCS 2008: 95-101 - [c19]Ryan E. Grant, Mohammad J. Rashti, Ahmad Afsahi:
An Analysis of QoS Provisioning for Sockets Direct Protocol vs. IPoIB over Modern InfiniBand Networks. ICPP Workshops 2008: 79-86 - 2007
- [c18]Reza Zamani, Ahmad Afsahi, Ying Qian, V. Carl Hamacher:
A feasibility analysis of power-awareness and energy minimization in modern interconnects for high-performance computing. CLUSTER 2007: 118-128 - [c17]Ryan E. Grant, Ahmad Afsahi:
Improving system efficiency through scheduling and power management. CLUSTER 2007: 478-479 - [c16]Mohammad J. Rashti, Ahmad Afsahi:
Assessing the Ability of Computation/Communication Overlap and Communication Progress in Modern Interconnects. Hot Interconnects 2007: 117-124 - [c15]Ying Qian, Ahmad Afsahi:
High Performance RDMA-based Multi-port All-gather on Multi-rail QsNet II. HPCS 2007: 3 - [c14]Ying Qian, Ahmad Afsahi:
RDMA-based and SMP-aware Multi-port All-Gather on Multi-rail QsNet^II SMP Clusters. ICPP 2007: 48 - [c13]Ryan E. Grant, Ahmad Afsahi:
A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs. IPDPS 2007: 1-8 - [c12]Mohammad J. Rashti, Ahmad Afsahi:
10-Gigabit iWARP Ethernet: Comparative Performance Analysis with InfiniBand and Myrinet-10G. IPDPS 2007: 1-8 - 2006
- [j3]Ying Qian, Ahmad Afsahi, Nathan R. Fredrickson, Reza Zamani:
Performance evaluation of the Sun Fire Link SMP clusters. Int. J. High Perform. Comput. Netw. 4(5/6): 209-221 (2006) - [c11]Ryan E. Grant, Ahmad Afsahi:
Power-performance efficiency of asymmetric multiprocessors for multi-threaded scientific applications. IPDPS 2006 - [c10]Ying Qian, Ahmad Afsahi:
Efficient RDMA-based multi-port collectives on multi-rail QsNetII clusters. IPDPS 2006 - 2005
- [c9]Reza Zamani, Ahmad Afsahi:
Communication Characteristics of Message-Passing Scientific and Engineering Applications. IASTED PDCS 2005: 644-649 - 2004
- [c8]Ying Qian, Ahmad Afsahi, Nathan R. Fredrickson, Reza Zamani:
Performance Evaluation of the Sun Fire Link SMP Clusters. HPCS 2004: 145-156 - [c7]Ying Qian, Ahmad Afsahi, Reza Zamani:
Myrinet Networks: A Performance Study. NCA 2004: 323-328 - 2003
- [c6]Nathan R. Fredrickson, Ahmad Afsahi, Ying Qian:
Performance characteristics of openMP constructs, and application benchmarks on a large symmetric multiprocessor. ICS 2003: 140-149 - 2002
- [j2]Ahmad Afsahi, Nikitas J. Dimopoulos:
Efficient communication using message prediction for clusters of multiprocessors. Concurr. Comput. Pract. Exp. 14(10): 859-883 (2002) - [j1]Ahmad Afsahi, Nikitas J. Dimopoulos:
Analysis of a Latency Hiding Broadcasting Algorithm on a Reconfigurable Optical Interconnect. Parallel Process. Lett. 12(1): 41-50 (2002) - [c5]Ahmad Afsahi, Nikitas J. Dimopoulos:
Architectural Extensions to Support Efficient Communication Using Message Prediction. HPCS 2002: 20-27 - 2000
- [c4]Ahmad Afsahi, Nikitas J. Dimopoulos:
Efficient Communication Using Message Prediction for Cluster Multiprocessors. CANPC 2000: 162-178
1990 – 1999
- 1999
- [c3]Ahmad Afsahi, Nikitas J. Dimopoulos:
Hiding Communication Latency in Reconfigurable Message-Passing Environments. IPPS/SPDP 1999: 55-60 - 1998
- [c2]Ahmad Afsahi, Nikitas J. Dimopoulos:
Communications Latency Hiding Techniques for a Reconfigurable Optical Interconnect: Benchmark Studies. PARA 1998: 1-6 - 1997
- [c1]Ahmad Afsahi, Nikitas J. Dimopoulos:
Collective Communications on a Reconfigurable Optical Interconnect. OPODIS 1997: 167-182
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-20 21:57 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint