default search action
Juan Gómez-Luna
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j39]Ivan Fernandez, Christina Giannoula, Aditya Manglik, Ricardo Quislant, Nika Mansouri-Ghiasi, Juan Gómez-Luna, Eladio Gutiérrez, Oscar G. Plata, Onur Mutlu:
MATSA: An MRAM-Based Energy-Efficient Accelerator for Time Series Analysis. IEEE Access 12: 36727-36742 (2024) - [j38]Can Firtina, Kamlesh R. Pillai, Gurpreet S. Kalsi, Bharathwaj Suresh, Damla Senol Cali, Jeremie S. Kim, Taha Shahroodi, Meryem Banu Cavlak, Joël Lindegger, Mohammed Alser, Juan Gómez-Luna, Sreenivas Subramoney, Onur Mutlu:
ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-efficient Genome Analysis. ACM Trans. Archit. Code Optim. 21(1): 19:1-19:29 (2024) - [j37]Hajar Falahati, Mohammad Sadrosadati, Qiumin Xu, Juan Gómez-Luna, Banafsheh Saber Latibari, Hyeran Jeon, Shaahin Hessabi, Hamid Sarbazi-Azad, Onur Mutlu, Murali Annavaram, Massoud Pedram:
Cross-core Data Sharing for Energy-efficient GPUs. ACM Trans. Archit. Code Optim. 21(3): 42:1-42:32 (2024) - [j36]Lois Orosa, Skanda Koppula, Yaman Umuroglu, Konstantinos Kanellopoulos, Juan Gómez-Luna, Michaela Blott, Kees A. Vissers, Onur Mutlu:
EcoFlow: Efficient Convolutional Dataflows on Low-Power Neural Network Accelerators. IEEE Trans. Computers 73(9): 2275-2289 (2024) - [j35]Jie Zhang, Hongjing Huang, Jie Sun, Juan Gómez-Luna, Onur Mutlu, Zeke Wang:
SparseACC: A Generalized Linear Model Accelerator for Sparse Datasets. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(3): 840-853 (2024) - [c69]Steve Rhyner, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Jiawei Jiang, Ataberk Olgun, Harshita Gupta, Ce Zhang, Onur Mutlu:
PIM-Opt: Demystifying Distributed Optimization Algorithms on a Real-World Processing-In-Memory System. PACT 2024: 201-218 - [c68]Ataberk Olgun, Majd Osseiran, A. Giray Yaglikçi, Yahya Can Tugrul, Haocong Luo, Steve Rhyner, Behzad Salami, Juan Gómez-Luna, Onur Mutlu:
Read Disturbance in High Bandwidth Memory: A Detailed Experimental Study on HBM2 DRAM Chips. DSN 2024: 75-89 - [c67]Ismail Emir Yüksel, Yahya Can Tugrul, F. Nisa Bostanci, Geraldo F. Oliveira, A. Giray Yaglikçi, Ataberk Olgun, Melina Soysal, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu:
Simultaneous Many-Row Activation in Off-the-Shelf DRAM Chips: Experimental Characterization and Analysis. DSN 2024: 99-114 - [c66]Geraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yaglikçi, F. Nisa Bostanci, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu:
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Computing. HPCA 2024: 186-203 - [c65]Ismail Emir Yüksel, Yahya Can Tugrul, Ataberk Olgun, F. Nisa Bostanci, Abdullah Giray Yaglikçi, Geraldo F. Oliveira, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu:
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis. HPCA 2024: 280-296 - [c64]Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani:
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems. ISPASS 2024: 217-229 - [i74]Christina Giannoula, Peiming Yang, Ivan Fernandez Vega, Jiacheng Yang, Yu Xin Li, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu, Gennady Pekhimenko:
Accelerating Graph Neural Networks on Real Processing-In-Memory Systems. CoRR abs/2402.16731 (2024) - [i73]Ismail Emir Yuksel, Yahya Can Tugrul, Ataberk Olgun, F. Nisa Bostanci, Abdullah Giray Yaglikçi, Geraldo F. Oliveira, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu:
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and Analysis. CoRR abs/2402.18736 (2024) - [i72]Geraldo F. Oliveira, Ataberk Olgun, Abdullah Giray Yaglikçi, F. Nisa Bostanci, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu:
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing. CoRR abs/2402.19080 (2024) - [i71]Geraldo F. Oliveira, Emanuele G. Esposito, Juan Gómez-Luna, Onur Mutlu:
PUMA: Efficient and Low-Cost Memory Allocation and Alignment Support for Processing-Using-Memory Architectures. CoRR abs/2403.04539 (2024) - [i70]Steve Rhyner, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Jiawei Jiang, Ataberk Olgun, Harshita Gupta, Ce Zhang, Onur Mutlu:
Analysis of Distributed Optimization Algorithms on a Real Processing-In-Memory System. CoRR abs/2404.07164 (2024) - [i69]Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani:
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems. CoRR abs/2405.03967 (2024) - [i68]Ismail Emir Yuksel, Yahya Can Tugrul, F. Nisa Bostanci, Geraldo F. Oliveira, Abdullah Giray Yaglikçi, Ataberk Olgun, Melina Soysal, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu:
Simultaneous Many-Row Activation in Off-the-Shelf DRAM Chips: Experimental Characterization and Analysis. CoRR abs/2405.06081 (2024) - [i67]Maciej Besta, Robert Gerstenberger, Patrick Iff, Pournima Sonawane, Juan Gómez-Luna, Raghavendra Kanakagiri, Rui Min, Onur Mutlu, Torsten Hoefler, Raja Appuswamy, Aidan O'Mahony:
Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments. CoRR abs/2408.12173 (2024) - 2023
- [j34]Alain Denzler, Geraldo F. Oliveira, Nastaran Hajinazar, Rahul Bera, Gagandeep Singh, Juan Gómez-Luna, Onur Mutlu:
Casper: Accelerating Stencil Computations Using Near-Cache Processing. IEEE Access 11: 22136-22154 (2023) - [j33]Geraldo F. Oliveira, Saugata Ghose, Juan Gómez-Luna, Amirali Boroumand, Alexis Savery, Sonny Rao, Salman Qazi, Gwendal Grignou, Rahul Thakur, Eric Shiu, Onur Mutlu:
Extending Memory Capacity in Modern Consumer Systems With Emerging Non-Volatile Memory: Experimental Analysis and Characterization Using the Intel Optane SSD. IEEE Access 11: 105843-105871 (2023) - [j32]Safaa Diab, Amir Nassereldine, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu, Izzat El Hajj:
A framework for high-throughput sequence alignment using real processing-in-memory systems. Bioinform. 39(5) (2023) - [j31]Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Nika Mansouri-Ghiasi, Onur Mutlu:
Scrooge: a fast and memory-frugal genomic sequence aligner for CPUs, GPUs, and ASICs. Bioinform. 39(5) (2023) - [j30]Ataberk Olgun, Juan Gómez-Luna, Konstantinos Kanellopoulos, Behzad Salami, Hasan Hassan, Oguz Ergin, Onur Mutlu:
PiDRAM: A Holistic End-to-end FPGA-based Framework for Processing-in-DRAM. ACM Trans. Archit. Code Optim. 20(1): 8:1-8:31 (2023) - [j29]Nika Mansouri-Ghiasi, Nandita Vijaykumar, Geraldo F. Oliveira, Lois Orosa, Ivan Fernandez, Mohammad Sadrosadati, Konstantinos Kanellopoulos, Nastaran Hajinazar, Juan Gómez-Luna, Onur Mutlu:
ALP: Alleviating CPU-Memory Data Movement Overheads in Memory-Centric Systems. IEEE Trans. Emerg. Top. Comput. 11(2): 388-403 (2023) - [j28]Antonio Fuentes-Alventosa, Juan Gómez-Luna, Rafael Medina Carnicer:
GVLE: a highly optimized GPU-based implementation of variable-length encoding. J. Supercomput. 79(8): 8447-8474 (2023) - [c63]Jinfan Chen, Juan Gómez-Luna, Izzat El Hajj, Yuxin Guo, Onur Mutlu:
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory. PACT 2023: 99-111 - [c62]Ataberk Olgun, Majd Osseiran, Abdullah Giray Yaglikçi, Yahya Can Tugrul, Haocong Luo, Steve Rhyner, Behzad Salami, Juan Gómez-Luna, Onur Mutlu:
An Experimental Analysis of RowHammer in HBM2 DRAM Chips. DSN-S 2023: 151-156 - [c61]Gagandeep Singh, Alireza Khodamoradi, Kristof Denolf, Jack Lo, Juan Gómez-Luna, Joseph Melber, Andra Bisca, Henk Corporaal, Onur Mutlu:
SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation. ICS 2023: 463-476 - [c60]Harshita Gupta, Mayank Kabra, Juan Gómez-Luna, Konstantinos Kanellopoulos, Onur Mutlu:
Evaluating Homomorphic Operations on a Real-World Processing-In-Memory System. IISWC 2023: 211-215 - [c59]Rakesh Nadig, Mohammad Sadrosadati, Haiyu Mao, Nika Mansouri-Ghiasi, Arash Tavakkol, Jisung Park, Hamid Sarbazi-Azad, Juan Gómez-Luna, Onur Mutlu:
Venice: Improving Solid-State Drive Parallelism at Low Cost via Conflict-Free Accesses. ISCA 2023: 36:1-36:16 - [c58]Juan Gómez-Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy Cimadomo, Geraldo F. Oliveira, Gagandeep Singh, Onur Mutlu:
Evaluating Machine LearningWorkloads on Memory-Centric Computing Systems. ISPASS 2023: 35-49 - [c57]Maurus Item, Geraldo F. Oliveira, Juan Gómez-Luna, Mohammad Sadrosadati, Yuxin Guo, Onur Mutlu:
TransPimLib: Efficient Transcendental Functions for Processing-in-Memory Systems. ISPASS 2023: 235-247 - [c56]Lukas Breitwieser, Ahmad Hesam, Fons Rademakers, Juan Gómez-Luna, Onur Mutlu:
High-Performance and Scalable Agent-Based Simulation with BioDynaMo. PPoPP 2023: 174-188 - [i66]Lukas Breitwieser, Ahmad Hesam, Fons Rademakers, Juan Gómez-Luna, Onur Mutlu:
High-Performance and Scalable Agent-Based Simulation with BioDynaMo. CoRR abs/2301.06984 (2023) - [i65]Gagandeep Singh, Alireza Khodamoradi, Kristof Denolf, Jack Lo, Juan Gómez-Luna, Joseph Melber, Andra Bisca, Henk Corporaal, Onur Mutlu:
SPARTA: Spatial Acceleration for Efficient and Scalable Horizontal Diffusion Weather Stencil Computation. CoRR abs/2303.03509 (2023) - [i64]Maurus Item, Juan Gómez-Luna, Yuxin Guo, Geraldo F. Oliveira, Mohammad Sadrosadati, Onur Mutlu:
TransPimLib: A Library for Efficient Transcendental Functions on Processing-in-Memory Systems. CoRR abs/2304.01951 (2023) - [i63]Rakesh Nadig, Mohammad Sadrosadati, Haiyu Mao, Nika Mansouri-Ghiasi, Arash Tavakkol, Jisung Park, Hamid Sarbazi-Azad, Juan Gómez-Luna, Onur Mutlu:
Venice: Improving Solid-State Drive Parallelism at Low Cost via Conflict-Free Accesses. CoRR abs/2305.07768 (2023) - [i62]Ataberk Olgun, Majd Osseiran, Abdullah Giray Yaglikçi, Yahya Can Tugrul, Haocong Luo, Steve Rhyner, Behzad Salami, Juan Gómez-Luna, Onur Mutlu:
An Experimental Analysis of RowHammer in HBM2 DRAM Chips. CoRR abs/2305.17918 (2023) - [i61]Harshita Gupta, Mayank Kabra, Juan Gómez-Luna, Konstantinos Kanellopoulos, Onur Mutlu:
Evaluating Homomorphic Operations on a Real-World Processing-In-Memory System. CoRR abs/2309.06545 (2023) - [i60]Jinfan Chen, Juan Gómez-Luna, Izzat El Hajj, Yuxin Guo, Onur Mutlu:
SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory. CoRR abs/2310.01893 (2023) - [i59]Geraldo F. Oliveira, Alain Kohli, David Novo, Juan Gómez-Luna, Onur Mutlu:
DaPPA: A Data-Parallel Framework for Processing-in-Memory Architectures. CoRR abs/2310.10168 (2023) - [i58]Ataberk Olgun, Majd Osseiran, Abdullah Giray Yaglikçi, Yahya Can Tugrul, Haocong Luo, Steve Rhyner, Behzad Salami, Juan Gómez-Luna, Onur Mutlu:
Understanding Read Disturbance in High Bandwidth Memory: An Experimental Analysis of Real HBM2 DRAM Chips. CoRR abs/2310.14665 (2023) - [i57]Ismail Emir Yuksel, Yahya Can Tugrul, F. Nisa Bostanci, Abdullah Giray Yaglikçi, Ataberk Olgun, Geraldo F. Oliveira, Melina Soysal, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Onur Mutlu:
PULSAR: Simultaneous Many-Row Activation for Reliable and High-Performance Computing in Off-the-Shelf DRAM Chips. CoRR abs/2312.02880 (2023) - 2022
- [j27]Juan Gómez-Luna, Izzat El Hajj, Ivan Fernandez, Christina Giannoula, Geraldo F. Oliveira, Onur Mutlu:
Benchmarking a New Paradigm: Experimental Analysis and Characterization of a Real Processing-in-Memory System. IEEE Access 10: 52565-52608 (2022) - [j26]Antonio Fuentes-Alventosa, Juan Gómez-Luna, Rafael Medina Carnicer:
GUD-Canny: a real-time GPU-based unsupervised and distributed Canny edge detector. J. Real Time Image Process. 19(3): 591-605 (2022) - [j25]Geraldo F. Oliveira, Juan Gómez-Luna, Saugata Ghose, Amirali Boroumand, Onur Mutlu:
Accelerating Neural Network Inference With Processing-in-DRAM: From the Edge to the Cloud. IEEE Micro 42(6): 25-38 (2022) - [j24]Christina Giannoula, Ivan Fernandez, Juan Gómez-Luna, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures. Proc. ACM Meas. Anal. Comput. Syst. 6(1): 21:1-21:49 (2022) - [j23]Antonio Fuentes-Alventosa, Juan Gómez-Luna, José María González-Linares, Nicolás Guil, Rafael Medina Carnicer:
CAVLCU: an efficient GPU-based implementation of CAVLC. J. Supercomput. 78(6): 7556-7590 (2022) - [j22]Gagandeep Singh, Dionysios Diamantopoulos, Juan Gómez-Luna, Christoph Hagleitner, Sander Stuijk, Henk Corporaal, Onur Mutlu:
Accelerating Weather Prediction Using Near-Memory Reconfigurable Fabric. ACM Trans. Reconfigurable Technol. Syst. 15(4): 39:1-39:27 (2022) - [c55]Mhd Ghaith Olabi, Juan Gómez-Luna, Onur Mutlu, Wen-Mei Hwu, Izzat El Hajj:
A Compiler Framework for Optimizing Dynamic Parallelism on GPUs. CGO 2022: 1-13 - [c54]Gagandeep Singh, Dionysios Diamantopoulos, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, Onur Mutlu:
LEAPER: Fast and Accurate FPGA-based System Performance Prediction via Transfer Learning. ICCD 2022: 499-508 - [c53]Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu:
Algorithmic Improvement and GPU Acceleration of the GenASM Algorithm. IPDPS Workshops 2022: 162 - [c52]Safaa Diab, Amir Nassereldine, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu, Izzat El Hajj:
High-throughput Pairwise Alignment with the Wavefront Algorithm using Processing-in-Memory. IPDPS Workshops 2022: 163 - [c51]Gagandeep Singh, Rakesh Nadig, Jisung Park, Rahul Bera, Nastaran Hajinazar, David Novo, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, Onur Mutlu:
Sibyl: adaptive and extensible data placement in hybrid storage systems using online reinforcement learning. ISCA 2022: 320-336 - [c50]Damla Senol Cali, Konstantinos Kanellopoulos, Joël Lindegger, Zülal Bingöl, Gurpreet S. Kalsi, Ziyi Zuo, Can Firtina, Meryem Banu Cavlak, Jeremie S. Kim, Nika Mansouri-Ghiasi, Gagandeep Singh, Juan Gómez-Luna, Nour Almadhoun Alserr, Mohammed Alser, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu:
SeGraM: a universal hardware accelerator for genomic sequence-to-graph and sequence-to-sequence mapping. ISCA 2022: 638-655 - [c49]Geraldo F. Oliveira, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu:
Methodologies, Workloads, and Tools for Processing-in-Memory: Enabling the Adoption of Data-Centric Architectures. ISVLSI 2022: 261-266 - [c48]Ataberk Olgun, Juan Gómez-Luna, Konstantinos Kanellopoulos, Behzad Salami, Hasan Hassan, Oguz Ergin, Onur Mutlu:
PiDRAM: An FPGA-based Framework for End-to-end Evaluation of Processing-in-DRAM Techniques. ISVLSI 2022: 267-272 - [c47]Geraldo F. Oliveira, Amirali Boroumand, Saugata Ghose, Juan Gómez-Luna, Onur Mutlu:
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases. ISVLSI 2022: 273-278 - [c46]Ivan Fernandez, Ricardo Quislant, Christina Giannoula, Mohammed Alser, Juan Gómez-Luna, Eladio Gutiérrez, Oscar G. Plata, Onur Mutlu:
Exploiting Near-Data Processing to Accelerate Time Series Analysis. ISVLSI 2022: 279-282 - [c45]Christina Giannoula, Ivan Fernandez, Juan Gómez-Luna, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
SparseP: Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures. ISVLSI 2022: 288-291 - [c44]Juan Gómez-Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy Cimadomo, Geraldo F. Oliveira, Gagandeep Singh, Onur Mutlu:
Machine Learning Training on a Real Processing-in-Memory System. ISVLSI 2022: 292-295 - [c43]Sina Darabi, Mohammad Sadrosadati, Negar Akbarzadeh, Joël Lindegger, Mohammad Hosseini, Jisung Park, Juan Gómez-Luna, Onur Mutlu, Hamid Sarbazi-Azad:
Morpheus: Extending the Last Level Cache Capacity in GPU Systems Using Idle GPU Core Resources. MICRO 2022: 228-244 - [c42]João Dinis Ferreira, Gabriel Falcão, Juan Gómez-Luna, Mohammed Alser, Lois Orosa, Mohammad Sadrosadati, Jeremie S. Kim, Geraldo F. Oliveira, Taha Shahroodi, Anant Nori, Onur Mutlu:
pLUTo: Enabling Massively Parallel Computation in DRAM via Lookup Tables. MICRO 2022: 900-919 - [c41]Jisung Park, Roknoddin Azizi, Geraldo F. Oliveira, Mohammad Sadrosadati, Rakesh Nadig, David Novo, Juan Gómez-Luna, Myungsuk Kim, Onur Mutlu:
Flash-Cosmos: In-Flash Bulk Bitwise Operations Using Inherent Computation Capability of NAND Flash Memory. MICRO 2022: 937-955 - [c40]Christina Giannoula, Ivan Fernandez, Juan Gómez-Luna, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures. SIGMETRICS (Abstracts) 2022: 33-34 - [d1]João Dinis Ferreira, Gabriel Falcão, Juan Gómez-Luna, Mohammed Alser, Lois Orosa, Mohammad Sadrosadati, Jeremie S. Kim, Geraldo F. Oliveira, Taha Shahroodi, Anant Nori, Onur Mutlu:
pLUTo: Enabling Massively Parallel Computation In DRAM via Lookup Tables. Zenodo, 2022 - [i56]Mhd Ghaith Olabi, Juan Gómez-Luna, Onur Mutlu, Wen-Mei W. Hwu, Izzat El Hajj:
A Compiler Framework for Optimizing Dynamic Parallelism on GPUs. CoRR abs/2201.02789 (2022) - [i55]Christina Giannoula, Ivan Fernandez, Juan Gómez-Luna, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems. CoRR abs/2201.05072 (2022) - [i54]Lois Orosa, Skanda Koppula, Yaman Umuroglu, Konstantinos Kanellopoulos, Juan Gómez-Luna, Michaela Blott, Kees A. Vissers, Onur Mutlu:
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators. CoRR abs/2202.02310 (2022) - [i53]Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu:
Algorithmic Improvement and GPU Acceleration of the GenASM Algorithm. CoRR abs/2203.15561 (2022) - [i52]Christina Giannoula, Ivan Fernandez, Juan Gómez-Luna, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems. CoRR abs/2204.00900 (2022) - [i51]Safaa Diab, Amir Nassereldine, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu, Izzat El Hajj:
High-throughput Pairwise Alignment with the Wavefront Algorithm using Processing-in-Memory. CoRR abs/2204.02085 (2022) - [i50]Damla Senol Cali, Konstantinos Kanellopoulos, Joël Lindegger, Zülal Bingöl, Gurpreet S. Kalsi, Ziyi Zuo, Can Firtina, Meryem Banu Cavlak, Jeremie S. Kim, Nika Mansouri-Ghiasi, Gagandeep Singh, Juan Gómez-Luna, Nour Almadhoun Alserr, Mohammed Alser, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu:
SeGraM: A Universal Hardware Accelerator for Genomic Sequence-to-Graph and Sequence-to-Sequence Mapping. CoRR abs/2205.05883 (2022) - [i49]Gagandeep Singh, Rakesh Nadig, Jisung Park, Rahul Bera, Nastaran Hajinazar, David Novo, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, Onur Mutlu:
Sibyl: Adaptive and Extensible Data Placement in Hybrid Storage Systems Using Online Reinforcement Learning. CoRR abs/2205.07394 (2022) - [i48]Mohammed Alser, Joël Lindegger, Can Firtina, Nour Almadhoun, Haiyu Mao, Gagandeep Singh, Juan Gómez-Luna, Onur Mutlu:
Going From Molecules to Genomic Variations to Scientific Discovery: Intelligent Algorithms and Architectures for Intelligent Genome Analysis. CoRR abs/2205.07957 (2022) - [i47]Geraldo F. Oliveira, Juan Gómez-Luna, Saugata Ghose, Onur Mutlu:
Methodologies, Workloads, and Tools for Processing-in-Memory: Enabling the Adoption of Data-Centric Architectures. CoRR abs/2205.14647 (2022) - [i46]Geraldo F. Oliveira, Amirali Boroumand, Saugata Ghose, Juan Gómez-Luna, Onur Mutlu:
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases. CoRR abs/2205.14664 (2022) - [i45]Ataberk Olgun, Juan Gómez-Luna, Konstantinos Kanellopoulos, Behzad Salami, Hasan Hassan, Oguz Ergin, Onur Mutlu:
PiDRAM: An FPGA-based Framework for End-to-end Evaluation of Processing-in-DRAM Techniques. CoRR abs/2206.00263 (2022) - [i44]Ivan Fernandez, Ricardo Quislant, Christina Giannoula, Mohammed Alser, Juan Gómez-Luna, Eladio Gutiérrez, Oscar G. Plata, Onur Mutlu:
Exploiting Near-Data Processing to Accelerate Time Series Analysis. CoRR abs/2206.00938 (2022) - [i43]Juan Gómez-Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy Cimadomo, Geraldo F. Oliveira, Gagandeep Singh, Onur Mutlu:
Machine Learning Training on a Real Processing-in-Memory System. CoRR abs/2206.06022 (2022) - [i42]Juan Gómez-Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy Cimadomo, Geraldo F. Oliveira, Gagandeep Singh, Onur Mutlu:
An Experimental Evaluation of Machine Learning Training on a Real Processing-in-Memory System. CoRR abs/2207.07886 (2022) - [i41]Can Firtina, Kamlesh R. Pillai, Gurpreet S. Kalsi, Bharathwaj Suresh, Damla Senol Cali, Jeremie S. Kim, Taha Shahroodi, Meryem Banu Cavlak, Joël Lindegger, Mohammed Alser, Juan Gómez-Luna, Sreenivas Subramoney, Onur Mutlu:
ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis. CoRR abs/2207.09765 (2022) - [i40]Safaa Diab, Amir Nassereldine, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu, Izzat El Hajj:
A Framework for High-throughput Sequence Alignment using Real Processing-in-Memory Systems. CoRR abs/2208.01243 (2022) - [i39]Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Nika Mansouri-Ghiasi, Onur Mutlu:
Scrooge: A Fast and Memory-Frugal Genomic Sequence Aligner for CPUs, GPUs, and ASICs. CoRR abs/2208.09985 (2022) - [i38]Gagandeep Singh, Dionysios Diamantopoulos, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, Onur Mutlu:
LEAPER: Modeling Cloud FPGA-based Systems via Transfer Learning. CoRR abs/2208.10606 (2022) - [i37]Jisung Park, Roknoddin Azizi, Geraldo F. Oliveira, Mohammad Sadrosadati, Rakesh Nadig, David Novo, Juan Gómez-Luna, Myungsuk Kim, Onur Mutlu:
Flash-Cosmos: In-Flash Bulk Bitwise Operations Using Inherent Computation Capability of NAND Flash Memory. CoRR abs/2209.05566 (2022) - [i36]Geraldo F. Oliveira, Juan Gómez-Luna, Saugata Ghose, Amirali Boroumand, Onur Mutlu:
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud. CoRR abs/2209.08938 (2022) - [i35]Sina Darabi, Mohammad Sadrosadati, Joël Lindegger, Negar Akbarzadeh, Mohammad Hosseini, Jisung Park, Juan Gómez-Luna, Hamid Sarbazi-Azad, Onur Mutlu:
Morpheus: Extending the Last Level Cache Capacity in GPU Systems Using Idle GPU Core Resources. CoRR abs/2209.10914 (2022) - [i34]Nika Mansouri-Ghiasi, Mohammad Sadrosadati, Geraldo F. Oliveira, Konstantinos Kanellopoulos, Rachata Ausavarungnirun, Juan Gómez-Luna, Aditya Manglik, João Dinis Ferreira, Jeremie S. Kim, Christina Giannoula, Nandita Vijaykumar, Jisung Park, Onur Mutlu:
RevaMp3D: Architecting the Processor Core and Cache Hierarchy for Systems with Monolithically-Integrated Logic and Memory. CoRR abs/2210.08508 (2022) - [i33]Ivan Fernandez, Aditya Manglik, Christina Giannoula, Ricardo Quislant, Nika Mansouri-Ghiasi, Juan Gómez-Luna, Eladio Gutiérrez, Oscar G. Plata, Onur Mutlu:
Accelerating Time Series Analysis via Processing using Non-Volatile Memories. CoRR abs/2211.04369 (2022) - [i32]Nika Mansouri-Ghiasi, Nandita Vijaykumar, Geraldo F. Oliveira, Lois Orosa, Ivan Fernandez, Mohammad Sadrosadati, Konstantinos Kanellopoulos, Nastaran Hajinazar, Juan Gómez-Luna, Onur Mutlu:
ALP: Alleviating CPU-Memory Data Movement Overheads in Memory-Centric Systems. CoRR abs/2212.06292 (2022) - 2021
- [j21]Geraldo F. Oliveira, Juan Gómez-Luna, Lois Orosa, Saugata Ghose, Nandita Vijaykumar, Ivan Fernandez, Mohammad Sadrosadati, Onur Mutlu:
DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks. IEEE Access 9: 134457-134502 (2021) - [j20]Mohammed Alser, Taha Shahroodi, Juan Gómez-Luna, Can Alkan, Onur Mutlu:
SneakySnake: a fast and accurate universal genome pre-alignment filter for CPUs, GPUs and FPGAs. Bioinform. 36(22-23): 5282-5290 (2021) - [j19]Gagandeep Singh, Mohammed Alser, Damla Senol Cali, Dionysios Diamantopoulos, Juan Gómez-Luna, Henk Corporaal, Onur Mutlu:
FPGA-Based Near-Memory Acceleration of Modern Data-Intensive Applications. IEEE Micro 41(4): 39-48 (2021) - [c39]Nastaran Hajinazar, Geraldo F. Oliveira, Sven Gregorio, João Dinis Ferreira, Nika Mansouri-Ghiasi, Minesh Patel, Mohammed Alser, Saugata Ghose, Juan Gómez-Luna, Onur Mutlu:
SIMDRAM: a framework for bit-serial SIMD processing using DRAM. ASPLOS 2021: 329-345 - [c38]Gagandeep Singh, Dionysios Diamantopoulos, Juan Gómez-Luna, Sander Stuijk, Onur Mutlu, Henk Corporaal:
Modeling FPGA-Based Systems via Few-Shot Learning. FPGA 2021: 146 - [c37]Juan Gómez-Luna, Izzat El Hajj, Ivan Fernandez, Christina Giannoula, Geraldo F. Oliveira, Onur Mutlu:
Benchmarking Memory-Centric Computing Systems: Analysis of Real Processing-In-Memory Hardware. IGSC (Workshops) 2021: 1-7 - [c36]Christina Giannoula, Nandita Vijaykumar, Nikela Papadopoulou, Vasileios Karakostas, Ivan Fernandez, Juan Gómez-Luna, Lois Orosa, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures. HPCA 2021: 263-276 - [c35]Lois Orosa, Yaohua Wang, Mohammad Sadrosadati, Jeremie S. Kim, Minesh Patel, Ivan Puddu, Haocong Luo, Kaveh Razavi, Juan Gómez-Luna, Hasan Hassan, Nika Mansouri-Ghiasi, Saugata Ghose, Onur Mutlu:
CODIC: A Low-Cost Substrate for Enabling Custom In-DRAM Functionalities and Optimizations. ISCA 2021: 484-497 - [c34]Jawad Haj-Yahya, Lois Orosa, Jeremie S. Kim, Juan Gómez-Luna, Abdullah Giray Yaglikçi, Mohammed Alser, Ivan Puddu, Onur Mutlu:
IChannels: Exploiting Current Management Mechanisms to Create Covert Channels in Modern Processors. ISCA 2021: 985-998 - [c33]Jawad Haj-Yahya, Jisung Park, Rahul Bera, Juan Gómez-Luna, Efraim Rotem, Taha Shahroodi, Jeremie S. Kim, Onur Mutlu:
BurstLink: Techniques for Energy-Efficient Video Display for Conventional and Virtual Reality Systems. MICRO 2021: 155-169 - [c32]Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez-Luna, Jakub Golinowski, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Nils Blach, Marek Konieczny, Onur Mutlu, Torsten Hoefler:
SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems. MICRO 2021: 282-297 - [i31]Christina Giannoula, Nandita Vijaykumar, Nikela Papadopoulou, Vasileios Karakostas, Ivan Fernandez, Juan Gómez-Luna, Lois Orosa, Nectarios Koziris, Georgios I. Goumas, Onur Mutlu:
SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures. CoRR abs/2101.07557 (2021) - [i30]Jawad Haj-Yahya, Jisung Park, Rahul Bera, Juan Gómez-Luna, Efraim Rotem, Taha Shahroodi, Jeremie S. Kim, Onur Mutlu:
BurstLink: Techniques for Energy-Efficient Conventional and Virtual Reality Video Display. CoRR abs/2104.05119 (2021) - [i29]Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez-Luna, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Marek Konieczny, Onur Mutlu, Torsten Hoefler:
SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems. CoRR abs/2104.07582 (2021) - [i28]João Dinis Ferreira, Gabriel Falcão, Juan Gómez-Luna, Mohammed Alser, Lois Orosa, Mohammad Sadrosadati, Jeremie S. Kim, Geraldo F. Oliveira, Taha Shahroodi, Anant Nori, Onur Mutlu:
pLUTo: In-DRAM Lookup Tables to Enable Massively Parallel General-Purpose Computation. CoRR abs/2104.07699 (2021) - [i27]Geraldo F. Oliveira, Juan Gómez-Luna, Lois Orosa, Saugata Ghose, Nandita Vijaykumar, Ivan Fernandez, Mohammad Sadrosadati, Onur Mutlu:
DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks. CoRR abs/2105.03725 (2021) - [i26]Juan Gómez-Luna, Izzat El Hajj, Ivan Fernandez, Christina Giannoula, Geraldo F. Oliveira, Onur Mutlu:
Benchmarking a New Paradigm: An Experimental Analysis of a Real Processing-in-Memory Architecture. CoRR abs/2105.03814 (2021) - [i25]Nastaran Hajinazar, Geraldo F. Oliveira, Sven Gregorio, João Dinis Ferreira, Nika Mansouri-Ghiasi, Minesh Patel, Mohammed Alser, Saugata Ghose, Juan Gómez-Luna, Onur Mutlu:
SIMDRAM: An End-to-End Framework for Bit-Serial SIMD Computing in DRAM. CoRR abs/2105.12839 (2021) - [i24]Jawad Haj-Yahya, Jeremie S. Kim, Abdullah Giray Yaglikçi, Ivan Puddu, Lois Orosa, Juan Gómez-Luna, Mohammed Alser, Onur Mutlu:
IChannels: Exploiting Current Management Mechanisms to Create Covert Channels in Modern Processors. CoRR abs/2106.05050 (2021) - [i23]Lois Orosa, Yaohua Wang, Mohammad Sadrosadati, Jeremie S. Kim, Minesh Patel, Ivan Puddu, Haocong Luo, Kaveh Razavi, Juan Gómez-Luna, Hasan Hassan, Nika Mansouri-Ghiasi, Saugata Ghose, Onur Mutlu:
CODIC: A Low-Cost Substrate for Enabling Custom In-DRAM Functionalities and Optimizations. CoRR abs/2106.05632 (2021) - [i22]Gagandeep Singh, Mohammed Alser, Damla Senol Cali, Dionysios Diamantopoulos, Juan Gómez-Luna, Henk Corporaal, Onur Mutlu:
FPGA-based Near-Memory Acceleration of Modern Data-Intensive Applications. CoRR abs/2106.06433 (2021) - [i21]Gagandeep Singh, Dionysios Diamantopoulos, Juan Gómez-Luna, Christoph Hagleitner, Sander Stuijk, Henk Corporaal, Onur Mutlu:
NERO: Accelerating Weather Prediction using Near-Memory Reconfigurable Fabric. CoRR abs/2107.08716 (2021) - [i20]Juan Gómez-Luna, Izzat El Hajj, Ivan Fernandez, Christina Giannoula, Geraldo F. Oliveira, Onur Mutlu:
Benchmarking Memory-Centric Computing Systems: Analysis of Real Processing-in-Memory Hardware. CoRR abs/2110.01709 (2021) - [i19]Ataberk Olgun, Juan Gómez-Luna, Konstantinos Kanellopoulos, Behzad Salami, Hasan Hassan, Oguz Ergin, Onur Mutlu:
PiDRAM: A Holistic End-to-end FPGA-based Framework for Processing-in-DRAM. CoRR abs/2111.00082 (2021) - [i18]Geraldo F. Oliveira, Saugata Ghose, Juan Gómez-Luna, Amirali Boroumand, Alexis Savery, Sonny Rao, Salman Qazi, Gwendal Grignou, Rahul Thakur, Eric Shiu, Onur Mutlu:
Extending Memory Capacity in Consumer Devices with Emerging Non-Volatile Memory: An Experimental Study. CoRR abs/2111.02325 (2021) - [i17]Alain Denzler, Rahul Bera, Nastaran Hajinazar, Gagandeep Singh, Geraldo F. Oliveira, Juan Gómez-Luna, Onur Mutlu:
Casper: Accelerating Stencil Computation using Near-cache Processing. CoRR abs/2112.14216 (2021) - 2020
- [j18]Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares:
Accelerating Chan-Vese model with cross-modality guided contrast enhancement for liver segmentation. Comput. Biol. Medicine 124: 103930 (2020) - [j17]Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares:
Accelerating sparse matrix-matrix multiplication with GPU Tensor Cores. Comput. Electr. Eng. 88: 106848 (2020) - [j16]Nitin Satpute, Rabia Naseem, Egidijus Pelanis, Juan Gómez-Luna, Faouzi Alaya Cheikh, Ole Jakob Elle, Joaquín Olivares:
GPU acceleration of liver enhancement for tumor segmentation. Comput. Methods Programs Biomed. 184: 105285 (2020) - [j15]Nitin Satpute, Rabia Naseem, Rafael Palomar, Orestis Zachariadis, Juan Gómez-Luna, Faouzi Alaya Cheikh, Joaquín Olivares:
Fast parallel vessel segmentation. Comput. Methods Programs Biomed. 192: 105430 (2020) - [j14]Orestis Zachariadis, Andrea Teatini, Nitin Satpute, Juan Gómez-Luna, Onur Mutlu, Ole Jakob Elle, Joaquín Olivares:
Accelerating B-spline interpolation on GPUs: Application to medical image registration. Comput. Methods Programs Biomed. 193: 105431 (2020) - [c31]Jiantong Jiang, Zeke Wang, Xue Liu, Juan Gómez-Luna, Nan Guan, Qingxu Deng, Wei Zhang, Onur Mutlu:
Boyi: A Systematic Framework for Automatically Deciding the Right Execution Model of OpenCL Applications on FPGAs. FPGA 2020: 299-309 - [c30]Gagandeep Singh, Dionysios Diamantopoulos, Christoph Hagleitner, Juan Gómez-Luna, Sander Stuijk, Onur Mutlu, Henk Corporaal:
NERO: A Near High-Bandwidth Memory Stencil Accelerator for Weather Prediction Modeling. FPL 2020: 9-17 - [c29]Ivan Fernandez, Ricardo Quislant, Eladio Gutiérrez, Oscar G. Plata, Christina Giannoula, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu:
NATSA: A Near-Data Processing Accelerator for Time Series Analysis. ICCD 2020: 120-129 - [c28]Yaohua Wang, Lois Orosa, Xiangjun Peng, Yang Guo, Saugata Ghose, Minesh Patel, Jeremie S. Kim, Juan Gómez-Luna, Mohammad Sadrosadati, Nika Mansouri-Ghiasi, Onur Mutlu:
FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching. MICRO 2020: 313-328 - [c27]Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gómez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu:
GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. MICRO 2020: 951-966 - [i16]Orestis Zachariadis, Andrea Teatini, Nitin Satpute, Juan Gómez-Luna, Onur Mutlu, Ole Jakob Elle, Joaquín Olivares:
Accelerating B-spline Interpolation on GPUs: Application to Medical Image Registration. CoRR abs/2004.05962 (2020) - [i15]Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gómez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu:
GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. CoRR abs/2009.07692 (2020) - [i14]Gagandeep Singh, Dionysios Diamantopoulos, Christoph Hagleitner, Juan Gómez-Luna, Sander Stuijk, Onur Mutlu, Henk Corporaal:
NERO: A Near High-Bandwidth Memory Stencil Accelerator for Weather Prediction Modeling. CoRR abs/2009.08241 (2020) - [i13]Yaohua Wang, Lois Orosa, Xiangjun Peng, Yang Guo, Saugata Ghose, Minesh Patel, Jeremie S. Kim, Juan Gómez-Luna, Mohammad Sadrosadati, Nika Mansouri-Ghiasi, Onur Mutlu:
FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching. CoRR abs/2009.08437 (2020) - [i12]Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares:
Accelerating Sparse Matrix-Matrix Multiplication with GPU Tensor Cores. CoRR abs/2009.14600 (2020) - [i11]Ivan Fernandez, Ricardo Quislant, Christina Giannoula, Mohammed Alser, Juan Gómez-Luna, Eladio Gutiérrez, Oscar G. Plata, Onur Mutlu:
NATSA: A Near-Data Processing Accelerator for Time Series Analysis. CoRR abs/2010.02079 (2020) - [i10]Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun:
A Modern Primer on Processing in Memory. CoRR abs/2012.03112 (2020) - [i9]Nastaran Hajinazar, Geraldo F. Oliveira, Sven Gregorio, João Dinis Ferreira, Nika Mansouri-Ghiasi, Minesh Patel, Mohammed Alser, Saugata Ghose, Juan Gómez-Luna, Onur Mutlu:
SIMDRAM: A Framework for Bit-Serial SIMD Processing Using DRAM. CoRR abs/2012.11890 (2020)
2010 – 2019
- 2019
- [j13]Saugata Ghose, Amirali Boroumand, Jeremie S. Kim, Juan Gómez-Luna, Onur Mutlu:
Processing-in-memory: A workload-driven perspective. IBM J. Res. Dev. 63(6): 3:1-3:19 (2019) - [j12]Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun:
Processing data where it makes sense: Enabling in-memory computation. Microprocess. Microsystems 67: 28-41 (2019) - [c26]Simon Garcia De Gonzalo, Sitao Huang, Juan Gómez-Luna, Simon D. Hammond, Onur Mutlu, Wen-Mei Hwu:
Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs. CGO 2019: 73-84 - [c25]Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun:
Enabling Practical Processing in and near Memory for Data-Intensive Computing. DAC 2019: 21 - [c24]Gagandeep Singh, Juan Gómez-Luna, Giovanni Mariani, Geraldo F. Oliveira, Stefano Corda, Sander Stuijk, Onur Mutlu, Henk Corporaal:
NAPEL: Near-Memory Computing Application Performance Prediction via Ensemble Learning. DAC 2019: 27 - [c23]Konstantinos Kanellopoulos, Nandita Vijaykumar, Christina Giannoula, Roknoddin Azizi, Skanda Koppula, Nika Mansouri-Ghiasi, Taha Shahroodi, Juan Gómez-Luna, Onur Mutlu:
SMASH: Co-designing Software Compression and Hardware-Accelerated Indexing for Efficient Sparse Matrix Operations. MICRO 2019: 600-614 - [c22]Sitao Huang, Li-Wen Chang, Izzat El Hajj, Simon Garcia De Gonzalo, Juan Gómez-Luna, Sai Rahul Chalamalasetti, Mohamed El-Hadedy, Dejan S. Milojicic, Onur Mutlu, Deming Chen, Wen-Mei W. Hwu:
Analysis and Modeling of Collaborative Execution Strategies for Heterogeneous CPU-FPGA Architectures. ICPE 2019: 79-90 - [i8]Lois Orosa, Yaohua Wang, Ivan Puddu, Mohammad Sadrosadati, Kaveh Razavi, Juan Gómez-Luna, Hasan Hassan, Nika Mansouri-Ghiasi, Arash Tavakkol, Minesh Patel, Jeremie S. Kim, Vivek Seshadri, Uksong Kang, Saugata Ghose, Rodolfo Azevedo, Onur Mutlu:
Dataplant: In-DRAM Security Mechanisms for Low-Cost Devices. CoRR abs/1902.07344 (2019) - [i7]Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun:
Processing Data Where It Makes Sense: Enabling In-Memory Computation. CoRR abs/1903.03988 (2019) - [i6]Onur Mutlu, Saugata Ghose, Juan Gómez-Luna, Rachata Ausavarungnirun:
Enabling Practical Processing in and near Memory for Data-Intensive Computing. CoRR abs/1905.04376 (2019) - [i5]Saugata Ghose, Amirali Boroumand, Jeremie S. Kim, Juan Gómez-Luna, Onur Mutlu:
A Workload and Programming Ease Driven Perspective of Processing-in-Memory. CoRR abs/1907.12947 (2019) - [i4]Mohammed Alser, Taha Shahroodi, Juan Gómez-Luna, Can Alkan, Onur Mutlu:
SneakySnake: A Fast and Accurate Universal Genome Pre-Alignment Filter for CPUs, GPUs, and FPGAs. CoRR abs/1910.09020 (2019) - [i3]Konstantinos Kanellopoulos, Nandita Vijaykumar, Christina Giannoula, Roknoddin Azizi, Skanda Koppula, Nika Mansouri-Ghiasi, Taha Shahroodi, Juan Gómez-Luna, Onur Mutlu:
SMASH: Co-designing Software Compression and Hardware-Accelerated Indexing for Efficient Sparse Matrix Operations. CoRR abs/1910.10776 (2019) - 2018
- [j11]Rafael Palomar, Juan Gómez-Luna, Faouzi Alaya Cheikh, Joaquín Olivares Bueno, Ole Jakob Elle:
High-Performance Computation of Bézier Surfaces on Parallel and Heterogeneous Platforms. Int. J. Parallel Program. 46(6): 1035-1062 (2018) - [j10]José M. Cecilia, Antonio Llanes, José L. Abellán, Juan Gómez-Luna, Li-Wen Chang, Wen-Mei W. Hwu:
High-throughput Ant Colony Optimization on graphics processing units. J. Parallel Distributed Comput. 113: 261-274 (2018) - [c21]Arash Tavakkol, Juan Gómez-Luna, Mohammad Sadrosadati, Saugata Ghose, Onur Mutlu:
MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices. FAST 2018: 49-66 - [c20]Arash Tavakkol, Mohammad Sadrosadati, Saugata Ghose, Jeremie S. Kim, Yixin Luo, Yaohua Wang, Nika Mansouri-Ghiasi, Lois Orosa, Juan Gómez-Luna, Onur Mutlu:
FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. ISCA 2018: 397-410 - [i2]A. J. Lázaro-Muñoz, José María González-Linares, Juan Gómez-Luna, Nicolás Guil:
Improving tasks throughput on accelerators using OpenCL command concurrency. CoRR abs/1806.10113 (2018) - [i1]Arash Tavakkol, Aasheesh Kolli, Stanko Novakovic, Kaveh Razavi, Juan Gómez-Luna, Hasan Hassan, Claude Barthels, Yaohua Wang, Mohammad Sadrosadati, Saugata Ghose, Ankit Singla, Pratap Subrahmanyam, Onur Mutlu:
Enabling Efficient RDMA-based Synchronous Mirroring of Persistent Memory Transactions. CoRR abs/1810.09360 (2018) - 2017
- [j9]A. J. Lázaro-Muñoz, José María González-Linares, Juan Gómez-Luna, Nicolás Guil:
A tasks reordering model to reduce transfers overhead on GPUs. J. Parallel Distributed Comput. 109: 258-271 (2017) - [c19]A. J. Lázaro-Muñoz, José María González-Linares, Juan Gómez-Luna, Nicolás Guil:
Efficient OpenCL-based concurrent tasks offloading on accelerators. ICCS 2017: 2353-2357 - [c18]Juan Gómez-Luna, Izzat El Hajj, Li-Wen Chang, Victor Garcia-Flores, Simon Garcia De Gonzalo, Thomas B. Jablin, Antonio J. Peña, Wen-mei W. Hwu:
Chai: Collaborative heterogeneous applications for integrated-architectures. ISPASS 2017: 43-54 - [c17]Li-Wen Chang, Juan Gómez-Luna, Izzat El Hajj, Sitao Huang, Deming Chen, Wen-mei W. Hwu:
Collaborative Computing for Heterogeneous Integrated Systems. ICPE 2017: 385-388 - 2016
- [j8]Gert-Jan van den Braak, Juan Gómez-Luna, José María González-Linares, Henk Corporaal, Nicolás Guil:
Configurable XOR Hash Functions for Banked Scratchpad Memories in GPUs. IEEE Trans. Computers 65(7): 2045-2058 (2016) - [j7]Juan Gómez-Luna, I-Jui Sung, Li-Wen Chang, José María González-Linares, Nicolás Guil, Wen-mei W. Hwu:
In-Place Matrix Transposition on GPUs. IEEE Trans. Parallel Distributed Syst. 27(3): 776-788 (2016) - [c16]Victor Garcia, Juan Gómez-Luna, Thomas Grass, Alejandro Rico, Eduard Ayguadé, Antonio J. Peña:
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications. IISWC 2016: 168-177 - [c15]Li-Wen Chang, Izzat El Hajj, Christopher I. Rodrigues, Juan Gómez-Luna, Wen-mei W. Hwu:
Efficient kernel synthesis for performance portable programming. MICRO 2016: 12:1-12:13 - [c14]Izzat El Hajj, Juan Gómez-Luna, Cheng Li, Li-Wen Chang, Dejan S. Milojicic, Wen-mei W. Hwu:
KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism. MICRO 2016: 13:1-13:12 - [c13]Li-Wen Chang, Izzat El Hajj, Hee-Seok Kim, Juan Gómez-Luna, Abdul Dakkak, Wen-mei W. Hwu:
A programming system for future proofing performance critical libraries. PPoPP 2016: 32:1-32:2 - 2015
- [j6]Julián Ramos Cózar, Manuel J. Marín-Jiménez, José María González-Linares, Nicolás Guil, Juan Gómez-Luna:
Calculation of dense trajectory descriptors on a heterogeneous embedded architecture. J. Syst. Archit. 61(10): 659-667 (2015) - [c12]Juan Gómez-Luna, Li-Wen Chang, I-Jui Sung, Wen-mei W. Hwu, Nicolás Guil:
In-Place Data Sliding Algorithms for Many-Core Architectures. ICPP 2015: 210-219 - 2014
- [c11]Antonio Fuentes-Alventosa, Juan Gómez-Luna, José María González-Linares, Nicolás Guil:
CUVLE: Variable-length encoding on CUDA. DASIP 2014: 1-6 - [c10]Salvador Ibarra-Delgado, Julián Ramos Cózar, José María González-Linares, Juan Gómez-Luna, Nicolás Guil:
Low-textured regions detection for improving stereoscopy algorithms. HPCS 2014: 676-680 - [c9]I-Jui Sung, Juan Gómez-Luna, José María González-Linares, Nicolás Guil, Wen-mei W. Hwu:
In-place transposition of rectangular matrices on accelerators. PPoPP 2014: 207-218 - 2013
- [j5]Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, Nicolás Guil:
An optimized approach to histogram computation on GPU. Mach. Vis. Appl. 24(5): 899-908 (2013) - [j4]Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides Benítez, Nicolás Guil Mata:
Performance Modeling of Atomic Additions on GPU Scratchpad Memory. IEEE Trans. Parallel Distributed Syst. 24(11): 2273-2282 (2013) - [c8]Gert-Jan van den Braak, Juan Gómez-Luna, Henk Corporaal, José María González-Linares, Nicolás Guil:
Simulation and architecture improvements of atomic operations on GPU scratchpad memory. ICCD 2013: 357-362 - [c7]Salvador Ibarra-Delgado, Manuel Hernandez Calviño, Nicolás Guil Mata, Juan Gómez-Luna:
A robust and low resource FPGA-based stereoscopic vision algorithm. ReConFig 2013: 1-6 - 2012
- [j3]Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, Nicolás Guil:
Performance models for asynchronous data transfers on consumer Graphics Processing Units. J. Parallel Distributed Comput. 72(9): 1117-1126 (2012) - 2011
- [j2]Juan Gómez-Luna, José María González-Linares, José Ignacio Benavides, Emilio L. Zapata, Nicolás Guil Mata:
Load Balancing versus Occupancy Maximization on Graphics Processing Units: The Generalized Hough Transform as a Case Study. Int. J. High Perform. Comput. Appl. 25(2): 205-222 (2011) - [c6]Juan Gómez-Luna, Holger Endt, Walter Stechele, José María González-Linares, José Ignacio Benavides, Nicolás Guil:
Egomotion compensation and moving objects detection algorithm on GPU. PARCO 2011: 183-190 - [c5]Carlos García-García, Juan Gómez-Luna, Ezequiel Herruzo Gomez, José Ignacio Benavides Benítez:
simARQ, An Automatic Repeat Request Simulator for Teaching Purposes. IT Revolutions 2011: 202-211 - 2010
- [c4]Rafael Palomar, José M. Palomares, José M. Castillo, Joaquín Olivares, Juan Gómez-Luna:
Parallelizing and Optimizing LIP-Canny Using NVIDIA CUDA. IEA/AIE (3) 2010: 389-398
2000 – 2009
- 2009
- [j1]Juan Gómez-Luna, Ezequiel Herruzo, José Ignacio Benavides:
MESI Cache Coherence Simulator for Teaching Purposes. CLEI Electron. J. 12(1) (2009) - [c3]Juan Gómez-Luna, José María González-Linares, José I. Benavides, Nicolás Guil:
Parallelization of a Video Segmentation Algorithm on CUDA-Enabled Graphics Processing Units. Euro-Par 2009: 924-935 - [c2]Sergio Ruben Geninatti, José Ignacio Benavides Benítez, Manuel Hernandez Calviño, Nicolás Guil Mata, Juan Gómez-Luna:
FPGA Implementation of the Generalized Hough Transform. ReConFig 2009: 172-177 - 2008
- [c1]Joaquín Olivares, Juan Gómez-Luna, José M. Palomares, Miguel A. Montijano:
Biprocessor SoC in an FPGA for Teaching Purposes. ICALT 2008: 250-251
Coauthor Index
aka: Ivan Fernandez Vega
aka: Nicolás Guil Mata
aka: Joaquín Olivares Bueno
aka: Abdullah Giray Yaglikçi
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 02:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint