Ambit: In-memory accelerator for bulk bitwise operations using commodity DRAM technology V Seshadri, D Lee, T Mullins, H Hassan, A Boroumand, J Kim, MA Kozuch, ... Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017 | 656 | 2017 |
Google workloads for consumer devices: Mitigating data movement bottlenecks A Boroumand, S Ghose, Y Kim, R Ausavarungnirun, E Shiu, R Thakur, ... Proceedings of the Twenty-Third International Conference on Architectural …, 2018 | 426 | 2018 |
Accelerating pointer chasing in 3D-stacked memory: Challenges, mechanisms, evaluation K Hsieh, S Khan, N Vijaykumar, KK Chang, A Boroumand, S Ghose, ... 2016 IEEE 34th International Conference on Computer Design (ICCD), 25-32, 2016 | 256 | 2016 |
Fast bulk bitwise AND and OR in DRAM V Seshadri, K Hsieh, A Boroum, D Lee, MA Kozuch, O Mutlu, PB Gibbons, ... IEEE Computer Architecture Letters 14 (2), 127-131, 2015 | 250 | 2015 |
LazyPIM: An efficient cache coherence mechanism for processing-in-memory A Boroumand, S Ghose, M Patel, H Hassan, B Lucia, K Hsieh, KT Malladi, ... IEEE Computer Architecture Letters 16 (1), 46-50, 2016 | 220 | 2016 |
Processing-in-memory: A workload-driven perspective S Ghose, A Boroumand, JS Kim, J Gómez-Luna, O Mutlu IBM Journal of Research and Development 63 (6), 3: 1-3: 19, 2019 | 208 | 2019 |
Gather-scatter DRAM: In-DRAM address translation to improve the spatial locality of non-unit strided accesses V Seshadri, T Mullins, A Boroumand, O Mutlu, PB Gibbons, MA Kozuch, ... Proceedings of the 48th International Symposium on Microarchitecture, 267-280, 2015 | 168 | 2015 |
GenASM: A high-performance, low-power approximate string matching acceleration framework for genome sequence analysis DS Cali, GS Kalsi, Z Bingöl, C Firtina, L Subramanian, JS Kim, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 147 | 2020 |
CoNDA: Efficient cache coherence support for near-data accelerators A Boroumand, S Ghose, M Patel, H Hassan, B Lucia, R Ausavarungnirun, ... Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 135 | 2019 |
Google neural network models for edge devices: Analyzing and mitigating machine learning inference bottlenecks A Boroumand, S Ghose, B Akin, R Narayanaswami, GF Oliveira, X Ma, ... 2021 30th International Conference on Parallel Architectures and Compilation …, 2021 | 91 | 2021 |
Enabling the adoption of processing-in-memory: Challenges, mechanisms, future research directions S Ghose, K Hsieh, A Boroumand, R Ausavarungnirun, O Mutlu arXiv preprint arXiv:1802.00320, 2018 | 89 | 2018 |
Buddy-RAM: Improving the performance and efficiency of bulk bitwise operations using DRAM V Seshadri, D Lee, T Mullins, H Hassan, A Boroumand, J Kim, MA Kozuch, ... arXiv preprint arXiv:1611.09988, 2016 | 88 | 2016 |
Polynesia: Enabling effective hybrid transactional/analytical databases with specialized hardware/software co-design A Boroumand, S Ghose, GF Oliveira, O Mutlu arXiv preprint arXiv:2103.00798, 2021 | 42 | 2021 |
The processing-in-memory paradigm: Mechanisms to enable adoption S Ghose, K Hsieh, A Boroumand, R Ausavarungnirun, O Mutlu Beyond-CMOS Technologies for Next Generation Computer Design, 133-194, 2019 | 37 | 2019 |
Mitigating edge machine learning inference bottlenecks: An empirical study on accelerating google edge models A Boroumand, S Ghose, B Akin, R Narayanaswami, GF Oliveira, X Ma, ... arXiv preprint arXiv:2103.00768, 2021 | 33 | 2021 |
LazyPIM: Efficient support for cache coherence in processing-in-memory architectures A Boroumand, S Ghose, M Patel, H Hassan, B Lucia, N Hajinazar, ... arXiv preprint arXiv:1706.03162, 2017 | 23 | 2017 |
Practical Mechanisms for Reducing Processor–Memory Data Movement in Modern Workloads A Boroumand Carnegie Mellon University, 2020 | 22 | 2020 |
Accelerating neural network inference with processing-in-dram: From the edge to the cloud GF Oliveira, J Gómez-Luna, S Ghose, A Boroumand, O Mutlu IEEE Micro 42 (6), 25-38, 2022 | 21 | 2022 |
Polynesia: Enabling High-Performance and Energy-Efficient Hybrid Transactional/Analytical Databases with Hardware/Software Co-Design A Boroumand, S Ghose, GF Oliveira, O Mutlu 2022 IEEE 38th International Conference on Data Engineering (ICDE), 2997-3011, 2022 | 16 | 2022 |
Using ECC DRAM to adaptively increase memory capacity Y Luo, S Ghose, T Li, S Govindan, B Sharma, B Kelly, A Boroumand, ... arXiv preprint arXiv:1706.08870, 2017 | 11 | 2017 |