Abdullah Kayi
Cited by
Cited by
Experimental evaluation of emerging multi-core architectures
A Kayi, Y Yao, T El-Ghazawi, G Newby
2007 IEEE International Parallel and Distributed Processing Symposium, 1-6, 2007
Performance issues in emerging homogeneous multi-core architectures
A Kayi, T El-Ghazawi, GB Newby
Simulation Modelling Practice and Theory 17 (9), 1485-1499, 2009
Comparing runtime systems with exascale ambitions using the parallel research kernels
RF Van der Wijngaart, A Kayi, JR Hammond, G Jost, T St. John, ...
High Performance Computing: 31st International Conference, ISC High …, 2016
A Highly-Efficient Distributed Deep Learning System For Automatic Speech Recognition
MP Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi ...
Interspeech, 2019
Adaptive cache coherence mechanisms with producer–consumer sharing optimization for chip multiprocessors
A Kayi, O Serres, T El-Ghazawi
IEEE Transactions on Computers 64 (2), 316-328, 2013
Improving efficiency in large-scale decentralized distributed training
W Zhang, X Cui, A Kayi, M Liu, U Finkler, B Kingsbury, G Saon, Y Mroueh, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
Address translation optimization for Unified Parallel C multi-dimensional arrays
O Serres, A Anbar, SG Merchant, A Kayi, T El-Ghazawi
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
An adaptive cache coherence protocol for chip multiprocessors
A Kayi, T El-Ghazawi
Proceedings of the Second International Forum on Next-Generation Multicore …, 2010
Using the parallel research kernels to study PGAS models
RF Van der Wijngaart, S Sridharan, A Kayi, G Jost, JR Hammond, ...
2015 9th International Conference on Partitioned Global Address Space …, 2015
Performance evaluation of clusters with ccnuma nodes-a case study
A Kayi, E Kornkven, T El-Ghazawi, S Al-Bahra, GB Newby
2008 10th IEEE International Conference on High Performance Computing and …, 2008
Disaggregated system domain
JA Kahle, CR Johns, C Evangelinos, A Kayi
US Patent 11,561,844, 2023
Application performance tuning for clusters with ccnuma nodes
A Kayi, E Kornkven, T El-Ghazawi, G Newby
2008 11th IEEE international conference on computational science and …, 2008
Asynchronous decentralized distributed training of acoustic models
X Cui, W Zhang, A Kayi, M Liu, U Finkler, B Kingsbury, G Saon, D Kung
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3565-3576, 2021
Enabling PGAS productivity with hardware support for shared address mapping: A UPC case study
O Serres, A Kayi, A Anbar, T El-Ghazawi
ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-26, 2015
Hardware support for address mapping in PGAS languages: a UPC case study
O Serres, A Kayi, A Anbar, T El-Ghazawi
Proceedings of the 11th ACM Conference on Computing Frontiers, 1-2, 2014
Performance analysis and tuning for clusters with ccNUMA nodes for scientific computing--A case study
A Kayi, E Kornkven, T El-Ghazawi, S Al-Bahra, GB Newby
Computer Systems Science and Engineering 24 (5), 291, 2010
Dynamic computation rates for distributed deep learning
W Zhang, X Cui, A Kayi, A Buyuktosunoglu
US Patent 11,977,986, 2024
Updating of statistical sets for decentralized distributed training of a machine learning model
X Cui, W Zhang, M Liu, A Kayi, Y Mroueh, A Buyuktosunoglu
US Patent 11,636,280, 2023
Data movement accelerator engines on a prototype power10 processor
Y Sugawara, D Chen, RA Haring, A Kayi, E Ratzlaff, RM Senger, ...
IEEE Micro 43 (1), 67-75, 2022
Data shuffling with hierarchical tuple spaces
CHA Costa, A Kayi, Y Park, C Johns
US Patent 10,956,125, 2021
The system can't perform the operation now. Try again later.
Articles 1–20