Follow
Abdullah Kayi
Title
Cited by
Cited by
Year
Experimental evaluation of emerging multi-core architectures
A Kayi, Y Yao, T El-Ghazawi, G Newby
2007 IEEE International Parallel and Distributed Processing Symposium, 1-6, 2007
392007
Performance issues in emerging homogeneous multi-core architectures
A Kayi, T El-Ghazawi, GB Newby
Simulation Modelling Practice and Theory 17 (9), 1485-1499, 2009
332009
Comparing runtime systems with exascale ambitions using the parallel research kernels
RF Van der Wijngaart, A Kayi, JR Hammond, G Jost, T St. John, ...
High Performance Computing: 31st International Conference, ISC High …, 2016
262016
A Highly-Efficient Distributed Deep Learning System For Automatic Speech Recognition
MP Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi ...
Interspeech, 2019
22*2019
Adaptive cache coherence mechanisms with producer–consumer sharing optimization for chip multiprocessors
A Kayi, O Serres, T El-Ghazawi
IEEE Transactions on Computers 64 (2), 316-328, 2013
192013
Improving efficiency in large-scale decentralized distributed training
W Zhang, X Cui, A Kayi, M Liu, U Finkler, B Kingsbury, G Saon, Y Mroueh, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Address translation optimization for Unified Parallel C multi-dimensional arrays
O Serres, A Anbar, SG Merchant, A Kayi, T El-Ghazawi
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
152011
An adaptive cache coherence protocol for chip multiprocessors
A Kayi, T El-Ghazawi
Proceedings of the Second International Forum on Next-Generation Multicore …, 2010
152010
Using the parallel research kernels to study PGAS models
RF Van der Wijngaart, S Sridharan, A Kayi, G Jost, JR Hammond, ...
2015 9th International Conference on Partitioned Global Address Space …, 2015
142015
Performance evaluation of clusters with ccnuma nodes-a case study
A Kayi, E Kornkven, T El-Ghazawi, S Al-Bahra, GB Newby
2008 10th IEEE International Conference on High Performance Computing and …, 2008
122008
Disaggregated system domain
JA Kahle, CR Johns, C Evangelinos, A Kayi
US Patent 11,561,844, 2023
102023
Application performance tuning for clusters with ccnuma nodes
A Kayi, E Kornkven, T El-Ghazawi, G Newby
2008 11th IEEE international conference on computational science and …, 2008
102008
Asynchronous decentralized distributed training of acoustic models
X Cui, W Zhang, A Kayi, M Liu, U Finkler, B Kingsbury, G Saon, D Kung
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3565-3576, 2021
42021
Enabling PGAS productivity with hardware support for shared address mapping: A UPC case study
O Serres, A Kayi, A Anbar, T El-Ghazawi
ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-26, 2015
42015
Hardware support for address mapping in PGAS languages: a UPC case study
O Serres, A Kayi, A Anbar, T El-Ghazawi
Proceedings of the 11th ACM Conference on Computing Frontiers, 1-2, 2014
42014
Performance analysis and tuning for clusters with ccNUMA nodes for scientific computing--A case study
A Kayi, E Kornkven, T El-Ghazawi, S Al-Bahra, GB Newby
Computer Systems Science and Engineering 24 (5), 291, 2010
42010
Dynamic computation rates for distributed deep learning
W Zhang, X Cui, A Kayi, A Buyuktosunoglu
US Patent 11,977,986, 2024
22024
Updating of statistical sets for decentralized distributed training of a machine learning model
X Cui, W Zhang, M Liu, A Kayi, Y Mroueh, A Buyuktosunoglu
US Patent 11,636,280, 2023
22023
Data movement accelerator engines on a prototype power10 processor
Y Sugawara, D Chen, RA Haring, A Kayi, E Ratzlaff, RM Senger, ...
IEEE Micro 43 (1), 67-75, 2022
22022
Data shuffling with hierarchical tuple spaces
CHA Costa, A Kayi, Y Park, C Johns
US Patent 10,956,125, 2021
22021
The system can't perform the operation now. Try again later.
Articles 1–20