Maxim Naumov
Title
Cited by
Cited by
Year
Atomistic simulation of realistically sized nanodevices using NEMO 3-D—Part I: Models and benchmarks
G Klimeck, SS Ahmed, H Bae, N Kharche, S Clark, B Haley, S Lee, ...
IEEE Transactions on Electron Devices 54 (9), 2079-2089, 2007
2612007
Parallel solution of sparse triangular linear systems in the preconditioned iterative methods on the GPU
M Naumov
Nvidia Technical Report NVR-2011-001, 2011
1222011
Incomplete-LU and Cholesky preconditioned iterative methods using CUSPARSE and CUBLAS
M Naumov
Nvidia White Paper, 2011
852011
Deep Learning Recommendation Model for Personalization and Recommendation Systems
M Naumov, D Mudigere, HJM Shi, J Huang, N Sundaraman, J Park, ...
arXiv preprint arXiv:1906.00091, 2019
692019
Multimillion Atom Simulation of Electronic and Optical Properties of Nanoscale Devices Using NEMO 3-D
S Ahmed, N Kharche, R Rahman, M Usman, S Lee, H Ryu, H Bae, ...
Encyclopedia of Complexity and Systems Science, 1-69, 2015
69*2015
AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks
A Devarakonda, M Naumov, M Garland
arXiv preprint arXiv:1712.02029, 2017
662017
AmgX: A Library for GPU Accelerated Algebraic Multigrid and Preconditioned Iterative Methods
M Naumov, M Arsaev, P Castonguay, J Cohen, J Demouth, J Eaton, ...
SIAM Journal on Scientific Computing 37 (5), S602-S626, 2015
602015
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
J Park, M Naumov, P Basu, S Deng, A Kalaiah, D Khudia, J Law, P Malani, ...
arXiv preprint arXiv:1811.09886, 2018
582018
CUSPARSE Library: A Set of Basic Linear Algebra Subroutines for Sparse Matrices
M Naumov, LS Chien, P Vandermersch, U Kapasi
GPU Technology Conference (GTC), 2010
49*2010
Parallel Graph Coloring with Applications to the Incomplete-LU Factorization on the GPU
M Naumov, P Castonguay, J Cohen
Nvidia Technical Report NVR-2015-001, 2015
442015
The architectural implications of Facebook's DNN-based personalized recommendation
U Gupta, CJ Wu, X Wang, M Naumov, B Reagen, D Brooks, B Cottel, ...
IEEE International Symposium on High Performance Computer Architecture (HPCA†…, 2020
412020
Exact calculation of entanglement in a 19-site two-dimensional spin system
Q Xu, S Kais, M Naumov, A Sameh
Physical Review A 81 (2), 022324, 2010
262010
Parallel incomplete-LU and Cholesky factorization in the preconditioned iterative methods on the GPU
M Naumov
NVIDIA Technical Report NVR-2012-003, 2012
242012
Bandana: Using Non-volatile Memory for Storing Deep Learning Models
A Eisenman, M Naumov, D Gardner, M Smelyanskiy, S Pupyrev, ...
Conference on Machine Learning and Systems (MLSys), 2019
212019
A tearing-based hybrid parallel banded linear system solver
M Naumov, A Sameh
Journal of Computational and Applied Mathematics 226 (2), 306-318, 2009
212009
Preconditioned Block‐Iterative Methods on GPUs
M Naumov
Proc. Applied Mathematics and Mechanics 12 (1), 11-14, 2012
152012
A tearing-based hybrid parallel sparse linear system solver
M Naumov, M Manguoglu, A Sameh
Journal of Computational and Applied Mathematics 234 (10), 3025-3038, 2010
132010
Parallel Spectral Graph Partitioning
M Naumov, T Moon
Nvidia Technical Report NVR-2016-001, 2016
112016
On the Dimensionality of Embeddings for Sparse Features and Data
M Naumov
arXiv preprint arXiv:1901.02103, 2019
102019
Eigenvalue solvers for atomistic simulations of electronic structures with NEMO-3D
M Naumov, S Lee, B Haley, H Bae, S Clark, R Rahman, H Ryu, F Saied, ...
Journal of Computational Electronics 7 (3), 297-300, 2008
102008
The system can't perform the operation now. Try again later.
Articles 1–20