Folgen
Steve Keckler
Steve Keckler
Vice President of Architecture Research, NVIDIA
Bestätigte E-Mail-Adresse bei cs.utexas.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Modeling the effect of technology trends on the soft error rate of combinational logic
P Shivakumar, M Kistler, SW Keckler, D Burger, L Alvisi
Proceedings International Conference on Dependable Systems and Networks, 389-398, 2002
20522002
Exascale computing study: Technology challenges in achieving exascale systems
P Kogge, K Bergman, S Borkar, D Campbell, W Carson, W Dally, ...
1492*2008
SCNN: An accelerator for compressed-sparse convolutional neural networks
A Parashar, M Rhu, A Mukkara, A Puglielli, R Venkatesan, B Khailany, ...
ACM SIGARCH computer architecture news 45 (2), 27-40, 2017
14812017
An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches
C Kim, D Burger, SW Keckler
ACM SIGPLAN Notices 37 (10), 211-222, 2002
10792002
Clock rate versus IPC: The end of the road for conventional microarchitectures
V Agarwal, MS Hrishikesh, SW Keckler, D Burger
Proceedings of the 27th annual international symposium on Computer …, 2000
10172000
GPUs and the future of parallel computing
SW Keckler, WJ Dally, B Khailany, M Garland, D Glasco
IEEE micro 31 (5), 7-17, 2011
8502011
Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture
K Sankaralingam, R Nagarajan, H Liu, C Kim, J Huh, D Burger, ...
Computer Architecture, 2003. Proceedings. 30th Annual International …, 2003
7562003
Research challenges for on-chip interconnection networks
JD Owens, WJ Dally, R Ho, DN Jayasimha, SW Keckler, LS Peh
IEEE micro 27 (5), 96-108, 2007
6562007
Understanding error propagation in deep learning neural network (DNN) accelerators and applications
G Li, SKS Hari, M Sullivan, T Tsai, K Pattabiraman, J Emer, SW Keckler
Proceedings of the International Conference for High Performance Computing …, 2017
6242017
Scaling to the End of Silicon with EDGE Architectures
D Burger, SW Keckler, KS McKinley, M Dahlin, LK John, C Lin, CR Moore, ...
Computer 37 (7), 44-55, 2004
5462004
vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design
M Rhu, N Gimelshein, J Clemons, A Zulfiqar, SW Keckler
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
5232016
Regional congestion awareness for load balance in networks-on-chip
P Gratz, B Grot, SW Keckler
High Performance Computer Architecture, 2008. HPCA 2008. IEEE 14th …, 2008
5022008
A NUCA substrate for flexible CMP cache sharing
J Huh, C Kim, H Shafi, L Zhang, D Burger, SW Keckler
ACM International Conference on Supercomputing 25th Anniversary Volume, 380-389, 2005
4952005
Timeloop: A systematic approach to dnn accelerator evaluation
A Parashar, P Raina, YS Shao, YH Chen, VA Ying, A Mukkara, ...
2019 IEEE international symposium on performance analysis of systems and …, 2019
4642019
Simba: Scaling deep-learning inference with multi-chip-module-based architecture
YS Shao, J Clemons, R Venkatesan, B Zimmer, M Fojtik, N Jiang, B Keller, ...
Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019
4502019
Sparse convolutional neural network accelerator
WJ Dally, A Parashar, JS Emer, SW Keckler, LR Dennison
US Patent 10,891,538, 2021
3922021
The optimal logic depth per pipeline stage is 6 to 8 FO4 inverter delays
MS Hrishikesh, D Burger, NP Jouppi, SW Keckler, KI Farkas, ...
ACM SIGARCH Computer Architecture News 30 (2), 14-24, 2002
3692002
Energy-efficient mechanisms for managing thread context in throughput processors
M Gebhart, DR Johnson, D Tarjan, SW Keckler, WJ Dally, E Lindholm, ...
Proceedings of the 38th annual international symposium on Computer …, 2011
3502011
Transparent offloading and mapping (TOM) enabling programmer-transparent near-data processing in GPU systems
K Hsieh, E Ebrahimi, G Kim, N Chatterjee, M O'Connor, N Vijaykumar, ...
ACM SIGARCH Computer Architecture News 44 (3), 204-216, 2016
3272016
The m-machine multicomputer
M Fillo, SW Keckler, WJ Dally, NP Carter, A Chang, Y Gurevich, WS Lee
International Journal of Parallel Programming 25 (3), 183-212, 1997
3231997
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20