Nitish Shirish Keskar

Zitiert von

	Alle	Seit 2019
Zitate	11975	11286
h-index	28	27
i10-index	41	41

3100

1550

775

2325

20172018201920202021202220232024136 504 1004 1556 1693 2174 3005 1843

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Richard Socheryou.comBestätigte E-Mail-Adresse bei stanford.edu
Caiming XiongSalesforce ResearchBestätigte E-Mail-Adresse bei salesforce.com
Bryan McCannYou.comBestätigte E-Mail-Adresse bei you.com
Jorge NocedalProfessor, Industrial Engineering, Northwestern UniversityBestätigte E-Mail-Adresse bei NORTHWESTERN.EDU
Dheevatsa MudigereDistinguished Engineer, NVIDIABestätigte E-Mail-Adresse bei nvidia.com
Mikhail SmelyanskiyFacebookBestätigte E-Mail-Adresse bei intel.com
Lav R. VarshneyUniversity of Illinois Urbana-ChampaignBestätigte E-Mail-Adresse bei illinois.edu
Stephen MerityBestätigte E-Mail-Adresse bei smerity.com
Nikhil NaikMITBestätigte E-Mail-Adresse bei mit.edu
Akhilesh Deepak GotmareSalesforce ResearchBestätigte E-Mail-Adresse bei salesforce.com
Ali MadaniProfluent BioBestätigte E-Mail-Adresse bei berkeley.edu
Nazneen RajaniHugging FaceBestätigte E-Mail-Adresse bei huggingface.co
Huan WangSalesforce ResearchBestätigte E-Mail-Adresse bei yale.edu
Semih YavuzSalesforce ResearchBestätigte E-Mail-Adresse bei salesforce.com
Albert S. BerahasAssistant Professor, University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Karim AhmedDartmouth College, Samsung Research AmericaBestätigte E-Mail-Adresse bei dartmouth.edu
Tong NiuSalesforce ResearchBestätigte E-Mail-Adresse bei salesforce.com
Raphael R EguchiStanford UniversityBestätigte E-Mail-Adresse bei alumni.stanford.edu
Jasdeep SinghStanford UniversityBestätigte E-Mail-Adresse bei stanford.edu
Wojciech KryścińskiCohereBestätigte E-Mail-Adresse bei cohere.com

Folgen

Nitish Shirish Keskar

OpenAI

Bestätigte E-Mail-Adresse bei openai.com - Startseite

Deep Learning Mathematical Optimization Natural Language Processing


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
On large-batch training for deep learning: Generalization gap and sharp minima NS Keskar, D Mudigere, J Nocedal, M Smelyanskiy, PTP Tang arXiv preprint arXiv:1609.04836, 2016	3342	2016
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	1372*	2023
Regularizing and optimizing LSTM language models S Merity, NS Keskar, R Socher arXiv preprint arXiv:1708.02182, 2017	1266	2017
Ctrl: A conditional transformer language model for controllable generation NS Keskar, B McCann, LR Varshney, C Xiong, R Socher arXiv preprint arXiv:1909.05858, 2019	1093	2019
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	732	2022
The natural language decathlon: Multitask learning as question answering B McCann, NS Keskar, C Xiong, R Socher arXiv preprint arXiv:1806.08730, 2018	650	2018
Improving generalization performance by switching from adam to sgd NS Keskar, R Socher arXiv preprint arXiv:1712.07628, 2017	624	2017
Neural text summarization: A critical evaluation W Kryściński, NS Keskar, B McCann, C Xiong, R Socher arXiv preprint arXiv:1908.08960, 2019	377	2019
Gedi: Generative discriminator guided sequence generation B Krause, AD Gotmare, B McCann, NS Keskar, S Joty, R Socher, ... arXiv preprint arXiv:2009.06367, 2020	298	2020
A closer look at deep learning heuristics: Learning rate restarts, warmup and distillation A Gotmare, NS Keskar, C Xiong, R Socher arXiv preprint arXiv:1810.13243, 2018	276	2018
Progen: Language modeling for protein generation A Madani, B McCann, N Naik, NS Keskar, N Anand, RR Eguchi, ... arXiv preprint arXiv:2004.03497, 2020	232	2020
An analysis of neural language modeling at multiple scales S Merity, NS Keskar, R Socher arXiv preprint arXiv:1803.08240, 2018	188	2018
Deep learning-enabled breast cancer hormonal receptor status determination from base-level H&E stains N Naik, A Madani, A Esteva, NS Keskar, MF Press, D Ruderman, DB Agus, ... Nature communications 11 (1), 5727, 2020	175	2020
Weighted transformer network for machine translation K Ahmed, NS Keskar, R Socher arXiv preprint arXiv:1711.02132, 2017	155	2017
Balancing communication and computation in distributed optimization AS Berahas, R Bollapragada, NS Keskar, E Wei IEEE Transactions on Automatic Control 64 (8), 3141-3155, 2018	114	2018
Sequence-to-sequence prediction using a neural network model NS Keskar, K Ahmed, R Socher US Patent 11,928,600, 2024	107	2024
Multitask learning as question answering NS Keskar, B McCann, C Xiong, R Socher US Patent 11,501,076, 2022	86	2022
Multitask learning as question answering B McCann, NS Keskar, C Xiong, R Socher US Patent 10,776,581, 2020	83	2020
Hybrid training of deep networks NS Keskar, R Socher US Patent 11,276,002, 2022	78	2022
Xlda: Cross-lingual data augmentation for natural language inference and question answering J Singh, B McCann, NS Keskar, C Xiong, R Socher arXiv preprint arXiv:1905.11471, 2019	77	2019

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren