Punit Singh Koura

Cited by

	All	Since 2019
Citations	13426	13419
h-index	9	9
i10-index	9	9

11000

5500

2750

8250

202220232024301 3011 10024

Co-authors

Todor MihaylovResearch Scientist at Meta AI (Llama, GenAI)Verified email at meta.com
Naman GoyalFacebook AI ResearchVerified email at gatech.edu
Xi Victoria LinAI at MetaVerified email at fb.com
Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Mona DiabProfessor & Director of Language Technologies Institute, Carnegie Mellon University, ACL FellowVerified email at andrew.cmu.edu
Mikel ArtetxeReka AIVerified email at reka.ai
Myle OttGoogle DeepMindVerified email at google.com
Daniel SimigCohereVerified email at cohere.com
Moya ChenFacebookVerified email at fb.com
Sam ShleiferFacebook AI ResearchVerified email at fb.com
Tianlu WangFacebook AI ResearchVerified email at fb.com
Xian LiFAIR, MetaVerified email at fb.com
Susan ZhangFAIRVerified email at fb.com
Stephen RollerGoogle DeepMindVerified email at google.com
Brian O'HoroMeta AI (FAIR)Verified email at meta.com
Ramakanth PasunuruFAIR at MetaVerified email at fb.com
Veselin StoyanovTome AIVerified email at fb.com

Punit Singh Koura

Meta AI

Verified email at fb.com

Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Llama 2: Open foundation and fine-tuned chat models H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288, 2023	9260	2023
Opt: Open pre-trained transformer language models S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... arXiv preprint arXiv:2205.01068, 2022	3111*	2022
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024	605	2024
Efficient large scale language modeling with mixtures of experts M Artetxe, S Bhosale, N Goyal, T Mihaylov, M Ott, S Shleifer, XV Lin, J Du, ... arXiv preprint arXiv:2112.10684, 2021	129*	2021
Few-shot Learning with Multilingual Generative Language Models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022	100*	2022
Opt-iml: Scaling language model instruction meta learning through the lens of generalization S Iyer, XV Lin, R Pasunuru, T Mihaylov, D Simig, P Yu, K Shuster, T Wang, ... arXiv preprint arXiv:2212.12017, 2022	86	2022
Llama 2: Open foundation and fine-tuned chat models, 2023b H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... URL https://arxiv. org/abs/2307.09288, 2023	62	2023
Llama 2: open foundation and fine-tuned chat models. CoRR abs/2307.09288 (2023) H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288 10, 2023	51	2023
A theory on adam instability in large-scale machine learning I Molybog, P Albert, M Chen, Z DeVito, D Esiobu, N Goyal, PS Koura, ... arXiv preprint arXiv:2304.09871, 2023	22	2023

The system can't perform the operation now. Try again later.

Articles 1–9

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors