Armen Aghajanyan

Cited by

	All	Since 2019
Citations	2460	2451
h-index	19	19
i10-index	21	21

1200

600

300

900

2019202020212022202320247 37 176 490 1167 568

Co-authors

Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Mike LewisFacebook AI ResearchVerified email at fb.com
Sonal GuptaResearcher at GoogleVerified email at google.com
Scott Wen-tau YihFAIR at MetaVerified email at meta.com
Gargi GhoshMeta AI ResearchVerified email at fb.com
Mandar JoshiGoogle AIVerified email at google.com
Naman GoyalFacebook AI ResearchVerified email at gatech.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Marjan GhazvininejadResearch Scientist, FAIR (Facebook AI Research)Verified email at fb.com

Armen Aghajanyan

Facebook AI Research

Verified email at fb.com

Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... arXiv preprint arXiv:2109.14084, 2021	399	2021
Incoder: A generative model for code infilling and synthesis D Fried, A Aghajanyan, J Lin, S Wang, E Wallace, F Shi, R Zhong, W Yih, ... arXiv preprint arXiv:2204.05999, 2022	371	2022
Intrinsic dimensionality explains the effectiveness of language model fine-tuning A Aghajanyan, L Zettlemoyer, S Gupta arXiv preprint arXiv:2012.13255, 2020	313	2020
Muppet: Massive multi-task representations with pre-finetuning A Aghajanyan, A Gupta, A Shrivastava, X Chen, L Zettlemoyer, S Gupta arXiv preprint arXiv:2101.11038, 2021	239	2021
Better fine-tuning by reducing representational collapse A Aghajanyan, A Shrivastava, A Gupta, N Goyal, L Zettlemoyer, S Gupta arXiv preprint arXiv:2008.03156, 2020	212	2020
Pre-training via paraphrasing M Lewis, M Ghazvininejad, G Ghosh, A Aghajanyan, S Wang, ... Advances in Neural Information Processing Systems 33, 18470-18481, 2020	144	2020
Memorization without overfitting: Analyzing the training dynamics of large language models K Tirumala, A Markosyan, L Zettlemoyer, A Aghajanyan Advances in Neural Information Processing Systems 35, 38274-38290, 2022	134	2022
Cm3: A causal masked multimodal model of the internet A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ... arXiv preprint arXiv:2201.07520, 2022	114	2022
Improving passage retrieval with zero-shot question generation DS Sachan, M Lewis, M Joshi, A Aghajanyan, W Yih, J Pineau, ... arXiv preprint arXiv:2204.07496, 2022	73	2022
Htlm: Hyper-text pre-training and prompting of language models A Aghajanyan, D Okhonko, M Lewis, M Joshi, H Xu, G Ghosh, ... arXiv preprint arXiv:2107.06955, 2021	62	2021
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591, 2023	53	2023
Conversational semantic parsing A Aghajanyan, J Maillard, A Shrivastava, K Diedrick, M Haeger, H Li, ... arXiv preprint arXiv:2009.13655, 2020	49	2020
Retrieval-augmented multimodal language modeling M Yasunaga, A Aghajanyan, W Shi, R James, J Leskovec, P Liang, ... arXiv preprint arXiv:2211.12561, 2022	47	2022
Megabyte: Predicting million-byte sequences with multiscale transformers L Yu, D Simig, C Flaherty, A Aghajanyan, L Zettlemoyer, M Lewis Advances in Neural Information Processing Systems 36, 2024	41	2024
Scaling laws for generative mixed-modal language models A Aghajanyan, L Yu, A Conneau, WN Hsu, K Hambardzumyan, S Zhang, ... International Conference on Machine Learning, 265-279, 2023	41	2023
Semantic representations using structural ontology for assistant systems A Aghajanyan, S Gupta, B Moran, TF Levin, CANSH Nakatsu, D Difranco, ... US Patent 11,688,022, 2023	31	2023
D4: Improving llm pretraining via document de-duplication and diversification K Tirumala, D Simig, A Aghajanyan, A Morcos Advances in Neural Information Processing Systems 36, 2024	24	2024
Non-autoregressive semantic parsing for compositional task-oriented dialog A Babu, A Shrivastava, A Aghajanyan, A Aly, A Fan, M Ghazvininejad arXiv preprint arXiv:2104.04923, 2021	23	2021
Softtarget regularization: An effective technique to reduce over-fitting in neural networks A Aghajanyan 2017 3rd IEEE International Conference on Cybernetics (CYBCONF), 1-5, 2017	20	2017
Retronlu: Retrieval augmented task-oriented semantic parsing V Gupta, A Shrivastava, A Sagar, A Aghajanyan, D Savenkov arXiv preprint arXiv:2109.10410, 2021	19	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors