Po-Yao (Bernie) Huang

Cited by

	All	Since 2019
Citations	4894	4788
h-index	24	24
i10-index	40	38

2100

1050

525

1575

2017201820192020202120222023202437 44 95 152 426 976 2067 1050

Public access

View all

17 articles

1 article

available

not available

Based on funding mandates

Co-authors

Xiaojun ChangDirector of The ReLER Lab and Professor in Artificial Intelligence, University of Technology SydneyVerified email at uts.edu.au
Alex HauptmannCarnegie Mellon UniversityVerified email at cs.cmu.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Christoph FeichtenhoferMeta, FAIRVerified email at fb.com
Hu XuMeta AI (FAIR Labs)Verified email at meta.com
Luke ZettlemoyerUniversity of Washington; MetaVerified email at cs.washington.edu
Junwei LiangAssistant Professor, HKUST (Guangzhou) | CSE, HKUST | Ph.D. @CMUVerified email at hkust-gz.edu.cn
Mandela PatrickPhD Student, University of OxfordVerified email at robots.ox.ac.uk
Billy li (Juncheng)Carnegie Mellon UniversityVerified email at cs.cmu.edu
Armen AghajanyanFacebook AI ResearchVerified email at fb.com
Junjie HuAssistant Professor, University of Wisconsin-MadisonVerified email at wisc.edu
Yuki M. AsanoAssistant Professor, University of AmsterdamVerified email at uva.nl
Shang-Wen Daniel LiFAIR - Research managerVerified email at fb.com
Jitendra MALIKProfessor of EECS, UC BerkeleyVerified email at eecs.berkeley.edu
Chris DyerDeepMind, Carnegie MellonVerified email at google.com
Graham NeubigCarnegie Mellon UniversityVerified email at cs.cmu.edu

Po-Yao (Bernie) Huang

Other namesBernie Huang, Poyao Huang

FAIR, Meta

Verified email at fb.com - Homepage

Multimodal machine learning Multi-modal learning natural language processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey of deep active learning P Ren, Y Xiao, X Chang, PY Huang, Z Li, BB Gupta, X Chen, X Wang ACM computing surveys (CSUR) 54 (9), 1-40, 2021	982	2021
Dinov2: Learning robust visual features without supervision M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ... arXiv preprint arXiv:2304.07193, 2023	812*	2023
A comprehensive survey of neural architecture search: Challenges and solutions P Ren, Y Xiao, X Chang, PY Huang, Z Li, X Chen, X Wang ACM Computing Surveys (CSUR) 54 (4), 1-34, 2021	558	2021
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... EMNLP 2021, 2021	396	2021
Support-set bottlenecks for video-text representation learning M Patrick, PY Huang, Y Asano*, F Metze, A Hauptmann, J Henriques, ... ICLR 2021, 2020	245	2020
Self-Supervised Deep Correlation Tracking D Yuan, X Chang, PY Huang, Q Liu, Z He IEEE Transactions on Image Processing (TIP), 2020	232	2020
Attention-based multimodal neural machine translation PY Huang, F Liu, SR Shiang, J Oh, C Dyer First Conference on Machine Translation (WMT16), 2016	217	2016
Masked autoencoders that listen PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ... NeurIPS 2022, 2022	161	2022
Structural analysis and optimization of convolutional neural networks with a small sample size RN D’souza, PY Huang, FC Yeh Scientific reports 10 (1), 834, 2020	119	2020
Rcaa: Relational context-aware agents for person search X Chang, PY Huang, YD Shen, X Liang, Y Yang, AG Hauptmann ECCV 2018, 2018	112	2018
Cm3: A causal masked multimodal model of the internet A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ... arXiv preprint arXiv:2201.07520, 2022	111	2022
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding H Xu, G Ghosh, PY Huang, P Arora, M Aminzadeh, C Feichtenhofer, ... ACL-Findings 2021, 2021	108	2021
Video pivoting unsupervised multi-modal machine translation M Li, PY Huang, X Chang, J Hu, Y Yang, A Hauptmann IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (3), 3918-3932, 2022	100	2022
Entity hierarchy embedding Z Hu, PY Huang, Y Deng, Y Gao, E Xing ACL 2015, 2015	88	2015
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models PY Huang, M Patrick, J Hu, G Neubig, F Metze, A Hauptmann NAACL 2021, 2021	56	2021
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting PY Huang, J Hu, X Chang, A Hauptmann ACL 2020, 2020	42	2020
Argus: Efficient activity detection system for extended video analysis W Liu, G Kang, PY Huang*, X Chang, Y Qian, J Liang, L Gui, J Wen, ... WACVW 2020, 2020	40	2020
Space-time crop & attend: Improving cross-modal video representation learning M Patrick, PY Huang, I Misra, F Metze, A Vedaldi, YM Asano*, ... ICCV 2021, 2021	36	2021
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ... arXiv preprint arXiv:2308.11596, 2023	34	2023
RWR-GAE: Random Walk Regularization for Graph Auto Encoders Vaibhav, PY Huang, R Frederking arXiv preprint arXiv:1908.04003, 2019	34*	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors