Po-Yao (Bernie) Huang
Po-Yao (Bernie) Huang
Other namesBernie Huang, Poyao Huang
Research Scientist, FAIR Labs, Meta AI
Verified email at - Homepage
Cited by
Cited by
A survey of deep active learning
P Ren, Y Xiao, X Chang, PY Huang, Z Li, BB Gupta, X Chen, X Wang
ACM computing surveys (CSUR) 54 (9), 1-40, 2021
A comprehensive survey of neural architecture search: Challenges and solutions
P Ren, Y Xiao, X Chang, PY Huang, Z Li, X Chen, X Wang
ACM Computing Surveys (CSUR) 54 (4), 1-34, 2021
Videoclip: Contrastive pre-training for zero-shot video-text understanding
H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ...
EMNLP 2021, 2021
Attention-based multimodal neural machine translation
PY Huang, F Liu, SR Shiang, J Oh, C Dyer
First Conference on Machine Translation (WMT16), 2016
Support-set bottlenecks for video-text representation learning
M Patrick*, PY Huang*, Y Asano*, F Metze, A Hauptmann, J Henriques, ...
ICLR 2021, 2020
Self-Supervised Deep Correlation Tracking
D Yuan, X Chang, PY Huang, Q Liu, Z He
IEEE Transactions on Image Processing (TIP), 2020
Structural analysis and optimization of convolutional neural networks with a small sample size
RN D’souza, PY Huang, FC Yeh
Scientific reports 10 (1), 1-13, 2020
Rcaa: Relational context-aware agents for person search
X Chang, PY Huang, YD Shen, X Liang, Y Yang, AG Hauptmann
ECCV 2018, 2018
Entity hierarchy embedding
Z Hu, PY Huang, Y Deng, Y Gao, E Xing
ACL 2015, 2015
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
H Xu, G Ghosh, PY Huang, P Arora, M Aminzadeh, C Feichtenhofer, ...
ACL-Findings 2021, 2021
Cm3: A causal masked multimodal model of the internet
A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ...
arXiv preprint arXiv:2201.07520, 2022
Masked autoencoders that listen
PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ...
NeurIPS 2022, 2022
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
PY Huang*, M Patrick*, J Hu, G Neubig, F Metze, A Hauptmann
NAACL 2021, 2021
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
PY Huang, J Hu, X Chang, A Hauptmann
ACL 2020, 2020
A survey of scene graph: Generation and application
P Xu, X Chang, L Guo, PY Huang, X Chen, AG Hauptmann
IEEE Trans. Neural Netw. Learn. Syst 1, 2020
Argus: Efficient activity detection system for extended video analysis
W Liu*, G Kang*, PY Huang*, X Chang, Y Qian, J Liang, L Gui, J Wen, ...
WACVW 2020, 2020
RWR-GAE: Random Walk Regularization for Graph Auto Encoders
Vaibhav, PY Huang, R Frederking
arXiv preprint arXiv:1908.04003, 2019
Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations
PY Huang, X Chang, A Hauptmann
EMNLP 2019, 2019
Space-time crop & attend: Improving cross-modal video representation learning
M Patrick*, PY Huang*, I Misra, F Metze, A Vedaldi, YM Asano*, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
Multimodal filtering of social media for temporal monitoring and event analysis
PY Huang, J Liang, JB Lamare, AG Hauptmann
ACM ICMR 2018, 2018
The system can't perform the operation now. Try again later.
Articles 1–20