Linjie (Lindsey) Li

Zitiert von

	Alle	Seit 2019
Zitate	8794	8779
h-index	34	34
i10-index	41	40

3200

1600

800

2400

20192020202120222023202425 231 814 1521 3059 3118

Öffentlicher Zugriff

Alle anzeigen

8 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Lijuan WangMicrosoft GenAIBestätigte E-Mail-Adresse bei microsoft.com
Zhe GanResearch Scientist, AppleBestätigte E-Mail-Adresse bei apple.com
Zicheng LiuMicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Kevin LinMicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Zhengyuan YangResearcher, MicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Jianfeng WangMicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Yu ChengThe Chinese University of Hong KongBestätigte E-Mail-Adresse bei cse.cuhk.edu.hk
Yen-Chun ChenResearcher, MicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Licheng Yu 虞立成Research Scientist and Manager, Facebook AIBestätigte E-Mail-Adresse bei fb.com
Jianfeng GaoMicrosoft Research, RedmondBestätigte E-Mail-Adresse bei microsoft.com
Jianwei YangPrincipal Researcher, Microsoft Research, RedmondBestätigte E-Mail-Adresse bei microsoft.com
Jie Lei 雷杰Research Scientist, Meta AIBestätigte E-Mail-Adresse bei fb.com
Chunyuan LiMicrosoft Research, RedmondBestätigte E-Mail-Adresse bei microsoft.com
Siqi SunAssociate Professor; Fudan University, Shanghai AI LabBestätigte E-Mail-Adresse bei fudan.edu.cn

Folgen

Linjie (Lindsey) Li

Senior Researcher, Microsoft

Bestätigte E-Mail-Adresse bei microsoft.com

Vision+Language


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
UNITER: Learning UNiversal Image-TExt Representations YC Chen, L Li, L Yu, AE Kholy, F Ahmed, Z Gan, Y Cheng, J Liu ECCV 2020, 2020	2404*	2020
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu CVPR 2021, 2021	641	2021
HERO: Hierarchical Encoder for Video+ Language Omni-representation Pre-training L Li, YC Chen, Y Cheng, Z Gan, L Yu, J Liu EMNLP 2020, 2020	499	2020
Large-Scale Adversarial Training for Vision-and-Language Representation Learning Z Gan, YC Chen, L Li, C Zhu, Y Cheng, J Liu NeurIPS 2020, 2020	488	2020
GIT: A Generative Image-to-text Transformer for Vision and Language J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang TMLR, 2022	419	2022
Relation-aware graph attention network for visual question answering L Li, Z Gan, Y Cheng, J Liu ICCV 2019, 2019	397	2019
Improving image generation with better captions J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ... Computer Science. https://cdn. openai. com/papers/dall-e-3. pdf 2 (3), 8, 2023	361	2023
The dawn of lmms: Preliminary explorations with gpt-4v (ision) Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang arXiv preprint arXiv:2309.17421 9, 1, 2023	345	2023
Segment Everything Everywhere All at Once X Zou, J Yang, H Zhang, F Li, L Li, J Gao, YJ Lee NeurIPS 2023, 2023	314	2023
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ... arXiv preprint arXiv:2303.11381, 2023	240	2023
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning K Lin, L Li, CC Lin, F Ahmed, Z Gan, Z Liu, Y Lu, L Wang CVPR 2022, 2021	223	2021
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang ICML 2024, 2023	208	2023
Mitigating hallucination in large multi-modal models via robust instruction tuning F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang ICLR 2024, 2023	204*	2023
VIOLET: End-to-End Video-Language Transformers with Masked Visual-token Modeling TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu arXiv preprint arXiv:2111.12681, 2021	189	2021
Generalized Decoding for Pixel, Image, and Language X Zou, ZY Dou, J Yang, Z Gan, L Li, C Li, X Dai, H Behl, J Wang, L Yuan, ... CVPR 2023, 2022	170	2022
Graph Optimal Transport for Cross-Domain Alignment L Chen, Z Gan, Y Cheng, L Li, L Carin, J Liu ICML 2020, 2020	166	2020
Vision-Language Pre-training: Basics, Recent Advances, and Future Trends Z Gan, L Li, C Li, L Wang, Z Liu, J Gao Foundations and Trends® in Computer Graphics and Vision 14 (3–4), 163-352, 2022	145	2022
Multi-step reasoning via recurrent dual attention for visual dialog Z Gan, Y Cheng, AEI Kholy, L Li, J Liu, J Gao ACL 2019, 2019	112	2019
Multimodal foundation models: From specialists to general-purpose assistants C Li, Z Gan, Z Yang, J Yang, L Li, L Wang, J Gao Foundations and Trends® in Computer Graphics and Vision 16.1-2 (2024): 1-214., 2023	106	2023
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ... NeurIPS 2021 Data and Benchmark Track, 2021	103	2021

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren