Folgen
Leda Sarı
Leda Sarı
Research Scientist, Meta AI
Bestätigte E-Mail-Adresse bei fb.com
Titel
Zitiert von
Zitiert von
Jahr
Ego4d: Around the world in 3,000 hours of egocentric video
K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
5172022
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
M Le, A Vyas, B Shi, B Karrer, L Sari, R Moritz, M Williamson, V Manohar, ...
arXiv preprint arXiv:2306.15687, 2023
592023
A Multi-View Approach to Audio-Visual Speaker Verification
L Sarı, K Singh, J Zhou, L Torresani, N Singhal, Y Saraf
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
422021
Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR
L Sarı, N Moritz, T Hori, J Le Roux
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
382020
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
C Liu, M Picheny, L Sarı, P Chitkara, A Xiao, X Zhang, M Chou, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
292022
Fusion of LVCSR and posteriorgram based keyword search
L Sarı, B Gündoğdu, M Saraçlar
Sixteenth Annual Conference of the International Speech Communication …, 2015
172015
Counterfactually fair automatic speech recognition
L Sarı, M Hasegawa-Johnson, CD Yoo
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3515-3525, 2021
162021
Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News
L Sari, S Thomas, M Hasegawa-Johnson, M Picheny
2019 IEEE International Conference on Acoustics, Speech and Signal …, 2019
162019
Training Spoken Language Understanding Systems with Non-Parallel Speech and Text
L Sarı, S Thomas, M Hasegawa-Johnson
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
142020
Template-based keyword search with pseudo posteriorgrams
B Gündoğdu, L Sarı, G Çetinkaya, M Saraçlar
Signal Processing and Communication Application Conference (SIU), 2016 24th …, 2016
142016
Auxiliary Networks for Joint Speaker Adaptation and Speaker Change Detection
L Sari, M Hasegawa-Johnson, S Thomas
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 324-333, 2021
102021
Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks
L Sari, S Thomas, M Hasegawa-Johnson
Interspeech, 769-773, 2019
92019
Elisa system description for lorehlt 2017
L Cheung, T Gowda, U Hermjakob, N Liu, J May, A Mayn, ...
Proc. Low Resource Human Lang. Technol, 51-59, 2017
92017
Seamless equal accuracy ratio for inclusive CTC speech recognition
H Gao, X Wang, S Kang, R Mina, D Issa, J Harvill, L Sari, ...
Speech Communication 136, 76-83, 2022
82022
Texture defect detection using independent vector analysis in wavelet domain
L Sari, A Ertüzün
2014 22nd International Conference on Pattern Recognition, 1639-1644, 2014
82014
Self-Supervised Representations for Singing Voice Conversion
T Jayashankar, J Wu, L Sari, D Kant, V Manohar, Q He
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
Worldly Wise (WoW)-Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering
K Ramnath, L Sari, M Hasegawa-Johnson, C Yoo
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
52021
Posteriorgram based approaches in keyword search
L Sarı, B Gündoğdu, M Saraçlar
2015 23nd Signal Processing and Communications Applications Conference (SIU …, 2015
52015
Speaker Adaptation with an Auxiliary Network
L Sarı, M Hasegawa-Johnson
Machine Learning in Speech and Language Processing Workshop (MLSLP), 2018
42018
Biased Self-supervised learning for ASR
FL Kreyssig, Y Shi, J Guo, L Sari, A Mohamed, PC Woodland
arXiv preprint arXiv:2211.02536, 2022
22022
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20