Yatharth Saraf
Yatharth Saraf
Facebook AI
Verified email at
Cited by
Cited by
XLS-R: Self-supervised cross-lingual speech representation learning at scale
A Babu, C Wang, A Tjandra, K Lakhotia, Q Xu, N Goyal, K Singh, ...
arXiv preprint arXiv:2111.09296, 2021
Contextual RNN-T for open domain ASR
M Jain, G Keren, J Mahadeokar, G Zweig, F Metze, Y Saraf
arXiv preprint arXiv:2006.03411, 2020
Providing entity-specific content in response to a search query
AJ Berntson, N Agrawal, S Zhou, Y Saraf, T Joshi, KR Mcdonald, ...
US Patent App. 12/876,638, 2012
Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion
D Le, M Jain, G Keren, S Kim, Y Shi, J Mahadeokar, J Chan, ...
arXiv preprint arXiv:2104.02194, 2021
A multi-view approach to audio-visual speaker verification
L Sarı, K Singh, J Zhou, L Torresani, N Singhal, Y Saraf
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Improved language identification through cross-lingual self-supervised learning
A Tjandra, DG Choudhury, F Zhang, K Singh, A Conneau, A Baevski, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Improving RNN transducer based ASR with auxiliary tasks
C Liu, F Zhang, D Le, S Kim, Y Saraf, G Zweig
2021 IEEE Spoken Language Technology Workshop (SLT), 172-179, 2021
Dual application of speech enhancement for automatic speech recognition
A Pandey, C Liu, Y Wang, Y Saraf
2021 IEEE Spoken Language Technology Workshop (SLT), 223-228, 2021
Towards measuring fairness in speech recognition: Casual conversations dataset transcriptions
C Liu, M Picheny, L Sarı, P Chitkara, A Xiao, X Zhang, M Chou, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Faster, simpler and more accurate hybrid asr systems using wordpieces
F Zhang, Y Wang, X Zhang, C Liu, Y Saraf, G Zweig
arXiv preprint arXiv:2005.09150, 2020
Multilingual graphemic hybrid ASR with massive data augmentation
C Liu, Q Zhang, X Zhang, K Singh, Y Saraf, G Zweig
arXiv preprint arXiv:1909.06522, 2019
Contextualizing ASR lattice rescoring with hybrid pointer network language model
DR Liu, C Liu, F Zhang, G Synnaeve, Y Saraf, G Zweig
arXiv preprint arXiv:2005.07394, 2020
Kaizen: Continuously improving teacher using exponential moving average for semi-supervised speech recognition
V Manohar, T Likhomanenko, Q Xu, WN Hsu, R Collobert, Y Saraf, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
Benchmarking lf-mmi, ctc and rnn-t criteria for streaming asr
X Zhang, F Zhang, C Liu, K Schubert, J Chan, P Prakash, J Liu, CF Yeh, ...
2021 IEEE spoken language technology workshop (SLT), 46-51, 2021
Conformer-based self-supervised learning for non-speech audio tasks
S Srivastava, Y Wang, A Tjandra, A Kumar, C Liu, K Singh, Y Saraf
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
Scaling ASR improves zero and few shot learning
A Xiao, W Zheng, G Keren, D Le, F Zhang, C Fuegen, O Kalinli, Y Saraf, ...
arXiv preprint arXiv:2111.05948, 2021
Accent-robust automatic speech recognition using supervised and unsupervised wav2vec embeddings
J Li, V Manohar, P Chitkara, A Tjandra, M Picheny, F Zhang, X Zhang, ...
arXiv preprint arXiv:2110.03520, 2021
Search result driven query intent identification
F Radlinski, N Craswell, B Billerbeck, M Shokouhi, S Ahari, N Agrawal, ...
US Patent App. 12/813,376, 2011
Algorithms for image segmentation
Y Saraf
Birla Institute of Technology and Science, 2006
Using domain intent to provide more search results that correspond to a domain
TC Hoad, D Vijaywargi, Y Saraf
US Patent 8,504,561, 2013
The system can't perform the operation now. Try again later.
Articles 1–20