Follow
Yusuke Kida
Yusuke Kida
Dell Technologies
Verified email at dell.com
Title
Cited by
Cited by
Year
Sound source direction estimation apparatus, sound source direction estimation method and computer program product
N Ding, Y Kida
US Patent 9,473,849, 2016
1202016
Voice activity detection: Merging source and filter-based information
T Drugman, Y Stylianou, Y Kida, M Akamine
IEEE Signal Processing Letters 23 (2), 252-256, 2015
1032015
Voice activity detection based on optimally weighted combination of multiple features.
Y Kida, T Kawahara
INTERSPEECH, 2621-2624, 2005
512005
Television apparatus and a remote operation apparatus
K Ouchi, A Kawamura, M Sakai, K Suzuki, Y Kida
US Patent 9,154,848, 2015
312015
Neural diarization with non-autoregressive intermediate attractors
Y Fujita, T Komatsu, R Scheibler, Y Kida, T Ogawa
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Apparatus, method and computer program product for feature extraction
Y Kida, T Masuko
US Patent 8,073,686, 2011
102011
Apparatus and method for discriminating speech, and computer readable medium
K Suzuki, M Sakai, Y Kida
US Patent 9,330,682, 2016
92016
Evaluation of voice activity detection by combining multiple features with weight adaptation.
Y Kida, T Kawahara
INTERSPEECH, 2006
92006
Apparatus and method for discriminating speech of acoustic signal with exclusion of disturbance sound, and non-transitory computer readable medium
K Suzuki, M Sakai, Y Kida
US Patent 9,330,683, 2016
82016
Speaker selective beamformer with keyword mask estimation
Y Kida, D Tran, M Omachi, T Taniguchi, Y Fujita
2018 IEEE Spoken Language Technology Workshop (SLT), 528-534, 2018
72018
Minimum classification error interactive training for speaker identification [interactive robot applications]
Y Kida, H Yamamoto, C Miyajima, K Tokuda, T Kitamura
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005
72005
Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition
Y Kida, M Sakai, T Masuko, A Kawamura
Tenth Annual Conference of the International Speech Communication Association, 2009
62009
Simultaneous Detection and Localization of a Wake-Up Word Using Multi-Task Learning of the Duration and Endpoint.
T Maekaku, Y Kida, A Sugiyama
INTERSPEECH, 4240-4244, 2019
52019
Tourist guidance robot based on HyperCLOVA
T Yamazaki, K Yoshikawa, T Kawamoto, M Ohagi, T Mizumoto, S Ichimura, ...
arXiv preprint arXiv:2210.10400, 2022
42022
Multi-sequence intermediate conditioning for ctc-based asr
Y Fujita, T Komatsu, Y Kida
arXiv preprint, 2022
42022
Using duration and pitch for mandarin digit string recognition
R Zhao, Y Kida, X Yan, P Ding, L He
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
42010
Better intermediates improve CTC inference
T Komatsu, Y Fujita, J Lee, L Lee, S Watanabe, Y Kida
arXiv preprint arXiv:2204.00176, 2022
22022
InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Y Nakagome, T Komatsu, Y Fujita, S Ichimura, Y Kida
arXiv preprint arXiv:2204.00174, 2022
22022
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Y Kida, T Komatsu, M Togami
arXiv preprint arXiv:2104.10328, 2021
22021
Creating device, creating method, and non-transitory computer readable storage medium
Y Kida, D Tran
US Patent App. 16/131,561, 2019
22019
The system can't perform the operation now. Try again later.
Articles 1–20