Voice conversion in high-order eigen space using deep belief nets. T Nakashika, R Takashima, T Takiguchi, Y Ariki Interspeech, 369-372, 2013 | 165 | 2013 |
Exemplar-based voice conversion in noisy environment R Takashima, T Takiguchi, Y Ariki 2012 IEEE Spoken Language Technology Workshop (SLT), 313-317, 2012 | 156 | 2012 |
GMM-based emotional voice conversion using spectrum and prosody features R Aihara, R Takashima, T Takiguchi, Y Ariki American Journal of Signal Processing 2 (5), 134-138, 2012 | 115 | 2012 |
Voice conversion using RNN pre-trained by recurrent temporal restricted Boltzmann machines T Nakashika, T Takiguchi, Y Ariki IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (3), 580-587, 2014 | 101 | 2014 |
Object recognition and segmentation using SIFT and Graph Cuts A Suga, K Fukuda, T Takiguchi, Y Ariki 2008 19th International Conference on Pattern Recognition, 1-4, 2008 | 86 | 2008 |
Automatic production system of soccer sports video by digital camera work based on situation recognition Y Ariki, S Kubota, M Kumano Eighth IEEE International Symposium on Multimedia (ISM'06), 851-860, 2006 | 84 | 2006 |
Automatic classification of TV news articles based on telop character recognition Y Ariki, K Matsuura Proceedings IEEE International Conference on Multimedia Computing and …, 1999 | 78 | 1999 |
3D human posture estimation using the HOG features from monocular image K Onishi, T Takiguchi, Y Ariki 2008 19th International Conference on Pattern Recognition, 1-4, 2008 | 73 | 2008 |
High-order sequence modeling using speaker-dependent recurrent temporal restricted boltzmann machines for voice conversion. T Nakashika, T Takiguchi, Y Ariki Interspeech, 2278-2282, 2014 | 68 | 2014 |
Extraction of TV news articles based on scene cut detection using DCT clustering Y Ariki, Y Saito Proceedings of 3rd IEEE International Conference on Image Processing 3, 847-850, 1996 | 65 | 1996 |
Robust feature extraction using kernel PCA T Takiguchi, Y Ariki 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 59 | 2006 |
Topic segmentation and retrieval system for lecture videos based on spontaneous speech recognition. N Yamamoto, J Ogata, Y Ariki INTERSPEECH, 961-964, 2003 | 59 | 2003 |
Two-step acoustic model adaptation for dysarthric speech recognition R Takashima, T Takiguchi, Y Ariki ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 58 | 2020 |
Exemplar-based voice conversion using sparse representation in noisy environments R Takashima, T Takiguchi, Y Ariki IEICE Transactions on Fundamentals of Electronics, Communications and …, 2013 | 58 | 2013 |
Lip reading using a dynamic feature of lip images and convolutional neural networks Y Li, Y Takashima, T Takiguchi, Y Ariki 2016 IEEE/ACIS 15th International Conference on Computer and Information …, 2016 | 55 | 2016 |
Noisy speech recognition using noise reduction method based on Kalman filter M Fujimoto, Y Ariki 2000 IEEE international conference on acoustics, speech, and signal …, 2000 | 55 | 2000 |
PCA-Based Speech Enhancement for Distorted Speech Recognition. T Takiguchi, Y Ariki Journal of multimedia 2 (5), 2007 | 52 | 2007 |
Emotional voice conversion using deep neural networks with MCC and F0 features Z Luo, T Takiguchi, Y Ariki 2016 IEEE/ACIS 15th International Conference on Computer and Information …, 2016 | 50 | 2016 |
Acoustic model adaptation using first-order linear prediction for reverberant speech T Takiguchi, M Nishimura, Y Ariki IEICE transactions on information and systems 89 (3), 908-914, 2006 | 47 | 2006 |
Voice conversion based on non-negative matrix factorization using phoneme-categorized dictionary R Aihara, T Nakashika, T Takiguchi, Y Ariki 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 46 | 2014 |