Sound-space recording and binaural presentation system based on a 252-channel microphone array S Sakamoto, S Hongo, T Okamoto, Y Iwaya, Y Suzuki Acoustical Science and technology 36 (6), 516-526, 2015 | 37 | 2015 |
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders. T Okamoto, T Toda, Y Shiga, H Kawai INTERSPEECH, 1308-1312, 2019 | 31 | 2019 |
Experimental validation of spatial Fourier transform-based multiple sound zone generation with a linear loudspeaker array T Okamoto, A Sakaguchi The Journal of the Acoustical Society of America 141 (3), 1769-1780, 2017 | 31 | 2017 |
High order Ambisonic decoding method for irregular loudspeaker arrays J Trevino, T Okamoto, Y Iwaya, Y Suzuki Proceedings of 20th International Congress on Acoustics, 23-27, 2010 | 31 | 2010 |
An investigation of subband WaveNet vocoder covering entire audible frequency range with limited acoustic features T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 30 | 2018 |
Estimation of sound source positions using a surrounding microphone array T Okamoto, R Nishimura, Y Iwaya Acoustical science and technology 28 (3), 181-189, 2007 | 27 | 2007 |
Tacotron-based acoustic model using phoneme alignment for practical neural text-to-speech systems T Okamoto, T Toda, Y Shiga, H Kawai 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 24 | 2019 |
Quasi-periodic parallel WaveGAN: a non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network YC Wu, T Hayashi, T Okamoto, H Kawai, T Toda IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 792-806, 2021 | 23 | 2021 |
3D spatial sound systems compatible with human's active listening to realize rich high-level kansei information Y Suzuki, T Okamoto, J Trevino, ZL Cui, Y Iwaya, S Sakamoto, M Otani Interdisciplinary information sciences 18 (2), 71-82, 2012 | 23 | 2012 |
Generation of multiple sound zones by spatial filtering in wavenumber domain using a linear array of loudspeakers T Okamoto 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 22 | 2014 |
Subband WaveNet with overlapped single-sideband filterbanks T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 20 | 2017 |
Analytical methods of generating multiple sound zones for open and baffled circular loudspeaker arrays T Okamoto 2015 IEEE Workshop on Applications of Signal Processing to Audio and …, 2015 | 20 | 2015 |
Improving FFTNet vocoder with noise shaping and subband approaches T Okamoto, T Toda, Y Shiga, H Kawai 2018 IEEE Spoken Language Technology Workshop (SLT), 304-311, 2018 | 19 | 2018 |
2.5 D higher order ambisonics for a sound field described by angular spectrum coefficients T Okamoto 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 19 | 2016 |
Text-to-speech synthesis Y Shiga, J Ni, K Tachibana, T Okamoto Speech-to-Speech Translation, 39-52, 2020 | 16 | 2020 |
Analytical approach to 2.5 D sound field control using a circular double-layer array of fixed-directivity loudspeakers T Okamoto 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 16 | 2017 |
Transformer-based text-to-speech with weighted forced attention T Okamoto, T Toda, Y Shiga, H Kawai ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 15 | 2020 |
Least squares approach in wavenumber domain for sound field recording and reproduction using multiple parallel linear arrays T Okamoto, S Enomoto, R Nishimura Applied acoustics 86, 95-103, 2014 | 14 | 2014 |
Implementation of a high-definition 3D audio-visual display based on higher-order Ambisonics using a 157-loudspeaker array combined with a 3D projection display T Okamoto, ZL Cui, Y Iwaya, Y Suzuki 2010 2nd IEEE InternationalConference on Network Infrastructure and Digital …, 2010 | 14 | 2010 |
Angular spectrum decomposition-based 2.5 D higher-order spherical harmonic sound field synthesis with a linear loudspeaker array T Okamoto 2017 IEEE Workshop on Applications of Signal Processing to Audio and …, 2017 | 13 | 2017 |