Takuma Okamoto

Cited by

	All	Since 2019
Citations	831	517
h-index	16	14
i10-index	30	19

120

20092010201120122013201420152016201720182019202020212022202320247 21 16 22 29 14 29 37 64 67 83 80 114 104 99 33

Takuma Okamoto

National Institute of Information and Communications Technology

Verified email at nict.go.jp - Homepage

Signal processing Sound field reproduction Sound field control Acoustic signal processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sound-space recording and binaural presentation system based on a 252-channel microphone array S Sakamoto, S Hongo, T Okamoto, Y Iwaya, Y Suzuki Acoustical Science and technology 36 (6), 516-526, 2015	40	2015
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders. T Okamoto, T Toda, Y Shiga, H Kawai INTERSPEECH, 1308-1312, 2019	33	2019
High order Ambisonic decoding method for irregular loudspeaker arrays J Trevino, T Okamoto, Y Iwaya, Y Suzuki Proceedings of 20th International Congress on Acoustics, 23-27, 2010	33	2010
An investigation of subband WaveNet vocoder covering entire audible frequency range with limited acoustic features T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	32	2018
Experimental validation of spatial Fourier transform-based multiple sound zone generation with a linear loudspeaker array T Okamoto, A Sakaguchi The Journal of the Acoustical Society of America 141 (3), 1769-1780, 2017	31	2017
Tacotron-based acoustic model using phoneme alignment for practical neural text-to-speech systems T Okamoto, T Toda, Y Shiga, H Kawai 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	29	2019
Estimation of sound source positions using a surrounding microphone array T Okamoto, R Nishimura, Y Iwaya Acoustical science and technology 28 (3), 181-189, 2007	27	2007
Quasi-periodic parallel WaveGAN: A non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network YC Wu, T Hayashi, T Okamoto, H Kawai, T Toda IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 792-806, 2021	26	2021
3D spatial sound systems compatible with human's active listening to realize rich high-level kansei information Y Suzuki, T Okamoto, J Trevino, ZL Cui, Y Iwaya, S Sakamoto, M Otani Interdisciplinary information sciences 18 (2), 71-82, 2012	24	2012
Generation of multiple sound zones by spatial filtering in wavenumber domain using a linear array of loudspeakers T Okamoto 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014	22	2014
Text-to-speech synthesis Y Shiga, J Ni, K Tachibana, T Okamoto Speech-to-Speech Translation, 39-52, 2020	21	2020
Improving FFTNet vocoder with noise shaping and subband approaches T Okamoto, T Toda, Y Shiga, H Kawai 2018 IEEE Spoken Language Technology Workshop (SLT), 304-311, 2018	21	2018
Subband WaveNet with overlapped single-sideband filterbanks T Okamoto, K Tachibana, T Toda, Y Shiga, H Kawai 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	21	2017
Transformer-based text-to-speech with weighted forced attention T Okamoto, T Toda, Y Shiga, H Kawai ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	20	2020
Analytical methods of generating multiple sound zones for open and baffled circular loudspeaker arrays T Okamoto 2015 IEEE Workshop on Applications of Signal Processing to Audio and …, 2015	20	2015
2.5 D higher order ambisonics for a sound field described by angular spectrum coefficients T Okamoto 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	19	2016
Analytical approach to 2.5 D sound field control using a circular double-layer array of fixed-directivity loudspeakers T Okamoto 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017	16	2017
Least squares approach in wavenumber domain for sound field recording and reproduction using multiple parallel linear arrays T Okamoto, S Enomoto, R Nishimura Applied acoustics 86, 95-103, 2014	15	2014
Implementation of a high-definition 3D audio-visual display based on higher-order Ambisonics using a 157-loudspeaker array combined with a 3D projection display T Okamoto, ZL Cui, Y Iwaya, Y Suzuki 2010 2nd IEEE InternationalConference on Network Infrastructure and Digital …, 2010	15	2010
Multi-stream HiFi-GAN with data-driven waveform decomposition T Okamoto, T Toda, H Kawai 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	14	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by