Takuma OKAMOTO, Ph.D.
Last update : 4 June 2024 (Our proposal accepted by Interspeech 2024.)
Publications in Japanese Demo page for neural speech waveform generative models Multiple sound spot synthesis project HP
Introduction
Research Interests
- Sound field control
- Speech synthesis
- Microphone / loudspeaker array signal processing
- Spoken language processing
Publications (Google Scholar Citations, ResearchGate, researchmap)
- Journal Papers
- H. Yamashita, T. Okamoto, R. Takashima, Y. Ohtani, T. Takiguchi, T. Toda and H. Kawai,
"Fast neural speech waveform generative models with fully-connected layer-based upsampling,"
IEEE Access, vol. 12, pp. 31409–31421, 2024. [IEEE Xplore] [Open access (PDF)]
- K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, and H. Kawai,
"Harmonic-Net: Fundamental frequency and speech rate controllable fast neural vocoder,"
IEEE/ACM Trans. Audio Speech Lang. Process., vol. 31, pp. 1902–1915, 2023. [IEEE Xplore] [Open access (PDF)]
- R. Komatsu, S. Gao, W. Hou, M. Zhang, T. Tanaka, K. Toyoda, Y. Kimura, K. Hino, Y. Iwamoto, K. Mori, T. Okamoto, and T. Shinozaki,
"Automatic spoken language acquisition based on observation and dialogue,"
IEEE J. Sel. Top. Signal Process., vol. 16, no. 6, pp. 1480–1492, Oct. 2022. [IEEE Xplore] [Open access (PDF)]
Special Issue on Self-Supervised Learning for Speech and Audio Processing
- T. Okamoto, K. Matsubara, T. Toda, Y. Shiga, and H. Kawai,
"Neural speech-rate conversion with multispeaker WaveNet vocoder,"
Speech Commun., vol. 138, pp. 1–12, Mar. 2022. [ScienceDirect] [Open access (PDF)] [Demo samples]
- K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, and H. Kawai,
"Comparison of real-time multi-speaker neural vocoders on CPUs,"
Acoust. Sci. Tech. vol. 43, no. 2, pp. 121–124, Mar. 2022. [Open access (PDF)]
- K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga and H. Kawai,
"Full-band LPCNet: A real-time neural vocoder for 48 kHz audio with a CPU,"
IEEE Access, vol. 9, pp. 94923–94933, 2021. [IEEE Xplore] [Open access (PDF)]
- Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai and T. Toda,
"Quasi-Periodic Parallel WaveGAN: A non-autoregressive raw waveform generative model with pitch-dependent dilated convolution neural network,"
IEEE/ACM Trans. Audio Speech Lang. Process., vol. 29, pp. 792–806, 2021. [IEEE Xplore] [Open access (PDF)]
- K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga and H. Kawai,
"Investigation of training data size for real-time neural vocoders on CPUs,"
Acoust. Sci. Tech., vol. 42, no. 1, pp. 65–68, Jan. 2021. [Open access (PDF)]
- T. Okamoto,
"Mode-matching-based sound field recording and synthesis with circular double-layer arrays,"
Appl. Sci., vol. 8, no. 7, 1048, July, 2018. [Open access (PDF)]
- T. Okamoto, K. Tachibana, T. Toda, Y. Shiga and H. Kawai,
"Deep neural network-based power spectrum reconstruction to improve quality of vocoded speech with limited acoustic parameters,"
Acoust. Sci. & Tech., vol. 39, no. 2, pp. 163–166, Mar. 2018. [Open access (PDF)]
- T. Okamoto,
"Localized sound zone generation based on external radiation canceller,"
J. Inf. Hiding Multimed. Signal Process. vol. 8, no. 6, pp. 1335–1351, Nov. 2017. [Open access (PDF)]
- T. Okamoto,
"Horizontal local sound field propagation based on sound source dimension mismatch,"
J. Inf. Hiding Multimed. Signal Process., vol. 8, no. 5, pp. 1609–1081, Sept. 2017. [Open access (PDF)]
- T. Okamoto and A. Sakaguchi,
"Experimental validation of spatial Fourier transform-based multiple sound zone generation with a linear loudspeaker array,"
J. Acoust. Soc. Am., vol. 141, no. 3, pp. 1769–1780, Mar. 2017. [Open access (PDF)]
- T. Okamoto, A. Hiroe and H. Kawai,
"Reducing latency for language identification based on large-vocabulary continuous speech recognition,"
Acoust. Sci. & Tech., vol. 38, no. 1, pp. 38–41, Jan. 2017. [Open access (PDF)]
- S. Sakamoto, S. Hongo, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Sound-space recording and binaural presentation system based on a 252ch spherical microphone array,"
Acoust. Sci. & Tech., vol. 36, no. 6, pp. 516–526, Nov. 2015. [Open access (PDF)] (The 57th Sato Prize Paper Award in ASJ)
- J. Trevino, T. Okamoto, Y. Iwaya, J. Li and Y. Suzuki,
"A spatial extrapolation method to derive high-order Ambisonics data from stereo sources,"
J. Inf. Hiding Multimed. Signal Process., vol. 6, no. 6, pp. 1100–1116, Nov. 2015. [Open access (PDF)]
- T. Okamoto, S. Enomoto and R. Nishimura
"Least squares approach in wavenumber domain for sound field recording and reproduction using multiple parallel linear arrays,"
Appl. Acoust., vol. 86, pp. 95–103, Dec. 2014.
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Sound field reproduction using Ambisonics and irregular loudspeaker arrays,"
IEICE Trans. Fund. Electron. Comm. Comput. Sci., vol. 97-A, no. 9, pp. 1832–1839, Sept. 2014.
- Y. Suzuki, T. Okamoto, J. Trevino, Z.-L. Cui, S. Sakamoto, Y. Iwaya and M. Otani,
"3D spatial sound systems compatible with human's active listening to realize rich high-level kansei information,"
Interdiscip. Inf. Sci., vol. 18, no. 2, pp. 71–82, Dec. 2012. [Open access (PDF)]
- C-S. Han, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Loudspeaker distributions suitable for crosstalk cancellers robust to head
rotation,"
Acoust. Sci. & Tech. vol. 33, no. 4, pp. 266–269, July 2012. [Open access (PDF)]
- T. Kimura, Y. Yamakata, M. Katsumoto, T. Okamoto, S. Yairi, Y. Iwaya and Y. Suzuki,
"Three-dimensional radiated sound field display system using directional loudspeakers and wave field synthesis,"
Acoust. Sci. & Tech., vol. 33, no. 1, pp. 11–20, Jan. 2012. [Open access (PDF)]
- T. Okamoto, Y. Iwaya and Y. Suzuki
"Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter,"
Appl. Acoust., vol. 73, no. 1, pp. 50–55, Jan. 2012.
- H-S. Wey, A. Ito, T. Okamoto and Y. Suzuki,
"Multiple description coding using time domain division for MP3 coded sound signal,"
J. Inf. Hiding Multimed. Signal Process., vol. 1, no.4, pp. 269–285, Oct. 2010. [Open access (PDF)]
- T. Okamoto, R. Nishimura and Y. Iwaya,
"Estimation of sound source positions using a surrounding microphone array,"
Acoust. Sci. & Tech., vol. 28, no. 3, pp. 181–189, May 2007. [Open access (PDF)]
- Book Chapters
- Y. Shiga, J. Ni, K. Tachibana and T. Okamoto,
"Text-to-Speech Synthesis,"
Book chapter of Speech-to-Speech Translation, pp. 39–52, 2020.
- T. Okamoto, Y. Iwaya and Y. Suzuki,
"Estimation of high-resolution sound properties for realizing an editable sound-space system,"
Book chapter of Principles and applications on spatial hearing, pp. 407–416, Mar. 2011.
- T. Okamoto, B. FG Katz, M. Noisternig, Y. Iwaya and Y. Suzuki,
"Implementation of real-time room auralization using a surrounding 157 loudspeaker array,"
Book chapter of Principles and applications on spatial hearing, pp. 373–382, Mar. 2011.
- S. Sakamoto, J. Kodama, S. Hongo, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Effects of microphone arrangement on the accuracy of a spherical microphone array (SENZI) in acquiring high-definition 3D sound space information,"
Book chapter of Principles and applications on spatial hearing, pp. 314–323, Mar. 2011.
- Invited talks
- T. Okamoto,
"Spatial Fourier transform-based localized sound zone generation methods with loudspeaker arrays,"
J. Acoust. Soc. Am., vol. 146, no. 4, pp. 2761–2762, Oct. 2019. [Abstract] [Presentation slide (SlideShare)]
178th meeting of the Acousical Society of America, Dec. 2019.
- T. Okamoto,
"Over 100 channels sound field recording and reproduction systems and their applications,"
Centre for Noise and Viblation Control Seminar in KAIST, Mar. 2012.
- Tutorial
- T. Shinozaki, T. Okamoto, and S. Mori,
"Toward the realization of automatic spoken language acquisition mechanism for human-symbiotic robots,"
APSIPA ASC 2021, Dec. 2021.
- Demonstrations
- T. Okamoto, K. Ueno, T. Okabe, K. Tani, Y. Yoshikata, M. Sudo, M. Kuwahara, and K. Hikita,
"Improving portable multiple sound spot synthesis system with a baffled circular array of 16 loudspeakers,"
WASPAA 2023 Demonstrations, Oct. 2023. [Manuscript (PDF)]
- T. Okamoto, K. Ueno, T. Okabe, K. Tani, Y. Yoshikata, M. Sudo, M. Kuwahara, and K. Hikita,
"Portable multilingual sound spot synthesis system with a compact circular array of 16 loudspeakers,"
ICASSP 2023 Show & Tell Demo Session, June 2023.
- Proceedings of International Conferences and Workshops
- T. Okamoto, Y. Ohtani, S. Shimizu, T. Toda and H. Kawai,
"Challenge of singing voice synthesis using only text-to-speech corpus with FIRNet source-filter neural vocoder,"
in Proc. Interspeech, Sept. 2024, pp. 1870–1874. [ISCA Archive] [Open access (PDF)] [Presentation slides (Speaker Deck)] [Demo page]
- T. Okamoto, Y. Ohtani and H. Kawai,
"Mobile PresenTra: NICT fast neural text-to-speech system on smartphones with incremental inference of MS-FC-HiFi-GAN for low-latency synthesis,"
in Proc. Interspeech, Sept. 2024, pp. 997–998. (Show & Tell). [ISCA Archive] [Open access (PDF)] [NICT Press Release]
- T. Okamoto, Y. Ohtani, T. Toda and H. Kawai,
"ConvNeXt-TTS and ConvNeXt-VC: ConvNeXt-based fast end-to-end sequence-to-sequence text-to-speech and voice conversion,"
in Proc. ICASSP, Apr. 2024, pp. 12456–12460. [IEEE Xplore] [Preprint (PDF)] [Demo page]
- Y. Ohtani, T. Okamoto, T. Toda and H. Kawai,
"FIRNet: Fundamental frequency controllable fast neural vocoder with trainable finite impulse response filter,"
in Proc. ICASSP, Apr. 2024, pp. 10871–10875. [IEEE Xplore] [Preprint (PDF)] [Demo page]
- T. Okamoto, H. Yamashita, Y. Ohtani, T. Toda and H. Kawai,
"WaveNeXt: ConvNeXt-based fast neural vocoder without iSTFT layer,"
in Proc. ASRU, Dec. 2023. [IEEE Xplore] [Preprint (PDF)] [Demo page]
- T. Okamoto, T. Toda and H. Kawai,
"E2E-S2S-VC: End-to-end sequence-to-sequence voice conversion,"
in Proc. Interspeech, Aug. 2023, pp. 2043–2047. [ISCA Archive] [Open access (PDF)] [Poster (PDF)] [Demo page]
- T. Okamoto,
"Multilingual sound spot synthesis systems,"
in Proc. Internoise, Aug. 2023, pp. 5861–5865. (invited) [Presentation slides (Speaker Deck)]
- R. Komatsu, Y. Kimura, T. Okamoto and T. Shinozaki,
"Continuous action space-based spoken language acquisition agent using residual sentence embedding and transformer decoder,"
in Proc. ICASSP, June 2023. [IEEE Xplore]
- T. Tanaka, R. Komatsu, T. Okamoto and T. Shinozaki,
"Pronunciation adaptive self speaking agent using WaveGrad,"
in Proc. AAAI SAS, Feb. 2022. [Open access (PDF)]
- T. Okamoto, T. Toda and H. Kawai,
"Multi-stream HiFi-GAN with data-driven waveform decomposition,"
in Proc. ASRU, Dec. 2021, pp. 610–617. [IEEE Xplore] [Preprint (PDF)] [Presentation slides (Speaker Deck)] [Demo page]
- T. Okamoto,
"2D multizone sound field synthesis with interior-exterior Ambisonics,"
in Proc. WASPAA, Oct. 2021, pp. 276–280. [IEEE Xplore] [Preprint (PDF)]
- K. Matsubara, T. Okamoto, R. Takashima, T. Takiguchi, T. Toda, Y. Shiga and H. Kawai,
"High-intelligibility speech synthesis for dysarthric speakers with LPCNet-based TTS and CycleVAE-based VC,"
in Proc. ICASSP, June 2021, pp. 7058–7062. [IEEE Xplore] [Preprint (PDF)]
- T. Okamoto, T. Toda, Y. Shiga and H. Kawai,
"Noise level limited sub-modeling for diffusion probabilistic vocoders,"
in Proc. ICASSP, June 2021, pp. 6029–6033. [IEEE Xplore] [Preprint (PDF)]
- T. Okamoto,
"Close-talking recording with planarly distributed microphones,"
in Proc. ICASSP, June 2021, pp. 4470–4474. [IEEE Xplore] [Preprint (PDF)] [MATLAB code (Code Ocean)]
- Y.-C. Wu, T. Hayashi, T. Okamoto, H. Kawai and T. Toda,
"Quasi-periodic parallel WaveGAN vocoder: A non-autoregressive pitch-dependent dilated convolution model for parametric speech generation,"
in Proc. Interspeech, Oct. 2020, pp. 3535–3539. [ISCA Archive] [Open access (PDF)]
- T. Okamoto, T. Toda, Y. Shiga and H. Kawai,
"Transformer-based text-to-speech with weighted forced attention,"
in Proc. ICASSP, May 2020, pp. 6729–6733. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF in SigPort)] [Presentation video] [Demo samples]
- T. Okamoto, T. Toda, Y. Shiga and H. Kawai,
"Tacotron-based acoustic model using phoneme alignment for practical neural text-to-speech systems,"
in Proc. ASRU, Dec. 2019, pp. 214–221. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto,
"3D localized sound zone generation with a planar omni-directional loudspeaker array,"
in Proc. WASPAA, Oct. 2019, pp. 110–114. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF in SigPort)]
- T. Okamoto, T. Toda, Y. Shiga and H. Kawai,
"Real-time neural text-to-speech with sequence-to-sequence acoustic model and WaveGlow or single Gaussian WaveRNN vocoders,"
in Proc. Interspeech, Sept. 2019, pp. 1308–1312. [ISCA Archive] [Open access (PDF)] [Presentation slide (SlideShare)]
- T. Okamoto, T. Toda, Y. Shiga and H. Kawai,
"Investigations of real-time Gaussian FFTNet and parallel WaveNet neural vocoders with simple acoustic features,"
in Proc. ICASSP, May 2019, pp. 7020–7024. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF in SigPort)]
- T. Okamoto,
"Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays,"
in Proc. ICASSP, May 2019, pp. 960–964. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF in SigPort)]
- T. Okamoto, T. Toda, Y. Shiga and H. Kawai,
"Improving FFTNet vocoder with noise shaping and subband approaches,"
in Proc. SLT, Dec. 2018, pp. 304–311. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto,
"2.5D localized sound zone generation with a circular array of fixed-directivity loudspeakers,"
in Proc. IWAENC, Sept. 2018, pp. 321–325. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto, K. Tachibana, T. Toda, Y. Shiga and H. Kawai,
"An investigation of subband WaveNet vocoder covering entire audible frequency range with limited acoustic features,"
in Proc. ICASSP, Apr. 2018, pp. 5654–5658. [IEEE Xplore] [Preprint (PDF)] [[Poster (PDF in SigPort)]
- T. Okamoto, K. Tachibana, T. Toda, Y. Shiga and H. Kawai,
"Subband WaveNet with overlapped single-sideband filterbanks,"
in Proc. ASRU, Dec. 2017, pp. 698–704. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto,
"Angular spectrum decomposition-based 2.5D higher-order spherical harmonic sound field synthesis with a linear loudspeaker array,"
in Proc. WASPAA, Oct. 2017, pp. 180–184. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto,
"Analytical approach to 2.5D sound field control using a circular double-layer array of fixed-directivity loudspeakers,"
in Proc. ICASSP, Mar. 2017, pp. 91–95. [IEEE Xplore] [Preprint (PDF)] [Presentation slide (PDF)]
- T. Okamoto,
"2.5D higher-order Ambisonics for a sound field described by angular spectrum coefficients,"
in Proc. ICASSP, Mar. 2016, pp. 326–330. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto,
"Analytical methods of generating multiple sound zones for open and baffled circular loudspeaker arrays,"
in Proc. WASPAA, Oct. 2015. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto, N. Kanda and C. Hori,
"Recurrent neural network scenario modeling for WFST based statistical dialogue management,"
in Proc. MLSLP, Sept. 2015.
- T. Okamoto,
"Near-field sound propagation based on a circular and linear array combination,"
in Proc. ICASSP, Apr. 2015, pp. 624–628. [IEEE Xplore] [Preprint (PDF)] [Poster (PDF)]
- T. Okamoto,
"Generation of multiple sound zones by spatial filtering in wavenumber domain using a linear array of loudspeakers,"
in Proc. ICASSP, May 2014, pp. 4733–4737. [IEEE Xplore] [Preprint (PDF)] [Presentation slide (PDF)]
- J. Trevino, T. Okamoto, C. Salvador, Y. Iwaya, Z.-L. Cui, S. Sakamoto and Y. Suzuki,
"High-order Ambisonics auditory displays for the scalable presentation of immersive 3D audio-visual contents,"
Proc. ICAT2013, Dec. 2013.
- J. Trevino, T. Okamoto, Y. Iwaya, J. Li and Y. Suzuki,
"Extrapolation of horizontal Ambisonics data from mainstream stereo sources,"
Proc. IIH-MSP 2013, pp. 302–305, Oct. 2013. (invited paper)
- Y. Suzuki, J. Trevino, T. Okamoto, Z.-L. Cui, S. Sakamoto and Y. Iwaya,
"High definition 3D auditory displays and microphone arrays for the use with future 3D TV,"
Proc. 3DSA2013, June 2013. (invited talk)
- S. Sakamoto, S. Hongo, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Improvement of accuracy of 3D sound space synthesized by real-time
"SENZI", a sound space information acquisition system using spherical
array with numerous microphones,"
Proc. ICA 2013, 055051, pp. 1–9, June 2013. (invited paper)
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Evaluation of different spatial windows for a multi-channel audio
interpolation system,"
Proc. ICA 2013, 055028, pp. 1–9, June 2013.
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Reproducing discrete multi-channel audio using arbitrary loudspeaker configurations,"
Proc. AES Japan Sec. Conf. in Sendai, Oct. 2012.
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Ambisonic synthesis of directional sources using non-spherical loudspeaker arrays,"
Proc. AES 25th UK Conf. and 4th Int. Symp. on Ambisonics and Spherical Acoust., pp. 10–1–5, Mar. 2012.
- Y. Suzuki, T. Okamoto, J. Trevino, T. Kimura, S. Sakamoto, Z.-L. Cui, M. Katsumoto and Y. Iwaya,
"Toward 3D spatial audio systems with high sense-of-presense,"
Proc. 5th Int. Univers. Commun. Symp., Oct. 2011. (invited lecture)
- C-S. Han, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Loudspeaker distributions suitable for crosstalk cancellers robust to changes in the listening position,"
Proc. 5th Int. Univers. Commun. Symp., Oct. 2011.
- Y. Iwaya, T. Okamoto, J. Trevino, S. Sakamoto and Y. Suzuki,
"Measurement/reproduction of high-definition sound space information using numerous microphones/loudspeakers,"
Proc. inter-noise 2011, Sept. 2011. (invited lecture)
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Evaluation of a new ambisonic decoder for irregular loudspeaker arrays using
interaural cues,"
Proc. 3nd Int. Symp. on Ambisonics and Spherical Acoust., June 2011. [Presentation slide (PDF)]
- T. Kimura, Y. Yamakata, M. Katsumoto, T. Okamoto, S. Yairi, Y. Iwaya and Y. Suzuki,
"Comparative performance evaluation of near 3D sound field reproduction system with directional loudspeakers and wave field synthesis,"
Proc. 4th Int. Univers. Commun. Symp., pp. 220–227, Oct. 2010.
- T. Okamoto, Z.-L. Cui, Y. Iwaya and Y. Suzuki,
"Implementation of a high-definition 3D audio-visual display based on higher-order Ambisonics using a 157-loudspeaker array combined with a 3D projection display,"
Proc. IEEE IC-NIDC 2010, pp. 179–183, Sept. 2010. [Presentation slide (PDF)]
- D. Cabrera, T. Okamoto, B. FG Katz, M. Noisternig, Y. Iwaya and Y. Suzuki,
"Considerations in characterising an almost anechoic room for interactive spatial audio reproduction,"
Proc. Int. Symp. on Room Acoust. 2010, P4g, Aug. 2010.
- T. Okamoto, Y. Iwaya and Y. Suzuki,
"Blind directivity estimation of a sound source in a room using a surrounding microphone array,"
Proc. ICA 2010,
Aug. 2010. [Presentation slide (PDF)]
- S. Sakamoto, J. Kodama, S. Hongo, T. Okamoto, Y. Iwaya and Y. Suzuki,
"SENZI, a 3D sound-space recording system using spherical microphone array with 252-ch microphones,"
Proc. ICA 2010,
Aug. 2010.
- Y. Iwaya, W. Sato, T. Okamoto, M. Otani and Y. Suzuki,
"Interpolation method of head-related transfer functions in the z-plane domain using a common-pole-zero model,"
Proc. ICA 2010,
Aug. 2010.
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Higher order Ambisonic decoding method for irregular loudspeaker arrays,"
Proc. ICA 2010,
Aug. 2010.
- H-S. Wey, A. Ito, T. Okamoto and Y. Suzuki,
"Multiple description coding for MP3 coded sound signal,"
Proc. ICA 2010,
Aug. 2010.
- T. Okamoto, D. Cabrera, M. Noisternig, B. FG Katz, Y. Iwaya and Y. Suzuki,
"Improving sound field reproduction in a small room based on
higher-order Ambisonics with a 157-loudspeaker array,"
Proc. 2nd Int. Symp. on Ambisonics and Spherical Acoust., Poster 5, May 2010. [Poster (PDF)]
- T. Okamoto, B. FG Katz, M. Noisternig, Y. Iwaya and Y. Suzuki,
"Implementation of real-time room auralization using a surrounding 157 loudspeaker array,"
Proc. IWPASH 2009 (eProceedings),
Nov. 2009. [Poster (PDF)]
- T. Okamoto, Y. Iwaya and Y. Suzuki,
"Toward an editable sound-space system using high-resolution sound properties,"
Proc. IWPASH 2009 (eProceedings),
Nov. 2009. [Poster (PDF)]
- J. Kodama, S. Sakamoto, S. Hongo, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Effects of microphone arrangements on the accuracy of a spherical
microphone array (SENZI) in acquiring high-definition 3D sound space
information,"
Proc. IWPASH 2009 (eProceedings),
Nov. 2009.
- T. Kimura, Y. Yamakata, M. Katsumoto, T. Okamoto, S. Yairi, Y. Iwaya and Y. Suzuki,
"Development of real system in near 3D sound field reproduction system using directional loudspeakers and wave field synthesis,"
Proc. WESPAC X, ID: 0164, Sept. 2009.
- Abstracts of International Conferences and Symposiums
- J. Trevino, T. Okamoto, Y. Iwaya, S. Sakamoto and Y. Suzuki,
"Evaluation of a high-order Ambisonics decoder for irregular loudspeaker arrays through reproduced field measurements,"
J. Acoust. Soc. Am., vol. 135, no. 4, pp. 2394, Apl. 2014.
167th meeting of the Acoustical Society of America, May 2014.
- T. Okamoto, Y. Iwaya and Y. Suzuki,
"Acoustic privacy area generation based on simple summation of numerous loudspeaker signals,"
J. Acoust. Soc. Am., vol. 131, no. 4, pp. 3481, Apl. 2012.
Acoustics 2012, May 2012. [Presentation slide (PDF)]
- S. Sakamoto, J. Kodama, S. Hongo, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Realization of sound space information acquisition system using a 252ch spherical microphone array,"
J. Acoust. Soc. Am., vol. 131, no. 4, pp. 3256, Apl. 2012.
Acoustics 2012, May 2012.
- Proceedings of International Conferences and Symposiums (not peer review)
- T. Okamoto, Y. Iwaya and Y. Suzuki,
"Acoustic privacy technique based on simple summation by multichannel loudspeaker array,"
Proc. The Joint Int. Conf. of the 5th Int. Symp. and the 4th Student-Organizing Int. Mini-Conf. on Inf. Electr. Syst., pp. 282–283, Feb. 2012.
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Three dimensional auditory display using Ambisonics with irregular loudspeaker arrays,"
Proc. The Joint Int. Conf. of the 5th Int. Symp. and the 4th Student-Organizing Int. Mini-Conf. on Inf. Electr. Syst., pp. 280–281, Feb. 2012.
- C-S. Han, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Loudspeaker arrangements for transaural auditory displays robust to listener's head rotation,"
Proc. The Joint Int. Conf. of the 5th Int. Symp. and the 4th Student-Organizing Int. Mini-Conf. on Inf. Electr. Syst., pp. 278–279, Feb. 2012.
- Y. Suzuki, T. Okamoto, J. Trevino, Z.-L. Cui, Y. Iwaya, S. Sakamoto and M. Otani,
"Toward realizing high sense-of-presense communications with 3D spatial sound systems,"
Proc. The Joint Int. Conf. of the 5th Int. Symp. and the 4th Student-Organizing Int. Mini-Conf. on Inf. Electr. Syst., pp. 124–149, Feb. 2012.
- T. Okamoto, Y. Iwaya, S. Sakamoto and Y. Suzuki
"Implementation of higher order Ambisonics recording array with 121 microphones,"
Proc. 3rd Student Organizing Int. Mini-Conf. on Inf. Electr. Syst., pp. 71–72, Oct. 2010. [Presentation slide (PDF)]
- Y. Suzuki, Y. Iwaya, S. Sakamoto and T. Okamoto,
"Development of acoustic systems realizing communications with high sense-of-presence,"
Proc. 4th Int. Symp. on Inf. Electr. Syst., pp. 52–59, July 2010.
- J. Trevino, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Development of a higher order ambisonics encoder and decoder for a 157-channel surround speaker array,"
Proc. 5th Int. Symp. on Medical, Bio- and Nano-Electr.,
pp.137–138, Feb. 2010.
- C-S. Han, T. Okamoto, Y. Iwaya and Y. Suzuki,
"Dynamic crosstalk canceller for a transaural system responsive to head rotation,"
Proc. 5th Int. Symp. on Medical, Bio- and Nano-Electr.,
pp. 135–136, Feb. 2010.
- S-J. Hahm, T. Okamoto, H-S. Wey, Y. Suzuki and S. Makino,
"New approaches to the network-oriented information technologies for the pleasant sound communications,"
Proc. 1st Student Organizing Int. Mini-Conf. on Inf. Electr. Syst., pp. 219, Oct. 2008.
- H-S. Wey, T. Okamoto, S-J. Hahm, and D-G. Kang,
"Toward the realization of network-oriented pleasant sound communication systems,"
Proc. 1st Int. Symp. on Center of Education and Research for Inf. Electr. Syst. Global Center of Excellence 2007,
PO-27, Nov. 2007.
- T. Okamoto, R. Nishimura and Y. Iwaya,
"Estimation of sound source positions using a surrounding microphone array,"
Proc. Joint Int. Conf. of 4th Int. Symp. on Syst. Construction of Global-Network-Oriented Inform.
Electr. and Student-Organizing Int. Mini-Conf. on Inf.
Electr. Syst,
pp.328–329, Jan. 2007. [Presentation slide (PDF)]
Awards
- The 57th Sato Prize Paper Award
- The 32nd Awaya Prize Young Researcher Award
Grants
- Grant-in-Aid for Scientific Research (C) : (18K11387)
from JSPS, Apr. 2018 to Mar. 2022.
- Grant-in-Aid for Young Scientists (B) : (15K21674)
from JSPS, Apr. 2015 to Mar. 2018.
- Grant-in-Aid for Young Scientists (B) : (25871208)
from JSPS, Apr. 2013 to Mar. 2015.
- ICA Young Scientist Award
- Grant-in-Aid for JSPS Fellows (DC2) : (19-55071)
from JSPS, Nov. 2007 to Mar. 2009.
Educational Background
- 2000.4 – 2004.3: Bachelor student, School of Engineering, Tohoku University, Japan
- 2004.4 – 2006.3: Master student, Graduate School of Information Sciences, Tohoku University, Japan
- 2006.4 – 2009.3: Ph.D. student, Graduate School of Information Sciences, Tohoku University, Japan
Professional Career
- 2007.6 – 2007.10: Research Assistant, Tohoku University, Japan
- 2007.11 – 2009.3: Research Fellow (DC2), Japan Society for the Promotion of Science, Japan
- 2009.4 – 2009.6: Post-doctor Research fellow, Tohoku University, Japan
- 2009.7 – 2012.3: COE Fellow, Tohoku University, Japan
- 2012.4 – 2020.6: Researcher, National Institute of Information and Communications Tehnology, Japan
- 2020.7 – 2024.3: Senior Researcher, National Institute of Information and Communications Tehnology, Japan
- 2024.4 – present: Research Manager, National Institute of Information and Communications Tehnology, Japan
- 2013.9 – 2015.3, Part-time Lecturer, Doshisha University, Japan
- 2019.9 – 2020.3, Part-time Lecturer, Kansai University, Japan
- 2023.4 – present, Visiting Associate Professor, Tohoku University, Japan
Academic Society Membership
Peer Reviewing
- IEEE Journal of Selected Topics in Signal Processing (1)
- IEEE/ACM Transactions on Audio, Speech, and Language Processing (17)
- IEEE Signal Processing Letters (9)
- Journal of the Acoustical Society of America (3)
- Speech Communication (5)
- Applied Acoustics (4)
- Applied Sciences (3)
- EURASIP Journal on Audio, Speech, and Music Processing (1)
- IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences (E) (3)
- Acoustical Science and Technology (16)
- Advanced Robotics (1)
- Journal of Information Hiding and Multimedia Signal Processing (1)
- Book chapters of Multimedia Information Hiding Technologies and Methodologies for Controlling Data (2)
- Proceedings of ICASSP, 2020 (2), 2021 (4), 2022 (5), 2023 (6), 2024 (4)
- Proceedings of Interspeech, 2022 (5), 2023 (6), 2024 (10)
- Proceeding of WASPAA 2021 (1)
- Proceedings of ASRU, 2023 (4)
- Proceedings of SLT, 2022 (3)
- Proceeding of International Symposium on Room Acoustics 2010 (1)
- Journal of the Acoustical Society of Japan (2)
- IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences (J) (8)
- Journal of the Virtual Reality Society of Japan (2)
Publications in Japanese Records of official trips overseas