Shusuke Takahashi

Cited by

	All	Since 2021
Citations	1157	1106
h-index	17	17
i10-index	25	24

440

220

110

330

201720182019202020212022202320242025202611 6 7 3 44 124 198 300 428 9

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Shusuke Takahashi

Sony Group Corporation

Verified email at sony.com

audio signal processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
ACCDOA: Activity-coupled cartesian direction of arrival representation for sound event localization and detection K Shimada, Y Koyama, N Takahashi, S Takahashi, Y Mitsufuji ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	153	2021
Multi-ACCDOA: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	134	2022
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ... arXiv preprint arXiv:2206.01948, 2022	120	2022
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events K Shimada, A Politis, P Sudarsanam, D Krause, K Uchida, S Adavanne, ... arXiv preprint arXiv:2306.09126, 2023	103	2023
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ... arXiv preprint arXiv:2205.07547, 2022	101	2022
All for one and one for all: Improving music separation by bridging networks R Sawata, S Uhlich, S Takahashi, Y Mitsufuji ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	76	2021
Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ... arXiv preprint arXiv:2106.10806, 2021	37	2021
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ... arXiv preprint arXiv:2305.10734, 2023	33	2023
Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability KW Cheuk, R Sawata, T Uesaka, N Murata, N Takahashi, S Takahashi, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	27	2023
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...	27*
Preventing oversmoothing in VAE via generalized variance parameterization Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji Neurocomputing 509, 137-156, 2022	26	2022
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	23	2023
Sound event localization and detection using activity-coupled cartesian DOA vector and RD3NET K Shimada, N Takahashi, S Takahashi, Y Mitsufuji arXiv preprint arXiv:2006.12014, 2020	23	2020
Spatial data augmentation with simulated room impulse responses for sound event localization and detection Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	20	2022
Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models R Sawata, Y Kashiwagi, S Takahashi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	19	2022
Preventing posterior collapse induced by oversmoothing in gaussian vae Y Takida, WH Liao, T Uesaka, S Takahashi, Y Mitsufuji arXiv e-prints, arXiv: 2102.08663, 2021	19	2021
Elementary real-time implementation of a virtual acoustic display based on ADVISE S Takane, S Takahashi, Y Suzuki, T Miyajima Acoustical science and technology 24 (5), 304-310, 2003	19	2003
Specmaskgit: Masked generative modeling of audio spectrograms for efficient audio synthesis and beyond M Comunità, Z Zhong, A Takahashi, S Yang, M Zhao, K Saito, Y Ikemiya, ... arXiv preprint arXiv:2406.17672, 2024	17	2024
The Sound Demixing Challenge 2023$\unicode {x2013} $ Cinematic Demixing Track S Uhlich, G Fabbro, M Hirano, S Takahashi, G Wichern, JL Roux, ... arXiv preprint arXiv:2308.06981, 2023	15	2023
Zero-and Few-shot Sound Event Localization and Detection K Shimada, K Uchida, Y Koyama, T Shibuya, S Takahashi, Y Mitsufuji, ... arXiv preprint arXiv:2309.09223, 2023	14	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by