| ACCDOA: Activity-coupled cartesian direction of arrival representation for sound event localization and detection K Shimada, Y Koyama, N Takahashi, S Takahashi, Y Mitsufuji ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 153 | 2021 |
| Multi-ACCDOA: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 134 | 2022 |
| STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ... arXiv preprint arXiv:2206.01948, 2022 | 120 | 2022 |
| STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events K Shimada, A Politis, P Sudarsanam, D Krause, K Uchida, S Adavanne, ... arXiv preprint arXiv:2306.09126, 2023 | 103 | 2023 |
| SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ... arXiv preprint arXiv:2205.07547, 2022 | 101 | 2022 |
| All for one and one for all: Improving music separation by bridging networks R Sawata, S Uhlich, S Takahashi, Y Mitsufuji ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 76 | 2021 |
| Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ... arXiv preprint arXiv:2106.10806, 2021 | 37 | 2021 |
| Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ... arXiv preprint arXiv:2305.10734, 2023 | 33 | 2023 |
| Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability KW Cheuk, R Sawata, T Uesaka, N Murata, N Takahashi, S Takahashi, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 27 | 2023 |
| Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ... | 27* | |
| Preventing oversmoothing in VAE via generalized variance parameterization Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji Neurocomputing 509, 137-156, 2022 | 26 | 2022 |
| An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 23 | 2023 |
| Sound event localization and detection using activity-coupled cartesian DOA vector and RD3NET K Shimada, N Takahashi, S Takahashi, Y Mitsufuji arXiv preprint arXiv:2006.12014, 2020 | 23 | 2020 |
| Spatial data augmentation with simulated room impulse responses for sound event localization and detection Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 20 | 2022 |
| Improving Character Error Rate is Not Equal to Having Clean Speech: Speech Enhancement for ASR Systems with Black-Box Acoustic Models R Sawata, Y Kashiwagi, S Takahashi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 19 | 2022 |
| Preventing posterior collapse induced by oversmoothing in gaussian vae Y Takida, WH Liao, T Uesaka, S Takahashi, Y Mitsufuji arXiv e-prints, arXiv: 2102.08663, 2021 | 19 | 2021 |
| Elementary real-time implementation of a virtual acoustic display based on ADVISE S Takane, S Takahashi, Y Suzuki, T Miyajima Acoustical science and technology 24 (5), 304-310, 2003 | 19 | 2003 |
| Specmaskgit: Masked generative modeling of audio spectrograms for efficient audio synthesis and beyond M Comunità, Z Zhong, A Takahashi, S Yang, M Zhao, K Saito, Y Ikemiya, ... arXiv preprint arXiv:2406.17672, 2024 | 17 | 2024 |
| The Sound Demixing Challenge 2023$\unicode {x2013} $ Cinematic Demixing Track S Uhlich, G Fabbro, M Hirano, S Takahashi, G Wichern, JL Roux, ... arXiv preprint arXiv:2308.06981, 2023 | 15 | 2023 |
| Zero-and Few-shot Sound Event Localization and Detection K Shimada, K Uchida, Y Koyama, T Shibuya, S Takahashi, Y Mitsufuji, ... arXiv preprint arXiv:2309.09223, 2023 | 14 | 2023 |