| Fsd50k: an open dataset of human-labeled sound events E Fonseca, X Favory, J Pons, F Font, X Serra IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 829-852, 2021 | 760 | 2021 |
| A Wavenet for speech denoising D Rethage, J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 | 632 | 2018 |
| Freesound Datasets: a platform for the creation of open audio datasets E Fonseca, J Pons, X Favory, F Font, D Bogdanov, A Ferraro, S Oramas, ... International Society for Music Information Retrieval Conference (ISMIR), 2017 | 329 | 2017 |
| End-to-end learning for music audio tagging at scale J Pons, O Nieto, M Prockup, E Schmidt, A Ehmann, X Serra International Society for Music Information Retrieval Conference (ISMIR), 2018 | 284 | 2018 |
| General-purpose tagging of freesound audio with audioset labels: Task description, dataset, and baseline E Fonseca, M Plakal, F Font, DPW Ellis, X Favory, J Pons, X Serra DCASE Workshop, 2018 | 231 | 2018 |
| Fast timing-conditioned latent audio diffusion Z Evans, CJ Carr, J Taylor, SH Hawley, J Pons Forty-first International Conference on Machine Learning, 2024 | 218 | 2024 |
| Experimenting with musically motivated convolutional neural networks J Pons, T Lidy, X Serra International Workshop on Content-Based Multimedia Indexing (CBMI), 1-6, 2016 | 218 | 2016 |
| Stable audio open Z Evans, JD Parker, CJ Carr, Z Zukowski, J Taylor, J Pons ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025 | 198 | 2025 |
| Timbre analysis of music audio signals with convolutional neural networks J Pons, O Slizovskaia, E Gómez Gutiérrez, X Serra European Signal Processing Conference (EUSIPCO), 2813-7, 2017 | 191 | 2017 |
| MusiCNN: pre-trained convolutional neural networks for music audio tagging J Pons, X Serra Late breaking/demo session of the International Society for Music …, 2019 | 151 | 2019 |
| Universal speech enhancement with score-based diffusion J Serrà, S Pascual, J Pons, RO Araz, D Scaini arXiv preprint arXiv:2206.03065, 2022 | 145 | 2022 |
| Randomly weighted CNNs for (music) audio classification J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 138 | 2019 |
| Long-form music generation with latent diffusion Z Evans, JD Parker, CJ Carr, Z Zukowski, J Taylor, J Pons arXiv preprint arXiv:2404.10301, 2024 | 102 | 2024 |
| End-to-end music source separation: Is it possible in the waveform domain? F Lluís, J Pons, X Serra arXiv preprint arXiv:1810.12187, 2018 | 100 | 2018 |
| Upsampling artifacts in neural audio synthesis J Pons, S Pascual, G Cengarle, J Serrà ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 98 | 2021 |
| Training neural audio classifiers with few data J Pons, J Serrà, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019 | 92 | 2019 |
| Automatic multitrack mixing with a differentiable mixing console of neural audio effects CJ Steinmetz, J Pons, S Pascual, J Serrà ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 88 | 2021 |
| Designing efficient architectures for modeling temporal features with convolutional neural networks J Pons, X Serra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017 | 88 | 2017 |
| Remixing music using source separation algorithms to improve the musical experience of cochlear implant users J Pons, J Janer, T Rode, W Nogueira The Journal of the Acoustical Society of America 140 (6), 4338-4349, 2016 | 81 | 2016 |
| An empirical study of Conv-TasNet B Kadioglu, M Horgan, X Liu, J Pons, D Darcy, V Kumar International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020 | 68* | 2020 |