[go: up one dir, main page]

Follow
Darius Petermann
Darius Petermann
Machine Learning @ Netflix | Ex. Apple, Google, Amazon, MERL
Verified email at iu.edu - Homepage
Title
Cited by
Cited by
Year
The cocktail fork problem: Three-stem audio separation for real-world soundtracks
D Petermann, G Wichern, ZQ Wang, J Le Roux
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
492022
Deep learning based source separation applied to choir ensembles
D Petermann, P Chandna, H Cuesta, J Bonada, E Gómez
arXiv preprint arXiv:2008.07645, 2020
422020
VERSA: A versatile evaluation toolkit for speech, audio, and music
J Shi, H Shim, J Tian, S Arora, H Wu, D Petermann, JQ Yip, Y Zhang, ...
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of …, 2025
402025
Harp-net: Hyper-autoencoded reconstruction propagation for scalable neural audio coding
D Petermann, S Beack, M Kim
2021 IEEE Workshop on applications of signal processing to audio and …, 2021
242021
Hyperbolic audio source separation
D Petermann, G Wichern, A Subramanian, J Le Roux
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
222023
Tackling the cocktail fork problem for separation and transcription of real-world soundtracks
D Petermann, G Wichern, AS Subramanian, ZQ Wang, J Le Roux
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 2592-2605, 2023
132023
Discrete Audio Tokens: More Than a Survey!
P Mousavi, G Maimon, A Moumen, D Petermann, J Shi, H Wu, H Yang, ...
arXiv preprint arXiv:2506.10274, 2025
112025
A deep-learning based framework for source separation, analysis, and synthesis of choral ensembles
P Chandna, H Cuesta, D Petermann, E Gómez
Frontiers in Signal Processing 2, 808594, 2022
102022
Hyperbolic distance-based speech separation
D Petermann, M Kim
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
62024
Audio Source Separation using Hyperbolic Embeddings
G Wichern, J Le Roux, D Petermann, AS Subramanian
US Patent App. 18/191,417, 2024
52024
SpaIn-Net: Spatially-informed stereophonic music source separation
D Petermann, M Kim
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
32022
Native multi-band audio coding within hyper-autoencoded reconstruction propagation networks
D Petermann, I Jang, M Kim
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation
D Petermann, MM Kalayeh
arXiv preprint arXiv:2501.05413, 2025
2025
Audio signal encoding/decoding method and apparatus for performing the same
W Lim, SK Beack, J Sung, TJ Lee, B CHO, M Kim, D Petermann
US Patent App. 18/473,791, 2024
2024
SATB Voice Segregation For Monoaural Recordings
D Petermann
https://zenodo.org/record/4091247, 2020
2020
SOURCE NUMBER ESTIMATION FOR MONOAURAL CHORAL RECORDING
D Petermann
An Overview of the Current Streaming Spectral Capabilities and Associated Parallel Computing and Research in Csound, and How It Relates To “Research in High Performance …
D PETERMANN
The system can't perform the operation now. Try again later.
Articles 1–17