Stars
FFmpeg实现视频裁剪、水印、转码、编解码、转Gif动图;FFmpeg本地推流、H264与RTMP实时推流直播;OpenGL滤镜特效,视频拍摄。音视频学习路线,音视频知识总结、流媒体协议
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Graph Neural Networks for Sound Source Localization
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
ManyEars Sound Source Localization, Tracking and Separation
Noise supression using deep filtering
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
嵌入式经典书籍分享,C程序员常读书单整理,含下载地址,成体系提升技术能力。书籍资源包括电子基础、C/C++、Arm架构、Linux、网络、设计模式、各类行业报告等等。
This rep contains awesome adaptive filter algorithms in 3 classic books.
Recurrent neural network for audio noise reduction
This is a Real-time howling detection and suppression algorithm using Matlab simulink.
An interactive introduction to the polyphase filterbank technique for radio astronomy spectrometers
Mixed-Radix Cooley-Tukey FFT and Inverse FFT in Matlab
This is the newest virtual test environment for PEM-AFC2 in MATLAB
A New Perspective of Auxiliary-Function-Based Independent Component Analysis in Acoustic Echo Cancellation
chapro is a library of modular functions that implement signal processing intended for hearing aids.
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Generalised Sidelobe Canceler beamformer for an array of microphones using matlab
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)
This code can simulate the MATLAB environment of uniform linear microphone array. It can define room size, reverberation degree, the number and location of microphones, and reduce the dependence on…
Submission to the 1st Acoustic Echo Cancellation Challenge, Microsoft and ICASSP 2021.
Spectral Subtraction, Wiener Filtering, MMSE
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Real-time GCC-NMF Blind Speech Separation and Enhancement
Convert a mono input stream to a stereo one