[go: up one dir, main page]

Skip to content
View framsc's full-sized avatar
😀
😀

Block or report framsc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Deep Learning for humans

Python 62,065 19,483 Updated Nov 18, 2024

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.

C++ 2,070 416 Updated Nov 18, 2024

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,210 501 Updated Nov 18, 2024

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,542 654 Updated Nov 18, 2024

An Open Source Machine Learning Framework for Everyone

C++ 186,422 74,312 Updated Nov 18, 2024
1 Updated Nov 18, 2024

Google Research

Jupyter Notebook 34,318 7,920 Updated Nov 15, 2024

A PyTorch-based Speech Toolkit

Python 8,934 1,398 Updated Nov 14, 2024

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,874 191 Updated Nov 13, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,339 781 Updated Nov 11, 2024

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

C++ 1,670 256 Updated Nov 10, 2024

Recurrent neural network for audio noise reduction

Rust 249 19 Updated Nov 9, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,180 1,077 Updated Nov 8, 2024

《机器学习》(西瓜书)公式详解

24,058 4,756 Updated Nov 8, 2024

Tools for handling speech data in machine learning projects.

Python 955 218 Updated Nov 7, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 35,765 4,081 Updated Nov 7, 2024

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 316 45 Updated Oct 28, 2024

🌎 machine learning tutorials (mainly in Python3)

HTML 3,194 650 Updated Oct 24, 2024

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Python 126 23 Updated Oct 23, 2024

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Python 647 92 Updated Oct 23, 2024

Noise supression using deep filtering

Python 2,537 235 Updated Oct 17, 2024

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,291 5,324 Updated Oct 4, 2024

《金庸群侠传》c++复刻版,已完工

C++ 2,612 372 Updated Oct 2, 2024

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 961 163 Updated Sep 19, 2024

Google's Engineering Practices documentation

20,010 1,952 Updated Sep 19, 2024

Tengine is a lite, high performance, modular inference engine for embedded device

C++ 4,653 998 Updated Sep 15, 2024

Trax — Deep Learning with Clear Code and Speed

Python 8,100 816 Updated Sep 10, 2024

Different implementations of "Weighted Prediction Error" for speech dereverberation

Python 493 164 Updated Sep 10, 2024

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Python 942 116 Updated Sep 4, 2024

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,122 160 Updated Aug 19, 2024
Next