[go: up one dir, main page]

Skip to content
Change the repository type filter

All

    Repositories list

    • Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
      Python
      35000Updated May 28, 2023May 28, 2023
    • FastChat

      Public
      The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
      Python
      Apache License 2.0
      4.6k000Updated May 24, 2023May 24, 2023
    • Robust Speech Recognition via Large-Scale Weak Supervision
      C
      MIT License
      43000Updated May 11, 2023May 11, 2023
    • Vietnamese Speech Recognition
      Python
      48000Updated Apr 7, 2023Apr 7, 2023
    • Finetune Wa2vec 2.0 For Speech Recognition
      Python
      24000Updated Apr 7, 2023Apr 7, 2023
    • speech recognition app powered by whisper.cpp
      C++
      GNU General Public License v3.0
      2000Updated Apr 7, 2023Apr 7, 2023
    • Faster Whisper transcription with CTranslate2
      Python
      MIT License
      1k000Updated Apr 7, 2023Apr 7, 2023
    • capgenx

      Public
      A minimal GUI application that generates transcriptions for audio and videos using Whisper neural network.
      C++
      MIT License
      2000Updated Apr 7, 2023Apr 7, 2023
    • Instruct-tune LLaMA on consumer hardware
      Jupyter Notebook
      Apache License 2.0
      2.2k000Updated Apr 7, 2023Apr 7, 2023
    • Code and documentation to train Stanford's Alpaca models, and generate the data.
      Python
      Apache License 2.0
      4.1k000Updated Apr 7, 2023Apr 7, 2023
    • gpt4all

      Public
      gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
      Python
      MIT License
      7.7k000Updated Apr 7, 2023Apr 7, 2023
    • Alpaca-LoRA as Chatbot service
      Python
      Apache License 2.0
      381000Updated Mar 24, 2023Mar 24, 2023
    • Python
      1000Updated Mar 10, 2023Mar 10, 2023
    • HTML
      2000Updated Mar 8, 2023Mar 8, 2023
    • This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
      Python
      BSD 3-Clause "New" or "Revised" License
      3000Updated Feb 22, 2023Feb 22, 2023
    • Whisper fine-tuning event script to use multiple hf datasets
      Python
      7000Updated Jan 11, 2023Jan 11, 2023
    • Our old fine-tuning code based on ColossalAI.
      Python
      Apache License 2.0
      3000Updated Jan 8, 2023Jan 8, 2023
    • Repository contains code to fine-tune WhisperASR model
      HTML
      MIT License
      8000Updated Dec 16, 2022Dec 16, 2022
    • Python
      8000Updated Nov 30, 2022Nov 30, 2022
    • onnxt5

      Public
      Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
      Python
      Apache License 2.0
      30000Updated Nov 2, 2022Nov 2, 2022
    • This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
      Python
      9000Updated Oct 31, 2022Oct 31, 2022
    • 3D Passive Face Liveness Detection (Anti-Spoofing). A single image is needed to compute liveness score. 99,67% accuracy on our dataset and perfect scores on multiple public datasets (NUAA, CASIA FASD, MSU...).
      C++
      48000Updated Oct 16, 2022Oct 16, 2022
    • Deploy your model with TensorRT quickly. 快速使用TensorRT来部署模型
      C++
      100000Updated Oct 16, 2022Oct 16, 2022
    • NATSpeech

      Public
      A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
      Python
      MIT License
      100000Updated Oct 16, 2022Oct 16, 2022
    • small c++ library to quickly deploy models using onnxruntime
      C++
      MIT License
      49000Updated Oct 16, 2022Oct 16, 2022
    • STT

      Public
      The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
      C++
      Mozilla Public License 2.0
      278000Updated Oct 16, 2022Oct 16, 2022
    • buzz

      Public
      Buzz transcribes audio from your computer's microphones to text using OpenAI's Whisper
      Python
      MIT License
      945000Updated Oct 16, 2022Oct 16, 2022
    • openvino version of openai/whisper
      Jupyter Notebook
      MIT License
      8.5k000Updated Oct 1, 2022Oct 1, 2022
    • HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools
      Python
      MIT License
      43000Updated Sep 16, 2022Sep 16, 2022
    • ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプル
      Python
      MIT License
      5000Updated Sep 1, 2022Sep 1, 2022