- Taipei, Taiwan
- kwchang.org
Highlights
- Pro
-
speech-trident Public
Awesome speech/audio LLMs, representation learning, and codec models
-
Codec-SUPERB Public
Forked from voidful/Codec-SUPERBAudio Codec Speech processing Universal PERformance Benchmark
Python UpdatedJun 8, 2024 -
awesome-llm-role-playing-with-persona Public
Forked from Neph0s/awesome-llm-role-playing-with-personaAwesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
UpdatedMay 30, 2024 -
seamless_communication_emo Public
Forked from facebookresearch/seamless_communicationFoundational Models for State-of-the-Art Speech and Text Translation
-
ML2021-Spring Public
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring
-
SpeechPrompt-v2 Public
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
-
AudioCodec-Hub Public
AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
-
AudioDec Public
Forked from facebookresearch/AudioDecAn Open-source Streaming High-fidelity Neural Audio Codec
Python Other UpdatedSep 18, 2023 -
SpeechPrompt Public
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm
-
Speech-Prompts-Adapters Public
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
-
speech-language-model Public
Forked from umbertocappellazzo/speech-language-modelA collection of papers related to speech language models
1 UpdatedJul 28, 2023 -
-
SpeechGen Public
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
-
-
-
Taiwanese-Whisper Public
fine-tune Whipser model for Taiwanese speech recognition
-
vision Public
Forked from pytorch/visionDatasets, Transforms and Models specific to Computer Vision
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 19, 2023 -
-
Taiwanese-Translation Public
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
-
FastSpeech2 Public
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
-
Taiwanese-Speech-Synthesis Public
Taiwanese Speech Synthesis with Tacotron2
-
RobustVC Public
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.
-
-
-
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
-
-
-
FlappyBird Public
🔥 Super Flappy Bird in p5.js
-
-
TensorFlowTTS Public
Forked from TensorSpeech/TensorFlowTTS😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)