ga642381

🎯

Focusing

Kai-Wei Chang (張凱爲) ga642381

🎯

Focusing

✊ Ph.D. student @ NTU ✊ Research Scientist Intern @ Meta

241 followers · 102 following

Taipei, Taiwan
kwchang.org

Sponsoring

Achievements

Highlights

speech-trident Public

Awesome speech/audio LLMs, representation learning, and codec models

701 35 Updated Nov 18, 2024
Codec-SUPERB Public
Forked from voidful/Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python Updated Jun 8, 2024
awesome-llm-role-playing-with-persona Public
Forked from Neph0s/awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

Updated May 30, 2024
seamless_communication_emo Public
Forked from facebookresearch/seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 2 Other Updated Mar 26, 2024
ML2021-Spring Public

**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring

machine-learning deep-learning

Jupyter Notebook 839 324 Updated Nov 9, 2023
SpeechPrompt-v2 Public

《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm

deep-learning prompt speech speech-processing large-language-models prompt-engineering

Python 81 4 Updated Oct 19, 2023
AudioCodec-Hub Public

AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models

audio deep-learning speech pytorch audio-compression speech-codec residual-vector-quantization

Python 22 2 MIT License Updated Sep 26, 2023
AudioDec Public
Forked from facebookresearch/AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Python Other Updated Sep 18, 2023
SpeechPrompt Public

**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

nlp deep-learning prompt pytorch speech-processing large-language-models prompting

Python 97 8 Updated Aug 25, 2023
Speech-Prompts-Adapters Public

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

adapter prompt speech awesome-list papers reprogramming parameter-efficient-learning

103 5 Updated Aug 4, 2023
speech-language-model Public
Forked from umbertocappellazzo/speech-language-model

A collection of papers related to speech language models

1 Updated Jul 28, 2023
speech_quality Public

Jupyter Notebook 1 Updated Jul 6, 2023
SpeechGen Public

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

deep-learning prompt speech-processing speech-generation large-language-models speech-llm

74 5 Updated Jun 9, 2023
Kai-Wei-Chang-Talks Public

A repository sharing slides of the talks I gave

6 Updated Jun 6, 2023
Linguistics-111 Public

Jupyter Notebook Updated May 31, 2023
Taiwanese-Whisper Public

fine-tune Whipser model for Taiwanese speech recognition

speech speech-recognition openai whisper asr

Python 27 8 Updated Mar 23, 2023
vision Public
Forked from pytorch/vision

Datasets, Transforms and Models specific to Computer Vision

Python BSD 3-Clause "New" or "Revised" License Updated Feb 19, 2023
FinanceWeb Public

JavaScript 5 1 Updated Jan 4, 2023
Taiwanese-Translation Public

Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus

bert taiwanese

Python 11 1 Updated Oct 15, 2022
FastSpeech2 Public

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊

text-to-speech pytorch tts waveglow melgan multi-speaker-tts fastspeech2

Python 93 16 Updated Oct 14, 2022
Taiwanese-Speech-Synthesis Public

Taiwanese Speech Synthesis with Tacotron2

Python 18 5 MIT License Updated Oct 2, 2022
RobustVC Public

**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.

deep-learning speech-processing speech-enhancement

Python 23 2 MIT License Updated Sep 27, 2022
S2VC Public
Forked from howard1337/S2VC

Python 2 1 Updated Jul 20, 2022
moth Public

虫我研所 Moth Institute 新一代設計展 https://ga642381.github.io/moth

website reactjs visual-design

8 Updated Jul 8, 2022
s3prl Public
Forked from s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Python 1 MIT License Updated Sep 26, 2021
CA2021-Final Public

Jupyter Notebook 2 Updated Jun 29, 2021
neurips2021-sas-react Public

JavaScript 2 MIT License Updated Jun 22, 2021
FlappyBird Public

🔥 Super Flappy Bird in p5.js

game bird flappy-bird p5js p5js-game

JavaScript 9 1 Updated Mar 8, 2021
TaiwaneseTTS Public

Python 8 2 Updated Dec 15, 2020
TensorFlowTTS Public
Forked from TensorSpeech/TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese and Easy to adapt for other languages)

Python 1 Apache License 2.0 Updated Nov 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kai-Wei Chang (張凱爲) ga642381

Sponsoring

Achievements

Achievements

Highlights

Block or report ga642381

speech-trident Public

Codec-SUPERB Public

awesome-llm-role-playing-with-persona Public

seamless_communication_emo Public

ML2021-Spring Public

SpeechPrompt-v2 Public

AudioCodec-Hub Public

AudioDec Public

SpeechPrompt Public

Speech-Prompts-Adapters Public

speech-language-model Public

speech_quality Public

SpeechGen Public

Kai-Wei-Chang-Talks Public

Linguistics-111 Public

Taiwanese-Whisper Public

vision Public

FinanceWeb Public

Taiwanese-Translation Public

FastSpeech2 Public

Taiwanese-Speech-Synthesis Public

RobustVC Public

S2VC Public

moth Public

s3prl Public

CA2021-Final Public

neurips2021-sas-react Public

FlappyBird Public

TaiwaneseTTS Public

TensorFlowTTS Public