1. ChatGPT Alternative
  2. /ChatTTS

ChatTTS

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

View Website
Repository details:
  • Stars

    31,658
  • Forks

    3,445
  • Last commit

    1 month ago
Written in:
Python (99%)
View Repository
Screenshot of ChatTTS website

ChatTTS

A generative speech model for daily dialogue.

Introduction

[!Note] This repo contains the algorithm infrastructure and some simple examples.

[!Tip] For the extended end-user products, please refer to the index repo Awesome-ChatTTS maintained by the community.

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Supported Languages

  • [x] English
  • [x] Chinese
  • [ ] Coming Soon...

Highlights

You can refer to this video on Bilibili for the detailed description.

  1. Conversational TTS: ChatTTS is optimized for dialogue-based tasks, enabling natural and expressive speech synthesis. It supports multiple speakers, facilitating interactive conversations.
  2. Fine-grained Control: The model could predict and control fine-grained prosodic features, including laughter, pauses, and interjections.
  3. Better Prosody: ChatTTS surpasses most of open-source TTS models in terms of prosody. We provide pretrained models to support further research and development.

Dataset & Model

[!Important] The released model is for academic purposes only.

  • The main model is trained with Chinese and English audio data of 100,000+ hours.
  • The open-source version on HuggingFace is a 40,000 hours pre-trained model without SFT.

Roadmap

  • [x] Open-source the 40k-hours-base model and spk_stats file.
  • [x] Streaming audio generation.
  • [x] Open-source DVAE encoder and zero shot inferring code.
  • [ ] Multi-emotion controlling.
  • [ ] ChatTTS.cpp (new repo in 2noise org is welcomed)
Share:
Repository details:
  • Stars

    31,658
  • Forks

    3,445
  • Last commit

    1 month ago
Written in:
Python (99%)
View Repository
back