Skip to main content
0
C

ChatTTS

A generative speech model for daily dialogue.

Rating

0.0

Votes

0

score

Downloads

0

total

Price

Free

Access token required

Works With

Claude CodeCursorWindsurfVS CodeDeveloper tool

About

ChatTTS

A generative speech model for daily dialogue.

](https://github.com/2noise/ChatTTS/blob/main/LICENSE) [

](https://huggingface.co/2Noise/ChatTTS) [ [](https://discord.gg/Ud5Jxgx5yD)

English | **简体中文** | **日本語** | **Русский** | **Español** | **Français** | **한국어**

Introduction

[!Note] This repo contains the algorithm infrastructure and some simple examples.
!Tip] For the extended end-user products, please refer to the index repo [Awesome-ChatTTS maintained by the community. You can find a diagram visualization of the codebase here.

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Supported Languages

  • [x] English
  • [x] Chinese
  • [ ] Coming Soon...

Highlights

You can refer to [this video on Bilibili](https://www.bilibili.com/video/BV1zn4y1o7iV) for the detailed description.
  1. 1.Conversational TTS: ChatTTS is optimized for dialogue-based tasks, enabling natural and expressive speech synthesis. It supports multiple speakers, facilitating interactive conversations.
  2. 2.Fine-grained Control: The model could predict and control fine-grained prosodic features, including laughter, pauses, and interjections.
  3. 3.Better Prosody: ChatTTS surpasses most of open-source TTS models in terms of prosody. We provide pretrained models to support further research and development.

Dataset & Model

[!Important] The released model is for academic purposes only.
  • The main model is trained with Chinese and English audio data of 100,000+ hours.
  • The open-source version on [HuggingFace](https://huggingface.co/2Noise/ChatTTS) is a 40,000 hours pre-trained model without SFT.

Roadmap

  • [x] Open-source the 40k-hours-base model and spk_stats file.
  • [x] Streaming audio generation.
  • [x] Open-source DVAE encoder and zero shot inferring code.
  • [ ] Multi-emotion controlling.
  • [ ] ChatTTS.cpp (new repo in 2noise org is welcomed)

Licenses

#### The Code

The code is published under AGPLv3+ license.

#### The model

The model is published under CC BY-NC 4.0 license. It is intended for educational and research use, and should not be used for any commercial or illegal purposes. The authors do not guarantee the accuracy, completeness, or reliability of the information. The information and data used in this repo, are for academic and research purposes only. The data obtained from publicly available sources, and the authors do not claim any ownership or copyright over the data.

Disclaimer

Don't lose this

Three weeks from now, you'll want ChatTTS again. Will you remember where to find it?

Save it to your library and the next time you need ChatTTS, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.

⚡ Pro tip for geeks: add a-gnt 🤵🏻‍♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.

🤵🏻‍♂️

a-gnt's Take

Our honest review

A generative speech model for daily dialogue. Best for anyone looking to make their AI assistant more capable in communication. It's completely free and works across most major AI apps. This one just landed in the catalog — worth trying while it's fresh.

Tips for getting started

1

Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.

What's New

Version 1.0.06 days ago

Imported from GitHub

Ratings & Reviews

0.0

out of 5

0 ratings

No reviews yet. Be the first to share your experience.