Spaces:

mrtroydev
/

audio-webui

No application file

File size: 1,258 Bytes

3883c60

# Features
* [x] 🔊 Text-to-audio
  * [x] 🗣 Text-to-speech
    * [x] 🐶 [Bark](https://github.com/suno-ai/bark)
      * [x] 🗣 Speech generation
      * [x] 🧬 Voice cloning
        * [x] 👍 Basic voice cloning
        * [x] 🧬 [Accurate voice cloning](https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer)
      * [x] 🤣 Disable stopping token option to let the AI decide how it wants to continue
  * [x] 🎵 [AudioLDM](https://github.com/haoheliu/AudioLDM) text-to-audio generation
  * [x] 🎵 [AudioCraft](https://github.com/facebookresearch/audiocraft) text-to-audio generation
* [x] 🔊 Audio-to-audio
  * [x] 🐶 Bark audio-to-audio using [a custom quantizer](https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer) to deconstruct audio for bark input
  * [x] 😎 [RVC](https://github.com/RVC-Project/Retrieval-based-voice-conversion-webui) (retrieval based voice conversion)
    * [x] 🧬 RVC training
    * [x] 🐸 [coqui-ai/TTS](https://github.com/coqui-ai/TTS) text-to-speech
* [x] 🎤 Automatic-speech-recognition
  * [x] 🎤 [Whisper](https://github.com/openai/whisper) speech recognition
* [x] 🚀 [Extensions](extensions/index.md)
  * [x] 🐍 Python
  * [x] 📜 Javascript
  * [x] 🖌️ Styling