Spaces:
No application file
No application file
File size: 1,258 Bytes
3883c60 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
# Features
* [x] π Text-to-audio
* [x] π£ Text-to-speech
* [x] πΆ [Bark](https://github.com/suno-ai/bark)
* [x] π£ Speech generation
* [x] 𧬠Voice cloning
* [x] π Basic voice cloning
* [x] 𧬠[Accurate voice cloning](https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer)
* [x] π€£ Disable stopping token option to let the AI decide how it wants to continue
* [x] π΅ [AudioLDM](https://github.com/haoheliu/AudioLDM) text-to-audio generation
* [x] π΅ [AudioCraft](https://github.com/facebookresearch/audiocraft) text-to-audio generation
* [x] π Audio-to-audio
* [x] πΆ Bark audio-to-audio using [a custom quantizer](https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer) to deconstruct audio for bark input
* [x] π [RVC](https://github.com/RVC-Project/Retrieval-based-voice-conversion-webui) (retrieval based voice conversion)
* [x] 𧬠RVC training
* [x] πΈ [coqui-ai/TTS](https://github.com/coqui-ai/TTS) text-to-speech
* [x] π€ Automatic-speech-recognition
* [x] π€ [Whisper](https://github.com/openai/whisper) speech recognition
* [x] π [Extensions](extensions/index.md)
* [x] π Python
* [x] π Javascript
* [x] ποΈ Styling
|