audio-webui / readme /features.md
mrtroydev's picture
Upload folder using huggingface_hub
3883c60 verified

A newer version of the Gradio SDK is available: 5.5.0

Upgrade

Features

  • πŸ”Š Text-to-audio
    • πŸ—£ Text-to-speech
      • 🐢 Bark
        • πŸ—£ Speech generation
        • 🧬 Voice cloning
        • 🀣 Disable stopping token option to let the AI decide how it wants to continue
    • 🎡 AudioLDM text-to-audio generation
    • 🎡 AudioCraft text-to-audio generation
  • πŸ”Š Audio-to-audio
    • 🐢 Bark audio-to-audio using a custom quantizer to deconstruct audio for bark input
    • 😎 RVC (retrieval based voice conversion)
      • 🧬 RVC training
      • 🐸 coqui-ai/TTS text-to-speech
  • 🎀 Automatic-speech-recognition
    • 🎀 Whisper speech recognition
  • πŸš€ Extensions
    • 🐍 Python
    • πŸ“œ Javascript
    • πŸ–ŒοΈ Styling