Spaces:

kevinwang676
/

Bark-with-Voice-Cloning

Running

Apply for community grant: Personal project

by kevinwang676 - opened Apr 28, 2023

Owner Apr 28, 2023

Combine the powerful text-to-audio model Bark with real-time voice cloning, which can generate highly realistic audio in a custom voice uploaded by the users.

SSPProduction

Apr 28, 2023

Hi, we tried this but the processing on the cloning part seems to be way too fast, and although it changes the voice timbre and quality a bit, it sounds absolutely nothing like the reference voice. Is it only supposed to replicate voice tone and timbre, or accent and other aspects of the voice as well? If the latter, it doesn't seem to be working at all for us.

kevinwang676

Owner Apr 28, 2023

Hi, the voice cloning part requires you to upload longer audio (~90s) as the reference audio in order to impove the quality of the cloned speech. You can check out the demo of YourTTS here: https://huggingface.co/spaces/ramkamal2000/voice-conversion-yourtts. Thanks for reaching out!

yoinked

Apr 28, 2023

looks like a good idea; havent tested it too well, but since bark is a pain to deal when finetuning; could be a good project to give a gpu to!

kevinwang676

Owner Apr 28, 2023

Thanks for your comments! I've added an example of voice cloning to the space. Please check it out. It would be amazing if this space can be used to demo Bark with voice cloning and for people to try it.

yoinked

Apr 28, 2023

CPU inference not only for bark but for cloning takes very long (400+ seconds for a 3 second output)

kevinwang676

Owner Apr 28, 2023

Sorry for the inconvenience. That's also the reason why I'd like to apply for community grant😂However, you can always duplicate and use it with a GPU in your own space.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment