File size: 854 Bytes
685622d cad914d 685622d cad914d c8f3ce3 685622d cad914d 493e78f cad914d c8f3ce3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
---
language:
- ja
license: apache-2.0
model_type: transformer
tags:
- text-to-speech
---
# Kotoba-Speech-v0.1
Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:
1. Fluent text-to-speech generation in Japanese
2. One-shot voice cloning through speech prompt
![logo](./logo.webp)
## Usage
Plesae check out our HF Spaces [demo](https://huggingface.co/spaces/kotoba-tech/Kotoba-Speech?logs=build).
## Model Details
* **Model type**: Our model is end-to-end transformers.
* **Language(s)**: Japanese
* **Library**: We'll releasde our training code soon. Inference and model code are largely adopted from [metavoice](https://github.com/metavoiceio/metavoice-src).
## Acknowledgements
- We thank meta-voice for opensourcing their code.
## License
Apache License Version 2.0, January 2004 |