File size: 854 Bytes
685622d
cad914d
 
685622d
cad914d
c8f3ce3
 
685622d
cad914d
 
 
 
 
 
 
 
 
 
493e78f
 
 
 
cad914d
 
 
 
 
 
 
 
 
 
 
c8f3ce3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
language:
- ja
license: apache-2.0
model_type: transformer
tags:
- text-to-speech
---

# Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:
1. Fluent text-to-speech generation in Japanese
2. One-shot voice cloning through speech prompt

![logo](./logo.webp)


## Usage
Plesae check out our HF Spaces [demo](https://huggingface.co/spaces/kotoba-tech/Kotoba-Speech?logs=build). 


## Model Details

* **Model type**: Our model is end-to-end transformers.
* **Language(s)**: Japanese 
* **Library**: We'll releasde our training code soon. Inference and model code are largely adopted from [metavoice](https://github.com/metavoiceio/metavoice-src). 


## Acknowledgements
- We thank meta-voice for opensourcing their code.

## License
Apache License Version 2.0, January 2004