InigoHierroMuga
commited on
Commit
·
b513a2e
1
Parent(s):
0306645
sabela
Browse files
README.md
CHANGED
|
@@ -1,3 +1,78 @@
|
|
| 1 |
-
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Aholab TTS Synthesis models - Sabela [gl]
|
| 2 |
+
## Description
|
| 3 |
+
This repository contains Sabela TTS model in Galician.
|
| 4 |
+
This model is part of a collection of text-to-speech (TTS) models in Basque (eu), Galician (gl), Catalan (ca) and Spanish (es). All voices in this collections are based on the VITS architecture proposed by [Kim et al. (2021)](https://arxiv.org/abs/2106.06103).
|
| 5 |
+
|
| 6 |
+
* Basque [eu]:
|
| 7 |
+
- antton
|
| 8 |
+
- maider
|
| 9 |
+
* Galician [gl]:
|
| 10 |
+
- brais
|
| 11 |
+
- celtia
|
| 12 |
+
- iago
|
| 13 |
+
- icia
|
| 14 |
+
- paulo
|
| 15 |
+
- sabela
|
| 16 |
+
* Catalan [ca]:
|
| 17 |
+
- pau
|
| 18 |
+
- ona
|
| 19 |
+
- bet
|
| 20 |
+
- eli
|
| 21 |
+
- eva
|
| 22 |
+
- jan
|
| 23 |
+
- mar
|
| 24 |
+
- pep
|
| 25 |
+
- pol
|
| 26 |
+
* Spanish [es]:
|
| 27 |
+
- laura
|
| 28 |
+
- alejandro
|
| 29 |
+
|
| 30 |
+
## Uses
|
| 31 |
+
This models are intented to be used for speech synthesis in Basque, Galician, Catalan and Spanish.
|
| 32 |
+
## How to use
|
| 33 |
+
### Python
|
| 34 |
+
Use the synthesize.py script to generate speech. All avalaible models are listed in the sections above. Before running the script, navigate to the repository directory:
|
| 35 |
+
```cd Ahotts```
|
| 36 |
+
|
| 37 |
+
For help:
|
| 38 |
+
```python3 synthesize.py -h```
|
| 39 |
+
Example commands:
|
| 40 |
+
```bash
|
| 41 |
+
python3 synthesize.py -t "Antton naiz, zer moduz zaude." -l eu -m antton -o audio_name
|
| 42 |
+
python3 synthesize.py -t "Soy Laura, qué tal estás?" -l es -m laura -o audio_name
|
| 43 |
+
python3 synthesize.py -t "Sóc Ona, com estàs." -l ca -m ona -o audio_name
|
| 44 |
+
python3 synthesize.py -t "Son Brais, como estás." -l gl -m brais -o audio_name
|
| 45 |
+
```
|
| 46 |
+
|
| 47 |
+
The synthesized audio is saved as a .wav file inside the **output/** directory.
|
| 48 |
+
Use ```--output``` / ```-o``` to specify the filename.
|
| 49 |
+
|
| 50 |
+
## Additional information
|
| 51 |
+
### Voice Resource Licenses and references
|
| 52 |
+
* Galician
|
| 53 |
+
- Celtia
|
| 54 |
+
Public Creative Commond Attribution 4.0 International License
|
| 55 |
+
[Vázquez Abuín, M., García Díaz, N., Vladu, A. I., Magariños, C., Vidal Miguéns, A., & Fernández Rei, E. (2023). Nos_Celtia-GL: Galician TTS corpus (1.0.0.) [Data set]. Zenodo.](https://doi.org/10.5281/zenodo.7716958)
|
| 56 |
+
- Brais
|
| 57 |
+
Public Creative Commond Attribution 4.0 International License
|
| 58 |
+
[Vladu, A. I., García Díaz, N., Regueira Fernández, X. L., Magariños, C., Moscoso Sánchez, A., Fernández López, D., Fernández Rei, E., & Dubert-García, F. (2025). Nos_Brais-GL: Galician TTS corpus [Data set]. Zenodo](https://doi.org/10.5281/zenodo.14265241)
|
| 59 |
+
- Sabela/Icia/Iago/Paulo
|
| 60 |
+
Public Creative Commond Attribution 4.0 International License
|
| 61 |
+
[Centro Ramón Piñeiro para a Investigación en Humanidades (CRPIH), & Multimedia Technology Group (GTM) – atlanTTic Research Center for Telecommunication Technologies. (2023). CRPIH_UVigo-GL-Voices: Galician TTS dataset (1.0.0.) [Data set]. Zenodo.](https://doi.org/10.5281/zenodo.8027725)
|
| 62 |
+
* Catalan
|
| 63 |
+
- Creative Commons Attribution-ShareAlike 4.0 International Public License [festcat_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised)
|
| 64 |
+
* Basque
|
| 65 |
+
- Maider, Antton: developed by HiTZ with funding from Project ILENIA. Public Creative Commond Attribution 4.0
|
| 66 |
+
* Spanish
|
| 67 |
+
- Alejandro: Developed in HiTZ from [openSLR dataset.](https://openslr.org/39/)
|
| 68 |
+
- Laura: Acquired in [ELRA ID: ELRA-S0309](https://catalog.elra.info/en-us/repository/browse/ELRA-S0309/)
|
| 69 |
+
### Authors
|
| 70 |
+
HiTZ Basque Center for Language Technology - Aholab Signal Processing Laboratory, University of the Basque Country EHU.
|
| 71 |
+
### Contact information
|
| 72 |
+
Ibon Saratxaga: ibon.saratxaga@ehu.eus
|
| 73 |
+
### Licensing Information
|
| 74 |
+
[Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)
|
| 75 |
+
### Funding
|
| 76 |
+
Catalan and Galician have been funded by the project with reference numbers 2022/TL22/00215337, 2022/TL22/00215336, 2022/TL22/00215335, and 2022/TL22/00215334 is funded by the Ministry of Digital Transformation and by the Recovery, Transformation and Resilience Plan – Funded by the European Union – NextGenerationEU.
|
| 77 |
+
### Citation information
|
| 78 |
+
Hernaez, I., Navas, E., Murugarren, J.L., Etxebarria, B. (2001) Description of the AhoTTS system for the Basque language. Proc. 4th ISCA ITRW on Speech Synthesis (SSW 4), paper 202
|
vits.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b7df43263c2eefffa79b841431d542957c8c6478b41448be679ef36036087f22
|
| 3 |
+
size 60333923
|