InigoHierroMuga commited on
Commit
b513a2e
·
1 Parent(s): 0306645
Files changed (2) hide show
  1. README.md +78 -3
  2. vits.onnx +3 -0
README.md CHANGED
@@ -1,3 +1,78 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Aholab TTS Synthesis models - Sabela [gl]
2
+ ## Description
3
+ This repository contains Sabela TTS model in Galician.
4
+ This model is part of a collection of text-to-speech (TTS) models in Basque (eu), Galician (gl), Catalan (ca) and Spanish (es). All voices in this collections are based on the VITS architecture proposed by [Kim et al. (2021)](https://arxiv.org/abs/2106.06103).
5
+
6
+ * Basque [eu]:
7
+ - antton
8
+ - maider
9
+ * Galician [gl]:
10
+ - brais
11
+ - celtia
12
+ - iago
13
+ - icia
14
+ - paulo
15
+ - sabela
16
+ * Catalan [ca]:
17
+ - pau
18
+ - ona
19
+ - bet
20
+ - eli
21
+ - eva
22
+ - jan
23
+ - mar
24
+ - pep
25
+ - pol
26
+ * Spanish [es]:
27
+ - laura
28
+ - alejandro
29
+
30
+ ## Uses
31
+ This models are intented to be used for speech synthesis in Basque, Galician, Catalan and Spanish.
32
+ ## How to use
33
+ ### Python
34
+ Use the synthesize.py script to generate speech. All avalaible models are listed in the sections above. Before running the script, navigate to the repository directory:
35
+ ```cd Ahotts```
36
+
37
+ For help:
38
+ ```python3 synthesize.py -h```
39
+ Example commands:
40
+ ```bash
41
+ python3 synthesize.py -t "Antton naiz, zer moduz zaude." -l eu -m antton -o audio_name
42
+ python3 synthesize.py -t "Soy Laura, qué tal estás?" -l es -m laura -o audio_name
43
+ python3 synthesize.py -t "Sóc Ona, com estàs." -l ca -m ona -o audio_name
44
+ python3 synthesize.py -t "Son Brais, como estás." -l gl -m brais -o audio_name
45
+ ```
46
+
47
+ The synthesized audio is saved as a .wav file inside the **output/** directory.
48
+ Use ```--output``` / ```-o``` to specify the filename.
49
+
50
+ ## Additional information
51
+ ### Voice Resource Licenses and references
52
+ * Galician
53
+ - Celtia
54
+ Public Creative Commond Attribution 4.0 International License
55
+ [Vázquez Abuín, M., García Díaz, N., Vladu, A. I., Magariños, C., Vidal Miguéns, A., & Fernández Rei, E. (2023). Nos_Celtia-GL: Galician TTS corpus (1.0.0.) [Data set]. Zenodo.](https://doi.org/10.5281/zenodo.7716958)
56
+ - Brais
57
+ Public Creative Commond Attribution 4.0 International License
58
+ [Vladu, A. I., García Díaz, N., Regueira Fernández, X. L., Magariños, C., Moscoso Sánchez, A., Fernández López, D., Fernández Rei, E., & Dubert-García, F. (2025). Nos_Brais-GL: Galician TTS corpus [Data set]. Zenodo](https://doi.org/10.5281/zenodo.14265241)
59
+ - Sabela/Icia/Iago/Paulo
60
+ Public Creative Commond Attribution 4.0 International License
61
+ [Centro Ramón Piñeiro para a Investigación en Humanidades (CRPIH), & Multimedia Technology Group (GTM) – atlanTTic Research Center for Telecommunication Technologies. (2023). CRPIH_UVigo-GL-Voices: Galician TTS dataset (1.0.0.) [Data set]. Zenodo.](https://doi.org/10.5281/zenodo.8027725)
62
+ * Catalan
63
+ - Creative Commons Attribution-ShareAlike 4.0 International Public License [festcat_trimmed_denoised](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised)
64
+ * Basque
65
+ - Maider, Antton: developed by HiTZ with funding from Project ILENIA. Public Creative Commond Attribution 4.0
66
+ * Spanish
67
+ - Alejandro: Developed in HiTZ from [openSLR dataset.](https://openslr.org/39/)
68
+ - Laura: Acquired in [ELRA ID: ELRA-S0309](https://catalog.elra.info/en-us/repository/browse/ELRA-S0309/)
69
+ ### Authors
70
+ HiTZ Basque Center for Language Technology - Aholab Signal Processing Laboratory, University of the Basque Country EHU.
71
+ ### Contact information
72
+ Ibon Saratxaga: ibon.saratxaga@ehu.eus
73
+ ### Licensing Information
74
+ [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)
75
+ ### Funding
76
+ Catalan and Galician have been funded by the project with reference numbers 2022/TL22/00215337, 2022/TL22/00215336, 2022/TL22/00215335, and 2022/TL22/00215334 is funded by the Ministry of Digital Transformation and by the Recovery, Transformation and Resilience Plan – Funded by the European Union – NextGenerationEU.
77
+ ### Citation information
78
+ Hernaez, I., Navas, E., Murugarren, J.L., Etxebarria, B. (2001) Description of the AhoTTS system for the Basque language. Proc. 4th ISCA ITRW on Speech Synthesis (SSW 4), paper 202
vits.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b7df43263c2eefffa79b841431d542957c8c6478b41448be679ef36036087f22
3
+ size 60333923