hcsolakoglu commited on
Commit
9c29a36
·
verified ·
1 Parent(s): 0cd16d3

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +100 -0
README.md ADDED
@@ -0,0 +1,100 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language: tr
4
+ tags:
5
+ - text-to-speech
6
+ - turkish
7
+ - tts
8
+ - f5-tts
9
+ datasets:
10
+ - custom
11
+ pipeline_tag: text-to-speech
12
+ ---
13
+
14
+ # Orkhon-TTS
15
+
16
+ ## Model Details
17
+
18
+ * **Model Type:** Text-to-Speech (TTS)
19
+ * **Architecture:** F5 TTS
20
+ * **Language:** Turkish (tr)
21
+ * **Developed by:** Hasan Can Solakoğlu
22
+ * **Model Version:** v1.0 (Alpha)
23
+ * **License:** Apache License 2.0
24
+ * **Demo:** [Orkhon-TTS Hugging Face Space](https://huggingface.co/spaces/hcsolakoglu/Orkhon-TTS)
25
+
26
+ ## Model Description
27
+
28
+ Orkhon-TTS is a Turkish Text-to-Speech model based on the F5 TTS architecture. It has been trained by Hasan Can Solakoğlu. This model is currently in its alpha stage.
29
+
30
+ The primary goal of this model is to provide a high-quality Turkish TTS voice for researchers, companies, and students.
31
+
32
+ ### Voice Cloning Capabilities
33
+
34
+ This model possesses voice cloning capabilities. Users are expected to use these features responsibly and ethically.
35
+
36
+ ## Training Data
37
+
38
+ Model was trained on high-quality single-speaker Turkish speech data. All training data was meticulously prepared and curated by Hasan Can Solakoğlu.
39
+
40
+
41
+ ## Intended Uses & Limitations
42
+
43
+ ### Intended Uses
44
+
45
+ * Generating Turkish speech from text for various applications.
46
+ * Research in Turkish speech synthesis.
47
+ * Educational purposes for understanding TTS models.
48
+ * Prototyping voice-enabled applications for Turkish users.
49
+
50
+ ### Limitations and Bias
51
+
52
+ * **Alpha Stage:** The model is currently in an alpha stage, meaning it may produce artifacts or unnatural-sounding speech in some cases.
53
+ * **Pronunciation of Abbreviations and Numbers:** The current version may not optimally handle the pronunciation of abbreviations or the verbalization of numbers written in digit form. These are planned improvements for v2.
54
+ * **Single Speaker:** The current public model is based on a single speaker.
55
+
56
+ ### Out-of-Scope Uses
57
+
58
+ * Generating speech for illegal or unethical purposes.
59
+ * Impersonating individuals without their explicit consent.
60
+ * Creating hate speech or misleading content.
61
+
62
+ ## How to Get Started with the Model
63
+
64
+ Usage instructions and code examples will be provided in the repository associated with this model. For a live demo, please visit the [Hugging Face Space](https://huggingface.co/spaces/hcsolakoglu/Orkhon-TTS).
65
+
66
+ ## Future Plans (v2)
67
+
68
+ * Improved handling of abbreviations.
69
+ * Enhanced pronunciation of numbers (reading numerical digits).
70
+ * Training on a larger dataset.
71
+ * Longer training duration for potentially higher quality.
72
+
73
+ ## Author Contact
74
+
75
+ For questions or feedback about the model, please contact Hasan Can Solakoğlu via [Twitter/X](https://x.com/HCSolakoglu).
76
+
77
+ ## Citation
78
+
79
+ If you use this model in your research or project, please consider citing it (details to be provided upon official release or publication).
80
+
81
+ ```bibtex
82
+ @misc{orkhon_tts_hcsolakoglu_2025,
83
+ author = {Solakoğlu, Hasan Can},
84
+ title = {Orkhon-TTS: A Turkish Text-to-Speech Model},
85
+ year = {2025},
86
+ publisher = {Hugging Face},
87
+ journal = {Hugging Face Model Hub},
88
+ howpublished = {\url{https://huggingface.co/hcsolakoglu/Orkhon-TTS}}
89
+ }
90
+ ```
91
+
92
+ ## Disclaimer
93
+
94
+ This model is provided "as-is" without any warranty, express or implied. The developers and contributors are not responsible for any damages or losses arising from the use of this model.
95
+
96
+ **Responsible Use of Voice Cloning:** The voice cloning capabilities of this model must be used responsibly and ethically. Users are solely responsible for the content they generate and any consequences arising from its use. The model creator (Hasan Can Solakoğlu) cannot be held liable for any misuse of the model or its outputs. Do not use this model to impersonate individuals without their explicit consent or for any malicious purposes.
97
+
98
+ ## License
99
+
100
+ This model is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.