├── AUDIO_EXAMPLES.MD
├── LICENSE
└── README.md
/AUDIO_EXAMPLES.MD:
--------------------------------------------------------------------------------
1 |
Results and Comparisons
2 |
3 | Adding more examples soon...
4 | Prompt: we are testing this model for our project.
5 |
6 | Tortoise-tts cloning an indian accent
7 |
8 | https://github.com/ahmedHanzala/urdu-voice-cloning/assets/105395393/fae434cb-df10-4b58-8b7d-6e4c50115e32
9 |
10 | Our finetuned Model
11 |
12 | https://github.com/ahmedHanzala/urdu-voice-cloning/assets/105395393/05ca7d27-87fd-4001-b62e-26ee71a76d5b
13 |
14 |
15 | Urdu script and urdu text-to-speech testing
16 |
17 | Prompt: seecs ایک بہت اچھا ڈیپارٹمنٹ ہے
18 |
19 | On Tortoise-tts base model
20 |
21 | https://github.com/ahmedHanzala/urdu-voice-cloning/assets/105395393/31dcefce-fc8d-436e-8c16-11d60de140b7
22 |
23 | On our finetuned Model
24 |
25 | https://github.com/ahmedHanzala/urdu-voice-cloning/assets/105395393/5394a4b4-d685-4e87-a254-7ea9436c3545
26 |
27 |
28 |
--------------------------------------------------------------------------------
/LICENSE:
--------------------------------------------------------------------------------
1 | MIT License
2 |
3 | Copyright (c) 2023 Ahmed Hanzala
4 |
5 | Permission is hereby granted, free of charge, to any person obtaining a copy
6 | of this software and associated documentation files (the "Software"), to deal
7 | in the Software without restriction, including without limitation the rights
8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9 | copies of the Software, and to permit persons to whom the Software is
10 | furnished to do so, subject to the following conditions:
11 |
12 | The above copyright notice and this permission notice shall be included in all
13 | copies or substantial portions of the Software.
14 |
15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21 | SOFTWARE.
22 |
--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
1 | Generative Urdu Speech Synthesis
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 | Announcement
11 | The weights are opensourced here: https://huggingface.co/zohann/urdu-tts
12 |
13 |
14 |
15 | Features
16 |
17 | - South-Asian Accent Voice Cloning: tortoise-tts better captures european and american accents but when a non native speaker voice is cloned it americanizes that voice and completely fails. This model has been finetuned on a south-asian accent dataset, so it performs very well in cloning voices with a south-asian accent.
18 | to better capture and reproduce Indian accents, enabling accurate voice cloning for a variety of Indian English accents. It offers improved voice quality and natural-sounding speech synthesis for a more authentic experience.
19 | - Urdu Text-to-Speech: The model also includes support for Urdu text-to-speech, it can understand the arabic-urdu script and produce speech in urdu based on the text.
20 |
21 | Results
22 | Results and audios samples here: https://ahmedhanzala.github.io/urdu-tts/
23 |
24 |
25 | Reference
26 |
27 | - https://github.com/152334H/DL-Art-School
28 | - https://github.com/neonbjb/tortoise-tts
29 |
30 | License
31 | This project is licensed under the MIT License. Feel free to use and modify the code according to your needs.
32 |