├── wavs ├── 5K_jeong_5.wav ├── 5K_kang_5.wav ├── 5K_yoon_5.wav ├── D12-250_4.wav ├── D12-250_5.wav ├── D12-500_4.wav ├── D12-500_5.wav ├── D13-250_4.wav ├── D13-250_5.wav ├── D13-500_4.wav ├── D13-500_5.wav ├── D14-250_4.wav ├── D14-250_5.wav ├── D14-500_4.wav ├── D14-500_5.wav ├── judy-250_2.wav ├── judy-250_3.wav ├── judy-250_5.wav ├── judy-500_2.wav ├── judy-500_3.wav ├── judy-500_5.wav ├── judy_5K_3.wav ├── kang_250_4.wav ├── kang_250_5.wav ├── kang_500_4.wav ├── kang_500_5.wav ├── mary-250_2.wav ├── mary-250_3.wav ├── mary-250_5.wav ├── mary-500_2.wav ├── mary-500_3.wav ├── mary-500_5.wav ├── mary_5K_3.wav ├── yoon_250_4.wav ├── yoon_250_5.wav ├── yoon_500_4.wav ├── yoon_500_5.wav ├── D12-500-5k_4.wav ├── D12-500-5k_5.wav ├── D13-500-5k_4.wav ├── D13-500-5k_5.wav ├── D14-500-5k_4.wav ├── D14-500-5k_5.wav ├── jeong_250_4.wav ├── jeong_250_5.wav ├── jeong_500_4.wav ├── jeong_500_5.wav ├── miller-250_2.wav ├── miller-250_3.wav ├── miller-250_5.wav ├── miller-500_2.wav ├── miller-500_3.wav ├── miller-500_5.wav └── miller_5K_3.wav ├── styles.css └── index.html /wavs/5K_jeong_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/5K_jeong_5.wav -------------------------------------------------------------------------------- /wavs/5K_kang_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/5K_kang_5.wav -------------------------------------------------------------------------------- /wavs/5K_yoon_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/5K_yoon_5.wav -------------------------------------------------------------------------------- /wavs/D12-250_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D12-250_4.wav -------------------------------------------------------------------------------- /wavs/D12-250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D12-250_5.wav -------------------------------------------------------------------------------- /wavs/D12-500_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D12-500_4.wav -------------------------------------------------------------------------------- /wavs/D12-500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D12-500_5.wav -------------------------------------------------------------------------------- /wavs/D13-250_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D13-250_4.wav -------------------------------------------------------------------------------- /wavs/D13-250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D13-250_5.wav -------------------------------------------------------------------------------- /wavs/D13-500_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D13-500_4.wav -------------------------------------------------------------------------------- /wavs/D13-500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D13-500_5.wav -------------------------------------------------------------------------------- /wavs/D14-250_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D14-250_4.wav -------------------------------------------------------------------------------- /wavs/D14-250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D14-250_5.wav -------------------------------------------------------------------------------- /wavs/D14-500_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D14-500_4.wav -------------------------------------------------------------------------------- /wavs/D14-500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D14-500_5.wav -------------------------------------------------------------------------------- /wavs/judy-250_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy-250_2.wav -------------------------------------------------------------------------------- /wavs/judy-250_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy-250_3.wav -------------------------------------------------------------------------------- /wavs/judy-250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy-250_5.wav -------------------------------------------------------------------------------- /wavs/judy-500_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy-500_2.wav -------------------------------------------------------------------------------- /wavs/judy-500_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy-500_3.wav -------------------------------------------------------------------------------- /wavs/judy-500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy-500_5.wav -------------------------------------------------------------------------------- /wavs/judy_5K_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/judy_5K_3.wav -------------------------------------------------------------------------------- /wavs/kang_250_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/kang_250_4.wav -------------------------------------------------------------------------------- /wavs/kang_250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/kang_250_5.wav -------------------------------------------------------------------------------- /wavs/kang_500_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/kang_500_4.wav -------------------------------------------------------------------------------- /wavs/kang_500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/kang_500_5.wav -------------------------------------------------------------------------------- /wavs/mary-250_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary-250_2.wav -------------------------------------------------------------------------------- /wavs/mary-250_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary-250_3.wav -------------------------------------------------------------------------------- /wavs/mary-250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary-250_5.wav -------------------------------------------------------------------------------- /wavs/mary-500_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary-500_2.wav -------------------------------------------------------------------------------- /wavs/mary-500_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary-500_3.wav -------------------------------------------------------------------------------- /wavs/mary-500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary-500_5.wav -------------------------------------------------------------------------------- /wavs/mary_5K_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/mary_5K_3.wav -------------------------------------------------------------------------------- /wavs/yoon_250_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/yoon_250_4.wav -------------------------------------------------------------------------------- /wavs/yoon_250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/yoon_250_5.wav -------------------------------------------------------------------------------- /wavs/yoon_500_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/yoon_500_4.wav -------------------------------------------------------------------------------- /wavs/yoon_500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/yoon_500_5.wav -------------------------------------------------------------------------------- /wavs/D12-500-5k_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D12-500-5k_4.wav -------------------------------------------------------------------------------- /wavs/D12-500-5k_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D12-500-5k_5.wav -------------------------------------------------------------------------------- /wavs/D13-500-5k_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D13-500-5k_4.wav -------------------------------------------------------------------------------- /wavs/D13-500-5k_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D13-500-5k_5.wav -------------------------------------------------------------------------------- /wavs/D14-500-5k_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D14-500-5k_4.wav -------------------------------------------------------------------------------- /wavs/D14-500-5k_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/D14-500-5k_5.wav -------------------------------------------------------------------------------- /wavs/jeong_250_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/jeong_250_4.wav -------------------------------------------------------------------------------- /wavs/jeong_250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/jeong_250_5.wav -------------------------------------------------------------------------------- /wavs/jeong_500_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/jeong_500_4.wav -------------------------------------------------------------------------------- /wavs/jeong_500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/jeong_500_5.wav -------------------------------------------------------------------------------- /wavs/miller-250_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller-250_2.wav -------------------------------------------------------------------------------- /wavs/miller-250_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller-250_3.wav -------------------------------------------------------------------------------- /wavs/miller-250_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller-250_5.wav -------------------------------------------------------------------------------- /wavs/miller-500_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller-500_2.wav -------------------------------------------------------------------------------- /wavs/miller-500_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller-500_3.wav -------------------------------------------------------------------------------- /wavs/miller-500_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller-500_5.wav -------------------------------------------------------------------------------- /wavs/miller_5K_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/DSAIL-SKKU/Transfer-Learning/HEAD/wavs/miller_5K_3.wav -------------------------------------------------------------------------------- /styles.css: -------------------------------------------------------------------------------- 1 | .heading { 2 | margin-top: 40px; 3 | font-weight: 900; 4 | } 5 | span { 6 | font-weight: 900; 7 | } 8 | .audio-list { 9 | display: flex; 10 | /* justify-content: space-between; */ 11 | } 12 | .audio-element { 13 | margin-right: 10px; 14 | } 15 | h3 { 16 | margin-top: 40px; 17 | } 18 | .footer { 19 | margin-top: 40px; 20 | } 21 | -------------------------------------------------------------------------------- /index.html: -------------------------------------------------------------------------------- 1 | 2 | 3 |
4 | 5 | 6 | 7 |Authors: Jeewoo Yoon, Seong Choi, Taihu Li, and Jinyoung Han*
12 |Abstract: To synthesize natural and intelligible speech with a small amount of data, transfer learning with well-maintained and pre-trained data has been known to be useful. However, little attention has been paid to answer the following research questions with empirically-grounded evidence, ``How much pre-trained (source) speech data (e.g., 10 K utterances or 10 hours) used in transfer learning is enough for generating natural and intelligible speech?'' and ``For generating natural and intelligible speech, how much (target) speech data should at least be provided?'', which are essential for the quality of speech synthesis. To answer these questions, this paper conducts extensive experiments on speech synthesis with multiple source and target data with different lengths, speakers, and languages. We show that intelligible and natural speech can be synthesized with only 500 utterances of target data using transfer learning. Our work also reveals that at least 5000 utterances of source pre-trained data are required to synthesize decent speech.
13 |The model was pre-trained with 10K utterances and fine-tuned with 250 and 500 utterances
16 |E-SPK1-250
20 | 23 |E-SPK2-250
26 | 29 |E-SPK3-250
32 | 35 |E-SPK1-500
40 | 43 |E-SPK2-500
46 | 49 |E-SPK3-500
52 | 55 |E-SPK1-250
63 | 66 |E-SPK2-250
69 | 72 |E-SPK3-250
75 | 78 |E-SPK1-500
83 | 86 |E-SPK2-500
89 | 92 |E-SPK3-500
95 | 98 |K-SPK1-250
105 | 108 |K-SPK2-250
111 | 114 |K-SPK3-250
117 | 120 |K-SPK1-500
125 | 128 |K-SPK2-500
131 | 134 |K-SPK3-500
137 | 140 |K-SPK1-250
147 | 150 |K-SPK2-250
153 | 156 |K-SPK3-250
159 | 162 |K-SPK1-500
167 | 170 |K-SPK2-500
173 | 176 |K-SPK3-500
179 | 182 |C-SPK1-250
189 | 192 |C-SPK2-250
195 | 198 |C-SPK3-250
201 | 204 |C-SPK1-500
209 | 212 |C-SPK2-500
215 | 218 |C-SPK3-500
221 | 224 |C-SPK1-250
231 | 234 |C-SPK2-250
237 | 240 |C-SPK3-250
243 | 246 |C-SPK1-500
251 | 254 |C-SPK2-500
257 | 260 |C-SPK3-500
263 | 266 |The model was pre-trained with 5K utterances and fine-tuned with 500 utterances
274 |E-SPK1-500
278 | 281 |E-SPK2-500
284 | 287 |E-SPK3-500
290 | 293 |K-SPK1-500
300 | 303 |K-SPK2-500
306 | 309 |K-SPK3-500
312 | 315 |C-SPK1-500
322 | 325 |C-SPK2-500
328 | 331 |C-SPK3-500
334 | 337 |