├── README.md ├── _config.yml ├── images ├── DurIAN_4S.png ├── DurIAN_SC.png └── logo.png ├── index.md └── wavs ├── 025_000009.wav ├── 025_r_1.wav ├── 025_r_2.wav ├── 025_r_3.wav ├── 025_r_4.wav ├── 025_r_5.wav ├── daj_000023.wav ├── daj_r_1.wav ├── daj_r_2.wav ├── daj_r_3.wav ├── daj_r_4.wav ├── daj_r_5.wav ├── dx_r_1.wav ├── dx_r_2.wav ├── dx_r_3.wav ├── dx_r_4.wav ├── dx_r_5.wav ├── gui_10000224.wav ├── gui_r_1.wav ├── gui_r_2.wav ├── gui_r_3.wav ├── gui_r_4.wav ├── gui_r_5.wav ├── inset_dvec ├── dvec_101.wav ├── dvec_1161.wav ├── dvec_1189.wav ├── dvec_1512.wav ├── dvec_238.wav └── dvec_658.wav ├── inset_emb ├── lut_101.wav ├── lut_1161.wav ├── lut_1189.wav ├── lut_1512.wav ├── lut_238.wav └── lut_658.wav ├── inset_ref ├── 101.wav ├── 1161.wav ├── 1189.wav ├── 1512.wav ├── 238.wav └── 658.wav ├── liu_88010505.wav ├── liu_r_1.wav ├── liu_r_2.wav ├── liu_r_3.wav ├── liu_r_4.wav ├── liu_r_5.wav ├── lpl_r_1.wav ├── lpl_r_2.wav ├── lpl_r_3.wav ├── lpl_r_4.wav ├── lpl_r_5.wav ├── lpl_sample.wav ├── outset_dvec ├── dvec_10001.wav ├── dvec_10003.wav ├── dvec_10004.wav └── dvec_10005.wav ├── song_dx_801000013.wav ├── speech_and_singing ├── speech_and_sing_10001.wav ├── speech_and_sing_10002.wav ├── speech_and_sing_10003.wav ├── speech_and_sing_10004.wav ├── speech_and_sing_10005.wav └── speech_and_sing_10006.wav ├── speech_only ├── speech_10001.wav ├── speech_10002.wav ├── speech_10003.wav ├── speech_10004.wav ├── speech_10005.wav └── speech_10006.wav ├── speech_ref ├── 10001.wav ├── 10002.wav ├── 10003.wav ├── 10004.wav ├── 10005.wav └── 10006.wav ├── ssx_08010642.wav ├── ssx_r_1.wav ├── ssx_r_2.wav ├── ssx_r_3.wav ├── ssx_r_4.wav └── ssx_r_5.wav /README.md: -------------------------------------------------------------------------------- 1 | # DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System 2 | 3 | Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System". Link is: https://tencent-ailab.github.io/learning_singing_from_speech/ 4 | -------------------------------------------------------------------------------- /_config.yml: -------------------------------------------------------------------------------- 1 | theme: jekyll-theme-cayman 2 | github: 3 | is_project_page: false 4 | title: [DurIAN_SC] 5 | -------------------------------------------------------------------------------- /images/DurIAN_4S.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/images/DurIAN_4S.png -------------------------------------------------------------------------------- /images/DurIAN_SC.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/images/DurIAN_SC.png -------------------------------------------------------------------------------- /images/logo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/images/logo.png -------------------------------------------------------------------------------- /index.md: -------------------------------------------------------------------------------- 1 | #
DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System
2 | 3 |
Liqiang Zhang, Chengzhu Yu, Heng Lu, Chao Weng, Chunlei Zhang, Yusong Wu, Xiang Xie, Zijin Li, Dong Yu
4 |
Tencent AI Lab
5 | 6 | 7 | ## Abstract 8 | 9 |
Singing voice conversion is converting the timbre in the source singing to the target speaker's voice while keeping singing content the same. However, singing data for target speaker is much more difficult to collect compared with normal speech data. In this paper, we introduce a singing voice conversion algorithm that is capable of generating high quality target speaker's singing using only his/her normal speech data. First, we manage to integrate the training and conversion process of speech and singing into one framework by unifying the features used in standard speech synthesis system and singing synthesis system. In this way, normal speech data can also contribute to singing voice conversion training, making the singing voice conversion system more robust especially when the singing database is small. Moreover, in order to achieve one-shot singing voice conversion, a speaker embedding module is developed using both speech and singing data, which provides target speaker identify information during conversion. Experiments indicate proposed sing conversion system can convert source singing to target speaker's high-quality singing with only 20 seconds of target speaker's enrollment speech data.
10 | 11 |
12 | 13 | ![arch](images/DurIAN_SC.png) 14 | 15 |
16 | 17 | ## Look Up Table(LUT) based speaker embedding & D-vector based speaker embedding 18 | 19 |
20 | \* Note: All samples are in Mandrin Chinese. 21 |
22 | \* There are 6 in-set singers---3 male singers and 3 female singers, shown here. 23 |
24 | \* The "Reference Voice" is shown here for timbre similarity test, which is the target singer's singing. 25 | 26 | 27 |
28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 44 | 46 | 48 | 49 | 50 | 51 | 53 | 55 | 57 | 58 | 59 | 60 | 62 | 64 | 66 | 67 | 68 | 69 | 71 | 73 | 75 | 76 | 77 | 78 | 80 | 82 | 84 | 85 | 86 | 87 | 89 | 91 | 93 | 94 | 95 |
Reference VoiceLUT SampleD-vector Sample
Female Singer1
Female Singer2
Female Singer3
Male Singer1
Male Singer2
Male Singer3
96 | 97 |
98 | 99 | ## Out-of-set test of D-vector based speaker embedding 100 | 101 |
102 | \* Note: All samples are in Mandrin Chinese. 103 |
104 | \* There are 4 out-of-set speakers---2 male speakers and 2 female speakers, shown here. 105 |
106 | \* The "Register Voice" is shown here for similarity test, which is the target speaker's speech. 107 | 108 | 109 |
110 | 111 | 112 | 113 | 114 | 115 | 116 | 117 | 118 | 119 | 120 | 121 | 122 | 124 | 126 | 127 | 128 | 129 | 131 | 133 | 134 | 135 | 136 | 138 | 140 | 141 | 142 | 143 | 145 | 147 | 148 | 149 |
Register VoiceD-vector Sample
Female Speaker1
Female Speaker2
Male Speaker1
Male Speaker2
150 | 151 |
152 | 153 | 154 | ## Training with speech corpus 155 | 156 |
157 | \* Note: All samples are in Mandrin Chinese. 158 |
159 | \* There are 6 in-set speakers---3 male speakers and 3 female speakers, shown here. 160 |
161 | \* The "Reference Voice" is shown here for timbre similarity test, which is the target speaker's speech. 162 |
163 | \* "Speech only" means training only with speech data, while "Speech & Singing" means training with speech data and other singers' singing data. 164 | 165 |
166 | 167 | 168 | 169 | 170 | 171 | 172 | 173 | 174 | 175 | 176 | 177 | 178 | 179 | 180 | 182 | 184 | 186 | 187 | 188 | 189 | 191 | 193 | 195 | 196 | 197 | 198 | 200 | 202 | 204 | 205 | 206 | 207 | 209 | 211 | 213 | 214 | 215 | 216 | 218 | 220 | 222 | 223 | 224 | 225 | 227 | 229 | 231 | 232 | 233 |
Reference VoiceSpeech OnlySpeechh & Singing
Male Speaker1
Male Speaker2
Male Speaker3
Female Speaker1
Female Speaker2
Female Speaker3
234 | 235 |
-------------------------------------------------------------------------------- /wavs/025_000009.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/025_000009.wav -------------------------------------------------------------------------------- /wavs/025_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/025_r_1.wav -------------------------------------------------------------------------------- /wavs/025_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/025_r_2.wav -------------------------------------------------------------------------------- /wavs/025_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/025_r_3.wav -------------------------------------------------------------------------------- /wavs/025_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/025_r_4.wav -------------------------------------------------------------------------------- /wavs/025_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/025_r_5.wav -------------------------------------------------------------------------------- /wavs/daj_000023.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/daj_000023.wav -------------------------------------------------------------------------------- /wavs/daj_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/daj_r_1.wav -------------------------------------------------------------------------------- /wavs/daj_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/daj_r_2.wav -------------------------------------------------------------------------------- /wavs/daj_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/daj_r_3.wav -------------------------------------------------------------------------------- /wavs/daj_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/daj_r_4.wav -------------------------------------------------------------------------------- /wavs/daj_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/daj_r_5.wav -------------------------------------------------------------------------------- /wavs/dx_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/dx_r_1.wav -------------------------------------------------------------------------------- /wavs/dx_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/dx_r_2.wav -------------------------------------------------------------------------------- /wavs/dx_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/dx_r_3.wav -------------------------------------------------------------------------------- /wavs/dx_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/dx_r_4.wav -------------------------------------------------------------------------------- /wavs/dx_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/dx_r_5.wav -------------------------------------------------------------------------------- /wavs/gui_10000224.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/gui_10000224.wav -------------------------------------------------------------------------------- /wavs/gui_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/gui_r_1.wav -------------------------------------------------------------------------------- /wavs/gui_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/gui_r_2.wav -------------------------------------------------------------------------------- /wavs/gui_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/gui_r_3.wav -------------------------------------------------------------------------------- /wavs/gui_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/gui_r_4.wav -------------------------------------------------------------------------------- /wavs/gui_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/gui_r_5.wav -------------------------------------------------------------------------------- /wavs/inset_dvec/dvec_101.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_dvec/dvec_101.wav -------------------------------------------------------------------------------- /wavs/inset_dvec/dvec_1161.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_dvec/dvec_1161.wav -------------------------------------------------------------------------------- /wavs/inset_dvec/dvec_1189.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_dvec/dvec_1189.wav -------------------------------------------------------------------------------- /wavs/inset_dvec/dvec_1512.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_dvec/dvec_1512.wav -------------------------------------------------------------------------------- /wavs/inset_dvec/dvec_238.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_dvec/dvec_238.wav -------------------------------------------------------------------------------- /wavs/inset_dvec/dvec_658.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_dvec/dvec_658.wav -------------------------------------------------------------------------------- /wavs/inset_emb/lut_101.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_emb/lut_101.wav -------------------------------------------------------------------------------- /wavs/inset_emb/lut_1161.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_emb/lut_1161.wav -------------------------------------------------------------------------------- /wavs/inset_emb/lut_1189.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_emb/lut_1189.wav -------------------------------------------------------------------------------- /wavs/inset_emb/lut_1512.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_emb/lut_1512.wav -------------------------------------------------------------------------------- /wavs/inset_emb/lut_238.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_emb/lut_238.wav -------------------------------------------------------------------------------- /wavs/inset_emb/lut_658.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_emb/lut_658.wav -------------------------------------------------------------------------------- /wavs/inset_ref/101.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_ref/101.wav -------------------------------------------------------------------------------- /wavs/inset_ref/1161.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_ref/1161.wav -------------------------------------------------------------------------------- /wavs/inset_ref/1189.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_ref/1189.wav -------------------------------------------------------------------------------- /wavs/inset_ref/1512.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_ref/1512.wav -------------------------------------------------------------------------------- /wavs/inset_ref/238.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_ref/238.wav -------------------------------------------------------------------------------- /wavs/inset_ref/658.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/inset_ref/658.wav -------------------------------------------------------------------------------- /wavs/liu_88010505.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/liu_88010505.wav -------------------------------------------------------------------------------- /wavs/liu_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/liu_r_1.wav -------------------------------------------------------------------------------- /wavs/liu_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/liu_r_2.wav -------------------------------------------------------------------------------- /wavs/liu_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/liu_r_3.wav -------------------------------------------------------------------------------- /wavs/liu_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/liu_r_4.wav -------------------------------------------------------------------------------- /wavs/liu_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/liu_r_5.wav -------------------------------------------------------------------------------- /wavs/lpl_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/lpl_r_1.wav -------------------------------------------------------------------------------- /wavs/lpl_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/lpl_r_2.wav -------------------------------------------------------------------------------- /wavs/lpl_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/lpl_r_3.wav -------------------------------------------------------------------------------- /wavs/lpl_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/lpl_r_4.wav -------------------------------------------------------------------------------- /wavs/lpl_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/lpl_r_5.wav -------------------------------------------------------------------------------- /wavs/lpl_sample.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/lpl_sample.wav -------------------------------------------------------------------------------- /wavs/outset_dvec/dvec_10001.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/outset_dvec/dvec_10001.wav -------------------------------------------------------------------------------- /wavs/outset_dvec/dvec_10003.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/outset_dvec/dvec_10003.wav -------------------------------------------------------------------------------- /wavs/outset_dvec/dvec_10004.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/outset_dvec/dvec_10004.wav -------------------------------------------------------------------------------- /wavs/outset_dvec/dvec_10005.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/outset_dvec/dvec_10005.wav -------------------------------------------------------------------------------- /wavs/song_dx_801000013.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/song_dx_801000013.wav -------------------------------------------------------------------------------- /wavs/speech_and_singing/speech_and_sing_10001.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_and_singing/speech_and_sing_10001.wav -------------------------------------------------------------------------------- /wavs/speech_and_singing/speech_and_sing_10002.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_and_singing/speech_and_sing_10002.wav -------------------------------------------------------------------------------- /wavs/speech_and_singing/speech_and_sing_10003.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_and_singing/speech_and_sing_10003.wav -------------------------------------------------------------------------------- /wavs/speech_and_singing/speech_and_sing_10004.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_and_singing/speech_and_sing_10004.wav -------------------------------------------------------------------------------- /wavs/speech_and_singing/speech_and_sing_10005.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_and_singing/speech_and_sing_10005.wav -------------------------------------------------------------------------------- /wavs/speech_and_singing/speech_and_sing_10006.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_and_singing/speech_and_sing_10006.wav -------------------------------------------------------------------------------- /wavs/speech_only/speech_10001.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_only/speech_10001.wav -------------------------------------------------------------------------------- /wavs/speech_only/speech_10002.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_only/speech_10002.wav -------------------------------------------------------------------------------- /wavs/speech_only/speech_10003.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_only/speech_10003.wav -------------------------------------------------------------------------------- /wavs/speech_only/speech_10004.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_only/speech_10004.wav -------------------------------------------------------------------------------- /wavs/speech_only/speech_10005.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_only/speech_10005.wav -------------------------------------------------------------------------------- /wavs/speech_only/speech_10006.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_only/speech_10006.wav -------------------------------------------------------------------------------- /wavs/speech_ref/10001.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_ref/10001.wav -------------------------------------------------------------------------------- /wavs/speech_ref/10002.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_ref/10002.wav -------------------------------------------------------------------------------- /wavs/speech_ref/10003.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_ref/10003.wav -------------------------------------------------------------------------------- /wavs/speech_ref/10004.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_ref/10004.wav -------------------------------------------------------------------------------- /wavs/speech_ref/10005.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_ref/10005.wav -------------------------------------------------------------------------------- /wavs/speech_ref/10006.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/speech_ref/10006.wav -------------------------------------------------------------------------------- /wavs/ssx_08010642.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/ssx_08010642.wav -------------------------------------------------------------------------------- /wavs/ssx_r_1.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/ssx_r_1.wav -------------------------------------------------------------------------------- /wavs/ssx_r_2.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/ssx_r_2.wav -------------------------------------------------------------------------------- /wavs/ssx_r_3.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/ssx_r_3.wav -------------------------------------------------------------------------------- /wavs/ssx_r_4.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/ssx_r_4.wav -------------------------------------------------------------------------------- /wavs/ssx_r_5.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tencent-ailab/learning_singing_from_speech/f17488de7849c50b779839b5e6fc05674a3eabd6/wavs/ssx_r_5.wav --------------------------------------------------------------------------------