└── README.md /README.md: -------------------------------------------------------------------------------- 1 | List of direct speech-to-speech translation papers. Welcome to recommend more awesome papers 😀. 2 | 3 | --- 4 | 5 | ## Dataset 6 | - CVSS Corpus and Massively Multilingual Speech-to-Speech Translation, [[paper]](https://arxiv.org/abs/2201.03713). Ye Jia, et al. 7 | - SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations, [[paper]](https://arxiv.org/abs/2201.03713). Paul-Ambroise Duquenne, et al. 8 | 9 | ## Paper List 10 | 11 | - Direct speech-to-speech translation with a sequence-to-sequence model, **InterSpeech-2019**, [[paper]](https://arxiv.org/abs/1904.06037). Ye Jia, et al. 12 | - Speech-To-Speech Translation Between Untranscribed Unknown Languages, **ASRU-2019**, [[paper]](https://arxiv.org/abs/1910.00795). Andros Tjandra, et al. 13 | - UWSpeech: Speech to Speech Translation for Unwritten Languages, **AAAI**, [[paper]](https://arxiv.org/abs/2006.07926). Chen Zhang, et al. 14 | - Transformer-Based Direct Speech-To-Speech Translation With Transcoder, **SLT-2021**, [[paper]](https://ahcweb01.naist.jp/papers/conference/2021/202101_SLT_takatomo-k/202101_SLT_takatomo-k.paper.pdf). Takatomo Kano, et al. 15 | - Direct Speech-To-Speech Translation With Discrete Units, **ACL 2022**, [[paper]](https://arxiv.org/abs/2107.05604). Ann Lee, et al. 16 | - Translatotron 2: Robust Direct Speech-To-Speech Translation, **ICML 2022**, [[paper]](https://arxiv.org/abs/2107.08661). Ye Jia, et al. 17 | - Direct Simultaneous Speech To Speech Translation, **Arxiv-2021**, [[paper]](https://arxiv.org/abs/2110.08250). Xutai Ma, et al. 18 | - Textless Speech-to-Speech Translation on Real Data, **NAACL 2022**, [[paper]](https://arxiv.org/abs/2112.08352). Ann Lee, et al. 19 | - Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation, **InterSpeech-2022**, [[paper]](https://arxiv.org/abs/2204.02967). Sravya Popuri, et al. 20 | - Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation, **InterSpeech-2022**, [[paper]](https://arxiv.org/abs/2203.13339). Ye Jia, et al. 21 | - TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation, **ICLR 2023**, [[paper]](https://arxiv.org/abs/2205.12523). Rongjie Huang, et al. 22 | - UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units, **Arxiv 2022**, [[paper]](https://arxiv.org/abs/2212.08055). Hirofumi Inaguma, et al. 23 | - Speech-to-Speech Translation For A Real-world Unwritten Language, **Arxiv 2022**, [[paper]](https://arxiv.org/abs/2211.06474). Peng-Jen Chen, et al. 24 | - Simple and Effective Unsupervised Speech Translation, **Arxiv 2022**, [[paper]](https://arxiv.org/abs/2210.10191). Changhan Wang, et al. 25 | --------------------------------------------------------------------------------