├── .gitignore ├── LICENSE ├── README.md ├── api.py ├── configs ├── E2TTS_Base_train.yaml ├── E2TTS_Small_train.yaml ├── F5TTS_Base_train.yaml └── F5TTS_Small_train.yaml ├── data ├── Emilia_ZH_EN_pinyin │ └── vocab.txt └── librispeech_pc_test_clean_cross_sentence.lst ├── f5_tts ├── api.py ├── configs │ ├── E2TTS_Base_train.yaml │ ├── E2TTS_Small_train.yaml │ ├── F5TTS_Base_train.yaml │ └── F5TTS_Small_train.yaml ├── eval │ ├── README.md │ ├── ecapa_tdnn.py │ ├── eval_infer_batch.py │ ├── eval_infer_batch.sh │ ├── eval_librispeech_test_clean.py │ ├── eval_seedtts_testset.py │ ├── eval_utmos.py │ └── utils_eval.py ├── infer │ ├── README.md │ ├── SHARED.md │ ├── examples │ │ ├── basic │ │ │ ├── basic.toml │ │ │ ├── basic_ref_en.wav │ │ │ └── basic_ref_zh.wav │ │ ├── multi │ │ │ ├── country.flac │ │ │ ├── main.flac │ │ │ ├── story.toml │ │ │ ├── story.txt │ │ │ └── town.flac │ │ └── vocab.txt │ ├── infer_cli.py │ ├── infer_gradio.py │ ├── speech_edit.py │ └── utils_infer.py ├── model │ ├── __init__.py │ ├── backbones │ │ ├── README.md │ │ ├── dit.py │ │ ├── mmdit.py │ │ └── unett.py │ ├── cfm.py │ ├── dataset.py │ ├── modules.py │ ├── trainer.py │ └── utils.py ├── scripts │ ├── count_max_epoch.py │ └── count_params_gflops.py ├── socket_server.py └── train │ ├── README.md │ ├── datasets │ ├── prepare_csv_wavs.py │ ├── prepare_emilia.py │ ├── prepare_libritts.py │ ├── prepare_ljspeech.py │ └── prepare_wenetspeech4tts.py │ ├── finetune_cli.py │ ├── finetune_gradio.py │ └── train.py ├── requirements.txt ├── run-api.bat ├── run-webui.bat ├── runtest.bat ├── test.py ├── testcuda.py └── 测试GPU是否可用.bat /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/README.md -------------------------------------------------------------------------------- /api.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/api.py -------------------------------------------------------------------------------- /configs/E2TTS_Base_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/configs/E2TTS_Base_train.yaml -------------------------------------------------------------------------------- /configs/E2TTS_Small_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/configs/E2TTS_Small_train.yaml -------------------------------------------------------------------------------- /configs/F5TTS_Base_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/configs/F5TTS_Base_train.yaml -------------------------------------------------------------------------------- /configs/F5TTS_Small_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/configs/F5TTS_Small_train.yaml -------------------------------------------------------------------------------- /data/Emilia_ZH_EN_pinyin/vocab.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/data/Emilia_ZH_EN_pinyin/vocab.txt -------------------------------------------------------------------------------- /data/librispeech_pc_test_clean_cross_sentence.lst: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/data/librispeech_pc_test_clean_cross_sentence.lst -------------------------------------------------------------------------------- /f5_tts/api.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/api.py -------------------------------------------------------------------------------- /f5_tts/configs/E2TTS_Base_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/configs/E2TTS_Base_train.yaml -------------------------------------------------------------------------------- /f5_tts/configs/E2TTS_Small_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/configs/E2TTS_Small_train.yaml -------------------------------------------------------------------------------- /f5_tts/configs/F5TTS_Base_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/configs/F5TTS_Base_train.yaml -------------------------------------------------------------------------------- /f5_tts/configs/F5TTS_Small_train.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/configs/F5TTS_Small_train.yaml -------------------------------------------------------------------------------- /f5_tts/eval/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/README.md -------------------------------------------------------------------------------- /f5_tts/eval/ecapa_tdnn.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/ecapa_tdnn.py -------------------------------------------------------------------------------- /f5_tts/eval/eval_infer_batch.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/eval_infer_batch.py -------------------------------------------------------------------------------- /f5_tts/eval/eval_infer_batch.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/eval_infer_batch.sh -------------------------------------------------------------------------------- /f5_tts/eval/eval_librispeech_test_clean.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/eval_librispeech_test_clean.py -------------------------------------------------------------------------------- /f5_tts/eval/eval_seedtts_testset.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/eval_seedtts_testset.py -------------------------------------------------------------------------------- /f5_tts/eval/eval_utmos.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/eval_utmos.py -------------------------------------------------------------------------------- /f5_tts/eval/utils_eval.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/eval/utils_eval.py -------------------------------------------------------------------------------- /f5_tts/infer/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/README.md -------------------------------------------------------------------------------- /f5_tts/infer/SHARED.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/SHARED.md -------------------------------------------------------------------------------- /f5_tts/infer/examples/basic/basic.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/basic/basic.toml -------------------------------------------------------------------------------- /f5_tts/infer/examples/basic/basic_ref_en.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/basic/basic_ref_en.wav -------------------------------------------------------------------------------- /f5_tts/infer/examples/basic/basic_ref_zh.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/basic/basic_ref_zh.wav -------------------------------------------------------------------------------- /f5_tts/infer/examples/multi/country.flac: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/multi/country.flac -------------------------------------------------------------------------------- /f5_tts/infer/examples/multi/main.flac: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/multi/main.flac -------------------------------------------------------------------------------- /f5_tts/infer/examples/multi/story.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/multi/story.toml -------------------------------------------------------------------------------- /f5_tts/infer/examples/multi/story.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/multi/story.txt -------------------------------------------------------------------------------- /f5_tts/infer/examples/multi/town.flac: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/multi/town.flac -------------------------------------------------------------------------------- /f5_tts/infer/examples/vocab.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/examples/vocab.txt -------------------------------------------------------------------------------- /f5_tts/infer/infer_cli.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/infer_cli.py -------------------------------------------------------------------------------- /f5_tts/infer/infer_gradio.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/infer_gradio.py -------------------------------------------------------------------------------- /f5_tts/infer/speech_edit.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/speech_edit.py -------------------------------------------------------------------------------- /f5_tts/infer/utils_infer.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/infer/utils_infer.py -------------------------------------------------------------------------------- /f5_tts/model/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/__init__.py -------------------------------------------------------------------------------- /f5_tts/model/backbones/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/backbones/README.md -------------------------------------------------------------------------------- /f5_tts/model/backbones/dit.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/backbones/dit.py -------------------------------------------------------------------------------- /f5_tts/model/backbones/mmdit.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/backbones/mmdit.py -------------------------------------------------------------------------------- /f5_tts/model/backbones/unett.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/backbones/unett.py -------------------------------------------------------------------------------- /f5_tts/model/cfm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/cfm.py -------------------------------------------------------------------------------- /f5_tts/model/dataset.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/dataset.py -------------------------------------------------------------------------------- /f5_tts/model/modules.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/modules.py -------------------------------------------------------------------------------- /f5_tts/model/trainer.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/trainer.py -------------------------------------------------------------------------------- /f5_tts/model/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/model/utils.py -------------------------------------------------------------------------------- /f5_tts/scripts/count_max_epoch.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/scripts/count_max_epoch.py -------------------------------------------------------------------------------- /f5_tts/scripts/count_params_gflops.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/scripts/count_params_gflops.py -------------------------------------------------------------------------------- /f5_tts/socket_server.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/socket_server.py -------------------------------------------------------------------------------- /f5_tts/train/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/README.md -------------------------------------------------------------------------------- /f5_tts/train/datasets/prepare_csv_wavs.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/datasets/prepare_csv_wavs.py -------------------------------------------------------------------------------- /f5_tts/train/datasets/prepare_emilia.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/datasets/prepare_emilia.py -------------------------------------------------------------------------------- /f5_tts/train/datasets/prepare_libritts.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/datasets/prepare_libritts.py -------------------------------------------------------------------------------- /f5_tts/train/datasets/prepare_ljspeech.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/datasets/prepare_ljspeech.py -------------------------------------------------------------------------------- /f5_tts/train/datasets/prepare_wenetspeech4tts.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/datasets/prepare_wenetspeech4tts.py -------------------------------------------------------------------------------- /f5_tts/train/finetune_cli.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/finetune_cli.py -------------------------------------------------------------------------------- /f5_tts/train/finetune_gradio.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/finetune_gradio.py -------------------------------------------------------------------------------- /f5_tts/train/train.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/f5_tts/train/train.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/requirements.txt -------------------------------------------------------------------------------- /run-api.bat: -------------------------------------------------------------------------------- 1 | @echo off 2 | 3 | 4 | call %cd%/runtime/python api.py 5 | 6 | pause -------------------------------------------------------------------------------- /run-webui.bat: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/run-webui.bat -------------------------------------------------------------------------------- /runtest.bat: -------------------------------------------------------------------------------- 1 | @echo off 2 | 3 | 4 | call %cd%/runtime/python test.py 5 | 6 | pause -------------------------------------------------------------------------------- /test.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/test.py -------------------------------------------------------------------------------- /testcuda.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jianchang512/f5-tts-api/HEAD/testcuda.py -------------------------------------------------------------------------------- /测试GPU是否可用.bat: -------------------------------------------------------------------------------- 1 | @echo off 2 | 3 | 4 | call %cd%/runtime/python testcuda.py 5 | 6 | pause --------------------------------------------------------------------------------