├── .dockerignore ├── .gitignore ├── Dockerfile ├── LICENSE-CODE ├── LICENSE-MODEL ├── README.md ├── RealEdit.txt ├── cog.yaml ├── config.py ├── data ├── __init__.py ├── gigaspeech.py ├── phonemize_encodec_encode_hf.py └── tokenizer.py ├── demo ├── 5895_34622_000026_000002.wav ├── 84_121550_000074_000000.wav ├── pam.wav └── temp │ ├── 84_121550_000074_000000.txt │ └── mfa_alignments │ ├── 5895_34622_000026_000002.csv │ └── 84_121550_000074_000000.csv ├── edit_utils.py ├── environment.yml ├── gradio_app.ipynb ├── gradio_app.py ├── gradio_requirements.txt ├── inference_speech_editing.ipynb ├── inference_speech_editing_scale.py ├── inference_tts.ipynb ├── inference_tts_scale.py ├── main.py ├── models ├── codebooks_patterns.py ├── modules │ ├── __init__.py │ ├── activation.py │ ├── embedding.py │ ├── sampling.py │ ├── scaling.py │ ├── transformer.py │ └── utils.py └── voicecraft.py ├── predict.py ├── pretrained_models └── .gitkeep ├── start-jupyter.bat ├── start-jupyter.sh ├── steps ├── __init__.py ├── optim.py ├── trainer.py └── trainer_utils.py ├── tts_demo.py ├── voicecraft-gradio-colab.ipynb └── z_scripts ├── e830M.sh └── e830M_ft.sh /.dockerignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/.dockerignore -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/.gitignore -------------------------------------------------------------------------------- /Dockerfile: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/Dockerfile -------------------------------------------------------------------------------- /LICENSE-CODE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/LICENSE-CODE -------------------------------------------------------------------------------- /LICENSE-MODEL: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/LICENSE-MODEL -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/README.md -------------------------------------------------------------------------------- /RealEdit.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/RealEdit.txt -------------------------------------------------------------------------------- /cog.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/cog.yaml -------------------------------------------------------------------------------- /config.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/config.py -------------------------------------------------------------------------------- /data/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /data/gigaspeech.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/data/gigaspeech.py -------------------------------------------------------------------------------- /data/phonemize_encodec_encode_hf.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/data/phonemize_encodec_encode_hf.py -------------------------------------------------------------------------------- /data/tokenizer.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/data/tokenizer.py -------------------------------------------------------------------------------- /demo/5895_34622_000026_000002.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/demo/5895_34622_000026_000002.wav -------------------------------------------------------------------------------- /demo/84_121550_000074_000000.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/demo/84_121550_000074_000000.wav -------------------------------------------------------------------------------- /demo/pam.wav: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/demo/pam.wav -------------------------------------------------------------------------------- /demo/temp/84_121550_000074_000000.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/demo/temp/84_121550_000074_000000.txt -------------------------------------------------------------------------------- /demo/temp/mfa_alignments/5895_34622_000026_000002.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/demo/temp/mfa_alignments/5895_34622_000026_000002.csv -------------------------------------------------------------------------------- /demo/temp/mfa_alignments/84_121550_000074_000000.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/demo/temp/mfa_alignments/84_121550_000074_000000.csv -------------------------------------------------------------------------------- /edit_utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/edit_utils.py -------------------------------------------------------------------------------- /environment.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/environment.yml -------------------------------------------------------------------------------- /gradio_app.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/gradio_app.ipynb -------------------------------------------------------------------------------- /gradio_app.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/gradio_app.py -------------------------------------------------------------------------------- /gradio_requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/gradio_requirements.txt -------------------------------------------------------------------------------- /inference_speech_editing.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/inference_speech_editing.ipynb -------------------------------------------------------------------------------- /inference_speech_editing_scale.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/inference_speech_editing_scale.py -------------------------------------------------------------------------------- /inference_tts.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/inference_tts.ipynb -------------------------------------------------------------------------------- /inference_tts_scale.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/inference_tts_scale.py -------------------------------------------------------------------------------- /main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/main.py -------------------------------------------------------------------------------- /models/codebooks_patterns.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/codebooks_patterns.py -------------------------------------------------------------------------------- /models/modules/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /models/modules/activation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/modules/activation.py -------------------------------------------------------------------------------- /models/modules/embedding.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/modules/embedding.py -------------------------------------------------------------------------------- /models/modules/sampling.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/modules/sampling.py -------------------------------------------------------------------------------- /models/modules/scaling.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/modules/scaling.py -------------------------------------------------------------------------------- /models/modules/transformer.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/modules/transformer.py -------------------------------------------------------------------------------- /models/modules/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/modules/utils.py -------------------------------------------------------------------------------- /models/voicecraft.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/models/voicecraft.py -------------------------------------------------------------------------------- /predict.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/predict.py -------------------------------------------------------------------------------- /pretrained_models/.gitkeep: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /start-jupyter.bat: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/start-jupyter.bat -------------------------------------------------------------------------------- /start-jupyter.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/start-jupyter.sh -------------------------------------------------------------------------------- /steps/__init__.py: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /steps/optim.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/steps/optim.py -------------------------------------------------------------------------------- /steps/trainer.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/steps/trainer.py -------------------------------------------------------------------------------- /steps/trainer_utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/steps/trainer_utils.py -------------------------------------------------------------------------------- /tts_demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/tts_demo.py -------------------------------------------------------------------------------- /voicecraft-gradio-colab.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/voicecraft-gradio-colab.ipynb -------------------------------------------------------------------------------- /z_scripts/e830M.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/z_scripts/e830M.sh -------------------------------------------------------------------------------- /z_scripts/e830M_ft.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jasonppy/VoiceCraft/HEAD/z_scripts/e830M_ft.sh --------------------------------------------------------------------------------