├── .gitmodules
├── README.md
├── Vocoder
    └── README.md
├── asr.py
├── generate_waveform_from_code.py
├── img
    └── Img.png
├── pretrained
    └── README.md
├── score.py
├── src
    ├── data
    │   ├── dataset_IM.py
    │   ├── dataset_IMunit.py
    │   ├── token_transform.py
    │   └── utils.py
    ├── lr_scheduler.py
    └── models
    │   ├── GIT_IMunit.py
    │   ├── IM_Speech.py
    │   ├── IMunit_Speech.py
    │   ├── SeiT.py
    │   └── __init__.py
├── test_Im_Sp.py
├── test_Im_Sp.sh
├── test_Im_Sp_unit.py
├── test_Im_Sp_unit.sh
└── train_Im_Sp.py


/.gitmodules:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/.gitmodules


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/README.md


--------------------------------------------------------------------------------
/Vocoder/README.md:
--------------------------------------------------------------------------------
1 | Put vocoder checkpoint and config files here.
2 | 


--------------------------------------------------------------------------------
/asr.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/asr.py


--------------------------------------------------------------------------------
/generate_waveform_from_code.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/generate_waveform_from_code.py


--------------------------------------------------------------------------------
/img/Img.png:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/img/Img.png


--------------------------------------------------------------------------------
/pretrained/README.md:
--------------------------------------------------------------------------------
1 | Put pretrained img tokenizer and codebook here.
2 | 


--------------------------------------------------------------------------------
/score.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/score.py


--------------------------------------------------------------------------------
/src/data/dataset_IM.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/data/dataset_IM.py


--------------------------------------------------------------------------------
/src/data/dataset_IMunit.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/data/dataset_IMunit.py


--------------------------------------------------------------------------------
/src/data/token_transform.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/data/token_transform.py


--------------------------------------------------------------------------------
/src/data/utils.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/data/utils.py


--------------------------------------------------------------------------------
/src/lr_scheduler.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/lr_scheduler.py


--------------------------------------------------------------------------------
/src/models/GIT_IMunit.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/models/GIT_IMunit.py


--------------------------------------------------------------------------------
/src/models/IM_Speech.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/models/IM_Speech.py


--------------------------------------------------------------------------------
/src/models/IMunit_Speech.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/models/IMunit_Speech.py


--------------------------------------------------------------------------------
/src/models/SeiT.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/src/models/SeiT.py


--------------------------------------------------------------------------------
/src/models/__init__.py:
--------------------------------------------------------------------------------
1 | 


--------------------------------------------------------------------------------
/test_Im_Sp.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/test_Im_Sp.py


--------------------------------------------------------------------------------
/test_Im_Sp.sh:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/test_Im_Sp.sh


--------------------------------------------------------------------------------
/test_Im_Sp_unit.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/test_Im_Sp_unit.py


--------------------------------------------------------------------------------
/test_Im_Sp_unit.sh:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/test_Im_Sp_unit.sh


--------------------------------------------------------------------------------
/train_Im_Sp.py:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ms-dot-k/Image-to-Speech/HEAD/train_Im_Sp.py


--------------------------------------------------------------------------------