├── .cursorignore ├── .gitignore ├── .vscode └── launch.json ├── FIND_BUG.md ├── LICENSE ├── README.md ├── config ├── input_tokenizer.json ├── multilingual_tokenizer.json ├── output_tokenizer.json └── symbols.json ├── pyproject.toml ├── screenshots ├── Screenshot from 2025-06-02 22-54-48.png └── Screenshot from 2025-06-06 09-14-06.png └── src ├── __init__.py ├── arch ├── __init__.py ├── model copy.py ├── model.py └── positional_encoding.py ├── helper ├── __init__.py ├── dataset.py ├── datasetfrom_lmdb.py ├── sentence_destructor.py ├── text_cleaner.py └── translation_dataset.py ├── main.py ├── sentence_length_statistics.ipynb ├── tools ├── __init__.py ├── generate_individual_words.ipynb ├── generate_input_tokenizer.ipynb ├── generate_output_tokenizer.ipynb └── prepare_dataset.ipynb └── train.py /.cursorignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/.cursorignore -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/.gitignore -------------------------------------------------------------------------------- /.vscode/launch.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/.vscode/launch.json -------------------------------------------------------------------------------- /FIND_BUG.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/FIND_BUG.md -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/README.md -------------------------------------------------------------------------------- /config/input_tokenizer.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/config/input_tokenizer.json -------------------------------------------------------------------------------- /config/multilingual_tokenizer.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/config/multilingual_tokenizer.json -------------------------------------------------------------------------------- /config/output_tokenizer.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/config/output_tokenizer.json -------------------------------------------------------------------------------- /config/symbols.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/config/symbols.json -------------------------------------------------------------------------------- /pyproject.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/pyproject.toml -------------------------------------------------------------------------------- /screenshots/Screenshot from 2025-06-02 22-54-48.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/screenshots/Screenshot from 2025-06-02 22-54-48.png -------------------------------------------------------------------------------- /screenshots/Screenshot from 2025-06-06 09-14-06.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/screenshots/Screenshot from 2025-06-06 09-14-06.png -------------------------------------------------------------------------------- /src/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/__init__.py -------------------------------------------------------------------------------- /src/arch/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/arch/__init__.py -------------------------------------------------------------------------------- /src/arch/model copy.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/arch/model copy.py -------------------------------------------------------------------------------- /src/arch/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/arch/model.py -------------------------------------------------------------------------------- /src/arch/positional_encoding.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/arch/positional_encoding.py -------------------------------------------------------------------------------- /src/helper/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/helper/__init__.py -------------------------------------------------------------------------------- /src/helper/dataset.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/helper/dataset.py -------------------------------------------------------------------------------- /src/helper/datasetfrom_lmdb.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/helper/datasetfrom_lmdb.py -------------------------------------------------------------------------------- /src/helper/sentence_destructor.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/helper/sentence_destructor.py -------------------------------------------------------------------------------- /src/helper/text_cleaner.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/helper/text_cleaner.py -------------------------------------------------------------------------------- /src/helper/translation_dataset.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/helper/translation_dataset.py -------------------------------------------------------------------------------- /src/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/main.py -------------------------------------------------------------------------------- /src/sentence_length_statistics.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/sentence_length_statistics.ipynb -------------------------------------------------------------------------------- /src/tools/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/tools/__init__.py -------------------------------------------------------------------------------- /src/tools/generate_individual_words.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/tools/generate_individual_words.ipynb -------------------------------------------------------------------------------- /src/tools/generate_input_tokenizer.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/tools/generate_input_tokenizer.ipynb -------------------------------------------------------------------------------- /src/tools/generate_output_tokenizer.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/tools/generate_output_tokenizer.ipynb -------------------------------------------------------------------------------- /src/tools/prepare_dataset.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/tools/prepare_dataset.ipynb -------------------------------------------------------------------------------- /src/train.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/dream-1ab/spell_corrector/HEAD/src/train.py --------------------------------------------------------------------------------