├── .gitignore ├── README.md ├── corrector.py ├── data ├── common_char_set.txt ├── custom_confusion.txt ├── dict.txt ├── people_chars_lm.klm ├── pinyin2word.model ├── same_pinyin.txt └── same_stroke.txt ├── lm ├── DLM.py ├── NLM.py └── __init__.py ├── tokenizer └── __init__.py └── utils ├── __init__.py ├── logger.py └── text_utils.py /.gitignore: -------------------------------------------------------------------------------- 1 | .idea 2 | __pycache__ 3 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/README.md -------------------------------------------------------------------------------- /corrector.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/corrector.py -------------------------------------------------------------------------------- /data/common_char_set.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/common_char_set.txt -------------------------------------------------------------------------------- /data/custom_confusion.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/custom_confusion.txt -------------------------------------------------------------------------------- /data/dict.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/dict.txt -------------------------------------------------------------------------------- /data/people_chars_lm.klm: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/people_chars_lm.klm -------------------------------------------------------------------------------- /data/pinyin2word.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/pinyin2word.model -------------------------------------------------------------------------------- /data/same_pinyin.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/same_pinyin.txt -------------------------------------------------------------------------------- /data/same_stroke.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/data/same_stroke.txt -------------------------------------------------------------------------------- /lm/DLM.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/lm/DLM.py -------------------------------------------------------------------------------- /lm/NLM.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/lm/NLM.py -------------------------------------------------------------------------------- /lm/__init__.py: -------------------------------------------------------------------------------- 1 | #!usr/bin/env python 2 | #-*- coding:utf-8 -*- -------------------------------------------------------------------------------- /tokenizer/__init__.py: -------------------------------------------------------------------------------- 1 | #!usr/bin/env python 2 | #-*- coding:utf-8 -*- -------------------------------------------------------------------------------- /utils/__init__.py: -------------------------------------------------------------------------------- 1 | #!usr/bin/env python 2 | #-*- coding:utf-8 -*- -------------------------------------------------------------------------------- /utils/logger.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/utils/logger.py -------------------------------------------------------------------------------- /utils/text_utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/hiyoung123/YoungCorrector/HEAD/utils/text_utils.py --------------------------------------------------------------------------------