├── .gitignore ├── LICENSE ├── README.md ├── moducorpus_sanitizer ├── __init__.py ├── about.py ├── cli.py ├── modu_news.py └── utils.py ├── requirements.txt └── setup.py /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # moducorpus-sanitizer: 모두의 말뭉치 정제 2 | -------------------------------------------------------------------------------- /moducorpus_sanitizer/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/moducorpus_sanitizer/__init__.py -------------------------------------------------------------------------------- /moducorpus_sanitizer/about.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/moducorpus_sanitizer/about.py -------------------------------------------------------------------------------- /moducorpus_sanitizer/cli.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/moducorpus_sanitizer/cli.py -------------------------------------------------------------------------------- /moducorpus_sanitizer/modu_news.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/moducorpus_sanitizer/modu_news.py -------------------------------------------------------------------------------- /moducorpus_sanitizer/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/moducorpus_sanitizer/utils.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- 1 | dataclasses>=0.6 2 | tqdm>=4.46.0 -------------------------------------------------------------------------------- /setup.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ko-nlp/moducorpus-sanitizer/HEAD/setup.py --------------------------------------------------------------------------------