├── .gitignore ├── README.md ├── demo.py ├── jiagu ├── __init__.py ├── __main__.py ├── analyze.py ├── cluster │ ├── __init__.py │ ├── base.py │ ├── dbscan.py │ ├── kmeans.py │ └── text.py ├── data │ └── stopwords.txt ├── findword.py ├── mmseg.py ├── model │ ├── kg.model │ ├── ner.model │ └── pos.model ├── normal │ └── README.md ├── perceptron.py ├── segment │ ├── README.md │ ├── dict │ │ ├── jiagu.dict │ │ └── user.dict │ ├── model │ │ └── cws.model │ └── nroute.py ├── sentiment │ ├── README.md │ ├── bayes.py │ └── model │ │ └── sentiment.model ├── textrank.py ├── topic │ └── lda.py └── utils.py ├── license ├── setup.py ├── test ├── test_cluster.py ├── test_findword.py ├── test_pos.py └── test_textrank.py └── train ├── README.md ├── data ├── test.txt └── train.txt ├── model └── ap.model └── perceptron.py /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/.gitignore -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/README.md -------------------------------------------------------------------------------- /demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/demo.py -------------------------------------------------------------------------------- /jiagu/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/__init__.py -------------------------------------------------------------------------------- /jiagu/__main__.py: -------------------------------------------------------------------------------- 1 | #!/usr/bin/env python3 2 | # -*-coding:utf-8-*- 3 | -------------------------------------------------------------------------------- /jiagu/analyze.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/analyze.py -------------------------------------------------------------------------------- /jiagu/cluster/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/cluster/__init__.py -------------------------------------------------------------------------------- /jiagu/cluster/base.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/cluster/base.py -------------------------------------------------------------------------------- /jiagu/cluster/dbscan.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/cluster/dbscan.py -------------------------------------------------------------------------------- /jiagu/cluster/kmeans.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/cluster/kmeans.py -------------------------------------------------------------------------------- /jiagu/cluster/text.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/cluster/text.py -------------------------------------------------------------------------------- /jiagu/data/stopwords.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/data/stopwords.txt -------------------------------------------------------------------------------- /jiagu/findword.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/findword.py -------------------------------------------------------------------------------- /jiagu/mmseg.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/mmseg.py -------------------------------------------------------------------------------- /jiagu/model/kg.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/model/kg.model -------------------------------------------------------------------------------- /jiagu/model/ner.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/model/ner.model -------------------------------------------------------------------------------- /jiagu/model/pos.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/model/pos.model -------------------------------------------------------------------------------- /jiagu/normal/README.md: -------------------------------------------------------------------------------- 1 | 2 | 文本归一化 3 | 4 | 包含 中文转拼音 5 | 6 | 全角半角等 7 | 8 | 9 | 10 | -------------------------------------------------------------------------------- /jiagu/perceptron.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/perceptron.py -------------------------------------------------------------------------------- /jiagu/segment/README.md: -------------------------------------------------------------------------------- 1 | 2 | 中文分词 3 | 4 | 5 | 6 | -------------------------------------------------------------------------------- /jiagu/segment/dict/jiagu.dict: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/segment/dict/jiagu.dict -------------------------------------------------------------------------------- /jiagu/segment/dict/user.dict: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /jiagu/segment/model/cws.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/segment/model/cws.model -------------------------------------------------------------------------------- /jiagu/segment/nroute.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/segment/nroute.py -------------------------------------------------------------------------------- /jiagu/sentiment/README.md: -------------------------------------------------------------------------------- 1 | 2 | 情感分析 3 | 4 | 5 | -------------------------------------------------------------------------------- /jiagu/sentiment/bayes.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/sentiment/bayes.py -------------------------------------------------------------------------------- /jiagu/sentiment/model/sentiment.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/sentiment/model/sentiment.model -------------------------------------------------------------------------------- /jiagu/textrank.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/textrank.py -------------------------------------------------------------------------------- /jiagu/topic/lda.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/topic/lda.py -------------------------------------------------------------------------------- /jiagu/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/jiagu/utils.py -------------------------------------------------------------------------------- /license: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/license -------------------------------------------------------------------------------- /setup.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/setup.py -------------------------------------------------------------------------------- /test/test_cluster.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/test/test_cluster.py -------------------------------------------------------------------------------- /test/test_findword.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/test/test_findword.py -------------------------------------------------------------------------------- /test/test_pos.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/test/test_pos.py -------------------------------------------------------------------------------- /test/test_textrank.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/test/test_textrank.py -------------------------------------------------------------------------------- /train/README.md: -------------------------------------------------------------------------------- 1 | # Jiagu自然语言处理工具训练方法 2 | 3 | 1. 将数据放入data目录,格式见`data/train.txt` 4 | 2. 运行`python3 perceptron.py`训练即可 5 | 6 | -------------------------------------------------------------------------------- /train/data/test.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/train/data/test.txt -------------------------------------------------------------------------------- /train/data/train.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/train/data/train.txt -------------------------------------------------------------------------------- /train/model/ap.model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/train/model/ap.model -------------------------------------------------------------------------------- /train/perceptron.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ownthink/Jiagu/HEAD/train/perceptron.py --------------------------------------------------------------------------------