├── LICENSE ├── README.md ├── _config.yml ├── dataset ├── annotated_dataset.csv └── topics.csv ├── get_indices.py ├── model_run ├── cand_generation.py ├── data │ ├── readme.txt │ └── readme.txt~ ├── get_labels.py ├── pre_trained_models │ └── readme.txt ├── supervised_labels.py ├── support_files │ ├── doc2vec_indices │ ├── readme.txt │ ├── svm_model │ └── word2vec_indices ├── toy_data │ ├── cand_label_output │ └── toytopics.csv └── unsupervised_labels.py ├── pruned_documents.py ├── train_svm_model.py ├── training ├── additional_files │ ├── readme.txt │ └── readme.txt~ ├── create_ngrams.py ├── doc2vectrain.py ├── dump │ └── readme.txt ├── extract.py ├── main_train.py ├── processed_documents │ └── readme.txt ├── support_packages │ └── readme.txt ├── tokenisation.py ├── trained_models │ └── readme.txt └── word2vectrain.py └── word2vec_phrases.py /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/README.md -------------------------------------------------------------------------------- /_config.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/_config.yml -------------------------------------------------------------------------------- /dataset/annotated_dataset.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/dataset/annotated_dataset.csv -------------------------------------------------------------------------------- /dataset/topics.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/dataset/topics.csv -------------------------------------------------------------------------------- /get_indices.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/get_indices.py -------------------------------------------------------------------------------- /model_run/cand_generation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/cand_generation.py -------------------------------------------------------------------------------- /model_run/data/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/data/readme.txt -------------------------------------------------------------------------------- /model_run/data/readme.txt~: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/data/readme.txt~ -------------------------------------------------------------------------------- /model_run/get_labels.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/get_labels.py -------------------------------------------------------------------------------- /model_run/pre_trained_models/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/pre_trained_models/readme.txt -------------------------------------------------------------------------------- /model_run/supervised_labels.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/supervised_labels.py -------------------------------------------------------------------------------- /model_run/support_files/doc2vec_indices: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/support_files/doc2vec_indices -------------------------------------------------------------------------------- /model_run/support_files/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/support_files/readme.txt -------------------------------------------------------------------------------- /model_run/support_files/svm_model: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/support_files/svm_model -------------------------------------------------------------------------------- /model_run/support_files/word2vec_indices: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/support_files/word2vec_indices -------------------------------------------------------------------------------- /model_run/toy_data/cand_label_output: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/toy_data/cand_label_output -------------------------------------------------------------------------------- /model_run/toy_data/toytopics.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/toy_data/toytopics.csv -------------------------------------------------------------------------------- /model_run/unsupervised_labels.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/model_run/unsupervised_labels.py -------------------------------------------------------------------------------- /pruned_documents.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/pruned_documents.py -------------------------------------------------------------------------------- /train_svm_model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/train_svm_model.py -------------------------------------------------------------------------------- /training/additional_files/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/additional_files/readme.txt -------------------------------------------------------------------------------- /training/additional_files/readme.txt~: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /training/create_ngrams.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/create_ngrams.py -------------------------------------------------------------------------------- /training/doc2vectrain.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/doc2vectrain.py -------------------------------------------------------------------------------- /training/dump/readme.txt: -------------------------------------------------------------------------------- 1 | Place your input Wikipedia XML dump over here 2 | -------------------------------------------------------------------------------- /training/extract.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/extract.py -------------------------------------------------------------------------------- /training/main_train.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/main_train.py -------------------------------------------------------------------------------- /training/processed_documents/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/processed_documents/readme.txt -------------------------------------------------------------------------------- /training/support_packages/readme.txt: -------------------------------------------------------------------------------- 1 | Place WikiExtractor.py, stanford-parser in this directory. 2 | -------------------------------------------------------------------------------- /training/tokenisation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/tokenisation.py -------------------------------------------------------------------------------- /training/trained_models/readme.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/trained_models/readme.txt -------------------------------------------------------------------------------- /training/word2vectrain.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/training/word2vectrain.py -------------------------------------------------------------------------------- /word2vec_phrases.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/sb1992/NETL-Automatic-Topic-Labelling-/HEAD/word2vec_phrases.py --------------------------------------------------------------------------------