├── .gitignore ├── LICENSE ├── README.md ├── WikiExtractor.py ├── aligning-docs-by-interlinks-demo.py ├── aligning-docs-by-interlinks-demo2.py ├── aligning-docs-by-lsi-demo.py ├── aligning-sentences-by-lsi-demo.py ├── arabic-morphological-analysis-demo.py ├── dict-demo.py ├── requirements.txt ├── split_processed_wikipedia_file.py ├── stopwords.txt ├── test-text-files ├── dict-out.txt ├── dict-test-ar-input.txt ├── dict-test-en-input.txt ├── test-in.ar.txt └── test-out.ar.txt ├── textpro.py └── wordnet ├── wn-data-arb.tab └── wn-data-eng.tab /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/README.md -------------------------------------------------------------------------------- /WikiExtractor.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/WikiExtractor.py -------------------------------------------------------------------------------- /aligning-docs-by-interlinks-demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/aligning-docs-by-interlinks-demo.py -------------------------------------------------------------------------------- /aligning-docs-by-interlinks-demo2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/aligning-docs-by-interlinks-demo2.py -------------------------------------------------------------------------------- /aligning-docs-by-lsi-demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/aligning-docs-by-lsi-demo.py -------------------------------------------------------------------------------- /aligning-sentences-by-lsi-demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/aligning-sentences-by-lsi-demo.py -------------------------------------------------------------------------------- /arabic-morphological-analysis-demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/arabic-morphological-analysis-demo.py -------------------------------------------------------------------------------- /dict-demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/dict-demo.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- 1 | nltk 2 | sklearn 3 | bs4 4 | -------------------------------------------------------------------------------- /split_processed_wikipedia_file.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/split_processed_wikipedia_file.py -------------------------------------------------------------------------------- /stopwords.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/stopwords.txt -------------------------------------------------------------------------------- /test-text-files/dict-out.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/test-text-files/dict-out.txt -------------------------------------------------------------------------------- /test-text-files/dict-test-ar-input.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/test-text-files/dict-test-ar-input.txt -------------------------------------------------------------------------------- /test-text-files/dict-test-en-input.txt: -------------------------------------------------------------------------------- 1 | car library pharmacy door charge -------------------------------------------------------------------------------- /test-text-files/test-in.ar.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/test-text-files/test-in.ar.txt -------------------------------------------------------------------------------- /test-text-files/test-out.ar.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/test-text-files/test-out.ar.txt -------------------------------------------------------------------------------- /textpro.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/textpro.py -------------------------------------------------------------------------------- /wordnet/wn-data-arb.tab: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/wordnet/wn-data-arb.tab -------------------------------------------------------------------------------- /wordnet/wn-data-eng.tab: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/motazsaad/comparable-text-miner/HEAD/wordnet/wn-data-eng.tab --------------------------------------------------------------------------------