├── NLP ├── README.md ├── SH_split.py ├── SH_split.pyc ├── check_doc_cluster.py ├── check_tfidf_word2vec_cluster.py ├── compute_DF.py ├── compute_class_distr_GMM.py ├── compute_text_class_distri.py ├── compute_word2vec_distr_kmeans.py ├── count.py ├── count_word_num_from_tfidf.py ├── create_DFdict_from_IDF.py ├── create_LSTM_sequence.py ├── create_selected_train_txt.py ├── create_train_txt.py ├── create_train_txt.pyc ├── create_vocab_from_keywordList.py ├── create_vocab_pkl.py ├── data │ ├── vocabulary_10w.txt │ └── vocabulary_nv_4w.txt ├── demo.py ├── extract_keyword_from_tfidf.py ├── extract_text_concept.py ├── extract_word_from_topic.py ├── gen_dummy_match.py ├── gen_vocab_from_seg.py ├── id2word.py ├── kmeans_word2vec.py ├── log │ ├── doc_cluster_word2vec_maxOneHot │ ├── doc_cluster_word2vec_topOneHot │ ├── kmeans_word2vec.log │ ├── test_ourword2vec.log │ └── word2vec_distr.log ├── merge_vocab.py ├── random_choose_cluster.py ├── run_SH_split.py ├── run_create_train_txt.py ├── setup.py ├── shuffle_list.py ├── test.py ├── test_read_h5_speed.py ├── test_word2vec.py ├── tfidf_from_seg.py ├── tfidf_weighted_word2vec.py ├── word2id.py └── word_type.py ├── README.MD ├── Tensorflow ├── BidirectionNet_4wtfidf.py ├── BidirectionNet_lstm.py ├── BidirectionNet_tfidf.py ├── Kmeans.py ├── inspect_data.py ├── merge_3crop_data.py └── test_match_pairList.py └── caffe ├── README.MD └── extract_feature.py /NLP/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/README.md -------------------------------------------------------------------------------- /NLP/SH_split.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/SH_split.py -------------------------------------------------------------------------------- /NLP/SH_split.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/SH_split.pyc -------------------------------------------------------------------------------- /NLP/check_doc_cluster.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/check_doc_cluster.py -------------------------------------------------------------------------------- /NLP/check_tfidf_word2vec_cluster.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/check_tfidf_word2vec_cluster.py -------------------------------------------------------------------------------- /NLP/compute_DF.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/compute_DF.py -------------------------------------------------------------------------------- /NLP/compute_class_distr_GMM.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/compute_class_distr_GMM.py -------------------------------------------------------------------------------- /NLP/compute_text_class_distri.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/compute_text_class_distri.py -------------------------------------------------------------------------------- /NLP/compute_word2vec_distr_kmeans.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/compute_word2vec_distr_kmeans.py -------------------------------------------------------------------------------- /NLP/count.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/count.py -------------------------------------------------------------------------------- /NLP/count_word_num_from_tfidf.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/count_word_num_from_tfidf.py -------------------------------------------------------------------------------- /NLP/create_DFdict_from_IDF.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_DFdict_from_IDF.py -------------------------------------------------------------------------------- /NLP/create_LSTM_sequence.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_LSTM_sequence.py -------------------------------------------------------------------------------- /NLP/create_selected_train_txt.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_selected_train_txt.py -------------------------------------------------------------------------------- /NLP/create_train_txt.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_train_txt.py -------------------------------------------------------------------------------- /NLP/create_train_txt.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_train_txt.pyc -------------------------------------------------------------------------------- /NLP/create_vocab_from_keywordList.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_vocab_from_keywordList.py -------------------------------------------------------------------------------- /NLP/create_vocab_pkl.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/create_vocab_pkl.py -------------------------------------------------------------------------------- /NLP/data/vocabulary_10w.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/data/vocabulary_10w.txt -------------------------------------------------------------------------------- /NLP/data/vocabulary_nv_4w.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/data/vocabulary_nv_4w.txt -------------------------------------------------------------------------------- /NLP/demo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/demo.py -------------------------------------------------------------------------------- /NLP/extract_keyword_from_tfidf.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/extract_keyword_from_tfidf.py -------------------------------------------------------------------------------- /NLP/extract_text_concept.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/extract_text_concept.py -------------------------------------------------------------------------------- /NLP/extract_word_from_topic.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/extract_word_from_topic.py -------------------------------------------------------------------------------- /NLP/gen_dummy_match.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/gen_dummy_match.py -------------------------------------------------------------------------------- /NLP/gen_vocab_from_seg.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/gen_vocab_from_seg.py -------------------------------------------------------------------------------- /NLP/id2word.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/id2word.py -------------------------------------------------------------------------------- /NLP/kmeans_word2vec.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/kmeans_word2vec.py -------------------------------------------------------------------------------- /NLP/log/doc_cluster_word2vec_maxOneHot: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/log/doc_cluster_word2vec_maxOneHot -------------------------------------------------------------------------------- /NLP/log/doc_cluster_word2vec_topOneHot: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/log/doc_cluster_word2vec_topOneHot -------------------------------------------------------------------------------- /NLP/log/kmeans_word2vec.log: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/log/kmeans_word2vec.log -------------------------------------------------------------------------------- /NLP/log/test_ourword2vec.log: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/log/test_ourword2vec.log -------------------------------------------------------------------------------- /NLP/log/word2vec_distr.log: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/log/word2vec_distr.log -------------------------------------------------------------------------------- /NLP/merge_vocab.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/merge_vocab.py -------------------------------------------------------------------------------- /NLP/random_choose_cluster.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/random_choose_cluster.py -------------------------------------------------------------------------------- /NLP/run_SH_split.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/run_SH_split.py -------------------------------------------------------------------------------- /NLP/run_create_train_txt.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/run_create_train_txt.py -------------------------------------------------------------------------------- /NLP/setup.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/setup.py -------------------------------------------------------------------------------- /NLP/shuffle_list.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/shuffle_list.py -------------------------------------------------------------------------------- /NLP/test.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/test.py -------------------------------------------------------------------------------- /NLP/test_read_h5_speed.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/test_read_h5_speed.py -------------------------------------------------------------------------------- /NLP/test_word2vec.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/test_word2vec.py -------------------------------------------------------------------------------- /NLP/tfidf_from_seg.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/tfidf_from_seg.py -------------------------------------------------------------------------------- /NLP/tfidf_weighted_word2vec.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/tfidf_weighted_word2vec.py -------------------------------------------------------------------------------- /NLP/word2id.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/word2id.py -------------------------------------------------------------------------------- /NLP/word_type.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/NLP/word_type.py -------------------------------------------------------------------------------- /README.MD: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/README.MD -------------------------------------------------------------------------------- /Tensorflow/BidirectionNet_4wtfidf.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/BidirectionNet_4wtfidf.py -------------------------------------------------------------------------------- /Tensorflow/BidirectionNet_lstm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/BidirectionNet_lstm.py -------------------------------------------------------------------------------- /Tensorflow/BidirectionNet_tfidf.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/BidirectionNet_tfidf.py -------------------------------------------------------------------------------- /Tensorflow/Kmeans.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/Kmeans.py -------------------------------------------------------------------------------- /Tensorflow/inspect_data.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/inspect_data.py -------------------------------------------------------------------------------- /Tensorflow/merge_3crop_data.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/merge_3crop_data.py -------------------------------------------------------------------------------- /Tensorflow/test_match_pairList.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/Tensorflow/test_match_pairList.py -------------------------------------------------------------------------------- /caffe/README.MD: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/caffe/README.MD -------------------------------------------------------------------------------- /caffe/extract_feature.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/WeitaoVan/Sohu-LuckData-Image-Text-Matching-Competition/HEAD/caffe/extract_feature.py --------------------------------------------------------------------------------