├── .idea ├── .name ├── MultiTurnResponseSelection.iml ├── encodings.xml ├── misc.xml ├── modules.xml ├── vcs.xml └── workspace.xml ├── README.md ├── tensorflow_src ├── Evaluate.py ├── SCN.PY └── utils.py ├── test.txt ├── theano_src ├── CNN.py ├── Classifier.py ├── Optimization.py ├── PreProcess.py ├── RNN.py ├── SMN_Dynamic.py ├── SMN_Last.py ├── SMN_Static.py ├── SimAsImage.py └── logistic_sgd.py └── train.sample /.idea/.name: -------------------------------------------------------------------------------- 1 | MultiTurnResponseSelection -------------------------------------------------------------------------------- /.idea/MultiTurnResponseSelection.iml: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 12 | -------------------------------------------------------------------------------- /.idea/encodings.xml: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 6 | -------------------------------------------------------------------------------- /.idea/misc.xml: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | -------------------------------------------------------------------------------- /.idea/modules.xml: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | -------------------------------------------------------------------------------- /.idea/vcs.xml: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 6 | -------------------------------------------------------------------------------- /.idea/workspace.xml: -------------------------------------------------------------------------------- 1 | 2 | 3 | 4 | 5 | 6 | 7 | 14 | 15 | 16 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 80 | 81 | 82 | 84 | 85 | 92 | 93 | 94 | 99 | 100 | 101 | 102 | 103 | 104 | 105 | 106 | 107 | 108 | 109 | 110 | 111 | 112 | 113 | 114 | 115 | 116 | 117 | 118 | 119 | 120 | 121 | 122 | 123 | 124 | 125 | 126 | 127 | 128 | 129 | 130 | 133 | 134 | 137 | 138 | 141 | 142 | 143 | 144 | 147 | 148 | 151 | 152 | 153 | 154 | 157 | 158 | 161 | 162 | 165 | 166 | 167 | 168 | 169 | 170 | 171 | 172 | 173 | 174 | 175 | 176 | 177 | 178 | 194 | 195 | 213 | 214 | 232 | 233 | 253 | 254 | 275 | 276 | 299 | 300 | 301 | 302 | 303 | 304 | 305 | 306 | 307 | 1485308173127 308 | 311 | 312 | 313 | 314 | 315 | 316 | 317 | 318 | 319 | 320 | 321 | 322 | 323 | 324 | 325 | 326 | 327 | 328 | 329 | 330 | 331 | 332 | 333 | 334 | 335 | 336 | 337 | 340 | 343 | 344 | 345 | 347 | 348 | 349 | 350 | 351 | 352 | 353 | 354 | 355 | 356 | 357 | 358 | 359 | 360 | 361 | 362 | 363 | 364 | 365 | 366 | 367 | 368 | 369 | 370 | 371 | 372 | 373 | 374 | 375 | 376 | 377 | 378 | 379 | 380 | 381 | 382 | 383 | 384 | 385 | 386 | 387 | 388 | 389 | 390 | 391 | 392 | 393 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | # Douban Conversation Corpus 2 | 3 | ## Data set 4 | We release Douban Conversation Corpus, comprising a training data set, a development set and a test set for retrieval based chatbot. The statistics of Douban Conversation Corpus are shown in the following table. 5 | 6 | | |Train|Val| Test | 7 | | ------------- |:-------------:|:-------------:|:-------------:| 8 | | session-response pairs | 1m|50k| 10k | 9 | | Avg. positive response per session | 1|1| 1.18 | 10 | | Fless Kappa | N\A|N\A|0.41 | 11 | | Min turn per session | 3|3| 3 | 12 | | Max ture per session | 98|91|45 | 13 | | Average turn per session | 6.69|6.75|5.95 | 14 | | Average Word per utterance | 18.56|18.50|20.74 | 15 | 16 | 17 | The test data contains 1000 dialogue context, and for each context we create 10 responses as candidates. We recruited three labelers to judge if a candidate is a proper response to the session. A proper response means the response can naturally reply to the message given the context. Each pair received three labels and the majority of the labels was taken as the final decision. 18 | 19 |
20 | As far as we known, this is the first human-labeled test set for retrieval-based chatbots. The entire corpus link https://www.dropbox.com/s/90t0qtji9ow20ca/DoubanConversaionCorpus.zip?dl=0 21 | 22 | 23 | ## Data template 24 | label \t conversation utterances (splited by \t) \t response 25 | 26 | 27 | ## Source Code 28 | We also release our source code to help others reproduce our result. The code has been tested under Ubuntu 14.04 with python 2.7. 29 | 30 | Please first run preprocess.py and edit the code with the correct path, and it will give you a .bin file. After that, please run SMN_Last.py with the generated .bin file, and the training loss will be printed on the screen. If you set the train_flag = False, it will give your predicted score with your model. 31 | 32 | Some tips: 33 | 34 | The 200-d word embedding is shared at https://1drv.ms/u/s!AtcxwlQuQjw1jF0bjeaKHEUNwitA . The shared file is a list has 3 elements, one of which is a word2vec file. Please Download it and replace the input path (Training data) in my scripy. 35 | 36 | Tensorflow resources: 37 | 38 | The tensorflow code requires several data set, which has been uploaded on the following path: 39 | 40 | Resource file: https://1drv.ms/u/s!AtcxwlQuQjw1jGn5kPzsH03lnG6U 41 | 42 | Worddict file: https://1drv.ms/u/s!AtcxwlQuQjw1jGrCjg8liK1wE-N9 43 | 44 | Requirement: tensorflow>=1.3 45 | 46 | 47 | ## Reference 48 | Please cite our paper if you use the data or code in this repos. 49 | 50 | Wu, Yu, et al. "Sequential Matching Network: A New Archtechture for Multi-turn Response Selection in Retrieval-based Chatbots." ACL. 2017. 51 | -------------------------------------------------------------------------------- /tensorflow_src/Evaluate.py: -------------------------------------------------------------------------------- 1 | def ComputeR10_1(scores,labels,count = 10): 2 | total = 0 3 | correct = 0 4 | for i in range(len(labels)): 5 | if labels[i] == 1: 6 | total = total+1 7 | sublist = scores[i:i+count] 8 | if max(sublist) == scores[i]: 9 | correct = correct + 1 10 | print(float(correct)/ total ) 11 | 12 | def ComputeR2_1(scores,labels,count = 2): 13 | total = 0 14 | correct = 0 15 | for i in range(len(labels)): 16 | if labels[i] == 1: 17 | total = total+1 18 | sublist = scores[i:i+count] 19 | if max(sublist) == scores[i]: 20 | correct = correct + 1 21 | print(float(correct)/ total ) -------------------------------------------------------------------------------- /tensorflow_src/SCN.PY: -------------------------------------------------------------------------------- 1 | import tensorflow as tf 2 | import pickle 3 | import utils 4 | from keras.preprocessing.sequence import pad_sequences 5 | import numpy as np 6 | import Evaluate 7 | 8 | embedding_file = r"D:\data\Ubuntu\embedding.pkl" 9 | evaluate_file = r"D:\data\Ubuntu\Evaluate.pkl" 10 | response_file =r"D:\data\Ubuntu\responses.pkl" 11 | history_file = r"D:\data\Ubuntu\utterances.pkl" 12 | 13 | class SCN(): 14 | def __init__(self): 15 | self.max_num_utterance = 10 16 | self.negative_samples = 1 17 | self.max_sentence_len = 50 18 | self.word_embedding_size = 200 19 | self.rnn_units = 200 20 | self.total_words = 434511 21 | self.batch_size = 40 22 | 23 | def LoadModel(self): 24 | #init = tf.global_variables_initializer() 25 | saver = tf.train.Saver() 26 | sess = tf.Session() 27 | #with tf.Session() as sess: 28 | #sess.run(init) 29 | saver.restore(sess,"neg5model\\model.5") 30 | return sess 31 | # Later, launch the model, use the saver to restore variables from disk, and 32 | # do some work with the model. 33 | # with tf.Session() as sess: 34 | # # Restore variables from disk. 35 | # saver.restore(sess, "/model/model.5") 36 | # print("Model restored.") 37 | 38 | def BuildModel(self): 39 | self.utterance_ph = tf.placeholder(tf.int32, shape=(None, self.max_num_utterance, self.max_sentence_len)) 40 | self.response_ph = tf.placeholder(tf.int32, shape=(None, self.max_sentence_len)) 41 | self.y_true = tf.placeholder(tf.int32, shape=(None,)) 42 | self.embedding_ph = tf.placeholder(tf.float32, shape=(self.total_words, self.word_embedding_size)) 43 | self.response_len = tf.placeholder(tf.int32, shape=(None,)) 44 | self.all_utterance_len_ph = tf.placeholder(tf.int32, shape=(None, self.max_num_utterance)) 45 | word_embeddings = tf.get_variable('word_embeddings_v', shape=(self.total_words,self. 46 | word_embedding_size), dtype=tf.float32, trainable=False) 47 | self.embedding_init = word_embeddings.assign(self.embedding_ph) 48 | all_utterance_embeddings = tf.nn.embedding_lookup(word_embeddings, self.utterance_ph) 49 | response_embeddings = tf.nn.embedding_lookup(word_embeddings, self.response_ph) 50 | sentence_GRU = tf.nn.rnn_cell.GRUCell(self.rnn_units, kernel_initializer=tf.orthogonal_initializer()) 51 | all_utterance_embeddings = tf.unstack(all_utterance_embeddings, num=self.max_num_utterance, axis=1) 52 | all_utterance_len = tf.unstack(self.all_utterance_len_ph, num=self.max_num_utterance, axis=1) 53 | A_matrix = tf.get_variable('A_matrix_v', shape=(self.rnn_units, self.rnn_units), initializer=tf.contrib.layers.xavier_initializer(), dtype=tf.float32) 54 | final_GRU = tf.nn.rnn_cell.GRUCell(self.rnn_units, kernel_initializer=tf.orthogonal_initializer()) 55 | reuse = None 56 | 57 | response_GRU_embeddings, _ = tf.nn.dynamic_rnn(sentence_GRU, response_embeddings, sequence_length=self.response_len, dtype=tf.float32, 58 | scope='sentence_GRU') 59 | self.response_embedding_save = response_GRU_embeddings 60 | response_embeddings = tf.transpose(response_embeddings, perm=[0, 2, 1]) 61 | response_GRU_embeddings = tf.transpose(response_GRU_embeddings, perm=[0, 2, 1]) 62 | matching_vectors = [] 63 | for utterance_embeddings, utterance_len in zip(all_utterance_embeddings, all_utterance_len): 64 | matrix1 = tf.matmul(utterance_embeddings, response_embeddings) 65 | utterance_GRU_embeddings, _ = tf.nn.dynamic_rnn(sentence_GRU, utterance_embeddings, sequence_length=utterance_len, dtype=tf.float32, 66 | scope='sentence_GRU') 67 | matrix2 = tf.einsum('aij,jk->aik', utterance_GRU_embeddings, A_matrix) # TODO:check this 68 | matrix2 = tf.matmul(matrix2, response_GRU_embeddings) 69 | matrix = tf.stack([matrix1, matrix2], axis=3, name='matrix_stack') 70 | conv_layer = tf.layers.conv2d(matrix, filters=8, kernel_size=(3, 3), padding='VALID', 71 | kernel_initializer=tf.contrib.keras.initializers.he_normal(), 72 | activation=tf.nn.relu, reuse=reuse, name='conv') # TODO: check other params 73 | pooling_layer = tf.layers.max_pooling2d(conv_layer, (3, 3), strides=(3, 3), 74 | padding='VALID', name='max_pooling') # TODO: check other params 75 | matching_vector = tf.layers.dense(tf.contrib.layers.flatten(pooling_layer), 50, 76 | kernel_initializer=tf.contrib.layers.xavier_initializer(), 77 | activation=tf.tanh, reuse=reuse, name='matching_v') # TODO: check wthether this is correct 78 | if not reuse: 79 | reuse = True 80 | matching_vectors.append(matching_vector) 81 | _, last_hidden = tf.nn.dynamic_rnn(final_GRU, tf.stack(matching_vectors, axis=0, name='matching_stack'), dtype=tf.float32, 82 | time_major=True, scope='final_GRU') # TODO: check time_major 83 | logits = tf.layers.dense(last_hidden, 2, kernel_initializer=tf.contrib.layers.xavier_initializer(), name='final_v') 84 | self.y_pred = tf.nn.softmax(logits) 85 | self.total_loss = tf.reduce_mean(tf.nn.sparse_softmax_cross_entropy_with_logits(labels=self.y_true, logits=logits)) 86 | tf.summary.scalar('loss', self.total_loss) 87 | optimizer = tf.train.AdamOptimizer(learning_rate=0.001) 88 | self.train_op = optimizer.minimize(self.total_loss) 89 | 90 | def Evaluate(self,sess): 91 | with open(evaluate_file, 'rb') as f: 92 | history, true_utt,labels = pickle.load(f) 93 | self.all_candidate_scores = [] 94 | history, history_len = utils.multi_sequences_padding(history, self.max_sentence_len) 95 | history, history_len = np.array(history), np.array(history_len) 96 | true_utt_len = np.array(utils.get_sequences_length(true_utt, maxlen=self.max_sentence_len)) 97 | true_utt = np.array(pad_sequences(true_utt, padding='post', maxlen=self.max_sentence_len)) 98 | low = 0 99 | while True: 100 | feed_dict = {self.utterance_ph: np.concatenate([history[low:low + 200]], axis=0), 101 | self.all_utterance_len_ph: np.concatenate([history_len[low:low + 200]], axis=0), 102 | self.response_ph: np.concatenate([true_utt[low:low + 200]], axis=0), 103 | self.response_len: np.concatenate([true_utt_len[low:low + 200]], axis=0), 104 | } 105 | candidate_scores = sess.run(self.y_pred, feed_dict=feed_dict) 106 | self.all_candidate_scores.append(candidate_scores[:, 1]) 107 | low = low + 200 108 | if low >= history.shape[0]: 109 | break 110 | all_candidate_scores = np.concatenate(self.all_candidate_scores, axis=0) 111 | Evaluate.ComputeR10_1(all_candidate_scores,labels) 112 | Evaluate.ComputeR2_1(all_candidate_scores,labels) 113 | 114 | 115 | 116 | 117 | def TrainModel(self,countinue_train = False, previous_modelpath = "model"): 118 | init = tf.global_variables_initializer() 119 | saver = tf.train.Saver() 120 | merged = tf.summary.merge_all() 121 | with tf.Session() as sess: 122 | writer = tf.summary.FileWriter("output2", sess.graph) 123 | train_writer = tf.summary.FileWriter('output2', sess.graph) 124 | with open(response_file, 'rb') as f: 125 | actions = pickle.load(f) 126 | with open(embedding_file, 'rb') as f: 127 | embeddings = pickle.load(f,encoding="bytes") 128 | with open(history_file, 'rb') as f: 129 | history, true_utt = pickle.load(f) 130 | # with open("data/biglearn_test_small.txt", encoding="utf8") as f: 131 | # lines = f.readlines() 132 | # history, true_utt = utils.build_evaluate_data(lines) 133 | history, history_len = utils.multi_sequences_padding(history, self.max_sentence_len) 134 | true_utt_len = np.array(utils.get_sequences_length(true_utt, maxlen=self.max_sentence_len)) 135 | true_utt = np.array(pad_sequences(true_utt, padding='post', maxlen=self.max_sentence_len)) 136 | actions_len = np.array(utils.get_sequences_length(actions, maxlen=self.max_sentence_len)) 137 | actions = np.array(pad_sequences(actions, padding='post', maxlen=self.max_sentence_len)) 138 | history, history_len = np.array(history), np.array(history_len) 139 | if countinue_train == False: 140 | sess.run(init) 141 | sess.run(self.embedding_init, feed_dict={self.embedding_ph: embeddings}) 142 | else: 143 | saver.restore(sess,previous_modelpath) 144 | low = 0 145 | epoch = 1 146 | while epoch < 10: 147 | n_sample = min(low + self.batch_size, history.shape[0]) - low 148 | negative_indices = [np.random.randint(0, actions.shape[0], n_sample) for _ in range(self.negative_samples)] 149 | negs = [actions[negative_indices[i], :] for i in range(self.negative_samples)] 150 | negs_len = [actions_len[negative_indices[i]] for i in range(self.negative_samples)] 151 | feed_dict = {self.utterance_ph: np.concatenate([history[low:low + n_sample]] * (self.negative_samples + 1), axis=0), 152 | self.all_utterance_len_ph: np.concatenate([history_len[low:low + n_sample]] * (self.negative_samples + 1), axis=0), 153 | self.response_ph: np.concatenate([true_utt[low:low + n_sample]] + negs, axis=0), 154 | self.response_len: np.concatenate([true_utt_len[low:low + n_sample]] + negs_len, axis=0), 155 | self.y_true: np.concatenate([np.ones(n_sample)] + [np.zeros(n_sample)] * self.negative_samples, axis=0) 156 | } 157 | _, summary = sess.run([self.train_op, merged], feed_dict=feed_dict) 158 | train_writer.add_summary(summary) 159 | low += n_sample 160 | if low % 102400 == 0: 161 | print("loss",sess.run(self.total_loss, feed_dict=feed_dict)) 162 | self.Evaluate(sess) 163 | if low >= history.shape[0]: 164 | low = 0 165 | saver.save(sess,"model/model.{0}".format(epoch)) 166 | print(sess.run(self.total_loss, feed_dict=feed_dict)) 167 | print('epoch={i}'.format(i=epoch)) 168 | epoch += 1 169 | 170 | if __name__ == "__main__": 171 | scn =SCN() 172 | scn.BuildModel() 173 | scn.TrainModel() 174 | #sess = scn.LoadModel() 175 | #scn.Evaluate(sess) 176 | #results = scn.BuildIndex(sess) 177 | #print(len(results)) 178 | 179 | #scn.TrainModel() -------------------------------------------------------------------------------- /tensorflow_src/utils.py: -------------------------------------------------------------------------------- 1 | import concurrent.futures 2 | import pickle 3 | import numpy as np 4 | from keras.preprocessing.sequence import pad_sequences 5 | from keras.preprocessing.text import text_to_word_sequence 6 | 7 | 8 | def build_data(lines, word_dict, tid=0): 9 | def word2id(c): 10 | if c in word_dict: 11 | return word_dict[c] 12 | else: 13 | return 0 14 | 15 | cnt = 0 16 | history = [] 17 | true_utt = [] 18 | for line in lines: 19 | fields = line.rstrip().lower().split('\t') 20 | utterance = fields[1].split('###') 21 | history.append([list(map(word2id, text_to_word_sequence(each_utt))) for each_utt in utterance]) 22 | true_utt.append(list(map(word2id, text_to_word_sequence(fields[2])))) 23 | cnt += 1 24 | if cnt % 10000 == 0: 25 | print(tid, cnt) 26 | return history, true_utt 27 | 28 | 29 | def build_evaluate_data(lines, tid=0): 30 | with open('worddata/word_dict.pkl', 'rb') as f: 31 | word_dict = pickle.load(f) 32 | 33 | def word2id(c): 34 | if c in word_dict: 35 | return word_dict[c] 36 | else: 37 | return 0 38 | 39 | cnt = 0 40 | history = [] 41 | true_utt = [] 42 | for line in lines: 43 | fields = line.rstrip().lower().split('\t') 44 | utterance = fields[-1].split('###') 45 | history.append([list(map(word2id, text_to_word_sequence(each_utt))) for each_utt in utterance]) 46 | true_utt.append(list(map(word2id, text_to_word_sequence(fields[0])))) 47 | cnt += 1 48 | if cnt % 10000 == 0: 49 | print(tid, cnt) 50 | return history, true_utt 51 | 52 | 53 | def multi_sequences_padding(all_sequences, max_sentence_len=50): 54 | max_num_utterance = 10 55 | PAD_SEQUENCE = [0] * max_sentence_len 56 | padded_sequences = [] 57 | sequences_length = [] 58 | for sequences in all_sequences: 59 | sequences_len = len(sequences) 60 | sequences_length.append(get_sequences_length(sequences, maxlen=max_sentence_len)) 61 | if sequences_len < max_num_utterance: 62 | sequences += [PAD_SEQUENCE] * (max_num_utterance - sequences_len) 63 | sequences_length[-1] += [0] * (max_num_utterance - sequences_len) 64 | else: 65 | sequences = sequences[-max_num_utterance:] 66 | sequences_length[-1] = sequences_length[-1][-max_num_utterance:] 67 | sequences = pad_sequences(sequences, padding='post', maxlen=max_sentence_len) 68 | padded_sequences.append(sequences) 69 | return padded_sequences, sequences_length 70 | 71 | 72 | def get_sequences_length(sequences, maxlen): 73 | sequences_length = [min(len(sequence), maxlen) for sequence in sequences] 74 | return sequences_length 75 | 76 | 77 | def load_data(total_words): 78 | process_num = 10 79 | executor = concurrent.futures.ProcessPoolExecutor(process_num) 80 | base = 0 81 | results = [] 82 | history = [] 83 | true_utt = [] 84 | word_dict = dict() 85 | vectors = [] 86 | with open('data/glove.twitter.27B.200d.txt', encoding='utf8') as f: 87 | lines = f.readlines() 88 | for i, line in enumerate(lines): 89 | line = line.split(' ') 90 | word_dict[line[0]] = i 91 | vectors.append(line[1:]) 92 | if i > total_words: 93 | break 94 | with open('worddata/embedding_matrix.pkl', "wb") as f: 95 | pickle.dump(vectors, f) 96 | with open("data/biglearn_train.old.txt", encoding="utf8") as f: 97 | lines = f.readlines() 98 | total_num = 1000000 99 | print(total_num) 100 | low = 0 101 | step = total_num // process_num 102 | print(step) 103 | while True: 104 | if low < total_num: 105 | results.append(executor.submit(build_data, lines[low:low + step], word_dict, base)) 106 | else: 107 | break 108 | base += 1 109 | low += step 110 | 111 | for result in results: 112 | h, t = result.result() 113 | history += h 114 | true_utt += t 115 | print(len(history)) 116 | print(len(true_utt)) 117 | pickle.dump([history, true_utt], open("worddata/train.pkl", "wb")) 118 | actions_id = [] 119 | with open('emb/actions.txt', encoding='utf8') as f: 120 | actions = f.readlines() 121 | 122 | def word2id(c): 123 | if c in word_dict: 124 | return word_dict[c] 125 | else: 126 | return 0 127 | 128 | for action in actions: 129 | actions_id.append([word2id(word) for word in text_to_word_sequence(action)]) 130 | with open('worddata/actions_embeddings.pkl', 'wb') as f: 131 | pickle.dump(actions_id, f) 132 | 133 | 134 | def evaluate(test_file, sess, actions, actions_len, max_sentence_len, utterance_ph, all_utterance_len_ph, 135 | response_ph, response_len, y_pred): 136 | each_test_run = len(actions) // 3 137 | acc1 = [0.0] * 10 138 | rank1 = 0.0 139 | cnt = 0 140 | print('evaluating') 141 | 142 | with open(test_file, encoding="utf8") as f: 143 | lines = f.readlines() 144 | low = 0 145 | history, true_utt = build_evaluate_data(lines) 146 | history, history_len = multi_sequences_padding(history, max_sentence_len) 147 | true_utt_len = np.array(get_sequences_length(true_utt, maxlen=max_sentence_len)) 148 | true_utt = np.array(pad_sequences(true_utt,padding='post', maxlen=max_sentence_len)) 149 | history, history_len = np.array(history), np.array(history_len) 150 | feed_dict = {utterance_ph: history, 151 | all_utterance_len_ph: history_len, 152 | response_ph: true_utt, 153 | response_len: true_utt_len 154 | } 155 | true_scores = sess.run(y_pred, feed_dict=feed_dict) 156 | true_scores = true_scores[:, 1] 157 | for i in range(true_scores.shape[0]): 158 | all_candidate_scores = [] 159 | for j in range(3): 160 | feed_dict = {utterance_ph: np.concatenate([history[low:low + 1]] * each_test_run, axis=0), 161 | all_utterance_len_ph: np.concatenate([history_len[low:low + 1]] * each_test_run, axis=0), 162 | response_ph: actions[each_test_run * j:each_test_run * (j + 1)], 163 | response_len: actions_len[each_test_run * j:each_test_run * (j + 1)] 164 | } 165 | candidate_scores = sess.run(y_pred, feed_dict=feed_dict) 166 | all_candidate_scores.append(candidate_scores[:, 1]) 167 | all_candidate_scores = np.concatenate(all_candidate_scores, axis=0) 168 | pos1 = np.sum(true_scores[i] + 1e-8 < all_candidate_scores) 169 | if pos1 < 10: 170 | acc1[pos1] += 1 171 | rank1 += pos1 172 | low += 1 173 | cnt += true_scores.shape[0] 174 | print([a / cnt for a in acc1]) # rank top 1 to top 10 acc 175 | print(rank1 / cnt) # average rank 176 | print(np.sum(acc1[:3]) * 1.0 / cnt) # top 3 acc 177 | 178 | 179 | if __name__ == '__main__': 180 | load_data(500000) 181 | -------------------------------------------------------------------------------- /theano_src/CNN.py: -------------------------------------------------------------------------------- 1 | import numpy as np 2 | import scipy.sparse as sp 3 | from collections import defaultdict, OrderedDict 4 | import sys, re, cPickle, random, logging, argparse 5 | import datetime 6 | 7 | import theano 8 | import theano.tensor as T 9 | from theano.tensor.nnet import conv 10 | 11 | def ReLU(x): 12 | y = T.maximum(0.0, x) 13 | return(y) 14 | 15 | def kmaxpooling(input,input_shape,k): 16 | sorted_values = T.argsort(input,axis=3) 17 | topmax_indexes = sorted_values[:,:,:,-k:] 18 | # sort indexes so that we keep the correct order within the sentence 19 | topmax_indexes_sorted = T.sort(topmax_indexes) 20 | 21 | #given that topmax only gives the index of the third dimension, we need to generate the other 3 dimensions 22 | dim0 = T.arange(0,input_shape[0]).repeat(input_shape[1]*input_shape[2]*k) 23 | dim1 = T.arange(0,input_shape[1]).repeat(k*input_shape[2]).reshape((1,-1)).repeat(input_shape[0],axis=0).flatten() 24 | dim2 = T.arange(0,input_shape[2]).repeat(k).reshape((1,-1)).repeat(input_shape[0]*input_shape[1],axis=0).flatten() 25 | dim3 = topmax_indexes_sorted.flatten() 26 | return input[dim0,dim1,dim2,dim3].reshape((input_shape[0], input_shape[1], input_shape[2], k)) 27 | 28 | class QALeNetConvPoolLayer(object): 29 | """ Convolution Layer and Pool Layer for Question and Sentence pair """ 30 | 31 | def __init__(self, rng, linp, rinp, filter_shape, poolsize): 32 | """ 33 | Allocate a LeNetConvPoolLayer with shared variable internal parameters. 34 | 35 | :type rng: numpy.random.RandomState 36 | :param rng: a random number generator used to initialize weights 37 | 38 | :type linp: theano.tensor.TensorType 39 | :param linp: symbolic variable that describes the left input of the 40 | architecture (one minibatch) 41 | 42 | :type rinp: theano.tensor.TensorType 43 | :param rinp: symbolic variable that describes the right input of the 44 | architecture (one minibatch) 45 | 46 | :type filter_shape: tuple or list of length 4 47 | :param filter_shape: (number of filters, 1, 48 | filter height,filter width) 49 | 50 | :type poolsize: tuple or list of length 2 51 | :param poolsize: the downsampling (pooling) factor (#rows,#cols) 52 | """ 53 | 54 | self.linp = linp 55 | self.rinp = rinp 56 | self.filter_shape = filter_shape 57 | self.poolsize = poolsize 58 | 59 | # there are "num input feature maps * filter height * filter width" 60 | # inputs to each hidden unit 61 | fan_in = np.prod(filter_shape[1:]) 62 | # each unit in the lower layer receives a gradient from: 63 | # "num output feature maps * filter height * filter width" / 64 | # pooling size 65 | fan_out = (filter_shape[0] * np.prod(filter_shape[2:]) /np.prod(poolsize)) 66 | # initialize weights with random weights 67 | W_bound = np.sqrt(6. / (fan_in + fan_out)) 68 | self.W = theano.shared(np.asarray(rng.uniform(low=-W_bound,high=W_bound,size=filter_shape), 69 | dtype=theano.config.floatX),borrow=True,name="W_conv") 70 | b_values = np.zeros((filter_shape[0],), dtype=theano.config.floatX) 71 | self.b = theano.shared(value=b_values, borrow=True, name="b_conv") 72 | 73 | # convolve input feature maps with filters 74 | lconv_out = conv.conv2d(input=linp, filters=self.W, filter_shape = filter_shape) 75 | rconv_out = conv.conv2d(input=rinp, filters=self.W, filter_shape = filter_shape) 76 | self.lconv_out_tanh = ReLU(lconv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 77 | self.rconv_out_tanh = ReLU(rconv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 78 | self.loutput = theano.tensor.signal.pool.pool_2d(input=self.lconv_out_tanh, ds=self.poolsize, ignore_border=True, mode="max") 79 | self.routput = theano.tensor.signal.pool.pool_2d(input=self.rconv_out_tanh, ds=self.poolsize, ignore_border=True, mode="max") 80 | self.params = [self.W, self.b] 81 | 82 | def predict(self, lnew_data, rnew_data): 83 | """ 84 | predict for new data 85 | """ 86 | lconv_out = conv.conv2d(input=lnew_data, filters=self.W) 87 | rconv_out = conv.conv2d(input=rnew_data, filters=self.W) 88 | lconv_out_tanh = T.tanh(lconv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 89 | rconv_out_tanh = T.tanh(rconv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 90 | loutput = theano.tensor.signal.pool.pool_2d(input=lconv_out_tanh, ds=self.poolsize, ignore_border=True, mode="max") 91 | routput = theano.tensor.signal.pool.pool_2d(input=rconv_out_tanh, ds=self.poolsize, ignore_border=True, mode="max") 92 | return loutput, routput 93 | 94 | class LeNetConvPoolLayer(object): 95 | """Pool Layer of a convolutional network """ 96 | 97 | def __init__(self, rng, input, filter_shape, image_shape, poolsize=(2, 2), non_linear="tanh"): 98 | """ 99 | Allocate a LeNetConvPoolLayer with shared variable internal parameters. 100 | 101 | :type rng: numpy.random.RandomState 102 | :param rng: a random number generator used to initialize weights 103 | 104 | :type input: theano.tensor.dtensor4 105 | :param input: symbolic image tensor, of shape image_shape 106 | 107 | :type filter_shape: tuple or list of length 4 108 | :param filter_shape: (number of filters, num input feature maps, 109 | filter height,filter width) 110 | 111 | :type image_shape: tuple or list of length 4 112 | :param image_shape: (batch size, num input feature maps, 113 | image height, image width) 114 | 115 | :type poolsize: tuple or list of length 2 116 | :param poolsize: the downsampling (pooling) factor (#rows,#cols) 117 | """ 118 | print 'image shape', image_shape 119 | print 'filter shape', filter_shape 120 | assert image_shape[1] == filter_shape[1] 121 | self.input = input 122 | self.filter_shape = filter_shape 123 | self.image_shape = image_shape 124 | self.poolsize = poolsize 125 | self.non_linear = non_linear 126 | # there are "num input feature maps * filter height * filter width" 127 | # inputs to each hidden unit 128 | fan_in = np.prod(filter_shape[1:]) 129 | # each unit in the lower layer receives a gradient from: 130 | # "num output feature maps * filter height * filter width" / 131 | # pooling size 132 | fan_out = (filter_shape[0] * np.prod(filter_shape[2:]) /np.prod(poolsize)) 133 | # initialize weights with random weights 134 | if self.non_linear=="none" or self.non_linear=="relu": 135 | self.W = theano.shared(np.asarray(rng.uniform(low=-0.01,high=0.01,size=filter_shape), 136 | dtype=theano.config.floatX),borrow=True,name="W_conv") 137 | else: 138 | W_bound = np.sqrt(6. / (fan_in + fan_out)) 139 | self.W = theano.shared(np.asarray(rng.uniform(low=-W_bound, high=W_bound, size=filter_shape), 140 | dtype=theano.config.floatX),borrow=True,name="W_conv") 141 | b_values =np.zeros((filter_shape[0],), dtype=theano.config.floatX) 142 | self.b = theano.shared(value=b_values, borrow=True, name="b_conv") 143 | 144 | # convolve input feature maps with filters 145 | conv_out = conv.conv2d(input=input, filters=self.W,filter_shape=self.filter_shape, image_shape=self.image_shape) 146 | if self.non_linear=="tanh": 147 | conv_out_tanh = T.tanh(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 148 | self.output = theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True,mode="max") 149 | elif self.non_linear=="relu": 150 | conv_out_tanh = ReLU(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 151 | self.output = theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True,mode="max") 152 | else: 153 | pooled_out = theano.tensor.signal.pool.pool_2d(input=conv_out, ds=self.poolsize, ignore_border=True,mode="max") 154 | self.output = pooled_out + self.b.dimshuffle('x', 0, 'x', 'x') 155 | self.params = [self.W, self.b] 156 | 157 | def predict(self, new_data, batch_size): 158 | """ 159 | predict for new data 160 | """ 161 | img_shape = (batch_size, 1, self.image_shape[2], self.image_shape[3]) 162 | conv_out = conv.conv2d(input=new_data, filters=self.W, filter_shape=self.filter_shape, image_shape=img_shape) 163 | if self.non_linear=="tanh": 164 | conv_out_tanh = T.tanh(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 165 | output = theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True) 166 | if self.non_linear=="relu": 167 | conv_out_tanh = ReLU(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 168 | output =theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True) 169 | else: 170 | pooled_out = theano.tensor.signal.pool.pool_2d(input=conv_out, ds=self.poolsize, ignore_border=True) 171 | output = pooled_out + self.b.dimshuffle('x', 0, 'x', 'x') 172 | return output 173 | 174 | class LeNetConvPoolLayer2(object): 175 | """Pool Layer of a convolutional network """ 176 | 177 | def __init__(self, rng, filter_shape, image_shape, poolsize=(2, 2), non_linear="tanh"): 178 | """ 179 | Allocate a LeNetConvPoolLayer with shared variable internal parameters. 180 | 181 | :type rng: numpy.random.RandomState 182 | :param rng: a random number generator used to initialize weights 183 | 184 | :type input: theano.tensor.dtensor4 185 | :param input: symbolic image tensor, of shape image_shape 186 | 187 | :type filter_shape: tuple or list of length 4 188 | :param filter_shape: (number of filters, num input feature maps, 189 | filter height,filter width) 190 | 191 | :type image_shape: tuple or list of length 4 192 | :param image_shape: (batch size, num input feature maps, 193 | image height, image width) 194 | 195 | :type poolsize: tuple or list of length 2 196 | :param poolsize: the downsampling (pooling) factor (#rows,#cols) 197 | """ 198 | print 'image shape', image_shape 199 | print 'filter shape', filter_shape 200 | assert image_shape[1] == filter_shape[1] 201 | self.filter_shape = filter_shape 202 | self.image_shape = image_shape 203 | self.poolsize = poolsize 204 | self.non_linear = non_linear 205 | # there are "num input feature maps * filter height * filter width" 206 | # inputs to each hidden unit 207 | fan_in = np.prod(filter_shape[1:]) 208 | # each unit in the lower layer receives a gradient from: 209 | # "num output feature maps * filter height * filter width" / 210 | # pooling size 211 | fan_out = (filter_shape[0] * np.prod(filter_shape[2:]) /np.prod(poolsize)) 212 | # initialize weights with random weights 213 | if self.non_linear=="none" or self.non_linear=="relu": 214 | self.W = theano.shared(np.asarray(rng.uniform(low=-0.01,high=0.01,size=filter_shape), 215 | dtype=theano.config.floatX),borrow=True,name="W_conv") 216 | else: 217 | W_bound = np.sqrt(6. / (fan_in + fan_out)) 218 | self.W = theano.shared(np.asarray(rng.uniform(low=-W_bound, high=W_bound, size=filter_shape), 219 | dtype=theano.config.floatX),borrow=True,name="W_conv") 220 | b_values =np.zeros((filter_shape[0],), dtype=theano.config.floatX) 221 | self.b = theano.shared(value=b_values, borrow=True, name="b_conv") 222 | self.params = [self.W, self.b] 223 | # convolve input feature maps with filters 224 | 225 | 226 | def __call__(self, input): 227 | conv_out = conv.conv2d(input=input, filters=self.W,filter_shape=self.filter_shape, image_shape=self.image_shape) 228 | if self.non_linear=="tanh": 229 | conv_out_tanh = T.tanh(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 230 | self.output = theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True,mode="max") 231 | elif self.non_linear=="relu": 232 | conv_out_tanh = ReLU(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 233 | self.output =theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True,mode="max") 234 | else: 235 | pooled_out = theano.tensor.signal.pool.pool_2d(input=conv_out, ds=self.poolsize, ignore_border=True,mode="max") 236 | self.output = pooled_out + self.b.dimshuffle('x', 0, 'x', 'x') 237 | return self.output 238 | 239 | 240 | def predict(self, new_data, batch_size): 241 | """ 242 | predict for new data 243 | """ 244 | img_shape = (batch_size, 1, self.image_shape[2], self.image_shape[3]) 245 | conv_out = conv.conv2d(input=new_data, filters=self.W, filter_shape=self.filter_shape, image_shape=img_shape) 246 | if self.non_linear=="tanh": 247 | conv_out_tanh = T.tanh(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 248 | output = theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True) 249 | if self.non_linear=="relu": 250 | conv_out_tanh = ReLU(conv_out + self.b.dimshuffle('x', 0, 'x', 'x')) 251 | output = theano.tensor.signal.pool.pool_2d(input=conv_out_tanh, ds=self.poolsize, ignore_border=True) 252 | else: 253 | pooled_out = theano.tensor.signal.pool.pool_2d(input=conv_out, ds=self.poolsize, ignore_border=True) 254 | output = pooled_out + self.b.dimshuffle('x', 0, 'x', 'x') 255 | return output -------------------------------------------------------------------------------- /theano_src/Classifier.py: -------------------------------------------------------------------------------- 1 | """ 2 | This tutorial introduces the multilayer perceptron using Theano. 3 | 4 | A multilayer perceptron is a logistic regressor where 5 | instead of feeding the input to the logistic regression you insert a 6 | intermediate layer, called the hidden layer, that has a nonlinear 7 | activation function (usually tanh or sigmoid) . One can use many such 8 | hidden layers making the architecture deep. The tutorial will also tackle 9 | the problem of MNIST digit classification. 10 | 11 | .. math:: 12 | 13 | f(x) = G( b^{(2)} + W^{(2)}( s( b^{(1)} + W^{(1)} x))), 14 | 15 | References: 16 | 17 | - textbooks: "Pattern Recognition and Machine Learning" - 18 | Christopher M. Bishop, section 5 19 | 20 | """ 21 | 22 | from __future__ import print_function 23 | 24 | __docformat__ = 'restructedtext en' 25 | 26 | 27 | import os 28 | import sys 29 | import timeit 30 | 31 | import numpy 32 | 33 | import theano 34 | import theano.tensor as T 35 | 36 | 37 | from logistic_sgd import LogisticRegression, load_data 38 | 39 | 40 | 41 | 42 | def _dropout_from_layer(rng, layer, p): 43 | """p is the probablity of dropping a unit 44 | """ 45 | srng = theano.tensor.shared_randomstreams.RandomStreams( 46 | rng.randint(999999)) 47 | # p=1-p because 1's indicate keep and p is prob of dropping 48 | mask = srng.binomial(n=1, p=1-p, size=layer.shape) 49 | # The cast is important because 50 | # int * float32 = float64 which pulls things off the gpu 51 | output = layer * T.cast(mask, theano.config.floatX) 52 | return output 53 | # start-snippet-1 54 | class HiddenLayer(object): 55 | def __init__(self, rng, input, n_in, n_out, W=None, b=None, 56 | activation=T.tanh): 57 | """ 58 | Typical hidden layer of a MLP: units are fully-connected and have 59 | sigmoidal activation function. Weight matrix W is of shape (n_in,n_out) 60 | and the bias vector b is of shape (n_out,). 61 | 62 | NOTE : The nonlinearity used here is tanh 63 | 64 | Hidden unit activation is given by: tanh(dot(input,W) + b) 65 | 66 | :type rng: numpy.random.RandomState 67 | :param rng: a random number generator used to initialize weights 68 | 69 | :type input: theano.tensor.dmatrix 70 | :param input: a symbolic tensor of shape (n_examples, n_in) 71 | 72 | :type n_in: int 73 | :param n_in: dimensionality of input 74 | 75 | :type n_out: int 76 | :param n_out: number of hidden units 77 | 78 | :type activation: theano.Op or function 79 | :param activation: Non linearity to be applied in the hidden 80 | layer 81 | """ 82 | self.input = input 83 | # end-snippet-1 84 | 85 | # `W` is initialized with `W_values` which is uniformely sampled 86 | # from sqrt(-6./(n_in+n_hidden)) and sqrt(6./(n_in+n_hidden)) 87 | # for tanh activation function 88 | # the output of uniform if converted using asarray to dtype 89 | # theano.config.floatX so that the code is runable on GPU 90 | # Note : optimal initialization of weights is dependent on the 91 | # activation function used (among other things). 92 | # For example, results presented in [Xavier10] suggest that you 93 | # should use 4 times larger initial weights for sigmoid 94 | # compared to tanh 95 | # We have no info for other function, so we use the same as 96 | # tanh. 97 | if W is None: 98 | W_values = numpy.asarray( 99 | rng.uniform( 100 | low=-numpy.sqrt(6. / (n_in + n_out)), 101 | high=numpy.sqrt(6. / (n_in + n_out)), 102 | size=(n_in, n_out) 103 | ), 104 | dtype=theano.config.floatX 105 | ) 106 | if activation == theano.tensor.nnet.sigmoid: 107 | W_values *= 4 108 | 109 | W = theano.shared(value=W_values, name='W', borrow=True) 110 | 111 | if b is None: 112 | b_values = numpy.zeros((n_out,), dtype=theano.config.floatX) 113 | b = theano.shared(value=b_values, name='b', borrow=True) 114 | 115 | self.W = W 116 | self.b = b 117 | 118 | lin_output = T.dot(input, self.W) + self.b 119 | self.output = ( 120 | lin_output if activation is None 121 | else activation(lin_output) 122 | ) 123 | #self.output = _dropout_from_layer(rng,self.output,0.3) 124 | 125 | # parameters of the model 126 | self.params = [self.W, self.b] 127 | class HiddenLayer2(object): 128 | def __init__(self, rng, n_in, n_out, W=None, b=None, 129 | activation=T.tanh): 130 | """ 131 | Typical hidden layer of a MLP: units are fully-connected and have 132 | sigmoidal activation function. Weight matrix W is of shape (n_in,n_out) 133 | and the bias vector b is of shape (n_out,). 134 | 135 | NOTE : The nonlinearity used here is tanh 136 | 137 | Hidden unit activation is given by: tanh(dot(input,W) + b) 138 | 139 | :type rng: numpy.random.RandomState 140 | :param rng: a random number generator used to initialize weights 141 | 142 | :type input: theano.tensor.dmatrix 143 | :param input: a symbolic tensor of shape (n_examples, n_in) 144 | 145 | :type n_in: int 146 | :param n_in: dimensionality of input 147 | 148 | :type n_out: int 149 | :param n_out: number of hidden units 150 | 151 | :type activation: theano.Op or function 152 | :param activation: Non linearity to be applied in the hidden 153 | layer 154 | """ 155 | # end-snippet-1 156 | 157 | # `W` is initialized with `W_values` which is uniformely sampled 158 | # from sqrt(-6./(n_in+n_hidden)) and sqrt(6./(n_in+n_hidden)) 159 | # for tanh activation function 160 | # the output of uniform if converted using asarray to dtype 161 | # theano.config.floatX so that the code is runable on GPU 162 | # Note : optimal initialization of weights is dependent on the 163 | # activation function used (among other things). 164 | # For example, results presented in [Xavier10] suggest that you 165 | # should use 4 times larger initial weights for sigmoid 166 | # compared to tanh 167 | # We have no info for other function, so we use the same as 168 | # tanh. 169 | if W is None: 170 | W_values = numpy.asarray( 171 | rng.uniform( 172 | low=-numpy.sqrt(6. / (n_in + n_out)), 173 | high=numpy.sqrt(6. / (n_in + n_out)), 174 | size=(n_in, n_out) 175 | ), 176 | dtype=theano.config.floatX 177 | ) 178 | if activation == theano.tensor.nnet.sigmoid: 179 | W_values *= 4 180 | 181 | W = theano.shared(value=W_values, name='W', borrow=True) 182 | 183 | if b is None: 184 | b_values = numpy.zeros((n_out,), dtype=theano.config.floatX) 185 | b = theano.shared(value=b_values, name='b', borrow=True) 186 | 187 | self.W = W 188 | self.b = b 189 | self.activation = activation 190 | 191 | self.params = [self.W, self.b] 192 | 193 | def __call__(self, input): 194 | lin_output = T.dot(input, self.W) + self.b 195 | return self.activation(lin_output) 196 | class TensorClassifier(object): 197 | """Multi-Layer Perceptron Class 198 | 199 | A multilayer perceptron is a feedforward artificial neural network model 200 | that has one layer or more of hidden units and nonlinear activations. 201 | Intermediate layers usually have as activation function tanh or the 202 | sigmoid function (defined here by a ``HiddenLayer`` class) while the 203 | top layer is a softmax layer (defined here by a ``LogisticRegression`` 204 | class). 205 | """ 206 | 207 | def __init__(self, rng, n_left,n_right,dim_tensor=1,activate_func = 'tanh'): 208 | self.dim_tensor = dim_tensor 209 | self.activation = activate_func 210 | 211 | W_values = numpy.asarray( 212 | numpy.random.randn(n_left,self.dim_tensor,n_right) * 0.05, 213 | # rng.uniform( 214 | # low=-numpy.sqrt(6. / (n_left+n_right)), 215 | # high=numpy.sqrt(6. / (n_left+n_right)), 216 | # size=(n_left, self.dim_tensor, n_right) 217 | # ), 218 | dtype=theano.config.floatX 219 | ) 220 | 221 | self.W = theano.shared(value=W_values, name='W', borrow=True) 222 | self.W2 = theano.shared(value=numpy.asarray( 223 | rng.uniform( 224 | low=-numpy.sqrt(6. / (n_left+n_right)), 225 | high=numpy.sqrt(6. / (n_left+n_right)), 226 | size=(n_left + n_right, self.dim_tensor) 227 | ), 228 | dtype=theano.config.floatX 229 | ), borrow=True) 230 | self.params = [self.W,self.W2] 231 | 232 | def __call__(self,left,right,batch_size, *args, **kwargs): 233 | tmp1 = T.dot(T.concatenate([left,right],1),self.W2) 234 | tmp2 = T.batched_dot(T.tensordot(left,self.W,[1,0]),right) 235 | tmp2 = theano.tensor.reshape(tmp2,(batch_size,self.dim_tensor)) 236 | if self.activation == 'tanh': 237 | return theano.tensor.tanh(tmp2 + tmp1) 238 | if self.activation == 'relu': 239 | return T.maximum(0.0, tmp2) 240 | # start-snippet-2 241 | class MLP(object): 242 | """Multi-Layer Perceptron Class 243 | 244 | A multilayer perceptron is a feedforward artificial neural network model 245 | that has one layer or more of hidden units and nonlinear activations. 246 | Intermediate layers usually have as activation function tanh or the 247 | sigmoid function (defined here by a ``HiddenLayer`` class) while the 248 | top layer is a softmax layer (defined here by a ``LogisticRegression`` 249 | class). 250 | """ 251 | 252 | def __init__(self, rng, input, n_in, n_hidden, n_out): 253 | """Initialize the parameters for the multilayer perceptron 254 | 255 | :type rng: numpy.random.RandomState 256 | :param rng: a random number generator used to initialize weights 257 | 258 | :type input: theano.tensor.TensorType 259 | :param input: symbolic variable that describes the input of the 260 | architecture (one minibatch) 261 | 262 | :type n_in: int 263 | :param n_in: number of input units, the dimension of the space in 264 | which the datapoints lie 265 | 266 | :type n_hidden: int 267 | :param n_hidden: number of hidden units 268 | 269 | :type n_out: int 270 | :param n_out: number of output units, the dimension of the space in 271 | which the labels lie 272 | 273 | """ 274 | 275 | # Since we are dealing with a one hidden layer MLP, this will translate 276 | # into a HiddenLayer with a tanh activation function connected to the 277 | # LogisticRegression layer; the activation function can be replaced by 278 | # sigmoid or any other nonlinear function 279 | self.hiddenLayer = HiddenLayer( 280 | rng=rng, 281 | input=input, 282 | n_in=n_in, 283 | n_out=n_hidden, 284 | activation=T.tanh 285 | ) 286 | 287 | # The logistic regression layer gets as input the hidden units 288 | # of the hidden layer 289 | self.logRegressionLayer = LogisticRegression( 290 | input=self.hiddenLayer.output, 291 | n_in=n_hidden, 292 | n_out=n_out, 293 | rng = rng 294 | ) 295 | # end-snippet-2 start-snippet-3 296 | # L1 norm ; one regularization option is to enforce L1 norm to 297 | # be small 298 | self.L1 = ( 299 | abs(self.hiddenLayer.W).sum() 300 | + abs(self.logRegressionLayer.W).sum() 301 | ) 302 | 303 | # square of L2 norm ; one regularization option is to enforce 304 | # square of L2 norm to be small 305 | self.L2_sqr = ( 306 | (self.hiddenLayer.W ** 2).sum() 307 | + (self.logRegressionLayer.W ** 2).sum() 308 | ) 309 | 310 | # negative log likelihood of the MLP is given by the negative 311 | # log likelihood of the output of the model, computed in the 312 | # logistic regression layer 313 | self.negative_log_likelihood = ( 314 | self.logRegressionLayer.negative_log_likelihood 315 | ) 316 | # same holds for the function computing the number of errors 317 | self.errors = self.logRegressionLayer.errors 318 | 319 | # the parameters of the model are the parameters of the two layer it is 320 | # made out of 321 | self.params = self.hiddenLayer.params + self.logRegressionLayer.params 322 | # end-snippet-3 323 | 324 | # keep track of model input 325 | self.input = input 326 | def ortho_weight(ndim): 327 | W = numpy.random.randn(ndim, ndim) 328 | u, s, v = numpy.linalg.svd(W) 329 | return u.astype('float32') 330 | class BilinearLR(object): 331 | """ 332 | Bilinear Formed Logistic Regression Class 333 | """ 334 | 335 | def __init__(self, rng, linp, rinp, n_in, n_out, W=None, b=None): 336 | """ Initialize the parameters of the logistic regression 337 | 338 | :type linp: theano.tensor.TensorType 339 | :param linp: symbolic variable that describes the left input of the 340 | architecture (one minibatch) 341 | 342 | :type rinp: theano.tensor.TensorType 343 | :param rinp: symbolic variable that describes the right input of the 344 | architecture (one minibatch) 345 | 346 | :type n_in: int 347 | :param n_in: number of left input units 348 | 349 | :type n_out: int 350 | :param n_out: number of right input units 351 | 352 | """ 353 | 354 | # initialize with 0 the weights W as a matrix of shape (n_in, n_out) 355 | if W is None: 356 | if n_in == n_out: 357 | self.W = theano.shared(ortho_weight(n_in),borrow=True) 358 | else: 359 | W_bound = numpy.sqrt(6. / (n_in+n_out)) 360 | self.W = theano.shared(numpy.asarray(rng.uniform(low=-W_bound,high=W_bound,size=(n_in,n_out 361 | )), 362 | dtype=theano.config.floatX),borrow=True) 363 | else: 364 | self.W = W 365 | 366 | if b is None: 367 | self.b = theano.shared(value=0., name='b') 368 | self.b = theano.tensor.addbroadcast(self.b) 369 | #self.b = theano.tensor.set_subtensor(self.b,0.) 370 | else: 371 | self.b = b 372 | 373 | # compute vector of class-membership probabilities in symbolic form 374 | self.p_y_given_x = T.nnet.sigmoid(T.batched_dot(T.dot(linp, self.W), rinp)+ self.b) 375 | self.predict_y = T.round(self.p_y_given_x) 376 | 377 | # parameters of the model 378 | self.params = [self.W, self.b] 379 | 380 | def errors(self,y): 381 | if y.dtype.startswith('int'): 382 | return T.mean(T.neq(self.predict_y,y)) 383 | else: 384 | raise NotImplementedError 385 | 386 | def predict(self, ldata, rdata): 387 | p_y_given_x = T.nnet.sigmoid(T.dot(T.dot(ldata, self.W), rdata.T).diagonal() + self.b) 388 | return p_y_given_x 389 | 390 | def get_cost(self, y): 391 | # cross-entropy loss 392 | L = - T.mean(y * T.log(self.p_y_given_x) + (1 - y) * T.log(1 - self.p_y_given_x)) 393 | return L 394 | 395 | 396 | def test_mlp(learning_rate=0.01, L1_reg=0.00, L2_reg=0.0001, n_epochs=1000, 397 | dataset='mnist.pkl.gz', batch_size=20, n_hidden=500): 398 | """ 399 | Demonstrate stochastic gradient descent optimization for a multilayer 400 | perceptron 401 | 402 | This is demonstrated on MNIST. 403 | 404 | :type learning_rate: float 405 | :param learning_rate: learning rate used (factor for the stochastic 406 | gradient 407 | 408 | :type L1_reg: float 409 | :param L1_reg: L1-norm's weight when added to the cost (see 410 | regularization) 411 | 412 | :type L2_reg: float 413 | :param L2_reg: L2-norm's weight when added to the cost (see 414 | regularization) 415 | 416 | :type n_epochs: int 417 | :param n_epochs: maximal number of epochs to run the optimizer 418 | 419 | :type dataset: string 420 | :param dataset: the path of the MNIST dataset file from 421 | http://www.iro.umontreal.ca/~lisa/deep/data/mnist/mnist.pkl.gz 422 | 423 | 424 | """ 425 | datasets = load_data(dataset) 426 | 427 | train_set_x, train_set_y = datasets[0] 428 | valid_set_x, valid_set_y = datasets[1] 429 | test_set_x, test_set_y = datasets[2] 430 | 431 | # compute number of minibatches for training, validation and testing 432 | n_train_batches = train_set_x.get_value(borrow=True).shape[0] // batch_size 433 | n_valid_batches = valid_set_x.get_value(borrow=True).shape[0] // batch_size 434 | n_test_batches = test_set_x.get_value(borrow=True).shape[0] // batch_size 435 | 436 | ###################### 437 | # BUILD ACTUAL MODEL # 438 | ###################### 439 | print('... building the model') 440 | 441 | # allocate symbolic variables for the data 442 | index = T.lscalar() # index to a [mini]batch 443 | x = T.matrix('x') # the data is presented as rasterized images 444 | y = T.ivector('y') # the labels are presented as 1D vector of 445 | # [int] labels 446 | 447 | rng = numpy.random.RandomState(1234) 448 | 449 | # construct the MLP class 450 | classifier = MLP( 451 | rng=rng, 452 | input=x, 453 | n_in=28 * 28, 454 | n_hidden=n_hidden, 455 | n_out=10 456 | ) 457 | 458 | # start-snippet-4 459 | # the cost we minimize during training is the negative log likelihood of 460 | # the model plus the regularization terms (L1 and L2); cost is expressed 461 | # here symbolically 462 | cost = ( 463 | classifier.negative_log_likelihood(y) 464 | + L1_reg * classifier.L1 465 | + L2_reg * classifier.L2_sqr 466 | ) 467 | # end-snippet-4 468 | 469 | # compiling a Theano function that computes the mistakes that are made 470 | # by the model on a minibatch 471 | test_model = theano.function( 472 | inputs=[index], 473 | outputs=classifier.errors(y), 474 | givens={ 475 | x: test_set_x[index * batch_size:(index + 1) * batch_size], 476 | y: test_set_y[index * batch_size:(index + 1) * batch_size] 477 | } 478 | ) 479 | 480 | validate_model = theano.function( 481 | inputs=[index], 482 | outputs=classifier.errors(y), 483 | givens={ 484 | x: valid_set_x[index * batch_size:(index + 1) * batch_size], 485 | y: valid_set_y[index * batch_size:(index + 1) * batch_size] 486 | } 487 | ) 488 | 489 | # start-snippet-5 490 | # compute the gradient of cost with respect to theta (sotred in params) 491 | # the resulting gradients will be stored in a list gparams 492 | gparams = [T.grad(cost, param) for param in classifier.params] 493 | 494 | # specify how to update the parameters of the model as a list of 495 | # (variable, update expression) pairs 496 | 497 | # given two lists of the same length, A = [a1, a2, a3, a4] and 498 | # B = [b1, b2, b3, b4], zip generates a list C of same size, where each 499 | # element is a pair formed from the two lists : 500 | # C = [(a1, b1), (a2, b2), (a3, b3), (a4, b4)] 501 | updates = [ 502 | (param, param - learning_rate * gparam) 503 | for param, gparam in zip(classifier.params, gparams) 504 | ] 505 | 506 | # compiling a Theano function `train_model` that returns the cost, but 507 | # in the same time updates the parameter of the model based on the rules 508 | # defined in `updates` 509 | train_model = theano.function( 510 | inputs=[index], 511 | outputs=cost, 512 | updates=updates, 513 | givens={ 514 | x: train_set_x[index * batch_size: (index + 1) * batch_size], 515 | y: train_set_y[index * batch_size: (index + 1) * batch_size] 516 | } 517 | ) 518 | # end-snippet-5 519 | 520 | ############### 521 | # TRAIN MODEL # 522 | ############### 523 | print('... training') 524 | 525 | # early-stopping parameters 526 | patience = 10000 # look as this many examples regardless 527 | patience_increase = 2 # wait this much longer when a new best is 528 | # found 529 | improvement_threshold = 0.995 # a relative improvement of this much is 530 | # considered significant 531 | validation_frequency = min(n_train_batches, patience // 2) 532 | # go through this many 533 | # minibatche before checking the network 534 | # on the validation set; in this case we 535 | # check every epoch 536 | 537 | best_validation_loss = numpy.inf 538 | best_iter = 0 539 | test_score = 0. 540 | start_time = timeit.default_timer() 541 | 542 | epoch = 0 543 | done_looping = False 544 | 545 | while (epoch < n_epochs) and (not done_looping): 546 | epoch = epoch + 1 547 | for minibatch_index in range(n_train_batches): 548 | 549 | minibatch_avg_cost = train_model(minibatch_index) 550 | # iteration number 551 | iter = (epoch - 1) * n_train_batches + minibatch_index 552 | 553 | if (iter + 1) % validation_frequency == 0: 554 | # compute zero-one loss on validation set 555 | validation_losses = [validate_model(i) for i 556 | in range(n_valid_batches)] 557 | this_validation_loss = numpy.mean(validation_losses) 558 | 559 | print( 560 | 'epoch %i, minibatch %i/%i, validation error %f %%' % 561 | ( 562 | epoch, 563 | minibatch_index + 1, 564 | n_train_batches, 565 | this_validation_loss * 100. 566 | ) 567 | ) 568 | 569 | # if we got the best validation score until now 570 | if this_validation_loss < best_validation_loss: 571 | #improve patience if loss improvement is good enough 572 | if ( 573 | this_validation_loss < best_validation_loss * 574 | improvement_threshold 575 | ): 576 | patience = max(patience, iter * patience_increase) 577 | 578 | best_validation_loss = this_validation_loss 579 | best_iter = iter 580 | 581 | # test it on the test set 582 | test_losses = [test_model(i) for i 583 | in range(n_test_batches)] 584 | test_score = numpy.mean(test_losses) 585 | 586 | print((' epoch %i, minibatch %i/%i, test error of ' 587 | 'best model %f %%') % 588 | (epoch, minibatch_index + 1, n_train_batches, 589 | test_score * 100.)) 590 | 591 | if patience <= iter: 592 | done_looping = True 593 | break 594 | 595 | end_time = timeit.default_timer() 596 | print(('Optimization complete. Best validation score of %f %% ' 597 | 'obtained at iteration %i, with test performance %f %%') % 598 | (best_validation_loss * 100., best_iter + 1, test_score * 100.)) 599 | print(('The code for file ' + 600 | os.path.split(__file__)[1] + 601 | ' ran for %.2fm' % ((end_time - start_time) / 60.)), file=sys.stderr) 602 | 603 | 604 | if __name__ == '__main__': 605 | test_mlp() -------------------------------------------------------------------------------- /theano_src/Optimization.py: -------------------------------------------------------------------------------- 1 | import theano 2 | import theano.tensor as T 3 | from collections import defaultdict, OrderedDict 4 | import numpy as np 5 | 6 | def as_floatX(variable): 7 | if isinstance(variable, float): 8 | return np.cast[theano.config.floatX](variable) 9 | 10 | if isinstance(variable, np.ndarray): 11 | return np.cast[theano.config.floatX](variable) 12 | return theano.tensor.cast(variable, theano.config.floatX) 13 | 14 | 15 | 16 | class RMSprop(object): 17 | def RMSprop(self,cost, params, lr=0.001, rho=0.9, epsilon=1e-6): 18 | grads = T.grad(cost=cost, wrt=params) 19 | updates = [] 20 | for p, g in zip(params, grads): 21 | acc = theano.shared(p.get_value() * 0.) 22 | acc_new = rho * acc + (1 - rho) * g ** 2 23 | gradient_scaling = T.sqrt(acc_new + epsilon) 24 | g = g / gradient_scaling 25 | updates.append((acc, acc_new)) 26 | updates.append((p, p - lr * g)) 27 | return updates 28 | 29 | 30 | class Adam(object): 31 | def Adam(self,cost, params, lr=0.0002, b1=0.1, b2=0.001, e=1e-8): 32 | updates = [] 33 | grads = T.grad(cost, params) 34 | i = theano.shared(as_floatX(0.)) 35 | i_t = i + 1. 36 | fix1 = 1. - (1. - b1)**i_t 37 | fix2 = 1. - (1. - b2)**i_t 38 | lr_t = lr * (T.sqrt(fix2) / fix1) 39 | for p, g in zip(params, grads): 40 | m = theano.shared(p.get_value() * 0.) 41 | v = theano.shared(p.get_value() * 0.) 42 | m_t = (b1 * g) + ((1. - b1) * m) 43 | v_t = (b2 * T.sqr(g)) + ((1. - b2) * v) 44 | g_t = m_t / (T.sqrt(v_t) + e) 45 | p_t = p - (lr_t * g_t) 46 | updates.append((m, m_t)) 47 | updates.append((v, v_t)) 48 | updates.append((p, p_t)) 49 | updates.append((i, i_t)) 50 | return updates 51 | 52 | class Adadelta(object): 53 | def sgd_updates_adadelta(self,params,cost,rho=0.95,epsilon=1e-6,norm_lim=9,word_vec_name='Words'): 54 | """ 55 | adadelta update rule, mostly from 56 | https://groups.google.com/forum/#!topic/pylearn-dev/3QbKtCumAW4 (for Adadelta) 57 | """ 58 | updates = OrderedDict({}) 59 | exp_sqr_grads = OrderedDict({}) 60 | exp_sqr_ups = OrderedDict({}) 61 | gparams = [] 62 | for param in params: 63 | empty = np.zeros_like(param.get_value()) 64 | exp_sqr_grads[param] = theano.shared(value=as_floatX(empty),name="exp_grad_%s" % param.name) 65 | gp = T.grad(cost, param) 66 | exp_sqr_ups[param] = theano.shared(value=as_floatX(empty), name="exp_grad_%s" % param.name) 67 | gparams.append(gp) 68 | for param, gp in zip(params, gparams): 69 | exp_sg = exp_sqr_grads[param] 70 | exp_su = exp_sqr_ups[param] 71 | up_exp_sg = rho * exp_sg + (1 - rho) * T.sqr(gp) 72 | updates[exp_sg] = up_exp_sg 73 | step = -(T.sqrt(exp_su + epsilon) / T.sqrt(up_exp_sg + epsilon)) * gp 74 | updates[exp_su] = rho * exp_su + (1 - rho) * T.sqr(step) 75 | stepped_param = param + step 76 | if (param.get_value(borrow=True).ndim == 2) and (param.name!='Words'): 77 | col_norms = T.sqrt(T.sum(T.sqr(stepped_param), axis=0)) 78 | desired_norms = T.clip(col_norms, 0, T.sqrt(norm_lim)) 79 | scale = desired_norms / (1e-7 + col_norms) 80 | updates[param] = stepped_param * scale 81 | else: 82 | updates[param] = stepped_param 83 | return updates 84 | -------------------------------------------------------------------------------- /theano_src/PreProcess.py: -------------------------------------------------------------------------------- 1 | import cPickle 2 | from collections import defaultdict 3 | import logging 4 | import theano 5 | import gensim 6 | import numpy as np 7 | from random import shuffle 8 | from gensim.models.word2vec import Word2Vec 9 | import codecs 10 | logger = logging.getLogger('relevance_logger') 11 | 12 | 13 | def build_multiturn_data(trainfile, max_len = 100,isshuffle=False): 14 | revs = [] 15 | vocab = defaultdict(float) 16 | total = 1 17 | with codecs.open(trainfile,'r','utf-8') as f: 18 | for line in f: 19 | line = line.replace("_","") 20 | parts = line.strip().split("\t") 21 | 22 | lable = parts[0] 23 | message = "" 24 | words = set() 25 | for i in range(1,len(parts)-1,1): 26 | message += "_t_" 27 | message += parts[i] 28 | words.update(set(parts[i].split())) 29 | 30 | response = parts[-1] 31 | 32 | data = {"y" : lable, "m":message,"r": response} 33 | revs.append(data) 34 | total += 1 35 | if total % 10000 == 0: 36 | print total 37 | #words = set(message.split()) 38 | words.update(set(response.split())) 39 | 40 | for word in words: 41 | vocab[word] += 1 42 | logger.info("processed dataset with %d question-answer pairs " %(len(revs))) 43 | logger.info("vocab size: %d" %(len(vocab))) 44 | if isshuffle == True: 45 | shuffle(revs) 46 | return revs, vocab, max_len 47 | 48 | 49 | def build_data(trainfile, max_len = 20,isshuffle=False): 50 | revs = [] 51 | vocab = defaultdict(float) 52 | total = 1 53 | with codecs.open(trainfile,'r','utf-8') as f: 54 | for line in f: 55 | line = line.replace("_","") 56 | parts = line.strip().split("\t") 57 | 58 | topic = parts[0] 59 | topic_r = parts[1] 60 | lable = parts[2] 61 | message = parts[-2] 62 | response = parts[-1] 63 | 64 | data = {"y" : lable, "m":message,"r": response,"t":topic,"t2":topic_r} 65 | revs.append(data) 66 | total += 1 67 | 68 | words = set(message.split()) 69 | words.update(set(response.split())) 70 | for word in words: 71 | vocab[word] += 1 72 | logger.info("processed dataset with %d question-answer pairs " %(len(revs))) 73 | logger.info("vocab size: %d" %(len(vocab))) 74 | if isshuffle == True: 75 | shuffle(revs) 76 | return revs, vocab, max_len 77 | 78 | class WordVecs(object): 79 | def __init__(self, fname, vocab, binary, gensim): 80 | if gensim: 81 | word_vecs = self.load_gensim(fname,vocab) 82 | self.k = len(word_vecs.values()[0]) 83 | self.W, self.word_idx_map = self.get_W(word_vecs, k=self.k) 84 | 85 | def get_W(self, word_vecs, k=300): 86 | """ 87 | Get word matrix. W[i] is the vector for word indexed by i 88 | """ 89 | vocab_size = len(word_vecs) 90 | word_idx_map = dict() 91 | W = np.zeros(shape=(vocab_size+1, k)) 92 | W[0] = np.zeros(k) 93 | i = 1 94 | for word in word_vecs: 95 | W[i] = word_vecs[word] 96 | word_idx_map[word] = i 97 | i += 1 98 | return W, word_idx_map 99 | 100 | def load_gensim(self, fname, vocab): 101 | model = Word2Vec.load(fname) 102 | weights = [[0.] * model.vector_size] 103 | word_vecs = {} 104 | total_inside_new_embed = 0 105 | miss= 0 106 | for pair in vocab: 107 | word = gensim.utils.to_unicode(pair) 108 | if word in model: 109 | total_inside_new_embed += 1 110 | word_vecs[pair] = np.array([w for w in model[word]]) 111 | #weights.append([w for w in model[word]]) 112 | else: 113 | miss = miss + 1 114 | word_vecs[pair] = np.array([0.] * model.vector_size) 115 | #weights.append([0.] * model.vector_size) 116 | print 'transfer', total_inside_new_embed, 'words from the embedding file, total', len(vocab), 'candidate' 117 | print 'miss word2vec', miss 118 | return word_vecs 119 | 120 | def createtopicvec(): 121 | max_topicword = 50 122 | model = Word2Vec.load_word2vec_format(r"\\msra-sandvm-001\v-wuyu\Models\W2V\Ubuntu\word2vec.model") 123 | topicmatrix = np.zeros(shape=(100,max_topicword,100),dtype=theano.config.floatX) 124 | file = open(r"\\msra-sandvm-001\v-wuyu\project\pythonproject\ACL2016\mergedic2.txt") 125 | i = 0 126 | miss = 0 127 | for line in file: 128 | tmp = line.strip().split(' ') 129 | for j in range(min(len(tmp),max_topicword)): 130 | if gensim.utils.to_unicode(tmp[j]) in model.vocab: 131 | topicmatrix[i,j,:] = model[gensim.utils.to_unicode(tmp[j])] 132 | else: 133 | miss = miss+1 134 | 135 | i= i+1 136 | print "miss word2vec", miss 137 | return topicmatrix 138 | 139 | def ParseSingleTurn(): 140 | logging.basicConfig(format='%(asctime)s : %(levelname)s : %(message)s', level=logging.INFO) 141 | revs, vocab, max_len = build_data(r"\\msra-sandvm-001\v-wuyu\Data\ubuntu_data\ubuntu_data\train.topic",isshuffle=True) 142 | word2vec = WordVecs(r"\\msra-sandvm-001\v-wuyu\Models\W2V\Ubuntu\word2vec.model", vocab, True, True) 143 | cPickle.dump([revs, word2vec, max_len,createtopicvec()], open("ubuntu_data.test",'wb')) 144 | logger.info("dataset created!") 145 | 146 | def ParseMultiTurn(): 147 | logging.basicConfig(format='%(asctime)s : %(levelname)s : %(message)s', level=logging.INFO) 148 | revs, vocab, max_len = build_multiturn_data(r"\\msra-sandvm-001\v-wuyu\Data\ubuntu_data\ubuntu_data\test.txt",isshuffle=False) 149 | word2vec = WordVecs(r"\\msra-sandvm-001\v-wuyu\Models\W2V\Ubuntu\word2vec.model", vocab, True, True) 150 | cPickle.dump([revs, word2vec, max_len], open("ubuntu_data.mul.test",'wb')) 151 | logger.info("dataset created!") 152 | 153 | if __name__=="__main__": 154 | ParseMultiTurn() -------------------------------------------------------------------------------- /theano_src/RNN.py: -------------------------------------------------------------------------------- 1 | import numpy as np 2 | import theano 3 | import theano.tensor as T 4 | from sklearn.base import BaseEstimator 5 | import logging 6 | import time 7 | import os 8 | import datetime 9 | import cPickle as pickle 10 | 11 | def ortho_weight(ndim): 12 | W = np.random.randn(ndim, ndim) 13 | u, s, v = np.linalg.svd(W) 14 | return u.astype('float32') 15 | 16 | 17 | # weight initializer, normal by default 18 | def norm_weight(nin, nout=None, scale=0.01, ortho=False): 19 | if nout is None: 20 | nout = nin 21 | if nout == nin and ortho: 22 | W = ortho_weight(nin) 23 | else: 24 | W = scale * np.random.randn(nin, nout) 25 | return W.astype('float32') 26 | def uniform_weight(size,scale=0.1): 27 | return np.random.uniform(size=size,low=-scale, high=scale).astype(theano.config.floatX) 28 | 29 | 30 | def glorot_uniform(size): 31 | fan_in, fan_out = size 32 | s = np.sqrt(6. / (fan_in + fan_out)) 33 | return np.random.uniform(size=size,low=-s, high=s).astype(theano.config.floatX) 34 | 35 | 36 | class BiGRU(object): 37 | def __init__(self, n_in, n_hidden, n_out, activation=T.tanh,inner_activation=T.nnet.sigmoid, 38 | output_type='real',batch_size=200): 39 | 40 | self.gru_1 = GRU(n_in,n_hidden,n_out,batch_size=batch_size) 41 | self.gru_2 = GRU(n_in,n_hidden,n_out,batch_size=batch_size) 42 | 43 | self.params = self.gru_1.params 44 | self.params += self.gru_2.params 45 | 46 | def __call__(self, input, input_lm=None, return_list = False): 47 | reverse_input = input[:,::-1,:] 48 | reverse_mask = input_lm[:,::-1] 49 | 50 | res1 = self.gru_1(input,input_lm,return_list) 51 | if return_list == True: 52 | res2 = self.gru_2(reverse_input,reverse_mask,return_list)[:,::-1,:] 53 | return T.concatenate([res1,res2],2) 54 | else: 55 | res2 = self.gru_2(reverse_input,reverse_mask,return_list) 56 | return T.concatenate([res1,res2],1) 57 | 58 | 59 | class GRU(object): 60 | def __init__(self, n_in, n_hidden, n_out, activation=T.tanh,inner_activation=T.nnet.sigmoid, 61 | output_type='real',batch_size=200): 62 | 63 | self.activation = activation 64 | self.inner_activation = inner_activation 65 | self.output_type = output_type 66 | 67 | self.batch_size = batch_size 68 | self.n_hidden = n_hidden 69 | 70 | # recurrent weights as a shared variable 71 | self.U_z = theano.shared(ortho_weight(n_hidden),borrow=True) 72 | self.W_z = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 73 | self.b_z = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 74 | 75 | self.U_r = theano.shared(ortho_weight(n_hidden),borrow=True) 76 | self.W_r = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 77 | self.b_r = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 78 | 79 | self.U_h = theano.shared(ortho_weight(n_hidden),borrow=True) 80 | self.W_h = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 81 | self.b_h = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 82 | 83 | 84 | self.params = [self.W_z,self.W_h,self.W_r, 85 | self.U_h,self.U_r,self.U_z, 86 | self.b_h,self.b_r,self.b_z] 87 | 88 | def __call__(self, input,input_lm=None, return_list = False, Init_input =None,check_gate = False): 89 | # activation function 90 | if Init_input == None: 91 | init = theano.shared(value=np.zeros((self.batch_size,self.n_hidden), 92 | dtype=theano.config.floatX),borrow=True) 93 | else: 94 | init = Init_input 95 | 96 | if check_gate: 97 | self.h_l, _ = theano.scan(self.step3, 98 | sequences=[input.dimshuffle(1,0,2),T.addbroadcast(input_lm.dimshuffle(1,0,'x'), -1)], 99 | outputs_info=[init, theano.shared(value=np.zeros((self.batch_size,self.n_hidden), 100 | dtype=theano.config.floatX),borrow=True)]) 101 | return [self.h_l[0][:,-1,:], self.h_l[1]] 102 | 103 | 104 | 105 | if input_lm == None: 106 | self.h_l, _ = theano.scan(self.step2, 107 | sequences=input.dimshuffle(1,0,2), 108 | outputs_info=init) 109 | else: 110 | self.h_l, _ = theano.scan(self.step, 111 | sequences=[input.dimshuffle(1,0,2),T.addbroadcast(input_lm.dimshuffle(1,0,'x'), -1)], 112 | outputs_info=init) 113 | self.h_l = self.h_l.dimshuffle(1,0,2) 114 | if return_list == True: 115 | return self.h_l 116 | return self.h_l[:,-1,:] 117 | 118 | def step2(self,x_t, h_tm1): 119 | x_z = T.dot(x_t, self.W_z) + self.b_z 120 | x_r = T.dot(x_t, self.W_r) + self.b_r 121 | x_h = T.dot(x_t, self.W_h) + self.b_h 122 | z = self.inner_activation(x_z + T.dot(h_tm1, self.U_z)) 123 | r = self.inner_activation(x_r + T.dot(h_tm1, self.U_r)) 124 | 125 | hh = self.activation(x_h + T.dot(r * h_tm1, self.U_h)) 126 | h = z * h_tm1 + (1 - z) * hh 127 | return h 128 | def step3(self,x_t,mask, h_tm1, gate_tm1): 129 | #h_tm1 = mask * h_tm1 130 | x_z = T.dot(x_t, self.W_z) + self.b_z 131 | x_r = T.dot(x_t, self.W_r) + self.b_r 132 | x_h = T.dot(x_t, self.W_h) + self.b_h 133 | z = self.inner_activation(x_z + T.dot(h_tm1, self.U_z)) 134 | r = self.inner_activation(x_r + T.dot(h_tm1, self.U_r)) 135 | 136 | hh = self.activation(x_h + T.dot(r * h_tm1, self.U_h)) 137 | h = z * h_tm1 + (1 - z) * hh 138 | h = mask * h + (1-mask) * h_tm1 139 | 140 | return [h,r] 141 | 142 | def step(self,x_t,mask, h_tm1): 143 | #h_tm1 = mask * h_tm1 144 | x_z = T.dot(x_t, self.W_z) + self.b_z 145 | x_r = T.dot(x_t, self.W_r) + self.b_r 146 | x_h = T.dot(x_t, self.W_h) + self.b_h 147 | z = self.inner_activation(x_z + T.dot(h_tm1, self.U_z)) 148 | r = self.inner_activation(x_r + T.dot(h_tm1, self.U_r)) 149 | 150 | hh = self.activation(x_h + T.dot(r * h_tm1, self.U_h)) 151 | h = z * h_tm1 + (1 - z) * hh 152 | h = mask * h + (1-mask) * h_tm1 153 | 154 | return h 155 | 156 | 157 | class LSTM(object): 158 | def __init__(self,n_in, n_hidden, n_out, activation=T.tanh,inner_activation=T.nnet.sigmoid, 159 | output_type='real',batch_size=200): 160 | self.activation = activation 161 | self.inner_activation = inner_activation 162 | self.output_type = output_type 163 | 164 | self.batch_size = batch_size 165 | self.n_hidden = n_hidden 166 | 167 | self.W_i = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 168 | self.U_i = theano.shared(ortho_weight(n_hidden),borrow=True) 169 | self.b_i = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 170 | 171 | self.W_f = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 172 | self.U_f = theano.shared(ortho_weight(n_hidden),borrow=True) 173 | self.b_f = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 174 | 175 | self.W_c = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 176 | self.U_c = theano.shared(ortho_weight(n_hidden),borrow=True) 177 | self.b_c = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 178 | 179 | self.W_o = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True) 180 | self.U_o = theano.shared(ortho_weight(n_hidden),borrow=True) 181 | self.b_o = theano.shared(value=np.zeros((n_hidden,),dtype=theano.config.floatX),borrow=True) 182 | 183 | self.params = [self.W_i, self.U_i, self.b_i, 184 | self.W_c, self.U_c, self.b_c, 185 | self.W_f, self.U_f, self.b_f, 186 | self.W_o, self.U_o, self.b_o] 187 | def __call__(self, input,input_lm=None, return_list = False): 188 | # activation function 189 | if input_lm == None: 190 | self.h_l, _ = theano.scan(self.step2, 191 | sequences=input.dimshuffle(1,0,2), 192 | outputs_info=[theano.shared(value=np.zeros((self.batch_size,self.n_hidden), 193 | dtype=theano.config.floatX),borrow=True), 194 | theano.shared(value=np.zeros((self.batch_size,self.n_hidden), 195 | dtype=theano.config.floatX),borrow=True)]) 196 | else: 197 | self.h_l, _ = theano.scan(self.step, 198 | sequences=[input.dimshuffle(1,0,2),T.addbroadcast(input_lm.dimshuffle(1,0,'x'), -1)], 199 | outputs_info=[theano.shared(value=np.zeros((self.batch_size,self.n_hidden), 200 | dtype=theano.config.floatX),borrow=True), 201 | theano.shared(value=np.zeros((self.batch_size,self.n_hidden), 202 | dtype=theano.config.floatX),borrow=True)]) 203 | self.h_l = self.h_l[0].dimshuffle(1,0,2) 204 | if return_list == True: 205 | return self.h_l 206 | return self.h_l[:,-1,:] 207 | 208 | def step(self,x_t,mask, h_tm1,c_tm1): 209 | #h_tm1 = mask * h_tm1 210 | #c_tm1 = mask * c_tm1 211 | x_i =T.dot(x_t, self.W_i) + self.b_i 212 | x_f =T.dot(x_t, self.W_f) + self.b_f 213 | x_c =T.dot(x_t, self.W_c) + self.b_c 214 | x_o =T.dot(x_t, self.W_o) + self.b_o 215 | 216 | i = self.inner_activation(x_i + T.dot(h_tm1, self.U_i)) 217 | f = self.inner_activation(x_f + T.dot(h_tm1, self.U_f)) 218 | c = f * c_tm1 + i * self.activation(x_c + T.dot(h_tm1, self.U_c)) 219 | o = self.inner_activation(x_o + T.dot(h_tm1, self.U_o)) 220 | h = o * self.activation(c) 221 | 222 | h = mask * h + (1-mask) * h_tm1 223 | c = mask * c + (1-mask) * c_tm1 224 | 225 | return [h, c] 226 | 227 | def step2(self,x_t, h_tm1,c_tm1): 228 | #h_tm1 = mask * h_tm1 229 | x_i =T.dot(x_t, self.W_i) + self.b_i 230 | x_f =T.dot(x_t, self.W_f) + self.b_f 231 | x_c =T.dot(x_t, self.W_c) + self.b_c 232 | x_o =T.dot(x_t, self.W_o) + self.b_o 233 | 234 | i = self.inner_activation(x_i + T.dot(h_tm1, self.U_i)) 235 | f = self.inner_activation(x_f + T.dot(h_tm1, self.U_f)) 236 | c = f * c_tm1 + i * self.activation(x_c + T.dot(h_tm1, self.U_c)) 237 | o = self.inner_activation(x_o + T.dot(h_tm1, self.U_o)) 238 | h = o * self.activation(c) 239 | return [h, c] 240 | 241 | class RNN(object): 242 | def __init__(self, input_l, input_r, n_in, n_hidden, n_out, activation=T.tanh, 243 | output_type='real',batch_size=200,input_lm=None,input_rm=None): 244 | if input_lm == None: 245 | input_lm = theano.shared(value=np.ones((batch_size,20), dtype=theano.config.floatX),borrow=True) 246 | if input_rm == None: 247 | input_rm = theano.shared(value=np.ones((batch_size,20), dtype=theano.config.floatX),borrow=True) 248 | self.activation = activation 249 | self.output_type = output_type 250 | # Parameters are reshaped views of theta 251 | param_idx = 0 # pointer to somewhere along parameter vector 252 | 253 | # recurrent weights as a shared variable 254 | self.W = theano.shared(ortho_weight(n_hidden),borrow=True,name='W') 255 | # input to hidden layer weights 256 | self.W_in = theano.shared(glorot_uniform((n_in,n_hidden)),borrow=True,name='W_in') 257 | 258 | self.h0 = theano.shared(value=np.zeros((batch_size,n_hidden), dtype=theano.config.floatX),borrow=True,name='h0') 259 | self.bh = theano.shared(value=np.zeros((batch_size,n_hidden), dtype=theano.config.floatX),borrow=True,name='bh') 260 | #self.by = theano.shared(value=np.zeros((n_out,), dtype=theano.config.floatX),borrow=True,name='by') 261 | # for convenience 262 | self.params = [self.W, self.W_in, self.bh] 263 | 264 | # activation function 265 | def step(x_t, mask, h_tm1): 266 | h_tm1 = mask * h_tm1 267 | #h_t = h_tm1 + self.bh 268 | h_t = T.tanh(T.dot(x_t, self.W_in) + \ 269 | T.dot(h_tm1, self.W) + self.bh) 270 | #y_t = T.dot(h_t, self.W_out) + self.by 271 | return h_t 272 | #a = T.addbroadcast(input_lm.dimshuffle(1,0), -1) 273 | self.h_l, _ = theano.scan(step, 274 | sequences=[input_l.dimshuffle(1,0,2),T.addbroadcast(input_lm.dimshuffle(1,0,'x'), -1)], 275 | outputs_info=theano.shared(value=np.zeros((batch_size,n_hidden), dtype=theano.config.floatX),borrow=True)) 276 | self.h_r, _ = theano.scan(step, 277 | sequences=[input_r.dimshuffle(1,0,2),T.addbroadcast(input_rm.dimshuffle(1,0,'x'), -1)], 278 | outputs_info=theano.shared(value=np.zeros((batch_size,n_hidden), dtype=theano.config.floatX),borrow=True)) 279 | self.h_l = self.h_l.dimshuffle(1,0,2) 280 | self.h_r = self.h_r.dimshuffle(1,0,2) 281 | 282 | 283 | if __name__=="__main__": 284 | input = T.tensor3() 285 | input2 = T.matrix() 286 | rnn = GRU(100,100,100,batch_size=47) 287 | res = rnn(input,input2,check_gate=True) 288 | output = theano.function([input,input2],[res[1]]) 289 | 290 | 291 | print output(np.random.rand(47,20,100).astype('float32'), 292 | np.ones((47,20)).astype('float32'))[0].shape -------------------------------------------------------------------------------- /theano_src/SMN_Dynamic.py: -------------------------------------------------------------------------------- 1 | import cPickle 2 | from RNN import GRU 3 | import numpy as np 4 | import theano 5 | from gensim.models.word2vec import Word2Vec 6 | from PreProcess import WordVecs 7 | from Classifier import LogisticRegression 8 | from Optimization import Adam 9 | import theano.tensor as T 10 | from SimAsImage import ConvSim 11 | 12 | max_turn = 10 13 | def get_idx_from_sent_msg(sents, word_idx_map, max_l=50,mask = False): 14 | """ 15 | Transforms sentence into a list of indices. Pad with zeroes. 16 | """ 17 | turns = [] 18 | for sent in sents.split('_t_'): 19 | x = [0] * max_l 20 | x_mask = [0.] * max_l 21 | words = sent.split() 22 | length = len(words) 23 | for i, word in enumerate(words): 24 | if max_l - length + i < 0: continue 25 | if word in word_idx_map: 26 | x[max_l - length + i] = word_idx_map[word] 27 | #if x[max_l - length + i] != 0: 28 | x_mask[max_l - length + i] = 1 29 | if mask: 30 | x += x_mask 31 | turns.append(x) 32 | 33 | final = [0.] * (max_l * 2 * max_turn) 34 | for i in range(max_turn): 35 | if max_turn - i <= len(turns): 36 | for j in range(max_l * 2): 37 | final[i*(max_l*2) + j] = turns[-(max_turn-i)][j] 38 | #print final 39 | #print sents 40 | return final 41 | 42 | def get_idx_from_sent(sent, word_idx_map, max_l=50,mask = False): 43 | """ 44 | Transforms sentence into a list of indices. Pad with zeroes. 45 | """ 46 | x = [0] * max_l 47 | x_mask = [0.] * max_l 48 | words = sent.split() 49 | length = len(words) 50 | for i, word in enumerate(words): 51 | if max_l - length + i < 0: continue 52 | if word in word_idx_map: 53 | x[max_l - length + i] = word_idx_map[word] 54 | #if x[max_l - length + i] != 0: 55 | x_mask[max_l - length + i] = 1 56 | if mask: 57 | x += x_mask 58 | return x 59 | 60 | def _dropout_from_layer(rng, layer, p): 61 | """p is the probablity of dropping a unit 62 | """ 63 | srng = theano.tensor.shared_randomstreams.RandomStreams( 64 | rng.randint(999999)) 65 | # p=1-p because 1's indicate keep and p is prob of dropping 66 | mask = srng.binomial(n=1, p=1-p, size=layer.shape) 67 | # The cast is important because 68 | # int * float32 = float64 which pulls things off the gpu 69 | output = layer * T.cast(mask, theano.config.floatX) 70 | return output 71 | 72 | 73 | def predict(datasets, 74 | U, # pre-trained word embeddings 75 | n_epochs=5,batch_size=20,max_l = 100,hidden_size=100,word_embedding_size=100, 76 | session_hidden_size=50,session_input_size =50, model_name = 'SMN_last.bin'): # for optimization 77 | """ 78 | return: a list of dicts of lists, each list contains (ansId, groundTruth, prediction) for a question 79 | """ 80 | hiddensize = hidden_size 81 | U = U.astype(dtype=theano.config.floatX) 82 | rng = np.random.RandomState(3435) 83 | lsize, rsize = max_l,max_l 84 | sessionmask = T.matrix() 85 | lx = [] 86 | lxmask = [] 87 | for i in range(max_turn): 88 | lx.append(T.matrix()) 89 | lxmask.append(T.matrix()) 90 | 91 | index = T.lscalar() 92 | rx = T.matrix('rx') 93 | rxmask = T.matrix() 94 | y = T.ivector('y') 95 | Words = theano.shared(value = U, name = "Words") 96 | llayer0_input = [] 97 | for i in range(max_turn): 98 | llayer0_input.append(Words[T.cast(lx[i].flatten(),dtype="int32")]\ 99 | .reshape((lx[i].shape[0],lx[i].shape[1],Words.shape[1]))) 100 | 101 | rlayer0_input = Words[T.cast(rx.flatten(),dtype="int32")].reshape((rx.shape[0],rx.shape[1],Words.shape[1])) # input: word embeddings of the mini batch 102 | 103 | 104 | train_set, dev_set, test_set = datasets[0], datasets[1], datasets[2] 105 | 106 | train_set_lx = [] 107 | train_set_lx_mask = [] 108 | q_embedding = [] 109 | offset = 2 * lsize 110 | for i in range(max_turn): 111 | train_set_lx.append(theano.shared(np.asarray(train_set[:,offset*i:offset*i + lsize] 112 | ,dtype=theano.config.floatX),borrow=True)) 113 | train_set_lx_mask.append(theano.shared(np.asarray(train_set[:,offset*i + lsize:offset*i + 2*lsize] 114 | ,dtype=theano.config.floatX),borrow=True)) 115 | train_set_rx = theano.shared(np.asarray(train_set[:,offset*max_turn:offset*max_turn + lsize] 116 | ,dtype=theano.config.floatX),borrow=True) 117 | train_set_rx_mask= theano.shared(np.asarray(train_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize] 118 | ,dtype=theano.config.floatX),borrow=True) 119 | train_set_session_mask= theano.shared(np.asarray(train_set[:,-max_turn-1:-1] 120 | ,dtype=theano.config.floatX),borrow=True) 121 | train_set_y =theano.shared(np.asarray(train_set[:,-1],dtype="int32"),borrow=True) 122 | 123 | val_set_lx = [] 124 | val_set_lx_mask = [] 125 | for i in range(max_turn): 126 | val_set_lx.append(theano.shared(np.asarray(dev_set[:,offset*i:offset*i + lsize] 127 | ,dtype=theano.config.floatX),borrow=True)) 128 | val_set_lx_mask.append(theano.shared(np.asarray(dev_set[:,offset*i + lsize:offset*i + 2*lsize] 129 | ,dtype=theano.config.floatX),borrow=True)) 130 | 131 | val_set_rx = theano.shared(np.asarray(dev_set[:,offset*max_turn:offset*max_turn + lsize],dtype=theano.config.floatX),borrow=True) 132 | val_set_rx_mask = theano.shared(np.asarray(dev_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize],dtype=theano.config.floatX),borrow=True) 133 | val_set_session_mask = theano.shared(np.asarray(dev_set[:,-max_turn-1:-1] 134 | ,dtype=theano.config.floatX),borrow=True) 135 | val_set_y =theano.shared(np.asarray(dev_set[:,-1],dtype="int32"),borrow=True) 136 | 137 | dic = {} 138 | for i in range(max_turn): 139 | dic[lx[i]] = train_set_lx[i][index*batch_size:(index+1)*batch_size] 140 | dic[lxmask[i]] = train_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 141 | dic[rx] = train_set_rx[index*batch_size:(index+1)*batch_size] 142 | dic[sessionmask] = train_set_session_mask[index*batch_size:(index+1)*batch_size] 143 | dic[rxmask] = train_set_rx_mask[index*batch_size:(index+1)*batch_size] 144 | dic[y] = train_set_y[index*batch_size:(index+1)*batch_size] 145 | 146 | val_dic = {} 147 | for i in range(max_turn): 148 | val_dic[lx[i]] = val_set_lx[i][index*batch_size:(index+1)*batch_size] 149 | val_dic[lxmask[i]] = val_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 150 | val_dic[rx] = val_set_rx[index*batch_size:(index+1)*batch_size] 151 | val_dic[sessionmask] = val_set_session_mask[index*batch_size:(index+1)*batch_size] 152 | val_dic[rxmask] = val_set_rx_mask[index*batch_size:(index+1)*batch_size] 153 | val_dic[y] = val_set_y[index*batch_size:(index+1)*batch_size] 154 | 155 | 156 | sentence2vec = GRU(n_in=word_embedding_size,n_hidden=hiddensize,n_out=hiddensize) 157 | 158 | for i in range(max_turn): 159 | q_embedding.append(sentence2vec(llayer0_input[i],lxmask[i],True)) 160 | r_embedding = sentence2vec(rlayer0_input,rxmask,True) 161 | 162 | pooling_layer = ConvSim(rng,max_l,session_input_size,hidden_size=hiddensize) 163 | 164 | poolingoutput = [] 165 | 166 | 167 | for i in range(max_turn): 168 | poolingoutput.append(pooling_layer(llayer0_input[i],rlayer0_input, 169 | q_embedding[i],r_embedding)) 170 | 171 | session2vec = GRU(n_in=session_input_size,n_hidden=session_hidden_size,n_out=session_hidden_size) 172 | res = session2vec(T.stack(poolingoutput,1),sessionmask,True) 173 | 174 | W = theano.shared(ortho_weight(50),borrow = True) 175 | W2 = theano.shared(glorot_uniform((100,50)),borrow=True) 176 | b = theano.shared(value=np.zeros((50,),dtype='float32'),borrow=True) 177 | U_s = theano.shared(glorot_uniform((50,1)),borrow = True) 178 | 179 | final = T.dot(T.tanh(T.dot(res,W) + T.dot(T.stack(q_embedding,1)[:,:,-1,:],W2) + b),U_s) 180 | 181 | weight = T.exp(T.max(final,2)) * sessionmask 182 | weight2 = weight / T.sum(weight,1)[:,None] 183 | 184 | final2 = T.sum(res *weight2[:,:,None],1) 185 | 186 | classifier = LogisticRegression(final2, session_hidden_size, 2, rng) 187 | 188 | 189 | test = theano.function([index], final2 190 | ,givens=val_dic,on_unused_input='ignore') 191 | print test(0).shape 192 | print test(0) 193 | 194 | cost = classifier.negative_log_likelihood(y) 195 | error = classifier.errors(y) 196 | opt = Adam() 197 | params = classifier.params 198 | params += sentence2vec.params 199 | params += session2vec.params 200 | params += pooling_layer.params 201 | params += [Words,W,b,W2,U_s] 202 | 203 | load_params(params,model_name) 204 | 205 | predict = classifier.predict_prob 206 | 207 | val_model = theano.function([index], [y,predict,cost,error], givens=val_dic 208 | ,on_unused_input='ignore') 209 | f = open('result.txt','w') 210 | loss = 0. 211 | for minibatch_index in xrange(datasets[1].shape[0]/batch_size): 212 | a,b,c,d = val_model(minibatch_index) 213 | print c 214 | loss += c 215 | #print b.shape 216 | for i in range(batch_size): 217 | f.write(str(b[i][1])) 218 | f.write('\t') 219 | f.write(str(a[i])) 220 | f.write('\n') 221 | #print b[i] 222 | print loss/(datasets[1].shape[0]/batch_size) 223 | 224 | def ortho_weight(ndim): 225 | W = np.random.randn(ndim, ndim) 226 | u, s, v = np.linalg.svd(W) 227 | return u.astype('float32') 228 | def load_params(params,filename): 229 | f = open(filename) 230 | num_params = cPickle.load(f) 231 | for p,w in zip(params,num_params): 232 | p.set_value(w.astype('float32'),borrow=True) 233 | print "load successfully" 234 | def glorot_uniform(size): 235 | fan_in, fan_out = size 236 | s = np.sqrt(6. / (fan_in + fan_out)) 237 | print s 238 | return np.random.uniform(size=size,low=-s, high=s).astype(theano.config.floatX) 239 | def train(datasets, 240 | U, # pre-trained word embeddings 241 | n_epochs=5,batch_size=20,max_l = 100,hidden_size=100,word_embedding_size=100, 242 | session_hidden_size=50,session_input_size =50, model_name = 'SMN_last.bin'): 243 | hiddensize = hidden_size 244 | U = U.astype(dtype=theano.config.floatX) 245 | rng = np.random.RandomState(3435) 246 | lsize, rsize = max_l,max_l 247 | sessionmask = T.matrix() 248 | lx = [] 249 | lxmask = [] 250 | for i in range(max_turn): 251 | lx.append(T.matrix()) 252 | lxmask.append(T.matrix()) 253 | 254 | index = T.lscalar() 255 | rx = T.matrix('rx') 256 | rxmask = T.matrix() 257 | y = T.ivector('y') 258 | Words = theano.shared(value = U, name = "Words") 259 | llayer0_input = [] 260 | for i in range(max_turn): 261 | llayer0_input.append(Words[T.cast(lx[i].flatten(),dtype="int32")]\ 262 | .reshape((lx[i].shape[0],lx[i].shape[1],Words.shape[1]))) 263 | 264 | rlayer0_input = Words[T.cast(rx.flatten(),dtype="int32")].reshape((rx.shape[0],rx.shape[1],Words.shape[1])) # input: word embeddings of the mini batch 265 | 266 | 267 | train_set, dev_set, test_set = datasets[0], datasets[1], datasets[2] 268 | 269 | train_set_lx = [] 270 | train_set_lx_mask = [] 271 | q_embedding = [] 272 | offset = 2 * lsize 273 | for i in range(max_turn): 274 | train_set_lx.append(theano.shared(np.asarray(train_set[:,offset*i:offset*i + lsize] 275 | ,dtype=theano.config.floatX),borrow=True)) 276 | train_set_lx_mask.append(theano.shared(np.asarray(train_set[:,offset*i + lsize:offset*i + 2*lsize] 277 | ,dtype=theano.config.floatX),borrow=True)) 278 | train_set_rx = theano.shared(np.asarray(train_set[:,offset*max_turn:offset*max_turn + lsize] 279 | ,dtype=theano.config.floatX),borrow=True) 280 | train_set_rx_mask= theano.shared(np.asarray(train_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize] 281 | ,dtype=theano.config.floatX),borrow=True) 282 | train_set_session_mask= theano.shared(np.asarray(train_set[:,-max_turn-1:-1] 283 | ,dtype=theano.config.floatX),borrow=True) 284 | train_set_y =theano.shared(np.asarray(train_set[:,-1],dtype="int32"),borrow=True) 285 | 286 | val_set_lx = [] 287 | val_set_lx_mask = [] 288 | for i in range(max_turn): 289 | val_set_lx.append(theano.shared(np.asarray(dev_set[:,offset*i:offset*i + lsize] 290 | ,dtype=theano.config.floatX),borrow=True)) 291 | val_set_lx_mask.append(theano.shared(np.asarray(dev_set[:,offset*i + lsize:offset*i + 2*lsize] 292 | ,dtype=theano.config.floatX),borrow=True)) 293 | 294 | val_set_rx = theano.shared(np.asarray(dev_set[:,offset*max_turn:offset*max_turn + lsize],dtype=theano.config.floatX),borrow=True) 295 | val_set_rx_mask = theano.shared(np.asarray(dev_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize],dtype=theano.config.floatX),borrow=True) 296 | val_set_session_mask = theano.shared(np.asarray(dev_set[:,-max_turn-1:-1] 297 | ,dtype=theano.config.floatX),borrow=True) 298 | val_set_y =theano.shared(np.asarray(dev_set[:,-1],dtype="int32"),borrow=True) 299 | 300 | dic = {} 301 | for i in range(max_turn): 302 | dic[lx[i]] = train_set_lx[i][index*batch_size:(index+1)*batch_size] 303 | dic[lxmask[i]] = train_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 304 | dic[rx] = train_set_rx[index*batch_size:(index+1)*batch_size] 305 | dic[sessionmask] = train_set_session_mask[index*batch_size:(index+1)*batch_size] 306 | dic[rxmask] = train_set_rx_mask[index*batch_size:(index+1)*batch_size] 307 | dic[y] = train_set_y[index*batch_size:(index+1)*batch_size] 308 | 309 | val_dic = {} 310 | for i in range(max_turn): 311 | val_dic[lx[i]] = val_set_lx[i][index*batch_size:(index+1)*batch_size] 312 | val_dic[lxmask[i]] = val_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 313 | val_dic[rx] = val_set_rx[index*batch_size:(index+1)*batch_size] 314 | val_dic[sessionmask] = val_set_session_mask[index*batch_size:(index+1)*batch_size] 315 | val_dic[rxmask] = val_set_rx_mask[index*batch_size:(index+1)*batch_size] 316 | val_dic[y] = val_set_y[index*batch_size:(index+1)*batch_size] 317 | 318 | 319 | sentence2vec = GRU(n_in=word_embedding_size,n_hidden=hiddensize,n_out=hiddensize) 320 | 321 | for i in range(max_turn): 322 | q_embedding.append(sentence2vec(llayer0_input[i],lxmask[i],True)) 323 | r_embedding = sentence2vec(rlayer0_input,rxmask,True) 324 | 325 | pooling_layer = ConvSim(rng,max_l,session_input_size,hidden_size=hiddensize) 326 | 327 | poolingoutput = [] 328 | 329 | 330 | for i in range(max_turn): 331 | poolingoutput.append(pooling_layer(llayer0_input[i],rlayer0_input, 332 | q_embedding[i],r_embedding)) 333 | 334 | session2vec = GRU(n_in=session_input_size,n_hidden=session_hidden_size,n_out=session_hidden_size) 335 | res = session2vec(T.stack(poolingoutput,1),sessionmask,True) 336 | 337 | W = theano.shared(ortho_weight(50),borrow = True) 338 | W2 = theano.shared(glorot_uniform((100,50)),borrow=True) 339 | b = theano.shared(value=np.zeros((50,),dtype='float32'),borrow=True) 340 | U_s = theano.shared(glorot_uniform((50,1)),borrow = True) 341 | 342 | final = T.dot(T.tanh(T.dot(res,W) + T.dot(T.stack(q_embedding,1)[:,:,-1,:],W2) + b),U_s) 343 | 344 | weight = T.exp(T.max(final,2)) * sessionmask 345 | weight2 = weight / T.sum(weight,1)[:,None] 346 | 347 | final2 = T.sum(res *weight2[:,:,None],1) 348 | 349 | classifier = LogisticRegression(final2, session_hidden_size, 2, rng) 350 | 351 | 352 | test = theano.function([index], final2 353 | ,givens=val_dic,on_unused_input='ignore') 354 | print test(0).shape 355 | print test(0) 356 | 357 | cost = classifier.negative_log_likelihood(y) 358 | error = classifier.errors(y) 359 | opt = Adam() 360 | params = classifier.params 361 | params += sentence2vec.params 362 | params += session2vec.params 363 | params += pooling_layer.params 364 | params += [Words,W,b,W2,U_s] 365 | 366 | grad_updates = opt.Adam(cost=cost,params=params,lr = 0.001) #opt.sgd_updates_adadelta(params, cost, lr_decay, 1e-8, sqr_norm_lim) 367 | 368 | train_model = theano.function([index], cost,updates=grad_updates, givens=dic,on_unused_input='ignore') 369 | val_model = theano.function([index], [cost,error], givens=val_dic,on_unused_input='ignore') 370 | best_dev = 1. 371 | n_train_batches = datasets[0].shape[0]/batch_size 372 | for i in xrange(n_epochs): 373 | cost = 0 374 | total = 0. 375 | for minibatch_index in np.random.permutation(range(n_train_batches)): 376 | batch_cost = train_model(minibatch_index) 377 | total = total + 1 378 | cost = cost + batch_cost 379 | if total % 50 == 0: 380 | print total, cost/total 381 | cost = cost / n_train_batches 382 | print "echo %d loss %f" % (i,cost) 383 | 384 | cost=0 385 | errors = 0 386 | j = 0 387 | for minibatch_index in xrange(datasets[1].shape[0]/batch_size): 388 | tcost, terr = val_model(minibatch_index) 389 | cost += tcost 390 | errors += terr 391 | j = j+1 392 | cost = cost / j 393 | errors = errors / j 394 | if cost < best_dev: 395 | best_dev = cost 396 | save_params(params,model_name) 397 | print "echo %d dev_loss %f" % (i,cost) 398 | print "echo %d dev_accuracy %f" % (i,1 - errors) 399 | 400 | def save_params(params,filename): 401 | num_params = [p.get_value() for p in params] 402 | f = open(filename,'wb') 403 | cPickle.dump(num_params,f) 404 | 405 | def get_session_mask(sents): 406 | session_mask = [0.] * max_turn 407 | turns = [] 408 | for sent in sents.split('_t_'): 409 | words = sent.split() 410 | if len(words) > 0: 411 | turns.append(len(words)) 412 | 413 | for i in range(max_turn): 414 | if max_turn - i <= len(turns): 415 | session_mask[-(max_turn-i)] = 1. 416 | #print session_mask 417 | return session_mask 418 | #print final 419 | 420 | 421 | def make_data(revs, word_idx_map, max_l=50, filter_h=3, val_test_splits=[2,3],validation_num = 50000): 422 | """ 423 | Transforms sentences into a 2-d matrix. 424 | """ 425 | train, val, test = [], [], [] 426 | for rev in revs: 427 | sent = get_idx_from_sent_msg(rev["m"], word_idx_map, max_l, True) 428 | sent += get_idx_from_sent(rev["r"], word_idx_map, max_l, True) 429 | sent += get_session_mask(rev["m"]) 430 | sent.append(int(rev["y"])) 431 | if len(val) > validation_num: 432 | train.append(sent) 433 | else: 434 | val.append(sent) 435 | 436 | train = np.array(train,dtype="int") 437 | val = np.array(val,dtype="int") 438 | test = np.array(test,dtype="int") 439 | print 'trainning data', len(train),'val data', len(val) 440 | return [train, val, test] 441 | 442 | if __name__=="__main__": 443 | train_flag = True 444 | max_word_per_utterence = 50 445 | dataset = r"../ubuntu_data.mul.100d.fullw2v.train" 446 | x = cPickle.load(open(dataset,"rb")) 447 | revs, wordvecs, max_l = x[0], x[1], x[2] 448 | 449 | if train_flag == False: 450 | x = cPickle.load(open(r"../ubuntu_data.mul.test","rb")) 451 | revs, wordvecs2, max_l2 = x[0], x[1], x[2] 452 | datasets = make_data(revs,wordvecs.word_idx_map,max_l=max_word_per_utterence) 453 | 454 | if train_flag == True: 455 | train(datasets,wordvecs.W,batch_size=200,max_l=max_word_per_utterence 456 | ,hidden_size=100,word_embedding_size=100,model_name='SMN_Dynamic.bin') 457 | else: 458 | predict(datasets,wordvecs.W,batch_size=200,max_l=max_word_per_utterence 459 | ,hidden_size=100,word_embedding_size=100,model_name='SMN_Dynamic.bin') -------------------------------------------------------------------------------- /theano_src/SMN_Last.py: -------------------------------------------------------------------------------- 1 | import cPickle 2 | from RNN import GRU 3 | import numpy as np 4 | import theano 5 | from gensim.models.word2vec import Word2Vec 6 | from PreProcess import WordVecs 7 | from Classifier import LogisticRegression 8 | from Optimization import Adam 9 | import theano.tensor as T 10 | from SimAsImage import ConvSim 11 | 12 | max_turn = 10 13 | def get_idx_from_sent_msg(sents, word_idx_map, max_l=50,mask = False): 14 | """ 15 | Transforms sentence into a list of indices. Pad with zeroes. 16 | """ 17 | turns = [] 18 | for sent in sents.split('_t_'): 19 | x = [0] * max_l 20 | x_mask = [0.] * max_l 21 | words = sent.split() 22 | length = len(words) 23 | for i, word in enumerate(words): 24 | if max_l - length + i < 0: continue 25 | if word in word_idx_map: 26 | x[max_l - length + i] = word_idx_map[word] 27 | #if x[max_l - length + i] != 0: 28 | x_mask[max_l - length + i] = 1 29 | if mask: 30 | x += x_mask 31 | turns.append(x) 32 | 33 | final = [0.] * (max_l * 2 * max_turn) 34 | for i in range(max_turn): 35 | if max_turn - i <= len(turns): 36 | for j in range(max_l * 2): 37 | final[i*(max_l*2) + j] = turns[-(max_turn-i)][j] 38 | #print final 39 | #print sents 40 | return final 41 | 42 | def get_idx_from_sent(sent, word_idx_map, max_l=50,mask = False): 43 | """ 44 | Transforms sentence into a list of indices. Pad with zeroes. 45 | """ 46 | x = [0] * max_l 47 | x_mask = [0.] * max_l 48 | words = sent.split() 49 | length = len(words) 50 | for i, word in enumerate(words): 51 | if max_l - length + i < 0: continue 52 | if word in word_idx_map: 53 | x[max_l - length + i] = word_idx_map[word] 54 | #if x[max_l - length + i] != 0: 55 | x_mask[max_l - length + i] = 1 56 | if mask: 57 | x += x_mask 58 | return x 59 | 60 | def _dropout_from_layer(rng, layer, p): 61 | """p is the probablity of dropping a unit 62 | """ 63 | srng = theano.tensor.shared_randomstreams.RandomStreams( 64 | rng.randint(999999)) 65 | # p=1-p because 1's indicate keep and p is prob of dropping 66 | mask = srng.binomial(n=1, p=1-p, size=layer.shape) 67 | # The cast is important because 68 | # int * float32 = float64 which pulls things off the gpu 69 | output = layer * T.cast(mask, theano.config.floatX) 70 | return output 71 | 72 | 73 | def predict(datasets, 74 | U, # pre-trained word embeddings 75 | n_epochs=5,batch_size=20,max_l = 100,hidden_size=100,word_embedding_size=100, 76 | session_hidden_size=50,session_input_size =50, model_name = 'SMN_last.bin'): # for optimization 77 | """ 78 | return: a list of dicts of lists, each list contains (ansId, groundTruth, prediction) for a question 79 | """ 80 | hiddensize = hidden_size 81 | U = U.astype(dtype=theano.config.floatX) 82 | rng = np.random.RandomState(3435) 83 | lsize, rsize = max_l,max_l 84 | 85 | sessionmask = T.matrix() 86 | lx = [] 87 | lxmask = [] 88 | for i in range(max_turn): 89 | lx.append(T.matrix()) 90 | lxmask.append(T.matrix()) 91 | 92 | index = T.lscalar() 93 | rx = T.matrix('rx') 94 | rxmask = T.matrix() 95 | y = T.ivector('y') 96 | Words = theano.shared(value = U, name = "Words") 97 | llayer0_input = [] 98 | for i in range(max_turn): 99 | llayer0_input.append(Words[T.cast(lx[i].flatten(),dtype="int32")]\ 100 | .reshape((lx[i].shape[0],lx[i].shape[1],Words.shape[1]))) 101 | 102 | rlayer0_input = Words[T.cast(rx.flatten(),dtype="int32")].reshape((rx.shape[0],rx.shape[1],Words.shape[1])) # input: word embeddings of the mini batch 103 | 104 | 105 | train_set, dev_set, test_set = datasets[0], datasets[1], datasets[2] 106 | 107 | train_set_lx = [] 108 | train_set_lx_mask = [] 109 | q_embedding = [] 110 | offset = 2 * lsize 111 | for i in range(max_turn): 112 | train_set_lx.append(theano.shared(np.asarray(train_set[:,offset*i:offset*i + lsize] 113 | ,dtype=theano.config.floatX),borrow=True)) 114 | train_set_lx_mask.append(theano.shared(np.asarray(train_set[:,offset*i + lsize:offset*i + 2*lsize] 115 | ,dtype=theano.config.floatX),borrow=True)) 116 | #print train_set_lx.shape 117 | train_set_rx = theano.shared(np.asarray(train_set[:,offset*max_turn:offset*max_turn + lsize] 118 | ,dtype=theano.config.floatX),borrow=True) 119 | train_set_rx_mask= theano.shared(np.asarray(train_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize] 120 | ,dtype=theano.config.floatX),borrow=True) 121 | train_set_session_mask= theano.shared(np.asarray(train_set[:,-max_turn-1:-1] 122 | ,dtype=theano.config.floatX),borrow=True) 123 | train_set_y =theano.shared(np.asarray(train_set[:,-1],dtype="int32"),borrow=True) 124 | 125 | val_set_lx = [] 126 | val_set_lx_mask = [] 127 | for i in range(max_turn): 128 | val_set_lx.append(theano.shared(np.asarray(dev_set[:,offset*i:offset*i + lsize] 129 | ,dtype=theano.config.floatX),borrow=True)) 130 | val_set_lx_mask.append(theano.shared(np.asarray(dev_set[:,offset*i + lsize:offset*i + 2*lsize] 131 | ,dtype=theano.config.floatX),borrow=True)) 132 | 133 | val_set_rx = theano.shared(np.asarray(dev_set[:,offset*max_turn:offset*max_turn + lsize],dtype=theano.config.floatX),borrow=True) 134 | val_set_rx_mask = theano.shared(np.asarray(dev_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize],dtype=theano.config.floatX),borrow=True) 135 | val_set_session_mask = theano.shared(np.asarray(dev_set[:,-max_turn-1:-1] 136 | ,dtype=theano.config.floatX),borrow=True) 137 | val_set_y =theano.shared(np.asarray(dev_set[:,-1],dtype="int32"),borrow=True) 138 | 139 | dic = {} 140 | for i in range(max_turn): 141 | dic[lx[i]] = train_set_lx[i][index*batch_size:(index+1)*batch_size] 142 | dic[lxmask[i]] = train_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 143 | dic[rx] = train_set_rx[index*batch_size:(index+1)*batch_size] 144 | dic[sessionmask] = train_set_session_mask[index*batch_size:(index+1)*batch_size] 145 | dic[rxmask] = train_set_rx_mask[index*batch_size:(index+1)*batch_size] 146 | dic[y] = train_set_y[index*batch_size:(index+1)*batch_size] 147 | 148 | val_dic = {} 149 | for i in range(max_turn): 150 | val_dic[lx[i]] = val_set_lx[i][index*batch_size:(index+1)*batch_size] 151 | val_dic[lxmask[i]] = val_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 152 | val_dic[rx] = val_set_rx[index*batch_size:(index+1)*batch_size] 153 | val_dic[sessionmask] = val_set_session_mask[index*batch_size:(index+1)*batch_size] 154 | val_dic[rxmask] = val_set_rx_mask[index*batch_size:(index+1)*batch_size] 155 | val_dic[y] = val_set_y[index*batch_size:(index+1)*batch_size] 156 | 157 | 158 | sentence2vec = GRU(n_in=word_embedding_size,n_hidden=hiddensize,n_out=hiddensize) 159 | 160 | for i in range(max_turn): 161 | q_embedding.append(sentence2vec(llayer0_input[i],lxmask[i],True)) 162 | r_embedding = sentence2vec(rlayer0_input,rxmask,True) 163 | 164 | pooling_layer = ConvSim(rng,max_l,session_input_size,hidden_size=hiddensize) 165 | 166 | poolingoutput = [] 167 | test = theano.function([index],pooling_layer(llayer0_input[-4],rlayer0_input, 168 | q_embedding[i],r_embedding) 169 | ,givens=val_dic,on_unused_input='ignore') 170 | print test(0).shape 171 | print test(0) 172 | 173 | for i in range(max_turn): 174 | poolingoutput.append(pooling_layer(llayer0_input[i],rlayer0_input, 175 | q_embedding[i],r_embedding)) 176 | 177 | session2vec = GRU(n_in=session_input_size,n_hidden=session_hidden_size,n_out=session_hidden_size) 178 | res = session2vec(T.stack(poolingoutput,1),sessionmask) 179 | classifier = LogisticRegression(res, session_hidden_size ,2,rng) 180 | 181 | cost = classifier.negative_log_likelihood(y) 182 | error = classifier.errors(y) 183 | opt = Adam() 184 | params = classifier.params 185 | params += sentence2vec.params 186 | params += session2vec.params 187 | params += pooling_layer.params 188 | params += [Words] 189 | 190 | 191 | load_params(params,model_name) 192 | 193 | predict = classifier.predict_prob 194 | 195 | val_model = theano.function([index], [y,predict,cost,error], givens=val_dic 196 | ,on_unused_input='ignore') 197 | f = open('result.txt','w') 198 | loss = 0. 199 | for minibatch_index in xrange(datasets[1].shape[0]/batch_size): 200 | a,b,c,d = val_model(minibatch_index) 201 | print c 202 | loss += c 203 | #print b.shape 204 | for i in range(batch_size): 205 | f.write(str(b[i][1])) 206 | f.write('\t') 207 | f.write(str(a[i])) 208 | f.write('\n') 209 | #print b[i] 210 | print loss/(datasets[1].shape[0]/batch_size) 211 | 212 | 213 | def load_params(params,filename): 214 | f = open(filename) 215 | num_params = cPickle.load(f) 216 | for p,w in zip(params,num_params): 217 | p.set_value(w.astype('float32'),borrow=True) 218 | print "load successfully" 219 | 220 | def train(datasets, 221 | U, # pre-trained word embeddings 222 | n_epochs=5,batch_size=20,max_l = 100,hidden_size=100,word_embedding_size=100, 223 | session_hidden_size=50,session_input_size =50, model_name = 'SMN_last.bin'): 224 | hiddensize = hidden_size 225 | U = U.astype(dtype=theano.config.floatX) 226 | rng = np.random.RandomState(3435) 227 | lsize, rsize = max_l,max_l 228 | sessionmask = T.matrix() 229 | lx = [] 230 | lxmask = [] 231 | for i in range(max_turn): 232 | lx.append(T.matrix()) 233 | lxmask.append(T.matrix()) 234 | 235 | index = T.lscalar() 236 | rx = T.matrix('rx') 237 | rxmask = T.matrix() 238 | y = T.ivector('y') 239 | Words = theano.shared(value = U, name = "Words") 240 | llayer0_input = [] 241 | for i in range(max_turn): 242 | llayer0_input.append(Words[T.cast(lx[i].flatten(),dtype="int32")]\ 243 | .reshape((lx[i].shape[0],lx[i].shape[1],Words.shape[1]))) 244 | 245 | rlayer0_input = Words[T.cast(rx.flatten(),dtype="int32")].reshape((rx.shape[0],rx.shape[1],Words.shape[1])) # input: word embeddings of the mini batch 246 | 247 | 248 | train_set, dev_set, test_set = datasets[0], datasets[1], datasets[2] 249 | 250 | train_set_lx = [] 251 | train_set_lx_mask = [] 252 | q_embedding = [] 253 | offset = 2 * lsize 254 | for i in range(max_turn): 255 | train_set_lx.append(theano.shared(np.asarray(train_set[:,offset*i:offset*i + lsize] 256 | ,dtype=theano.config.floatX),borrow=True)) 257 | train_set_lx_mask.append(theano.shared(np.asarray(train_set[:,offset*i + lsize:offset*i + 2*lsize] 258 | ,dtype=theano.config.floatX),borrow=True)) 259 | train_set_rx = theano.shared(np.asarray(train_set[:,offset*max_turn:offset*max_turn + lsize] 260 | ,dtype=theano.config.floatX),borrow=True) 261 | train_set_rx_mask= theano.shared(np.asarray(train_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize] 262 | ,dtype=theano.config.floatX),borrow=True) 263 | train_set_session_mask= theano.shared(np.asarray(train_set[:,-max_turn-1:-1] 264 | ,dtype=theano.config.floatX),borrow=True) 265 | train_set_y =theano.shared(np.asarray(train_set[:,-1],dtype="int32"),borrow=True) 266 | 267 | val_set_lx = [] 268 | val_set_lx_mask = [] 269 | for i in range(max_turn): 270 | val_set_lx.append(theano.shared(np.asarray(dev_set[:,offset*i:offset*i + lsize] 271 | ,dtype=theano.config.floatX),borrow=True)) 272 | val_set_lx_mask.append(theano.shared(np.asarray(dev_set[:,offset*i + lsize:offset*i + 2*lsize] 273 | ,dtype=theano.config.floatX),borrow=True)) 274 | 275 | val_set_rx = theano.shared(np.asarray(dev_set[:,offset*max_turn:offset*max_turn + lsize],dtype=theano.config.floatX),borrow=True) 276 | val_set_rx_mask = theano.shared(np.asarray(dev_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize],dtype=theano.config.floatX),borrow=True) 277 | val_set_session_mask = theano.shared(np.asarray(dev_set[:,-max_turn-1:-1] 278 | ,dtype=theano.config.floatX),borrow=True) 279 | val_set_y =theano.shared(np.asarray(dev_set[:,-1],dtype="int32"),borrow=True) 280 | 281 | dic = {} 282 | for i in range(max_turn): 283 | dic[lx[i]] = train_set_lx[i][index*batch_size:(index+1)*batch_size] 284 | dic[lxmask[i]] = train_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 285 | dic[rx] = train_set_rx[index*batch_size:(index+1)*batch_size] 286 | dic[sessionmask] = train_set_session_mask[index*batch_size:(index+1)*batch_size] 287 | dic[rxmask] = train_set_rx_mask[index*batch_size:(index+1)*batch_size] 288 | dic[y] = train_set_y[index*batch_size:(index+1)*batch_size] 289 | 290 | val_dic = {} 291 | for i in range(max_turn): 292 | val_dic[lx[i]] = val_set_lx[i][index*batch_size:(index+1)*batch_size] 293 | val_dic[lxmask[i]] = val_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 294 | val_dic[rx] = val_set_rx[index*batch_size:(index+1)*batch_size] 295 | val_dic[sessionmask] = val_set_session_mask[index*batch_size:(index+1)*batch_size] 296 | val_dic[rxmask] = val_set_rx_mask[index*batch_size:(index+1)*batch_size] 297 | val_dic[y] = val_set_y[index*batch_size:(index+1)*batch_size] 298 | 299 | 300 | sentence2vec = GRU(n_in=word_embedding_size,n_hidden=hiddensize,n_out=hiddensize) 301 | 302 | for i in range(max_turn): 303 | q_embedding.append(sentence2vec(llayer0_input[i],lxmask[i],True)) 304 | r_embedding = sentence2vec(rlayer0_input,rxmask,True) 305 | 306 | pooling_layer = ConvSim(rng,max_l,session_input_size,hidden_size=hiddensize) 307 | 308 | poolingoutput = [] 309 | test = theano.function([index],pooling_layer(llayer0_input[-4],rlayer0_input, 310 | q_embedding[i],r_embedding) 311 | ,givens=val_dic,on_unused_input='ignore') 312 | print test(0).shape 313 | print test(0) 314 | 315 | for i in range(max_turn): 316 | poolingoutput.append(pooling_layer(llayer0_input[i],rlayer0_input, 317 | q_embedding[i],r_embedding)) 318 | 319 | session2vec = GRU(n_in=session_input_size,n_hidden=session_hidden_size,n_out=session_hidden_size) 320 | res = session2vec(T.stack(poolingoutput,1),sessionmask) 321 | classifier = LogisticRegression(res, session_hidden_size ,2,rng) 322 | 323 | cost = classifier.negative_log_likelihood(y) 324 | error = classifier.errors(y) 325 | opt = Adam() 326 | params = classifier.params 327 | params += sentence2vec.params 328 | params += session2vec.params 329 | params += pooling_layer.params 330 | params += [Words] 331 | 332 | grad_updates = opt.Adam(cost=cost,params=params,lr = 0.001) #opt.sgd_updates_adadelta(params, cost, lr_decay, 1e-8, sqr_norm_lim) 333 | 334 | train_model = theano.function([index], cost,updates=grad_updates, givens=dic,on_unused_input='ignore') 335 | val_model = theano.function([index], [cost,error], givens=val_dic,on_unused_input='ignore') 336 | best_dev = 1. 337 | n_train_batches = datasets[0].shape[0]/batch_size 338 | for i in xrange(n_epochs): 339 | cost = 0 340 | total = 0. 341 | for minibatch_index in np.random.permutation(range(n_train_batches)): 342 | batch_cost = train_model(minibatch_index) 343 | total = total + 1 344 | cost = cost + batch_cost 345 | if total % 50 == 0: 346 | print total, cost/total 347 | cost = cost / n_train_batches 348 | print "echo %d loss %f" % (i,cost) 349 | 350 | cost=0 351 | errors = 0 352 | j = 0 353 | for minibatch_index in xrange(datasets[1].shape[0]/batch_size): 354 | tcost, terr = val_model(minibatch_index) 355 | cost += tcost 356 | errors += terr 357 | j = j+1 358 | cost = cost / j 359 | errors = errors / j 360 | if cost < best_dev: 361 | best_dev = cost 362 | save_params(params,model_name) 363 | print "echo %d dev_loss %f" % (i,cost) 364 | print "echo %d dev_accuracy %f" % (i,1 - errors) 365 | 366 | def save_params(params,filename): 367 | num_params = [p.get_value() for p in params] 368 | f = open(filename,'wb') 369 | cPickle.dump(num_params,f) 370 | 371 | def get_session_mask(sents): 372 | session_mask = [0.] * max_turn 373 | turns = [] 374 | for sent in sents.split('_t_'): 375 | words = sent.split() 376 | if len(words) > 0: 377 | turns.append(len(words)) 378 | 379 | for i in range(max_turn): 380 | if max_turn - i <= len(turns): 381 | session_mask[-(max_turn-i)] = 1. 382 | #print session_mask 383 | return session_mask 384 | #print final 385 | 386 | 387 | def make_data(revs, word_idx_map, max_l=50, filter_h=3, val_test_splits=[2,3],validation_num = 50000): 388 | """ 389 | Transforms sentences into a 2-d matrix. 390 | """ 391 | train, val, test = [], [], [] 392 | for rev in revs: 393 | sent = get_idx_from_sent_msg(rev["m"], word_idx_map, max_l, True) 394 | sent += get_idx_from_sent(rev["r"], word_idx_map, max_l, True) 395 | sent += get_session_mask(rev["m"]) 396 | sent.append(int(rev["y"])) 397 | if len(val) > validation_num: 398 | train.append(sent) 399 | else: 400 | val.append(sent) 401 | 402 | train = np.array(train,dtype="int") 403 | val = np.array(val,dtype="int") 404 | test = np.array(test,dtype="int") 405 | print 'trainning data', len(train),'val data', len(val) 406 | return [train, val, test] 407 | 408 | if __name__=="__main__": 409 | train_flag = True 410 | max_word_per_utterence = 50 411 | dataset = r"../ubuntu_data.mul.100d.fullw2v.train" 412 | x = cPickle.load(open(dataset,"rb")) 413 | revs, wordvecs, max_l = x[0], x[1], x[2] 414 | 415 | if train_flag == False: 416 | x = cPickle.load(open(r"../ubuntu_data.mul.test","rb")) 417 | revs, wordvecs2, max_l2 = x[0], x[1], x[2] 418 | datasets = make_data(revs,wordvecs.word_idx_map,max_l=max_word_per_utterence) 419 | 420 | if train_flag == True: 421 | train(datasets,wordvecs.W,batch_size=200,max_l=max_word_per_utterence 422 | ,hidden_size=100,word_embedding_size=100) 423 | else: 424 | predict(datasets,wordvecs.W,batch_size=200,max_l=max_word_per_utterence 425 | ,hidden_size=100,word_embedding_size=100) -------------------------------------------------------------------------------- /theano_src/SMN_Static.py: -------------------------------------------------------------------------------- 1 | import cPickle 2 | from RNN import GRU 3 | import numpy as np 4 | import theano 5 | from gensim.models.word2vec import Word2Vec 6 | from PreProcess import WordVecs 7 | from Classifier import LogisticRegression 8 | from Optimization import Adam 9 | import theano.tensor as T 10 | from SimAsImage import ConvSim 11 | 12 | max_turn = 10 13 | def get_idx_from_sent_msg(sents, word_idx_map, max_l=50,mask = False): 14 | """ 15 | Transforms sentence into a list of indices. Pad with zeroes. 16 | """ 17 | turns = [] 18 | for sent in sents.split('_t_'): 19 | x = [0] * max_l 20 | x_mask = [0.] * max_l 21 | words = sent.split() 22 | length = len(words) 23 | for i, word in enumerate(words): 24 | if max_l - length + i < 0: continue 25 | if word in word_idx_map: 26 | x[max_l - length + i] = word_idx_map[word] 27 | #if x[max_l - length + i] != 0: 28 | x_mask[max_l - length + i] = 1 29 | if mask: 30 | x += x_mask 31 | turns.append(x) 32 | 33 | final = [0.] * (max_l * 2 * max_turn) 34 | for i in range(max_turn): 35 | if max_turn - i <= len(turns): 36 | for j in range(max_l * 2): 37 | final[i*(max_l*2) + j] = turns[-(max_turn-i)][j] 38 | #print final 39 | #print sents 40 | return final 41 | 42 | def get_idx_from_sent(sent, word_idx_map, max_l=50,mask = False): 43 | """ 44 | Transforms sentence into a list of indices. Pad with zeroes. 45 | """ 46 | x = [0] * max_l 47 | x_mask = [0.] * max_l 48 | words = sent.split() 49 | length = len(words) 50 | for i, word in enumerate(words): 51 | if max_l - length + i < 0: continue 52 | if word in word_idx_map: 53 | x[max_l - length + i] = word_idx_map[word] 54 | #if x[max_l - length + i] != 0: 55 | x_mask[max_l - length + i] = 1 56 | if mask: 57 | x += x_mask 58 | return x 59 | 60 | def _dropout_from_layer(rng, layer, p): 61 | """p is the probablity of dropping a unit 62 | """ 63 | srng = theano.tensor.shared_randomstreams.RandomStreams( 64 | rng.randint(999999)) 65 | # p=1-p because 1's indicate keep and p is prob of dropping 66 | mask = srng.binomial(n=1, p=1-p, size=layer.shape) 67 | # The cast is important because 68 | # int * float32 = float64 which pulls things off the gpu 69 | output = layer * T.cast(mask, theano.config.floatX) 70 | return output 71 | 72 | 73 | def predict(datasets, 74 | U, # pre-trained word embeddings 75 | n_epochs=5,batch_size=20,max_l = 100,hidden_size=100,word_embedding_size=100, 76 | session_hidden_size=50,session_input_size =50, model_name = 'SMN_last.bin'): # for optimization 77 | """ 78 | return: a list of dicts of lists, each list contains (ansId, groundTruth, prediction) for a question 79 | """ 80 | hiddensize = hidden_size 81 | U = U.astype(dtype=theano.config.floatX) 82 | rng = np.random.RandomState(3435) 83 | lsize, rsize = max_l,max_l 84 | sessionmask = T.matrix() 85 | lx = [] 86 | lxmask = [] 87 | for i in range(max_turn): 88 | lx.append(T.matrix()) 89 | lxmask.append(T.matrix()) 90 | 91 | index = T.lscalar() 92 | rx = T.matrix('rx') 93 | rxmask = T.matrix() 94 | y = T.ivector('y') 95 | Words = theano.shared(value = U, name = "Words") 96 | llayer0_input = [] 97 | for i in range(max_turn): 98 | llayer0_input.append(Words[T.cast(lx[i].flatten(),dtype="int32")]\ 99 | .reshape((lx[i].shape[0],lx[i].shape[1],Words.shape[1]))) 100 | 101 | rlayer0_input = Words[T.cast(rx.flatten(),dtype="int32")].reshape((rx.shape[0],rx.shape[1],Words.shape[1])) # input: word embeddings of the mini batch 102 | 103 | 104 | train_set, dev_set, test_set = datasets[0], datasets[1], datasets[2] 105 | 106 | train_set_lx = [] 107 | train_set_lx_mask = [] 108 | q_embedding = [] 109 | offset = 2 * lsize 110 | for i in range(max_turn): 111 | train_set_lx.append(theano.shared(np.asarray(train_set[:,offset*i:offset*i + lsize] 112 | ,dtype=theano.config.floatX),borrow=True)) 113 | train_set_lx_mask.append(theano.shared(np.asarray(train_set[:,offset*i + lsize:offset*i + 2*lsize] 114 | ,dtype=theano.config.floatX),borrow=True)) 115 | train_set_rx = theano.shared(np.asarray(train_set[:,offset*max_turn:offset*max_turn + lsize] 116 | ,dtype=theano.config.floatX),borrow=True) 117 | train_set_rx_mask= theano.shared(np.asarray(train_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize] 118 | ,dtype=theano.config.floatX),borrow=True) 119 | train_set_session_mask= theano.shared(np.asarray(train_set[:,-max_turn-1:-1] 120 | ,dtype=theano.config.floatX),borrow=True) 121 | train_set_y =theano.shared(np.asarray(train_set[:,-1],dtype="int32"),borrow=True) 122 | 123 | val_set_lx = [] 124 | val_set_lx_mask = [] 125 | for i in range(max_turn): 126 | val_set_lx.append(theano.shared(np.asarray(dev_set[:,offset*i:offset*i + lsize] 127 | ,dtype=theano.config.floatX),borrow=True)) 128 | val_set_lx_mask.append(theano.shared(np.asarray(dev_set[:,offset*i + lsize:offset*i + 2*lsize] 129 | ,dtype=theano.config.floatX),borrow=True)) 130 | 131 | val_set_rx = theano.shared(np.asarray(dev_set[:,offset*max_turn:offset*max_turn + lsize],dtype=theano.config.floatX),borrow=True) 132 | val_set_rx_mask = theano.shared(np.asarray(dev_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize],dtype=theano.config.floatX),borrow=True) 133 | val_set_session_mask = theano.shared(np.asarray(dev_set[:,-max_turn-1:-1] 134 | ,dtype=theano.config.floatX),borrow=True) 135 | val_set_y =theano.shared(np.asarray(dev_set[:,-1],dtype="int32"),borrow=True) 136 | 137 | dic = {} 138 | for i in range(max_turn): 139 | dic[lx[i]] = train_set_lx[i][index*batch_size:(index+1)*batch_size] 140 | dic[lxmask[i]] = train_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 141 | dic[rx] = train_set_rx[index*batch_size:(index+1)*batch_size] 142 | dic[sessionmask] = train_set_session_mask[index*batch_size:(index+1)*batch_size] 143 | dic[rxmask] = train_set_rx_mask[index*batch_size:(index+1)*batch_size] 144 | dic[y] = train_set_y[index*batch_size:(index+1)*batch_size] 145 | 146 | val_dic = {} 147 | for i in range(max_turn): 148 | val_dic[lx[i]] = val_set_lx[i][index*batch_size:(index+1)*batch_size] 149 | val_dic[lxmask[i]] = val_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 150 | val_dic[rx] = val_set_rx[index*batch_size:(index+1)*batch_size] 151 | val_dic[sessionmask] = val_set_session_mask[index*batch_size:(index+1)*batch_size] 152 | val_dic[rxmask] = val_set_rx_mask[index*batch_size:(index+1)*batch_size] 153 | val_dic[y] = val_set_y[index*batch_size:(index+1)*batch_size] 154 | 155 | 156 | sentence2vec = GRU(n_in=word_embedding_size,n_hidden=hiddensize,n_out=hiddensize) 157 | 158 | for i in range(max_turn): 159 | q_embedding.append(sentence2vec(llayer0_input[i],lxmask[i],True)) 160 | r_embedding = sentence2vec(rlayer0_input,rxmask,True) 161 | 162 | pooling_layer = ConvSim(rng,max_l,session_input_size,hidden_size=hiddensize) 163 | 164 | poolingoutput = [] 165 | 166 | 167 | for i in range(max_turn): 168 | poolingoutput.append(pooling_layer(llayer0_input[i],rlayer0_input, 169 | q_embedding[i],r_embedding)) 170 | 171 | session2vec = GRU(n_in=session_input_size,n_hidden=session_hidden_size,n_out=session_hidden_size) 172 | res = session2vec(T.stack(poolingoutput,1),sessionmask,True) 173 | w = theano.shared(value=np.ones((max_turn,),dtype=theano.config.floatX),borrow=True) 174 | 175 | 176 | test = theano.function([index],T.sum(res * w[None,:,None],1) 177 | ,givens=val_dic,on_unused_input='ignore') 178 | print test(0).shape 179 | print test(0) 180 | classifier = LogisticRegression(T.sum(res * w[None,:,None],1), session_hidden_size,2,rng) 181 | 182 | cost = classifier.negative_log_likelihood(y) 183 | error = classifier.errors(y) 184 | opt = Adam() 185 | params = classifier.params 186 | params += sentence2vec.params 187 | params += session2vec.params 188 | params += pooling_layer.params 189 | params += [Words,w] 190 | 191 | 192 | load_params(params,model_name) 193 | 194 | predict = classifier.predict_prob 195 | 196 | val_model = theano.function([index], [y,predict,cost,error], givens=val_dic 197 | ,on_unused_input='ignore') 198 | f = open('result.txt','w') 199 | loss = 0. 200 | for minibatch_index in xrange(datasets[1].shape[0]/batch_size): 201 | a,b,c,d = val_model(minibatch_index) 202 | print c 203 | loss += c 204 | #print b.shape 205 | for i in range(batch_size): 206 | f.write(str(b[i][1])) 207 | f.write('\t') 208 | f.write(str(a[i])) 209 | f.write('\n') 210 | #print b[i] 211 | print loss/(datasets[1].shape[0]/batch_size) 212 | 213 | 214 | def load_params(params,filename): 215 | f = open(filename) 216 | num_params = cPickle.load(f) 217 | for p,w in zip(params,num_params): 218 | p.set_value(w.astype('float32'),borrow=True) 219 | print "load successfully" 220 | 221 | def train(datasets, 222 | U, # pre-trained word embeddings 223 | n_epochs=5,batch_size=20,max_l = 100,hidden_size=100,word_embedding_size=100, 224 | session_hidden_size=50,session_input_size =50, model_name = 'SMN_last.bin'): 225 | hiddensize = hidden_size 226 | U = U.astype(dtype=theano.config.floatX) 227 | rng = np.random.RandomState(3435) 228 | lsize, rsize = max_l,max_l 229 | sessionmask = T.matrix() 230 | lx = [] 231 | lxmask = [] 232 | for i in range(max_turn): 233 | lx.append(T.matrix()) 234 | lxmask.append(T.matrix()) 235 | 236 | index = T.lscalar() 237 | rx = T.matrix('rx') 238 | rxmask = T.matrix() 239 | y = T.ivector('y') 240 | Words = theano.shared(value = U, name = "Words") 241 | llayer0_input = [] 242 | for i in range(max_turn): 243 | llayer0_input.append(Words[T.cast(lx[i].flatten(),dtype="int32")]\ 244 | .reshape((lx[i].shape[0],lx[i].shape[1],Words.shape[1]))) 245 | 246 | rlayer0_input = Words[T.cast(rx.flatten(),dtype="int32")].reshape((rx.shape[0],rx.shape[1],Words.shape[1])) # input: word embeddings of the mini batch 247 | 248 | 249 | train_set, dev_set, test_set = datasets[0], datasets[1], datasets[2] 250 | 251 | train_set_lx = [] 252 | train_set_lx_mask = [] 253 | q_embedding = [] 254 | offset = 2 * lsize 255 | for i in range(max_turn): 256 | train_set_lx.append(theano.shared(np.asarray(train_set[:,offset*i:offset*i + lsize] 257 | ,dtype=theano.config.floatX),borrow=True)) 258 | train_set_lx_mask.append(theano.shared(np.asarray(train_set[:,offset*i + lsize:offset*i + 2*lsize] 259 | ,dtype=theano.config.floatX),borrow=True)) 260 | train_set_rx = theano.shared(np.asarray(train_set[:,offset*max_turn:offset*max_turn + lsize] 261 | ,dtype=theano.config.floatX),borrow=True) 262 | train_set_rx_mask= theano.shared(np.asarray(train_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize] 263 | ,dtype=theano.config.floatX),borrow=True) 264 | train_set_session_mask= theano.shared(np.asarray(train_set[:,-max_turn-1:-1] 265 | ,dtype=theano.config.floatX),borrow=True) 266 | train_set_y =theano.shared(np.asarray(train_set[:,-1],dtype="int32"),borrow=True) 267 | 268 | val_set_lx = [] 269 | val_set_lx_mask = [] 270 | for i in range(max_turn): 271 | val_set_lx.append(theano.shared(np.asarray(dev_set[:,offset*i:offset*i + lsize] 272 | ,dtype=theano.config.floatX),borrow=True)) 273 | val_set_lx_mask.append(theano.shared(np.asarray(dev_set[:,offset*i + lsize:offset*i + 2*lsize] 274 | ,dtype=theano.config.floatX),borrow=True)) 275 | 276 | val_set_rx = theano.shared(np.asarray(dev_set[:,offset*max_turn:offset*max_turn + lsize],dtype=theano.config.floatX),borrow=True) 277 | val_set_rx_mask = theano.shared(np.asarray(dev_set[:,offset*max_turn +lsize:offset*max_turn +2 *lsize],dtype=theano.config.floatX),borrow=True) 278 | val_set_session_mask = theano.shared(np.asarray(dev_set[:,-max_turn-1:-1] 279 | ,dtype=theano.config.floatX),borrow=True) 280 | val_set_y =theano.shared(np.asarray(dev_set[:,-1],dtype="int32"),borrow=True) 281 | 282 | dic = {} 283 | for i in range(max_turn): 284 | dic[lx[i]] = train_set_lx[i][index*batch_size:(index+1)*batch_size] 285 | dic[lxmask[i]] = train_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 286 | dic[rx] = train_set_rx[index*batch_size:(index+1)*batch_size] 287 | dic[sessionmask] = train_set_session_mask[index*batch_size:(index+1)*batch_size] 288 | dic[rxmask] = train_set_rx_mask[index*batch_size:(index+1)*batch_size] 289 | dic[y] = train_set_y[index*batch_size:(index+1)*batch_size] 290 | 291 | val_dic = {} 292 | for i in range(max_turn): 293 | val_dic[lx[i]] = val_set_lx[i][index*batch_size:(index+1)*batch_size] 294 | val_dic[lxmask[i]] = val_set_lx_mask[i][index*batch_size:(index+1)*batch_size] 295 | val_dic[rx] = val_set_rx[index*batch_size:(index+1)*batch_size] 296 | val_dic[sessionmask] = val_set_session_mask[index*batch_size:(index+1)*batch_size] 297 | val_dic[rxmask] = val_set_rx_mask[index*batch_size:(index+1)*batch_size] 298 | val_dic[y] = val_set_y[index*batch_size:(index+1)*batch_size] 299 | 300 | 301 | sentence2vec = GRU(n_in=word_embedding_size,n_hidden=hiddensize,n_out=hiddensize) 302 | 303 | for i in range(max_turn): 304 | q_embedding.append(sentence2vec(llayer0_input[i],lxmask[i],True)) 305 | r_embedding = sentence2vec(rlayer0_input,rxmask,True) 306 | 307 | pooling_layer = ConvSim(rng,max_l,session_input_size,hidden_size=hiddensize) 308 | 309 | poolingoutput = [] 310 | 311 | 312 | for i in range(max_turn): 313 | poolingoutput.append(pooling_layer(llayer0_input[i],rlayer0_input, 314 | q_embedding[i],r_embedding)) 315 | 316 | session2vec = GRU(n_in=session_input_size,n_hidden=session_hidden_size,n_out=session_hidden_size) 317 | res = session2vec(T.stack(poolingoutput,1),sessionmask,True) 318 | w = theano.shared(value=np.ones((max_turn,),dtype=theano.config.floatX),borrow=True) 319 | 320 | 321 | test = theano.function([index],T.sum(res * w[None,:,None],1) 322 | ,givens=val_dic,on_unused_input='ignore') 323 | print test(0).shape 324 | print test(0) 325 | classifier = LogisticRegression(T.sum(res * w[None,:,None],1), session_hidden_size,2,rng) 326 | 327 | cost = classifier.negative_log_likelihood(y) 328 | error = classifier.errors(y) 329 | opt = Adam() 330 | params = classifier.params 331 | params += sentence2vec.params 332 | params += session2vec.params 333 | params += pooling_layer.params 334 | params += [Words,w] 335 | 336 | grad_updates = opt.Adam(cost=cost,params=params,lr = 0.001) #opt.sgd_updates_adadelta(params, cost, lr_decay, 1e-8, sqr_norm_lim) 337 | 338 | train_model = theano.function([index], cost,updates=grad_updates, givens=dic,on_unused_input='ignore') 339 | val_model = theano.function([index], [cost,error], givens=val_dic,on_unused_input='ignore') 340 | best_dev = 1. 341 | n_train_batches = datasets[0].shape[0]/batch_size 342 | for i in xrange(n_epochs): 343 | cost = 0 344 | total = 0. 345 | for minibatch_index in np.random.permutation(range(n_train_batches)): 346 | batch_cost = train_model(minibatch_index) 347 | total = total + 1 348 | cost = cost + batch_cost 349 | if total % 50 == 0: 350 | print total, cost/total 351 | cost = cost / n_train_batches 352 | print "echo %d loss %f" % (i,cost) 353 | 354 | cost=0 355 | errors = 0 356 | j = 0 357 | for minibatch_index in xrange(datasets[1].shape[0]/batch_size): 358 | tcost, terr = val_model(minibatch_index) 359 | cost += tcost 360 | errors += terr 361 | j = j+1 362 | cost = cost / j 363 | errors = errors / j 364 | if cost < best_dev: 365 | best_dev = cost 366 | save_params(params,model_name) 367 | print "echo %d dev_loss %f" % (i,cost) 368 | print "echo %d dev_accuracy %f" % (i,1 - errors) 369 | 370 | def save_params(params,filename): 371 | num_params = [p.get_value() for p in params] 372 | f = open(filename,'wb') 373 | cPickle.dump(num_params,f) 374 | 375 | def get_session_mask(sents): 376 | session_mask = [0.] * max_turn 377 | turns = [] 378 | for sent in sents.split('_t_'): 379 | words = sent.split() 380 | if len(words) > 0: 381 | turns.append(len(words)) 382 | 383 | for i in range(max_turn): 384 | if max_turn - i <= len(turns): 385 | session_mask[-(max_turn-i)] = 1. 386 | #print session_mask 387 | return session_mask 388 | #print final 389 | 390 | 391 | def make_data(revs, word_idx_map, max_l=50, filter_h=3, val_test_splits=[2,3],validation_num = 50000): 392 | """ 393 | Transforms sentences into a 2-d matrix. 394 | """ 395 | train, val, test = [], [], [] 396 | for rev in revs: 397 | sent = get_idx_from_sent_msg(rev["m"], word_idx_map, max_l, True) 398 | sent += get_idx_from_sent(rev["r"], word_idx_map, max_l, True) 399 | sent += get_session_mask(rev["m"]) 400 | sent.append(int(rev["y"])) 401 | if len(val) > validation_num: 402 | train.append(sent) 403 | else: 404 | val.append(sent) 405 | 406 | train = np.array(train,dtype="int") 407 | val = np.array(val,dtype="int") 408 | test = np.array(test,dtype="int") 409 | print 'trainning data', len(train),'val data', len(val) 410 | return [train, val, test] 411 | 412 | if __name__=="__main__": 413 | train_flag = True 414 | max_word_per_utterence = 50 415 | dataset = r"../ubuntu_data.mul.100d.fullw2v.train" 416 | x = cPickle.load(open(dataset,"rb")) 417 | revs, wordvecs, max_l = x[0], x[1], x[2] 418 | 419 | if train_flag == False: 420 | x = cPickle.load(open(r"../ubuntu_data.mul.test","rb")) 421 | revs, wordvecs2, max_l2 = x[0], x[1], x[2] 422 | datasets = make_data(revs,wordvecs.word_idx_map,max_l=max_word_per_utterence) 423 | 424 | if train_flag == True: 425 | train(datasets,wordvecs.W,batch_size=200,max_l=max_word_per_utterence 426 | ,hidden_size=100,word_embedding_size=100) 427 | else: 428 | predict(datasets,wordvecs.W,batch_size=200,max_l=max_word_per_utterence 429 | ,hidden_size=100,word_embedding_size=100) -------------------------------------------------------------------------------- /theano_src/SimAsImage.py: -------------------------------------------------------------------------------- 1 | 2 | import os 3 | import sys 4 | import timeit 5 | 6 | import numpy 7 | from CNN import QALeNetConvPoolLayer,LeNetConvPoolLayer2 8 | from Classifier import HiddenLayer2 9 | import theano 10 | import theano.tensor as T 11 | import numpy as np 12 | def ortho_weight(ndim): 13 | W = np.random.randn(ndim, ndim) 14 | u, s, v = np.linalg.svd(W) 15 | return u.astype('float32') 16 | def kmaxpooling(input,input_shape,k): 17 | sorted_values = T.argsort(input,axis=3) 18 | topmax_indexes = sorted_values[:,:,:,-k:] 19 | # sort indexes so that we keep the correct order within the sentence 20 | topmax_indexes_sorted = T.sort(topmax_indexes) 21 | 22 | #given that topmax only gives the index of the third dimension, we need to generate the other 3 dimensions 23 | dim0 = T.arange(0,input_shape[0]).repeat(input_shape[1]*input_shape[2]*k) 24 | dim1 = T.arange(0,input_shape[1]).repeat(k*input_shape[2]).reshape((1,-1)).repeat(input_shape[0],axis=0).flatten() 25 | dim2 = T.arange(0,input_shape[2]).repeat(k).reshape((1,-1)).repeat(input_shape[0]*input_shape[1],axis=0).flatten() 26 | dim3 = topmax_indexes_sorted.flatten() 27 | return input[dim0,dim1,dim2,dim3].reshape((input_shape[0], input_shape[1], input_shape[2], k)) 28 | 29 | class PoolingSim(object): 30 | def __init__(self, rng, n_in, n_out, W=None, b=None, 31 | activation=T.tanh): 32 | self.W = theano.shared(value=ortho_weight(100), name='W', borrow=True) 33 | self.activation = activation 34 | self.hidden_layer = HiddenLayer2(rng,2*5*n_in,n_out) 35 | 36 | self.params = [self.W] + self.hidden_layer.params 37 | 38 | def __call__(self, input_l,input_r,batch_size,max_l): 39 | channel_1 = T.batched_dot(input_l,input_r.dimshuffle(0,2,1)) 40 | channel_2 = T.batched_dot(T.dot(input_l,self.W),input_r.dimshuffle(0,2,1)) 41 | input = T.stack([channel_1,channel_2],axis=1) 42 | poolingoutput = kmaxpooling(input,[batch_size,2,max_l,max_l],5) 43 | mlp_in = T.flatten(poolingoutput,2) 44 | return self.hidden_layer(mlp_in) 45 | 46 | class PoolingSim3(object): 47 | def __init__(self, rng, n_in, n_out, W=None, b=None, 48 | activation=T.tanh,hidden_size=100): 49 | self.W = theano.shared(value=ortho_weight(hidden_size), name='W', borrow=True) 50 | self.activation = activation 51 | self.hidden_layer = HiddenLayer2(rng,2*5*n_in,n_out) 52 | 53 | self.params = [self.W] + self.hidden_layer.params 54 | 55 | def __call__(self,origin_l,origin_r,input_l,input_r,batch_size,max_l): 56 | channel_1 = T.batched_dot(origin_l,origin_r.dimshuffle(0,2,1)) 57 | channel_2 = T.batched_dot(T.dot(input_l,self.W),input_r.dimshuffle(0,2,1)) 58 | input = T.stack([channel_1,channel_2],axis=1) 59 | poolingoutput = kmaxpooling(input,[batch_size,2,max_l,max_l],5) 60 | mlp_in = T.flatten(poolingoutput,2) 61 | return self.hidden_layer(mlp_in) 62 | 63 | class PoolingSim2(object): 64 | def __init__(self, rng, n_in, n_out,tensor_num = 3, 65 | activation=T.tanh): 66 | self.tensor_num = tensor_num 67 | self.W = [] 68 | for i in range(tensor_num): 69 | self.W.append(theano.shared(value=ortho_weight(100), borrow=True)) 70 | self.activation = activation 71 | self.hidden_layer = HiddenLayer2(rng,tensor_num*5*n_in,n_out) 72 | 73 | self.params = self.W + self.hidden_layer.params 74 | 75 | def __call__(self, input_l,input_r,batch_size,max_l): 76 | channels = [] 77 | for i in range(self.tensor_num): 78 | channels.append(T.batched_dot(T.dot(input_l,self.W[i]),input_r.dimshuffle(0,2,1))) 79 | 80 | input = T.stack(channels,axis=1) 81 | poolingoutput = kmaxpooling(input,[batch_size,self.tensor_num,max_l,max_l],5) 82 | mlp_in = T.flatten(poolingoutput,2) 83 | return self.hidden_layer(mlp_in) 84 | 85 | class ConvSim(object): 86 | def __init__(self, rng, n_in, n_out, W=None, b=None, 87 | activation=T.tanh,hidden_size=100): 88 | self.W = theano.shared(value=ortho_weight(hidden_size), borrow=True) 89 | self.activation = activation 90 | 91 | self.conv_layer = LeNetConvPoolLayer2(rng,filter_shape=(8,2,3,3), 92 | image_shape=(200,2,50,50) 93 | ,poolsize=(3,3),non_linear='relu') 94 | 95 | self.hidden_layer = HiddenLayer2(rng,2048,n_out) 96 | self.params = [self.W,] + self.conv_layer.params + self.hidden_layer.params 97 | def Get_M2(self,input_l,input_r): 98 | return T.batched_dot(T.dot(input_l,self.W),input_r.dimshuffle(0,2,1)) 99 | 100 | def __call__(self, origin_l,origin_r,input_l,input_r): 101 | channel_1 = T.batched_dot(origin_l,origin_r.dimshuffle(0,2,1)) 102 | channel_2 = T.batched_dot(T.dot(input_l,self.W),input_r.dimshuffle(0,2,1)) 103 | input = T.stack([channel_1,channel_2],axis=1) 104 | mlp_in = T.flatten(self.conv_layer(input),2) 105 | 106 | return self.hidden_layer(mlp_in) 107 | 108 | class ConvSim2(object): 109 | def __init__(self, rng, n_in, n_out, W=None, b=None, 110 | activation=T.tanh,hidden_size=100): 111 | self.W = theano.shared(value=ortho_weight(hidden_size), borrow=True) 112 | self.activation = activation 113 | 114 | self.conv_layer = LeNetConvPoolLayer2(rng,filter_shape=(8,1,3,3), 115 | image_shape=(200,1,50,50) 116 | ,poolsize=(3,3),non_linear='relu') 117 | 118 | self.hidden_layer = HiddenLayer2(rng,2048,n_out) 119 | self.params = self.conv_layer.params + self.hidden_layer.params 120 | 121 | def __call__(self, origin_l,origin_r): 122 | channel_1 = T.batched_dot(origin_l,origin_r.dimshuffle(0,2,1)) 123 | input =channel_1.dimshuffle(0,'x',1,2) 124 | mlp_in = T.flatten(self.conv_layer(input),2) 125 | 126 | return self.hidden_layer(mlp_in) -------------------------------------------------------------------------------- /theano_src/logistic_sgd.py: -------------------------------------------------------------------------------- 1 | import cPickle, gzip, numpy 2 | import theano 3 | import theano.tensor as T 4 | 5 | class SumRegression(object): 6 | def __init__(self,input,n_in,n_out,rng): 7 | self.W = theano.shared(value=numpy.ones((n_in,n_out),dtype=theano.config.floatX) 8 | ,borrow=True,name='W') 9 | self.predict_prob = T.nnet.softmax(T.dot(input,self.W)) 10 | self.predict_y = T.argmax(self.predict_prob,axis=1) 11 | self.params=[] 12 | 13 | def negative_log_likelihood(self, y): 14 | #return - T.mean(y * T.log(self.predict_prob) + (1 - y) * T.log(1 - self.predict_prob)) 15 | return -T.mean(T.log(self.predict_prob)[T.arange(y.shape[0]), y]) 16 | 17 | def errors(self,y): 18 | if y.dtype.startswith('int'): 19 | return T.mean(T.neq(self.predict_y,y)) 20 | else: 21 | raise NotImplementedError 22 | 23 | class LogisticRegression(object): 24 | def __init__(self,input,n_in,n_out,rng): 25 | self.W = theano.shared( numpy.asarray( 26 | rng.uniform( 27 | low=-numpy.sqrt(6. / (n_in + n_out)), 28 | high=numpy.sqrt(6. / (n_in + n_out)), 29 | size=(n_in, n_out) 30 | ), 31 | dtype=theano.config.floatX 32 | )) 33 | self.b = theano.shared(value=numpy.zeros(n_out,dtype=theano.config.floatX),borrow=True,name='b') 34 | self.predict_prob = T.nnet.softmax(T.dot(input,self.W)+self.b) 35 | self.predict_y = T.argmax(self.predict_prob,axis=1) 36 | self.params=[self.W,self.b] 37 | 38 | def negative_log_likelihood(self, y): 39 | #return - T.mean(y * T.log(self.predict_prob) + (1 - y) * T.log(1 - self.predict_prob)) 40 | return -T.mean(T.log(self.predict_prob)[T.arange(y.shape[0]), y]) 41 | 42 | def errors(self,y): 43 | if y.dtype.startswith('int'): 44 | return T.mean(T.neq(self.predict_y,y)) 45 | else: 46 | raise NotImplementedError 47 | 48 | def load_data(dataset): 49 | def shared_data(data_xy): 50 | data_x,data_y = data_xy 51 | shared_x = theano.shared(data_x) 52 | shared_y = theano.shared(data_y) 53 | return shared_x, T.cast(shared_y,'int32') 54 | 55 | f = gzip.open(dataset) 56 | train_set, dev_set, test_set = cPickle.load(f) 57 | f.close() 58 | 59 | train_set_x, train_set_y = shared_data(train_set) 60 | dev_set_x, dev_set_y = shared_data(dev_set) 61 | test_set_x, test_set_y = shared_data(test_set) 62 | 63 | rval = [(train_set_x,train_set_y),(dev_set_x, dev_set_y ), 64 | (test_set_x,test_set_y)] 65 | return rval 66 | 67 | 68 | def sgd_optimization_mnist(learning_rate=0.13, n_epochs=1000, 69 | dataset='mnist.pkl.gz', 70 | batch_size=600): 71 | data = load_data('mnist.pkl.gz') 72 | train_x, train_y = data[0] 73 | dev_x, dev_y = data[1] 74 | test_x, test_y = data[2] 75 | 76 | n_train_batches = train_x.get_value(borrow=True).shape[0]//batch_size 77 | n_dev_batches = dev_x.get_value(borrow=True).shape[0]//batch_size 78 | n_test_batches = test_x.get_value(borrow=True).shape[0]//batch_size 79 | 80 | print n_dev_batches 81 | 82 | x = T.matrix('x') 83 | y = T.ivector('y') 84 | classifier = LogisticRegression(input=x,n_in=28*28,n_out=10) 85 | cost = classifier.negative_log_likelihood(y) 86 | print 'building model...' 87 | index = T.lscalar() 88 | 89 | g_w = T.grad(cost=cost, wrt= classifier.W) 90 | g_b = T.grad(cost = cost,wrt = classifier.b) 91 | updates = [(classifier.W,classifier.W-learning_rate*g_w), 92 | (classifier.b,classifier.b-learning_rate*g_b)] 93 | train_model = theano.function(inputs=[index],outputs = cost,updates=updates, 94 | givens={ 95 | x: train_x[index*batch_size:(index+1)*batch_size], 96 | y: train_y[index*batch_size:(index+1)*batch_size] 97 | }) 98 | 99 | validate_model = theano.function(inputs=[index],outputs = classifier.error(y), 100 | givens={ 101 | x: test_x[index*batch_size:(index+1)*batch_size], 102 | y: test_y[index*batch_size:(index+1)*batch_size] 103 | }) 104 | epoch = 0 105 | while epoch < n_epochs: 106 | epoch = epoch +1 107 | train_error = 0 108 | for minibatch_index in range(n_train_batches): 109 | minibatch_avg_cost = train_model(minibatch_index) 110 | #print minibatch_avg_cost 111 | train_error = train_error+ minibatch_avg_cost 112 | if minibatch_index == n_train_batches-1: 113 | validation_losses = [validate_model(i) for i in range(n_dev_batches)] 114 | this_validation_losses = numpy.mean(validation_losses) 115 | #print validation_losses 116 | print('epoch %i, minibatch %i, valiadation error %f'%(epoch,minibatch_index+1,this_validation_losses)) 117 | 118 | if __name__ == '__main__': 119 | sgd_optimization_mnist() -------------------------------------------------------------------------------- /train.sample: -------------------------------------------------------------------------------- 1 | 1 my hometown 的中文歌词很感动英文版是小田亲自翻译的更加感动求 v _url_ thanks 3 2 | 0 my hometown 的中文歌词很感动英文版是小田亲自翻译的更加感动求 v _url_ 是我眼拙心盲 3 | 1 昆明那里配眼镜比较便宜云大附近很多店应该有竞争价格会下来一点的吧给推荐个云大附近的吧谢谢去了就能看到比如云光什么的 4 | 0 昆明那里配眼镜比较便宜云大附近很多店应该有竞争价格会下来一点的吧给推荐个云大附近的吧谢谢你的他毕竟还是说了我的完全没有任何消息我伤害了他于是 15 天没消息 5 | 1 看原版英文电影学纯正英语大爱老友记反复看了好多次了一样光盘都快被我看花了那你现在的英语应该不错了 6 | 0 看原版英文电影学纯正英语大爱老友记反复看了好多次了一样光盘都快被我看花了我也想知道啊感觉暧昧的时候很大胆真正在一起后怎么小心翼翼的了是不是闷葫芦了我月鱼金秤我不懂啊 7 | 1 视频资源综合帖开头的我只知道这两个请教一下怎么我打开这两个网站之后没有看到播放的字样呢是不是还需要另外下载播放器求解 gracias 下载了 ftv 播放器后还是不能看怎么办好像真的不能看啊 8 | 0 视频资源综合帖开头的我只知道这两个请教一下怎么我打开这两个网站之后没有看到播放的字样呢是不是还需要另外下载播放器求解 gracias 本来就是要坚持做你想永久那是不可能的都是要做很多次才可以 9 | 1 杀人者海明威翻译者是上海人把客字看出来的有阿拉上海宁才会翻得出的词语啊哈哈哈哈哈哈当初读的时候注意到了 10 | 0 杀人者海明威翻译者是上海人把客字看出来的有阿拉上海宁才会翻得出的词语啊哈哈哈主要女孩子的皮肤这个东西尤其脸上的皮肤要是出了什么大问题那真的一辈子都完了女人这方面真的很被动小心点总是好的 11 | 1 小广告专贴有料的小广告还挺多最近想买瑜伽铺巾有推荐的产品没搬个小板凳坐等另外问候迷糊蕊麒和朋友们 de 纯天然橡胶瑜伽垫防滑性超棒舒适耐用瑜伽习练的必备品哦还有漂亮的王燕老师极力推荐你心水了吗 url z0dszsi 你是优胜美地的啊哈哈我去过你们那环境很好 12 | 0 小广告专贴有料的小广告还挺多最近想买瑜伽铺巾有推荐的产品没搬个小板凳坐等另外问候迷糊蕊麒和朋友们 de 纯天然橡胶瑜伽垫防滑性超棒舒适耐用瑜伽习练的必备品哦还有漂亮的王燕老师极力推荐你心水了吗 url z0dszsi 走着 13 | 1 际恒公关公司怎么样我特别想去际恒实习不知可否 · · · · · · grace 现在想来吗简历发给我看下吧是 hr 大人么我也想去你家试一试求介绍我不是 hr 不过可以和你聊聊无意中发现恒际的贴请问您现在还在恒际吗是的还在同行吗是啊同行啊那请问贵公司还招人吗现在招啊我们公司招 14 | 0 际恒公关公司怎么样我特别想去际恒实习不知可否 · · · · · · grace 现在想来吗简历发给我看下吧是 hr 大人么我也想去你家试一试求介绍我不是 hr 不过可以和你聊聊无意中发现恒际的贴请问您现在还在恒际吗是的还在同行吗是啊同行啊那请问贵公司还招人吗现在招啊我刚跟他发短信了他说你实在要的话就法院诉讼去吧你说难道他已经有办法还是之前有人告过他但是没成功 15 | 1 私房菜删帖及封禁相关细则传很久很多菜的一个帖子被删了阿土伯你给我捞出来了我刚回了个人问的问题结果又进去了我才刚释放啊你再给我捞出来吧我就放那自己看不回帖了还是有违禁词与 wangshang gouwu 有关的一切词都是违禁词帖子里确实有违禁词你给我捞出来了我可以进到帖子里把这个词删了吗不会进去修改还没有删掉就又进回收站了吧能删你给我删可以吗 16 | 0 私房菜删帖及封禁相关细则传很久很多菜的一个帖子被删了阿土伯你给我捞出来了我刚回了个人问的问题结果又进去了我才刚释放啊你再给我捞出来吧我就放那自己看不回帖了还是有违禁词与 wangshang gouwu 有关的一切词都是违禁词嗯哼 17 | 1 殷琪卸妆我怎么找不到真的很夸张眼睛也差太多了吧哪里能看到我就是上面给的地址看的不过现在看不了了 18 | 0 殷琪卸妆我怎么找不到真的很夸张眼睛也差太多了吧哪里能看到 s 吧粉色是 s 的 hm 的都会偏大相当于 m 19 | 1 日本早稻田大学图书馆网站有大量中国古籍的扫描版直接点开 pdf 右上角有下载选项另外虽然是多年前的帖子但现在仍然能用非常感谢请教是一张一张下还是全部整个 pdf 一起下不可能一次下一张的恩谢谢 20 | 0 日本早稻田大学图书馆网站有大量中国古籍的扫描版直接点开 pdf 右上角有下载选项另外虽然是多年前的帖子但现在仍然能用非常感谢请教是一张一张下还是全部整个 pdf 一起下不可能一次下一张的我觉得他已经不光是荧幕形象小气巴拉的问题了感觉现实生活中也是个精巴到不行的抠门鬼 21 | 1 刮痧与拍痧拍痧拍出来的是湿气毒气浊气放心吧我们的身体很聪明的好的东西跑不出来拍完喝些温开水使这些代谢的废物尽快排出体外是不是要等痧退了才能继续拍或者洗澡据说继续拍可以拍散的我没试过一般等他自然消退再继续我的气血水平较低不敢每天都拍几个小时后可以洗澡话说我拍完都三四天了还没退完我出痧多的时候一周才退下去别急病毒不是一天积累起来的也不可能短时间内排完养生可是一生的事业哈哈是滴不过目前只有一些隐隐约约的红点了 22 | 0 刮痧与拍痧拍痧拍出来的是湿气毒气浊气放心吧我们的身体很聪明的好的东西跑不出来拍完喝些温开水使这些代谢的废物尽快排出体外是不是要等痧退了才能继续拍或者洗澡据说继续拍可以拍散的我没试过一般等他自然消退再继续我的气血水平较低不敢每天都拍几个小时后可以洗澡话说我拍完都三四天了还没退完我出痧多的时候一周才退下去别急病毒不是一天积累起来的也不可能短时间内排完养生可是一生的事业哼你才不是呢 23 | 1 刮痧与拍痧拍痧拍出来的是湿气毒气浊气放心吧我们的身体很聪明的好的东西跑不出来拍完喝些温开水使这些代谢的废物尽快排出体外是不是要等痧退了才能继续拍或者洗澡据说继续拍可以拍散的我没试过一般等他自然消退再继续我的气血水平较低不敢每天都拍几个小时后可以洗澡话说我拍完都三四天了还没退完我出痧多的时候一周才退下去别急病毒不是一天积累起来的也不可能短时间内排完养生可是一生的事业哈哈是滴不过目前只有一些隐隐约约的红点了 24 | 0 刮痧与拍痧拍痧拍出来的是湿气毒气浊气放心吧我们的身体很聪明的好的东西跑不出来拍完喝些温开水使这些代谢的废物尽快排出体外是不是要等痧退了才能继续拍或者洗澡据说继续拍可以拍散的我没试过一般等他自然消退再继续我的气血水平较低不敢每天都拍几个小时后可以洗澡话说我拍完都三四天了还没退完我出痧多的时候一周才退下去别急病毒不是一天积累起来的也不可能短时间内排完养生可是一生的事业要有信心啦 25 | 1 东莞哪里有做 diy 蛋糕的地方环境很温馨服务非常好女朋友说蛋糕很好吃从来没吃过这么好吃的蛋糕下次要带女朋友一起去请问后面四个问号啥意思我发的是表情啊怎么变成问号了 26 | 0 东莞哪里有做 diy 蛋糕的地方环境很温馨服务非常好女朋友说蛋糕很好吃从来没吃过这么好吃的蛋糕下次要带女朋友一起去请问后面四个问号啥意思我问了呢 15 天内超过 15 天就不能无条件换机有任何问题都算保修 27 | 1 想听听大家对论摄影看法大家觉得那个版本的论摄影翻译的更好一些呢黄灿然译的第二版好些貌似是 2010年再版的之前湖南美术出的错误奇多黄灿然的版本从字面上的错误少一点从翻译的角度没好多少基本无法达意还是看原著吧 28 | 0 想听听大家对论摄影看法大家觉得那个版本的论摄影翻译的更好一些呢黄灿然译的第二版好些貌似是 2010年再版的之前湖南美术出的错误奇多不然怎么办 29 | 1 身高 167cm 女生 172 爱穿高跟么微跟五公分左右嗯我喜欢的高度看来你 182 你好聪明 30 | 0 身高 167cm 女生 172 爱穿高跟么微跟五公分左右嗯我喜欢的高度看来你 182 是的几天不见就特别想 31 | 1 征同游集中贴性别女时间 2012年 6月 17 到 23 方式自由行目前一人求被捡预算不清楚还没概念机票已买上海广州暹粒广州上海住宿标准无所谓联系豆油性格和善为什么要先去广州东航出的特价机票必须得转一下后来才发现真傻价钱也没咋便宜诶时间搭不上我要等七月份高温假出来以后再定嘿嘿 · 我也准备七月去等到放假可能七月十号左右目前还有一同学到时联系下吧看看时间能不能对上可以同游吼吼 32 | 0 征同游集中贴性别女时间 2012年 6月 17 到 23 方式自由行目前一人求被捡预算不清楚还没概念机票已买上海广州暹粒广州上海住宿标准无所谓联系豆油性格和善为什么要先去广州东航出的特价机票必须得转一下后来才发现真傻价钱也没咋便宜诶时间搭不上我要等七月份高温假出来以后再定嘿嘿 · 我也准备七月去等到放假可能七月十号左右目前还有一同学抢到沙发好开心 33 | 1 济南哪里买明信片今儿去中山公园了啥也没淘到我还有这个想法幸好你去了要不跑冤枉路了那是 2年前的事了从上看下来有去中山公园的冲动了 34 | 0 济南哪里买明信片今儿去中山公园了啥也没淘到我还有这个想法幸好你去了要不跑冤枉路了那是 2年前的事了我睡到现在才起来所以是早安 35 | 1 有一天我一定要买齐所有幾米的本本我在当当网上模拟买几米好几次了喜欢装帧精美的看着精装的就舒服可是好贵啊舍不得要是有人送我一套我就娶她哈哈做梦了有人送我我就嫁他我送了本星空给她我没有错过她愿你幸福啊 36 | 0 有一天我一定要买齐所有幾米的本本我在当当网上模拟买几米好几次了喜欢装帧精美的看着精装的就舒服可是好贵啊舍不得要是有人送我一套我就娶她哈哈做梦了有人送我我就嫁他我送了本星空给她我没有错过她因为不在家所以没拍照过年咯 37 | 1 家里这么潮湿有啥好办法啊我家的天花残了水滴下来了也掉皮了装好了三年的房子就这么废了不住还好才住不到一个月啊洗手间的天花在滴水厨房的地满地的水不能拖的家里又潮又水又脏看得我想吐唯一不怎么潮的是我的睡房我天天空调抽潮门都不敢打开所以我现在天天在床上呆着这种天气再这样下去会疯的挺悲催的噢貌似潮湿的天气还会持续一段时间但中途会有太阳出来的你家会像十五二十一样整个天花板塌掉么很壮观诶你心态真好还欣赏壮不壮观我家要是这样我得郁闷死 38 | 0 家里这么潮湿有啥好办法啊我家的天花残了水滴下来了也掉皮了装好了三年的房子就这么废了不住还好才住不到一个月啊洗手间的天花在滴水厨房的地满地的水不能拖的家里又潮又水又脏看得我想吐唯一不怎么潮的是我的睡房我天天空调抽潮门都不敢打开所以我现在天天在床上呆着这种天气再这样下去会疯的挺悲催的噢貌似潮湿的天气还会持续一段时间但中途会有太阳出来的你家会像十五二十一样整个天花板塌掉么很壮观诶哈哈没事儿 39 | 1 致所有和双男纠结中的女人我是射手女跟双子男认识一年多了关系平淡属于相敬如宾的那种昨晚心血来潮发了条信息过去 hi baby 你睡了么他竟然回你不能这样喊次奥他这是拒绝暧昧么是因为有女朋友了还是他只是把我当普通朋友求解 lz 都注销了求解个 p 啊晕呃好吧我发帖了但没人回应肥仔不是回了吗问题是你不接受人家说的啊你问这里的人双子男怎么想的谁都不是他肚里的蛔虫谁知道他怎么想的啊而且看你的文字总觉得是很抽风的那种女生内心淡定的人才搞的掂双子男我承认我很抽风本以为射手和双子很合拍呢哎这是我和他的故事您给个话吧 url zj93ips 没加这个组永远也不会加脑子进水了吗好好的日子不过干嘛非折腾男朋友对方不是人吗也是爸妈生养的不怕折腾跑了吗然后再低三下四的求人家回头无语给你的建议是他找你你就回应不找你别犯贱该干嘛干嘛好永远别打这个号码 40 | 0 致所有和双男纠结中的女人我是射手女跟双子男认识一年多了关系平淡属于相敬如宾的那种昨晚心血来潮发了条信息过去 hi baby 你睡了么他竟然回你不能这样喊次奥他这是拒绝暧昧么是因为有女朋友了还是他只是把我当普通朋友求解 lz 都注销了求解个 p 啊晕呃好吧我发帖了但没人回应肥仔不是回了吗问题是你不接受人家说的啊你问这里的人双子男怎么想的谁都不是他肚里的蛔虫谁知道他怎么想的啊而且看你的文字总觉得是很抽风的那种女生内心淡定的人才搞的掂双子男我承认我很抽风本以为射手和双子很合拍呢哎这是我和他的故事您给个话吧 url zj93ips 没加这个组永远也不会加脑子进水了吗好好的日子不过干嘛非折腾男朋友对方不是人吗也是爸妈生养的不怕折腾跑了吗然后再低三下四的求人家回头无语给你的建议是他找你你就回应不找你别犯贱该干嘛干嘛闪电侠你也看完了 41 | 1 咱们学校那个驾校怎么样看到这个 2010年的帖子真亲切我现在已经不再师大了话说我已经 2010年 6 月份就发证了挺好的据说现在人很多的我那时 2600 你是师大毕业的吗什么专业的啊校友啊不过我还在我 07 级历史的哈哈我是 10 出版的我认识你一 07 的学姐呵呵啊我们这最大的是 08 的啊嗯嗯嗯是 08 的我记错了哈哈额嘿嘿嘿嘿是男是女啊女的呵呵 42 | 0 咱们学校那个驾校怎么样看到这个 2010年的帖子真亲切我现在已经不再师大了话说我已经 2010年 6 月份就发证了挺好的据说现在人很多的我那时 2600 你是师大毕业的吗什么专业的啊校友啊不过我还在我 07 级历史的哈哈我是 10 出版的我认识你一 07 的学姐呵呵啊我们这最大的是 08 的啊嗯嗯嗯是 08 的我记错了哈哈额嘿嘿嘿嘿是男是女啊好的发简历吧 43 | 1 左手摁计算器右手记数 d excel 所谓软件就是预算软件直接上完毕我一直都是手写计算 excel 这个方法好像很厉害学习了可以举个例子么我邮箱 url 谢谢我干安装的一样么 44 | 0 左手摁计算器右手记数 d excel 所谓软件就是预算软件直接上完毕我一直都是手写计算 excel 这个方法好像很厉害学习了可以举个例子么我邮箱 url 谢谢熊抱 45 | 1 无争围棋的棋友们大家互相认识一下吧我先介绍一下自己软件工程师现居深圳无争围棋的发起人和主要维护者爱下围棋有近二十年棋龄但水平不高 url 打不开无争网了郁闷已经恢复正常服务器偶有不稳定情况惭愧见谅 46 | 0 无争围棋的棋友们大家互相认识一下吧我先介绍一下自己软件工程师现居深圳无争围棋的发起人和主要维护者爱下围棋有近二十年棋龄但水平不高 url 打不开无争网了郁闷把他当成你了 47 | 1 每日好习惯随时更新 lz 啊有没有男女不一样的地方啊肯定有啦私以为男人只要不吸烟少喝酒吃早餐午饭不凑合晚上不暴饮暴食少淫欲多运动就是最好的养生啦那我都做到啊岂不是完美了那个吃豆浆的男生好像不好吧虽然对女人特别好但对男人也没有不好啦别太纠结这些细节了太纠结本身也是问题对吧你已经做的挺不错了就继续保持好习惯吧有空的时候把这些观念传达给更多的人会更加快乐我一直本着养生原则 48 | 0 每日好习惯随时更新 lz 啊有没有男女不一样的地方啊肯定有啦私以为男人只要不吸烟少喝酒吃早餐午饭不凑合晚上不暴饮暴食少淫欲多运动就是最好的养生啦那我都做到啊岂不是完美了那个吃豆浆的男生好像不好吧虽然对女人特别好但对男人也没有不好啦别太纠结这些细节了太纠结本身也是问题对吧你已经做的挺不错了就继续保持好习惯吧有空的时候把这些观念传达给更多的人会更加快乐天蝎的美女啊哈哈周围有天蝎的确实感觉神秘啊你的头像就很神秘我见到别的女生也会打招呼聊天但是不会像女朋友那样对付一个女朋友就已经很累了我是不会去找麻烦脚踩 2 只船太麻烦了有那功夫我去干点自己喜欢的事情朋友在我眼里不分男女但是还是哥们多些不会有什么这个颜那个颜的对女生仅限偶尔聊天打招呼什么的不会短信电话不断如果这样就有问题了我的立场还是讲清楚他不妥协不改那就散自己去想吧想清楚也没机会了 2 个人的生活其实就是妥协找到平衡点偶尔冷他天天对他好对你没好处估计你男朋友还不成熟需要经历痛苦 49 | 1 珠海哪间医院看皮肤科比较有名皮防所看痘痘在紫荆园附近是不是招牌是绿色的字体大大间系转角位咩色唔记得招牌好大个好似系皮肤防疫中心对面是个运动场对吧一直以为那家是美容院治痘痘很出名到七八点还是会很多人但是查地图好像是在紫荆豪庭对面马路那边了 50 | 0 珠海哪间医院看皮肤科比较有名皮防所看痘痘在紫荆园附近是不是招牌是绿色的字体大大间系转角位咩色唔记得招牌好大个好似系皮肤防疫中心对面是个运动场对吧一直以为那家是美容院治痘痘很出名到七八点还是会很多人换成注明来自豆瓣注明应聘职位 51 | 1 客观分析天蝎男纯原创 2010 08 30 15 49 53 joy 天蝎男是很贱根据我跟蝎男交往 1年零 3 个月的经验看对付蝎男要做到以下几点 1 不能太懂事他犯浑的时候你要比他还浑 2 不能总给他好脸他乖的时候就对他温柔他犯 sb 的时候就甭搭理他绝对不能依着他 3 记住他说的话他做的事再吵架的时候用他自己的话回他那强盗逻辑让他哑口无言用他对待你的方式对待他等他质问你的时候告诉他我为什么不能这么对待你 4 表面上顺从他但别心理真顺从不要被他同化了 5 适时撒泼耍赖记住他们贱的很 6 爱自己多一点真心赞很赞同 · · · · 现在刚和蝎子开始唉 · · · · 日后还长着蝎子说过我很要强他很喜欢这是有征服欲吗必须的征服欲啊之后就会对你这种个性挑三拣四再后来就是大加讽刺再再后来就是攻击人格达到虐人的快感慢慢来吧不着急被蝎子爱着的时候还是很美好的他可以给你所有的少女幻想我觉得我完蛋了 · · · · · 52 | 0 客观分析天蝎男纯原创 2010 08 30 15 49 53 joy 天蝎男是很贱根据我跟蝎男交往 1年零 3 个月的经验看对付蝎男要做到以下几点 1 不能太懂事他犯浑的时候你要比他还浑 2 不能总给他好脸他乖的时候就对他温柔他犯 sb 的时候就甭搭理他绝对不能依着他 3 记住他说的话他做的事再吵架的时候用他自己的话回他那强盗逻辑让他哑口无言用他对待你的方式对待他等他质问你的时候告诉他我为什么不能这么对待你 4 表面上顺从他但别心理真顺从不要被他同化了 5 适时撒泼耍赖记住他们贱的很 6 爱自己多一点真心赞很赞同 · · · · 现在刚和蝎子开始唉 · · · · 日后还长着蝎子说过我很要强他很喜欢这是有征服欲吗必须的征服欲啊之后就会对你这种个性挑三拣四再后来就是大加讽刺再再后来就是攻击人格达到虐人的快感慢慢来吧不着急被蝎子爱着的时候还是很美好的他可以给你所有的少女幻想人生中总有过客嘛痛过就好了 53 | 1 如何学好古代汉语在下的个人体会学古文最快的方法就是背书时间紧张起码要背四书因为在过去所有读书的人都是背了四书的包括亲手革掉古文命的五四新文学领袖们以及大部分老一辈无产阶级革命家背书的具体方法首先明确每个字的读音再把它大声地念若干遍你会立刻发现汉语的音乐性很快就会喜欢老祖宗编曲的古老歌谣了有一点像疯狂英语但其实疯狂英语是借鉴老办法的你好啊我想问下初接触古文古籍里面那些不认识的字的读音怎样读呀去哪查呢十分感谢有很多办法给你个最快的 url 54 | 0 如何学好古代汉语在下的个人体会学古文最快的方法就是背书时间紧张起码要背四书因为在过去所有读书的人都是背了四书的包括亲手革掉古文命的五四新文学领袖们以及大部分老一辈无产阶级革命家背书的具体方法首先明确每个字的读音再把它大声地念若干遍你会立刻发现汉语的音乐性很快就会喜欢老祖宗编曲的古老歌谣了有一点像疯狂英语但其实疯狂英语是借鉴老办法的你好啊我想问下初接触古文古籍里面那些不认识的字的读音怎样读呀去哪查呢十分感谢啊啊啊啊啊那都是刷到眼瞎才挑出来的精品啊 55 | 1 业余建筑工作室或者王澍老师的联系方式 13957176938 但是可惜他不接电话不回短信不如找他夫人陆文宇好说话这个是他的号码么没人接 56 | 0 业余建筑工作室或者王澍老师的联系方式 13957176938 但是可惜他不接电话不回短信不如找他夫人陆文宇好说话这个是他的号码么 ca113105 只有 80f 和 85f 拼价 470 57 | 1 找人一起合作写推理小说我还想问下帅楼主是同志吗我总觉得你不是同志会很可惜 eautiful girls q 58 | 0 找人一起合作写推理小说我还想问下帅楼主是同志吗我总觉得你不是同志会很可惜 eautiful girls 那我还是找下一家 59 | 1 月亮摩羯的使命是征服日狮月摩日狮月摩 1 日狮月摩上升天蝎哎外表冷脸没有表情内心火热冲动但大脑又控制自己要冷静征服好生纠结 60 | 0 月亮摩羯的使命是征服日狮月摩日狮月摩 1 日狮月摩上升天蝎哎外表冷脸没有表情内心火热冲动但大脑又控制自己要冷静征服没胆量了吧 61 | 1 假如晓霞活着少平会不会跟她结婚别说了我八年前看的现在想想看到晓霞死那一段的时候哭坏了好巧啊我也是 8年前看的这本书好巧有缘 62 | 0 假如晓霞活着少平会不会跟她结婚别说了我八年前看的现在想想看到晓霞死那一段的时候哭坏了好巧啊我也是 8年前看的这本书我会拼命哒失兄我年底来给你汇报好消息 63 | 1 测测自己生命的颜色宁佳心是蓝色张小伟是黑色呵呵呵呵呵呵呵呵呵你也看了阳光姐姐的单翼天使不孤单啊我也看了我有本好好看啊我也有好看点一百个赞好好看啊 64 | 0 测测自己生命的颜色宁佳心是蓝色张小伟是黑色呵呵呵呵呵呵呵呵呵你也看了阳光姐姐的单翼天使不孤单啊我也看了我有本好好看啊我也有好看点一百个赞不过只是时间上不顺的话还好说 oo 65 | 1 我在北京我要学意大利语请大家指点好的学校我刚学的意大利语在新东方学的还不错的就是课程排的有些紧口语听力得自己练语法讲的非常详细意大利语也只有在北京的新东方有吧都是中教我不知道别的新东方有米有我是在北京学的我们这个班是一个中教一个外教中教主要负责语法外教主要负责语音感觉怎么样啊课程安排的非常紧新东方的学费比较低课时少口语和听力需要课下自己苦练没有上课时间练习但是语法讲的非常好最好不要上集中的课程周末班比较好每周 2 天课每天 6 小时这就差不多是这一周该消化的了老师讲的挺好的每个班 20 个人左右上课都顾及的到只要用心学把每周讲的复习好跟上没问题我初级学完以后还打算在新东方报中级班 66 | 0 我在北京我要学意大利语请大家指点好的学校我刚学的意大利语在新东方学的还不错的就是课程排的有些紧口语听力得自己练语法讲的非常详细意大利语也只有在北京的新东方有吧都是中教我不知道别的新东方有米有我是在北京学的我们这个班是一个中教一个外教中教主要负责语法外教主要负责语音感觉怎么样啊蕾丝少了像泳衣 67 | 1 征集征集不和婆婆住的十大理由我跟我老公外加我婆婆和大姑子去给我老公买裤子我婆婆就当我不存在给她亲儿子选裤子还亲自给拽拽裤腿提提裆真的是托着啦还问他裆部紧不紧我简直是目瞪口呆旁边碰过来碰过去超级呕腥哎还有现在不是夏天吗他妈在家穿睡衣睡衣蛮透的也不知道穿胸罩估计是觉得没胸穿不穿无所谓吧就看两颗大粒的黑葡萄在我面前晃来晃去我就不相信他儿子看不到后来有一次在饭桌上我看了他妈的胸又看看我老公我老公看我眼神异常才发现叫他妈去换衣服去他妈可听他儿子话了就去换了衣服不过这次换了下次还照就然后还有他妈上午睡回笼觉的时候房间门也不关两个房间门是门对门我一开门就看他妈穿个透透的破大裤头翘个屁股在睡觉我好几次都悄悄的关上她的门了那我老公先起不也看到了他儿子也不小了也不知道注意点都说儿大防母女大防 f 父父母就没点自觉吗而且你老公不会觉得不自在估计也都习惯了不过我看到你这么说都恶心想起月子里老公晚上洗完澡穿着内裤和婆婆说话我都觉得别扭 68 | 0 征集征集不和婆婆住的十大理由我跟我老公外加我婆婆和大姑子去给我老公买裤子我婆婆就当我不存在给她亲儿子选裤子还亲自给拽拽裤腿提提裆真的是托着啦还问他裆部紧不紧我简直是目瞪口呆旁边碰过来碰过去超级呕腥哎还有现在不是夏天吗他妈在家穿睡衣睡衣蛮透的也不知道穿胸罩估计是觉得没胸穿不穿无所谓吧就看两颗大粒的黑葡萄在我面前晃来晃去我就不相信他儿子看不到后来有一次在饭桌上我看了他妈的胸又看看我老公我老公看我眼神异常才发现叫他妈去换衣服去他妈可听他儿子话了就去换了衣服不过这次换了下次还照就然后还有他妈上午睡回笼觉的时候房间门也不关两个房间门是门对门我一开门就看他妈穿个透透的破大裤头翘个屁股在睡觉我好几次都悄悄的关上她的门了那我老公先起不也看到了他儿子也不小了也不知道注意点我的最爱 69 | 1 福州宠物医院哪家好呢我的喵在善化坊那边的精灵仁爱医院包括绝育驱虫咳嗽之类的东西挖坟帖你还回已经有人挖坟了我也跟着挖呗 70 | 0 福州宠物医院哪家好呢我的喵在善化坊那边的精灵仁爱医院包括绝育驱虫咳嗽之类的东西挖坟帖你还回哦哦这样那只能祝你好运咯 71 | 1 日本建筑师事务所官网收集天津银河广场的图书馆是日本设计师设计的吗是山本理显我说呢谢啦 72 | 0 日本建筑师事务所官网收集天津银河广场的图书馆是日本设计师设计的吗是山本理显我和我 ex 不是是高中同学后来异地了 73 | 1 在景区做过两年导游解答各类关于乌镇的问题到了桐乡站怎么到乌镇乘坐 k282 5 元人终点站就是乌镇汽车站时刻表 url 公交么嗯公交车好的谢谢啦不客气哈我在乌镇住两天会不会有点多余住一天两天都可以啊你大概的时候行程安排有吗改时间了住 21 22 号两天一天西栅一天东栅那 ok 你慢慢逛好了 74 | 0 在景区做过两年导游解答各类关于乌镇的问题到了桐乡站怎么到乌镇乘坐 k282 5 元人终点站就是乌镇汽车站时刻表 url 公交么嗯公交车好的谢谢啦不客气哈我在乌镇住两天会不会有点多余住一天两天都可以啊你大概的时候行程安排有吗改时间了住 21 22 号两天一天西栅一天东栅 0 0 好吧语文没学好改之 75 | 1 原创 tal ben shahar 三本英文著作下载地址 _url_ 以上是他三本重要著作的 pdf 下载方式希望对大家有帮助这个网站支持免费下载大家可以自主查询更多积极心理学相关的文献钓鱼链接不要在這裡瞎留言危言聳聽 76 | 0 原创 tal ben shahar 三本英文著作下载地址 _url_ 以上是他三本重要著作的 pdf 下载方式希望对大家有帮助这个网站支持免费下载大家可以自主查询更多积极心理学相关的文献钓鱼链接现在就去暖会不会太早 77 | 1 性与福气 2 一个曾经很阳光的香港男星由于其糜烂的私生活而导致前程尽毁陈冠希他还有很多福报过去世修的福报很大你如何得知啊貌似他是基督徒 78 | 0 性与福气 2 一个曾经很阳光的香港男星由于其糜烂的私生活而导致前程尽毁陈冠希他还有很多福报过去世修的福报很大呵呵谢谢你晓同学因为现在陈晓红了总有些人到处黑他找不到黑料就黑他长相没辨识度神马的真心看了无语你也是学表演的吧祝福你的星路一路顺利 79 | 1 每个人进来说一个西海岸的元素好么雪茄算么黑暗交易黑暗交易就是那种匪帮盗卖军火那种的神马玩意儿哦哦了解 80 | 0 每个人进来说一个西海岸的元素好么雪茄算么黑暗交易黑暗交易就是那种匪帮盗卖军火那种的神马玩意儿我上升天蝎 81 | 1 我喜欢犯贱怎么办典型的犯贱你就想找虐就冲你那句不缺女人我就对你咬牙切齿像你这种不负责任的男人祝你将来被伤的很深顺带一句感觉楼主是实在人说话句句中肯确实很贱的说幸好你还有自知之明我讨厌把女人不当一回事的男人我妈是女人我将来孩他娘也是女人单凭这点不说高尚的男女平等的社会论调不谈高端的人性哲学尊重女性朋友没啥话说咱俩有点跑题了露珠应该表达的是现在不在乎找到找不到女朋友用词不当语境使然配合露珠本篇帖子的本意是你我在此自作多情罢了 82 | 0 我喜欢犯贱怎么办典型的犯贱你就想找虐就冲你那句不缺女人我就对你咬牙切齿像你这种不负责任的男人祝你将来被伤的很深顺带一句感觉楼主是实在人说话句句中肯确实很贱的说幸好你还有自知之明我讨厌把女人不当一回事的男人因为认识一女的也是这配置比凤姐和丽娟还流弊没想到一猜你就是这配置忒落俗套了饭去恁继续高傲 ps 你的腻秤很有趣 83 | 1 同学们我们来总结一下高中的教过你的老师吧高一高二高三 ing 语文唐远霞数学马静英语袁秦物理邓战军化学梁光斐生物廖慧历史杨秋萍政治杨耘地理冷咏松哈哈哈哈我们生物历史政治都一样诶好喜欢啦们哦廖慧就是太瘦哦身体不好有点造孽最喜欢上秋萍姐诶课每次就像时装表演一样杨耘是我高一高二班主任蛮好诶是勒不晓得廖慧现在生崽没得杨姨妈肯定好么你是 11 届勒嗯嗯我 13 班诶你哪班诶嘛 26 勒你叫乃名字嘛我认到你们班王 y 84 | 0 同学们我们来总结一下高中的教过你的老师吧高一高二高三 ing 语文唐远霞数学马静英语袁秦物理邓战军化学梁光斐生物廖慧历史杨秋萍政治杨耘地理冷咏松哈哈哈哈我们生物历史政治都一样诶好喜欢啦们哦廖慧就是太瘦哦身体不好有点造孽最喜欢上秋萍姐诶课每次就像时装表演一样杨耘是我高一高二班主任蛮好诶是勒不晓得廖慧现在生崽没得杨姨妈肯定好么你是 11 届勒嗯嗯我 13 班诶你哪班诶嘛嗯嗯其实对于我来说无所谓但我征求了目前住的几位 gg 的意见他们不太愿意哟真的很抱歉希望你们能找到合适的房子 85 | 1 3 16 更新 wow 小白经典事件直播贴这是谁的一铲子这帖子时不时就有人挖上来的多大仇系列我一直以为你是从 2x 版本开始玩的我不觉得有什么诶挖坟人的动机我都没所谓了因为谁都是从小白过来的啊看看这个我反而觉得自己魔兽没白玩我是 3 13 后期的玩家了遇到了比较好的基友和团队所以现在能打打还看得过去的进度 fk 驾鹤西去删号自焚的只能在网吧里摆摆老资格嘲讽下 90后小朋友了哈哈哈哈哈真是妥艳你知道战士盾反要切姿态吗我们那时候惩戒打架要用一级命令生存射击猎远程射死布衣近战砍死战士贼的年代你不知道吧真正高端贼能用剥皮小刀戳死 r14 战士真是尽现骨灰级玩家的丑恶嘴脸啊哈哈哈哈哈哈哈小朋友们那崇拜的小眼神〜那种对老玩家发自灵魂深处很不得当场脱裤献菊的向往啊〜 86 | 0 3 16 更新 wow 小白经典事件直播贴这是谁的一铲子这帖子时不时就有人挖上来的多大仇系列我一直以为你是从 2x 版本开始玩的我不觉得有什么诶挖坟人的动机我都没所谓了因为谁都是从小白过来的啊看看这个我反而觉得自己魔兽没白玩我是 3 13 后期的玩家了遇到了比较好的基友和团队所以现在能打打还看得过去的进度 fk 驾鹤西去删号自焚的只能在网吧里摆摆老资格嘲讽下 90后小朋友了哈哈哈哈哈真是妥艳晚安晚安 87 | 1 dele 中级考试心得 2 考来真没什么用怎么讲 2 拿了 90 左右后来去准备 selectividad 还是要没日没夜地学要真心想学好一门语言 c 等级以上才算是敲门砖吧累感不爱不过谢谢是这样的除非十分喜欢学的这门语言不然学起来真的很累不用客气 88 | 0 dele 中级考试心得 2 考来真没什么用怎么讲 2 拿了 90 左右后来去准备 selectividad 还是要没日没夜地学要真心想学好一门语言 c 等级以上才算是敲门砖吧累感不爱不过谢谢逐年递增销售额为什么就不能是别人不知道的来买为什么非要是回头客你这什么逻辑 89 | 1 认知疗法重建你的睡眠想法必看组长你有句话说到我心里去了我白天的糟糕感觉并不仅是因为我的失眠很大程度上源自我糟糕的负面想法以前我是因为为不会如何睡觉现在我越来越清楚应该如何睡了睡眠质量也越来好可以很快入睡即使醒了也很快再次睡着可以睡到早上但是就在我这两天觉得我可以和失眠再见可以正常生活的时候我又莫名的出现很多对睡觉怪异的想法比如我怎么就睡着了闭眼睁眼这么长时间就过去了之类这种无里头的想法以至于这想问题晚上又整夜无法入睡不知道是因为想弄明白这些无聊的问题还是因为以后都纠结在这些问题上放不下而睡不着或有这种想法而不敢入睡我对自己真的很无语国庆在家的时候和妈妈一起睡特别好像是有种动力感觉像是她睡了我也要快点睡有点像是较劲我要睡的好所以那 7 天每天都是晚上 9 10 点睡可以立马睡着中途醒了也不再意可以再睡可以睡到早上 6 7 点可是现在一个人睡就会胡思乱想我到底是怎么了真的很烦不推荐这篇文章组长我只是希望你帮我解惑下一边期待睡觉一边对睡觉又带有恐惧这是不是就是害怕失眠的一种表现害怕失眠本身就是一种表现我们讨论的问题如果不脱离表现和各种表象就不会对你有任何帮助 90 | 0 认知疗法重建你的睡眠想法必看组长你有句话说到我心里去了我白天的糟糕感觉并不仅是因为我的失眠很大程度上源自我糟糕的负面想法以前我是因为为不会如何睡觉现在我越来越清楚应该如何睡了睡眠质量也越来好可以很快入睡即使醒了也很快再次睡着可以睡到早上但是就在我这两天觉得我可以和失眠再见可以正常生活的时候我又莫名的出现很多对睡觉怪异的想法比如我怎么就睡着了闭眼睁眼这么长时间就过去了之类这种无里头的想法以至于这想问题晚上又整夜无法入睡不知道是因为想弄明白这些无聊的问题还是因为以后都纠结在这些问题上放不下而睡不着或有这种想法而不敢入睡我对自己真的很无语国庆在家的时候和妈妈一起睡特别好像是有种动力感觉像是她睡了我也要快点睡有点像是较劲我要睡的好所以那 7 天每天都是晚上 9 10 点睡可以立马睡着中途醒了也不再意可以再睡可以睡到早上 6 7 点可是现在一个人睡就会胡思乱想我到底是怎么了真的很烦不推荐这篇文章组长我只是希望你帮我解惑下一边期待睡觉一边对睡觉又带有恐惧这是不是就是害怕失眠的一种表现他最近的表现的确反常很多时候感觉心思都不在节目里 91 | 1 求助 tactical planning 面试前需要准备什么战术策划数字敏感些高数最好好点 excel 好两年前入行前得帖子结果没去做 tp 去做了策划 p 到变成 planning 我当初面试的时候那人认为我更适合做策划就推荐我去了策划组前后面了三次请问前辈在哪家公司高就能够如此甄选人才 92 | 0 求助 tactical planning 面试前需要准备什么战术策划数字敏感些高数最好好点 excel 好两年前入行前得帖子结果没去做 tp 去做了策划 p 到变成 planning 我当初面试的时候那人认为我更适合做策划就推荐我去了策划组前后面了三次慢慢学吧 93 | 1 大家觉得海贼里头哪些歌好听路飞神曲在空岛我也喜欢他那个小调可爱瞎了我会唱 94 | 0 大家觉得海贼里头哪些歌好听路飞神曲在空岛我也喜欢他那个小调可爱瞎了任何的爱好不良嗜好都有独特的乐趣我安慰我自己 95 | 1 邯郸有哪几个不错的书店 2005年 8月29日 lz 发了这个贴今天是 2011年 8月10日如果他发帖那天和某个姑娘上了床没做好保护那今天他们的小孩儿都要上小学了他妈的幸不辱命组长难的现身啊 96 | 0 邯郸有哪几个不错的书店 2005年 8月29日 lz 发了这个贴今天是 2011年 8月10日如果他发帖那天和某个姑娘上了床没做好保护那今天他们的小孩儿都要上小学了他妈的幸不辱命哦 thx anyway 97 | 1 个人关于直觉型人学外语的感想我觉得一切都应该是我执行力不够影响的英语背单词什么的是这样我也觉得很痛苦但是难受的读完一篇生僻文章再把每个单词查出来什么意思效果很好虽然耗时久 · · · 但是一个个背我根本不可能坚持 · · · 同感强 n 型的人拿着单词本背单词是种折磨我更喜欢阅读先不查生单词全部读下来再把所有的生单词查出来然后把文章再看一两遍理解意思仍然不记单词然后再读另一篇文章每天这样坚持会发现很多生单词是重复出现的即高频词汇或者说是你以前查出来过有印象但是不记得意思的单词反复出现的单词重点记忆这时候就是记忆强化和重复的过程坚持每天阅读下去甚至不用刻意记单词会发现需要查出来的生单词越来越少词汇自然就积累了阅读文章实际上是生单词在语境中的应用理解性记忆远好于死记硬背我最近逼自己用拓词机械重复但是效果还不错双管齐下吧其实如果我现在小学不着急的话应该会好好疼爱我的英英词典柯林斯很不错恩顺便打个广告自从有道词典添加了柯林斯词典后基本告别其他所有词典用有道柯林斯词典很方便例句和解释都非常实用且易于理解长期用有道词典手机版的路过 98 | 0 个人关于直觉型人学外语的感想我觉得一切都应该是我执行力不够影响的英语背单词什么的是这样我也觉得很痛苦但是难受的读完一篇生僻文章再把每个单词查出来什么意思效果很好虽然耗时久 · · · 但是一个个背我根本不可能坚持 · · · 同感强 n 型的人拿着单词本背单词是种折磨我更喜欢阅读先不查生单词全部读下来再把所有的生单词查出来然后把文章再看一两遍理解意思仍然不记单词然后再读另一篇文章每天这样坚持会发现很多生单词是重复出现的即高频词汇或者说是你以前查出来过有印象但是不记得意思的单词反复出现的单词重点记忆这时候就是记忆强化和重复的过程坚持每天阅读下去甚至不用刻意记单词会发现需要查出来的生单词越来越少词汇自然就积累了阅读文章实际上是生单词在语境中的应用理解性记忆远好于死记硬背我最近逼自己用拓词机械重复但是效果还不错双管齐下吧其实如果我现在小学不着急的话应该会好好疼爱我的英英词典柯林斯很不错对滴转租合同 99 | 1 羊男蝎女修成正果了可怜的我正在分手期不过我觉得他是想冷静冷静还有回转的余地他说他很忙但是分手 6 天了好像还是每晚都有通话态度也很客气然后有一天我忍不住想复合他说现在答应复合怕以后又反复不答应又觉得没给两个人机会想 51 放假的时候想想我最近极度痛苦挣扎中在找回自己的生活一个人在异国他乡好崩溃啊沾 lz 的喜气多多多多传给我点吧谢谢楼主好希望我们能复合然后修成正果啊我今天早晨给他发了条信息说早晨好心情愉快然后他说妹子嘴很甜嘛好好加油哦昨晚还说今晚打电话分手那天很奇怪就是他没回我电话我生气了猛打他电话结果他说做朋友吧有人给分析不现在发展如何了愿一切都好爱嘛并不是要分出个输赢的坚持坦诚和包容才能让爱得以延续和好了不过长路漫漫继续坚持为了爱坚持也是值得的但要有自己的原则祝好运 100 | 0 羊男蝎女修成正果了可怜的我正在分手期不过我觉得他是想冷静冷静还有回转的余地他说他很忙但是分手 6 天了好像还是每晚都有通话态度也很客气然后有一天我忍不住想复合他说现在答应复合怕以后又反复不答应又觉得没给两个人机会想 51 放假的时候想想我最近极度痛苦挣扎中在找回自己的生活一个人在异国他乡好崩溃啊沾 lz 的喜气多多多多传给我点吧谢谢楼主好希望我们能复合然后修成正果啊我今天早晨给他发了条信息说早晨好心情愉快然后他说妹子嘴很甜嘛好好加油哦昨晚还说今晚打电话分手那天很奇怪就是他没回我电话我生气了猛打他电话结果他说做朋友吧有人给分析不现在发展如何了愿一切都好爱嘛并不是要分出个输赢的坚持坦诚和包容才能让爱得以延续和好了不过长路漫漫继续坚持好谢谢 101 | 1 羊男蝎女修成正果了我现在也这么想前段时间该折腾的挽回失望拉黑断联可是我还是放不下现在准备继续努力如果特别喜欢就别放手退回朋友的位置重新开始对他好不要给他压力不要对他提出要求去温暖他羊羊很心软的你对他好他会记得的 ps 我们准备结婚了希望带给你点喜气恭喜 102 | 0 羊男蝎女修成正果了我现在也这么想前段时间该折腾的挽回失望拉黑断联可是我还是放不下现在准备继续努力如果特别喜欢就别放手退回朋友的位置重新开始对他好不要给他压力不要对他提出要求去温暖他羊羊很心软的你对他好他会记得的 ps 我们准备结婚了希望带给你点喜气周末睡觉觉得就是浪费时间啊 103 | 1 有养秋田的嘛萨摩耶金毛拉布拉多松狮敢在俗气点吗我养秋田我爱秋田我觉得最俗的是泰迪人人养就像那最炫民族风一样一大众就俗还有泰迪那疯闹的小性格烦没错泰迪巨爱叫跟吉娃娃有一拼了是啊小狗都凶可家里没条件养大个的秋田太大啦忠犬八公人家在院子有自己的房子的我就整天看画的饼解解馋你可以养一只小柴啊大小跟雪纳瑞差不多我觉得柴犬像是小版的秋田田是的我也发现了打算养个小柴犬不过我真的不喜欢凶闹的狗狗我喜欢不言不语的不知这个柴犬个头小了是不是也比秋田要闹呢秋田不闹跟人特别友好而且也不爱叫很聪明但是特别好斗我家秋田是小男孩爱跟别的公狗打架母狗怎么欺负他都没事不知道柴犬是不是也如此 104 | 0 有养秋田的嘛萨摩耶金毛拉布拉多松狮敢在俗气点吗我养秋田我爱秋田我觉得最俗的是泰迪人人养就像那最炫民族风一样一大众就俗还有泰迪那疯闹的小性格烦没错泰迪巨爱叫跟吉娃娃有一拼了是啊小狗都凶可家里没条件养大个的秋田太大啦忠犬八公人家在院子有自己的房子的我就整天看画的饼解解馋你可以养一只小柴啊大小跟雪纳瑞差不多我觉得柴犬像是小版的秋田田是的我也发现了打算养个小柴犬不过我真的不喜欢凶闹的狗狗我喜欢不言不语的不知这个柴犬个头小了是不是也比秋田要闹呢嘿嘿多交流 105 | 1 下巴长痘的同志们来这里集合了原因经验方法西医终于暂时压制住了痘子但是起痘的原因没有消除 lz 的痘子还是此起彼伏包括下巴 lz 决定调整生活习惯来慢慢的调理身体不在操之过急 1 早睡每天 10 点半之前雷打不动必须躺在床上早睡自然能早起上班迟到次数锐减哈哈哈 2 健康饮食饮不喝灌装饮料冰激凌咖啡上火酒精多喝开水花草茶红绿茶食营养均衡三餐合理搭配经常吃应季水果少吃甚至不吃垃圾零食辛辣油腻等重口味食品根据体质尽量搭配一些调理食物 3 运动不用多说了 4 冬季注意保暖减少寒性体质的诱因泡脚 5 合理护肤针对皮肤的状态选择合适的护肤品不盲目相信网络推荐产品坚持一段时间后身体一定会有变化 lz 你现在脸上已经不长了麼我也是体寒怕冷的很连夏天都很少出汗也是抗痘很久很久吃了半年的中药脸上其他地方的痘痘都不长了唯独下巴啊下巴啊我看别人吃了月见草胶囊管用我吃不管用啊我 ms 特别容易推迟有时候甚至推迟 2 礼拜你现在下巴还有痘痘吗 106 | 0 下巴长痘的同志们来这里集合了原因经验方法西医终于暂时压制住了痘子但是起痘的原因没有消除 lz 的痘子还是此起彼伏包括下巴 lz 决定调整生活习惯来慢慢的调理身体不在操之过急 1 早睡每天 10 点半之前雷打不动必须躺在床上早睡自然能早起上班迟到次数锐减哈哈哈 2 健康饮食饮不喝灌装饮料冰激凌咖啡上火酒精多喝开水花草茶红绿茶食营养均衡三餐合理搭配经常吃应季水果少吃甚至不吃垃圾零食辛辣油腻等重口味食品根据体质尽量搭配一些调理食物 3 运动不用多说了 4 冬季注意保暖减少寒性体质的诱因泡脚 5 合理护肤针对皮肤的状态选择合适的护肤品不盲目相信网络推荐产品坚持一段时间后身体一定会有变化 lz 你现在脸上已经不长了麼我也是体寒怕冷的很连夏天都很少出汗也是抗痘很久很久吃了半年的中药脸上其他地方的痘痘都不长了唯独下巴啊下巴啊我看别人吃了月见草胶囊管用我吃不管用啊我 ms 特别容易推迟有时候甚至推迟 2 礼拜同学 logo 哪能用 ps 做啊要用矢量的好么好吧就算你用 ps 做那个不是等高线做出来的 107 | 1 十月份适合到哪里去旅游九寨九寨我是打算坐飞机去的但是一看机票钱好多好多银子我也是算了路费一再犹豫坐大巴我实在是受不了那个罪我可以但时间不允许我身在武汉啊要先到成都再转车折腾都得 12 天我在郑州可以坐动车到西安然后直飞黄龙所以我这次就不去九寨了我回郑州囧 108 | 0 十月份适合到哪里去旅游九寨九寨我是打算坐飞机去的但是一看机票钱好多好多银子我也是算了路费一再犹豫坐大巴我实在是受不了那个罪我可以但时间不允许我身在武汉啊要先到成都再转车折腾都得 12 天我在郑州可以坐动车到西安然后直飞黄龙那不是我负责的事情了 109 | 1 寻找无名指上有痣的 girl 我有痣的位置说出来好低俗好想知道在哪里你是故意的吗那个谁我不是故意的我是有意的我就是问问 · · · · 怎么地 · · · · 这位朋友请到拐角来我展示给你看 110 | 0 寻找无名指上有痣的 girl 我有痣的位置说出来好低俗好想知道在哪里你是故意的吗那个谁我不是故意的我是有意的我就是问问 · · · · 怎么地 · · · · 可以阿亲你豆油我要的东西我给亲算优惠价 111 | 1 愤怒把一个男人捣碎成很多男孩巴列霍作者的意思是一般人的愤怒是负情绪对于穷人来讲是一种正面力量他是想说愤怒的无力吧百度了一下说他一生穷困潦倒又思想激进大概有很多无力的时候原来如此 112 | 0 愤怒把一个男人捣碎成很多男孩巴列霍作者的意思是一般人的愤怒是负情绪对于穷人来讲是一种正面力量他是想说愤怒的无力吧百度了一下说他一生穷困潦倒又思想激进大概有很多无力的时候呵呵呵 113 | 1 最近买的几支试管跟哪家买的试管装 1 有地址吗豆油下 thx 微笑家豆豆家博物馆的猫咪家啥的 114 | 0 最近买的几支试管跟哪家买的试管装 1 有地址吗豆油下 thx 这个真心不愿意啊不是很喜欢锻炼但是身体还可以紫金山一口气爬上去摸问题虽然也不高 --------------------------------------------------------------------------------