└── README.md /README.md: -------------------------------------------------------------------------------- 1 | ### Startups 2 | - [机器学习、深度学习、计算机视觉、大数据创业公司 - Startups in AI](https://github.com/lipiji/AIStartups) 3 | 4 | ## Deep Reinforcement Learning 5 | - David Silver. "[Tutorial: Deep Reinforcement Learning](http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf)." ICML 2016. 6 | - David Silver’s course. "[Reinforcement Learning](http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html)". 2015. 7 | - Bahdanau, Dzmitry, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. "[An Actor-Critic Algorithm for Sequence Prediction](http://arxiv.org/abs/1607.07086)." arXiv preprint arXiv:1607.07086 (2016). 8 | - Li, Jiwei, Will Monroe, Alan Ritter, and Dan Jurafsky. "[Deep Reinforcement Learning for Dialogue Generation](http://arxiv.org/abs/1606.01541)." arXiv preprint arXiv:1606.01541 (2016). 9 | - Pathak, Deepak, Pulkit Agrawal, Alexei A. Efros, and Trevor Darrell. "[Curiosity-driven Exploration by Self-supervised Prediction](https://arxiv.org/abs/1705.05363)." arXiv preprint arXiv:1705.05363 (2017). 10 | - Keneshloo, Yaser, Tian Shi, Chandan K. Reddy, and Naren Ramakrishnan. "[Deep Reinforcement Learning For Sequence to Sequence Models](https://arxiv.org/abs/1805.09461)." arXiv preprint arXiv:1805.09461 (2018). 11 | 12 | ## Dialogue System 13 | - Jiang, Shaojie, and Maarten de Rijke. "[Why are Sequence-to-Sequence Models So Dull?](https://staff.fnwi.uva.nl/m.derijke/wp-content/papercite-data/pdf/jiang-why-2018.pdf)." report, 2018. 14 | - Eric Chu, Prashanth Vijayaraghavan, Deb Roy. "[Learning Personas from Dialogue with Attentive Memory Networks](https://arxiv.org/abs/1810.08717)." EMNLP (2018). 15 | - Ruizhe Li, Chenghua Lin, Matthew Collinson, Xiao Li, Guanyi Chen. "[A Dual-Attention Hierarchical Recurrent Neural Network for Dialogue Act Classification](https://arxiv.org/abs/1810.09154)." arXiv:1810.09154 (2018). 16 | 17 | #### Task-Oriented Dialogue 18 | - Wen, Tsung-Hsien, David Vandyke, Nikola Mrksic, Milica Gasic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young. "[A network-based end-to-end trainable task-oriented dialogue system](https://arxiv.org/abs/1604.04562)." arXiv preprint arXiv:1604.04562 (2016). 19 | - Li, Xiujun, Yun-Nung Chen, Lihong Li, Jianfeng Gao, and Asli Celikyilmaz. "[End-to-end task-completion neural dialogue systems](https://arxiv.org/abs/1703.01008)." arXiv preprint arXiv:1703.01008 (2017). 20 | - Li, Xiujun, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, and Yun-Nung Chen. "[A user simulator for task-completion dialogues](https://arxiv.org/abs/1612.05688)." arXiv preprint arXiv:1612.05688 (2016). 21 | - Yan, Zhao, Nan Duan, Peng Chen, Ming Zhou, Jianshe Zhou, and Zhoujun Li. "[Building Task-Oriented Dialogue Systems for Online Shopping](http://www.aaai.org/ocs/index.php/AAAI/AAAI17/paper/viewPaper/14261)." In AAAI, pp. 4618-4626. 2017. 22 | - Peng, Baolin, Xiujun Li, Jianfeng Gao, Jingjing Liu, and Kam-Fai Wong. "[Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning](http://www.aclweb.org/anthology/P18-1203)." ACL, vol. 1, pp. 2182-2192. 2018. 23 | - Janarthanan Rajendran, Jatin Ganhotra, Satinder Singh, Lazaros Polymenakos. "[Learning End-to-End Goal-Oriented Dialog with Multiple Answers](https://arxiv.org/abs/1808.09996)." arXiv preprint arXiv:1808.09996 (2018). 24 | 25 | ## Text Generation 26 | - Rennie, Steven J., Etienne Marcheret, Youssef Mroueh, Jarret Ross, and Vaibhava Goel. "[Self-critical sequence training for image captioning](https://arxiv.org/abs/1612.00563)." arXiv preprint arXiv:1612.00563 (2016). 27 | - Lin, Kevin, Dianqi Li, Xiaodong He, Zhengyou Zhang, and Ming-Ting Sun. "[Adversarial Ranking for Language Generation](https://arxiv.org/pdf/1705.11001.pdf)." arXiv preprint arXiv:1705.11001 (2017). 28 | - Zhang, Li, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, and Timothy M. Hospedales. "[Actor-Critic Sequence Training for Image Captioning](https://arxiv.org/abs/1706.09601)." arXiv preprint arXiv:1706.09601 (2017). 29 | - Wiseman, Sam, Stuart M. Shieber, and Alexander M. Rush. "[Challenges in Data-to-Document Generation](https://arxiv.org/abs/1707.08052)." arXiv preprint arXiv:1707.08052 (2017). 30 | - Lebret, Rémi, David Grangier, and Michael Auli. "[Neural text generation from structured data with application to the biography domain](https://arxiv.org/abs/1603.07771)." arXiv preprint arXiv:1603.07771 (2016). 31 | - Chisholm, Andrew, Will Radford, and Ben Hachey. "[Learning to generate one-sentence biographies from Wikidata](https://arxiv.org/abs/1702.06235)." arXiv preprint arXiv:1702.06235 (2017). 32 | - Sha, Lei, Lili Mou, Tianyu Liu, Pascal Poupart, Sujian Li, Baobao Chang, and Zhifang Sui. "[Order-Planning Neural Text Generation From Structured Data](https://arxiv.org/abs/1709.00155)." arXiv preprint arXiv:1709.00155 (2017). 33 | - Jiaxian Guo, Sidi Lu, Han Cai, Weinan Zhang, Yong Yu, Jun Wang. "[Long Text Generation via Adversarial Training with Leaked Information](https://arxiv.org/abs/1709.08624)." arXiv preprint arXiv:1709.08624 (2017). 34 | - Guu, Kelvin, Tatsunori B. Hashimoto, Yonatan Oren, and Percy Liang. "[Generating Sentences by Editing Prototypes](https://arxiv.org/abs/1709.08878)." arXiv preprint arXiv:1709.08878 (2017). 35 | - Tianyu Liu, Kexiang Wang, Lei Sha, Baobao Chang, Zhifang Sui. "[Table-to-text Generation by Structure-aware Seq2seq Learnings](https://arxiv.org/abs/1711.09724)." arXiv preprint arXiv:1711.09724 (2017). 36 | - Kahou, Samira Ebrahimi, Adam Atkinson, Vincent Michalski, Akos Kadar, Adam Trischler, and Yoshua Bengio. "[FigureQA: An Annotated Figure Dataset for Visual Reasoning](https://arxiv.org/abs/1710.07300)." arXiv preprint arXiv:1710.07300 (2017). 37 | - Murakami, Soichiro, Akihiko Watanabe, Akira Miyazawa, Keiichi Goshima, Toshihiko Yanase, Hiroya Takamura, and Yusuke Miyao. "[Learning to Generate Market Comments from Stock Prices](http://www.aclweb.org/anthology/P17-1126)." In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1374-1384. 2017. 38 | - Mueller, Jonas, David Gifford, and Tommi Jaakkola. "[Sequence to better sequence: continuous revision of combinatorial structures](http://proceedings.mlr.press/v70/mueller17a.html)." In International Conference on Machine Learning, pp. 2536-2544. 2017. 39 | - Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer. "[Generating Wikipedia by Summarizing Long Sequences](https://arxiv.org/abs/1801.10198)." ICLR 2018. 40 | - Clark, Elizabeth, Anne Spencer Ross, Chenhao Tan, Yangfeng Ji, and Noah A. Smith. "[Creative Writing with a Machine in the Loop: Case Studies on Slogans and Stories](https://homes.cs.washington.edu/~ansross/papers/iui2018-creativewriting.pdf)." (2018). 41 | - Gehrmann, Sebastian, S. E. A. S. Harvard, Falcon Z. Dai, Henry Elder, and Alexander M. Rush. "[End-to-End Content and Plan Selection for Natural Language Generation](https://scholar.harvard.edu/files/gehrmann/files/e2e-harvardnlp.pdf)." 42 | - Juncen Li, Robin Jia, He He, Percy Liang. "[Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer](https://arxiv.org/abs/1804.06437)." arXiv:1804.06437 2018. 43 | - Yi Liao, Lidong Bing, Piji Li, Shuming Shi, Wai Lam, Tong Zhang. "[Incorporating Pseudo-Parallel Data for Quantifiable Sequence Editing](https://arxiv.org/abs/1804.07007)." arXiv:1804.07007 2018. 44 | - Xin Wang, Wenhu Chen, Yuan-Fang Wang, William Yang Wang. "[No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling](https://arxiv.org/abs/1804.09160)." arXiv:1804.09160 2018. 45 | - Sam Wiseman, Stuart M. Shieber, Alexander M. Rush. "[Learning Neural Templates for Text Generation 46 | ](https://arxiv.org/abs/1808.10122)." arXiv:1808.10122 2018. 47 | 48 | 49 | ## Text Summarization 50 | - Ryang, Seonggi, and Takeshi Abekawa. "[Framework of automatic text summarization using reinforcement learning](http://dl.acm.org/citation.cfm?id=2390980)." In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 256-265. Association for Computational Linguistics, 2012. [not neural-based methods] 51 | - King, Ben, Rahul Jha, Tyler Johnson, Vaishnavi Sundararajan, and Clayton Scott. "[Experiments in Automatic Text Summarization Using Deep Neural Networks](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.459.8775&rep=rep1&type=pdf)." Machine Learning (2011). 52 | - Liu, Yan, Sheng-hua Zhong, and Wenjie Li. "[Query-Oriented Multi-Document Summarization via Unsupervised Deep Learning](http://www.aaai.org/ocs/index.php/AAAI/AAAI12/paper/view/5058/5322)." AAAI. 2012. 53 | - Rioux, Cody, Sadid A. Hasan, and Yllias Chali. "[Fear the REAPER: A System for Automatic Multi-Document Summarization with Reinforcement Learning](http://emnlp2014.org/papers/pdf/EMNLP2014075.pdf)." In EMNLP, pp. 681-690. 2014.[not neural-based methods] 54 | - PadmaPriya, G., and K. Duraiswamy. "[An Approach For Text Summarization Using Deep Learning Algorithm](http://thescipub.com/PDF/jcssp.2014.1.9.pdf)." Journal of Computer Science 10, no. 1 (2013): 1-9. 55 | - Denil, Misha, Alban Demiraj, and Nando de Freitas. "[Extraction of Salient Sentences from Labelled Documents](http://arxiv.org/abs/1412.6815)." arXiv preprint arXiv:1412.6815 (2014). 56 | - Kågebäck, Mikael, et al. "[Extractive summarization using continuous vector space models](http://www.aclweb.org/anthology/W14-1504)." Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality (CVSC)@ EACL. 2014. 57 | - Denil, Misha, Alban Demiraj, Nal Kalchbrenner, Phil Blunsom, and Nando de Freitas. "[Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network](http://arxiv.org/abs/1406.3830)." arXiv preprint arXiv:1406.3830 (2014). 58 | - Cao, Ziqiang, Furu Wei, Li Dong, Sujian Li, and Ming Zhou. "[Ranking with Recursive Neural Networks and Its Application to Multi-document Summarization](http://gana.nlsde.buaa.edu.cn/~lidong/aaai15-rec_sentence_ranking.pdf)." (AAAI'2015). 59 | - Fei Liu, Jeffrey Flanigan, Sam Thomson, Norman Sadeh, and Noah A. Smith. "[Toward Abstractive Summarization Using Semantic Representations](http://www.cs.cmu.edu/~nasmith/papers/liu+flanigan+thomson+sadeh+smith.naacl15.pdf)." NAACL 2015 60 | - Wenpeng Yin, Yulong Pei. "Optimizing Sentence Modeling and Selection for Document Summarization." IJCAI 2015 61 | - He, Zhanying, Chun Chen, Jiajun Bu, Can Wang, Lijun Zhang, Deng Cai, and Xiaofei He. "[Document Summarization Based on Data Reconstruction](http://cs.nju.edu.cn/zlj/pdf/AAAI-2012-He.pdf)." In AAAI. 2012. 62 | - Liu, He, Hongliang Yu, and Zhi-Hong Deng. "[Multi-Document Summarization Based on Two-Level Sparse Representation Model](http://www.cis.pku.edu.cn/faculty/system/dengzhihong/papers/AAAI%202015_Multi-Document%20Summarization%20Based%20on%20Two-Level%20Sparse%20Representation%20Model.pdf)." In Twenty-Ninth AAAI Conference on Artificial Intelligence. 2015. 63 | - Jin-ge Yao, Xiaojun Wan, Jianguo Xiao. "[Compressive Document Summarization via Sparse Optimization](http://ijcai.org/Proceedings/15/Papers/198.pdf)." IJCAI 2015 64 | - Piji Li, Lidong Bing, Wai Lam, Hang Li, and Yi Liao. "[Reader-Aware Multi-Document Summarization via Sparse Coding](http://arxiv.org/abs/1504.07324)." IJCAI 2015. 65 | - Lopyrev, Konstantin. "[Generating News Headlines with Recurrent Neural Networks](http://arxiv.org/abs/1512.01712)." arXiv preprint arXiv:1512.01712 (2015). [The first paragraph as document.] 66 | - Alexander M. Rush, Sumit Chopra, Jason Weston. "[A Neural Attention Model for Abstractive Sentence Summarization](http://arxiv.org/abs/1509.00685)." EMNLP 2015. [sentence compression] 67 | - Hu, Baotian, Qingcai Chen, and Fangze Zhu. "[LCSTS: a large scale chinese short text summarization dataset](http://arxiv.org/abs/1506.05865)." arXiv preprint arXiv:1506.05865 (2015). 68 | - Gulcehre, Caglar, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. "[Pointing the Unknown Words](http://arxiv.org/abs/1603.08148)." arXiv preprint arXiv:1603.08148 (2016). 69 | - Nallapati, Ramesh, Bing Xiang, and Bowen Zhou. "[Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond](http://arxiv.org/abs/1602.06023)." arXiv preprint arXiv:1602.06023 (2016). [sentence compression] 70 | - Sumit Chopra, Alexander M. Rush and Michael Auli. "[Abstractive Sentence Summarization with Attentive Recurrent Neural Networks](http://harvardnlp.github.io/papers/naacl16_summary.pdf)" NAACL 2016. 71 | - Jiatao Gu, Zhengdong Lu, Hang Li, Victor O.K. Li. "[Incorporating Copying Mechanism in Sequence-to-Sequence Learning](http://arxiv.org/abs/1603.06393)." ACL. (2016) 72 | - Jianpeng Cheng, Mirella Lapata. "[Neural Summarization by Extracting Sentences and Words](http://arxiv.org/abs/1603.07252)". ACL. (2016) 73 | - Zhang, Jianmin, Jin-ge Yao, and Xiaojun Wan. "[Toward constructing sports news from live text commentary](http://www.icst.pku.edu.cn/lcwm/wanxj/files/acl16_sports.pdf)." In Proceedings of ACL. 2016. 74 | - Ziqiang Cao, Wenjie Li, Sujian Li, Furu Wei. "[AttSum: Joint Learning of Focusing and Summarization with Neural Attention](http://arxiv.org/abs/1604.00125)". arXiv:1604.00125 (2016) 75 | - Ayana, Shiqi Shen, Zhiyuan Liu, Maosong Sun. "[Neural Headline Generation with Sentence-wise Optimization](http://arxiv.org/abs/1604.01904)". arXiv:1604.01904 (2016) 76 | - Kikuchi, Yuta, Graham Neubig, Ryohei Sasano, Hiroya Takamura, and Manabu Okumura. "[Controlling Output Length in Neural Encoder-Decoders](https://arxiv.org/abs/1609.09552)." arXiv preprint arXiv:1609.09552 (2016). 77 | - Qian Chen, Xiaodan Zhu, Zhenhua Ling, Si Wei and Hui Jiang. "[Distraction-Based Neural Networks for Document Summarization](https://arxiv.org/abs/1610.08462)." IJCAI 2016. 78 | - Wang, Lu, and Wang Ling. "[Neural Network-Based Abstract Generation for Opinions and Arguments](http://www.ccs.neu.edu/home/luwang/papers/NAACL2016.pdf)." NAACL 2016. 79 | - Yishu Miao, Phil Blunsom. "[Language as a Latent Variable: Discrete Generative Models for Sentence Compression](http://arxiv.org/abs/1609.07317)." EMNLP 2016. 80 | - Takase, Sho, Jun Suzuki, Naoaki Okazaki, Tsutomu Hirao, and Masaaki Nagata. "[Neural headline generation on abstract meaning representation](https://www.aclweb.org/anthology/D/D16/D16-1112.pdf)." EMNLP, pp. 1054-1059. 2016. 81 | - Hongya Song, Zhaochun Ren, Piji Li, Shangsong Liang, Jun Ma, and Maarten de Rijke. [Summarizing Answers in Non-Factoid Community Question-Answering](http://dl.acm.org/citation.cfm?id=3018704). In WSDM 2017: The 10th International Conference on Web Search and Data Mining, 2017. 82 | - Wenyuan Zeng, Wenjie Luo, Sanja Fidler, Raquel Urtasun. "[Efficient Summarization with Read-Again and Copy Mechanism](https://arxiv.org/abs/1611.03382)." arXiv preprint arXiv:1611.03382 (2016). 83 | - Piji Li, Zihao Wang, Wai Lam, Zhaochun Ren, Lidong Bing. "[Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization](https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14613)". In AAAI, 2017. 84 | - Ramesh Nallapati, Feifei Zhai, Bowen Zhou. [SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive Summarization of Documents](https://arxiv.org/abs/1611.04230). In AAAI, 2017. 85 | - Ramesh Nallapati, Bowen Zhou, Mingbo Ma. "[Classify or Select: Neural Architectures for Extractive Document Summarization](https://arxiv.org/abs/1611.04244)." arXiv preprint arXiv:1611.04244 (2016). 86 | - Suzuki, Jun, and Masaaki Nagata. "[Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization](http://www.aclweb.org/anthology/E17-2047)." EACL 2017 (2017): 291. 87 | - Jiwei Tan and Xiaojun Wan. [Abstractive Document Summarization with a Graph-Based Attentional Neural Model](). ACL, 2017. 88 | - Preksha Nema, Mitesh M. Khapra, Balaraman Ravindran and Anirban Laha. [Diversity driven attention model for query-based abstractive summarization](). ACL,2017 89 | - Abigail See, Peter J. Liu and Christopher D. Manning. [Get To The Point: Summarization with Pointer-Generator Networks](https://arxiv.org/abs/1704.04368). ACL, 2017. 90 | - Qingyu Zhou, Nan Yang, Furu Wei and Ming Zhou. [Selective Encoding for Abstractive Sentence Summarization](https://arxiv.org/abs/1704.07073). ACL, 2017 91 | - Maxime Peyrard and Judith Eckle-Kohler. [Supervised Learning of Automatic Pyramid for Optimization-Based Multi-Document Summarization](). ACL, 2017. 92 | - Shashi Narayan, Nikos Papasarantopoulos, Mirella Lapata, Shay B. Cohen. "[Neural Extractive Summarization with Side Information](https://arxiv.org/abs/1704.04530)." arXiv preprint arXiv:1704.04530 (2017). 93 | - Romain Paulus, Caiming Xiong, Richard Socher. "[A Deep Reinforced Model for Abstractive Summarization](https://metamind.io/static/pdf/deep-reinforced-model-arxiv-v1.pdf)." (2017). 94 | - Shibhansh Dohare, Harish Karnick. "[Text Summarization using Abstract Meaning Representation](https://arxiv.org/abs/1706.01678)." arXiv:1706.01678 (2017). 95 | - Michihiro Yasunaga, Rui Zhang, Kshitijh Meelu, Ayush Pareek, Krishnan Srinivasan, Dragomir Radev. "[Graph-based Neural Multi-Document Summarization](https://arxiv.org/abs/1706.06681)." arXiv:1706.06681 (2017). 96 | - Piji Li, Wai Lam, Lidong Bing, and Zihao Wang. [Deep Recurrent Generative Decoder for Abstractive Text Summarization](http://lipiji.com/). Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017. 97 | - Piji Li, Wai Lam, Lidong Bing, Weiwei Guo, and Hang Li. [Cascaded Attention based Unsupervised Information Distillation for Compressive Summarization](http://lipiji.com/). Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017. 98 | - Piji Li, Lidong Bing, Wai Lam. [Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset](http://www1.se.cuhk.edu.hk/~textmine/dataset/ra-mds/). Proceedings of the EMNLP 2017 Workshop on New Frontiers in Summarization (EMNLP-NewSum'17). Sep 2017. 99 | - Tan, Jiwei, Xiaojun Wan, and Jianguo Xiao. "[From Neural Sentence Summarization to Headline Generation: A Coarse-to-Fine Approach](http://static.ijcai.org/proceedings-2017/0574.pdf)." IJCAI 2017. 100 | - Ling, Jeffrey, and Alexander M. Rush. "[Coarse-to-Fine Attention Models for Document Summarization](http://www.aclweb.org/anthology/W/W17/W17-4505.pdf)." EMNLP 2017 (2017): 33. 101 | - Ziqiang Cao, Furu Wei, Wenjie Li, Sujian Li. "[Faithful to the Original: Fact Aware Neural Abstractive Summarization](https://arxiv.org/abs/1711.04434)." arXiv:1711.04434 (2017). 102 | - Angela Fan, David Grangier, Michael Auli. "[Controllable Abstractive Summarization](https://arxiv.org/abs/1711.05217)." arXiv:1711.05217 (2017). 103 | - Liu, Linqing, Yao Lu, Min Yang, Qiang Qu, Jia Zhu, and Hongyan Li. "[Generative Adversarial Network for Abstractive Text Summarization](https://arxiv.org/pdf/1711.09357.pdf)." arXiv preprint arXiv:1711.09357 (2017). 104 | - Narayan, Shashi, Shay B. Cohen, and Mirella Lapata. "[Ranking Sentences for Extractive Summarization with Reinforcement Learning](https://arxiv.org/abs/1802.08636)." arXiv preprint arXiv:1802.08636 (2018). 105 | - Asli Celikyilmaz, Antoine Bosselut, Xiaodong He, Yejin Choi. "[Deep Communicating Agents for Abstractive Summarization](https://arxiv.org/abs/1803.10357)." NAACL (2018). 106 | - Chen, Wenhu, Guanlin Li, Shuo Ren, Shujie Liu, Zhirui Zhang, Mu Li, and Ming Zhou. "[Generative Bridging Network in Neural Sequence Prediction](https://arxiv.org/abs/1706.09152)." NAACL (2018). 107 | - Li, Piji, Lidong Bing, and Wai Lam. "[Actor-Critic based Training Framework for Abstractive Summarization](https://arxiv.org/abs/1803.11070)." arXiv preprint arXiv:1803.11070 (2018). 108 | - Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan Kim, Walter Chang, Nazli Goharian. "[ 109 | A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents](https://arxiv.org/abs/1804.05685)". NAACL, 2018. 110 | - Yuxiang Wu, Baotian Hu. "[Learning to Extract Coherent Summary via Deep Reinforcement Learning](https://arxiv.org/abs/1804.07036)." AAAI (2018). 111 | - Jianmin Zhang, Jiwei Tan, Xiaojun Wan. "[Towards a Neural Network Approach to Abstractive Multi-Document Summarization](https://arxiv.org/abs/1804.09010)." arXiv:1804.09010 (2018). 112 | - Li Wang, Junlin Yao, Yunzhe Tao, Li Zhong, Wei Liu, Qiang Du. "[A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization](https://arxiv.org/abs/1805.03616)." IJCAI-ECAI (2018). 113 | - Yen-Chun Chen, Mohit Bansal. "[Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting 114 | ](https://arxiv.org/abs/1805.11080)." arXiv:1805.11080 (2018). 115 | - Song, Kaiqiang, Lin Zhao, and Fei Liu. "[Structure-Infused Copy Mechanisms for Abstractive Summarization](http://www.cs.ucf.edu/~feiliu/papers/COLING2018_StructSumm.pdf)." COLING, 2018. 116 | - Keneshloo, Yaser, Tian Shi, Chandan K. Reddy, and Naren Ramakrishnan. "[Deep Reinforcement Learning For Sequence to Sequence Models](https://arxiv.org/abs/1805.09461)." arXiv preprint arXiv:1805.09461 (2018). 117 | - Qingyu Zhou, Nan Yang, Furu Wei, Ming Zhou. "[Sequential Copying Networks](https://arxiv.org/abs/1807.02301)." AAAI (2018). 118 | - Qingyu Zhou, Nan Yang, Furu Wei, Shaohan Huang, Ming Zhou, Tiejun Zhao. "[Neural Document Summarization by Jointly Learning to Score and Select Sentences](https://arxiv.org/abs/1807.02305)." ACL (2018). 119 | - Lin, Junyang, Xu Sun, Shuming Ma, and Qi Su. "[Global Encoding for Abstractive Summarization](https://arxiv.org/abs/1805.03989)." arXiv preprint arXiv:1805.03989 (2018). 120 | - Khatri, Chandra, Gyanit Singh, and Nish Parikh. "[Abstractive and Extractive Text Summarization using Document Context Vector and Recurrent Neural Networks](https://arxiv.org/abs/1807.08000)." arXiv preprint arXiv:1807.08000 (2018). 121 | - Hsu, Wan-Ting, Chieh-Kai Lin, Ming-Ying Lee, Kerui Min, Jing Tang, and Min Sun. "[A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss](https://arxiv.org/abs/1805.06266)." arXiv preprint arXiv:1805.06266 (2018). 122 | - Sun, Fei, Peng Jiang, Hanxiao Sun, Changhua Pei, Wenwu Ou, and Xiaobo Wang. "[Multi-Source Pointer Network for Product Title Summarization](https://arxiv.org/abs/1808.06885)." arXiv preprint arXiv:1808.06885 (2018). 123 | - Wojciech Kryściński, Romain Paulus, Caiming Xiong, Richard Socher. "[Improving Abstraction in Text Summarization 124 | ](https://arxiv.org/abs/1808.07913)." arXiv preprint arXiv:1808.07913 (2018). 125 | - Zhang, Xingxing, Mirella Lapata, Furu Wei, and Ming Zhou. "[Neural Latent Extractive Document Summarization](https://arxiv.org/abs/1808.07187)." arXiv preprint arXiv:1808.07187 (2018). 126 | - Sebastian Gehrmann, Yuntian Deng, Alexander M. Rush. "[Bottom-Up Abstractive Summarization](https://arxiv.org/abs/1808.10792)." arXiv preprint arXiv:1808.10792 (2018). 127 | - Yichen Jiang, Mohit Bansal. "[Closed-Book Training to Improve Summarization Encoder Memory](https://arxiv.org/abs/1809.04585)." arXiv preprint arXiv:1809.04585 (2018). 128 | - Kamal Al-Sabahi, Zhang Zuping, Yang Kang. "[Bidirectional Attentional Encoder-Decoder Model and Bidirectional Beam Search for Abstractive Summarization](https://arxiv.org/abs/1809.06662)." arXiv preprint arXiv:1809.06662 (2018). 129 | - Raphael Schumann. "[Unsupervised Abstractive Sentence Summarization using Length Controlled Variational Autoencoder](https://arxiv.org/abs/1809.05233)." arXiv preprint arXiv:1809.05233 (2018). 130 | - Krishna, Kundan, and Balaji Vasan Srinivasan. "[Generating Topic-Oriented Summaries Using Neural Attention](http://www.aclweb.org/anthology/N18-1153)." NAACL 2018. 131 | - Lisa Fan, Dong Yu, Lu Wang. "[Robust Neural Abstractive Summarization Systems and Evaluation against Adversarial Information](https://arxiv.org/abs/1810.06065)." arXiv preprint arXiv:1810.06065 (2018). 132 | - Eric Chu, Peter J. Liu. "[Unsupervised Neural Multi-document Abstractive Summarization](https://arxiv.org/abs/1810.05739)." arXiv preprint arXiv:1810.05739 (2018). 133 | - Yaser Keneshloo, Naren Ramakrishnan, Chandan K. Reddy. "[Deep Transfer Reinforcement Learning for Text Summarization](https://arxiv.org/abs/1810.06667)." arXiv preprint arXiv:1810.06667 (2018). 134 | - Mahnaz Koupaee, William Yang Wang. "[WikiHow: A Large Scale Text Summarization Dataset 135 | ](https://arxiv.org/abs/1810.09305)." arXiv preprint arXiv:1810.09305 (2018). 136 | - Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon. "[Unified Language Model Pre-training for Natural Language Understanding and Generation](https://arxiv.org/abs/1905.03197)." arXiv preprint arXiv:1905.03197 (2019). 137 | 138 | ### Opinion Summarization 139 | - Wu, Haibing, Yiwei Gu, Shangdi Sun, and Xiaodong Gu. "[Aspect-based Opinion Summarization with Convolutional Neural Networks](http://arxiv.org/abs/1511.09128)." arXiv preprint arXiv:1511.09128 (2015). 140 | - Irsoy, Ozan, and Claire Cardie. "[Opinion Mining with Deep Recurrent Neural Networks](http://anthology.aclweb.org/D/D14/D14-1080.pdf)." In EMNLP, pp. 720-728. 2014. 141 | - Piji Li, Zihao Wang, Zhaochun Ren, Lidong Bing, Wai Lam. "[Neural Rating Regression with Abstractive Tips Generation for Recommendation](https://arxiv.org/abs/1708.00154).". In SIGIR, 2017. 142 |   143 | ### Video Summarization 144 | - Zhou, Kaiyang, and Yu Qiao. "[Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward](https://arxiv.org/abs/1801.00054)." arXiv preprint arXiv:1801.00054 (2017). 145 | - Mahasseni, Behrooz, Michael Lam, and Sinisa Todorovic. "[Unsupervised video summarization with adversarial lstm networks](http://openaccess.thecvf.com/content_cvpr_2017/papers/Mahasseni_Unsupervised_Video_Summarization_CVPR_2017_paper.pdf)." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017. 146 | 147 | ### Reading Comprehension 148 | - Hermann, Karl Moritz, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. "[Teaching machines to read and comprehend](http://papers.nips.cc/paper/5945-teaching-machines-to-read-and-comprehend)." In Advances in Neural Information Processing Systems, pp. 1693-1701. 2015. 149 | - Hill, Felix, Antoine Bordes, Sumit Chopra, and Jason Weston. "[The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations](http://arxiv.org/abs/1511.02301)." arXiv preprint arXiv:1511.02301 (2015). 150 | - Kadlec, Rudolf, Martin Schmid, Ondrej Bajgar, and Jan Kleindienst. "[Text Understanding with the Attention Sum Reader Network](http://arxiv.org/abs/1603.01547)." arXiv preprint arXiv:1603.01547 (2016). 151 | - Chen, Danqi, Jason Bolton, and Christopher D. Manning. "[A thorough examination of the cnn/daily mail reading comprehension task](http://arxiv.org/abs/1606.02858)." arXiv preprint arXiv:1606.02858 (2016). 152 | - Dhingra, Bhuwan, Hanxiao Liu, William W. Cohen, and Ruslan Salakhutdinov. "[Gated-Attention Readers for Text Comprehension](http://arxiv.org/abs/1606.01549)." arXiv preprint arXiv:1606.01549 (2016). 153 | - Sordoni, Alessandro, Phillip Bachman, and Yoshua Bengio. "[Iterative Alternating Neural Attention for Machine Reading](http://arxiv.org/abs/1606.02245)." arXiv preprint arXiv:1606.02245 (2016). 154 | - Trischler, Adam, Zheng Ye, Xingdi Yuan, and Kaheer Suleman. "[Natural Language Comprehension with the EpiReader](http://arxiv.org/abs/1606.02270)." arXiv preprint arXiv:1606.02270 (2016). 155 | - Yiming Cui, Zhipeng Chen, Si Wei, Shijin Wang, Ting Liu, Guoping Hu. "[Attention-over-Attention Neural Networks for Reading Comprehension](http://arxiv.org/abs/1607.04423)." arXiv preprint arXiv:1607.04423 (2016). 156 | - Yiming Cui, Ting Liu, Zhipeng Chen, Shijin Wang, Guoping Hu. "[Consensus Attention-based Neural Networks for Chinese Reading Comprehension](https://arxiv.org/abs/1607.02250)." arXiv preprint arXiv:1607.02250 (2016). 157 | - Daniel Hewlett, Alexandre Lacoste, Llion Jones, Illia Polosukhin, Andrew Fandrianto, Jay Han, Matthew Kelcey and David Berthelot. "[WIKIREADING: A Novel Large-scale Language Understanding Task over Wikipedia](http://www.aclweb.org/anthology/P/P16/P16-1145.pdf)." ACL (2016). pp. 1535-1545. 158 | - Minghao Hu, Yuxing Peng, Xipeng Qiu. "[Mnemonic Reader for Machine Comprehension](https://arxiv.org/abs/1705.02798)." arXiv:1705.02798 (2017). 159 | - Wenhui Wang, Nan Yang, Furu Wei, Baobao Chang and Ming Zhou. "[R-NET: Machine Reading Comprehension with Self-matching Networks](https://www.microsoft.com/en-us/research/publication/mcr/)." ACL (2017). 160 | 161 | 162 | ### Sentence Modelling 163 | - Kalchbrenner, Nal, Edward Grefenstette, and Phil Blunsom. "[A convolutional neural network for modelling sentences](http://arxiv.org/abs/1404.2188)." arXiv preprint arXiv:1404.2188 (2014). 164 | - Kim, Yoon. "[Convolutional neural networks for sentence classification](http://arxiv.org/abs/1408.5882)." arXiv preprint arXiv:1408.5882 (2014). 165 | - Le, Quoc V., and Tomas Mikolov. "[Distributed representations of sentences and documents](http://arxiv.org/abs/1405.4053)." arXiv preprint arXiv:1405.4053 (2014). 166 | - Yang, Zichao, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. "[Hierarchical Attention Networks for Document Classification](http://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf)." In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016. 167 | 168 | ### Reasoning 169 | - Peng, Baolin, Zhengdong Lu, Hang Li, and Kam-Fai Wong. "[Towards Neural Network-based Reasoning](http://arxiv.org/abs/1508.05508)." arXiv preprint arXiv:1508.05508 (2015). 170 | 171 | ### Knowledge Engine 172 | - Bordes, Antoine, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. "[Translating embeddings for modeling multi-relational data](http://papers.nips.cc/paper/5071-translating-embeddings-for-modeling-multi-relational-data)." In Advances in Neural Information Processing Systems, pp. 2787-2795. 2013. TransE 173 | - Lin, Yankai, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. "[Neural Relation Extraction with Selective Attention over Instances](http://nlp.csai.tsinghua.edu.cn/~lzy/publications/acl2016_nre.pdf)." ACL (2016) 174 | - TransXXX 175 | 176 | ### Memory Networks 177 | - Graves, Alex, Greg Wayne, and Ivo Danihelka. "[Neural turing machines](http://arxiv.org/abs/1410.5401)." arXiv preprint arXiv:1410.5401 (2014). 178 | - Weston, Jason, Sumit Chopra, and Antoine Bordes. "[Memory networks](http://arxiv.org/abs/1410.3916)." ICLR (2014). 179 | - Sukhbaatar, Sainbayar, Jason Weston, and Rob Fergus. "[End-to-end memory networks](http://papers.nips.cc/paper/5846-end-to-end-memory-networks)." In Advances in neural information processing systems, pp. 2440-2448. 2015. 180 | - Weston, Jason, Antoine Bordes, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Armand Joulin, and Tomas Mikolov. "[Towards ai-complete question answering: A set of prerequisite toy tasks](http://arxiv.org/abs/1502.05698)." arXiv preprint arXiv:1502.05698 (2015). 181 | - Bordes, Antoine, Nicolas Usunier, Sumit Chopra, and Jason Weston. "[Large-scale simple question answering with memory networks](http://arxiv.org/abs/1506.02075)." arXiv preprint arXiv:1506.02075 (2015). 182 | - Kumar, Ankit, Ozan Irsoy, Jonathan Su, James Bradbury, Robert English, Brian Pierce, Peter Ondruska, Ishaan Gulrajani, and Richard Socher. "[Ask me anything: Dynamic memory networks for natural language processing](http://arxiv.org/abs/1506.07285)." arXiv preprint arXiv:1506.07285 (2015). 183 | - Dodge, Jesse, Andreea Gane, Xiang Zhang, Antoine Bordes, Sumit Chopra, Alexander Miller, Arthur Szlam, and Jason Weston. "[Evaluating prerequisite qualities for learning end-to-end dialog systems](http://arxiv.org/abs/1511.06931)." arXiv preprint arXiv:1511.06931 (2015). 184 | - Hill, Felix, Antoine Bordes, Sumit Chopra, and Jason Weston. "[The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations](http://arxiv.org/abs/1511.02301)." arXiv preprint arXiv:1511.02301 (2015). 185 | - Weston, Jason. "[Dialog-based Language Learning](http://arxiv.org/abs/1604.06045)." arXiv preprint arXiv:1604.06045 (2016). 186 | - Bordes, Antoine, and Jason Weston. "[Learning End-to-End Goal-Oriented Dialog](http://arxiv.org/abs/1605.07683)." arXiv preprint arXiv:1605.07683 (2016). 187 | - Chandar, Sarath, Sungjin Ahn, Hugo Larochelle, Pascal Vincent, Gerald Tesauro, and Yoshua Bengio. "[Hierarchical Memory Networks](https://arxiv.org/abs/1605.07427)." arXiv preprint arXiv:1605.07427 (2016). 188 | - Jason Weston."[Memory Networks for Language Understanding](http://www.thespermwhale.com/jaseweston/icml2016/)." ICML Tutorial 2016 189 | - Tang, Yaohua, Fandong Meng, Zhengdong Lu, Hang Li, and Philip LH Yu. "[Neural Machine Translation with External Phrase Memory](http://arxiv.org/abs/1606.01792)." arXiv preprint arXiv:1606.01792 (2016). 190 | - Wang, Mingxuan, Zhengdong Lu, Hang Li, and Qun Liu. "[Memory-enhanced Decoder for Neural Machine Translation](http://arxiv.org/abs/1606.02003)." arXiv preprint arXiv:1606.02003 (2016). 191 | - Xiong, Caiming, Stephen Merity, and Richard Socher. "[Dynamic memory networks for visual and textual question answering](https://arxiv.org/abs/1603.01417)." arXiv preprint arXiv:1603.01417 (2016). 192 | 193 | ### Neural Structures 194 | - Srivastava, Rupesh Kumar, Klaus Greff, and Jürgen Schmidhuber. "[Highway networks](http://arxiv.org/abs/1505.00387)." arXiv preprint arXiv:1505.00387 (2015). 195 | - Srivastava, Rupesh K., Klaus Greff, and Jürgen Schmidhuber. "[Training very deep networks](http://arxiv.org/abs/1507.06228)." In Advances in Neural Information Processing Systems, pp. 2368-2376. 2015. 196 | - Vinyals, Oriol, Meire Fortunato, and Navdeep Jaitly. "[Pointer networks](https://arxiv.org/abs/1506.03134)." In Advances in Neural Information Processing Systems, pp. 2692-2700. 2015. 197 | - Rasmus, Antti, Mathias Berglund, Mikko Honkala, Harri Valpola, and Tapani Raiko. "[Semi-supervised learning with ladder networks](http://arxiv.org/abs/1507.02672)." In Advances in Neural Information Processing Systems, pp. 3546-3554. 2015. 198 | - Bengio, Samy, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. "[Scheduled sampling for sequence prediction with recurrent neural networks](https://arxiv.org/abs/1506.03099)." In Advances in Neural Information Processing Systems, pp. 1171-1179. 2015. 199 | - He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "[Deep Residual Learning for Image Recognition](http://arxiv.org/abs/1512.03385)." arXiv preprint arXiv:1512.03385 (2015). 200 | - He, Kaiming. "[Tutorial: Deep Residual Networks: Deep Learning Gets Way Deeper](http://icml.cc/2016/tutorials/icml2016_tutorial_deep_residual_networks_kaiminghe.pdf)." ICML 2016 tutorial. 201 | - Courbariaux, Matthieu, and Yoshua Bengio. "[Binarynet: Training deep neural networks with weights and activations constrained to+ 1 or-1](http://arxiv.org/abs/1602.02830)." arXiv preprint arXiv:1602.02830 (2016). 202 | - Jiatao Gu, Zhengdong Lu, Hang Li, Victor O.K. Li. "[Incorporating Copying Mechanism in Sequence-to-Sequence Learning](http://arxiv.org/abs/1603.06393)." ACL (2016) 203 | - Gulcehre, Caglar, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. "[Pointing the Unknown Words](http://arxiv.org/abs/1603.08148)." arXiv preprint arXiv:1603.08148 (2016). 204 | - Andreas, Jacob, Marcus Rohrbach, Trevor Darrell, and Dan Klein. "[Learning to compose neural networks for question answering](http://arxiv.org/abs/1601.01705)." NAACL 2016. 205 | - Julian Georg Zilly, Rupesh Kumar Srivastava, Jan Koutník, Jürgen Schmidhuber. "[Recurrent Highway Networks](http://arxiv.org/abs/1607.03474)." arXiv preprint arXiv:1607.03474 (2016). 206 | - Zhilin Yang, Ye Yuan, Yuexin Wu, Ruslan Salakhutdinov, William W. Cohen. "[Review Networks for Caption Generation](https://arxiv.org/abs/1605.07912)." arXiv preprint arXiv:1605.07912 (2016). 207 | - Xiang Li, Tao Qin, Jian Yang, Tie-Yan Liu. "[LightRNN: Memory and Computation-Efficient Recurrent Neural Networks](https://arxiv.org/abs/1610.09893)." arXiv preprint arXiv:1610.09893 (2016). 208 | - Zhaopeng Tu, Yang Liu, Lifeng Shang, Xiaohua Liu, Hang Li. "[Neural Machine Translation with Reconstruction](https://arxiv.org/abs/1611.01874)." arXiv preprint arXiv:1611.01874 (2016). 209 | - Yingce Xia, Di He, Tao Qin, Liwei Wang, Nenghai Yu, Tie-Yan Liu, Wei-Ying Ma. "[Dual Learning for Machine Translation](https://arxiv.org/abs/1611.00179)." arXiv preprint arXiv:1611.00179 (2016). 210 | - Bahdanau, Dzmitry, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. "[An actor-critic algorithm for sequence prediction](https://arxiv.org/abs/1607.07086)." arXiv preprint arXiv:1607.07086 (2016). 211 | - Kannan, Anjuli, and Oriol Vinyals. "[Adversarial evaluation of dialogue models](https://arxiv.org/abs/1701.08198)." arXiv preprint arXiv:1701.08198 (2017). 212 | - Kawthekar, Prasad, Raunaq Rewari, and Suvrat Bhooshan. "[Evaluating Generative Models for Text Generation](https://web.stanford.edu/class/cs224n/reports/2737434.pdf)." 213 | - Li, Jiwei, Will Monroe, Tianlin Shi, Alan Ritter, and Dan Jurafsky. "[Adversarial Learning for Neural Dialogue Generation](https://arxiv.org/abs/1701.06547)." arXiv preprint arXiv:1701.06547 (2017). 214 | - Yang, Zhen, Wei Chen, Feng Wang, and Bo Xu. "[Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets](https://arxiv.org/abs/1703.04887)." arXiv preprint arXiv:1703.04887 (2017). 215 | - Lijun Wu, Yingce Xia, Li Zhao, Fei Tian, Tao Qin, Jianhuang Lai, Tie-Yan Liu. "[Adversarial Neural Machine Translation](https://arxiv.org/abs/1704.06933)." IJCAI (2017). 216 | - Liu, Pengfei, Xipeng Qiu, and Xuanjing Huang. "[Adversarial Multi-task Learning for Text Classification](https://arxiv.org/abs/1704.05742)." arXiv preprint arXiv:1704.05742 (2017). 217 | - Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin. "[Convolutional Sequence to Sequence Learning (https://arxiv.org/abs/1705.03122)." arXiv:1705.03122 (2017). 218 | - Lamb, Alex M., Anirudh Goyal ALIAS PARTH GOYAL, Ying Zhang, Saizheng Zhang, Aaron C. Courville, and Yoshua Bengio. "[Professor forcing: A new algorithm for training recurrent networks](https://arxiv.org/abs/1610.09038)." In Advances In Neural Information Processing Systems, pp. 4601-4609. 2016. 219 | - Rezende, Danilo Jimenez, Shakir Mohamed, and Daan Wierstra. "[Stochastic backpropagation and approximate inference in deep generative models](http://arxiv.org/abs/1401.4082)." arXiv preprint arXiv:1401.4082 (2014). 220 | - Kingma, Diederik P., and Max Welling. "[Auto-encoding variational bayes](http://arxiv.org/abs/1312.6114)." arXiv preprint arXiv:1312.6114 (2013). 221 | - Fabius, Otto, and Joost R. van Amersfoort. "[Variational recurrent auto-encoders](https://arxiv.org/abs/1412.6581)." arXiv preprint arXiv:1412.6581 (2014). 222 | - Bayer, Justin, and Christian Osendorfer. "[Learning stochastic recurrent networks](http://arxiv.org/abs/1411.7610)." arXiv preprint arXiv:1411.7610 (2014). 223 | - Bowman, Samuel R., Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, and Samy Bengio. "[Generating sentences from a continuous space](https://arxiv.org/abs/1511.06349)." arXiv preprint arXiv:1511.06349 (2015). 224 | - Gregor, Karol, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, and Daan Wierstra. "[DRAW: A recurrent neural network for image generation](http://arxiv.org/abs/1502.04623)." arXiv preprint arXiv:1502.04623 (2015). 225 | - Makhzani, Alireza, Jonathon Shlens, Navdeep Jaitly, and Ian Goodfellow. "[Adversarial autoencoders](http://arxiv.org/abs/1511.05644)." arXiv preprint arXiv:1511.05644 (2015). 226 | - Johnson, Matthew J., David Duvenaud, Alexander B. Wiltschko, Sandeep R. Datta, and Ryan P. Adams. "[Composing graphical models with neural networks for structured representations and fast inference](http://arxiv.org/abs/1603.06277)." arXiv preprint arXiv:1603.06277 (2016). 227 | - Doersch, Carl. "[Tutorial on Variational Autoencoders](https://arxiv.org/abs/1606.05908)." arXiv preprint arXiv:1606.05908 (2016). 228 | - Chung, Junyoung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron C. Courville, and Yoshua Bengio. "[A recurrent latent variable model for sequential data](http://arxiv.org/abs/1506.02216)." In Advances in neural information processing systems, pp. 2980-2988. 2015. 229 | - Eslami, S. M., Nicolas Heess, Theophane Weber, Yuval Tassa, Koray Kavukcuoglu, and Geoffrey E. Hinton. "[Attend, Infer, Repeat: Fast Scene Understanding with Generative Models](https://arxiv.org/abs/1603.08575)." arXiv preprint arXiv:1603.08575 (2016). 230 | - Shengjia Zhao, Jiaming Song, Stefano Ermon. "[InfoVAE: Information Maximizing Variational Autoencoders](https://arxiv.org/abs/1706.02262)." arXiv:1706.02262 (2017). 231 | - Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. "[Generative adversarial nets](http://arxiv.org/abs/1406.2661)." In Advances in Neural Information Processing Systems, pp. 2672-2680. 2014 232 | - Radford, Alec, Luke Metz, and Soumith Chintala. "[Unsupervised representation learning with deep convolutional generative adversarial networks](http://arxiv.org/abs/1511.06434)." arXiv preprint arXiv:1511.06434 (2015). 233 | - Denton, Emily L., Soumith Chintala, and Rob Fergus. "[Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks](http://arxiv.org/abs/1506.05751)." In Advances in neural information processing systems, pp. 1486-1494. 2015. 234 | - Dosovitskiy, Alexey, Jost Tobias Springenberg, and Thomas Brox. "[Learning to generate chairs with convolutional neural networks](http://arxiv.org/abs/1411.5928)." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1538-1546. 2015. 235 | - Mathieu, Michael, Camille Couprie, and Yann LeCun. "[Deep multi-scale video prediction beyond mean square error](http://arxiv.org/abs/1511.05440)." arXiv preprint arXiv:1511.05440 (2015). 236 | - Salimans, Tim, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. "[Improved Techniques for Training GANs](http://arxiv.org/abs/1606.03498)." arXiv preprint arXiv:1606.03498 (2016). 237 | - Chen, Xi, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. "[InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets](http://arxiv.org/abs/1606.03657)." arXiv preprint arXiv:1606.03657 (2016). 238 | - Im, Daniel Jiwoong, Chris Dongjoo Kim, Hui Jiang, and Roland Memisevic. "[Generating images with recurrent adversarial networks](http://arxiv.org/abs/1602.05110)." arXiv preprint arXiv:1602.05110 (2016). 239 | - Yu, Lantao, Weinan Zhang, Jun Wang, and Yong Yu. "[SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient](http://arxiv.org/abs/1609.05473)." arXiv preprint arXiv:1609.05473 (2016). 240 | - Augustus Odena, Christopher Olah, Jonathon Shlens. "[Conditional Image Synthesis With Auxiliary Classifier GANs](https://arxiv.org/abs/1610.09585)." arXiv preprint arXiv:1610.09585 (2016). 241 | - Ian Goodfellow. "[NIPS Tutorial: GANs](http://www.iangoodfellow.com/slides/2016-12-04-NIPS.pdf)", NIPS, 2016 242 | - Che, Tong, Yanran Li, Ruixiang Zhang, R. Devon Hjelm, Wenjie Li, Yangqiu Song, and Yoshua Bengio. "[Maximum-Likelihood Augmented Discrete Generative Adversarial Networks](https://arxiv.org/abs/1702.07983)." arXiv preprint arXiv:1702.07983 (2017). 243 | - Junbo (Jake) Zhao, Yoon Kim, Kelly Zhang, Alexander M. Rush, Yann LeCun. "[Adversarially Regularized Autoencoders for Generating Discrete Structures](https://arxiv.org/abs/1706.04223)." arXiv preprint arXiv:1706.04223 (2017). 244 | - Mike Lewis Denis Yarats Yann N. Dauphin Devi Parikh Dhruv Batra . "[ Deal or No Deal? End-to-End Learning for Negotiation Dialogues](http://s3.amazonaws.com/end-to-end-negotiator/end-to-end-negotiator.pdf)." (2017). 245 | - Mihaela Rosca, Balaji Lakshminarayanan, David Warde-Farley, Shakir Mohamed. "[Variational Approaches for Auto-Encoding Generative Adversarial Networks](https://arxiv.org/abs/1706.04987)." arXiv preprint arXiv:1706.04987 (2017). 246 | - Goyal, Prasoon, Zhiting Hu, Xiaodan Liang, Chenyu Wang, and Eric Xing. "[Nonparametric Variational Auto-encoders for Hierarchical Representation Learning](https://arxiv.org/pdf/1703.07027.pdf)." arXiv preprint arXiv:1703.07027 (2017). 247 | - Sabour, Sara, Nicholas Frosst, and Geoffrey Hinton. "[Dynamic Routing between Capsules](https://arxiv.org/abs/1710.09829)." (2017). 248 | - Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "[Attention is all you need](http://papers.nips.cc/paper/7181-attention-is-all-you-need)." NIPS. 2017. 249 | 250 | #### Architecture Search 251 | - Frankle, Jonathan, and Michael Carbin. "The lottery ticket hypothesis: Finding sparse, trainable neural networks." arXiv preprint arXiv:1803.03635 (2018). 252 | - Xie, Saining, Alexander Kirillov, Ross Girshick, and Kaiming He. "Exploring Randomly Wired Neural Networks for Image Recognition." arXiv preprint arXiv:1904.01569 (2019). 253 | - So, David R., Chen Liang, and Quoc V. Le. "The Evolved Transformer." arXiv preprint arXiv:1901.11117 (2019). 254 | - Chenguang Wang, Mu Li, Alexander J. Smola. "Language Models with Transformers." arXiv preprint arXiv:1904.09408 (2019). 255 | 256 | ### Recommendation System 257 | - Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "[Restricted Boltzmann machines for collaborative filtering](http://dl.acm.org/citation.cfm?id=1273596)." In Proceedings of the 24th international conference on Machine learning, pp. 791-798. ACM, 2007. 258 | - Wang, Hao, Xingjian Shi, and Dit-Yan Yeung. "[Relational Stacked Denoising Autoencoder for Tag Recommendation](http://www.wanghao.in/paper/AAAI15_RSDAE.pdf)." In AAAI, pp. 3052-3058. 2015. 259 | - Wang, Hao, Naiyan Wang, and Dit-Yan Yeung. "[Collaborative deep learning for recommender systems](http://dl.acm.org/citation.cfm?id=2783273)." In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1235-1244. ACM, 2015. 260 | - Covington, Paul, Jay Adams, and Emre Sargin. "[Deep neural networks for youtube recommendations](http://dl.acm.org/citation.cfm?id=2959190)." In Proceedings of the 10th ACM Conference on Recommender Systems, pp. 191-198. ACM, 2016. 261 | - Devooght, Robin, and Hugues Bersini. "[Collaborative Filtering with Recurrent Neural Networks](https://arxiv.org/abs/1608.07400)." arXiv preprint arXiv:1608.07400 (2016). 262 | - Wang, Hao, S. H. I. Xingjian, and Dit-Yan Yeung. "[Collaborative recurrent autoencoder: Recommend while learning to fill in the blanks](http://papers.nips.cc/paper/6163-collaborative-recurrent-autoencoder-recommend-while-learning-to-fill-in-the-blanks)." In Advances in Neural Information Processing Systems, pp. 415-423. 2016. 263 | - Tang, Jian, Yifan Yang, Sam Carton, Ming Zhang, and Qiaozhu Mei. "[Context-aware Natural Language Generation with Recurrent Neural Networks](https://arxiv.org/abs/1611.09900)." arXiv preprint arXiv:1611.09900 (2016). 264 | - Zhang, Fuzheng, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma. "[Collaborative Knowledge Base Embedding for Recommender Systems](http://www.kdd.org/kdd2016/subtopic/view/collaborative-knowledge-base-embedding-for-recommender-systems)." KDD, 2016. 265 | - Dong, Li, Shaohan Huang, Furu Wei, Mirella Lapata, Ming Zhou, and Ke XuΤ. "[Learning to Generate Product Reviews from Attributes](http://www.aclweb.org/anthology/E/E17/E17-1059.pdf)." EACL, 2017. 266 | - He, Xiangnan. "[Neural Collaborative Filtering](http://www.comp.nus.edu.sg/~xiangnan/papers/ncf.pdf)." WWW, 2017 267 | - Wu, Chao-Yuan, Amr Ahmed, Alex Beutel, Alexander J. Smola, and How Jing. "[Recurrent Recommender Networks](http://alexbeutel.com/papers/rrn_wsdm2017.pdf)." Training 10, no. 2: 10-1.2017 268 | - Radford, Alec, Rafal Jozefowicz, and Ilya Sutskever. "[Learning to generate reviews and discovering sentiment](https://arxiv.org/pdf/1704.01444.pdf)." arXiv preprint arXiv:1704.01444 (2017). 269 | - Piji Li, Zihao Wang, Zhaochun Ren, Lidong Bing, Wai Lam. "[Neural Rating Regression with Abstractive Tips Generation for Recommendation](https://arxiv.org/abs/1708.00154).". In SIGIR, pp xx-xx. 2017. 270 | 271 | ### Network Representation Learning 272 | - [Must-read papers on network representation learning (NRL)/network embedding (NE)](https://github.com/thunlp/NRLPapers) 273 | 274 | ### Music Generation 275 | - [Using machine learning to generate music](http://www.datasciencecentral.com/profiles/blogs/using-machine-learning-to-generate-music) 276 | 277 | ### Computational Biology 278 | - [Awesome DeepBio](https://github.com/gokceneraslan/awesome-deepbio) by Gökçen Eraslan 279 | 280 | ### GO 281 | - Silver, David, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser et al. "[Mastering the game of Go with deep neural networks and tree search](http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html)." Nature 529, no. 7587 (2016): 484-489. 282 | - Tian, Yuandong, and Yan Zhu. "[Better Computer Go Player with Neural Network and Long-term Prediction](http://arxiv.org/abs/1511.06410)." arXiv preprint arXiv:1511.06410 (2015). 283 | 284 | ### Stock Prediction 285 | - Xiao Ding, Yue Zhang, Ting Liu, Junwen Duan. "Deep Learning for Event-Driven Stock Prediction". IJCAI 2015. 286 | - Si, Jianfeng, Arjun Mukherjee, Bing Liu, Sinno Jialin Pan, Qing Li, and Huayi Li. "[Exploiting Social Relations and Sentiment for Stock Prediction](http://www.aclweb.org/anthology/D14-1120)." EMNLP 2014. 287 | - Ding, Xiao, Yue Zhang, Ting Liu, and Junwen Duan. "[Using Structured Events to Predict Stock Price Movement: An Empirical Investigation](http://anthology.aclweb.org/D/D14/D14-1148.pdf)." EMNLP 2014. 288 | - Bollen, Johan, Huina Mao, and Xiaojun Zeng. "[Twitter mood predicts the stock market](http://arxiv.org/abs/1010.3003)." Journal of Computational Science 2, no. 1 (2011): 1-8. 289 | - Hengjian Jia. "[Investigation Into The Effectiveness Of Long Short Term Memory Networks For Stock Price Prediction](http://arxiv.org/abs/1603.07893)." arXiv:1603.07893. (2016) 290 | --------------------------------------------------------------------------------