└── README.md


/README.md:
--------------------------------------------------------------------------------
  1 | ### Startups
  2 |   - [机器学习、深度学习、计算机视觉、大数据创业公司 - Startups in AI](https://github.com/lipiji/AIStartups)
  3 | 
  4 | ##  Deep Reinforcement Learning
  5 |  - David Silver. "[Tutorial: Deep Reinforcement Learning](http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf)." ICML 2016.
  6 |  - David Silver’s course. "[Reinforcement Learning](http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html)". 2015.
  7 |  - Bahdanau, Dzmitry, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. "[An Actor-Critic Algorithm for Sequence Prediction](http://arxiv.org/abs/1607.07086)." arXiv preprint arXiv:1607.07086 (2016).
  8 |  - Li, Jiwei, Will Monroe, Alan Ritter, and Dan Jurafsky. "[Deep Reinforcement Learning for Dialogue Generation](http://arxiv.org/abs/1606.01541)." arXiv preprint arXiv:1606.01541 (2016).
  9 |  - Pathak, Deepak, Pulkit Agrawal, Alexei A. Efros, and Trevor Darrell. "[Curiosity-driven Exploration by Self-supervised Prediction](https://arxiv.org/abs/1705.05363)." arXiv preprint arXiv:1705.05363 (2017).
 10 |  - Keneshloo, Yaser, Tian Shi, Chandan K. Reddy, and Naren Ramakrishnan. "[Deep Reinforcement Learning For Sequence to Sequence Models](https://arxiv.org/abs/1805.09461)." arXiv preprint arXiv:1805.09461 (2018).
 11 | 
 12 | ## Dialogue System
 13 | - Jiang, Shaojie, and Maarten de Rijke. "[Why are Sequence-to-Sequence Models So Dull?](https://staff.fnwi.uva.nl/m.derijke/wp-content/papercite-data/pdf/jiang-why-2018.pdf)." report, 2018.
 14 | - Eric Chu, Prashanth Vijayaraghavan, Deb Roy. "[Learning Personas from Dialogue with Attentive Memory Networks](https://arxiv.org/abs/1810.08717)." EMNLP (2018).
 15 | - Ruizhe Li, Chenghua Lin, Matthew Collinson, Xiao Li, Guanyi Chen. "[A Dual-Attention Hierarchical Recurrent Neural Network for Dialogue Act Classification](https://arxiv.org/abs/1810.09154)."  arXiv:1810.09154 (2018).
 16 | 
 17 | #### Task-Oriented Dialogue
 18 | - Wen, Tsung-Hsien, David Vandyke, Nikola Mrksic, Milica Gasic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, and Steve Young. "[A network-based end-to-end trainable task-oriented dialogue system](https://arxiv.org/abs/1604.04562)." arXiv preprint arXiv:1604.04562 (2016).
 19 | - Li, Xiujun, Yun-Nung Chen, Lihong Li, Jianfeng Gao, and Asli Celikyilmaz. "[End-to-end task-completion neural dialogue systems](https://arxiv.org/abs/1703.01008)." arXiv preprint arXiv:1703.01008 (2017).
 20 | - Li, Xiujun, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, and Yun-Nung Chen. "[A user simulator for task-completion dialogues](https://arxiv.org/abs/1612.05688)." arXiv preprint arXiv:1612.05688 (2016).
 21 | - Yan, Zhao, Nan Duan, Peng Chen, Ming Zhou, Jianshe Zhou, and Zhoujun Li. "[Building Task-Oriented Dialogue Systems for Online Shopping](http://www.aaai.org/ocs/index.php/AAAI/AAAI17/paper/viewPaper/14261)." In AAAI, pp. 4618-4626. 2017.
 22 | - Peng, Baolin, Xiujun Li, Jianfeng Gao, Jingjing Liu, and Kam-Fai Wong. "[Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning](http://www.aclweb.org/anthology/P18-1203)." ACL, vol. 1, pp. 2182-2192. 2018.
 23 | - Janarthanan Rajendran, Jatin Ganhotra, Satinder Singh, Lazaros Polymenakos. "[Learning End-to-End Goal-Oriented Dialog with Multiple Answers](https://arxiv.org/abs/1808.09996)." arXiv preprint arXiv:1808.09996 (2018).
 24 | 
 25 | ## Text Generation
 26 | - Rennie, Steven J., Etienne Marcheret, Youssef Mroueh, Jarret Ross, and Vaibhava Goel. "[Self-critical sequence training for image captioning](https://arxiv.org/abs/1612.00563)." arXiv preprint arXiv:1612.00563 (2016).
 27 | - Lin, Kevin, Dianqi Li, Xiaodong He, Zhengyou Zhang, and Ming-Ting Sun. "[Adversarial Ranking for Language Generation](https://arxiv.org/pdf/1705.11001.pdf)." arXiv preprint arXiv:1705.11001 (2017).
 28 | - Zhang, Li, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, and Timothy M. Hospedales. "[Actor-Critic Sequence Training for Image Captioning](https://arxiv.org/abs/1706.09601)." arXiv preprint arXiv:1706.09601 (2017).
 29 | - Wiseman, Sam, Stuart M. Shieber, and Alexander M. Rush. "[Challenges in Data-to-Document Generation](https://arxiv.org/abs/1707.08052)." arXiv preprint arXiv:1707.08052 (2017).
 30 | - Lebret, Rémi, David Grangier, and Michael Auli. "[Neural text generation from structured data with application to the biography domain](https://arxiv.org/abs/1603.07771)." arXiv preprint arXiv:1603.07771 (2016).
 31 | - Chisholm, Andrew, Will Radford, and Ben Hachey. "[Learning to generate one-sentence biographies from Wikidata](https://arxiv.org/abs/1702.06235)." arXiv preprint arXiv:1702.06235 (2017).
 32 | - Sha, Lei, Lili Mou, Tianyu Liu, Pascal Poupart, Sujian Li, Baobao Chang, and Zhifang Sui. "[Order-Planning Neural Text Generation From Structured Data](https://arxiv.org/abs/1709.00155)." arXiv preprint arXiv:1709.00155 (2017).
 33 | - Jiaxian Guo, Sidi Lu, Han Cai, Weinan Zhang, Yong Yu, Jun Wang. "[Long Text Generation via Adversarial Training with Leaked Information](https://arxiv.org/abs/1709.08624)." arXiv preprint  arXiv:1709.08624 (2017).
 34 | - Guu, Kelvin, Tatsunori B. Hashimoto, Yonatan Oren, and Percy Liang. "[Generating Sentences by Editing Prototypes](https://arxiv.org/abs/1709.08878)." arXiv preprint arXiv:1709.08878 (2017).
 35 | - Tianyu Liu, Kexiang Wang, Lei Sha, Baobao Chang, Zhifang Sui. "[Table-to-text Generation by Structure-aware Seq2seq Learnings](https://arxiv.org/abs/1711.09724)." arXiv preprint arXiv:1711.09724 (2017).
 36 | - Kahou, Samira Ebrahimi, Adam Atkinson, Vincent Michalski, Akos Kadar, Adam Trischler, and Yoshua Bengio. "[FigureQA: An Annotated Figure Dataset for Visual Reasoning](https://arxiv.org/abs/1710.07300)." arXiv preprint arXiv:1710.07300 (2017).
 37 | - Murakami, Soichiro, Akihiko Watanabe, Akira Miyazawa, Keiichi Goshima, Toshihiko Yanase, Hiroya Takamura, and Yusuke Miyao. "[Learning to Generate Market Comments from Stock Prices](http://www.aclweb.org/anthology/P17-1126)." In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 1374-1384. 2017.
 38 | - Mueller, Jonas, David Gifford, and Tommi Jaakkola. "[Sequence to better sequence: continuous revision of combinatorial structures](http://proceedings.mlr.press/v70/mueller17a.html)." In International Conference on Machine Learning, pp. 2536-2544. 2017.
 39 | - Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer. "[Generating Wikipedia by Summarizing Long Sequences](https://arxiv.org/abs/1801.10198)." ICLR 2018.
 40 | - Clark, Elizabeth, Anne Spencer Ross, Chenhao Tan, Yangfeng Ji, and Noah A. Smith. "[Creative Writing with a Machine in the Loop: Case Studies on Slogans and Stories](https://homes.cs.washington.edu/~ansross/papers/iui2018-creativewriting.pdf)." (2018).
 41 | - Gehrmann, Sebastian, S. E. A. S. Harvard, Falcon Z. Dai, Henry Elder, and Alexander M. Rush. "[End-to-End Content and Plan Selection for Natural Language Generation](https://scholar.harvard.edu/files/gehrmann/files/e2e-harvardnlp.pdf)."
 42 | - Juncen Li, Robin Jia, He He, Percy Liang. "[Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer](https://arxiv.org/abs/1804.06437)." arXiv:1804.06437 2018.
 43 | - Yi Liao, Lidong Bing, Piji Li, Shuming Shi, Wai Lam, Tong Zhang. "[Incorporating Pseudo-Parallel Data for Quantifiable Sequence Editing](https://arxiv.org/abs/1804.07007)." arXiv:1804.07007 2018.
 44 | - Xin Wang, Wenhu Chen, Yuan-Fang Wang, William Yang Wang. "[No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling](https://arxiv.org/abs/1804.09160)." arXiv:1804.09160 2018.
 45 | - Sam Wiseman, Stuart M. Shieber, Alexander M. Rush. "[Learning Neural Templates for Text Generation
 46 | ](https://arxiv.org/abs/1808.10122)." arXiv:1808.10122 2018.
 47 | 
 48 | 
 49 | ## Text Summarization
 50 |   - Ryang, Seonggi, and Takeshi Abekawa. "[Framework of automatic text summarization using reinforcement learning](http://dl.acm.org/citation.cfm?id=2390980)." In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 256-265. Association for Computational Linguistics, 2012. [not neural-based methods]
 51 |   - King, Ben, Rahul Jha, Tyler Johnson, Vaishnavi Sundararajan, and Clayton Scott. "[Experiments in Automatic Text Summarization Using Deep Neural Networks](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.459.8775&rep=rep1&type=pdf)." Machine Learning (2011).
 52 |   - Liu, Yan, Sheng-hua Zhong, and Wenjie Li. "[Query-Oriented Multi-Document Summarization via Unsupervised Deep Learning](http://www.aaai.org/ocs/index.php/AAAI/AAAI12/paper/view/5058/5322)." AAAI. 2012.
 53 |   - Rioux, Cody, Sadid A. Hasan, and Yllias Chali. "[Fear the REAPER: A System for Automatic Multi-Document Summarization with Reinforcement Learning](http://emnlp2014.org/papers/pdf/EMNLP2014075.pdf)." In EMNLP, pp. 681-690. 2014.[not neural-based methods]
 54 |   - PadmaPriya, G., and K. Duraiswamy. "[An Approach For Text Summarization Using Deep Learning Algorithm](http://thescipub.com/PDF/jcssp.2014.1.9.pdf)." Journal of Computer Science 10, no. 1 (2013): 1-9.
 55 |   - Denil, Misha, Alban Demiraj, and Nando de Freitas. "[Extraction of Salient Sentences from Labelled Documents](http://arxiv.org/abs/1412.6815)." arXiv preprint arXiv:1412.6815 (2014).
 56 |   - Kågebäck, Mikael, et al. "[Extractive summarization using continuous vector space models](http://www.aclweb.org/anthology/W14-1504)." Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality (CVSC)@ EACL. 2014.
 57 |   - Denil, Misha, Alban Demiraj, Nal Kalchbrenner, Phil Blunsom, and Nando de Freitas. "[Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network](http://arxiv.org/abs/1406.3830)." arXiv preprint arXiv:1406.3830 (2014).
 58 |   - Cao, Ziqiang, Furu Wei, Li Dong, Sujian Li, and Ming Zhou. "[Ranking with Recursive Neural Networks and Its Application to Multi-document Summarization](http://gana.nlsde.buaa.edu.cn/~lidong/aaai15-rec_sentence_ranking.pdf)." (AAAI'2015).
 59 |   - Fei Liu, Jeffrey Flanigan, Sam Thomson, Norman Sadeh, and Noah A. Smith. "[Toward Abstractive Summarization Using Semantic Representations](http://www.cs.cmu.edu/~nasmith/papers/liu+flanigan+thomson+sadeh+smith.naacl15.pdf)." NAACL 2015
 60 |   - Wenpeng Yin， Yulong Pei. "Optimizing Sentence Modeling and Selection for Document Summarization." IJCAI 2015
 61 |   - He, Zhanying, Chun Chen, Jiajun Bu, Can Wang, Lijun Zhang, Deng Cai, and Xiaofei He. "[Document Summarization Based on Data Reconstruction](http://cs.nju.edu.cn/zlj/pdf/AAAI-2012-He.pdf)." In AAAI. 2012.
 62 |   - Liu, He, Hongliang Yu, and Zhi-Hong Deng. "[Multi-Document Summarization Based on Two-Level Sparse Representation Model](http://www.cis.pku.edu.cn/faculty/system/dengzhihong/papers/AAAI%202015_Multi-Document%20Summarization%20Based%20on%20Two-Level%20Sparse%20Representation%20Model.pdf)." In Twenty-Ninth AAAI Conference on Artificial Intelligence. 2015.
 63 |   - Jin-ge Yao, Xiaojun Wan, Jianguo Xiao. "[Compressive Document Summarization via Sparse Optimization](http://ijcai.org/Proceedings/15/Papers/198.pdf)." IJCAI 2015
 64 |   - Piji Li, Lidong Bing, Wai Lam, Hang Li, and Yi Liao. "[Reader-Aware Multi-Document Summarization via Sparse Coding](http://arxiv.org/abs/1504.07324)." IJCAI 2015.
 65 |   - Lopyrev, Konstantin. "[Generating News Headlines with Recurrent Neural Networks](http://arxiv.org/abs/1512.01712)." arXiv preprint arXiv:1512.01712 (2015). [The first paragraph as document.]
 66 |   - Alexander M. Rush, Sumit Chopra, Jason Weston. "[A Neural Attention Model for Abstractive Sentence Summarization](http://arxiv.org/abs/1509.00685)." EMNLP 2015. [sentence compression]
 67 |   - Hu, Baotian, Qingcai Chen, and Fangze Zhu. "[LCSTS: a large scale chinese short text summarization dataset](http://arxiv.org/abs/1506.05865)." arXiv preprint arXiv:1506.05865 (2015).
 68 |   - Gulcehre, Caglar, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. "[Pointing the Unknown Words](http://arxiv.org/abs/1603.08148)." arXiv preprint arXiv:1603.08148 (2016).
 69 |   - Nallapati, Ramesh, Bing Xiang, and Bowen Zhou. "[Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond](http://arxiv.org/abs/1602.06023)." arXiv preprint arXiv:1602.06023 (2016). [sentence compression]
 70 |   - Sumit Chopra, Alexander M. Rush and Michael Auli. "[Abstractive Sentence Summarization with Attentive Recurrent Neural Networks](http://harvardnlp.github.io/papers/naacl16_summary.pdf)" NAACL 2016.
 71 |   - Jiatao Gu, Zhengdong Lu, Hang Li, Victor O.K. Li. "[Incorporating Copying Mechanism in Sequence-to-Sequence Learning](http://arxiv.org/abs/1603.06393)." ACL. (2016)
 72 |   - Jianpeng Cheng, Mirella Lapata. "[Neural Summarization by Extracting Sentences and Words](http://arxiv.org/abs/1603.07252)". ACL. (2016)
 73 |   - Zhang, Jianmin, Jin-ge Yao, and Xiaojun Wan. "[Toward constructing sports news from live text commentary](http://www.icst.pku.edu.cn/lcwm/wanxj/files/acl16_sports.pdf)." In Proceedings of ACL. 2016.
 74 |   - Ziqiang Cao, Wenjie Li, Sujian Li, Furu Wei. "[AttSum: Joint Learning of Focusing and Summarization with Neural Attention](http://arxiv.org/abs/1604.00125)".  arXiv:1604.00125 (2016)
 75 |   - Ayana, Shiqi Shen, Zhiyuan Liu, Maosong Sun. "[Neural Headline Generation with Sentence-wise Optimization](http://arxiv.org/abs/1604.01904)". arXiv:1604.01904 (2016)
 76 |   - Kikuchi, Yuta, Graham Neubig, Ryohei Sasano, Hiroya Takamura, and Manabu Okumura. "[Controlling Output Length in Neural Encoder-Decoders](https://arxiv.org/abs/1609.09552)." arXiv preprint arXiv:1609.09552 (2016).
 77 |   - Qian Chen, Xiaodan Zhu, Zhenhua Ling, Si Wei and Hui Jiang. "[Distraction-Based Neural Networks for Document Summarization](https://arxiv.org/abs/1610.08462)." IJCAI 2016.
 78 |   - Wang, Lu, and Wang Ling. "[Neural Network-Based Abstract Generation for Opinions and Arguments](http://www.ccs.neu.edu/home/luwang/papers/NAACL2016.pdf)." NAACL 2016.
 79 |   - Yishu Miao, Phil Blunsom. "[Language as a Latent Variable: Discrete Generative Models for Sentence Compression](http://arxiv.org/abs/1609.07317)." EMNLP 2016.
 80 |   - Takase, Sho, Jun Suzuki, Naoaki Okazaki, Tsutomu Hirao, and Masaaki Nagata. "[Neural headline generation on abstract meaning representation](https://www.aclweb.org/anthology/D/D16/D16-1112.pdf)." EMNLP, pp. 1054-1059. 2016.
 81 |   - Hongya Song, Zhaochun Ren, Piji Li, Shangsong Liang, Jun Ma, and Maarten de Rijke. [Summarizing Answers in Non-Factoid Community Question-Answering](http://dl.acm.org/citation.cfm?id=3018704). In WSDM 2017: The 10th International Conference on Web Search and Data Mining, 2017.
 82 |   - Wenyuan Zeng, Wenjie Luo, Sanja Fidler, Raquel Urtasun. "[Efficient Summarization with Read-Again and Copy Mechanism](https://arxiv.org/abs/1611.03382)." arXiv preprint arXiv:1611.03382 (2016).
 83 |   - Piji Li, Zihao Wang, Wai Lam, Zhaochun Ren, Lidong Bing. "[Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization](https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14613)". In AAAI, 2017.
 84 |   - Ramesh Nallapati, Feifei Zhai, Bowen Zhou. [SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive Summarization of Documents](https://arxiv.org/abs/1611.04230). In AAAI, 2017.
 85 |   - Ramesh Nallapati, Bowen Zhou, Mingbo Ma. "[Classify or Select: Neural Architectures for Extractive Document Summarization](https://arxiv.org/abs/1611.04244)." arXiv preprint arXiv:1611.04244 (2016).
 86 |   - Suzuki, Jun, and Masaaki Nagata. "[Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization](http://www.aclweb.org/anthology/E17-2047)." EACL 2017 (2017): 291.
 87 |   - Jiwei Tan and Xiaojun Wan. [Abstractive Document Summarization with a Graph-Based Attentional Neural Model](). ACL, 2017.
 88 |   - Preksha Nema, Mitesh M. Khapra, Balaraman Ravindran and Anirban Laha. [Diversity driven attention model for query-based abstractive summarization](). ACL,2017
 89 |   - Abigail See, Peter J. Liu and Christopher D. Manning. [Get To The Point: Summarization with Pointer-Generator Networks](https://arxiv.org/abs/1704.04368). ACL, 2017.
 90 |   - Qingyu Zhou, Nan Yang, Furu Wei and Ming Zhou. [Selective Encoding for Abstractive Sentence Summarization](https://arxiv.org/abs/1704.07073). ACL, 2017
 91 |   - Maxime Peyrard and Judith Eckle-Kohler. [Supervised Learning of Automatic Pyramid for Optimization-Based Multi-Document Summarization](). ACL, 2017.
 92 |   - Shashi Narayan, Nikos Papasarantopoulos, Mirella Lapata, Shay B. Cohen. "[Neural Extractive Summarization with Side Information](https://arxiv.org/abs/1704.04530)." arXiv preprint arXiv:1704.04530 (2017).
 93 |   - Romain Paulus, Caiming Xiong, Richard Socher. "[A Deep Reinforced Model for Abstractive Summarization](https://metamind.io/static/pdf/deep-reinforced-model-arxiv-v1.pdf)." (2017).
 94 |   - Shibhansh Dohare, Harish Karnick. "[Text Summarization using Abstract Meaning Representation](https://arxiv.org/abs/1706.01678)." 	arXiv:1706.01678 (2017).
 95 |   - Michihiro Yasunaga, Rui Zhang, Kshitijh Meelu, Ayush Pareek, Krishnan Srinivasan, Dragomir Radev. "[Graph-based Neural Multi-Document Summarization](https://arxiv.org/abs/1706.06681)." 	arXiv:1706.06681 (2017).
 96 |   - Piji Li, Wai Lam, Lidong Bing, and Zihao Wang. [Deep Recurrent Generative Decoder for Abstractive Text Summarization](http://lipiji.com/). Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017. 
 97 |   - Piji Li, Wai Lam, Lidong Bing, Weiwei Guo, and Hang Li. [Cascaded Attention based Unsupervised Information Distillation for Compressive Summarization](http://lipiji.com/). Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP'17). Sep 2017.
 98 |   - Piji Li, Lidong Bing, Wai Lam. [Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset](http://www1.se.cuhk.edu.hk/~textmine/dataset/ra-mds/). Proceedings of the EMNLP 2017 Workshop on New Frontiers in Summarization (EMNLP-NewSum'17). Sep 2017.
 99 |   - Tan, Jiwei, Xiaojun Wan, and Jianguo Xiao. "[From Neural Sentence Summarization to Headline Generation: A Coarse-to-Fine Approach](http://static.ijcai.org/proceedings-2017/0574.pdf)." IJCAI 2017.
100 |   - Ling, Jeffrey, and Alexander M. Rush. "[Coarse-to-Fine Attention Models for Document Summarization](http://www.aclweb.org/anthology/W/W17/W17-4505.pdf)." EMNLP 2017 (2017): 33.
101 |   - Ziqiang Cao, Furu Wei, Wenjie Li, Sujian Li. "[Faithful to the Original: Fact Aware Neural Abstractive Summarization](https://arxiv.org/abs/1711.04434)." arXiv:1711.04434 (2017).
102 |   - Angela Fan, David Grangier, Michael Auli. "[Controllable Abstractive Summarization](https://arxiv.org/abs/1711.05217)." arXiv:1711.05217 (2017).
103 |   - Liu, Linqing, Yao Lu, Min Yang, Qiang Qu, Jia Zhu, and Hongyan Li. "[Generative Adversarial Network for Abstractive Text Summarization](https://arxiv.org/pdf/1711.09357.pdf)." arXiv preprint arXiv:1711.09357 (2017).
104 |   - Narayan, Shashi, Shay B. Cohen, and Mirella Lapata. "[Ranking Sentences for Extractive Summarization with Reinforcement Learning](https://arxiv.org/abs/1802.08636)." arXiv preprint arXiv:1802.08636 (2018).
105 |   - Asli Celikyilmaz, Antoine Bosselut, Xiaodong He, Yejin Choi. "[Deep Communicating Agents for Abstractive Summarization](https://arxiv.org/abs/1803.10357)." NAACL (2018).
106 |   - Chen, Wenhu, Guanlin Li, Shuo Ren, Shujie Liu, Zhirui Zhang, Mu Li, and Ming Zhou. "[Generative Bridging Network in Neural Sequence Prediction](https://arxiv.org/abs/1706.09152)." NAACL (2018).
107 |   - Li, Piji, Lidong Bing, and Wai Lam. "[Actor-Critic based Training Framework for Abstractive Summarization](https://arxiv.org/abs/1803.11070)." arXiv preprint arXiv:1803.11070 (2018).
108 |   - Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan Kim, Walter Chang, Nazli Goharian. "[
109 | A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents](https://arxiv.org/abs/1804.05685)".  NAACL, 2018.
110 |   - Yuxiang Wu, Baotian Hu. "[Learning to Extract Coherent Summary via Deep Reinforcement Learning](https://arxiv.org/abs/1804.07036)." AAAI (2018).
111 |   - Jianmin Zhang, Jiwei Tan, Xiaojun Wan. "[Towards a Neural Network Approach to Abstractive Multi-Document Summarization](https://arxiv.org/abs/1804.09010)." arXiv:1804.09010  (2018).
112 |   - Li Wang, Junlin Yao, Yunzhe Tao, Li Zhong, Wei Liu, Qiang Du. "[A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization](https://arxiv.org/abs/1805.03616)." IJCAI-ECAI  (2018).
113 |   - Yen-Chun Chen, Mohit Bansal. "[Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
114 | ](https://arxiv.org/abs/1805.11080)." arXiv:1805.11080  (2018).
115 |   - Song, Kaiqiang, Lin Zhao, and Fei Liu. "[Structure-Infused Copy Mechanisms for Abstractive Summarization](http://www.cs.ucf.edu/~feiliu/papers/COLING2018_StructSumm.pdf)." COLING, 2018.
116 |   - Keneshloo, Yaser, Tian Shi, Chandan K. Reddy, and Naren Ramakrishnan. "[Deep Reinforcement Learning For Sequence to Sequence Models](https://arxiv.org/abs/1805.09461)." arXiv preprint arXiv:1805.09461 (2018).
117 |   - Qingyu Zhou, Nan Yang, Furu Wei, Ming Zhou. "[Sequential Copying Networks](https://arxiv.org/abs/1807.02301)." AAAI (2018).
118 |   - Qingyu Zhou, Nan Yang, Furu Wei, Shaohan Huang, Ming Zhou, Tiejun Zhao. "[Neural Document Summarization by Jointly Learning to Score and Select Sentences](https://arxiv.org/abs/1807.02305)." ACL (2018).
119 |   - Lin, Junyang, Xu Sun, Shuming Ma, and Qi Su. "[Global Encoding for Abstractive Summarization](https://arxiv.org/abs/1805.03989)." arXiv preprint arXiv:1805.03989 (2018).
120 |   - Khatri, Chandra, Gyanit Singh, and Nish Parikh. "[Abstractive and Extractive Text Summarization using Document Context Vector and Recurrent Neural Networks](https://arxiv.org/abs/1807.08000)." arXiv preprint arXiv:1807.08000 (2018).
121 |   - Hsu, Wan-Ting, Chieh-Kai Lin, Ming-Ying Lee, Kerui Min, Jing Tang, and Min Sun. "[A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss](https://arxiv.org/abs/1805.06266)." arXiv preprint arXiv:1805.06266 (2018).
122 |   - Sun, Fei, Peng Jiang, Hanxiao Sun, Changhua Pei, Wenwu Ou, and Xiaobo Wang. "[Multi-Source Pointer Network for Product Title Summarization](https://arxiv.org/abs/1808.06885)." arXiv preprint arXiv:1808.06885 (2018).
123 |   - Wojciech Kryściński, Romain Paulus, Caiming Xiong, Richard Socher. "[Improving Abstraction in Text Summarization
124 | ](https://arxiv.org/abs/1808.07913)." arXiv preprint arXiv:1808.07913 (2018).
125 |   - Zhang, Xingxing, Mirella Lapata, Furu Wei, and Ming Zhou. "[Neural Latent Extractive Document Summarization](https://arxiv.org/abs/1808.07187)." arXiv preprint arXiv:1808.07187 (2018).
126 |   - Sebastian Gehrmann, Yuntian Deng, Alexander M. Rush. "[Bottom-Up Abstractive Summarization](https://arxiv.org/abs/1808.10792)." arXiv preprint arXiv:1808.10792 (2018).
127 |   - Yichen Jiang, Mohit Bansal. "[Closed-Book Training to Improve Summarization Encoder Memory](https://arxiv.org/abs/1809.04585)." arXiv preprint arXiv:1809.04585 (2018).
128 |   - Kamal Al-Sabahi, Zhang Zuping, Yang Kang. "[Bidirectional Attentional Encoder-Decoder Model and Bidirectional Beam Search for Abstractive Summarization](https://arxiv.org/abs/1809.06662)." arXiv preprint arXiv:1809.06662 (2018).
129 |   - Raphael Schumann. "[Unsupervised Abstractive Sentence Summarization using Length Controlled Variational Autoencoder](https://arxiv.org/abs/1809.05233)." arXiv preprint arXiv:1809.05233 (2018).
130 |   - Krishna, Kundan, and Balaji Vasan Srinivasan. "[Generating Topic-Oriented Summaries Using Neural Attention](http://www.aclweb.org/anthology/N18-1153)." NAACL 2018.
131 |   - Lisa Fan, Dong Yu, Lu Wang. "[Robust Neural Abstractive Summarization Systems and Evaluation against Adversarial Information](https://arxiv.org/abs/1810.06065)." arXiv preprint arXiv:1810.06065 (2018).
132 |   - Eric Chu, Peter J. Liu. "[Unsupervised Neural Multi-document Abstractive Summarization](https://arxiv.org/abs/1810.05739)." arXiv preprint arXiv:1810.05739 (2018).
133 |   - Yaser Keneshloo, Naren Ramakrishnan, Chandan K. Reddy. "[Deep Transfer Reinforcement Learning for Text Summarization](https://arxiv.org/abs/1810.06667)." arXiv preprint arXiv:1810.06667 (2018).
134 |   - Mahnaz Koupaee, William Yang Wang. "[WikiHow: A Large Scale Text Summarization Dataset
135 | ](https://arxiv.org/abs/1810.09305)." arXiv preprint arXiv:1810.09305 (2018).
136 |   - Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon. "[Unified Language Model Pre-training for Natural Language Understanding and Generation](https://arxiv.org/abs/1905.03197)." arXiv preprint arXiv:1905.03197 (2019).
137 |  
138 | ### Opinion Summarization
139 |   - Wu, Haibing, Yiwei Gu, Shangdi Sun, and Xiaodong Gu. "[Aspect-based Opinion Summarization with Convolutional Neural Networks](http://arxiv.org/abs/1511.09128)." arXiv preprint arXiv:1511.09128 (2015).
140 |   - Irsoy, Ozan, and Claire Cardie. "[Opinion Mining with Deep Recurrent Neural Networks](http://anthology.aclweb.org/D/D14/D14-1080.pdf)." In EMNLP, pp. 720-728. 2014.
141 |   - Piji Li, Zihao Wang, Zhaochun Ren, Lidong Bing, Wai Lam. "[Neural Rating Regression with Abstractive Tips Generation for Recommendation](https://arxiv.org/abs/1708.00154).". In SIGIR, 2017.
142 |   
143 | ### Video Summarization
144 |   - Zhou, Kaiyang, and Yu Qiao. "[Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward](https://arxiv.org/abs/1801.00054)." arXiv preprint arXiv:1801.00054 (2017). 
145 |   - Mahasseni, Behrooz, Michael Lam, and Sinisa Todorovic. "[Unsupervised video summarization with adversarial lstm networks](http://openaccess.thecvf.com/content_cvpr_2017/papers/Mahasseni_Unsupervised_Video_Summarization_CVPR_2017_paper.pdf)." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017.
146 | 
147 | ### Reading Comprehension
148 |  - Hermann, Karl Moritz, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. "[Teaching machines to read and comprehend](http://papers.nips.cc/paper/5945-teaching-machines-to-read-and-comprehend)." In Advances in Neural Information Processing Systems, pp. 1693-1701. 2015.
149 |  - Hill, Felix, Antoine Bordes, Sumit Chopra, and Jason Weston. "[The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations](http://arxiv.org/abs/1511.02301)." arXiv preprint arXiv:1511.02301 (2015).
150 |  - Kadlec, Rudolf, Martin Schmid, Ondrej Bajgar, and Jan Kleindienst. "[Text Understanding with the Attention Sum Reader Network](http://arxiv.org/abs/1603.01547)." arXiv preprint arXiv:1603.01547 (2016).
151 |  - Chen, Danqi, Jason Bolton, and Christopher D. Manning. "[A thorough examination of the cnn/daily mail reading comprehension task](http://arxiv.org/abs/1606.02858)." arXiv preprint arXiv:1606.02858 (2016).
152 |  - Dhingra, Bhuwan, Hanxiao Liu, William W. Cohen, and Ruslan Salakhutdinov. "[Gated-Attention Readers for Text Comprehension](http://arxiv.org/abs/1606.01549)." arXiv preprint arXiv:1606.01549 (2016).
153 |  - Sordoni, Alessandro, Phillip Bachman, and Yoshua Bengio. "[Iterative Alternating Neural Attention for Machine Reading](http://arxiv.org/abs/1606.02245)." arXiv preprint arXiv:1606.02245 (2016).
154 |  - Trischler, Adam, Zheng Ye, Xingdi Yuan, and Kaheer Suleman. "[Natural Language Comprehension with the EpiReader](http://arxiv.org/abs/1606.02270)." arXiv preprint arXiv:1606.02270 (2016).
155 |  - Yiming Cui, Zhipeng Chen, Si Wei, Shijin Wang, Ting Liu, Guoping Hu. "[Attention-over-Attention Neural Networks for Reading Comprehension](http://arxiv.org/abs/1607.04423)." arXiv preprint arXiv:1607.04423 (2016).
156 |  - Yiming Cui, Ting Liu, Zhipeng Chen, Shijin Wang, Guoping Hu. "[Consensus Attention-based Neural Networks for Chinese Reading Comprehension](https://arxiv.org/abs/1607.02250)." arXiv preprint arXiv:1607.02250 (2016).
157 |  - Daniel Hewlett, Alexandre Lacoste, Llion Jones, Illia Polosukhin, Andrew Fandrianto, Jay Han, Matthew Kelcey and David Berthelot. "[WIKIREADING: A Novel Large-scale Language Understanding Task over Wikipedia](http://www.aclweb.org/anthology/P/P16/P16-1145.pdf)." ACL (2016). pp. 1535-1545.
158 |   - Minghao Hu, Yuxing Peng, Xipeng Qiu. "[Mnemonic Reader for Machine Comprehension](https://arxiv.org/abs/1705.02798)." arXiv:1705.02798 (2017).
159 |   - Wenhui Wang, Nan Yang, Furu Wei, Baobao Chang and Ming Zhou. "[R-NET: Machine Reading Comprehension with Self-matching Networks](https://www.microsoft.com/en-us/research/publication/mcr/)." ACL (2017).
160 |   
161 | 
162 | ### Sentence Modelling
163 |   - Kalchbrenner, Nal, Edward Grefenstette, and Phil Blunsom. "[A convolutional neural network for modelling sentences](http://arxiv.org/abs/1404.2188)." arXiv preprint arXiv:1404.2188 (2014).
164 |   - Kim, Yoon. "[Convolutional neural networks for sentence classification](http://arxiv.org/abs/1408.5882)." arXiv preprint arXiv:1408.5882 (2014).
165 |   - Le, Quoc V., and Tomas Mikolov. "[Distributed representations of sentences and documents](http://arxiv.org/abs/1405.4053)." arXiv preprint arXiv:1405.4053 (2014).
166 |   - Yang, Zichao, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. "[Hierarchical Attention Networks for Document Classification](http://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf)." In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016.
167 | 
168 | ### Reasoning
169 |   - Peng, Baolin, Zhengdong Lu, Hang Li, and Kam-Fai Wong. "[Towards Neural Network-based Reasoning](http://arxiv.org/abs/1508.05508)." arXiv preprint arXiv:1508.05508 (2015).
170 |   
171 | ### Knowledge Engine
172 |  - Bordes, Antoine, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, and Oksana Yakhnenko. "[Translating embeddings for modeling multi-relational data](http://papers.nips.cc/paper/5071-translating-embeddings-for-modeling-multi-relational-data)." In Advances in Neural Information Processing Systems, pp. 2787-2795. 2013. TransE
173 |  - Lin, Yankai, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. "[Neural Relation Extraction with Selective Attention over Instances](http://nlp.csai.tsinghua.edu.cn/~lzy/publications/acl2016_nre.pdf)." ACL (2016)
174 |  - TransXXX
175 | 
176 | ### Memory Networks
177 |  - Graves, Alex, Greg Wayne, and Ivo Danihelka. "[Neural turing machines](http://arxiv.org/abs/1410.5401)." arXiv preprint arXiv:1410.5401 (2014).
178 |  - Weston, Jason, Sumit Chopra, and Antoine Bordes. "[Memory networks](http://arxiv.org/abs/1410.3916)." ICLR (2014).
179 |  - Sukhbaatar, Sainbayar, Jason Weston, and Rob Fergus. "[End-to-end memory networks](http://papers.nips.cc/paper/5846-end-to-end-memory-networks)." In Advances in neural information processing systems, pp. 2440-2448. 2015.
180 |  - Weston, Jason, Antoine Bordes, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Armand Joulin, and Tomas Mikolov. "[Towards ai-complete question answering: A set of prerequisite toy tasks](http://arxiv.org/abs/1502.05698)." arXiv preprint arXiv:1502.05698 (2015).
181 |  - Bordes, Antoine, Nicolas Usunier, Sumit Chopra, and Jason Weston. "[Large-scale simple question answering with memory networks](http://arxiv.org/abs/1506.02075)." arXiv preprint arXiv:1506.02075 (2015).
182 |  - Kumar, Ankit, Ozan Irsoy, Jonathan Su, James Bradbury, Robert English, Brian Pierce, Peter Ondruska, Ishaan Gulrajani, and Richard Socher. "[Ask me anything: Dynamic memory networks for natural language processing](http://arxiv.org/abs/1506.07285)." arXiv preprint arXiv:1506.07285 (2015).
183 |  - Dodge, Jesse, Andreea Gane, Xiang Zhang, Antoine Bordes, Sumit Chopra, Alexander Miller, Arthur Szlam, and Jason Weston. "[Evaluating prerequisite qualities for learning end-to-end dialog systems](http://arxiv.org/abs/1511.06931)." arXiv preprint arXiv:1511.06931 (2015).
184 |  - Hill, Felix, Antoine Bordes, Sumit Chopra, and Jason Weston. "[The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations](http://arxiv.org/abs/1511.02301)." arXiv preprint arXiv:1511.02301 (2015).
185 |  - Weston, Jason. "[Dialog-based Language Learning](http://arxiv.org/abs/1604.06045)." arXiv preprint arXiv:1604.06045 (2016).
186 |  - Bordes, Antoine, and Jason Weston. "[Learning End-to-End Goal-Oriented Dialog](http://arxiv.org/abs/1605.07683)." arXiv preprint arXiv:1605.07683 (2016).
187 |  - Chandar, Sarath, Sungjin Ahn, Hugo Larochelle, Pascal Vincent, Gerald Tesauro, and Yoshua Bengio. "[Hierarchical Memory Networks](https://arxiv.org/abs/1605.07427)." arXiv preprint arXiv:1605.07427 (2016).
188 |  - Jason Weston."[Memory Networks for Language Understanding](http://www.thespermwhale.com/jaseweston/icml2016/)." ICML Tutorial 2016
189 |  - Tang, Yaohua, Fandong Meng, Zhengdong Lu, Hang Li, and Philip LH Yu. "[Neural Machine Translation with External Phrase Memory](http://arxiv.org/abs/1606.01792)." arXiv preprint arXiv:1606.01792 (2016).
190 |  - Wang, Mingxuan, Zhengdong Lu, Hang Li, and Qun Liu. "[Memory-enhanced Decoder for Neural Machine Translation](http://arxiv.org/abs/1606.02003)." arXiv preprint arXiv:1606.02003 (2016).
191 |  - Xiong, Caiming, Stephen Merity, and Richard Socher. "[Dynamic memory networks for visual and textual question answering](https://arxiv.org/abs/1603.01417)." arXiv preprint arXiv:1603.01417 (2016).
192 | 
193 | ### Neural Structures
194 |  - Srivastava, Rupesh Kumar, Klaus Greff, and Jürgen Schmidhuber. "[Highway networks](http://arxiv.org/abs/1505.00387)." arXiv preprint arXiv:1505.00387 (2015).
195 |  - Srivastava, Rupesh K., Klaus Greff, and Jürgen Schmidhuber. "[Training very deep networks](http://arxiv.org/abs/1507.06228)." In Advances in Neural Information Processing Systems, pp. 2368-2376. 2015.
196 |  - Vinyals, Oriol, Meire Fortunato, and Navdeep Jaitly. "[Pointer networks](https://arxiv.org/abs/1506.03134)." In Advances in Neural Information Processing Systems, pp. 2692-2700. 2015.
197 |  - Rasmus, Antti, Mathias Berglund, Mikko Honkala, Harri Valpola, and Tapani Raiko. "[Semi-supervised learning with ladder networks](http://arxiv.org/abs/1507.02672)." In Advances in Neural Information Processing Systems, pp. 3546-3554. 2015.
198 |  - Bengio, Samy, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. "[Scheduled sampling for sequence prediction with recurrent neural networks](https://arxiv.org/abs/1506.03099)." In Advances in Neural Information Processing Systems, pp. 1171-1179. 2015.
199 |  - He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. "[Deep Residual Learning for Image Recognition](http://arxiv.org/abs/1512.03385)." arXiv preprint arXiv:1512.03385 (2015).
200 |  - He, Kaiming. "[Tutorial: Deep	Residual	Networks: Deep	Learning	Gets	Way Deeper](http://icml.cc/2016/tutorials/icml2016_tutorial_deep_residual_networks_kaiminghe.pdf)." ICML	2016	tutorial.
201 |  - Courbariaux, Matthieu, and Yoshua Bengio. "[Binarynet: Training deep neural networks with weights and activations constrained to+ 1 or-1](http://arxiv.org/abs/1602.02830)." arXiv preprint arXiv:1602.02830 (2016). 
202 |  - Jiatao Gu, Zhengdong Lu, Hang Li, Victor O.K. Li. "[Incorporating Copying Mechanism in Sequence-to-Sequence Learning](http://arxiv.org/abs/1603.06393)." ACL (2016)
203 |  - Gulcehre, Caglar, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. "[Pointing the Unknown Words](http://arxiv.org/abs/1603.08148)." arXiv preprint arXiv:1603.08148 (2016).
204 |  - Andreas, Jacob, Marcus Rohrbach, Trevor Darrell, and Dan Klein. "[Learning to compose neural networks for question answering](http://arxiv.org/abs/1601.01705)." NAACL 2016.
205 |  - Julian Georg Zilly, Rupesh Kumar Srivastava, Jan Koutník, Jürgen Schmidhuber. "[Recurrent Highway Networks](http://arxiv.org/abs/1607.03474)." arXiv preprint  arXiv:1607.03474 (2016).
206 |  - Zhilin Yang, Ye Yuan, Yuexin Wu, Ruslan Salakhutdinov, William W. Cohen. "[Review Networks for Caption Generation](https://arxiv.org/abs/1605.07912)." arXiv preprint  arXiv:1605.07912 (2016).
207 |  - Xiang Li, Tao Qin, Jian Yang, Tie-Yan Liu. "[LightRNN: Memory and Computation-Efficient Recurrent Neural Networks](https://arxiv.org/abs/1610.09893)." arXiv preprint  arXiv:1610.09893 (2016).
208 |  - Zhaopeng Tu, Yang Liu, Lifeng Shang, Xiaohua Liu, Hang Li. "[Neural Machine Translation with Reconstruction](https://arxiv.org/abs/1611.01874)." arXiv preprint  arXiv:1611.01874 (2016).
209 |  - Yingce Xia, Di He, Tao Qin, Liwei Wang, Nenghai Yu, Tie-Yan Liu, Wei-Ying Ma. "[Dual Learning for Machine Translation](https://arxiv.org/abs/1611.00179)." arXiv preprint  arXiv:1611.00179 (2016).
210 |   - Bahdanau, Dzmitry, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. "[An actor-critic algorithm for sequence prediction](https://arxiv.org/abs/1607.07086)." arXiv preprint arXiv:1607.07086 (2016).
211 |  - Kannan, Anjuli, and Oriol Vinyals. "[Adversarial evaluation of dialogue models](https://arxiv.org/abs/1701.08198)." arXiv preprint arXiv:1701.08198 (2017).
212 |  - Kawthekar, Prasad, Raunaq Rewari, and Suvrat Bhooshan. "[Evaluating Generative Models for Text Generation](https://web.stanford.edu/class/cs224n/reports/2737434.pdf)."
213 |  - Li, Jiwei, Will Monroe, Tianlin Shi, Alan Ritter, and Dan Jurafsky. "[Adversarial Learning for Neural Dialogue Generation](https://arxiv.org/abs/1701.06547)." arXiv preprint arXiv:1701.06547 (2017).
214 |  - Yang, Zhen, Wei Chen, Feng Wang, and Bo Xu. "[Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets](https://arxiv.org/abs/1703.04887)." arXiv preprint arXiv:1703.04887 (2017).
215 |  - Lijun Wu, Yingce Xia, Li Zhao, Fei Tian, Tao Qin, Jianhuang Lai, Tie-Yan Liu. "[Adversarial Neural Machine Translation](https://arxiv.org/abs/1704.06933)." IJCAI (2017).
216 |  - Liu, Pengfei, Xipeng Qiu, and Xuanjing Huang. "[Adversarial Multi-task Learning for Text Classification](https://arxiv.org/abs/1704.05742)." arXiv preprint arXiv:1704.05742 (2017).
217 |  - Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, Yann N. Dauphin. "[Convolutional Sequence to Sequence Learning (https://arxiv.org/abs/1705.03122)."  arXiv:1705.03122 (2017).
218 |  - Lamb, Alex M., Anirudh Goyal ALIAS PARTH GOYAL, Ying Zhang, Saizheng Zhang, Aaron C. Courville, and Yoshua Bengio. "[Professor forcing: A new algorithm for training recurrent networks](https://arxiv.org/abs/1610.09038)." In Advances In Neural Information Processing Systems, pp. 4601-4609. 2016.
219 |  - Rezende, Danilo Jimenez, Shakir Mohamed, and Daan Wierstra. "[Stochastic backpropagation and approximate inference in deep generative models](http://arxiv.org/abs/1401.4082)." arXiv preprint arXiv:1401.4082 (2014).
220 |  - Kingma, Diederik P., and Max Welling. "[Auto-encoding variational bayes](http://arxiv.org/abs/1312.6114)." arXiv preprint arXiv:1312.6114 (2013).
221 |  - Fabius, Otto, and Joost R. van Amersfoort. "[Variational recurrent auto-encoders](https://arxiv.org/abs/1412.6581)." arXiv preprint arXiv:1412.6581 (2014).
222 |  - Bayer, Justin, and Christian Osendorfer. "[Learning stochastic recurrent networks](http://arxiv.org/abs/1411.7610)." arXiv preprint arXiv:1411.7610 (2014).
223 |  - Bowman, Samuel R., Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, and Samy Bengio. "[Generating sentences from a continuous space](https://arxiv.org/abs/1511.06349)." arXiv preprint arXiv:1511.06349 (2015).
224 |  - Gregor, Karol, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, and Daan Wierstra. "[DRAW: A recurrent neural network for image generation](http://arxiv.org/abs/1502.04623)." arXiv preprint arXiv:1502.04623 (2015).
225 |  - Makhzani, Alireza, Jonathon Shlens, Navdeep Jaitly, and Ian Goodfellow. "[Adversarial autoencoders](http://arxiv.org/abs/1511.05644)." arXiv preprint arXiv:1511.05644 (2015).
226 |  - Johnson, Matthew J., David Duvenaud, Alexander B. Wiltschko, Sandeep R. Datta, and Ryan P. Adams. "[Composing graphical models with neural networks for structured representations and fast inference](http://arxiv.org/abs/1603.06277)." arXiv preprint arXiv:1603.06277 (2016).
227 |  - Doersch, Carl. "[Tutorial on Variational Autoencoders](https://arxiv.org/abs/1606.05908)." arXiv preprint arXiv:1606.05908 (2016).
228 |  - Chung, Junyoung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron C. Courville, and Yoshua Bengio. "[A recurrent latent variable model for sequential data](http://arxiv.org/abs/1506.02216)." In Advances in neural information processing systems, pp. 2980-2988. 2015.
229 |  - Eslami, S. M., Nicolas Heess, Theophane Weber, Yuval Tassa, Koray Kavukcuoglu, and Geoffrey E. Hinton. "[Attend, Infer, Repeat: Fast Scene Understanding with Generative Models](https://arxiv.org/abs/1603.08575)." arXiv preprint arXiv:1603.08575 (2016).
230 |  - Shengjia Zhao, Jiaming Song, Stefano Ermon. "[InfoVAE: Information Maximizing Variational Autoencoders](https://arxiv.org/abs/1706.02262)." arXiv:1706.02262 (2017).
231 |  - Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. "[Generative adversarial nets](http://arxiv.org/abs/1406.2661)." In Advances in Neural Information Processing Systems, pp. 2672-2680. 2014
232 |  - Radford, Alec, Luke Metz, and Soumith Chintala. "[Unsupervised representation learning with deep convolutional generative adversarial networks](http://arxiv.org/abs/1511.06434)." arXiv preprint arXiv:1511.06434 (2015).
233 |  - Denton, Emily L., Soumith Chintala, and Rob Fergus. "[Deep Generative Image Models using a￼ Laplacian Pyramid of Adversarial Networks](http://arxiv.org/abs/1506.05751)." In Advances in neural information processing systems, pp. 1486-1494. 2015.
234 |  - Dosovitskiy, Alexey, Jost Tobias Springenberg, and Thomas Brox. "[Learning to generate chairs with convolutional neural networks](http://arxiv.org/abs/1411.5928)." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1538-1546. 2015.
235 |  - Mathieu, Michael, Camille Couprie, and Yann LeCun. "[Deep multi-scale video prediction beyond mean square error](http://arxiv.org/abs/1511.05440)." arXiv preprint arXiv:1511.05440 (2015).
236 |  - Salimans, Tim, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. "[Improved Techniques for Training GANs](http://arxiv.org/abs/1606.03498)." arXiv preprint arXiv:1606.03498 (2016).
237 |  - Chen, Xi, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. "[InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets](http://arxiv.org/abs/1606.03657)." arXiv preprint arXiv:1606.03657 (2016).
238 |  - Im, Daniel Jiwoong, Chris Dongjoo Kim, Hui Jiang, and Roland Memisevic. "[Generating images with recurrent adversarial networks](http://arxiv.org/abs/1602.05110)." arXiv preprint arXiv:1602.05110 (2016).
239 |  - Yu, Lantao, Weinan Zhang, Jun Wang, and Yong Yu. "[SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient](http://arxiv.org/abs/1609.05473)." arXiv preprint arXiv:1609.05473 (2016).
240 |  - Augustus Odena, Christopher Olah, Jonathon Shlens. "[Conditional Image Synthesis With Auxiliary Classifier GANs](https://arxiv.org/abs/1610.09585)." arXiv preprint arXiv:1610.09585 (2016).
241 |  - Ian Goodfellow. "[NIPS Tutorial: GANs](http://www.iangoodfellow.com/slides/2016-12-04-NIPS.pdf)", NIPS, 2016
242 |  - Che, Tong, Yanran Li, Ruixiang Zhang, R. Devon Hjelm, Wenjie Li, Yangqiu Song, and Yoshua Bengio. "[Maximum-Likelihood Augmented Discrete Generative Adversarial Networks](https://arxiv.org/abs/1702.07983)." arXiv preprint arXiv:1702.07983 (2017).
243 |  - Junbo (Jake) Zhao, Yoon Kim, Kelly Zhang, Alexander M. Rush, Yann LeCun. "[Adversarially Regularized Autoencoders for Generating Discrete Structures](https://arxiv.org/abs/1706.04223)." arXiv preprint arXiv:1706.04223 (2017).
244 |  - 	Mike Lewis  	Denis Yarats  	Yann N. Dauphin  	Devi Parikh  	Dhruv Batra . "[ Deal or No Deal? End-to-End Learning for Negotiation Dialogues](http://s3.amazonaws.com/end-to-end-negotiator/end-to-end-negotiator.pdf)." (2017).
245 |  - Mihaela Rosca, Balaji Lakshminarayanan, David Warde-Farley, Shakir Mohamed. "[Variational Approaches for Auto-Encoding Generative Adversarial Networks](https://arxiv.org/abs/1706.04987)." arXiv preprint arXiv:1706.04987 (2017).
246 |  - Goyal, Prasoon, Zhiting Hu, Xiaodan Liang, Chenyu Wang, and Eric Xing. "[Nonparametric Variational Auto-encoders for Hierarchical Representation Learning](https://arxiv.org/pdf/1703.07027.pdf)." arXiv preprint arXiv:1703.07027 (2017).
247 |  - Sabour, Sara, Nicholas Frosst, and Geoffrey Hinton. "[Dynamic Routing between Capsules](https://arxiv.org/abs/1710.09829)." (2017).
248 |  - Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. "[Attention is all you need](http://papers.nips.cc/paper/7181-attention-is-all-you-need)." NIPS. 2017.
249 |  
250 | #### Architecture Search
251 | - Frankle, Jonathan, and Michael Carbin. "The lottery ticket hypothesis: Finding sparse, trainable neural networks." arXiv preprint arXiv:1803.03635 (2018).
252 | - Xie, Saining, Alexander Kirillov, Ross Girshick, and Kaiming He. "Exploring Randomly Wired Neural Networks for Image Recognition." arXiv preprint arXiv:1904.01569 (2019).
253 | - So, David R., Chen Liang, and Quoc V. Le. "The Evolved Transformer." arXiv preprint arXiv:1901.11117 (2019).
254 | - Chenguang Wang, Mu Li, Alexander J. Smola. "Language Models with Transformers." arXiv preprint arXiv:1904.09408 (2019).
255 |  
256 | ### Recommendation System
257 | - Salakhutdinov, Ruslan, Andriy Mnih, and Geoffrey Hinton. "[Restricted Boltzmann machines for collaborative filtering](http://dl.acm.org/citation.cfm?id=1273596)." In Proceedings of the 24th international conference on Machine learning, pp. 791-798. ACM, 2007.
258 | - Wang, Hao, Xingjian Shi, and Dit-Yan Yeung. "[Relational Stacked Denoising Autoencoder for Tag Recommendation](http://www.wanghao.in/paper/AAAI15_RSDAE.pdf)." In AAAI, pp. 3052-3058. 2015.
259 | - Wang, Hao, Naiyan Wang, and Dit-Yan Yeung. "[Collaborative deep learning for recommender systems](http://dl.acm.org/citation.cfm?id=2783273)." In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1235-1244. ACM, 2015.
260 | - Covington, Paul, Jay Adams, and Emre Sargin. "[Deep neural networks for youtube recommendations](http://dl.acm.org/citation.cfm?id=2959190)." In Proceedings of the 10th ACM Conference on Recommender Systems, pp. 191-198. ACM, 2016.
261 | - Devooght, Robin, and Hugues Bersini. "[Collaborative Filtering with Recurrent Neural Networks](https://arxiv.org/abs/1608.07400)." arXiv preprint arXiv:1608.07400 (2016).
262 | - Wang, Hao, S. H. I. Xingjian, and Dit-Yan Yeung. "[Collaborative recurrent autoencoder: Recommend while learning to fill in the blanks](http://papers.nips.cc/paper/6163-collaborative-recurrent-autoencoder-recommend-while-learning-to-fill-in-the-blanks)." In Advances in Neural Information Processing Systems, pp. 415-423. 2016.
263 | - Tang, Jian, Yifan Yang, Sam Carton, Ming Zhang, and Qiaozhu Mei. "[Context-aware Natural Language Generation with Recurrent Neural Networks](https://arxiv.org/abs/1611.09900)." arXiv preprint arXiv:1611.09900 (2016).
264 | - Zhang, Fuzheng, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma. "[Collaborative Knowledge Base Embedding for Recommender Systems](http://www.kdd.org/kdd2016/subtopic/view/collaborative-knowledge-base-embedding-for-recommender-systems)." KDD, 2016. 
265 | - Dong, Li, Shaohan Huang, Furu Wei, Mirella Lapata, Ming Zhou, and Ke XuΤ. "[Learning to Generate Product Reviews from Attributes](http://www.aclweb.org/anthology/E/E17/E17-1059.pdf)." EACL, 2017.
266 | - He, Xiangnan. "[Neural Collaborative Filtering](http://www.comp.nus.edu.sg/~xiangnan/papers/ncf.pdf)." WWW, 2017
267 | - Wu, Chao-Yuan, Amr Ahmed, Alex Beutel, Alexander J. Smola, and How Jing. "[Recurrent Recommender Networks](http://alexbeutel.com/papers/rrn_wsdm2017.pdf)." Training 10, no. 2: 10-1.2017
268 | - Radford, Alec, Rafal Jozefowicz, and Ilya Sutskever. "[Learning to generate reviews and discovering sentiment](https://arxiv.org/pdf/1704.01444.pdf)." arXiv preprint arXiv:1704.01444 (2017).
269 | - Piji Li, Zihao Wang, Zhaochun Ren, Lidong Bing, Wai Lam. "[Neural Rating Regression with Abstractive Tips Generation for Recommendation](https://arxiv.org/abs/1708.00154).". In SIGIR, pp xx-xx. 2017.
270 | 
271 | ### Network Representation Learning
272 |  - [Must-read papers on network representation learning (NRL)/network embedding (NE)](https://github.com/thunlp/NRLPapers) 
273 | 
274 | ### Music Generation
275 |  - [Using machine learning to generate music](http://www.datasciencecentral.com/profiles/blogs/using-machine-learning-to-generate-music)
276 |  
277 | ### Computational Biology
278 |  - [Awesome DeepBio](https://github.com/gokceneraslan/awesome-deepbio) by Gökçen Eraslan
279 | 
280 | ### GO
281 |  - Silver, David, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser et al. "[Mastering the game of Go with deep neural networks and tree search](http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html)." Nature 529, no. 7587 (2016): 484-489.
282 |  - Tian, Yuandong, and Yan Zhu. "[Better Computer Go Player with Neural Network and Long-term Prediction](http://arxiv.org/abs/1511.06410)." arXiv preprint arXiv:1511.06410 (2015).
283 |  
284 | ### Stock Prediction
285 |   - Xiao Ding, Yue Zhang, Ting Liu, Junwen Duan. "Deep Learning for Event-Driven Stock Prediction". IJCAI 2015.
286 |   - Si, Jianfeng, Arjun Mukherjee, Bing Liu, Sinno Jialin Pan, Qing Li, and Huayi Li. "[Exploiting Social Relations and Sentiment for Stock Prediction](http://www.aclweb.org/anthology/D14-1120)." EMNLP 2014.
287 |   - Ding, Xiao, Yue Zhang, Ting Liu, and Junwen Duan. "[Using Structured Events to Predict Stock Price Movement: An Empirical Investigation](http://anthology.aclweb.org/D/D14/D14-1148.pdf)." EMNLP 2014.
288 |   - Bollen, Johan, Huina Mao, and Xiaojun Zeng. "[Twitter mood predicts the stock market](http://arxiv.org/abs/1010.3003)." Journal of Computational Science 2, no. 1 (2011): 1-8.
289 |   - Hengjian Jia. "[Investigation Into The Effectiveness Of Long Short Term Memory Networks For Stock Price Prediction](http://arxiv.org/abs/1603.07893)." arXiv:1603.07893. (2016)
290 | 


--------------------------------------------------------------------------------