└── README.md /README.md: -------------------------------------------------------------------------------- 1 | ## Dataset for paper "Towards Explainable NLP: A Generative Explanation Framework for Text Classification" 2 | 3 | ###Datasets 4 | This repository contains two datasets used in the paper "Towards Explainable NLP: A Generative Explanation Framework for Text Classification". You may download both of the two datasets from [here](https://drive.google.com/open?id=1o_r3D-lngEXPK8mtx8UQCbadR65Ycv5a) 5 | 6 | + PCMag Review Dataset 7 | This dataset is crawled from the website [PCMag](https://www.pcmag.com/). It is a website providing reviews for electronic products, like laptops, smartphones, cameras and so on. 8 | Each item in the dataset consists of three parts: a long review text, three short comments, and an overall rating score for the product. 9 | + Three short comments are summaries of the long review respectively from positive, negative, neutral perspectives. 10 | + An overall rating score is a number ranging from 0 to 5, and the possible values that the score could be are {1.0, 1.5, 2.0, ..., 5.0}. 11 | 12 | + Skytrax User Reviews Dataset 13 | This dataset we used in our paper is adapted from the [original version](https://github.com/quankiquanki/skytrax-reviews-dataset). 14 | Each item in this dataset consists of three parts: a review text, five sub-field scores and an overall rating score. 15 | + The five sub-field scores respectively stand for the user’s ratings for seat comfortability, cabin stuff, food, in-flight environment, and ticket value, and each score is an integer between 0 and 5. 16 | + The overall score is an integer between 1 and 10. 17 | 18 | ### Reference 19 | ``` 20 | @inproceedings{liu-etal-2019-towards-explainable, 21 | title = "Towards Explainable {NLP}: A Generative Explanation Framework for Text Classification", 22 | author = "Liu, Hui and 23 | Yin, Qingyu and 24 | Wang, William Yang", 25 | booktitle = "Proceedings of the 57th Conference of the Association for Computational Linguistics", 26 | month = jul, 27 | year = "2019", 28 | address = "Florence, Italy", 29 | publisher = "Association for Computational Linguistics", 30 | url = "https://www.aclweb.org/anthology/P19-1560", 31 | pages = "5570--5581", 32 | } 33 | ``` 34 | 35 | 36 | --------------------------------------------------------------------------------