├── README.md ├── Step1_get_matched_sentences.ipynb ├── Step2_identify_spurious_correlated_words.ipynb ├── Step3_feature_selection.ipynb ├── data ├── kindle_short_sents.pickle ├── kindle_unique_sents.pkl ├── readme.md ├── toxic_short_sents.pickle └── toxic_tweets.csv ├── data_structure.py ├── kindle_random_samples.csv ├── sentiment_lexicon.csv └── spurious_words.py /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/README.md -------------------------------------------------------------------------------- /Step1_get_matched_sentences.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/Step1_get_matched_sentences.ipynb -------------------------------------------------------------------------------- /Step2_identify_spurious_correlated_words.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/Step2_identify_spurious_correlated_words.ipynb -------------------------------------------------------------------------------- /Step3_feature_selection.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/Step3_feature_selection.ipynb -------------------------------------------------------------------------------- /data/kindle_short_sents.pickle: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/data/kindle_short_sents.pickle -------------------------------------------------------------------------------- /data/kindle_unique_sents.pkl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/data/kindle_unique_sents.pkl -------------------------------------------------------------------------------- /data/readme.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/data/readme.md -------------------------------------------------------------------------------- /data/toxic_short_sents.pickle: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/data/toxic_short_sents.pickle -------------------------------------------------------------------------------- /data/toxic_tweets.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/data/toxic_tweets.csv -------------------------------------------------------------------------------- /data_structure.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/data_structure.py -------------------------------------------------------------------------------- /kindle_random_samples.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/kindle_random_samples.csv -------------------------------------------------------------------------------- /sentiment_lexicon.csv: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/sentiment_lexicon.csv -------------------------------------------------------------------------------- /spurious_words.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/tapilab/emnlp-2020-spurious/HEAD/spurious_words.py --------------------------------------------------------------------------------