├── LICENSE ├── README.md ├── needed_files.url.filelist ├── scripts ├── Explore NYTime.ipynb ├── download.sh └── to_jsonl.py ├── test_JPTimes.url.filelist ├── test_NYTimes.url.filelist ├── train.url.filelist ├── train_sample.jsonl ├── valid.url.filelist └── webarchive.url.filelist /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/README.md -------------------------------------------------------------------------------- /needed_files.url.filelist: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/needed_files.url.filelist -------------------------------------------------------------------------------- /scripts/Explore NYTime.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/scripts/Explore NYTime.ipynb -------------------------------------------------------------------------------- /scripts/download.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/scripts/download.sh -------------------------------------------------------------------------------- /scripts/to_jsonl.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/scripts/to_jsonl.py -------------------------------------------------------------------------------- /test_JPTimes.url.filelist: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/test_JPTimes.url.filelist -------------------------------------------------------------------------------- /test_NYTimes.url.filelist: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/test_NYTimes.url.filelist -------------------------------------------------------------------------------- /train.url.filelist: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/train.url.filelist -------------------------------------------------------------------------------- /train_sample.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/train_sample.jsonl -------------------------------------------------------------------------------- /valid.url.filelist: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/valid.url.filelist -------------------------------------------------------------------------------- /webarchive.url.filelist: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/ygorg/KPTimes/HEAD/webarchive.url.filelist --------------------------------------------------------------------------------