├── LICENSE ├── README.md ├── annotations ├── bbox │ ├── dev.jsonl │ ├── test.jsonl │ └── train.jsonl └── qa │ ├── dev.jsonl │ ├── test.jsonl │ └── train.jsonl ├── download_slides_slideshare.py ├── evaluate.py ├── example.png ├── extract_ocr_tessearct.py ├── extract_ocr_visionAPI.py └── requirements.txt /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/README.md -------------------------------------------------------------------------------- /annotations/bbox/dev.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/annotations/bbox/dev.jsonl -------------------------------------------------------------------------------- /annotations/bbox/test.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/annotations/bbox/test.jsonl -------------------------------------------------------------------------------- /annotations/bbox/train.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/annotations/bbox/train.jsonl -------------------------------------------------------------------------------- /annotations/qa/dev.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/annotations/qa/dev.jsonl -------------------------------------------------------------------------------- /annotations/qa/test.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/annotations/qa/test.jsonl -------------------------------------------------------------------------------- /annotations/qa/train.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/annotations/qa/train.jsonl -------------------------------------------------------------------------------- /download_slides_slideshare.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/download_slides_slideshare.py -------------------------------------------------------------------------------- /evaluate.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/evaluate.py -------------------------------------------------------------------------------- /example.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/example.png -------------------------------------------------------------------------------- /extract_ocr_tessearct.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/extract_ocr_tessearct.py -------------------------------------------------------------------------------- /extract_ocr_visionAPI.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/nttmdlab-nlp/SlideVQA/HEAD/extract_ocr_visionAPI.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- 1 | pytesseract==0.3.8 2 | google-cloud-vision==2.7.3 3 | opencv-python 4 | --------------------------------------------------------------------------------