├── .github └── ISSUE_TEMPLATE │ ├── bug_report.yaml │ ├── config.yaml │ └── feature_request.yaml ├── .gitignore ├── BUILD.md ├── Dockerfile.qwendemo ├── Dockerfile.qwenint4openai ├── Dockerfile.qwenopenai ├── FAQ.md ├── FAQ_ja.md ├── FAQ_ko.md ├── FAQ_zh.md ├── LICENSE ├── NOTICE ├── README.md ├── README_CN.md ├── README_JA.md ├── README_KO.md ├── TUTORIAL.md ├── TUTORIAL_ja.md ├── TUTORIAL_ko.md ├── TUTORIAL_zh.md ├── assets ├── apple.jpeg ├── apple_r.jpeg ├── demo.jpeg ├── demo_highfive.jpg ├── demo_spotting_caption.jpg ├── demo_vl.gif ├── logo.jpg ├── mm_tutorial │ ├── Beijing.jpeg │ ├── Beijing_Small.jpeg │ ├── Chongqing.jpeg │ ├── Chongqing_Small.jpeg │ ├── Hospital.jpg │ ├── Hospital_Small.jpg │ ├── Menu.jpeg │ ├── Rebecca_(1939_poster).jpeg │ ├── Rebecca_(1939_poster)_Small.jpeg │ ├── Shanghai.jpg │ ├── Shanghai_Output.jpg │ ├── Shanghai_Output_Small.jpeg │ ├── Shanghai_Small.jpeg │ └── TUTORIAL.ipynb ├── qwenvl.jpeg ├── radar.png ├── radar_qwenvlplus.jpg ├── touchstone_datasets.jpg ├── touchstone_eval.png ├── touchstone_logo.png └── wechat.png ├── eval_mm ├── EVALUATION.md ├── data ├── evaluate_caption.py ├── evaluate_grounding.py ├── evaluate_multiple_choice.py ├── evaluate_vqa.py ├── infographicsvqa_eval.py ├── mmbench │ ├── MMBENCH.md │ ├── evaluate_multiple_choice_mmbench.py │ ├── mmbench_converter_dev.py │ ├── mmbench_converter_test.py │ ├── mmbench_evaluation.py │ ├── mmbench_evaluation_tricky.py │ └── mmbench_predict_to_submission.py ├── mme │ ├── EVAL_MME.md │ ├── cognition.jpg │ ├── eval.py │ ├── get_images.py │ └── perception.jpg ├── seed_bench │ ├── EVAL_SEED.md │ ├── eval.py │ ├── leaderboard.jpg │ └── trans.py ├── vqa.py └── vqa_eval.py ├── finetune.py ├── finetune ├── ds_config_zero2.json ├── ds_config_zero3.json ├── finetune_ds.sh ├── finetune_lora_ds.sh ├── finetune_lora_single_gpu.sh ├── finetune_qlora_ds.sh └── finetune_qlora_single_gpu.sh ├── openai_api.py ├── requirements.txt ├── requirements_openai_api.txt ├── requirements_web_demo.txt ├── touchstone ├── README.md ├── README_CN.md ├── README_JA.md └── README_KO.md └── web_demo_mm.py /.github/ISSUE_TEMPLATE/bug_report.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/.github/ISSUE_TEMPLATE/bug_report.yaml -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/config.yaml: -------------------------------------------------------------------------------- 1 | blank_issues_enabled: true 2 | -------------------------------------------------------------------------------- /.github/ISSUE_TEMPLATE/feature_request.yaml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/.github/ISSUE_TEMPLATE/feature_request.yaml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/.gitignore -------------------------------------------------------------------------------- /BUILD.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/BUILD.md -------------------------------------------------------------------------------- /Dockerfile.qwendemo: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/Dockerfile.qwendemo -------------------------------------------------------------------------------- /Dockerfile.qwenint4openai: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/Dockerfile.qwenint4openai -------------------------------------------------------------------------------- /Dockerfile.qwenopenai: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/Dockerfile.qwenopenai -------------------------------------------------------------------------------- /FAQ.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/FAQ.md -------------------------------------------------------------------------------- /FAQ_ja.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/FAQ_ja.md -------------------------------------------------------------------------------- /FAQ_ko.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/FAQ_ko.md -------------------------------------------------------------------------------- /FAQ_zh.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/FAQ_zh.md -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/LICENSE -------------------------------------------------------------------------------- /NOTICE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/NOTICE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/README.md -------------------------------------------------------------------------------- /README_CN.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/README_CN.md -------------------------------------------------------------------------------- /README_JA.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/README_JA.md -------------------------------------------------------------------------------- /README_KO.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/README_KO.md -------------------------------------------------------------------------------- /TUTORIAL.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/TUTORIAL.md -------------------------------------------------------------------------------- /TUTORIAL_ja.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/TUTORIAL_ja.md -------------------------------------------------------------------------------- /TUTORIAL_ko.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/TUTORIAL_ko.md -------------------------------------------------------------------------------- /TUTORIAL_zh.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/TUTORIAL_zh.md -------------------------------------------------------------------------------- /assets/apple.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/apple.jpeg -------------------------------------------------------------------------------- /assets/apple_r.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/apple_r.jpeg -------------------------------------------------------------------------------- /assets/demo.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/demo.jpeg -------------------------------------------------------------------------------- /assets/demo_highfive.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/demo_highfive.jpg -------------------------------------------------------------------------------- /assets/demo_spotting_caption.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/demo_spotting_caption.jpg -------------------------------------------------------------------------------- /assets/demo_vl.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/demo_vl.gif -------------------------------------------------------------------------------- /assets/logo.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/logo.jpg -------------------------------------------------------------------------------- /assets/mm_tutorial/Beijing.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Beijing.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Beijing_Small.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Beijing_Small.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Chongqing.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Chongqing.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Chongqing_Small.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Chongqing_Small.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Hospital.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Hospital.jpg -------------------------------------------------------------------------------- /assets/mm_tutorial/Hospital_Small.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Hospital_Small.jpg -------------------------------------------------------------------------------- /assets/mm_tutorial/Menu.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Menu.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Rebecca_(1939_poster).jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Rebecca_(1939_poster).jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Rebecca_(1939_poster)_Small.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Rebecca_(1939_poster)_Small.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Shanghai.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Shanghai.jpg -------------------------------------------------------------------------------- /assets/mm_tutorial/Shanghai_Output.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Shanghai_Output.jpg -------------------------------------------------------------------------------- /assets/mm_tutorial/Shanghai_Output_Small.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Shanghai_Output_Small.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/Shanghai_Small.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/mm_tutorial/Shanghai_Small.jpeg -------------------------------------------------------------------------------- /assets/mm_tutorial/TUTORIAL.ipynb: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /assets/qwenvl.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/qwenvl.jpeg -------------------------------------------------------------------------------- /assets/radar.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/radar.png -------------------------------------------------------------------------------- /assets/radar_qwenvlplus.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/radar_qwenvlplus.jpg -------------------------------------------------------------------------------- /assets/touchstone_datasets.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/touchstone_datasets.jpg -------------------------------------------------------------------------------- /assets/touchstone_eval.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/touchstone_eval.png -------------------------------------------------------------------------------- /assets/touchstone_logo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/touchstone_logo.png -------------------------------------------------------------------------------- /assets/wechat.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/assets/wechat.png -------------------------------------------------------------------------------- /eval_mm/EVALUATION.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/EVALUATION.md -------------------------------------------------------------------------------- /eval_mm/data: -------------------------------------------------------------------------------- 1 | /cpfs01/shared/public/shusheng.yss/datasets/qwenvl_evaluation -------------------------------------------------------------------------------- /eval_mm/evaluate_caption.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/evaluate_caption.py -------------------------------------------------------------------------------- /eval_mm/evaluate_grounding.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/evaluate_grounding.py -------------------------------------------------------------------------------- /eval_mm/evaluate_multiple_choice.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/evaluate_multiple_choice.py -------------------------------------------------------------------------------- /eval_mm/evaluate_vqa.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/evaluate_vqa.py -------------------------------------------------------------------------------- /eval_mm/infographicsvqa_eval.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/infographicsvqa_eval.py -------------------------------------------------------------------------------- /eval_mm/mmbench/MMBENCH.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/MMBENCH.md -------------------------------------------------------------------------------- /eval_mm/mmbench/evaluate_multiple_choice_mmbench.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/evaluate_multiple_choice_mmbench.py -------------------------------------------------------------------------------- /eval_mm/mmbench/mmbench_converter_dev.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/mmbench_converter_dev.py -------------------------------------------------------------------------------- /eval_mm/mmbench/mmbench_converter_test.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/mmbench_converter_test.py -------------------------------------------------------------------------------- /eval_mm/mmbench/mmbench_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/mmbench_evaluation.py -------------------------------------------------------------------------------- /eval_mm/mmbench/mmbench_evaluation_tricky.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/mmbench_evaluation_tricky.py -------------------------------------------------------------------------------- /eval_mm/mmbench/mmbench_predict_to_submission.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mmbench/mmbench_predict_to_submission.py -------------------------------------------------------------------------------- /eval_mm/mme/EVAL_MME.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mme/EVAL_MME.md -------------------------------------------------------------------------------- /eval_mm/mme/cognition.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mme/cognition.jpg -------------------------------------------------------------------------------- /eval_mm/mme/eval.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mme/eval.py -------------------------------------------------------------------------------- /eval_mm/mme/get_images.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mme/get_images.py -------------------------------------------------------------------------------- /eval_mm/mme/perception.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/mme/perception.jpg -------------------------------------------------------------------------------- /eval_mm/seed_bench/EVAL_SEED.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/seed_bench/EVAL_SEED.md -------------------------------------------------------------------------------- /eval_mm/seed_bench/eval.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/seed_bench/eval.py -------------------------------------------------------------------------------- /eval_mm/seed_bench/leaderboard.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/seed_bench/leaderboard.jpg -------------------------------------------------------------------------------- /eval_mm/seed_bench/trans.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/seed_bench/trans.py -------------------------------------------------------------------------------- /eval_mm/vqa.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/vqa.py -------------------------------------------------------------------------------- /eval_mm/vqa_eval.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/eval_mm/vqa_eval.py -------------------------------------------------------------------------------- /finetune.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune.py -------------------------------------------------------------------------------- /finetune/ds_config_zero2.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/ds_config_zero2.json -------------------------------------------------------------------------------- /finetune/ds_config_zero3.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/ds_config_zero3.json -------------------------------------------------------------------------------- /finetune/finetune_ds.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/finetune_ds.sh -------------------------------------------------------------------------------- /finetune/finetune_lora_ds.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/finetune_lora_ds.sh -------------------------------------------------------------------------------- /finetune/finetune_lora_single_gpu.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/finetune_lora_single_gpu.sh -------------------------------------------------------------------------------- /finetune/finetune_qlora_ds.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/finetune_qlora_ds.sh -------------------------------------------------------------------------------- /finetune/finetune_qlora_single_gpu.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/finetune/finetune_qlora_single_gpu.sh -------------------------------------------------------------------------------- /openai_api.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/openai_api.py -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/requirements.txt -------------------------------------------------------------------------------- /requirements_openai_api.txt: -------------------------------------------------------------------------------- 1 | fastapi 2 | uvicorn 3 | openai 4 | pydantic 5 | sse_starlette 6 | -------------------------------------------------------------------------------- /requirements_web_demo.txt: -------------------------------------------------------------------------------- 1 | gradio 2 | modelscope 3 | -------------------------------------------------------------------------------- /touchstone/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/touchstone/README.md -------------------------------------------------------------------------------- /touchstone/README_CN.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/touchstone/README_CN.md -------------------------------------------------------------------------------- /touchstone/README_JA.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/touchstone/README_JA.md -------------------------------------------------------------------------------- /touchstone/README_KO.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/touchstone/README_KO.md -------------------------------------------------------------------------------- /web_demo_mm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/QwenLM/Qwen-VL/HEAD/web_demo_mm.py --------------------------------------------------------------------------------