├── .gitattributes ├── GPT_eval ├── detailed_evaluation │ ├── difficulty_specific_evaluation.py │ └── domain_specific_evaluation.py ├── examples │ ├── meta_llama_3-1_70b_instruct_gpteval.jsonl │ └── qwen_2_5_MATH_72b_instruct_gpteval.jsonl ├── get_result.py ├── get_result.sh └── gpt_evaluation_template.txt ├── Omni-Judge_eval ├── detailed_evaluation │ ├── difficulty_specific_evaluation.py │ └── domain_specific_evaluation.py ├── examples_infile │ ├── meta_llama_3-1_70b_infile.jsonl │ └── qwen_2_5_MATH_72b_instruct_infile.jsonl ├── get_result.py ├── omni_judge.py ├── omni_judge.sh ├── omni_judge_vllm.py └── omni_judge_vllm.sh ├── Omni-Math.jsonl ├── README.md └── imgs ├── MiniLogo.png ├── box_plot.png └── head_picture.jpg /.gitattributes: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/.gitattributes -------------------------------------------------------------------------------- /GPT_eval/detailed_evaluation/difficulty_specific_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/detailed_evaluation/difficulty_specific_evaluation.py -------------------------------------------------------------------------------- /GPT_eval/detailed_evaluation/domain_specific_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/detailed_evaluation/domain_specific_evaluation.py -------------------------------------------------------------------------------- /GPT_eval/examples/meta_llama_3-1_70b_instruct_gpteval.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/examples/meta_llama_3-1_70b_instruct_gpteval.jsonl -------------------------------------------------------------------------------- /GPT_eval/examples/qwen_2_5_MATH_72b_instruct_gpteval.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/examples/qwen_2_5_MATH_72b_instruct_gpteval.jsonl -------------------------------------------------------------------------------- /GPT_eval/get_result.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/get_result.py -------------------------------------------------------------------------------- /GPT_eval/get_result.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/get_result.sh -------------------------------------------------------------------------------- /GPT_eval/gpt_evaluation_template.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/GPT_eval/gpt_evaluation_template.txt -------------------------------------------------------------------------------- /Omni-Judge_eval/detailed_evaluation/difficulty_specific_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/detailed_evaluation/difficulty_specific_evaluation.py -------------------------------------------------------------------------------- /Omni-Judge_eval/detailed_evaluation/domain_specific_evaluation.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/detailed_evaluation/domain_specific_evaluation.py -------------------------------------------------------------------------------- /Omni-Judge_eval/examples_infile/meta_llama_3-1_70b_infile.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/examples_infile/meta_llama_3-1_70b_infile.jsonl -------------------------------------------------------------------------------- /Omni-Judge_eval/examples_infile/qwen_2_5_MATH_72b_instruct_infile.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/examples_infile/qwen_2_5_MATH_72b_instruct_infile.jsonl -------------------------------------------------------------------------------- /Omni-Judge_eval/get_result.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/get_result.py -------------------------------------------------------------------------------- /Omni-Judge_eval/omni_judge.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/omni_judge.py -------------------------------------------------------------------------------- /Omni-Judge_eval/omni_judge.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/omni_judge.sh -------------------------------------------------------------------------------- /Omni-Judge_eval/omni_judge_vllm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/omni_judge_vllm.py -------------------------------------------------------------------------------- /Omni-Judge_eval/omni_judge_vllm.sh: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Judge_eval/omni_judge_vllm.sh -------------------------------------------------------------------------------- /Omni-Math.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/Omni-Math.jsonl -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/README.md -------------------------------------------------------------------------------- /imgs/MiniLogo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/imgs/MiniLogo.png -------------------------------------------------------------------------------- /imgs/box_plot.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/imgs/box_plot.png -------------------------------------------------------------------------------- /imgs/head_picture.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KbsdJames/Omni-MATH/HEAD/imgs/head_picture.jpg --------------------------------------------------------------------------------