├── LICENSE ├── README.md ├── figures ├── acc_vs_len.png ├── cover_fig.png ├── efficiency.png ├── examples.png ├── leaderboard.png ├── pipeline.png └── stat.png ├── grading.py ├── grading_results ├── result_example.jsonl.log └── result_p_subset_example.jsonl.log ├── model_responses └── example.jsonl ├── requirements.txt └── utils.py /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/README.md -------------------------------------------------------------------------------- /figures/acc_vs_len.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/acc_vs_len.png -------------------------------------------------------------------------------- /figures/cover_fig.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/cover_fig.png -------------------------------------------------------------------------------- /figures/efficiency.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/efficiency.png -------------------------------------------------------------------------------- /figures/examples.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/examples.png -------------------------------------------------------------------------------- /figures/leaderboard.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/leaderboard.png -------------------------------------------------------------------------------- /figures/pipeline.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/pipeline.png -------------------------------------------------------------------------------- /figures/stat.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/figures/stat.png -------------------------------------------------------------------------------- /grading.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/grading.py -------------------------------------------------------------------------------- /grading_results/result_example.jsonl.log: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/grading_results/result_example.jsonl.log -------------------------------------------------------------------------------- /grading_results/result_p_subset_example.jsonl.log: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/grading_results/result_p_subset_example.jsonl.log -------------------------------------------------------------------------------- /model_responses/example.jsonl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/model_responses/example.jsonl -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/requirements.txt -------------------------------------------------------------------------------- /utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/meituan-longcat/AMO-Bench/HEAD/utils.py --------------------------------------------------------------------------------