└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # MLLM-QA-Papers-with-Code 2 | 3 | A collection of papers and code repositories for employing Multi-Modal Large Language Models (MLLM) in quality assessment tasks. 4 | 5 | | Title | Pub | Paper Link | Code Link | 6 | |-------|------------|------------|-----------| 7 | | Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision | ICLR2024 | [Paper](https://arxiv.org/pdf/2309.14181.pdf) | [Code](https://github.com/Q-Future/Q-Bench) | 8 | | Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models | CVPR2024 | [Paper](https://arxiv.org/pdf/2309.14181.pdf) | [Code](https://arxiv.org/pdf/2311.06783.pdf) | 9 | | Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels | - | [Paper](https://arxiv.org/pdf/2312.17090.pdf) | [Code](https://github.com/Q-Future/Q-Align) | 10 | | Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models | - | [Paper](https://arxiv.org/pdf/2312.08962.pdf) | [Code](https://depictqa.github.io/) | 11 | | Towards Open-ended Visual Quality Comparison | - | [Paper](https://arxiv.org/abs/2402.16641) | [Code](https://github.com/Q-Future/Co-Instruct) | 12 | | VisualCritic: Making LMMs Perceive Visual Quality Like Humans | - | [Paper](https://arxiv.org/pdf/2403.12806v1.pdf) | [Code](#) | 13 | | AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics Perception | - | [Paper](https://arxiv.org/pdf/2401.08276.pdf) | [Code](https://github.com/yipoh/AesBench) | 14 | | 2AFC Prompting of Large Multimodal Models for Image Quality Assessment | - | [Paper](https://arxiv.org/abs/2402.01162) | [Code](https://github.com/h4nwei/2AFC-LMMs) | 15 | | A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | - | [Paper](https://arxiv.org/abs/2403.10854) | [Code]() | 16 | | UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark | - | [Paper](https://arxiv.org/pdf/2404.09619.pdf) | [Code](https://github.com/Uniaa-MLLM/Uniaa) | 17 | | AesExpert: Towards Multi-modality Foundation Model for Image Aesthetics Perception | - | [Paper](https://arxiv.org/pdf/2404.09624.pdf) | [Code](https://github.com/yipoh/AesExpert) | 18 | 19 | --------------------------------------------------------------------------------