Achieving accurate and interpretable AI-generated images detection via VLMs' visual reasoning capability.
FakeReasoning conducts forgery reasoning task in structured stages (summary, caption, reasoning and conclusion) and hierarchical steps (low-level and high-level), leading to accurate and interpretable detection.
Constructed with advanced GPT-4o, MMFR-Dataset contains over 100,000 images with over
300,000 reasoning annotations as its training set.
Here we illustrate several images and according annotations from reasoning stage.
Evaluation sets include 20,000 images with over 60,000 reasoning annotations across 10 up-to-data generative models
and are balanced in terms of authenticity.
Here we illustrate several real and fake images from each evaluation set.
@article{gao2025fakereasoning, title={FakeReasoning: Towards Generalizable Forgery Detection and Reasoning}, author={Gao, Yueying and Chang, Dongliang and Yu, Bingyao and Qin, Haotian and Chen, Lei and Liang, Kongming and Ma, Zhanyu}, journal={arXiv preprint arXiv:2503.21210}, year={2025}, url={https://arxiv.org/abs/2503.21210} }