FakeReasoning: Towards Generalizable Forgery Detection and Reasoning

Achieving accurate and interpretable AI-generated images detection via VLMs' visual reasoning capability.

Paper

Code

Dataset

Authors

Yueying Gao¹, Dongliang Chang², Bingyao Yu², Haotian Qin¹, Lei Chen², Kongming Liang¹, Zhanyu Ma¹

¹ PRIS, Beijing University of Posts and Telecommunications
² Tsinghua University

Contributions

FakeReasoning: A forgery detection and reasoning framework, providing accurate detection with structured and reliable reasoning on forgery attributes.

MMFR-Dataset: A multi-modal forgery reasoning dataset, containing 100K training images and 20K evaluation images annotated with detailed reasoning on forgery attributes.

Detection and Reasoning Performance of FakeReasoning

FakeReasoning conducts forgery reasoning task in structured stages (summary, caption, reasoning and conclusion) and hierarchical steps (low-level and high-level), leading to accurate and interpretable detection.

MMFR-Dataset

Constructed with advanced GPT-4o, MMFR-Dataset contains over 100,000 images with over 300,000 reasoning annotations as its training set.
Here we illustrate several images and according annotations from reasoning stage.

Evaluation sets include 20,000 images with over 60,000 reasoning annotations across 10 up-to-data generative models and are balanced in terms of authenticity.
Here we illustrate several real and fake images from each evaluation set.

DeepFloyd IF

DALLE-3

Stable Diffusion

Guided

GLIDE

GigaGAN

StyleGAN-XL

StyleGAN2

GauGAN

BigGAN

Cite Our Work

@article{gao2025fakereasoning,
  title={FakeReasoning: Towards Generalizable Forgery Detection and Reasoning},
  author={Gao, Yueying and Chang, Dongliang and Yu, Bingyao and Qin, Haotian and Chen, Lei and Liang, Kongming and Ma, Zhanyu},
  journal={arXiv preprint arXiv:2503.21210},
  year={2025},
  url={https://arxiv.org/abs/2503.21210}
}