Can AI image detectors be 100% accurate?

No. Every published detector has measurable false-positive and false-negative rates. State-of-the-art research detectors plateau between 85% and 95% balanced accuracy on in-distribution data and degrade sharply once images are recompressed, resized, or screenshotted.

Why do two different detectors give different verdicts on the same image?

They look at different signals. One may rely on JPEG block artifacts, another on FFT high-frequency energy, a third on a learned classifier trained on Stable Diffusion 1.5 outputs. Each signal can be present, absent, or misleading on a given image.

Does humanizing an image always defeat detectors?

Not always — but a good humanization pipeline that re-introduces sensor-realistic noise, repacks JPEG, and breaks frequency fingerprints will defeat the majority of public detectors today.

How AI Image Detectors Actually Work — A 2026 Technical Guide

title: "How AI Image Detectors Actually Work — A 2026 Technical Guide" description: "A long, technical look at how modern AI image detectors really work — frequency analysis, PRNU, sensor noise, perplexity-style features, and why they disagree." slug: "how-ai-image-detectors-work" publishedAt: "2026-04-15" updatedAt: "2026-04-21" author: "SynthGuard Team" category: "ai-detection" tags: ["detectors", "forensics", "research", "diffusion"] readingTime: 14 coverImage: "/blog/covers/how-ai-image-detectors-work.webp" featured: true faq:

q: "Can AI image detectors be 100% accurate?" a: "No. Every published detector has measurable false-positive and false-negative rates. State-of-the-art research detectors plateau between 85% and 95% balanced accuracy on in-distribution data and degrade sharply once images are recompressed, resized, or screenshotted."
q: "Why do two different detectors give different verdicts on the same image?" a: "They look at different signals. One may rely on JPEG block artifacts, another on FFT high-frequency energy, a third on a learned classifier trained on Stable Diffusion 1.5 outputs. Each signal can be present, absent, or misleading on a given image."
q: "Does humanizing an image always defeat detectors?" a: "Not always — but a good humanization pipeline that re-introduces sensor-realistic noise, repacks JPEG, and breaks frequency fingerprints will defeat the majority of public detectors today." related: ["prnu-fft-sensor-noise", "humanize-ai-images-without-losing-quality", "is-this-photo-ai", "why-real-photo-flagged-ai"]

AI image detectors look magical from the outside — drop an image, get a percentage, ship the verdict. Inside, they are an assembly of brittle statistical signals stacked on top of each other, each capable of being right, wrong, or actively misleading. This is the long, honest version: what the signals actually are, why they disagree, and where the field is heading in 2026.

The fundamental asymmetry#

Generative models produce images that are statistically plausible but not statistically natural. A photograph carries decades of physical baggage — sensor read noise, demosaicing artifacts, lens vignetting, JPEG quantization done by a specific camera firmware. A diffusion sample carries none of that by default. It carries a different baggage: training-distribution priors, decoder upsampling artifacts, and a uniform high-frequency signature.

Every detector exploits this gap. The interesting question is which part of the gap.

Signal 1 — Frequency-domain fingerprints#

If you take the 2D Fourier transform of a real photograph and compare it to a diffusion output, you can often spot the difference with the naked eye. Real images decay smoothly toward higher frequencies because optics low-pass them, and demosaicing introduces a characteristic cross pattern. Diffusion outputs frequently have a near-uniform high-frequency floor because the decoder upsamples from a low-resolution latent.

This shows up as:

An unnaturally flat radial spectrum above ~0.4 Nyquist
Faint grid artifacts at the latent grid (typically 8× or 16×)
Missing or compressed demosaicing kernels in the corners

A naive FFT detector flags any image with a too-flat radial spectrum. The problem: bicubic upscalers, AI-enhanced "HDR" filters in modern phones, and even aggressive Lightroom sharpening reproduce the same flatness on perfectly real images.

Signal 2 — JPEG re-encoding artifacts#

JPEG quantizes 8×8 DCT blocks. When you re-save a JPEG, the new quantization grid usually does not align with the previous one, leaving a faint "double-quantization" trace. Real photos taken on a phone have one generation of JPEG. Photos that were generated, exported as PNG, and then re-uploaded to a JPEG-only platform have one JPEG. AI images that were generated, edited in Photoshop, and exported as JPEG again often have two or three.

A double-JPEG detector looks at the histogram of the first DCT coefficients of each 8×8 block. A bimodal histogram is a strong tell. This is fast (it runs on the compressed stream without decoding) and surprisingly reliable on social-media downloads — but it tells you nothing about whether the content is AI-generated. It only tells you the file has been re-encoded.

Signal 3 — PRNU and sensor noise#

Photo Response Non-Uniformity (PRNU) is the per-pixel noise pattern unique to each physical sensor. Two iPhone 15 Pros leave subtly different PRNU patterns on every shot. Generated images have no PRNU — they have whatever noise the decoder hallucinated, which is uniform and channel-correlated in ways real sensors are not.

Modern PRNU-aware detectors do not try to identify which camera took an image. They check whether the high-frequency residual is consistent with any plausible sensor model:

Is the green channel noisier than red and blue (Bayer pattern)?
Does noise variance scale with luminance (shot noise)?
Are noise correlations between adjacent pixels close to demosaic kernels?

If the residual is too clean, too uniform, or has the wrong inter-channel correlation, the detector flags it. This is one of the strongest content-agnostic signals available today.

Signal 4 — Learned classifiers (the CNN/ViT layer)#

The classic approach — train a ResNet or ViT on (real, fake) pairs — works beautifully on the distribution it was trained on and falls off a cliff outside it. A classifier trained on Stable Diffusion 1.5 routinely scores below 60% on Midjourney v6 outputs. A classifier trained on 2023 GAN faces is essentially blind to 2025 transformer-based portrait models.

The state of the art uses contrastive backbones (DINOv2, SigLIP) frozen as feature extractors with a small detection head trained on diverse generators. Even those plateau in the high 80s on out-of-distribution data.

Signal 5 — Perplexity-style patch statistics#

Borrowed from text detection: ask whether each local patch of the image is too predictable given its neighbors. Real photos have surprising patches — a sudden rim light, a smudged background, a chromatic aberration on a high-contrast edge. Generated images tend to be too coherent — every patch is exactly what the surrounding context predicts.

This is implemented as either:

Block variance ratios — variance of each 32×32 patch divided by the variance of its surroundings. A flat ratio across the image is suspicious.
Auto-encoder reconstruction residuals — pass the image through a VAE and measure where the reconstruction is suspiciously good. Generated images tend to reconstruct too well because they live near the manifold the VAE was trained on.

Signal 6 — Metadata and provenance#

The least sexy signal and often the most decisive. A fresh shot from a phone carries:

EXIF: camera make, model, ISO, shutter speed, GPS, lens focal length
MakerNote: vendor-specific binary blob with white balance presets, image processor version
Color profile: sRGB vs. Display P3 with embedded ICC profile
Thumbnail: a 160×120 JPEG embedded in the EXIF that should match the main image

Diffusion outputs typically carry none of this — or worse, they carry generator metadata (Software: Stable Diffusion XL 1.0). A metadata-aware detector that finds an image without EXIF, without a thumbnail, and with a perfectly square aspect ratio common to generators is going to flag it as suspect even if every pixel signal is clean.

Signal 7 — Watermarking (the SynthID question)#

Google, Meta and OpenAI have all shipped invisible watermarks that survive most editing. SynthID-Image embeds a multi-bit signal in the latent before decoding, recoverable by a detection network even after JPEG, crop and color shift. Stable Diffusion 3 ships with a similar mark.

Watermark detection is the only signal with near-zero false-positive rate on watermarked images — by construction. It is also useless on images from generators that don't watermark, on watermarks that have been disrupted by intentional perturbation, and on watermarks the detector wasn't told about.

In practice, watermark detectors run alongside the statistical signals as a "fast positive" path: if the watermark is found, return AI immediately; if not, fall through to the rest.

Why detectors disagree#

The 2024 paper "AI-Generated Image Detection: A Survey" measured 14 public detectors on the same 10,000-image benchmark and found pairwise agreement ranging from 41% to 78%. The reasons:

Different signal mix. A frequency-only detector and a learned-classifier detector are looking at different artifacts of the same image.
Different training distributions. A 2023-trained model has never seen 2025 generators.
Different thresholds. Two detectors with identical scoring can return opposite verdicts depending on where the operator drew the line.
Different pre-processing. Resizing to 224×224 vs. 512×512 vs. native resolution materially changes every frequency-domain signal.

This is why services like Originality, Hive, and Sightengine produce divergent verdicts — and why you should never treat any single detector's output as ground truth.

The post-processing problem#

Every signal above degrades when an image is:

Re-encoded as JPEG
Resized
Screenshotted
Color-graded
Posted through Instagram, X, or WhatsApp

The 2025 Hive Moderation transparency report acknowledged that their AI-image detector loses ~22 percentage points of recall on images that have been screenshotted on iOS and re-uploaded. This is not a flaw — it is a fundamental limit. The signals detectors use are exactly the signals that lossy pipelines destroy.

Our own Photo Detector deliberately surfaces this uncertainty — instead of a single percentage, it shows the contributing signals and a confidence band, so you can see why the verdict came out the way it did.

What's coming next#

Three trends to watch in 2026:

Provenance-first standards (C2PA / Content Credentials) shifting the burden from "detect after the fact" to "verify cryptographically signed origin." Adobe, Sony, Leica and Nikon now ship cameras that sign captures at the sensor level.
Foundation-model detectors — large multi-modal models that look at semantics ("does this scene make physical sense?") in addition to pixel statistics.
Adversarially-trained humanizers that explicitly optimize against published detector ensembles. We expect this arms race to dominate the next 18 months.

Bottom line#

AI image detection in 2026 is a probabilistic sport, not a forensic certainty. Every detector is a stack of signals, each with known failure modes, combined into a number that hides more than it reveals. Use them for triage, not for adjudication — and when the stakes are high, look at the signals, not just the verdict.

If you build with detectors, expose the underlying signals to your users. If you write about them, never report a percentage without the detector's documented out-of-distribution accuracy. And if you ship generated images that need to pass detection, a real humanization pipeline does the job — but understand exactly what it's defeating.

How AI Image Detectors Actually Work — A 2026 Technical Guide

The fundamental asymmetry#

Signal 1 — Frequency-domain fingerprints#

Signal 2 — JPEG re-encoding artifacts#

Signal 3 — PRNU and sensor noise#

Signal 4 — Learned classifiers (the CNN/ViT layer)#

Signal 5 — Perplexity-style patch statistics#

Signal 6 — Metadata and provenance#

Signal 7 — Watermarking (the SynthID question)#

Why detectors disagree#

The post-processing problem#

What's coming next#

Bottom line#

Review method, sources and limits

Primary references

Frequently asked questions

Glossary terms in this article

Keep reading

PRNU, FFT & Sensor Noise — The Forensics Behind Image Authenticity

The Complete Guide to Humanizing AI-Generated Images Without Losing Quality

How Do I Know If a Photo Is AI-Generated?