If Hive says 'AI' and Sightengine says 'real', which one is right?

Neither, alone. The two systems weight different signals — Hive leans on a learned classifier trained against generator outputs, Sightengine leans on physical forensic features. Disagreement is informative: it usually means the image has mixed-origin signals (real photo edited by AI, or AI photo passed through a real camera-style pipeline).

Are the confidence scores comparable across the two services?

No. A Hive score of 0.8 and a Sightengine score of 0.8 are calibrated against different reference distributions. Treat them as ordinal within each service, not as a shared scale.

Does running an image through a humanizer affect both detectors equally?

Usually not. A humanizer that targets pixel-level forensics (noise, PRNU, JPEG history) moves Sightengine more than Hive. A humanizer that targets generator-specific fingerprints moves Hive more than Sightengine. Layered pipelines that address both are the only ones that drop both scores simultaneously.

Hive Moderation vs Sightengine — What 2026 Image Detectors Actually Check

title: "Hive Moderation vs Sightengine — What 2026 Image Detectors Actually Check" description: "A side-by-side technical breakdown of how Hive Moderation and Sightengine score AI-generated images in 2026, where they agree, where they disagree, and how to read their confidence scores honestly." slug: "hive-vs-sightengine-2026" publishedAt: "2026-05-20" updatedAt: "2026-05-20" author: "SynthGuard Team" category: "ai-detection" tags: ["hive", "sightengine", "image-detection", "moderation", "forensics"] readingTime: 11 coverImage: "/blog/covers/hive-vs-sightengine-2026.jpg" featured: false faq:

q: "If Hive says 'AI' and Sightengine says 'real', which one is right?" a: "Neither, alone. The two systems weight different signals — Hive leans on a learned classifier trained against generator outputs, Sightengine leans on physical forensic features. Disagreement is informative: it usually means the image has mixed-origin signals (real photo edited by AI, or AI photo passed through a real camera-style pipeline)."
q: "Are the confidence scores comparable across the two services?" a: "No. A Hive score of 0.8 and a Sightengine score of 0.8 are calibrated against different reference distributions. Treat them as ordinal within each service, not as a shared scale."
q: "Does running an image through a humanizer affect both detectors equally?" a: "Usually not. A humanizer that targets pixel-level forensics (noise, PRNU, JPEG history) moves Sightengine more than Hive. A humanizer that targets generator-specific fingerprints moves Hive more than Sightengine. Layered pipelines that address both are the only ones that drop both scores simultaneously." related: ["how-ai-image-detectors-work", "ai-text-detectors-disagree", "prnu-fft-sensor-noise"]

Hive Moderation and Sightengine are the two AI image detectors most likely to be running between a creator and their audience in 2026. They sit inside Discord moderation pipelines, dating-app verification flows, stock-photo intake systems, and most large content platforms' upload paths. They also disagree with each other surprisingly often.

This is a side-by-side technical look at what each one actually checks, why they reach different conclusions on the same image, and how to read their scores honestly.

What Hive actually does#

Hive's AI-image classifier is a learned model trained on a large, evolving corpus of generator outputs paired with real photographs. It does not publish architecture details, but the behavior is consistent with a CNN-plus-transformer ensemble fine-tuned per generator family: a head that specializes in diffusion fingerprints, a head for GAN fingerprints, a head for flow-matching models, and a fusion layer that produces the single score the API returns.

The implication is that Hive's strength tracks coverage of its training set. When a new generator ships and gets adopted, Hive's recall on that generator drops sharply for a few weeks until they retrain. Older generators with extensive training data are caught at 95%+; brand-new outputs in the wild can sit closer to 70% for the first month of the model's public life.

Hive scores are robust to mild post-processing — light JPEG compression, modest color grading, small crops. They degrade meaningfully under aggressive processing: heavy denoising, re-encoding through a real camera-style pipeline, or downscaling below ~512 pixels on the shorter side.

What Sightengine actually does#

Sightengine's AI detection is built on a more classical forensic stack with a learned classifier on top. The signals it weights heavily include:

JPEG quantization history. Real photos almost always have a single, identifiable JPEG encoding pass with quantization tables consistent with a known camera or editor. Generated images often have a flat quantization profile or two stacked passes from the renderer plus a downstream save.
Sensor noise statistics. Real photos have luma-dependent grain with a characteristic spectrum. Generated images, even after noise injection, tend to have over-smooth or implausibly uniform noise.
Chroma subsampling fingerprints. The 4:2:0 boundaries in a generated image often do not align with the way a real camera would have subsampled the same scene.
DCT block-edge artifacts. The 8×8 block grid of JPEG leaves measurable discontinuities in real photos that interact with content edges in specific ways. Generated images have block discontinuities that interact with content differently because the content was never constrained by the block grid.

Sightengine's strength is being generator-agnostic. It does not need to have seen a specific model to detect it; it checks whether the image is consistent with having come from a physical camera. The cost is that high-quality post-processing pipelines that simulate camera output can move Sightengine scores substantially.

Where they agree, where they disagree#

Three regimes show up consistently when you run the same image through both services:

Both confident AI (raw generator output). A freshly exported Midjourney v8 or Imagen 4 image with default save settings will typically score above 0.9 on both. Hive recognizes the generator fingerprint; Sightengine sees the missing camera artifacts. Agreement is high and the verdict is reliable.

Both confident real (camera-original JPEG). An iPhone-original JPEG, untouched, scores below 0.1 on both. Hive sees nothing matching its generator corpus; Sightengine sees a complete, consistent camera fingerprint.

Sharp disagreement (post-processed AI or AI-edited real). This is the interesting regime. An AI image that has been re-encoded through a camera-style pipeline can score 0.3 on Sightengine (the camera-style metadata and noise pass) while still scoring 0.8 on Hive (the generator-fingerprint head still recognizes the diffusion latent structure). A real photo with AI inpainting can produce the inverse pattern: Sightengine flags the inpainted region's noise mismatch, Hive sees mostly real-photo features and underweights the inpaint.

The disagreement is not a bug — it is the most informative output. A reviewer who reads both scores and treats agreement as confirmation and disagreement as a signal to investigate gets meaningfully better calibration than one who trusts either service alone.

What the scores do not mean#

Three common misreadings cause most of the bad decisions made with these tools:

A high score is not proof. Both services are calibrated for high precision at the typical operating threshold (~0.7), but neither is designed for legal-grade attribution. A 0.92 from Hive means "this image is consistent with our generator corpus", not "this image was produced by a specific model".
A low score is not exoneration. Both services can be defeated by aware adversaries, and both have meaningful blind spots — Sightengine on heavily post-processed generations, Hive on novel generators within their training window.
Scores are not comparable across services. Hive 0.8 and Sightengine 0.8 are calibrated against different reference distributions. Operationally, you want to set independent thresholds per service and combine them with explicit logic, not average them.

Implications for creators and platforms#

For platforms, the practical takeaway is that single-vendor detection is structurally fragile. Each service has a different blind spot, and aware adversaries optimize against whichever one they think you are running. A two-source ensemble with explicit disagreement-handling is meaningfully harder to game than either service used alone.

For creators publishing AI work in good faith, the takeaway is the symmetric one: if your goal is to clear both detectors, surface treatment is not enough. Sightengine forces you to address the physical-camera signals (JPEG history, noise statistics, subsampling, block alignment). Hive forces you to address the generator fingerprint at the latent level. A humanizer that only does one of these will move one score and leave the other untouched.

This is why the layered approach matters. A pipeline that injects camera-grade noise, recreates plausible JPEG history, and adds a downstream pass that disrupts diffusion-specific latent structure will move both detectors. Any pipeline that does less will move one and leave the other as the open flank.

If you want to see a layered image humanizer that targets both physical-camera forensics and generator-specific fingerprints, try the Photo Humanizer. Processing runs entirely in your browser, so your originals never leave your device.

Hive Moderation vs Sightengine — What 2026 Image Detectors Actually Check

What Hive actually does#

What Sightengine actually does#

Where they agree, where they disagree#

What the scores do not mean#

Implications for creators and platforms#

Review method, sources and limits

Primary references

Frequently asked questions

Keep reading

How AI Image Detectors Actually Work — A 2026 Technical Guide

AI Text Detectors: Why GPTZero, Originality & Turnitin Disagree

PRNU, FFT & Sensor Noise — The Forensics Behind Image Authenticity