AI Detection Glossary

Last updated: April 2026

Essential terms for understanding AI-generated content detection. These definitions cover the core concepts behind our detection tools and the broader AI content landscape.

📖 32 Terms Defined 🔤 A–Z Coverage 🧠 Technical & Accessible

Terms

Adversarial Attack

A technique where imperceptible perturbations are added to inputs (images, text, audio) to fool AI detection models into producing incorrect classifications. Adversarial attacks are used to bypass deepfake detectors and text classifiers, making robustness testing essential for any detection system.

AI Hallucination

A phenomenon where a generative AI model produces factually incorrect or entirely fabricated information with apparent confidence. Common examples include invented citations, fictitious people, and fake statistics. Hallucinations are a useful detection signal, since human-written text rarely contains verifiably fabricated references.

AI Watermarking

The practice of embedding hidden, machine-readable signals into AI-generated content at creation time to enable later verification of its AI origin. Techniques include altering token selection distributions for text (e.g., Google's SynthID) and frequency-domain signatures for images. Increasingly mandated by AI regulation frameworks such as the EU AI Act.

Artifact (Visual)

An unintended visual imperfection in AI-generated images or video — such as malformed hands, asymmetric features, incoherent text, or unnatural backgrounds. Artifacts are primary signals for both human inspectors and automated classifiers to identify synthetic imagery, though they grow subtler as models improve.

Audio Deepfake

A synthetic voice recording created using AI to mimic a specific person's vocal characteristics. Modern systems require only seconds of reference audio. Detection relies on acoustic analysis of unnatural prosody, spectral artifacts, and statistical anomalies inaudible to humans.

↑ Back to top

Terms

Burstiness

A measure of variation in sentence length and complexity within a text. Human writing naturally alternates between short and long sentences (high burstiness), while AI-generated text tends toward uniform sentence length (low burstiness). A key signal used alongside perplexity in AI text detection tools like GPTZero.

↑ Back to top

Terms

C2PA Coalition for Content Provenance and Authenticity

An open technical standard for embedding cryptographically signed provenance metadata into media files, recording creation history, tools used, and editing chain. Co-founded by Adobe, Microsoft, and others. C2PA credentials are tamper-evident — modifying the file invalidates the signature. Adoption is growing in journalism, social media, and hardware cameras.

Classifier (AI Detection)

A machine learning model trained to categorize content as "human-generated" or "AI-generated." Text classifiers analyze statistical patterns in language; image classifiers examine visual features and frequency signatures. No classifier achieves perfect accuracy — all have false positive and false negative rates — and performance degrades as generative models evolve, requiring continuous retraining.

↑ Back to top

Terms

Deepfake

Synthetic media — typically video, image, or audio — in which a person's likeness or voice has been convincingly replaced or manipulated using deep learning. The term combines "deep learning" and "fake." Types include face swaps, voice cloning, and full synthesis. Detection methods analyze facial boundaries, blinking patterns, lighting, and physiological signals.

Diffusion Model

A generative AI architecture that creates images by learning to reverse a gradual noise-addition process. Starting from pure noise, the model iteratively denoises to produce coherent images guided by text prompts. Used by Stable Diffusion, DALL-E 3, and Midjourney. Diffusion outputs have different artifacts from GAN images, requiring distinct detection strategies.

Disinformation

False information deliberately created and spread to deceive or manipulate. Unlike misinformation (unintentional), disinformation implies malicious intent. AI-generated deepfakes, fake news articles, and fabricated audio have dramatically increased the scale and sophistication of disinformation campaigns.

↑ Back to top

Terms

EXIF Metadata Exchangeable Image File Format

Technical data embedded in image files by recording devices — camera model, GPS, timestamp, settings. Genuine photographs contain rich EXIF data; AI-generated images typically have absent or software-generated metadata. Useful as a screening signal, though easily stripped or forged.

↑ Back to top

Terms

Face Swap

A deepfake technique replacing one person's face with another's in video or image while preserving original body movements. Modern methods require only a single reference image. Detection relies on boundary artifacts, lighting mismatches, and unnatural eye movement at the face-neck edge.

False Positive / False Negative

In AI detection, a false positive occurs when authentic human content is incorrectly flagged as AI-generated. A false negative occurs when AI-generated content is missed and classified as human. Balancing these error rates is a central challenge — aggressive detectors produce more false positives, while conservative ones miss more AI content.

↑ Back to top

Terms

GAN Generative Adversarial Network

A deep learning architecture with two competing networks — a Generator creating synthetic data and a Discriminator distinguishing real from fake. GANs were the dominant technology behind early deepfakes (StyleGAN, DeepFaceLab). Their outputs leave characteristic artifacts — asymmetric features, wavy backgrounds, frequency-domain fingerprints — that remain key detection signals.

Generative AI

AI systems capable of creating new content — text, images, audio, video, or code — rather than merely analyzing existing data. Encompasses LLMs (GPT, Claude), image generators (DALL-E, Midjourney, Stable Diffusion), and audio/video synthesis tools. The rapid improvement of generative AI is the core driver behind the need for AI content detection tools.

GPT Generative Pre-trained Transformer

OpenAI's family of autoregressive language models (GPT-3.5 through GPT-4+). GPT models are the most commonly detected AI text type, forming the core of classifiers like GPTZero. ChatGPT normalized AI text generation for mainstream users, triggering widespread deployment of detection tools.

↑ Back to top

Terms

Hybrid Content

Content combining human-authored and AI-generated components — such as an LLM draft substantially edited by a person. The most challenging category for detection because human editing disrupts statistical AI signals. Studies show light editing can reduce classifier accuracy from above 90% to below 60%.

↑ Back to top

Terms

Inpainting

An AI image editing technique that fills selected regions with plausible content based on surrounding context and optional text prompts. Only part of the image is synthetic, making detection particularly challenging. Forensic methods analyze noise patterns and statistical inconsistencies at region boundaries.

↑ Back to top

Terms

Lexical Diversity

A measure of vocabulary richness in a text — the ratio of unique words to total words. AI-generated text often exhibits lower lexical diversity than human writing, favouring common, high-probability vocabulary. Combined with perplexity and burstiness, lexical diversity analysis helps detection tools identify AI-authored content.

LLM Large Language Model

A neural language model trained on massive text corpora using transformer architecture, capable of generating fluent natural language. Examples: GPT-4, Claude, Gemini, LLaMA. LLMs leave statistical traces in outputs — particularly in token probability distributions — that detection classifiers exploit, though signals diminish as models improve.

↑ Back to top

Terms

Media Literacy

The ability to critically evaluate media content — including AI-generated material — for authenticity, bias, and intent. As automated detection tools have inherent limitations, media literacy education is essential for empowering individuals to question suspicious content, verify sources, and understand the capabilities of generative AI.

Metadata

Structured information embedded in digital files describing properties, origin, and history. Types include EXIF (camera data), XMP, IPTC, and C2PA manifests. Genuine photographs carry rich metadata; AI-generated images often have sparse or software-generated metadata. Easily stripped or modified, limiting standalone reliability.

Misinformation

False information spread without deliberate intent to deceive. AI contributes to misinformation when users unknowingly share hallucinated facts, AI-generated images misrepresenting events, or deepfakes believed authentic. Combating AI-enabled misinformation requires both detection technology and media literacy education.

↑ Back to top

Terms

Neural Network

A computational model of interconnected layers that transform input data through weighted operations to produce predictions. Deep neural networks are the foundation of all modern AI — language models, image generators, and deepfake detectors alike. Key types include CNNs (images), RNNs (sequences), and transformers (parallel sequence processing).

Natural Language Processing NLP

The branch of AI focused on enabling computers to understand, interpret, and generate human language. NLP underpins both AI text generation (LLMs) and AI text detection (classifiers that analyze linguistic patterns). Core NLP tasks relevant to detection include tokenization, sentiment analysis, and statistical language modelling.

↑ Back to top

Terms

Perplexity

A measure of how "surprising" text is to a language model — how uncertain the model is about each next word. Human writing exhibits higher perplexity (more creative word choices) than AI text, which selects high-probability tokens for predictable prose. A primary detection signal in tools like GPTZero, often combined with burstiness analysis.

Prompt Engineering

The practice of designing optimized instructions for AI models to elicit desired outputs. In detection evasion, users prompt LLMs to "write like a human," vary sentence structure, or include deliberate errors to artificially raise burstiness and perplexity. Detection researchers study prompt patterns to improve classifier robustness.

↑ Back to top

Terms

Stable Diffusion

An open-source text-to-image diffusion model by Stability AI. Its open nature made it the most widely deployed image generator, spawning thousands of community fine-tunes. Runs on consumer GPUs without API content policies or watermarking, complicating provenance. Detection requires classifiers trained on diverse community variants.

Synthetic Media

Any digital content — text, image, audio, video — fully or substantially generated by AI rather than directly captured or authored by a human. Encompasses AI articles, generated artwork, deepfake videos, and cloned audio. Not inherently harmful, but potential for deception makes detection and provenance verification critical.

↑ Back to top

Terms

Temperature

A parameter controlling the randomness of an LLM's output. Low temperature (e.g., 0.2) produces predictable, repetitive text; high temperature (e.g., 1.0+) produces more varied and creative output. Temperature affects detectability — low-temperature AI text is easier to detect due to its high predictability, while high-temperature text mimics human variability.

Token

The fundamental unit of text processed by LLMs — typically a word, sub-word, or character sequence. LLMs generate text by predicting probability distributions over tokens sequentially. The distribution of token probabilities — which tokens were predictable versus surprising — is the statistical basis for perplexity-based AI text detection.

Transformer

The neural network architecture introduced in 2017 ("Attention Is All You Need") that replaced recurrent processing with parallelizable self-attention. Foundation of all modern LLMs (GPT, BERT, Claude, LLaMA, Gemini) and many image generation models. Understanding transformers is key to understanding why AI text exhibits its characteristic statistical properties.

↑ Back to top

Terms

Video Deepfake

Synthetic video depicting a real person performing actions or speaking words that never occurred. Types include face swaps, reenactment, body puppeting, and full text-to-video synthesis. Detection methods include temporal consistency analysis, physiological signal detection, audio-visual alignment verification, and model-specific artifact identification.

Voice Cloning

AI synthesis of a realistic imitation of a specific person's voice from a short audio sample — as few as 3–10 seconds in modern systems. Used in entertainment and accessibility but also in voice phishing attacks and political disinformation. Detection relies on spectrogram analysis, prosody irregularities, and liveness verification.

↑ Back to top

Terms

Watermarking (Digital)

The practice of embedding identifying or authenticating information into digital content. AI watermarking embeds provenance signals at content creation time (see AI Watermarking). Robust watermarks survive compression, resizing, and format conversion. Standards are being developed through C2PA and national AI regulation frameworks to enable transparency about AI-generated content at scale.

↑ Back to top

Terms

Zero-Shot Detection

A detection approach where a classifier identifies AI content from models it was never trained on, by generalizing from universal properties of AI generation. Important because the universe of generative models grows too fast for model-specific training. Current zero-shot detectors achieve 70–85% accuracy on unseen models versus 90–97% on known models.

↑ Back to top

A

Attaque adversariale

Modification d'une entrée pour tromper un classifieur IA tout en restant imperceptible pour l'humain. Utilisée pour contourner les détecteurs de deepfakes et de texte IA.

Hallucination IA

Génération par un LLM d'informations fausses présentées avec confiance — citations inventées, statistiques fictives. Signe révélateur du texte IA lors de la vérification des faits.

Filigrane IA (AI Watermarking)

Signaux invisibles intégrés dans le contenu IA à sa création pour permettre une vérification ultérieure. Google SynthID et d'autres systèmes modifient subtilement la distribution des tokens ou des fréquences d'image.

Artefact visuel

Défaut visuel dans une image ou vidéo IA — mains malformées, arrière-plans incohérents, textes illisibles. Signal principal pour identifier l'imagerie synthétique.

Deepfake audio

Enregistrement vocal synthétique imitant une personne réelle à l'aide de l'IA. Les systèmes modernes nécessitent seulement quelques secondes d'audio source.

B–C

Burstiness

Variation de longueur des phrases dans un texte. L'écriture humaine alterne phrases courtes et longues (burstiness élevée) ; le texte IA tend vers une longueur uniforme. Signal de détection clé.

C2PA

Standard technique ouvert pour intégrer des métadonnées de provenance cryptographiquement signées dans les fichiers médias. Co-fondé par Adobe et Microsoft.

Classifieur (détection IA)

Modèle d'apprentissage automatique entraîné pour distinguer le contenu IA du contenu humain. Aucun classifieur n'atteint une précision parfaite ; la performance se dégrade avec l'évolution des modèles génératifs.

D–F

Deepfake

Média synthétique dans lequel l'apparence ou la voix d'une personne a été remplacée ou manipulée par deep learning. Types : échange de visage, clonage vocal, synthèse complète.

Modèle de diffusion

Architecture IA générant des images en inversant un processus de bruitage progressif. Utilisé par Stable Diffusion, DALL-E 3 et Midjourney.

Désinformation

Information fausse diffusée intentionnellement pour tromper. Les deepfakes et le contenu IA ont considérablement augmenté l'échelle des campagnes de désinformation.

Métadonnées EXIF

Données techniques intégrées dans les images par les appareils — modèle de caméra, GPS, horodatage. Les images IA manquent typiquement de métadonnées EXIF authentiques.

Échange de visage (Face Swap)

Technique deepfake remplaçant le visage d'une personne par celui d'une autre. La détection repose sur les artefacts de bordure, les incohérences d'éclairage et les mouvements oculaires non naturels.

Faux positif / Faux négatif

Un faux positif survient quand un contenu humain est identifié à tort comme IA. Un faux négatif survient quand du contenu IA passe inaperçu. Équilibrer ces taux d'erreur est un défi central de la détection.

G–I

GAN (Réseau génératif adversarial)

Architecture avec un générateur et un discriminateur en compétition. Technologie fondamentale des premiers deepfakes. Laisse des artefacts caractéristiques dans les images — signaux de détection clés.

IA générative

Systèmes IA capables de créer du nouveau contenu — texte, images, audio, vidéo. L'amélioration rapide de l'IA générative est le moteur principal du besoin d'outils de détection.

GPT (Generative Pre-trained Transformer)

Famille de modèles de langage d'OpenAI. GPT est le type de texte IA le plus couramment détecté. ChatGPT a normalisé la génération de texte IA pour le grand public.

Contenu hybride

Contenu combinant éléments humains et IA — le plus difficile à détecter car l'édition humaine masque les signaux statistiques de l'IA.

Inpainting

Technique d'édition IA remplissant des régions sélectionnées d'une image. Seule une partie de l'image est synthétique, rendant la détection particulièrement difficile.

L–N

Diversité lexicale

Mesure de la richesse du vocabulaire dans un texte. Le texte IA présente souvent une diversité lexicale plus faible que l'écriture humaine, favorisant un vocabulaire commun et prévisible.

LLM (Grand modèle de langage)

Modèle de réseau de neurones entraîné sur de vastes corpus de texte. GPT-4, Claude, Gemini et LLaMA en sont des exemples. Leur sortie peut être détectée par analyse de perplexité et de burstiness.

Littératie médiatique

Capacité à évaluer critiquement les contenus médiatiques pour leur authenticité et leur intention. Complément essentiel aux outils de détection automatisés.

Métadonnées

Informations structurées décrivant les propriétés et l'origine d'un fichier numérique. Les photographies authentiques contiennent des métadonnées riches ; les images IA en ont souvent peu.

Mésinformation

Information fausse diffusée sans intention de tromper. L'IA y contribue quand des utilisateurs partagent involontairement des faits hallucinés ou des images IA trompeuses.

Réseau de neurones

Système de couches interconnectées transformant des données en prédictions. Fondement de tous les systèmes IA modernes — modèles de langage, générateurs d'images et détecteurs.

Traitement du langage naturel (NLP)

Branche de l'IA permettant aux ordinateurs de comprendre et générer le langage humain. Sous-tend à la fois la génération de texte IA et sa détection.

P–S

Perplexité

Mesure de la prévisibilité d'un texte. Le texte humain a une perplexité élevée (imprévisible) ; le texte IA a une perplexité faible (prévisible). Outil de détection fondamental.

Prompt engineering

Conception d'instructions optimisées pour les modèles IA. En évasion de détection, les utilisateurs demandent aux LLM d'« écrire comme un humain » pour tromper les classifieurs.

Stable Diffusion

Modèle de génération d'images open-source. Sa disponibilité publique complique la provenance car il fonctionne sans politique de contenu API ni filigranage.

Médias synthétiques

Tout contenu numérique généré substantiellement par l'IA — texte, image, audio, vidéo. Comprend les deepfakes, les images générées et les voix clonées.

T–Z

Température

Paramètre contrôlant l'aléatoire de la sortie d'un LLM. Une température basse produit du texte prévisible et facile à détecter ; une température élevée imite la variabilité humaine.

Token

Unité fondamentale de texte traitée par les LLM. La distribution des probabilités de tokens est la base statistique de la détection par perplexité.

Transformer

Architecture de réseau de neurones introduite en 2017, fondamentale pour tous les LLM modernes. Utilise des mécanismes d'auto-attention pour traiter les séquences en parallèle.

Deepfake vidéo

Vidéo synthétique montrant une personne réelle effectuant des actions qui n'ont jamais eu lieu. La détection analyse la cohérence temporelle, les signaux physiologiques et l'alignement audio-visuel.

Clonage vocal

Synthèse IA imitant la voix d'une personne à partir de quelques secondes d'audio. Utilisé dans des fraudes vocales et la désinformation politique.

Filigrane numérique

Intégration d'informations d'authentification dans le contenu numérique. Les filigranes robustes survivent à la compression et au recadrage. Des standards sont développés via le C2PA et les cadres réglementaires IA.

Détection zéro-shot

Détection de contenu IA provenant de modèles jamais vus pendant l'entraînement. Les détecteurs zéro-shot atteignent 70–85 % de précision sur des modèles inconnus contre 90–97 % sur des modèles connus.

Lectures complémentaires

Dernière mise à jour : avril 2026

AI Detection Glossary

A Terms

B Terms

C Terms

D Terms

E Terms

F Terms

G Terms

H Terms

I Terms

L Terms

M Terms

N Terms

P Terms

S Terms

T Terms

V Terms

W Terms

Z Terms

Further Reading

A

B–C

D–F

G–I

L–N

P–S

T–Z

Lectures complémentaires

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms

Terms