Cross-cutting

Hallucination

Hallucination is the LLM failure mode in which the model generates content that is plausible-sounding but factually wrong, invented citations, fabricated quotes, non-existent functions, misremembered statistics. Hallucination is most common on factual questions where the model lacks training data or grounding context.

May 23, 2026

Hallucination is not a bug. It's an emergent consequence of the LLM's training objective (predict the next token plausibly) combined with the model's lack of knowledge boundaries. The mitigations are well-understood: RAG provides factual grounding, structured outputs constrain what the model can claim, citation requirements force the model to ground claims in source documents, and lower temperature reduces creative invention. None of these eliminate hallucination entirely; production systems combine multiple mitigations and add validation layers (does the cited URL exist? does the quoted text appear in the source?) before exposing the output to users.