01. What It Is
Hallucination in AI refers to a model generating content that is factually incorrect, logically inconsistent, or unsupported by any real source, while presenting it with apparent confidence. The term comes from psychology, where a hallucination is a perception with no external stimulus. In LLMs, the analogy is a confident statement with no factual grounding.
Two subtypes are commonly distinguished:
Factuality errors: The model states something objectively false. Examples include fabricated citations (the paper exists but says something different, or does not exist at all), wrong dates, incorrect scientific claims, nonexistent legal cases.
Faithfulness errors: The model distorts or misrepresents something it was actually given. A summary that contradicts the source document, or a response that ignores an explicit constraint in the prompt, are faithfulness failures.
Baseline hallucination rates for frontier models on mixed production tasks are typically reported in the 3-20% range, with higher rates in low-resource languages, specialized domains, and tasks that push the model to the edge of its training distribution.