Limits and Trust 5 min read

Why AI Hallucinates

You have probably heard that AI makes things up. It might confidently cite a research paper that does not exist, give you a wrong date for a real event, or invent a quote from a real person.

This is called hallucination. Understanding why it happens helps you know when to trust AI and when to verify.

AI does not know what it knows

Here is the key thing to understand: a language model does not have a fact-checking system built in. It does not know the difference between things it knows reliably and things it is uncertain about.

When you ask a question, it does not search its knowledge and return a verified answer. It generates text that is likely to follow your question based on patterns it learned. Most of the time, those patterns produce accurate text. Sometimes they produce confident-sounding nonsense.

Think of it like a very confident guesser

Imagine someone who has read a huge amount and absorbed a lot. When you ask them a question, they always answer. They never say “I am not sure.” They give you a fluent, plausible-sounding response, even when they are filling in gaps from inference rather than from actual knowledge.

That is the model’s default behaviour. It was trained to produce good-sounding text, not to flag uncertainty.

Why the patterns sometimes lead it astray

Language models learn that certain kinds of sentences follow certain kinds of questions. “Who invented X?” tends to be followed by a name. “What year did Y happen?” tends to be followed by a date.

If the model does not have reliable data for the specific answer, it may still generate a plausible-looking one because the shape of the answer fits the pattern. It is not lying. It is doing what it was trained to do: produce likely-sounding text.

Where hallucination is most likely

  • Obscure or niche facts. Rarely discussed topics have less training data, so patterns are weaker.
  • Recent events. Models have a training cutoff. Anything after that date is unknown territory.
  • Specific numbers, dates, and citations. These need to be precisely correct, and the model has no way to verify precision.
  • Quotes. The model can generate text in someone’s style without it being something they actually said.

What you can do

Verify anything that matters. Treat AI output on factual questions the same way you would treat a tip from a well-read friend: plausible and worth investigating, not automatically correct.

Ask for sources and check them. The model may generate real-looking citations that do not exist. If a source matters, look it up yourself.

Notice confident language. Phrases like “It is well-established that…” or “According to a 2019 study…” are not evidence of accuracy. They are just patterns the model learned from academic writing.

Use AI for what it is reliable at. Drafting, summarising, explaining, brainstorming, writing code. These tasks do not require the same kind of factual precision as researching specific claims.

You now understand this

AI hallucination happens because language models generate text based on patterns, not verified facts. They have no built-in way to distinguish what they know reliably from what they are inferring. The result is confident-sounding output that can be wrong. Verify anything that matters, especially specific facts, dates, numbers, and citations.

Next lesson What Is RAG? →