Glossary / Models & Architecture

Hallucination

When AI confidently makes things up. Every model does it. The question is whether your deployment can survive it.

Models & Architecture

The Technical Definition

Hallucination occurs when an LLM generates confident, plausible-sounding outputs that are factually incorrect or completely fabricated. The model isn’t lying—it’s operating as designed. It predicts the next token based on patterns in its training data. Sometimes that results in something true. Sometimes it results in something that feels true but isn’t. The model can’t distinguish between the two. It just outputs text that matches what it learned.

What This Actually Means for Your Business

Hallucinations are not an edge case. They’re a feature of how language models work. Every LLM hallucinates. The question is whether your use case can tolerate it.

If you’re using AI for customer service, hallucinations can be catastrophic. Your AI agent confidently tells a customer that a product feature exists when it doesn’t. The customer purchases based on that information. Now you have an angry customer and a revenue problem. If you’re using AI for legal research, hallucinations can be worse—the model cites a case that doesn’t exist. Your lawyer uses it in a brief without fact-checking. That’s professional liability.

Not all use cases are equally vulnerable. If you’re using AI to draft internal emails or brainstorm ideas, hallucinations are annoying but not dangerous. Someone reads it, catches the error, and moves on. If you’re using it for financial compliance or medical advice, hallucinations are disqualifying without protective layers.

The insidious part: hallucinations are confident. The model doesn’t flag uncertainty. It delivers wrong information in the same tone as correct information. Your team can’t tell the difference without external verification. This is why so many hallucination problems get discovered in production, not testing. Your testers checked cases they knew the answers to. The hallucination happened on a novel question where nobody caught it until it impacted a real customer.

Model choice matters here too. Some models hallucinate more than others. Newer models generally hallucinate less. Models fine-tuned on specific domains hallucinate less in those domains. But none of them eliminate it entirely. There’s no zero-hallucination model. There’s only “less hallucination than the alternative.”

Companies also discover that hallucination rates vary by task type. The same model might be very accurate on factual recall but much worse at complex reasoning. You need to test your specific use case.

Reality Check

What the vendor says: “We use advanced techniques to reduce hallucinations in our AI system.”

What that means in practice: They use RAG or prompt engineering to ground outputs in verified data. That helps, but it doesn’t eliminate hallucinations. Someone still needs to verify high-stakes outputs.

What Operators Actually Do

Companies managing hallucination effectively build verification layers. For customer-facing use cases, this often means human review. The AI drafts a response. A human verifies it before it goes to the customer. That sounds like it defeats the purpose—why use AI if you still need to check everything? But it doesn’t. The AI still accelerates the human’s work. They verify in minutes instead of drafting from scratch.

Smart teams also use RAG (Retrieval-Augmented Generation) to ground responses in verified sources. Instead of letting the model rely on training data, you feed it specific documents you control. If the model is reasoning about customer refund policies, you feed it the actual policy document. That doesn’t eliminate hallucinations—the model could still misinterpret the policy—but it dramatically reduces them because you’re anchoring to ground truth.

They also build negative tests. “Show me cases where this model gets it wrong.” Find scenarios where hallucination is likely and test explicitly for it. Edge cases, complex reasoning, domain-specific knowledge—these are where hallucinations hide.

The pattern that works depends on stakes. High-stakes customer-facing work gets human verification. Medium-stakes internal work might get spot-check verification. Low-stakes draft work uses AI output directly and accepts hallucination risk as acceptable.

The companies most comfortable with hallucination are those that designed their product around it. They use AI to accelerate research, not replace it. The human still owns the decision and the accountability. The AI speeds up the thinking, but humans decide.

The Questions to Ask

  1. What’s the actual cost of hallucination in this use case? If the AI gets it wrong, what’s the impact? Who catches it? Can we design the workflow so humans verify before impact?

  2. How will you test for hallucination on this specific task? Generic benchmarks don’t tell you much. You need to test on cases your business cares about. What scenarios are highest risk?

  3. If hallucination happens in production, what’s the rollback plan? How quickly can you detect it? Who’s responsible for responding? What’s the mitigation while you fix it?

Get the next Brief

One operator. Every other Wednesday.

Plus the AI Glossary and the Failure Museum.
Real names. Real numbers. Honest analysis.