Reliability

The Anti-Hallucination Stack: 3 Claude Instructions

6 minute readUpdated June 2026Explore more

TL;DR

Hallucination is not a rare edge case - it is a default behaviour. Models are trained to sound fluent and confident whether they know a fact or are guessing. Three plain-text instructions close that gap: one flags uncertainty, one labels where each claim came from, and one tells Claude to stop and ask instead of inventing. Stack all three at the top of your system prompt.

What hallucination actually costs you

Claude, like every large language model, is trained to produce fluent, confident text - and that training does not distinguish between facts it knows and facts it is guessing. The output sounds identical either way. One wrong stat in a proposal can cost a deal; one fabricated quote in a post can dent credibility you spent months building. Most people fix this by checking everything manually, which erases the time savings that made AI worth using. The smarter fix is to change how Claude handles uncertainty before it writes a word.

Instruction 1: flag what it is not sure about

By default Claude states uncertain things as if they are fact. This instruction tells it to label its confidence on any claim that could be wrong. When you see 'I believe' or 'you may want to verify this', you know which lines to check and which to trust - so you spot-check the flagged lines instead of auditing the whole document.

Instruction 1 - uncertainty flagWhenever you are not fully confident in a fact, statistic, date, name or
claim, explicitly flag it. Use phrases like 'I believe', 'I'm not
certain but', or 'you may want to verify this'. Do not present uncertain
information as established fact. If you do not know something, say so
directly rather than guessing.

Instruction 2: make it tell you where it got the information

This forces source transparency. Without it, Claude blends three things into one seamless answer: information from your documents, information from its training data, and its own inferences. With it, every major claim is tagged with its origin, so you know immediately which claims are solid and which need a source check.

Instruction 2 - source transparencyWhen you include facts, statistics, research findings or specific claims,
indicate the basis for each one using one of three labels: (1) 'Based on
the documents you provided:' for my materials, (2) 'From my training
data:' for your knowledge base, or (3) 'This is my inference:' for
conclusions you are drawing. Apply these labels consistently.

Instruction 3: stop rather than make things up

The most important one. It addresses the root cause: Claude's tendency to complete a task even when it lacks the information to do it accurately. Ask for a case study with specific numbers it does not have, and by default it will generate plausible-sounding numbers to be helpful. This instruction gives it explicit permission to stop and ask instead - reframing an incomplete answer as the correct behaviour when real information is missing.

Instruction 3 - refusal over fabricationIf you do not have sufficient accurate information to complete a task or
answer a question, do not guess or fabricate details to fill the gap.
Instead, tell me specifically what information you are missing and ask me
to provide it. An incomplete response that asks the right follow-up
questions is more useful than a complete one built on invented
information. Accuracy is more important than completeness.

How to use all three together

Each works on its own; combined they create a system where Claude flags uncertainty, tells you where every claim came from, and asks for missing information instead of inventing it. Place all three at the top of your system prompt (or in your CLAUDE.md) so they apply to every message automatically - set once, running in the background on every output. They are plain text and work in Claude.ai, in the API, and in any tool built on Claude.

Common questions

  • Do these instructions actually stop hallucination completely?

    They dramatically reduce it, not eliminate it. Instruction 3 is the heavy lifter - it removes most fabricated stats and quotes by giving Claude permission to stop and ask. The other two make whatever remains easy to spot, so your review shrinks to the flagged lines instead of the whole document.

  • Where exactly do I put them?

    At the top of your system prompt, or in your CLAUDE.md if you use Claude Code. That way they apply to every message in the conversation automatically and you never have to re-paste them.

  • Will flagging uncertainty make Claude sound wishy-washy?

    Only on claims that genuinely deserve a flag. Claude still states things it is confident about plainly. The labels appear on the facts and figures you would have had to check anyway - which is the point.

  • Do they work outside Claude Code?

    Yes. They are plain text with no special formatting, so they work in Claude.ai, in API calls, and in any tool built on top of Claude. No technical setup required.

  • Why does Claude fabricate in the first place?

    It is trained to produce fluent, helpful, confident text, and that training does not separate known facts from guesses - the output reads the same either way. Left without instruction, it fills gaps rather than leaving them. These instructions change that default.

  • Which one should I add if I only add one?

    Instruction 3 - refusal over fabrication. It targets the root cause directly by telling Claude to ask for missing data instead of inventing it, which is where the most damaging errors come from.

Want the rest of the reliability stack?

Get the other 1 in the anti-hallucination stack - free, with 5,000+ builders.

Join the Club