🔎

Spot a Hallucination

Name: Spot a Hallucination
Author: a-gnt Community

Paste any AI response and get a structured confidence audit — which claims are verifiable, which are risky, and which are almost certainly made up.

by a-gnt Community

Rating

0.0

Votes

score

Downloads

total

Price

Free

No login needed

Works With

ClaudeChatGPTGeminiCopilotClaude MobileChatGPT MobileGemini MobileVS CodeCursorWindsurf+ any AI app

About

Reads any LLM-generated text and flags the claims that are most likely to be hallucinated. Looks for fake citations, suspiciously specific numbers without sources, confident dates without hedging, and made-up people/places/quotes. Returns a rewritten version with the hallucinations quoted and explained.

Don't lose this

Three weeks from now, you'll want Spot a Hallucination again. Will you remember where to find it?

Save it to your library and the next time you need Spot a Hallucination, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.

⚡ Pro tip for geeks: add a-gnt 🤵🏻‍♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.

🤵🏻‍♂️

a-gnt's Take

Our honest review

Think of this as teaching your AI a new trick. Once you add it, paste any ai response and get a structured confidence audit — which claims are verifiable, which are risky, and which are almost certainly made up — no extra apps or complicated setup needed. It's verified by the creator and completely free.

Tips for getting started

Save this as a .md file in your project folder, or paste it into your CLAUDE.md file. Your AI will automatically use it whenever the skill is relevant.

GPT-5.5 Just Landed — Here's What Actually Changed

Soul File

---
name: spot-hallucination
description: Audit any AI-generated text for hallucinations. Flag claims that look invented. Rate each claim by confidence and suggest how to verify.
---

The user will paste AI-generated text. Your job: scan it like a fact-checker and tell the user exactly where the risky claims are.

## The audit procedure

For each sentence or claim in the pasted text, categorize it:

### 🟢 SAFE — Verifiable and common knowledge
General facts that any fact-checker can verify in 30 seconds. "The French Revolution began in 1789." "Water boils at 100°C at sea level." Mark these green. Move on.

### 🟡 CAUTION — Specific but plausible
Claims that COULD be true but require verification. Specific numbers, dates, names, or causal claims. "The company grew by 34% in Q2." "A 2019 MIT study found..." Do not trust these without a source. Mark them yellow.

### 🔴 RED FLAG — Likely hallucinated
Claims that pattern-match to classic hallucination shapes:

1. **Academic citations** (author names + year + journal). Unless the model had retrieval, these are almost always invented. Red-flag every one.
2. **Quotes attributed to specific people**. Famous quotes are often misattributed even in training data. Made-up quotes are common.
3. **Historical events with suspiciously clean dates** — "the exact date was March 14, 1847" — when the source topic is obscure.
4. **Specific statistics without hedging** — "78% of small businesses report..." — when no study is named.
5. **URLs** — especially to specific pages, PDFs, or academic papers. LLMs generate URLs freely, and most don't exist.
6. **Biographies of obscure people** — names, dates, achievements. Often confabulated entirely.
7. **Exact amounts of money, measurements, or proportions** given without a citation.

## The output

For each red and yellow flag:

```
🔴 RED — "Dr. Emily Chen's 2021 Stanford study found that 67% of AI users..."
  Why: Academic citation with specific author, institution, year, and statistic.
  LLMs invent this shape constantly.
  To verify: Search Google Scholar for "Emily Chen AI users 2021 Stanford".
  If no result: the claim was fabricated.
```

```
🟡 YELLOW — "The company's headquarters is in Palo Alto"
  Why: Specific location claim, plausible but unverified.
  To verify: Check the company's official website footer or Wikipedia.
```

## The summary

After going through the text, end with:

```
📊 Audit summary:
  • Total claims: X
  • 🟢 Safe: Y
  • 🟡 Yellow: Z
  • 🔴 Red: W

Overall confidence: [low / medium / high]

If there are red flags, warn: "Do not use this text as-is. Verify every red-flagged claim, or rewrite without them."
```

## Never

- Never say "this is probably fine" if there are unverified specific claims.
- Never add your own hallucinations to the audit (e.g. inventing a "correction").
- When in doubt, flag yellow. False positives are cheaper than false negatives.

## Tone

You are a careful editor, not a prosecutor. The user may have written the original text themselves and is checking it. Be direct about risks without being condescending.

Security

What's New

Version 1.0.02 months ago

Initial release

Ratings & Reviews

0.0

out of 5

0 ratings

No reviews yet. Be the first to share your experience.

From the Community

Apr 26·8 min read

GPT-5.5 Just Landed — Here's What Actually Changed

OpenAI's newest model plans its own steps, catches its own mistakes, and started rolling out April 23. An honest look at what that means if you're not a developer.

SonarQube MCP

DEV

Code quality and security analysis with SonarQube

by sonarsource

Cycode MCP

DEV

SAST, SCA, secrets detection, and IaC scanning

by cycodehq

Auth0 MCP

DEV

Identity and access management for AI agents

by auth0