BEACON Free Tool

Is your site blocking AI crawlers?

If your robots.txt disallows GPTBot, PerplexityBot, ClaudeBot or Google-Extended, those engines can’t read your site — and won’t recommend it. Paste your URL for a real read of your live robots.txt and see exactly which AI crawlers are allowed or blocked. No signup.

https://
GPTBot (ChatGPT)Checks whether OpenAI’s crawler is allowed to read and learn from your pages.
PerplexityBotConfirms whether Perplexity can access your content to cite you in answers.
ClaudeBotChecks Anthropic’s crawler access for Claude.
Google-ExtendedThe toggle that controls whether your content feeds Google’s Gemini and AI features.
Reads your live robots.txtA real fetch of your actual /robots.txt — not a cached guess or fabricated result.
llms.txt tooAlso reports whether you publish an /llms.txt to guide AI engines to your best content.

Why AI-crawler access matters

AI engines can only recommend what they can read. A single overly-broad Disallow line in robots.txt can quietly remove you from ChatGPT, Perplexity, Claude and Gemini — one of the most common and most invisible visibility mistakes.

  • GPTBot — OpenAI / ChatGPT
  • OAI-SearchBot — ChatGPT search
  • PerplexityBot — Perplexity
  • ClaudeBot — Anthropic / Claude
  • Google-Extended — Google Gemini & AI features

Allowing vs blocking — your choice

Some sites deliberately block AI crawlers to protect content; most want the visibility. Either way, you should know your current state. This tool simply reports what your robots.txt says today, so the decision is informed rather than accidental.

How to unblock AI crawlers

If a crawler is blocked, remove or narrow the relevant Disallow rule for that user-agent in robots.txt, then re-check. After access is fixed, make sure your content is in static HTML (not JavaScript-only) so crawlers actually see it — a free Beacon account measures that render gap across your site.

FAQ

How do I know if GPTBot is blocked?

Paste your URL above. Beacon reads your live robots.txt and reports whether GPTBot, PerplexityBot, ClaudeBot and Google-Extended are allowed or disallowed.

Should I block AI crawlers?

It depends on your goals. Blocking protects content from being used for training and answers, but it also removes you from AI recommendations. Most brands seeking visibility should allow them.

Is this result accurate?

Yes — it’s a real fetch of your /robots.txt at check time, parsed for each AI user-agent. Nothing is fabricated.

What about JavaScript-rendered content?

Allowing crawlers is step one; they also need readable HTML. If your content only appears after JavaScript runs, crawlers see an empty shell. A free Beacon account measures this render gap.

Check your AI-crawler access now

Free, real, no signup — see who can read your site.

Start free Run a free check