AI Bot Reference

Known AI chatbot crawlers — user agents & IP addresses

The definitive reference for identifying and managing AI web crawlers. Use this data to configure your robots.txt, firewall rules, or verify bot traffic in your server logs.

24+

AI crawlers documented

18

major entities covered

2025

data last updated

Why this matters

What you can do with this data

Whether you want to block AI training, appear in AI search results, or just understand your traffic — this reference helps.

Configure robots.txt

Use our examples to allow or block specific AI bots from crawling your site for training or search indexing.

Firewall rules

Reference official IP ranges to whitelist or block AI crawlers at the network level.

Analytics filtering

Identify AI bot traffic in your analytics to get accurate human visitor counts.

Content licensing

Understand which bots respect opt-out directives for AI training vs. search indexing.

Get the data

Download or copy our curated list of AI bot user agents and IP addresses. Perfect for configuring your robots.txt, firewall rules, or analytics filters.

Complete reference

All known AI chatbot crawlers

Click on any bot to see detailed information including user agent strings, IP ranges, and robots.txt configuration examples.

OpenAIRespects robots.txt

GPTBot

Model training data collection

GPTBot
View details
OpenAIRespects robots.txt

OAI-SearchBot

ChatGPT search indexing and citations

OAI-SearchBot
View details
OpenAIMay ignore robots.txt

ChatGPT-User

Real-time webpage fetching for user queries

ChatGPT-User
View details
GoogleRespects robots.txt

Googlebot

Search indexing and AI model training

Googlebot
View details
Perplexity AIRespects robots.txt

PerplexityBot

Search indexing for AI answers

PerplexityBot
View details
Perplexity AIMay ignore robots.txt

Perplexity-User

Real-time content fetching for user queries

Perplexity-User
View details
AnthropicRespects robots.txt

ClaudeBot

Model training data collection

ClaudeBot
View details
AnthropicRespects robots.txt

Claude-User

Real-time content fetching for user queries

Claude-User
View details
AnthropicRespects robots.txt

Claude-SearchBot

Search indexing for Claude answers

Claude-SearchBot
View details
MicrosoftRespects robots.txt

Bingbot

Search indexing and AI answer generation

Bingbot
View details
ByteDanceRespects robots.txt

ByteSpider

Generative AI training and search indexing

Bytespider
View details
AmazonRespects robots.txt

Amazonbot

Alexa accuracy and LLM training (Bedrock)

Amazonbot
View details
Allen Institute for AIRespects robots.txt

AI2Bot

Scientific and open-source AI research

AI2Bot
View details
Common CrawlRespects robots.txt

CCBot

Open web indexing for public datasets

CCBot
View details
LAIONRespects robots.txt

LAION Crawlers

Multimodal and image AI training

laion-huggingface-processor
View details
HuaweiRespects robots.txt

PetalBot

Search indexing and Pangu LLM training

PetalBot
View details
MetaMay ignore robots.txt

Meta AI Crawlers

Llama model training and AI features

Meta-ExternalAgent
View details
CohereRespects robots.txt

Cohere-AI

Enterprise-grade LLM training

cohere-ai
View details
You.comRespects robots.txt

YouBot

Search indexing and YouChat answers

YouBot
View details
DuckDuckGoRespects robots.txt

DuckAssistBot

Real-time AI summarization (DuckAssist)

DuckAssistBot
View details
PhindRespects robots.txt

PhindBot

Search indexing and technical AI answers

PhindBot
View details
Cognition LabsMay ignore robots.txt

Devin

Autonomous task execution and coding

Devin
View details
TavilyRespects robots.txt

TavilyBot

Real-time AI data retrieval for agents

TavilyBot
View details
Parallel.aiRespects robots.txt

ShapBot

Search indexing for AI APIs

ShapBot
View details
Go beyond blocking bots

See what AI is saying about your brand

Managing bot crawlers is just one piece. With Aiso, you can see the actual conversations happening about your brand inside ChatGPT, Claude, and other AI platforms.