OpenAI

GPTBot

OpenAI's primary web crawler for gathering training data to improve AI models like ChatGPT.

Purpose: Model training data collection

Quick Facts

Company
OpenAI
Respects robots.txt
Yes
Last Updated
2025-05
Official Documentation

📊 Popularity & Traffic

#1Ranking among AI crawlers
~30% of AI crawler trafficTraffic share

Rose from 5% to 30% of AI-specific crawl traffic between May 2024 and May 2025, surpassing all other AI bots.

🤖 User Agent Strings

Use these patterns to identify GPTBot in your server logs or configure your robots.txt file.

GPTBot

Respects robots.txt

Primary OpenAI crawler for model training

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.3; +https://openai.com/gptbot

🌐 IP Ranges

Source: Official OpenAI JSON file
Official source file

Identified IP Ranges7 Ranges

20.171.206.0/24
Subnet with 256 addresses
20.171.207.0/24
Subnet with 256 addresses
40.84.180.0/24
Subnet with 256 addresses
40.84.181.0/24
Subnet with 256 addresses
52.230.152.0/24
Subnet with 256 addresses
52.233.106.0/24
Subnet with 256 addresses

How to read CIDR notation:

The /28 suffix indicates a block of 16 IP addresses. For example,.112/28 covers all addresses from .112 up to .127. Adding these to your firewall will block the entire range used by GPTBot.

📝 Robots.txt Configuration

Add the following to your robots.txt file to block GPTBot:

User-agent: GPTBot
Disallow: /

💡 Important Notes

  • Respects robots.txt directives including per-directory Allow/Disallow rules
  • Blocking GPTBot excludes your content from future model training
  • Does not affect ChatGPT search feature (that's OAI-SearchBot)
Beyond blocking crawlers

See what AI is saying about your brand

Understanding crawlers is step one. With Aiso, you can see the actual conversations happening about your brand inside ChatGPT, Claude, and Perplexity.