OpenAI
GPTBot
OpenAI's primary web crawler for gathering training data to improve AI models like ChatGPT.
Purpose: Model training data collection
📊 Popularity & Traffic
#1Ranking among AI crawlers
~30% of AI crawler trafficTraffic share
Rose from 5% to 30% of AI-specific crawl traffic between May 2024 and May 2025, surpassing all other AI bots.
🤖 User Agent Strings
Use these patterns to identify GPTBot in your server logs or configure your robots.txt file.
GPTBot
Respects robots.txtPrimary OpenAI crawler for model training
Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.3; +https://openai.com/gptbot🌐 IP Ranges
Source: Official OpenAI JSON file
Official source fileIdentified IP Ranges7 Ranges
20.171.206.0/24Subnet with 256 addresses
20.171.207.0/24Subnet with 256 addresses
40.84.180.0/24Subnet with 256 addresses
40.84.181.0/24Subnet with 256 addresses
52.230.152.0/24Subnet with 256 addresses
52.233.106.0/24Subnet with 256 addresses
How to read CIDR notation:
The/28 suffix indicates a block of 16 IP addresses. For example,.112/28 covers all addresses from .112 up to .127. Adding these to your firewall will block the entire range used by GPTBot.📝 Robots.txt Configuration
Add the following to your robots.txt file to block GPTBot:
User-agent: GPTBot
Disallow: /💡 Important Notes
- Respects robots.txt directives including per-directory Allow/Disallow rules
- Blocking GPTBot excludes your content from future model training
- Does not affect ChatGPT search feature (that's OAI-SearchBot)
Beyond blocking crawlers
See what AI is saying about your brand
Understanding crawlers is step one. With Aiso, you can see the actual conversations happening about your brand inside ChatGPT, Claude, and Perplexity.