Cohere
Cohere-AI
Crawler for gathering text data to train enterprise-focused language models.
Purpose: Enterprise-grade LLM training
📊 Popularity & Traffic
Smaller volume than OpenAI but highly active in technical and business content niches.
🤖 User Agent Strings
Use these patterns to identify Cohere-AI in your server logs or configure your robots.txt file.
cohere-ai
Respects robots.txtGeneral Cohere AI crawler
cohere-aicohere-training-data-crawler
Respects robots.txtExplicit training data agent
cohere-training-data-crawler🌐 IP Ranges
Source: Varies (typically cloud instances)
No specific IP ranges published. Identify this bot using the User Agent strings above.
📝 Robots.txt Configuration
Add the following to your robots.txt file to block Cohere-AI:
User-agent: cohere-ai
Disallow: /💡 Important Notes
- Focuses on high-quality textual content for business use cases
- Known to integrate with Slack/Enterprise platforms for private data training when authorized
- Typically respects standard Robots.txt directives with no reported issues
Beyond blocking crawlers
See what AI is saying about your brand
Understanding crawlers is step one. With Aiso, you can see the actual conversations happening about your brand inside ChatGPT, Claude, and Perplexity.