Cohere

Cohere-AI

Crawler for gathering text data to train enterprise-focused language models.

Purpose: Enterprise-grade LLM training

Quick Facts

Company
Cohere
Respects robots.txt
Yes
Last Updated
2025-05
Official Documentation

📊 Popularity & Traffic

Smaller volume than OpenAI but highly active in technical and business content niches.

🤖 User Agent Strings

Use these patterns to identify Cohere-AI in your server logs or configure your robots.txt file.

cohere-ai

Respects robots.txt

General Cohere AI crawler

cohere-ai

cohere-training-data-crawler

Respects robots.txt

Explicit training data agent

cohere-training-data-crawler

🌐 IP Ranges

Source: Varies (typically cloud instances)

No specific IP ranges published. Identify this bot using the User Agent strings above.

📝 Robots.txt Configuration

Add the following to your robots.txt file to block Cohere-AI:

User-agent: cohere-ai
Disallow: /

💡 Important Notes

  • Focuses on high-quality textual content for business use cases
  • Known to integrate with Slack/Enterprise platforms for private data training when authorized
  • Typically respects standard Robots.txt directives with no reported issues
Beyond blocking crawlers

See what AI is saying about your brand

Understanding crawlers is step one. With Aiso, you can see the actual conversations happening about your brand inside ChatGPT, Claude, and Perplexity.