PerplexityBot
PerplexityBot is Perplexity's web crawler. It indexes content for Perplexity's AI answer engine.
Why It Matters for GEO
Perplexity cites sources with visible links. Allowing PerplexityBot means potential direct traffic from Perplexity citations.
Unlike many AI crawlers that collect data purely for model training, PerplexityBot crawls the live web to find current, relevant sources for Perplexity's real-time answer engine. Every page it indexes is a candidate for citation. If PerplexityBot cannot access your site, your content is invisible to Perplexity's 15+ million monthly users — many of whom are researchers, professionals, and business buyers.
How to Configure
# robots.txt
User-agent: PerplexityBot
Allow: /
Practical Example
An IT consulting firm discovers their website blocks all non-Google bots via a wildcard rule in their robots.txt — a leftover configuration from five years ago. After updating the file to explicitly allow PerplexityBot, the firm's technical guides start appearing in Perplexity search results. Over the following quarter, they track 180 new referral visits from Perplexity, with an average session duration of four minutes — indicating high-quality, research-intent traffic that converts at twice the rate of general organic visitors.
Common Mistakes
- Wildcard bot blocking: A
User-agent: * Disallow: /rule blocks all bots including PerplexityBot. Always add explicit allow rules for AI crawlers you want to admit. - Blocking by IP only: Some security tools block bots based on IP ranges without updating robots.txt. PerplexityBot may be blocked at the server level even if robots.txt appears permissive. Check both.
- Forgetting to update after a site migration: When moving to a new CMS or host, default robots.txt templates often block all crawlers. AI bot permissions are frequently lost during migrations.
- Allowing the bot but having no crawlable content: If your site is a single-page application with no server-side rendering, PerplexityBot may access your pages but find no readable text. Ensure key content is rendered server-side.