User-agent: * Allow: / # Web search + general-purpose crawlers User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Slurp Allow: / User-agent: Yandex Allow: / User-agent: Baiduspider Allow: / User-agent: Applebot Allow: / User-agent: AhrefsBot Allow: / User-agent: SemrushBot Allow: / User-agent: facebookexternalhit Allow: / User-agent: Twitterbot Allow: / User-agent: LinkedInBot Allow: / # AI / LLM crawlers — broad citation rather than blanket block. Flip any # block to `Disallow: /` to opt out per-bot. User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: Google-Extended Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / User-agent: Amazonbot Allow: / User-agent: AmazonbotCommerce Allow: / User-agent: Bytespider Allow: / User-agent: cohere-ai Allow: / User-agent: CCBot Allow: / User-agent: ImagesiftBot Allow: / User-agent: Diffbot Allow: / User-agent: meta-externalagent Allow: / User-agent: meta-externalfetcher Allow: / User-agent: facebook-externalhit-llama Allow: / User-agent: MistralAI-User Allow: / User-agent: YouBot Allow: / Sitemap: https://sebastienrousseau.com/sitemap.xml Sitemap: https://sebastienrousseau.com/news-sitemap.xml Sitemap: https://sebastienrousseau.com/fr/news-sitemap.xml # llms.txt: https://sebastienrousseau.com/llms.txt # llms-full: https://sebastienrousseau.com/llms-full.txt