# Block all known AI crawlers and assistants # from using content for training AI models. # Source: https://robotstxt.com/ai User-Agent: GPTBot User-Agent: ClaudeBot User-Agent: Claude-User User-Agent: Claude-SearchBot User-Agent: CCBot User-Agent: Google-Extended User-Agent: Applebot-Extended User-Agent: Facebookbot User-Agent: Meta-ExternalAgent User-Agent: Meta-ExternalFetcher User-Agent: diffbot User-Agent: PerplexityBot User-Agent: Perplexity‑User User-Agent: Omgili User-Agent: Omgilibot User-Agent: webzio-extended User-Agent: ImagesiftBot User-Agent: Bytespider User-Agent: TikTokSpider User-Agent: Amazonbot User-Agent: Youbot User-Agent: SemrushBot-OCOB User-Agent: Petalbot User-Agent: VelenPublicWebCrawler User-Agent: TurnitinBot User-Agent: Timpibot User-Agent: OAI-SearchBot User-Agent: ICC-Crawler User-Agent: AI2Bot User-Agent: AI2Bot-Dolma User-Agent: DataForSeoBot User-Agent: AwarioBot User-Agent: AwarioSmartBot User-Agent: AwarioRssBot User-Agent: Google-CloudVertexBot User-Agent: PanguBot User-Agent: Kangaroo Bot User-Agent: Sentibot User-Agent: img2dataset User-Agent: Meltwater User-Agent: Seekr User-Agent: peer39_crawler User-Agent: cohere-ai User-Agent: cohere-training-data-crawler User-Agent: DuckAssistBot User-Agent: Scrapy User-Agent: Cotoyogi User-Agent: aiHitBot User-Agent: Factset_spyderbot User-Agent: FirecrawlAgent User-agent: Googlebot-Image Disallow: / DisallowAITraining: / # Block any non-specified AI crawlers (e.g., new # or unknown bots) from using content for training # AI models, while allowing the website to be # indexed and accessed by bots. These directives # are still experimental and may not be supported # by all AI crawlers. User-Agent: * DisallowAITraining: / Content-Usage: ai=n Allow: / User-agent: googlebot-image User-agent: ia_archiver User-agent: baiduspider User-agent: naverbot User-agent: yeti User-agent: yahoo-mmcrawler User-agent: psbot Disallow: / User-agent: * Disallow: /cgi-bin/ Disallow: /wp-admin/ Disallow: /wp-login.php Disallow: /?s= User-agent: * Allow: /wp-admin/admin-ajax.php Sitemap: https://hardoppervlak.nl/wp-sitemap.xml