Crawler type: html_page developers.openai.com↗

OpenAI

stable for 49 days · 1 material event tracked · 2 snapshots in history

Documented user-agents (4)

Each distinct UA this vendor publishes on its docs page, extracted by Haiku from the latest snapshot. New UAs appearing or scope changes here are the high-signal events to watch.

User-agent Purpose Scope / when it fires Opt-out
OAI-SearchBot Surface websites in ChatGPT search results Search indexing for ChatGPT search features; does not crawl automatically Disallow OAI-SearchBot in robots.txt; sites opted out will not appear in ChatGPT search answers
OAI-AdsBot Validate safety of landing pages submitted as ads on ChatGPT Only visits pages submitted as ads; not used for automatic web crawling Data not used for training AI models; applies only to submitted ad landing pages
GPTBot Training data collection for generative AI foundation models Crawls content for training and improving AI models and safety Disallow GPTBot in robots.txt; indicates content should not be used for training AI models
ChatGPT-User User-triggered fetch for live responses in ChatGPT and Custom GPTs Visits pages when users ask questions or interact with GPT Actions; not automatic crawling User-initiated actions; robots.txt rules may not apply to ChatGPT-User requests
Change timeline — diffs over time with insights

Each block is a detected change: the new-vs-prior snapshot diff and the LLM-written insight. Newest first.

2025-10-19 2026-04-19 182 days apart
+29 −0
View diff
Index: gptbot
===================================================================
--- gptbot	2025-10-19
+++ gptbot	2026-04-19
@@ -0,0 +1,29 @@
+OpenAI uses web crawlers (“robots”) and user agents to perform actions for its products, either automatically or triggered by user request. OpenAI uses OAI-SearchBot and GPTBot robots.txt tags to enable webmasters to manage how their sites and content work with AI. Each setting is independent of the others – for example, a webmaster can allow OAI-SearchBot in order to appear in search results while disallowing GPTBot to indicate that crawled content should not be used for training OpenAI’s generative AI foundation models. If your site has allowed both bots, we may use the results from just one crawl for both use cases to avoid duplicative crawling. For search results, please note it can take ~24 hours from a site’s robots.txt update for our systems to adjust.
+User agent
+Description & details
+OAI-SearchBot
+OAI-SearchBot is for search. OAI-SearchBot is used to surface websites in search results in ChatGPT’s search features. Sites that are opted out of OAI-SearchBot will not be shown in ChatGPT search answers, though can still appear as navigational links. To help ensure your site appears in search results, we recommend allowing OAI-SearchBot in your site’s robots.txt file and allowing requests from our published IP ranges below.
+Full user-agent string:
+Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0 Safari/537.36; compatible; OAI-SearchBot/1.3; +https://openai.com/searchbot
+Published IP addresses:
+https://openai.com/searchbot.json
+OAI-AdsBot
+OAI-AdsBot is used to validate the safety of web pages submitted as ads on ChatGPT. When you submit an ad, OpenAI may visit the landing page to ensure it complies with our policies. We may also use content from the landing page to determine when it’s most relevant to show the ad to users. OAI-AdsBot only visits pages submitted as ads, and the data collected by OAI-AdsBot is not used to train generative AI foundation models.
+Full user-agent string:
+Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; OAI-AdsBot/1.0; +https://openai.com/adsbot
+GPTBot
+GPTBot is used to make our generative AI foundation models more useful and safe. It is used to crawl content that may be used in training our generative AI foundation models. Disallowing GPTBot indicates a site’s content should not be used in training generative AI foundation models.
+Full user-agent string:
+Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.3; +https://openai.com/gptbot
+Published IP addresses:
+https://openai.com/gptbot.json
+ChatGPT-User
+OpenAI also uses ChatGPT-User for certain user actions in ChatGPT and
+Custom GPTs
+. When users ask ChatGPT or a CustomGPT a question, it may visit a web page with a ChatGPT-User agent. ChatGPT users may also interact with external applications via
+GPT Actions
+. ChatGPT-User is not used for crawling the web in an automatic fashion. Because these actions are initiated by a user, robots.txt rules may not apply. ChatGPT-User is not used to determine whether content may appear in Search. Please use OAI-SearchBot in robots.txt for managing Search opt outs and automatic crawl.
+Full user-agent string:
+Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; ChatGPT-User/1.0; +https://openai.com/bot
+Published IP addresses:
+https://openai.com/chatgpt-user.json
\ No newline at end of file
Events