←back to thread

770 points ta988 | 2 comments | | HN request time: 0.498s | source
Show context
Ukv ◴[] No.42550989[source]
Are these IPs actually from OpenAI/etc. (https://openai.com/gptbot.json), or is it possibly something else masquerading as these bots? The real GPTBot/Amazonbot/etc. claim to obey robots.txt, and switching to a non-bot UA string seems extra questionable behaviour.
replies(2): >>42551196 #>>42563566 #
equestria ◴[] No.42551196[source]
I exclude all the published LLM User-Agents and have a content honeypot on my website. Google obeys, but ChatGPT and Bing still clearly know the content of the honeypot.
replies(3): >>42551318 #>>42551321 #>>42551783 #
1. Ukv ◴[] No.42551318[source]
Interesting - do you have a link?
replies(1): >>42551698 #
2. equestria ◴[] No.42551698[source]
Of course, but I'd rather not share it for obvious reasons. It is a nonsensical biography of a non-existing person.