Some early takeaways from around 6,000 AI bot requests across 4 domains (see attached tweet for what I'm doing/testing).
1. ChatGPT-User doesn't appear to be able to act agentically (gets the summary, that's it) in a non-user triggered session (i.e. general, real world discovery). Can't be asserted with complete confidence, but there hasn't been a single interaction with the API as yet across all sessions. It can interact with the API if you explicitly ask it to (i.e. fetch this page, then choose a section to read), however, it's unable to construct URLs that aren't in the source, which means that it can't use the search tools.
2. OAI-Searchbot however when it gets served with a summary can either interact with the API itself, or hand over to GPTBot to continue the session (seems to be a common pattern). Both bots (or the LLMs behind them) read the instructions, then search (understanding the HyDE instruction), request sections, full content from the page etc. Note that GPTBot is not intercepted by the worker, so only gets served with the compressed content when it requests it directly (which it does).
3. All of Anthropic's bots (including Claude-User) appear to be able to act agentically in general discovery.
4. There has been no discernible change in requests from the intercepted bots (one domain is slightly up, one is slightly down, but generally it's all looking relatively flat). This is pretty much expected, aligning with my opinion that AI crawlers can read your content just fine, and counter to the messaging from certain services.
5. One other point, which I think is pretty relevant - serving markdown directly broke OpenAI's parsing, so the API returns basic HTML, which OpenAI then turns back into markdown.
Note: since this is a test, all the domains it's active on are relatively low traffic, but there's enough there to start forming some opinions on the capabilities of the various AI bots and whether there's any benefit to this.
Further note: over the next day or so I'll also start serving an Open Knowledge Format version of the (QueryBurst created) knowledge graph for these sites, with individual markdown pages for all entities. Instructions for browsing these pages/an entry point will be added to the initial response.
A common rhetoric in AI search is that AI systems/agents "can't read your site" and that you need to serve an "optimized" version.
In fact there was a (very) large acquisition last week for a platform that offers to do this for you.
So does serving an optimized version of your content increase citations? Or is it a load of GEO nonsense? Could it in fact harm your site?
Well, I have my opinions.
But opinions are like... (you know what), so I spent the last week or so building an "agent friendly" API and am currently serving faithful compressed summaries, with optional additional retrieval to selected user agents on a few test domains.
The "agent" gets instructions on how to use the API to request more content (full document, a specific section, search within the page). It can also search across the full site (semantic or keyword). All agent "journeys" are tracked via sessions.
Soon I'll report if:
a) there was any increase/decrease in citations/AI bot activity
b) whether the "agents" interacted with the API after getting the initial summary (so far, none of them have, but it's been live for less than 24 hours)
c) whether any of them requested the full API instructions
d) lots more... (already have some myths debunked)
On that note, if you have a site you don't mind testing things on (this to my mind isn't without risk) then let me know.
If you're a QueryBurst subscriber you actually have access to it already. But it's heavily caveated with "this is experimental - we recommend running on a test domain". I wouldn't go gung-ho and add it to your main site at the moment...
...although with that being said, for full dogfooding, one of the test domains is QueryBurst's marketing site itself.
Note: while in testing I may experiment A/B test various content formats for the initial served summary (currently it's a highly compressed 200-300 tokens). Although at all times it will be a faithful compression of what's actually on the page with nothing new injected, other than the API instructions.
Technical notes:
1. Requests are intercepted at the edge via a Cloudflare Worker. Known AI user-agents (ChatGPT-User, Claude-User, Perplexity-User etc.) are routed to a backend API that serves a compressed markdown summary (~200-300 tokens) built from the page's existing QueryBurst Site intelligence (key claims, document structure, metadata).
2. The response includes instructions and links for deeper retrieval. Full page content, specific sections by heading, page-scoped search, or site-wide semantic search. Each interaction carries a session ID so we can trace the full agent journey (did it stop at the summary? drill into a section? search for related content?).
3. Static assets are excluded at the Worker level. All responses are served sub-200ms. Human visitors see the normal site, nothing changes for them.
4. The compression is deterministic (no LLM in the serving path). It's pre-computed from our existing Site Intelligence pipeline. Nothing is invented or injected beyond factual assertions already present on the page.