Complete Guide

Observed Outcomes Register: Documented AI Retrieval, Citation and Feature-Extraction Behaviour

The public log of documented AI retrieval, citation, and feature-extraction behaviour observed on SEO Strategy Ltd client work. Each entry records a dated observation: query, AI surface, retrieval pattern, feature extracted, constraint handling, entity preservation, attribution persistence, confounders, interpretation confidence, and the hypothesis generated. Two entries at launch (Olliers Operation Soteria, March 2026 / Daves Taxis Cruise Tour Planner, May 2026). The register’s value compounds with entry count and time span, not with any individual observation.

7 min read 1,467 words Updated May 2026

The observed-outcomes register is the public log of documented AI retrieval, citation, and feature-extraction behaviour observed on SEO Strategy Ltd client work. Each entry records a dated observation of an AI system’s actual output for a specific query — the surface, the retrieval pattern, the entity treatment, the operational differentiator extracted, the confounders, and the interpretation confidence. The register exists because the strongest evidence for any framework is documented behaviour observed in the wild, recorded the same way every time, accumulating over months.

2 Documented entries in the register at launch (May 2026) — Olliers Solicitors Operation Soteria (March 2026, Google AI Overview) and Daves Taxis Cruise Tour Planner (May 2026, Google AI Mode). The register is append-only and updates as observations occur. SEO Strategy Ltd direct observation, March-May 2026

11 Structured fields recorded per entry — date observed, query, AI surface, retrieval pattern, feature extracted, constraint handling observed, entity preservation, attribution persistence, confounders, interpretation confidence, hypothesis generated — ensuring observations are comparable over time and across sectors. SEO Strategy Ltd register methodology, May 2026

8-12 Entry count threshold at which observations begin carrying evidential weight rather than reading as isolated anecdotes. Two entries is a start, not a pattern; the register’s value compounds with entry count and time span, not with any individual observation. SEO Strategy Ltd register design principle, May 2026

The observed-outcomes register is the public log of documented AI retrieval, citation, and feature-extraction behaviour observed on SEO Strategy Ltd client work. Each entry records a dated observation of an AI system’s actual output for a specific query — the surface, the retrieval pattern, the entity treatment, the operational differentiator extracted, the confounders, and the interpretation confidence. The register exists because the strongest evidence for any framework is not the framework’s internal logic. It is documented behaviour observed in the wild, recorded the same way every time, accumulating over months. This page is small at launch. It will compound.

Why this register exists

The AI visibility category is flooded with frameworks, opinions, and predictive claims. Almost no one is publishing structured, timestamped, verifiable observations of actual AI system behaviour. That is the gap this register fills.

Each entry is an observation, not a proof. An observation is a single documented data point from a specific AI surface at a specific time, with the confounders named. A proof would require controlled replication across queries, surfaces, and time windows that no one in this discipline currently has. The honest position is that we have observations consistent with hypotheses about how AI retrieval and recommendation systems behave — and that as observations accumulate, those hypotheses either earn evidential weight or get refuted. Either outcome is useful.

The register is read alongside CITATE (the page-level standard the observed pages were built to) and the AI Discovery Stack (the system model the observations sit inside). The register does not prove either framework. It documents specific outputs in specific conditions and lets readers draw their own conclusions.

How entries are structured

Every entry uses the same fields so that observations are comparable over time:

Date observed — when the output was captured
Query — the exact prompt used
AI surface — which platform and which feature (e.g. Google AI Overview, Google AI Mode, ChatGPT, Perplexity)
Retrieval pattern — how the source surfaced (cited link, named card, embedded summary, branded panel)
Feature extracted — the specific operational differentiator the system surfaced, if any, in whose exact wording
Constraint handling observed — whether the system reasoned about user constraints (timing, location, eligibility) using the source’s structure
Entity preservation — whether the source was named, linked, logo-displayed, or all three
Attribution persistence — whether the attribution carried through multiple turns or follow-up queries
Confounders — what could plausibly explain the result other than the framework hypothesis
Interpretation confidence — low, medium, or high, with the reasoning named
Hypothesis generated — what this observation, if it repeats, would suggest

Entry 1: Olliers Solicitors — Operation Soteria

Date observed: March 2026. Client: Olliers Solicitors. Sector: Criminal defence law.

Query: Operation Soteria-related criminal defence query (specific phrasing captured in screenshot evidence held with the client).

AI surface: Google AI Overview.

Retrieval pattern: Citation with the Olliers Solicitors logo displayed alongside the AI Overview answer. The source was named in the answer and visually anchored by the brand mark in the citation panel.

Feature extracted: The page’s structured explanation of the Operation Soteria framework was the substrate for the AI Overview’s answer. The page had been built to CITATE 6/6 with a standalone opening, explicit definitions of the operational terms, attributed claims, and named entity references throughout.

Constraint handling observed: The AI Overview reasoned about the legal context (which police forces, which charge types, which procedural stage) using the structure on the page rather than generic legal definitions. The page’s sectioning by procedural stage was preserved in the AI’s reasoning sequence.

Entity preservation: Named source, linked source, logo displayed in the citation panel. Strong on all three dimensions.

Attribution persistence: Persisted across follow-up queries within the same session. The system continued to surface Olliers Solicitors when adjacent procedural queries were submitted.

Confounders: Olliers is a recognised criminal defence firm with strong off-page corroboration (Chambers, Legal 500, law society listings). The AI Overview citation may reflect that corroboration rather than purely the CITATE-structured page. Disentangling on-page structure from off-page trust is not possible from a single observation.

Interpretation confidence: Medium-high. The combination of named citation, logo display, and structural preservation of the page’s sectioning is consistent with the hypothesis that pages built to extractable structure with strong entity corroboration become reusable substrates for AI Overview answers. It is not conclusive evidence that CITATE structure alone produced the result.

Hypothesis generated: Pages built to a measurement standard (CITATE 6/6) on entities with strong off-page corroboration appear more likely to be selected as the named substrate for AI Overview answers in regulated sectors where citation accuracy carries reputational stakes for the AI system.

Entry 2: Daves Taxis — Cruise Tour Planner

Date observed: May 2026. Client: Daves Taxis (David Plomer, sole trader, West End Southampton). Sector: Local transport / cruise tour services.

Query: Planning-stage cruise tour query for Southampton arrivals with specific docking and sailing time constraints.

AI surface: Google AI Mode.

Retrieval pattern: Branded card display of Daves Taxis with the business named and visually presented as a distinct entity, not merely listed in a results set.

Feature extracted: The AI Mode answer included, in near-verbatim form, the page’s description of the Cruise Tour Planner: “Features a dedicated ‘Cruise Tour Planner’ that filters destinations based on your exact docking and sailing hours.” The operationally distinct feature — the planner that handled the constraint-handling specific to cruise arrivals — was preserved as a named differentiator in the AI’s response. The system did not paraphrase the feature into generic language; it preserved the named tool and its constraint logic.

Constraint handling observed: The AI Mode response sequenced the planning logic (arrival time, docking duration, sailing time, destination selection) using the page’s structured planner as the reasoning scaffold rather than generating generic tour-planning prose. This is the most operationally interesting aspect of the observation: the AI did not just cite the page; it reused the page’s constraint-handling structure inside its own answer.

Entity preservation: Named source, branded card, business presented as a distinct callable entity. Strong on all three.

Attribution persistence: Daves Taxis persisted across multiple planning-context follow-up queries within the same session, including variations in arrival time and tour duration. The system retained the entity and re-applied it to adjacent planning contexts rather than re-retrieving alternatives each turn.

Confounders: Daves Taxis already ranked third organically for “taxi cruise tours Southampton” before the observation, so organic ranking strength is a plausible contributor. The local sector also has fewer competing entities than the criminal defence sector, lowering the bar for entity preservation. Disentangling AI Mode’s feature-extraction behaviour from underlying organic visibility is not possible from a single observation.

Interpretation confidence: High on the feature-extraction observation specifically. The verbatim preservation of “Cruise Tour Planner” as a named tool and the use of its constraint-handling structure in the AI’s reasoning are documented in the captured response. Medium on the broader implication. The hypothesis it generates needs cross-vertical replication before it carries serious evidential weight.

Hypothesis generated: AI retrieval systems may preferentially preserve operationally distinct, named features (planners, calculators, structured decision tools) when those features handle a constraint the user’s query implies. The feature is not just extractable content; it is reusable operational structure the AI can scaffold its own answer around. Generic prose appears more likely to be paraphrased away; named operational tools appear more likely to be preserved verbatim. If this hypothesis holds across further observations, the implication is that high-value pages should increasingly contain named, constraint-aware operational structures (planners, decision flows, calculators) rather than only descriptive prose.

What this register is not

This is not a results portfolio. It is not a case study collection. It is not a marketing surface. The register exists to document observed AI behaviour with enough structural consistency that the observations can be compared, the hypotheses can be tested, and conclusions can be drawn or refuted as the entry count grows.

Two entries is a start, not a pattern. Eight to twelve entries across multiple sectors would be enough to begin saying which observations recur, which were idiosyncratic, and which hypotheses are earning evidential weight. The register’s value compounds with entry count and time span, not with any individual observation.

Methodology and update cadence

Each entry is captured at the time of observation: the query, the screenshot, the surface, the response. Entries are added in the order observed. Existing entries are not edited retroactively. If a later observation contradicts an earlier one, both are kept and the contradiction is named in a new entry.

The register is updated as observations occur, not on a fixed cadence. Quality of observation beats quantity of entries.

For the framework standards these observations are tested against, see the CITATE framework, the AI Discovery Stack, and the full SEO Strategy Frameworks register. For the commercial engagement that produces this work, see the AI Visibility Audit.

Key Definitions

Observed-outcomes register: A structured, append-only public log of documented AI retrieval, citation, and feature-extraction behaviour observed on client work. Each entry uses the same fields so observations are comparable over time and across sectors. Entries are observations, not proofs — documented data points whose evidential weight comes from accumulation, not from any individual record.
Feature extraction: The behaviour observed when an AI system preserves a named, operationally distinct feature from a source page verbatim in its own response — particularly when that feature handles a constraint the user’s query implies (timing, eligibility, location, sequence). Distinct from paraphrastic summarisation, where the system rewrites the source’s information in generic language. The Daves Taxis Cruise Tour Planner observation is the canonical example.
Attribution persistence: Whether an AI system continues to surface, name, and credit the same source across follow-up queries in the same session, rather than re-retrieving alternatives each turn. High attribution persistence indicates the system has formed a session-level entity preference for the source.

Frequently Asked Questions

What is the observed-outcomes register?

A public log of documented AI retrieval, citation, and feature-extraction behaviour observed on SEO Strategy Ltd client work. Each entry uses the same structured fields so observations are comparable over time. The register exists to generate evidence for or against hypotheses about how AI retrieval and recommendation systems behave — not to prove a framework correct.

Is this a case study collection?

No. Case studies present client results from a marketing angle. The observed-outcomes register documents AI behaviour with structural consistency for cross-comparison, names the confounders, and rates the interpretation confidence. The two formats serve different purposes; this register is for evidence, not promotion.

What does an entry need to qualify for inclusion?

A specific AI surface, a specific query, a documented response with screenshot evidence, and enough information to fill all eleven structured fields including confounders and interpretation confidence. Observations that cannot be honestly rated for confidence or that have unnamed confounders do not get added.

Why only two entries at launch?

Quality of observation beats quantity of entries. Two structured entries are more useful than ten loosely-described ones. The register is append-only and will compound over months — eight to twelve entries across multiple sectors would be enough to begin saying which observations recur and which hypotheses are earning evidential weight.

What is feature extraction and why does it matter?

Feature extraction is the behaviour observed when an AI system preserves a named, operationally distinct feature from a source page verbatim — particularly when the feature handles a constraint the user’s query implies. Distinct from paraphrastic summarisation. If this pattern holds across observations, the implication is that high-value pages should increasingly contain named, constraint-aware operational structures (planners, calculators, decision tools) rather than only descriptive prose.

Founder of SEO Strategy Ltd with 20+ years in SEO, web development and digital marketing. Specialising in healthcare IT, legal services and SaaS — from technical audits to AI-assisted development.

Ready to improve your search visibility?

Book a free 30-minute consultation and let's discuss your SEO strategy.

Get in Touch