Guide

How to ship pages that answer engines can cite

Pages that get cited aren't optimized after the draft is done. They're briefed as source-of-truth assets before writing starts. A complete workflow: pick the job, write the brief, structure the content, configure the technical layer, verify with a prompt panel.

By: CM Charles Morris
Published: May 7, 2026
Version: v1.0
Reading time: ~11 min

Pages that get cited by answer engines aren’t usually “AI optimized” after the draft is done. They’re briefed as source-of-truth assets before writing starts. By the time someone’s adding schema to a finished post, the page either had the bones to get cited or it didn’t.

This guide is the workflow for getting those bones right. It’s organized as a sequence: pick the job, write the brief, structure the content, lay the evidence, configure the technical layer, verify with a prompt panel after publish. Skip a step and the rest of the work compounds in the wrong direction.

The four jobs a citable page can do

Every citable page does one primary job. Trying to do all four at once is the most common failure mode I see — pages that are part glossary, part comparison, part case study, and part product page tend to get cited for none of those purposes.

Page job	User prompt	Page goal
Define	“What is X?”	Give a precise answer with scope
Compare	“X vs Y for Z”	Help the user choose, with criteria
Prove	“Is X effective?”	Show evidence, methodology, results
Instruct	“How do I do X?”	Steps, prerequisites, common mistakes

Pick one before writing. The page can support secondary questions in adjacent sections, but the primary job determines structure, evidence, and schema.

Brief the page around a prompt, not a keyword

This is the editorial change that produces the most downstream lift. Briefs that target keywords produce content that mentions a topic. Briefs that target prompts produce content that answers a question.

A weak brief looks like this:

Target keyword: AI visibility platform
Word count: 2,000
Include: definition, benefits, use cases, comparison

A strong brief looks like this:

Primary prompt:
"How do I know whether my brand is being cited in ChatGPT, 
Perplexity, Gemini, and Google AI Overviews?"

Page job:
Instruct (with secondary compare element)

Source-of-truth claim:
AI visibility should be measured by prompts, mentions, citations, 
cited URLs, and answer accuracy — not by traffic alone.

Evidence required:
- Methodology section with the exact panel structure
- Sample prompt panel (5-10 prompts)
- Example raw export showing log fields
- Limitations of the approach

Anti-goals:
- Don't make this a vendor comparison
- Don't promise specific lift numbers
- Don't claim attribution we can't measure

The strong brief gives the writer a retrieval target. The page exists to answer one specific prompt — and everything in it serves that answer or stays out.

The “anti-goals” line is underrated. Most page bloat comes from someone adding scope creep mid-draft. Naming what the page is not doing makes those additions easier to refuse.

What the first screen needs to do

The first screen — what’s visible without scrolling — has to answer four questions: what is this page about, who is it for, what’s the answer, and why should the reader trust it.

This is also the screen retrieval systems weight most heavily. If your specific answer is in section 4, the chunks that get retrieved for the actual user query will probably miss it. Lead with the answer.

A working pattern:

# How to track AI visibility across answer engines

Updated: May 7, 2026
Author: Charles Morris

AI visibility should be tracked by prompt, answer engine, brand 
mention, citation URL, and answer accuracy. Traffic alone misses 
cases where a brand is recommended without a click or cited 
through a third-party source.

## tl;dr
- Track mentions and citations separately.
- Build prompts from customer language, keyword data, and 
  semantic fanout.
- Verify on a fixed schedule because answer engines change.

That’s not decorative structure. The retrieval system gets a clean title, a date, a named author entity, a direct answer, and a summary — all in the first chunk. The reader gets the same things, in the same order.

How each section should be structured

Every important section should answer one question and keep its evidence local. A section that requires reading three other sections to make sense is harder for retrieval systems to surface accurately.

The pattern I’d use:

## How often should we re-run an AEO prompt panel?

Re-run important AEO prompts every two weeks, or immediately 
after major page, product, or platform changes. A fixed schedule 
makes answer drift visible without turning every variance into 
an emergency.

Example panel:
- 20 prompts
- 4 answer engines (ChatGPT, Claude, Perplexity, Gemini)
- 1 target page per prompt
- Fields: answer, mentions, citations, cited URLs, notes

Limitation:
This doesn't measure personalized or logged-in answer states, 
which can differ significantly from anonymous queries.

Heading is the question. First sentence is the direct answer. Example is concrete. Limitation is named. The whole section reads correctly when retrieved alone — which is exactly how retrieval systems consume it.

This is the pattern from the chunking piece, applied at section scale. Lead with the answer, keep the supporting context local, name the limitation in the same chunk as the claim.

Match evidence to claim type

Different claims need different evidence. A common page failure is using one type of evidence (a customer quote, say) for every type of claim, including ones that need methodology or documentation.

The map I’d use:

Claim type	Evidence to include
Technical behavior	Official docs, reproducible test, source code, standards
Product capability	Product page, docs, screenshots, API reference
Performance lift	Study, analytics extract, methodology, sample size
Comparison	Table with criteria, dates, source links
Process recommendation	Checklist, template, example output

Two principles to apply:

Evidence sits next to the claim it supports. If the claim is in the introduction and the methodology is on a separate page, the chunk containing the claim has nothing to verify it. Move methodology into the section, even at the cost of some duplication across pages.

Vendor evidence isn’t free. A vendor’s own published data is informative but not independent. If the only evidence for a claim is the company’s own marketing material, mark it that way (use phrases like “the vendor reports” or “according to their published case study”) so the claim’s confidence level is honest.

Google’s AI features documentation is a useful constraint here. Google says there are no additional technical requirements for AI Overviews or AI Mode beyond Search eligibility, and recommends making important content textually available with structured data that matches visible content. Translation: evidence and visible content do the heavy lifting. There is no “AI file” that compensates for vague claims and missing evidence.

Schema: less than you’ve been told, in different ways than you’ve been told

The schema landscape has shifted significantly, and most AEO advice on schema is operating on outdated assumptions.

What’s changed:

FAQ rich results are restricted to government and health sites. Google announced this in 2023 and tightened it further in their March 2026 core update. Adding FAQ schema to a marketing or comparison page no longer produces rich results for the vast majority of sites.
HowTo rich results are deprecated entirely. Google removed HowTo rich results from desktop and mobile and dropped Search Console reporting for it.
Schema’s role has shifted from “rich result trigger” to “entity verification signal.” Google’s March 2026 update reportedly increased the weight of schema for AI Mode source selection — schema that accurately describes content increases citation probability even when no rich result displays.

What this means for page planning: don’t add FAQ or HowTo schema as an AEO tactic. The rich results aren’t coming. But do add accurate Article (or a more specific subtype) schema, with author, dates, and entity relationships that match the visible page. That’s the markup that helps AI engines verify what your page is about.

A working Article example:

{
  "@context": "https://schema.org",
  "@type": "Article",
  "headline": "How to track AI visibility across answer engines",
  "datePublished": "2026-05-07",
  "dateModified": "2026-05-07",
  "author": {
    "@type": "Person",
    "name": "Charles Morris",
    "url": "https://optimizeaeo.io/about/"
  },
  "publisher": {
    "@type": "Person",
    "name": "Charles Morris"
  }
}

The principle: schema should describe the visible page, not create a parallel version of the answer. If your schema and your visible content tell different stories, the engine will trust visible content over markup. Schema that lies about the page is worse than no schema at all.

Crawler access is part of shipping, not a separate task

Most teams think of crawler access as an infrastructure concern handled by someone else. For AEO work, that separation is wrong. If the page can’t be fetched by the right systems, the content work is incomplete.

Pre-publish checklist:

[ ] URL returns 200 without authentication
[ ] Important text appears in rendered HTML (not only post-JS)
[ ] Page is internally linked from a relevant hub or pillar
[ ] XML sitemap includes the URL where appropriate
[ ] robots.txt allows the search and retrieval bots you want
    - Googlebot
    - OAI-SearchBot (ChatGPT search)
    - ChatGPT-User (user-triggered retrieval)
    - Claude-SearchBot (Claude search)
    - Claude-User (user-triggered retrieval)
[ ] CDN/WAF rules don't block legitimate AI user agents
[ ] noindex, nosnippet, max-snippet, data-nosnippet rules are intentional

OpenAI and Anthropic both document separate bots for training, search, and user-triggered retrieval. That separation matters at the page level — a comparison page might be valuable enough to allow user-triggered retrieval, even if the broader site blocks training crawls. (For more on this, see the reference piece on robots.txt vs llms.txt.)

The pre-publish moment is when these decisions cost the least. After publish, fixing access is a re-crawl wait of a day to a few weeks depending on the platform.

Verify with a prompt panel after publishing

The work isn’t over at publish. A page can be live, indexed, and still uncited — or cited but described incorrectly. Both states are possible, and both need different remediation. You won’t know which without testing.

A starter post-launch panel:

Prompt 1: What is [topic]?
Prompt 2: How do I [task]?
Prompt 3: [brand] vs [competitor] for [use case]
Prompt 4: Best [category] tools for [audience]
Prompt 5: Does [brand/page] support [specific feature]?

For each prompt, log:

Field	Why it matters
Engine and mode	Answers vary by system and by mode
Exact prompt	Small wording changes produce different answers
Brand mentioned	Mention without click is still influence
Target page cited	Citation confirms the page is being used
Other sources cited	Shows who currently owns the evidence
Accuracy notes	Inaccurate citation can be worse than no citation

Run the panel three times: at publish, after indexing (2-7 days later), and again two weeks after that. Citation behavior often takes a week or two to stabilize, and answer engines change models and retrieval behavior on their own schedule.

If the page isn’t cited after the third run, the diagnostic order is: check access first, check on-page answer clarity second, check competitive evidence third. Most failures are at the access or clarity layer, not at the brand-authority layer.

Common shipping mistakes

Five mistakes I see repeatedly:

1. Writing for a topic instead of a prompt. The page covers the topic broadly. No section is the best answer to any specific question. Cited pages own one prompt; topic pages own none.

2. Separating evidence from claims. The claim is in the introduction. The data is in a PDF. The methodology is on a separate page. The retrieval system can’t assemble trust from fragments scattered across surfaces.

3. Publishing thin comparison pages. A comparison page without criteria, scope, dates, and source links is less citable than a third-party review of the same products. If your comparison page can’t beat G2 or Capterra on substance, the engine will cite them.

4. Hiding the answer below a marketing intro. Three paragraphs of context before the actual answer is a pattern that worked when humans read top-to-bottom. Retrieval systems don’t read top-to-bottom. They retrieve the chunk most relevant to the query, which may not be the chunk you wrote first.

5. Treating schema as a patch. If the visible page is vague, schema will not make it authoritative. Markup amplifies clarity that’s already there. It cannot create clarity from absence.

The final pre-publish checklist

A consolidated version of everything above. Run this before hitting publish on any AEO-targeted page.

PURPOSE
[ ] Primary prompt is written in natural-question form
[ ] Page job is named: define, compare, prove, or instruct
[ ] One source-of-truth claim is identified
[ ] Anti-goals are listed (what this page is NOT)

CONTENT
[ ] First paragraph answers the primary prompt directly
[ ] tl;dr appears near the top
[ ] H2s are questions or direct claims (not generic labels)
[ ] Every H2 starts with a direct answer in the first sentence
[ ] Every major claim has nearby evidence
[ ] Dates, scope, and methodology are visible

TECHNICAL
[ ] Page is crawlable and indexable where intended
[ ] Important content is text, not image-only
[ ] Article schema matches visible page content
[ ] No FAQ or HowTo schema as a tactical add-on
[ ] Internal links connect to related source pages
[ ] robots.txt permits intended AI bots

VERIFICATION
[ ] 5-prompt verification panel is ready
[ ] Citation, mention, and accuracy fields are defined
[ ] Owner is assigned for post-launch fixes
[ ] Re-run schedule is set (publish / +7 days / +14 days)

What to do Monday morning

Pick one upcoming page brief and rewrite the top section: replace “target keyword” with a primary natural-language prompt and add a source-of-truth claim.
Add an evidence-required list and an anti-goals list to the brief. Two sentences each is enough.
On the page itself, move the answer above any setup or marketing intro. The first 60 words after the title should contain the actual claim.
Place evidence next to the claim it supports, not in a separate page or PDF.
Audit your robots.txt for the AI search and user-triggered bots. Decide intentionally rather than inheriting whatever was there before.
Replace FAQ or HowTo schema with accurate Article schema that matches the visible page content.
Build a 5-prompt verification panel. Run it three times: publish day, 7 days later, 14 days later. Log mentions, citations, and accuracy.

The boring takeaway: citable pages are made by planning, not by post-hoc optimization. By the time you’re adding schema or tweaking headings, the page has either earned its citations or it hasn’t. The leverage is in the brief.

Sources

AI features and your website (Google, accessed 2026-05-07)
Changes to HowTo and FAQ rich results (Google Search Central)
Article structured data (Google)
Article — Schema.org Type
Overview of OpenAI Crawlers (OpenAI)
Does Anthropic crawl data from the web? (Anthropic)