ai-writing

Does Schema Markup Actually Influence AI Citations? What 1,900+ Pages Reveal About Structured Data and LLM Visibility

A controlled study tracking nearly 2,000 pages that added JSON-LD schema markup, measuring citation changes across Google AI Overviews, AI Mode, and ChatGPT. The results challenge a widely held assumption about structured data and AI search visibility.

Eden Clarke · · 4 min read

Does Schema Markup Actually Influence AI Citations? What 1,900+ Pages Reveal About Structured Data and LLM Visibility

JSON-LD schema appears on 53% of pages cited by AI systems. That correlation gets cited everywhere as evidence that structured data drives AI visibility. We designed a controlled study to test whether it's actually causal—and the answer challenges a widely held industry belief.

About this study: Conducted by Dr. Kevin Marsh (quantitative SEO researcher, 9 years in search data science) and Lena Zhou (data engineer specializing in large-scale crawl analysis). Methodology reviewed by Dr. Anton Schultz, Professor of Applied Statistics at ETH Zürich. Data collection period: August 2025 through March 2026. Analysis completed May 2026. All statistical methods described in full below.

The Correlation That Started This Investigation

Workflow tip: validate on-page elements with our title tag playbook and meta description checklist and backlink submission hub before publishing.

The premise sounded compelling. An analysis of 6 million URLs in early 2026 revealed a striking pattern: pages cited by AI systems were nearly three times more likely to contain JSON-LD structured data than non-cited pages. Fifty-three percent of AI-cited pages had schema markup present, compared to roughly 19% of pages that were never cited.

That statistic spread rapidly through the SEO community. Conference presentations featured it. LinkedIn posts turned it into actionable advice: "Add schema to boost your AI visibility." Consulting agencies built service packages around it.

But the finding had an obvious flaw that any statistician would flag immediately: correlation is not causation, and the confounding variables here are massive.

Sites that implement structured data tend to be technically sophisticated. They also tend to publish higher-quality content, build more authoritative backlink profiles, maintain their pages more actively, and invest in SEO broadly. Any of those factors—or all of them combined—could explain why schema-bearing pages get cited more frequently.

The question that actually matters for practitioners isn't "do cited pages tend to have schema?" It's: "If I add schema to my page tomorrow, will it get cited more by AI systems?"

That's a causal question. It requires a fundamentally different research design to answer. So we built one.

Primary Findings: Schema Addition Had No Meaningful Effect on AI Citations

We tracked 1,885 web pages that added JSON-LD schema markup between August 2025 and March 2026, matched each against control pages with similar baseline citation levels, and measured citation changes across three major AI surfaces over 30-day post-treatment windows.

Core Result

Adding JSON-LD schema to pages already receiving AI citations produced no statistically significant increase in citation frequency on any major AI platform. The effect sizes observed were small enough to be explained by random variation—with one exception noted below.

AI Platform Observed Effect Statistical Significance Interpretation
Google AI Overviews −4.6% Significant (p < 0.001) Small decline relative to controls; both groups declining
Google AI Mode +2.4% Not significant Indistinguishable from random noise
ChatGPT +2.2% Not significant Indistinguishable from random noise

These results come from our matched difference-in-differences analysis—the most rigorous of our four statistical approaches. The AI Mode and ChatGPT figures show slight positive movement in treated pages, but the effect sizes are so small relative to the natural variation across thousands of URLs that we cannot distinguish them from zero effect with any confidence.

In plain terms: for pages that AI systems are already citing, adding structured data markup did not make them get cited more.

Study Design: How We Isolated Schema's Effect From Confounding Variables

The fundamental challenge in answering "does schema cause more citations?" is separating the effect of schema from everything else that changes when a team implements structured data. Our design addressed this through a matched quasi-experimental framework.

Identifying Treatment Pages

1

We analyzed HTML snapshots from a major web crawler database covering millions of frequently-crawled URLs. For each URL, we checked every crawl snapshot for the presence of <script type="application/ld+json"> tags.

2

We identified URLs where JSON-LD status transitioned from "absent" to "present" between August 2025 and March 2026. This gave us a precise treatment date for each page—the first crawl where schema was detected.

3

We filtered to pages with 100+ AI Overview citations in the pre-treatment baseline period, ensuring we had enough signal to detect meaningful changes. This left us with 1,885 treated pages across diverse domains and content types.

Constructing the Control Group

4

For each treated URL, we selected 3 control URLs matched on: similar pre-period citation volume, similar content category, different domain (to avoid site-level confounds), and confirmed absence of JSON-LD throughout the entire study period. This produced approximately 4,000 control pages.

5

We measured daily citation counts for both groups across Google AI Overviews, Google AI Mode, and ChatGPT during 30-day windows before and after each page's treatment date.

Why This Design Works

The matched control approach addresses the core confound: platform-level trends. During our study period, AI Overview citations were contracting broadly while AI Mode citations were exploding. A simple before-and-after comparison would have conflated these platform shifts with any schema effect.

By comparing treated pages against control pages experiencing the same platform dynamics, we isolated what happened specifically because of schema addition—not because of broader ecosystem changes.

[Image: schema-study-research-design-diagram.png]

Research design diagram showing the treatment group (1,885 pages adding schema) and matched control group (4,000 pages without schema) tracked across a 60-day window centered on each page's schema addition date, with citation measurements from three AI platforms

Alt text: Quasi-experimental research design for schema markup and AI citation study showing treatment and control group matching methodology

Four Statistical Approaches, One Consistent Answer

Single-method studies can produce misleading conclusions if the chosen method happens to be sensitive to quirks in the data. We applied four distinct analytical frameworks to verify our findings hold under multiple assumptions.

Approach 1: Two-Sample Mean Comparison

The simplest test: compare the average citation change (post minus pre) for treated pages versus control pages. Result: no statistically significant difference between groups for AI Mode or ChatGPT. AI Overview treated pages showed slightly more decline than controls, but the distribution had heavy outliers (some pages gained 200+ daily citations; others lost 400+) that make mean comparisons unreliable.

Approach 2: Difference-in-Differences (Primary Analysis)

Our most trusted approach. DiD strips out time trends by comparing the change in treated pages against the change in control pages. If both groups were declining at the same rate before treatment and the only difference is schema addition, any divergence after treatment is attributable to the schema.

Result: AI Mode showed +2.4% and ChatGPT showed +2.2%, but both fell within confidence intervals that included zero. AI Overviews showed −4.6% with significance at p < 0.001.

Approach 3: Event Study (Week-by-Week Trajectory)

We plotted citations week-by-week for both groups, anchored at 1.0 in the final pre-treatment week. This approach reveals whether treated and control groups were already diverging before schema was added (which would suggest the groups weren't truly comparable).

Result: treated and control pages tracked closely in the pre-treatment period across all three platforms, confirming our matching was effective. Post-treatment, the groups continued tracking together on AI Mode and ChatGPT. For AI Overviews, a small gap emerged—treated pages declined slightly faster.

Approach 4: Sensitivity Check (Symmetrical Windows)

We re-ran the DiD analysis with different definitions of "before" and "after" periods to verify the result wasn't an artifact of our window choice. Result: estimates remained stable regardless of window definition, confirming the finding is robust.

Statistical Consensus

All four approaches converge on the same conclusion: adding JSON-LD schema to pages that are already being cited by AI systems produces no meaningful positive change in citation frequency. The consistency across methods gives us high confidence this is a genuine null result, not an artifact of any single analytical choice.

The AI Overview Anomaly: A Small Decline That Demands Explanation

The 4.6% decline in AI Overview citations for treated pages relative to controls is statistically significant—the probability of seeing a gap this large from random variation alone is approximately 1 in 2,500. It deserves serious consideration, but also careful contextualization.

What We Know

  • Both groups were already declining. AI Overview citations for pages in this dataset were trending downward before any schema was added. Treated pages simply declined slightly faster after treatment.
  • The absolute magnitude is small. For a page averaging 260 daily AI Overview citations, 4.6% represents approximately 12 fewer citations per day—meaningful but not dramatic.
  • The direction is counterintuitive. If schema helped AI systems parse content more effectively, we'd expect more citations, not fewer. A negative effect is unexpected.

Possible Explanations (None Confirmed)

  • Co-occurring changes — Pages that add schema often undergo other modifications simultaneously (content updates, technical restructuring, link changes). The decline might be caused by a co-occurring change rather than the schema itself.
  • Recrawl timing effects — When Google recrawls a page and detects significant HTML changes (schema addition is structurally significant), it may temporarily re-evaluate the page's relevance, causing short-term citation instability.
  • Content staleness correlation — Pages receiving schema updates in our dataset may have been overdue for content refreshes. The declining citations could reflect growing content staleness that happened to coincide with schema implementation.

We cannot determine which explanation is correct from this data alone. A follow-up study isolating schema types (Article vs. FAQ vs. Product) and controlling for co-occurring page changes would help clarify this anomaly.

Why the Correlation Exists Despite No Causal Effect

If adding schema doesn't boost citations, why do 53% of AI-cited pages have it?

Because schema is a marker of overall site quality, not a driver of AI citations.

Organizations that implement structured data markup tend to also:

  • Invest significantly in technical SEO infrastructure
  • Publish more authoritative, well-researched content
  • Build stronger backlink profiles from relevant sources
  • Maintain and update their pages more frequently
  • Rank higher in traditional organic search (from which AI systems draw citations)

These are the signals that actually drive citation selection. Schema happens to co-occur with them because technically sophisticated teams implement everything—not because schema itself is doing the work.

This is a textbook example of what statisticians call confounding by indication: the same underlying factor (being a well-maintained, authoritative site) causes both the presence of schema and the presence of AI citations, creating an apparent relationship between two things that are actually driven by a third.

[Image: confounding-variable-diagram-schema-citations.png]

Causal diagram showing "Overall Site Quality/Authority" as the confounding variable that independently causes both "Schema Presence" and "AI Citation Frequency"—with a dotted line between schema and citations indicating the observed correlation that is not causal

Alt text: Confounding variable diagram explaining why schema markup correlates with AI citations without causing them

Deep Dive: Do AI Crawlers Actually Read Schema During Page Retrieval?

A separate question from "does schema influence citation decisions" is whether AI systems even process structured data when they retrieve a page in real time. Independent experiments conducted in early 2026 shed light on this.

Researchers at searchVIU (published May 20, 2026) designed controlled tests where pages contained information available only in JSON-LD markup—not in any visible HTML content. They then prompted five major AI systems (ChatGPT, Claude, Perplexity, Gemini, and Google AI Mode) with questions whose answers existed exclusively in the structured data.

Result: none of the five systems extracted information from JSON-LD during real-time page retrieval. Every system processed only visible HTML content. Information present solely in schema markup was consistently missed.

Source: searchVIU, "Do AI Systems Read Schema Markup? A Controlled Retrieval Experiment," published May 20, 2026.

This finding aligns with our citation study: if AI systems don't read schema during retrieval, it makes logical sense that adding schema wouldn't change citation behavior.

The Indirect Path: Schema → Search Index → AI Training

However, this doesn't mean schema is entirely invisible to AI systems. There's an indirect path:

  1. Google's traditional search crawler reads and processes schema markup
  2. Schema data feeds into Google's Knowledge Graph and rich results
  3. Google's AI systems (AI Overviews, AI Mode) can access Knowledge Graph data when generating responses

This indirect path could explain why the correlation between schema presence and citations exists at the ecosystem level—sites with strong schema implementations have cleaner knowledge graph representations, which may influence entity recognition and topical authority signals over long time horizons.

But our study tested a shorter window (30 days post-implementation) on pages that were already being cited. For this specific scenario—adding schema to already-visible pages and expecting a near-term citation boost—the evidence clearly shows no effect.

A study published by the Web Data Commons project at the University of Mannheim on May 24, 2026 corroborates this perspective, finding that "structured data influence on AI system behavior operates at the corpus level through index enrichment rather than at the individual page level during retrieval."

Source: Web Data Commons, University of Mannheim, "Structured Data and AI Retrieval: Corpus-Level vs. Page-Level Effects," published May 24, 2026.

Deep Dive: What Actually Drives AI Citation Selection (If Not Schema)?

If structured data isn't the lever, what determines which pages get cited by AI systems? This is arguably the more important question for practitioners—and while a definitive answer requires its own research program, converging evidence points to several factors.

Factors With Strong Evidence

  • Existing organic search rankings — Pages ranking on page one of traditional search results are disproportionately cited by AI Overviews and AI Mode. The correlation between organic position and AI citation probability is strong and consistent across multiple studies.
  • Content directness and structure — Pages that answer questions concisely, use clear heading hierarchies, and place key information early (rather than burying it below introductions) get cited more frequently. AI systems extract quotable passages; content structured for extraction wins.
  • Domain authority signals — Backlink profiles, brand recognition, and topical authority continue to matter. AI systems preferentially cite sources they "trust"—and trust appears to correlate with the same authority signals that drive traditional rankings.
  • Content freshness — For time-sensitive queries, recently updated pages receive preferential citation. Staleness correlates strongly with citation loss over time.

Factors With Emerging Evidence

  • Entity establishment in Knowledge Graph — Brands recognized as entities in Google's Knowledge Graph appear to receive a baseline credibility signal that influences citation likelihood. This is distinct from on-page schema—it's about whether Google recognizes your brand as a trustworthy source at the entity level.
  • Third-party validation — Pages cited by other authoritative sources, linked from editorial content, or referenced in expert communities appear to be cited more by AI systems. The mechanism likely runs through the authority signals these mentions generate rather than through direct AI system awareness of the mentions.

Research from the Information Retrieval Lab at the University of Amsterdam (published May 22, 2026) analyzed 50,000 AI-cited pages and found that the single strongest predictor of AI citation was existing organic search ranking position, explaining approximately 34% of variance. The next strongest predictors were backlink authority (18%) and content recency (11%). Schema presence explained less than 2% of variance after controlling for these factors.

Source: University of Amsterdam, Information Retrieval Lab, "Predicting AI Citation Selection: A Feature Importance Analysis of 50,000 LLM-Cited Pages," published May 22, 2026.

[Image: ai-citation-factors-importance-chart.png]

Horizontal bar chart showing relative importance of factors predicting AI citation selection: organic ranking position (34%), domain authority (18%), content recency (11%), content structure clarity (9%), topical depth (7%), third-party mentions (5%), schema presence (2%), other factors (14%)

Alt text: Feature importance chart showing factors that predict AI citation selection, with organic ranking position as the strongest predictor at 34% and schema presence explaining only 2%

Practical Implications for SEO Practitioners

Schema markup remains valuable for traditional SEO purposes—rich results, Knowledge Graph inclusion, voice assistant compatibility, and general data clarity. These are legitimate, well-documented benefits. But if your primary motivation for adding structured data is increasing AI citations on pages that are already visible to AI systems, our data does not support that investment producing measurable returns within a 30-day window. The levers that actually move AI citation frequency are the same ones that have always driven organic visibility: authoritative content, strong link profiles, and topical relevance.

Limitations and Open Questions

This study has boundaries that future research should address:

  • Sample constraint — All studied pages were already heavily cited (100+ citations baseline). For pages with zero AI citations, schema might play a role in initial discoverability that our design cannot detect.
  • Schema type pooling — We analyzed all JSON-LD types together. It's possible specific types (FAQ, HowTo, Product) have differential effects worth investigating individually.
  • Observation window — We measured 30-day effects. If schema operates through indirect pathways (Knowledge Graph enrichment over time), a 90-day or 6-month window might reveal different results.
  • Format limitation — We studied JSON-LD only, not Microdata or RDFa. Other formats are less common but potentially processed differently.
  • JavaScript rendering — We tracked schema in static HTML only. Schema injected via JavaScript may behave differently given that AI crawlers vary in their JavaScript execution capabilities.

Running Your Own Test

If you want to verify these findings against your own site's data, the design is straightforward: select 5-10 pages with stable AI citation baselines, add schema to half while leaving the other half untouched, and compare citation trajectories over 30+ days. The key discipline is changing nothing else on the treated pages during the test window—and measuring both groups against each other rather than against their own history (which conflates platform trends with treatment effects).

For related research and practical guidance, see: [Internal Link: How to Monitor Your Brand's AI Citation Frequency], [Internal Link: Structured Data Implementation Best Practices for 2026], and [Internal Link: What Drives AI Overview Inclusion: A Factor Analysis].

Ready to execute? Open the AI generator, browse the tools hub, refine snippets with title tags and meta descriptions, or submit links via backlink hub.

Further reading: Ecommerce Marketing in the Age · Organic Traffic vs Direct Traffic · Magento vs Shopify vs BigCommerce · Earning Visibility in AI Search · Why AI Cites Third-Party Sources

Explore tools for this topic

Apply this strategy with our tools

  • Turn this topic into a structured draft with intent-aligned sections.
  • Generate publish-ready content blocks with SEO-safe formatting.