How to Optimize for AI Searches - Schema Markup Is Not Optional Anymore (Part 2 of 7)
Strategy GEO
Stop guessing whether AI engines understand your pages. Zeover analyzes your site against 100+ GEO metrics including structured data coverage, benchmarks your visibility across ChatGPT, Claude, Gemini, and Grok, and generates optimized content. See how AI sees your brand.
This is part two of our series on how to optimize for AI searches. Part one covered starting with your llms.txt - the map AI engines use to navigate your site. This part covers what happens when they arrive at individual pages: do they understand what they’re reading?
AI engines don’t look at your pages the way humans do. They don’t read your hero image, appreciate your layout, or respond to visual hierarchy. They parse HTML, metadata, and structured data. If your structured data is missing, incomplete, or outdated, they skip your content in favor of a competitor who took the time to make their pages machine-readable.
TL;DR
- Content with proper schema markup has a 2.5x higher chance of appearing in AI-generated answers.
- Pages with FAQPage schema are 3.2x more likely to appear in Google AI Overviews.
- Microsoft confirmed in March 2025 that schema markup helps LLMs understand content for Copilot answers.
- JSON-LD is the preferred format. Google’s May 2025 guidance recommends it specifically.
- Five schema types matter most: Organization, FAQPage, HowTo, Product, and Author.
Schema Markup, Explained in One Paragraph
Schema markup is structured data embedded in your HTML that describes what a page is about. Instead of leaving AI engines to infer from your text that a page describes a product priced at $49/month with a 4.7-star rating, schema says so explicitly in a format built for machines. The standard is Schema.org, a shared vocabulary maintained by Google, Microsoft, Yahoo, and Yandex. The preferred syntax is JSON-LD, a small block of JavaScript-flavored data placed in your page’s <head> or before the closing </body> tag.
Schema has been around for a decade, but AI engines made it non-negotiable. Traditional search engines would rank your page based on content, links, and authority even without schema. AI engines evaluate structured data as a primary input when deciding what to cite.
The Impact on AI Visibility
Content with proper schema markup has a 2.5x higher chance of appearing in AI-generated answers, according to a 2025 analysis. Microsoft’s principal product manager confirmed in March 2025 that schema markup helps LLMs understand content for Copilot answers. Google’s official May 2025 guidance recommends JSON-LD specifically for AI-optimized content.
For specific schema types, the effect is sharper. Pages with FAQPage schema markup are 3.2x more likely to appear in Google AI Overviews and show 28% higher citation rates across major AI platforms. That isn’t a marginal improvement - it’s a category change in how AI engines evaluate the page.
The mechanism is straightforward. When a user asks ChatGPT “what does Acme’s pricing look like,” the AI needs to find a reliable source that clearly states Acme’s pricing. A page with Product schema declaring the price, currency, and availability gives it an unambiguous answer to cite. A page with the same information buried in prose forces the AI to parse natural language and risk getting it wrong. Given the choice, AI engines pick the unambiguous source.
The Five Schema Types That Matter
Not all schema is equally useful for AI visibility. Focus on these five.
1. Organization
Every site should have Organization schema on its homepage (or an @id reference from other schema). It tells AI engines your legal name, logo, URL, social profiles, contact information, and company details. This is the anchor that AI engines use to connect content across your domain to a single entity.
Key properties: name, url, logo, sameAs (social profile URLs), description, foundingDate, address.
2. FAQPage
The highest-impact schema type for AI visibility. Pages with FAQPage schema are 3.2x more likely to appear in AI Overviews because the format mirrors how users ask questions and how AI engines construct answers.
Add FAQ sections to your homepage, key service pages, and long-form blog posts. Structure each entry with a Question and an acceptedAnswer of type Answer. Write the questions as your customers actually ask them. “How much does a kitchen remodel cost in Denver?” works better than “Pricing information.”
3. HowTo
For tutorial content, how-to guides, and step-by-step explanations. HowTo schema declares each step explicitly, which helps AI engines understand the sequence and extract it in response to procedural questions.
If you publish content that teaches users how to accomplish something, HowTo schema is the highest-leverage addition you can make to those pages.
4. Product
If you sell anything on your site, Product schema is essential. It declares price, availability, ratings, reviews, SKU, brand, and more. AI engines cite products confidently when structured data makes the facts explicit.
Include aggregateRating if you have customer reviews. Pages with visible ratings get cited at higher rates than pages with similar content but no rating data.
5. Author
Author schema signals to AI engines who wrote a piece of content. This matters because AI engines increasingly evaluate E-E-A-T (Experience, Expertise, Authoritativeness, Trust) when choosing what to cite. A page written by a named expert with a linked profile, credentials, and jobTitle is more citable than an anonymous post.
Add Person schema for each author, link it from articles via the author property, and include image, jobTitle, worksFor, and sameAs for their public profiles.
Common Mistakes
Using Microdata or RDFa instead of JSON-LD. All three are valid, but JSON-LD is preferred by Google and easier to maintain because it lives separately from your HTML. Pick JSON-LD and stick with it.
Schema that contradicts page content. If your schema says price: "$49" but your page displays $59, you’ve created an inconsistency that makes AI engines discount the entire page. Audit regularly to keep schema and visible content in sync.
Missing required properties. Each schema type has required properties. FAQPage needs at least one Question/Answer pair. Product needs at least a name and offer. Missing required properties can invalidate the schema entirely.
Over-schemaing irrelevant content. Don’t add Product schema to a blog post. Don’t add HowTo schema to content that isn’t a how-to. Schema that misrepresents content is worse than no schema.
Not testing. Google’s Rich Results Test and the Schema Markup Validator both detect errors in seconds. Use them before deploying schema changes.
How to Roll This Out
Start with Organization schema on your homepage. That’s the one piece of structured data every site needs, and it’s the easiest to implement. From there, move to FAQPage on your top three to five pages - the ones that already handle the most traffic or answer the most customer questions.
Next, add Product schema to everything you sell. Then work through your blog and resource content, adding Author schema to every article and HowTo where it fits.
The process takes a week or two if you’re doing it manually. Zeover automates it by crawling your site, identifying pages missing schema or using it incorrectly, and scoring each issue with clear remediation steps. You don’t need to know the difference between offers and aggregateOffer - the platform tells you what’s broken and how to fix it.
Where This Fits in the Series
Schema markup makes individual pages machine-readable. But structured data alone isn’t enough if the underlying content isn’t written for AI retrieval, so the series continues with that next. Each part builds on the last.
Previously in this series: Part 1 - Start with your llms.txt.


