Structured data impact on visibility in AI search engines

18/09/2025

As artificial intelligence continues to evolve, the conversation around its effective training methods becomes increasingly critical. One area of debate is the role of structured data in enhancing AI search visibility. While many enthusiasts advocate for its use, recent experiments suggest that the reality might be quite different. This article delves into the findings of these tests and explores the implications for AI developers and content creators alike.

Content Index

Understanding structured data and its purpose

Structured data, often represented through schemas, is a standardized format used to organize and label information in a way that machines can easily interpret. This data is crucial for search engines as it helps them understand the context of the content, thus improving the visibility of web pages in search results.

Common types of structured data include:

  • Schema.org Markup: A collection of schemas that webmasters can use to structure their data.
  • JSON-LD: A lightweight data interchange format that is easy for humans to read and write.
  • Microdata: An HTML specification used to nest metadata within existing content on web pages.

By implementing structured data, website owners aim to enhance how their content appears in search engines, potentially improving click-through rates and visibility.

Does AI require structured data?

There is a common belief that structured data is essential for AI systems, particularly in terms of enhancing search capabilities. However, the effectiveness of structured data in this context is currently under scrutiny.

Recent tests have shown that AI models, particularly those based on large language models (LLMs), do not necessarily benefit from structured data as expected. This raises questions about how AI processes data and what it means for future developments in AI training.

Advantages and limitations of structured data in AI

While structured data can provide certain advantages, such as improved clarity and organization of information, its limitations are becoming more apparent in AI applications. Here are some potential advantages and limitations:

  • Advantage: Facilitates better understanding of context for traditional search engines.
  • Limitation: LLMs may not utilize structured data effectively during training.
  • Advantage: Can enhance rich snippets and enhance visibility.
  • Limitation: The breakdown of schema during tokenization may render it ineffective.

Why unstructured data is challenging for AI training

Unstructured data, which includes text, images, and videos, poses a significant challenge for AI training. Unlike structured data, unstructured data lacks a predefined format, making it difficult for AI models to extract meaningful insights. This can lead to inefficiencies in training and may hinder the overall performance of the model.

The primary issues with unstructured data include:

  • Lack of organization: Unstructured data is often chaotic and difficult to analyze.
  • Inconsistent formats: Variability in how information is presented complicates data processing.
  • Noise and ambiguity: Unstructured data can contain irrelevant or misleading information, leading to inaccuracies.

How AI interacts with unstructured data

Despite the challenges, AI has developed methods to process unstructured data effectively. Techniques such as natural language processing (NLP) and computer vision have made it possible for AI models to interpret and classify unstructured information.

Some key methodologies include:

  • Tokenization: The process of breaking down text into smaller units (tokens) for analysis.
  • Sentiment analysis: The use of NLP to determine the emotional tone behind a series of words.
  • Feature extraction: Identifying key characteristics from unstructured data to improve model performance.

Recent experiments challenging the effectiveness of structured data

Two notable experiments have shed light on the limitations of structured data in the context of AI visibility. The first was conducted by Mark Williams-Cook, who highlighted how LLMs process content.

In his findings, Williams-Cook explained that when LLMs tokenize content, they inadvertently "destroy" the schema markup. This means that structured data is broken into discrete tokens, making it indistinguishable from regular words. Consequently, any potential advantage from structured data in AI training is diminished.

In a similar vein, Julio C. Guevara conducted an experiment involving two product pages for an imaginary product. One page featured traditional content, while the other contained only structured data. The outcome revealed that LLMs could extract information from the page with visible text but struggled to interpret the structured data alone.

These experiments collectively suggest that while structured data may enhance traditional search visibility, its current role in AI training is limited.

The future of structured data in AI

While the current evidence points to a lack of effectiveness in using structured data for AI search visibility, this landscape may change as AI technology continues to evolve. Developers are exploring innovative ways to enhance AI's ability to understand and utilize structured data.

Potential future developments could include:

  • Improved tokenization methods: Techniques that preserve structured data during the tokenization process.
  • Advanced AI models: The creation of models specifically designed to utilize structured data more effectively.
  • Integration of multimodal data: Combining structured data with other forms of information (e.g., images, audio) for comprehensive analysis.

In summary, while structured data has its advantages in traditional search engines, its role in AI search visibility remains uncertain. Ongoing research and experimentation will be critical to understanding how best to leverage structured data in the future of AI development.

If you want to explore more stories like Structured data impact on visibility in AI search engines, you can browse the SEO Theory section.

Avatar photo

James Wirral

I am James Wirral, an SEO and SEM specialist for all major search engines, and my story began not in an office but behind the counter of my family's small bookshop. Watching local customers discover the titles they needed made me realise how powerful the right words and the right place could be. I taught myself the mechanics of search — from technical audits and schema to user intent and paid media — often late into the night, turning curiosity into craft. Over the years I have guided independent businesses and growing brands to consistent, measurable success, delivering double-digit organic growth and improving return on ad spend through honest, data-driven strategies. My work is grounded in evidence: careful testing, transparent reporting and a focus on long-term value rather than short-term tricks.What drives me is people. I remember a bakery owner who regained her customer base after a local search optimisation we carried out together, and a charity that reached donors they never knew existed thanks to a refocused content strategy. Those outcomes taught me that technical skills matter, but empathy and integrity make the difference. I publish practical guides, speak at industry events and mentor junior marketers so knowledge spreads beyond one campaign. Above all, I treat SEO and SEM as a promise to users and clients alike: to respect privacy, to prioritise relevance, and to build sustainable visibility that helps real people find what they need.

Related News

Leave a Reply

Your email address will not be published. Required fields are marked *

Your score: Useful

Go up

By clicking “Accept”, you agree Seo Wirral can store cookies on your device and disclose information in accordance with our Cookie Policy