Friday, March 13, 2026

Retargeting 101: A Beginner’s...

Retargeting is a form of online advertising that allows you to show ads...

From Blog to Business:...

Creating a strong content strategy is crucial for any business, especially those that...

Google Updates Robots Meta...

Google Updates Search Central Documentation for AI Mode Document Updates for AI Mode Google has...

Starting a Blog? Here...

Starting a blog can be an exciting venture, but it can also be...
HomeSEOUser Data Is...

User Data Is Important In Google’s Ranking Systems. What We Learned From Liz Reid’s Appeal Declaration

Introduction to Google’s Trial Documents

Google has appealed the ruling that says they need to give proprietary information to competitors. The latest document in the DOJ vs. Google trial reveals some interesting things about how Google’s search engine works.

Key Takeaways

  • Google has been ordered to give information to competitors so as not to be an illegal monopoly. Google does not want to give its extensive user-side data away.
  • Google’s data on page quality and freshness is proprietary. They don’t want to give it away.
  • Pages that are indexed are marked up with annotations, including signals that identify spam pages.
  • If spammers got hold of those spam signals, it would make stopping spam difficult.
  • User data is important to Google’s Glue system that stores info on every query searched, what the user saw, and how they interacted with the search results.
  • User data is important for training RankEmbed BERT – one of the deep learning systems behind Search.

Google’s Proprietary Page Quality and Freshness Signals

This really isn’t a surprise. Freshness signals are at the heart of Google’s proprietary secrets. Every page in Google’s index is marked up with annotations to help it understand the page. These include signals to identify spam and duplicate pages.

Pages Marked Up with Proprietary Page Understanding Annotations

Every page in Google’s index is marked up with annotations to help it understand the page. These include signals to identify spam and duplicate pages. Google argues that giving competitors a list of indexed URLs will enable them to “forgo crawling and analyzing the larger web, and to instead focus their efforts on crawling only the fraction of pages Google has included in its index.”

- Advertisement -

The Role of User Data in Google’s Ranking Systems

User data is used to train, build, and operate RankEmbed models. Google Glue is a huge table of user activity. It collects the text of the queries searched, the user’s language, location and device type, and information on what appeared on the SERP, what the user clicked on or hovered over, how long they stayed on a SERP, and more. RankEmbed BERT is one of the deep learning systems that underpins Search. RankEmbed BERT is used in reranking the results returned by traditional ranking systems. RankEmbed BERT is trained on click and query data from actual users.

User Interactions and Google’s Success

The AI systems behind search are continually learning to improve upon presenting searchers with satisfying results. Google looks at what they are clicking on and whether they return to the SERPs or not. Google also runs live experiments that look at what searchers choose to click on and stay on. Those actions help train RankEmbed BERT. The take-home point is that user satisfaction is by far the most important thing we should be optimizing for.

Conclusion

In conclusion, Google’s trial documents reveal a lot about how their search engine works, including the importance of user data and proprietary page quality and freshness signals. Google’s use of user data is crucial to their success, and they are not willing to share this information with their competitors. As Google continues to evolve and improve their search engine, it’s likely that user data will play an even bigger role in the future.

- Advertisement -

Latest Articles

- Advertisement -

Continue reading

Google Answers Questions About Search Console’s Branded Queries Filter

Introduction to Google Search Console's Branded Queries Filter Google Search Central recently announced that the branded queries filter in Search Console is now available to all eligible sites. This update has led to many questions from SEOs, which Google's John...

ChatGPT’s Default & Premium Models Search The Web Differently

Introduction to ChatGPT Models Ask ChatGPT's default and premium models the same question, and they'll cite almost entirely different sources. A Writesonic analysis found that GPT-5.4 Thinking, ChatGPT's premium model, sent 56% of its citations to brand websites, while GPT-5.3...

WordPress Gutenberg 22.7 Lays Groundwork For AI Publishing

New Updates in Gutenberg 22.7 Introduction to New Features Gutenberg 22.7 has introduced several exciting new features that make it easier for users to work with the platform. One of the key updates is the live preview for style variation transforms,...

WordPress Releases AI Plugins For Anthropic Claude, Google Gemini, And OpenAI

Introduction to WordPress AI Plugins WordPress has created three new plugins that make it easy to add OpenAI, Google Gemini, or Anthropic Claude integration for the PHP AI Client SDK. These plugins enable text, image, function calling, and web search...