Friday, May 8, 2026

From Free to Premium:...

WordPress is a fantastic platform for bloggers, offering a wide range of customizable...

Content Invisible Without JavaScript

Introduction to JavaScript and SEO This week, a question was raised about the impact...

From Short-Form Videos to...

As a creator, you're likely no stranger to the world of short-form videos,...

WP Engine Vs Automattic...

Introduction to the Case WP Engine has filed a Second Amended Complaint against Automattic...
HomeSEOGooglebot Tops AI...

Googlebot Tops AI Crawler Traffic

Introduction to Cloudflare’s Year in Review

Cloudflare has released its sixth annual Year in Review, providing a comprehensive look at Internet traffic, security, and AI crawler activity across 2025. The report draws on data from Cloudflare’s network, which spans more than 330 cities across 125 countries and handles over 81 million HTTP requests per second on average.

AI Crawler Findings

The AI crawler findings stand out in the report. Googlebot crawled far more web pages than any other AI bot, reflecting Google’s dual-purpose approach to crawling for both search indexing and AI training. Cloudflare analyzed successful requests for HTML content from leading AI crawlers during October and November 2025. The results showed Googlebot reached 11.6% of unique web pages in the sample, more than 3 times the pages seen by OpenAI’s GPTBot at 3.6%. It’s nearly 200 times more than PerplexityBot, which crawled just 0.06% of pages.

Googlebot’s Dominance

Bingbot came in third at 2.6%, followed by Meta-ExternalAgent and ClaudeBot at 2.4% each. The report noted that because Googlebot crawls for both search indexing and AI model training, web publishers face a difficult choice. Blocking Googlebot’s AI training means risking search discoverability. Cloudflare wrote: “Because Googlebot is used to crawl content for both search indexing and AI model training, and because of Google’s long-established dominance in search, Web site operators are essentially unable to block Googlebot’s AI training without risking search discoverability.”

- Advertisement -

AI Bots and HTML Requests

Throughout 2025, AI bots (excluding Googlebot) averaged 4.2% of HTML requests across Cloudflare’s customer base. The share fluctuated between 2.4% in early April and 6.4% in late June. Googlebot alone accounted for 4.5% of HTML requests, slightly more than all other AI bots combined. The share of human-generated HTML traffic started 2025 at seven percentage points below non-AI bot traffic. By September, human traffic began exceeding non-AI bot traffic on some days. As of December 2, humans generated 47% of HTML requests while non-AI bots generated 44%.

Crawl-to-Refer Ratios

Cloudflare tracks how often AI and search platforms send traffic to sites relative to how often they crawl. A high ratio means heavy crawling without sending users back to source sites. Anthropic had the highest ratios among AI platforms, ranging from approximately 25,000:1 to 100,000:1 during the second half of the year after stabilizing from earlier volatility. OpenAI’s ratios reached as high as 3,700:1 in March. Perplexity maintained the lowest ratios among leading AI platforms, generally below 400:1 and under 200:1 from September onward.

User-Action Crawling

Not all AI crawling is for model training. “User action” crawling occurs when bots visit sites in response to user questions posed to chatbots. This category saw the fastest growth in 2025. User-action crawling volume increased more than 15 times from January through early December. The trend closely matched the traffic pattern for OpenAI’s ChatGPT-User bot, which visits pages when users ask ChatGPT questions.

AI Crawlers and Robots.txt

Cloudflare analyzed robots.txt files across nearly 3,900 of the top 10,000 domains. AI crawlers were the most frequently blocked user agents. GPTBot, ClaudeBot, and CCBot had the highest number of full disallow directives. These directives tell crawlers to stay away from entire sites. Googlebot and Bingbot showed a different pattern. Their disallow directives leaned heavily toward partial blocks, likely focused on login endpoints and non-content areas rather than full site blocking.

Civil Society Under Attack

For the first time, organizations in the “People and Society” vertical were the most targeted by attacks. This category includes religious institutions, nonprofits, civic organizations, and libraries. The sector received 4.4% of global mitigated traffic, up from under 2% at the start of the year. Attack share jumped to over 17% in late March and peaked at 23.2% in early July.

Other Key Findings

Cloudflare’s report included several additional findings across traffic, security, and connectivity. Global Internet traffic grew 19% year-over-year. Growth stayed relatively flat through mid-April, then accelerated after mid-August. Post-quantum encryption now secures 52% of human traffic to Cloudflare, nearly double the 29% share at the start of the year. ChatGPT remained the top generative AI service globally. Google Gemini, Windsurf AI, Grok/xAI, and DeepSeek were new entrants to the top 10.

Why This Matters

The AI crawler data should affect how you think about bot access and traffic. Google’s dual-purpose crawler creates a competitive advantage. You can block other AI crawlers while keeping Googlebot access for search visibility, but you can’t separate Google’s search crawling from its AI training crawling. The crawl-to-refer ratios help quantify what publishers already suspected. AI platforms crawl heavily but send little traffic back. The gap between crawling and referring varies widely by platform.

Looking Ahead

Cloudflare expects AI metrics to change as the space continues to evolve. The company added several new AI-related datasets to this year’s report that weren’t available in previous editions. The crawl-to-refer ratios may change as AI platforms adjust their search features and referral behavior. OpenAI’s ratios already showed some decline through the year as ChatGPT search usage grew.

Conclusion

In conclusion, Cloudflare’s Year in Review provides valuable insights into the state of the Internet, AI crawlers, and security. The report highlights the dominance of Googlebot, the growth of user-action crawling, and the increasing attacks on civil society organizations. As the AI landscape continues to evolve, it’s essential to stay informed about the latest trends and developments. By understanding the implications of AI crawlers and their impact on traffic and security, individuals and organizations can make informed decisions about their online presence and protection.

- Advertisement -

Latest Articles

- Advertisement -

Continue reading

Bing Team Describes How Grounding Differs From Search Indexing

Introduction to Microsoft's New Framework Microsoft's Bing team has published a framework that describes how indexing requirements change when the goal is to support AI answers rather than to rank search results. This framework identifies five measurement areas where the...

GoDaddy Transferred A Domain By Mistake And Refused To Fix It

Introduction to the Problem GoDaddy, a well-known domain registrar, allegedly transferred a domain name without the authorization of its longtime registrant. This unauthorized transfer occurred without the necessary documentation, leaving the victim in a difficult situation. After spending nearly ten...

Google Tests AI Headlines, Rolls Out Spam Update – SEO Pulse

Introduction to Google's Latest Updates Google has been making significant changes to how content appears in its search results. This week's updates affect how headlines appear in search, how spam enforcement is handled, and how AI-generated content is labeled. These...

Google Answers Questions About Search Console’s Branded Queries Filter

Introduction to Google Search Console's Branded Queries Filter Google Search Central recently announced that the branded queries filter in Search Console is now available to all eligible sites. This update has led to many questions from SEOs, which Google's John...