SEO

Ahrefs' AI Search Benchmark Report: SEO Splits Into Three Jobs

Ahrefs studied AI Overviews, AI Mode, ChatGPT citations, brand mentions, YouTube signals, click loss, and source churn. The practical read: SEO now splits into three jobs: be retrievable, be citeable, and be the brand a model will name or link.

Francisco Leon de Vivero
Ahrefs' AI Search Benchmark Report: SEO Splits Into Three Jobs

Ahrefs' AI Search Benchmark Report: SEO Splits Into Three Jobs

TL;DR: Ahrefs analyzed 146 million SERPs, 730,000 AI responses, and roughly a million citations for its Q4 2025 to Q1 2026 AI Search Benchmark Report. The headline is not that search died. It is that AI search pulls the old SEO job apart into three separate ones: be retrievable, be citeable, and be the brand a model trusts enough to name or link. Google still sent 190 times more visitors than ChatGPT across 76,000 sites, so the blue-link economy is not gone. But AI Overviews already cut clicks to top content by 58%, and 28.3% of ChatGPT's most-cited pages had zero organic keyword visibility. The skills that win answers now overlap with classic SEO without being identical to it.

Read this as a benchmark, not a verdict. The sample is large and the work is careful, but it is one vendor's slice of a moving target. Use the numbers to set direction, then check the same patterns against your own rankings, logs, prompts, analytics, and brand mentions before you reorganize a roadmap around them.

21% Share of keywords showing an AI Overview across Ahrefs' 146 million SERP dataset, concentrated on informational intent.
-58% Drop in clicks to top-ranking content when an AI Overview appears. Position 10 still lost 19.4%.
28.3% Of ChatGPT's top 1,000 cited pages had zero organic keyword visibility in Ahrefs' data.
0.737 Correlation between YouTube mentions and AI brand visibility, the strongest link Ahrefs measured.
190x More visitors Google sent than ChatGPT across 76,000 measured sites, even with AI search rising.
28% Of brand mentions across six AI indexes actually included a link. Perplexity linked far more than AI Overviews.
13.7% URL overlap between AI Mode and AI Overviews across matched response pairs, even though their answers usually agreed.

Every quarter brings another study that claims to settle AI search. Most of them measure one tool, one prompt set, or one industry, then stretch the result into a law. Ahrefs' benchmark is more useful than that because the sample is big enough to show structure instead of random scatter, and because it reports where its own numbers disagree. I am going to walk through what the data supports, where it should make you change something, and where it does not prove what people will say it proves.

The short version is the angle I keep coming back to with clients. The Ahrefs benchmark does not say SEO is over. It says one job became three. You now have to be retrievable so a model can find your page, citeable so the model will quote it, and trusted enough as a brand that the model names you or links you by choice. Classic SEO already covers part of all three. It does not cover all of any of them.

What Ahrefs Actually Measured

Scale is the reason to take this report seriously. Ahrefs looked at 146 million SERPs to study AI Overview behavior, 730,000 paired responses to compare AI Mode against AI Overviews, 863,000 search result pages and 4 million AI Overview URLs to study source selection, around a thousand top ChatGPT citations, and 31,000 brand mentions to study linking. Its brand-signal work also used 75,000 brands, 174,000 cited pages, 26,000 source URLs, and 108 million queries. Numbers that size smooth out the randomness that wrecks smaller studies.

Scale is also the reason to stay careful. Coverage skews by geography and language. Indonesia led AI Overview coverage at 37.2% while the United States sat at 20.5%, and Spanish made up 11.48% of AI Overviews by language. If your market is North American and English-first, the global averages already describe a different population than yours. Treat each figure as a baseline to test, not a result to copy. That posture matters more here than in classic rank studies, because AI answers change between observations and vary by account, location, and session.

How to read every number below: Ahrefs reports correlations and population shares, not guarantees for your site. A correlation tells you where to look. It does not tell you that copying the pattern will move your own visibility. Validate against your analytics and your prompts before you commit budget.

AI Overviews Reward Long, Informational Queries

AI Overviews appeared for 21% of keywords in the dataset, and they cluster on informational intent. Query length is the cleaner predictor. One-word queries triggered an AI Overview 9.5% of the time. Queries of seven words or more triggered one 46.4% of the time, almost five times as often.

That maps to how people use assistants. Short head terms usually carry navigational or commercial intent, and Google still answers many of those with links, products, and local packs. Long descriptive questions are where a generated answer earns its place, and where your informational pages are most exposed to being summarized instead of clicked.

The practical read: pull your long, question-shaped informational queries and check which ones already return an AI Overview. Those pages need answer-ready structure, not more padding. Your short commercial and branded terms are safer for now, which is one more reason to keep service and product pages strong while you adapt the blog. Google's own guidance points the same direction, as I covered in Google's AI search documentation.

SEO read: Do not audit AI Overview risk only on head terms. Start with high-intent informational long-tail queries. That is where Google has the most room to answer before the click.

AI Mode and AI Overviews Agree on the Answer, Not the URLs

Here is the finding that should change how you set targets. Across 730,000 response pairs, AI Mode and AI Overviews shared only 13.7% of their cited URLs, yet they reached the same conclusion 86% of the time. Two systems read the web, picked mostly different sources, and still told the user the same thing.

Source stability looks similar inside a single surface. Ahrefs found AI Overview content had about a 70% chance of changing between observations, while the meaning of the answer stayed close to identical, with a 0.95 cosine similarity score. The wording and the cited pages move. The conclusion holds.

So chasing one exact citation slot is the wrong goal. The durable goal is to shape the consensus answer in your category. If the model's conclusion is stable while its sources rotate, you want your facts, framing, and proof to match what the consensus already says, so any source the model grabs still carries your position. I unpacked this volatility in more detail in AI citation drift and source stability, and the benchmark backs it up at far larger scale.

This is why platform-specific tracking matters. You need separate monitoring for Google AI Overviews, AI Mode, ChatGPT, Perplexity, Gemini, Copilot, and Claude. A single "AI visibility" score can hide the surface that is actually broken.

Citation Does Not Require Ranking

This is the most uncomfortable number for traditional reporting. In ChatGPT's top 1,000 cited pages, 28.3% had zero organic keyword visibility. More than a quarter of the pages ChatGPT leaned on would not show up in a normal rank report at all.

AI Overviews behave less radically but still break the old link. Of AI Overview citations, 37.9% came from pages ranking in the top 10, 31.2% from pages ranking 11 to 100, and 31% from pages that did not rank in the top 100. Ranking helps. It is far from the whole story.

Two jobs are coming apart here. Being retrievable, meaning a model can crawl, parse, and ground itself in your page, is no longer the same as ranking for the query. A page can be invisible in Google and still feed an answer, and a page can rank well and still get skipped. If your only scoreboard is position tracking, you will miss both halves of this. I dug into the source-selection mechanics in how ChatGPT picks citations and the crawl-versus-citation gap in the ghost citation problem.

The fix is a measurement split. Keep rank tracking for the queries that still drive clicks and conversions. Add citation tracking as its own report: which prompts mention you, which cite a page, and which page they cite. The two reports will not agree, and that disagreement is the point.

AI search measurement map separating rank, retrieval, citation, mention, and referral visits
AI search reporting needs separate scoreboards for rank, retrieval, citation, mention, and click behavior.

Key takeaway

Classic ranking is now a retrieval input, not the whole game. Build pages that answer related sub-questions clearly, use descriptive headings, and make each section useful when separated from the rest of the article.

If retrievable and citeable are jobs one and two, being a named, trusted brand is job three, and it does not run on backlinks the way SEO teams expect. Across six AI indexes, Ahrefs found a brand mention included a link only 28% of the time. Perplexity was far more willing to attach a link than AI Overviews. Most of the time, the model talks about a brand without sending a clickable path to it.

That changes what you count. An unlinked mention still shapes whether a model associates your brand with a category and a prompt, even though it never shows in a backlink tool or a referral report. The teams winning AI visibility are tracking linked and unlinked mentions as two different signals, which is the same attribution gap I described in the GEO attribution crisis.

Signal What it proves What it does not prove
Brand mention The model associates the brand with the answer. That users can click through.
Linked mention The answer gives users a route to a site. That users will take the click.
Referral visit The AI surface produced measurable traffic. That the AI answer was the original decision driver.

The strongest correlation in the report is the one most SEO teams under-invest in. YouTube mentions correlated with AI brand visibility at about 0.737 across ChatGPT, AI Mode, and AI Overviews, higher than the other signals Ahrefs tested. Correlation is not proof that posting videos lifts citations, and I want to be honest about that. But the signal is consistent across three surfaces, and it lines up with a second Ahrefs study I covered earlier in why YouTube mentions track AI visibility. At minimum, video and editorial mentions belong in the distribution plan next to links, not after them.

Practical test: Search your top category terms on YouTube. Are you named in titles, transcripts, descriptions, and third-party videos? If the answer is no, you probably have an AI visibility gap that a link report will not show.

Length Does Not Buy Citations

The word-count myth takes a clean hit here. Ahrefs found near-zero correlation between word count and the odds of an AI Overview citation. More than half of cited pages, 53.4%, were under 1,000 words. Long does not mean quotable.

What does show up is format. For top-of-funnel queries, "best X" style blog lists made up 43.83% of ChatGPT's source URLs. Models reach for pages that already organize options into a clean, comparable answer, because that shape is easy to extract and summarize.

So write for extraction, not length. Put the answer near the heading that asks the question. Use comparison tables when the query implies a choice. Keep passages self-contained so a model can lift one without losing meaning. The "best X" pattern is a real citation supply chain you can supply on purpose, with honest, current, well-sourced lists. It is also the exact pattern that low-quality operators flood with spam, so the bar is editorial quality, not volume. My piece on query augmentation and agentic search explains why extractable sub-answers get pulled into these responses, and what actually gets you cited ranks the levers that matter.

The Click Economy Is Smaller, Not Gone

It is easy to read the citation data and panic. The traffic data argues for calm. AI Overviews cut clicks to top-ranking content by 58% in Ahrefs' measurement, and even position 10 lost 19.4% of its clicks. That is a real tax on informational rankings, and it is already here.

Editorial chart showing AI Overview click loss and Google traffic still exceeding ChatGPT traffic
AI Overviews shrink informational clicks, but Google remains the larger traffic source by a wide margin.

At the same time, the scale of AI referral is still small next to Google. Under Ahrefs' classification, ChatGPT may represent about 12% of Google search volume, which is large enough to plan around. But across 76,000 measured sites, Google still sent 190 times more visitors than ChatGPT. Both numbers are true. AI answers are eating informational clicks, and Google search is still the larger source of human visits by a wide margin.

The sane plan treats this as a reallocation, not a collapse. Defend the commercial and branded queries that still click. Expect informational clicks to shrink and convert that content into citation and brand-association value instead. I laid out which businesses handle that change well in zero-click survival, and the recent referral movement in ChatGPT referral traffic and the clicky no-click future.

Trust and the Misinformation Gap

Job three has a defensive side. Ahrefs ran a misinformation test by seeding false claims and watching which models repeated them. Gemini and Perplexity echoed the seeded false claims in 37% to 39% of answers. ChatGPT 4 and 5 stayed below 7%. Claude avoided the false claims, but it also did not surface the official site, which is its own kind of failure if you are the brand that should have been named.

Read that carefully before you act on it. This is one controlled test on one set of seeded claims. It does not rank these models for general accuracy, and it does not predict how they will treat your specific brand. What it does show is that models vary a lot in how easily a wrong claim sticks, and that being correct is not the same as being present.

The defensive move is to own the canonical story about your brand on your own site. Maintain clear pages that state what you do, who you serve, and the facts a model should repeat. Keep FAQs current so the right answer is easy to find and quote. When the web disagrees about you, the model needs a clean, well-structured source to prefer, and that source should be yours. Google retiring FAQ rich results did not retire that need, as I argued in the FAQ schema AI pivot.

The Three Jobs, On One Page

Pull the report together and it resolves into three jobs that used to be one. Each has a different goal and a different scoreboard.

Three-part AI search SEO framework showing retrievable, citeable, and named signals
AI search visibility separates retrieval, citation, and brand mention measurement into different jobs.
The job What it means What to measure
Be retrievable A model can crawl, parse, and ground itself in your page, whether or not it ranks. 28.3% of ChatGPT's cited pages had no organic visibility. Crawler access in logs, render and parse health, which pages get pulled into answers.
Be citeable The page answers in an extractable shape the model will quote. Format beat length, and "best X" lists supplied 43.83% of ChatGPT top-of-funnel sources. Prompt-level citation tracking, which page gets cited, presence in the consensus answer.
Be named or linked The model trusts the brand enough to name it, and sometimes link it. Only 28% of mentions carried a link, and YouTube mentions led the visibility signals. Linked and unlinked mentions tracked separately, share of voice by prompt, mention sources.

Classic SEO already touches all three. Crawlability and content quality help you get retrieved. Clear structure and authority help you get cited. Digital PR and brand search help you get named. The gap is that none of those classic tactics, on their own, finish any of the three jobs in an AI answer. You have to manage them as separate outcomes now.

Free AI Search Toolkit: I turned this framework into seven browser-only tools you can use without an API key or signup. Start with the AI Search Split Scorecard, then use the Citation-Ready Content Grader, Rank vs Citation Gap Mapper, Best-X Citation Opportunity Builder, Brand Story Risk Tester, Consensus Answer Tracker, and AI Crawler Access Audit to split rank, retrieval, citation, mention, and click risk into separate workflows.

A Practitioner Action Plan

Here is the order I would run this in for a real site, not a slide.

AI search SEO action plan organized by weekly, monthly, and quarterly priorities
The practical roadmap starts with exposure, citation, and crawler checks before expanding distribution.

Critical this week

  • Audit AI Overview exposure. Pull priority informational queries and check which ones trigger AI Overviews. Do this for long-tail queries, not only head terms.
  • Separate rank from citation. Track organic rank, AI mention, linked AI mention, cited URL, and referral visit as different metrics.
  • Check AI crawler access. Review robots.txt, important templates, blocked resources, and server logs. A page that cannot be fetched cannot be cited.

Important this month

  • Build YouTube and editorial mention distribution. Given the 0.737 correlation, treat video and earned mentions as a first-class channel, not a post-publication extra.
  • Map cited third-party lists. Identify which "best X" pages AI systems mention in your category, fix inaccurate data, and pursue legitimate inclusion where the list is editorially real.
  • Rewrite top pages for extraction. Use direct headings, answer-first paragraphs, comparison tables, specific entity names, and short sections that still make sense on their own.
  • Publish canonical brand answers. Create official pages for pricing, policies, product limits, support, comparisons, and "how it works" questions.

Strategic this quarter

  • Create a multi-surface visibility report. Combine Google rankings, AI citations, YouTube mentions, branded search, referral traffic, and third-party coverage.
  • Refresh pages with citation value. Ahrefs found many top ChatGPT-cited pages were recently updated. Add visible update dates and keep statistics current.
  • Measure brand story risk. Run prompts around your brand, products, alternatives, pricing, complaints, and category. Track false, outdated, and missing answers.

How to Read This Benchmark Without Overreacting

One vendor produced this, with its own crawl, its own classification of what counts as an AI Overview or a citation, and a sample that skews toward certain countries and languages. Those choices are reasonable, and the scale is real. They still shape the output. Your market may behave differently, and AI surfaces change month to month, so a figure from this quarter may drift by the next one.

That is not a reason to dismiss it. It is a reason to use it the way you would use a competitor's market study: as a strong prior that tells you where to point your own measurement. The teams that win the next year of AI search will be the ones who treat reports like this as a starting hypothesis and then prove or disprove it with their own data, prompts, and logs.

The SEOFrancisco Takeaway

The Ahrefs benchmark is the clearest large-sample case yet that AI search did not kill SEO. It pulled the job apart. Retrievable, citeable, named. Three outcomes, three scoreboards, one brand trying to win all three at once.

If you only act on one line from the report, make it this. Stop reporting AI search as a single ranking number. Track whether a model can reach your page, whether it will quote your page, and whether it will say your name. Those are the three questions the data keeps circling, and they are the three questions your next reporting cycle should answer.

Rankings get you into the retrieval pool. Clear content gets you extracted. Brand distribution gets you trusted. Official answers protect you when the model is wrong. That is the 2026 search job.

Need an AI search visibility audit? Francisco Leon helps SEO and content teams measure AI citations, crawler access, brand mentions, and query-level AI Overview risk. Book a consultation.

FAQ

How often do AI Overviews appear, according to Ahrefs?

For 21% of keywords across a 146 million SERP dataset, concentrated on informational intent. Long queries triggered them far more often, 46.4% for seven-plus-word queries versus 9.5% for one-word queries.

Do you have to rank in Google to get cited by AI?

No. In ChatGPT's top 1,000 cited pages, 28.3% had zero organic keyword visibility. For AI Overviews, 31% of citations came from pages outside the top 100. Ranking helps but does not gate citation.

Does longer content get cited more in AI search?

The data says no. Ahrefs found near-zero correlation between word count and AI Overview citation, and 53.4% of cited pages were under 1,000 words. Extractable format matters more than length.

Why does the report emphasize YouTube?

YouTube mentions had the strongest correlation with AI brand visibility that Ahrefs measured, about 0.737 across ChatGPT, AI Mode, and AI Overviews. It is a correlation, not proof of cause, but it is consistent enough to act on.

Is AI search replacing Google traffic yet?

Not in raw volume. AI Overviews cut clicks to top content by 58%, but across 76,000 sites Google still sent 190 times more visitors than ChatGPT. The change is real and uneven, not a clean replacement.

Related Articles

About the Author

Francisco Leon de Vivero at an industry conference

About the author

Francisco Leon de Vivero

Francisco Leon de Vivero is VP of Growth at Growing Search and a senior SEO strategist with 15+ years of experience across enterprise, ecommerce, and international search. He focuses on technical SEO, AI search visibility, content systems, and the operating layer that turns search data into action.

Next step

Turn this background reading into a more current SEO plan.

Use the most relevant current page below if this topic is still on your roadmap, then review the proof and contact paths if you want direct support.

Current service page

AI SEO

The service combines brand visibility audits, content adjustments, authority signals, competitive comparisons, and tracking through tools like StakeView and BrandLens so teams can see how AI search is reshaping discovery.

Explore this service