Why “get cited by AI” is too blunt a goal
Almost every GEO article frames the goal as “get cited by AI.” But being cited isn’t the value — who cites you, whether that platform sends you traffic, whether its users are your customers, and whether those people pay, is the value.
Here’s the most counter-intuitive example: Claude barely surfaces clickable outbound links in chat. Claude can use you as the basis of an answer and the user still never “clicks” to your site — nothing shows in GA referral. Looks wasted — except Claude’s users include 70% of the Fortune 100, and its paying rate and revenue-per-user are the highest of the field. It doesn’t give you traffic; it influences high-value decision-makers. For a B2B or developer-tools brand, “being Claude’s basis for an answer” can be worth more than a thousand clicks from Perplexity.
So this piece has two halves: first how you get cited (mechanism), then what a citation converts into (channel economics) — the half most articles skip, and the one that decides which platform you should bet on.
Part 1: The four gates a citation must clear (mechanism)
All five are RAG, but RAG is a four-stage pipeline — clear each in order; fail one and the rest is wasted:
| Stage | What it does | Why you get dropped |
|---|---|---|
| ① Candidate pool | Pulls candidates from the index it can reach | You’re simply not in that index |
| ② Rerank | Orders/filters by relevance / authority / freshness | Indexed, but not in the top band |
| ③ Passage grounding | Cites passages, not whole pages | No self-contained “answerable” passage |
| ④ Citation decision | Picks 3–8 sources to attach when composing | Passage read, but not chosen as a named source |
The biggest difference is at stage ①’s “gate”: ChatGPT → Bing index + own (OAI-SearchBot governs citation eligibility; GPTBot is training-only); Gemini → live Google Search (a classifier, ~0.7 threshold, decides whether to search); Perplexity → own index (re-fetches high-citation pages every 24–72h) + Bing fallback + a 3-layer reranker; Claude → Brave Search (cited URLs overlap heavily with Brave organic); Grok → live web + direct X, extreme freshness weighting.
Mechanism decides whether you can get in. What decides whether it’s worth getting in is Part 2.
Part 2: Each platform is a different channel (conversion)
Figures below synthesize Sensor Tower’s State of AI 2026 and third-party traffic analyses (Similarweb etc.) — reported estimates that shift as platforms tune; read the relative character, not the absolute numbers.
First, the pie: these 5 ≈ the entire AI-assistant market — and it’s still in flux
Market-share source: Sensor Tower State of AI 2026 · True Audience · May 2026 (25 markets); growth figures are approximate.
| Platform | Exact share | User growth (approx.) | MAU |
|---|---|---|---|
| ChatGPT | 46.4% | slowing (share fell below 50%) | ~1.1B |
| Gemini | 27.7% | ~+114% YoY | 662M |
| Claude | 10.3% | +452% YoY (fastest) | 245M |
| Grok | 3.3% | exploding (small base) | — |
| Perplexity | 2.8% | ~+184% YoY | — |
| Subtotal (the 5 analyzed here) | 90.5% |
Others (not deep-dived here): DeepSeek 3.2%, Meta AI 2.5%, Microsoft Copilot 1.6% — the top 8 total ~97.8%, with ~2% long tail.
Always label the metric: the table above is Sensor Tower’s True Audience (unique reach) / May 2026 / 25 markets. Other definitions give very different numbers — e.g. StatCounter’s “AI chatbot” share puts ChatGPT at ~77%, because it measures referral / browser-network traffic, not app reach. Always cite share with source + metric + date, or someone with a different dataset will out-argue you.
The growth column matters more: ChatGPT is slowing and dropped below 50% for the first time while every challenger surges (Claude +452%, Perplexity +184%, Gemini +114%) — the market is multipolar and unsettled, so don’t bet GEO on “whoever’s biggest right now.”
⚠️ And this table basically excludes China
ChatGPT / Gemini / Claude are walled out of mainland China, so the table above is essentially the “ex-China” pie. Inside the wall is a parallel world (third-party estimates, approximate): Doubao (ByteDance) leads at ~200M+ MAU, Ernie (Baidu) ~220M, Quark (Alibaba) ~180M, Yuanbao (Tencent) ~150M, DeepSeek, Kimi (Moonshot) ~90M … 900M+ MAU combined. Note DeepSeek is just 3.2% globally but a top-tier giant inside China (different geography and metric).
Verdict: platform priority depends entirely on where your audience is. Targeting Taiwan / the West → bet on the 5 above; targeting mainland China → scrap this table and play Doubao / Ernie / Quark instead. Don’t treat one “global (actually ex-China)” table as universal.
Claude — no traffic, but it influences the most expensive people
- Traffic behavior: chat barely gives clickable outbound links (browsing + citations are recent and rarely triggered). So being cited ≠ referral; true influence is estimated ~2–3× higher than GA shows (dark funnel).
- Audience: developers, researchers, professionals/enterprise; 70% of the Fortune 100 use it, 1,000+ customers pay >$1M/yr, ~70% of enterprise deals won vs OpenAI.
- Paying power: revenue-per-user ~$2.76, ~1.5× ChatGPT’s ~$1.74; paying rate 13% vs ChatGPT’s 8%. Global audience grew >4× YoY.
- Verdict: if you’re B2B / SaaS / dev-tools / high-ticket, this is the one to fight for first — but never measure it by GA referral. It sends no traffic, only influence, and click-only marketers will write off the single most valuable engine as “no effect.” Measure brand-search volume, customer self-report, proposal win rates instead.
ChatGPT — the actual traffic pool
- Traffic behavior: Search mode gives links, and it accounts for ~87% of all AI referral traffic — for “clicks from AI,” it is the pool. But pure-answer contexts are often no-click.
- Audience/scale: 1B MAU, fastest ever — the broadest, most mainstream.
- Verdict: the non-negotiable base — biggest reach, ~87% of AI traffic. But don’t fantasize it drives traffic like old SEO; it often ends the moment it answers. First get into the Bing index + allow
OAI-SearchBot(where most sites die), then write extractable 40–60-word answer passages.
Perplexity — small, but the best at driving traffic, highest intent
- Traffic behavior: highest citation density (~8 sources per answer, 94% with clickable inline numbered links) — structurally built to drive measurable traffic.
- Conversion: Perplexity sessions convert ~3.1× non-branded Google organic, with ~4.7× session duration — high-intent, research-driven.
- Scale: but only ~2.8% of AI referral traffic — small in absolute terms.
- Verdict: negligible by total traffic, but it’s the only platform where you can show a hard “GEO drove conversions” number. Need to prove GEO works to a boss or client? Start here — nowhere else proves it. Comparison and research content win most.
Google Gemini / AI Overviews — the zero-click double-edge
- Traffic behavior: ~83% of AI Overview searches are zero-click (AI Mode higher); when an AIO appears, top-page CTR drops ~58%. But cited brands see ~+35% CTR — not cited = traffic eaten; cited = you keep the high-intent users who clicked through after reading.
- Content type decides survival: “how-to” queries are ~99.9% cannibalized; “buy X / best X / price” transactional queries far less.
- Scale: Google-scale + AI Mode already 100M+ MAU — widest reach.
- Verdict: it gives links but ~83% are zero-click, its sources are mostly an extension of existing Google rankings, and its users pay less than Claude’s — so for most brands it isn’t a traffic source, it’s a defensive front: not being cited means getting eaten by the summary. Don’t expect informational content to drive traffic; only transactional/comparison queries are worth fighting for here.
Grok — niche, but strong on the right topics
- Traffic/audience: live web + direct X; audience skews male, 25–34, and ~4× more likely than average to be crypto traders (the widest skew in the field); extreme freshness weighting.
- Verdict: unless you’re in crypto, current events or tech, skip it for now and save resources for the top three. Fierce for the right industry, wasted on the wrong one — don’t chase it just because it’s loud in the news.
One table: the channel character of all five
| Platform | Gate | Clickable links? | Audience | Paying power / scale | Conversion shape | GEO priority for |
|---|---|---|---|---|---|---|
| Claude | Brave | barely | devs / enterprise / pro | top rev/user, enterprise-strong | influence (not traffic) | B2B / SaaS / dev / high-ticket |
| ChatGPT | Bing + own | yes (Search mode) | most mainstream | 1B MAU, ~87% of AI referral | breadth traffic | almost every brand |
| Perplexity | own + Bing | dense & clickable | research, high-intent | small but converts 3.1× | measurable traffic | ROI proof, comparison content |
| Gemini/AIO | yes but mostly zero-click | widest | hundreds of millions | cited-to-keep-CTR | info = presence, transactional = traffic | |
| Grok | web + X | yes | male / crypto-skew | growing, niche | timely influence | crypto / news / tech |
Mapping to GEO: don’t chase “being cited,” chase “right platform × right conversion”
Boil it down to an executable priority:
- Start from your business model, then pick the platform. B2B / high-ticket / dev-tools → Claude’s “influence” first (accept that it won’t send traffic); want measurable traffic + ROI → Perplexity; mass retail / broad reach → ChatGPT + transactional AI-Overview queries. Allocate the same effort by “platform audience ≟ your customer,” not evenly across five.
- Track “cited” and “drove traffic” separately. Claude / AI Overviews are dark-funnel / zero-click — GA referral will understate them. GEO measurement must include brand-search volume, direct-traffic shifts, customer self-report, and proposal win rates — or you’ll misjudge your most valuable Claude influence as “no effect” and cut it.
- Match content type to platform. Informational (how-to) is near zero-click in AI Overviews → aim to “be cited for brand presence”; transactional (buy/best/price) is barely hit by zero-click → worth driving traffic everywhere; research/comparison → Perplexity’s upside is largest.
- The four mechanism gates are still the ticket. Right channel or not, fail stage ① (not in that index) and nothing else matters — get into Bing/Google/Brave, allow the right crawlers, write extractable answer passages: the shared precondition for every platform.
Why this has to be watched continuously — and not just in GA
The gates, the link behavior, the audiences, the paying power, the growth rates — all differ and all shift monthly: ChatGPT’s share has dropped below 50%, Claude grew 4× YoY, the AI-Overview zero-click rate is still rising. To simultaneously track “am I in each candidate pool, which passage got cited, and did that platform hand value back to me” is impossible to do weekly by hand — and the two most valuable (Claude, AI Overviews) are invisible in GA by nature.
That’s why we run cross-engine continuous monitoring + dark-funnel attribution: automating the engine-by-engine, gate-by-gate, channel-by-channel check so you can see “am I the basis in Claude,” “Perplexity’s ROI number,” and “how much AI Overviews ate vs. saved back via citation” — none of which Google Analytics will ever show you.
Bottom line: three things to remember
- These 5 ≈ the whole market (~85–90%), but each is a different channel — don’t apply one “get cited” goal to all five.
- Clear the four mechanism gates (get into the index) = the ticket; then allocate by “platform audience ≟ your customer” — B2B/high-ticket → Claude’s influence; measurable ROI → Perplexity; breadth → ChatGPT; informational → “cited-to-keep-CTR” in AI Overviews.
- Measure beyond GA — the two most valuable (Claude, AI Overviews) are invisible in referral by nature; Google Analytics alone will misjudge them as “no effect.”
If you must rank them: with budget for only three, pick Claude (influence), Perplexity (ROI numbers), ChatGPT (the base); play defense on Gemini/AIO, and judge Grok by your industry.
In one line: GEO isn’t “getting cited everywhere” — it’s “being trusted by the right people, on the right platform, in the right way” — and the market reshuffles monthly, so it has to be watched continuously, not done once.
Honesty note: none of the engines fully document their rerank/selection layers; the mechanisms above are an informed synthesis of vendor docs and observable behavior, and the figures are Sensor Tower / third-party estimates that shift as platforms tune. Audience profiles are statistical tendencies, not absolutes. Which is exactly why this needs continuous tracking, not a one-time read.