You cannot improve an AI visibility problem you cannot see. If ChatGPT, Perplexity, and Google's AI Overviews answer questions in your category, the first job is to learn whether they mention you, how often, and against which competitors. Most of that you can measure for free.
What "AI Visibility" Means
AI visibility is how often, and how prominently, AI engines mention or cite your brand when they answer questions in your space. It is the AI-era version of a ranking: instead of where you sit on a results page, it is whether you appear in the synthesized answer at all, and whether you are named alongside or ahead of competitors.
Two signals sit underneath it, and they are not the same. A mention is the model naming your brand in its text. A citation is the model linking to your page as a source. The balance differs by engine: ChatGPT tends to mention brands more often than it cites them, while Google's AI Overviews lean the other way. Either way, track both, because they tell you different things: mentions measure awareness, citations measure whether your content is doing the work.
If the underlying idea is new to you, our guide to generative engine optimization covers how engines decide what to mention. This article is about measuring the result.
The Metrics That Matter
Five metrics cover the job. You do not need all five to start, but you should know what each one tells you.
| Metric | What it tells you | How to get it | The catch |
|---|---|---|---|
| Share of voice | How often you appear versus competitors across a set of prompts | Run prompts, divide your mentions by the total | The prompts you choose decide the number |
| Mentions vs citations | Awareness versus whether your page is the cited source | Log both for each answer | A mention with no citation sends no traffic |
| Sentiment | Whether you are described positively, neutrally, or wrong | Read the answer, not just the name | A wrong mention is worse than none |
| AI referral traffic | Visits that actually arrive from AI tools | GA4 channel plus referrer rules | Heavily undercounted (see below) |
| Branded search lift | People searching your name after seeing you in AI | Branded queries in Search Console | Indirect, but the most defensible |
That is the scoreboard. The rest of this guide is how to populate it without overpaying, and why the numbers are shakier than they look.
A Reality Check: Why AI Visibility Is Hard to Measure
Before you trust any number, understand why the numbers wobble. Two facts make AI visibility genuinely harder to measure than rankings.
First, the answers are non-deterministic. Ask the same model the same question twice and you can get a different set of brands. SparkToro ran 2,961 prompts across ChatGPT, Claude, and Google's AI and found less than a 1-in-100 chance of getting the same list of brands in any two responses, and closer to 1-in-1,000 for the same order. The consequence is blunt: a one-off check tells you almost nothing, and any tool selling you an "AI ranking position" is selling a number that does not really exist. Measure a distribution across many runs, not a single snapshot.
Second, the traffic is hard to attribute. When someone reads about you in an AI answer and later visits your site, the visit often arrives with no referrer and lands in the "Direct" bucket in your analytics. So the AI traffic you can see is a fraction of the real influence, and the gap is wide: Bain estimates around 60% of searches now end without a click to any website, which is influence that never reaches your analytics at all.
Neither problem makes measurement pointless. It means you measure the right way: track direction over weeks, report ranges instead of false-precision decimals, and never bet a decision on a single reading.
Before You Measure: Can AI Even See Your Pages?
There is a prerequisite that sits above every metric: an engine cannot mention or cite a page it cannot reach. If your most important pages are blocked to AI crawlers, your visibility is not low, it is unmeasurable, and no tracker will tell you that is the reason.
The usual culprits are a robots.txt rule that disallows crawlers like GPTBot, ClaudeBot, or PerplexityBot, a firewall that challenges non-browser traffic, or content that only renders with JavaScript the crawler does not run. Google's own guidance is that appearing in AI features takes no special markup, just helpful content engines can reach, and "can reach" is exactly what most sites get wrong for AI bots. Check reachability before you read too much into a poor citation rate, because a blocked crawler makes every other measurement misleading. Our AI search playbook covers the checks, and geotoolbox's free Content Analyzer runs them across the major AI crawlers in under a minute.
The Free Way to Track It
You can build a usable AI visibility picture with a spreadsheet and an hour a month. The trick is consistency, not volume.
- Pick your prompts. Choose 10 to 20 questions a real customer would ask, the kind that should surface your brand. Ground them in your actual Search Console queries rather than guessing, so you track demand that exists.
- Use a clean session. Run each prompt in a logged-out or incognito window. If you are signed in, the model's memory of your past chats biases the answer toward your own brand and ruins the reading.
- Run each prompt across engines, more than once. Put every prompt through ChatGPT, Perplexity, and Google's AI Overview, two or three times each, because a single run is noise.
- Log it in a sheet. One row per answer, with columns for prompt, engine, mentioned (yes/no), position (first, second, third or lower), citation link (yes/no), sentiment, and which competitors appeared. That last column is the one most people skip and the most useful.
- Compute share of voice and repeat. Divide your mentions by the total across all brands for a rough share of voice, then re-run the whole set on a fixed schedule. Monthly is plenty to start. The number on any given day barely matters; the direction over three months is the signal.
That is the entire method. It will not scale to thousands of prompts, but for most brands the first real insights come from exactly this.
Tracking AI Referral Traffic and Branded Search
Prompts tell you about mentions. Two more signals tell you about impact.
The first is AI referral traffic. In GA4, visits from assistants show up as referrals from domains like chatgpt.com, perplexity.ai, and gemini.google.com, so you can build a segment that isolates them. Expect it to undercount badly: when a model strips the referrer, the visit lands silently in "Direct," so treat the number as a floor, not a full count.
The second is branded search lift, and it is the most defensible signal you have. When an AI names your brand, a share of those people go search for you directly afterward. A rise in branded queries in Search Console, especially while your non-branded clicks stay flat or fall, is the fingerprint of AI-driven demand, and it is the number a finance team will accept. Watch it alongside your prompt tracking.
One thing to monitor throughout: accuracy. Models routinely state confident, wrong facts about brands, and a mention that misrepresents you is worse than none. Our guide to AI hallucinations about your brand covers spotting and correcting them.
When Is a Paid Tool Worth It?
Every visibility score on the market, paid or free, is a modeled estimate. No tool sees real user prompts, so all of them sample controlled questions and infer your standing. That is fine, as long as you remember you are buying scale and consistency, not access to ground truth.
The DIY method above works until it does not. Stay manual if you are tracking a small prompt set, reporting monthly, and can spare a couple of hours. Move to a paid tracker when you cross into real scale: hundreds of prompts across five or more engines, daily rather than monthly readings, automated share of voice against several named competitors, or simply the point where the staff time to do it by hand costs more than the subscription. Agencies and multi-brand teams hit that line fastest.
For the actual options, from free graders to enterprise trackers, our rundown of the best AI visibility tools compares them by what they do and what they cost. Whichever you choose, the score it returns is only as good as the prompts behind it, so start by getting those right.
Frequently Asked Questions
What is AI visibility? AI visibility is how often and how prominently AI engines like ChatGPT, Perplexity, and Google's AI Overviews mention or cite your brand when they answer questions in your category. It is the AI-era equivalent of a search ranking.
How do I check if ChatGPT mentions my brand? Ask it. Run the questions your customers would ask, in a logged-out or incognito session so your chat history does not bias the answer, and note whether and how your brand comes up. Repeat across engines and over time, because a single check is unreliable.
Can I measure AI visibility for free? Yes. A spreadsheet, a set of real customer prompts, and a monthly re-run gets you share of voice, sentiment, and competitor comparison at no cost. Paid tools add scale and automation, not access to data you fundamentally cannot get yourself.
Why do the results change every time? Because AI answers are non-deterministic. SparkToro found less than a 1-in-100 chance of getting the same brand list twice. Treat any single result as noise and track the direction across many runs instead.
Do I need a paid AI visibility tool? Only at scale. If you track a small prompt set monthly, do it by hand. Move to a paid tracker when you need hundreds of prompts across many engines, daily data, or automated competitor benchmarking.
Where to Start
You do not need a budget to start measuring AI visibility, you need a consistent method. Pick ten real prompts, run them across the major engines in a clean session, log what you see, and repeat monthly. Watch the direction, not the daily number.
Before any of that, rule out the silent failure: a page AI crawlers cannot reach scores zero for a reason no tracker will show you. geotoolbox's free Content Analyzer checks whether the major AI crawlers can fetch your page and grades how citable it is from 0 to 100, in under a minute. Start there, then build your prompt-tracking sheet.
Sources
- AIs are highly inconsistent when recommending brands or products - SparkToro, 2025 (2,961-prompt study)
- Goodbye Clicks, Hello AI: Zero-Click Search Redefines Marketing - Bain & Company, 2025
- AI features and your website - Google Search Central