Skip to main content
Whitepaper · GEO / AI Search

How Small Businesses Get Cited by ChatGPT, Claude, and Perplexity (and Why It Matters in 2026)

A research-backed guide to how AI answer engines decide which businesses to name — and the practical, honest playbook for earning those citations.

Ozvor Research13 min read

Your potential customer in Lisboa types into ChatGPT: “best independent dentist for cosmetic work in city centre.” The AI answers with three names. One of them has posted on LinkedIn every week for the past three months with specific, useful content. The other two have not posted anywhere in over a year. Guess which name appears first. The question for your business is not whether this is happening. It is whether your name is in the answer when it does.

Key takeaways

  • Buyers increasingly ask AI before they search — and AI answers with a short list of names. If you’re not on it, you’re invisible.
  • AI engines cite sources they can find, parse, and trust — drawn from training data and live retrieval from places like Reddit, Wikipedia, LinkedIn, and your own site.
  • Citation-worthy content shares five traits: it’s specific, sourced, answer-shaped, statistic-rich, and consistently published.
  • No tool can guarantee a citation — AI is non-deterministic — but you can measurably increase the probability, and track it.

The numbers behind the shift

The way people find businesses is changing at a pace most small business owners have not caught up with yet.

Gartner predicted in February 2024 that traditional search engine volume would drop 25% by 2026 as customers shift to AI tools for recommendations and answers. That shift is well underway. ChatGPT reached 200 million weekly active users by August 2024 — double its count from November 2023. Perplexity, which launched in 2022, crossed 10 million monthly active users by January 2024 and reached 30 million monthly active users by early 2025.

Sources: Gartner press release, February 2024; OpenAI / Axios, August 2024; Backlinko Perplexity statistics, citing Perplexity data.

When those users ask a question that overlaps with your business — “best accountant for freelancers in Berlin,” “which social media tool should I use for my restaurant,” “reliable plumber in Lisbon city centre” — ChatGPT, Claude, Perplexity, and Gemini generate an answer with names and sources. They do not show ten blue links. They give a recommendation.

Marketers are paying attention. A BrightEdge survey of more than 750 search, content, and digital marketing professionals, conducted in June 2025, found that 68% of organisations are actively changing their strategies to account for AI search, and that more than half have tasked their SEO or digital marketing teams with leading those efforts.

Source: BrightEdge press release, “BrightEdge Survey Reveals 68% of Marketers Are Embracing AI Search Shift,” brightedge.com.

For a scenario-driven look at why most businesses are falling behind on social media consistency, read: Why Small Businesses Stop Posting on Social Media.


What is Generative Engine Optimization (GEO)?

Generative Engine Optimization, or GEO, is the practice of structuring your content so that large language models are more likely to cite it when answering relevant questions.

The term was formally defined in a research paper by academics at Princeton University, Georgia Tech, the Allen Institute for AI, and IIT Delhi, published at KDD 2024 — one of the most competitive academic conferences in data science. The paper introduced a benchmark of 10,000 diverse user queries and tested nine content optimization methods against AI search systems to measure what drives citation visibility.

Source: Aggarwal et al., “GEO: Generative Engine Optimization,” Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024, arxiv.org/abs/2311.09735, doi.org/10.1145/3637528.3671900.

Traditional SEO is about ranking in Google’s index. GEO is about being cited in AI-generated answers. The mechanisms overlap but are not the same.

DimensionTraditional SEOGenerative Engine Optimization (GEO)
GoalRank on page 1 of GoogleBe cited in AI-generated answers
Primary signalBacklinks + keyword relevanceStructured, authoritative, specific content
Key platformsGoogle, BingChatGPT, Claude, Perplexity, Gemini
Content formatKeyword-optimised pagesSpecific, data-backed posts and articles
Timescale to impactMonths to yearsWeeks to months (retrieval-based systems)
Citation mechanismBlue link in SERPNamed source embedded in AI response

Neither replaces the other. Businesses that want to be visible in 2026 benefit from doing both. But GEO is where most small businesses have done nothing yet — which means the field is still open.


How LLMs decide what to cite

To understand GEO, you need to understand how different AI systems actually pull in content when answering questions. There are two distinct mechanisms at work.

Training data

When a language model is trained, it processes a large slice of the internet. Content that appears frequently, from sources that appear frequently, gets encoded into the model’s weights. This is why Wikipedia, Reddit, and LinkedIn are heavily represented in AI answers — they have been indexed at scale. For small businesses, this path to visibility is slow: content you publish today will not appear in a model’s training data until the next major training run, which can take a year or more.

Live retrieval (real-time search)

This is where the faster opportunity lies. ChatGPT has offered Browse with Bing since 2023, allowing the model to fetch and synthesise live web content in response to queries. Perplexity was built from the ground up as a retrieval-augmented AI — every answer it generates includes real-time web search, with clickable citations for each source. Claude (Anthropic) has offered web search capability since 2024. Gemini is deeply integrated with Google’s live index.

For small businesses, the practical implication is this: content you publish on LinkedIn today can be retrieved and cited by Perplexity or ChatGPT Search within days or weeks — not after a model retraining cycle. The path to AI citation is faster than most people assume, if the content is structured correctly.

The caveat is phrasing. Retrieval-based systems match your content to queries by semantic relevance. If a customer asks “best dentist for invisalign in city centre” and your LinkedIn post says “we offer Invisalign consultations at our city-centre clinic,” those words need to be there. Vague posts (“we love helping our clients smile”) do not get retrieved because they do not match any specific query.


Where citations actually come from

Not all content is cited equally. Semrush conducted a three-month study of the most-cited domains across AI search systems. Reddit leads with a citation frequency of 40.1%. Wikipedia follows. LinkedIn ranks as the second most-cited domain overall, and the top-cited domain for professional queries.

Source: Semrush, “The Most-Cited Domains in AI: A 3-Month Study,” semrush.com.

In a separate study of 89,000 LinkedIn URLs cited by ChatGPT Search, Google AI Mode, and Perplexity — analysed across 325,000 unique prompts — LinkedIn appeared in 11% of AI responses on average.

Source: Semrush, “We Analyzed 89K LinkedIn URLs Cited in AI Search: Here’s What Drives Visibility,” semrush.com.

For B2B businesses and professional services, LinkedIn is the clearest opportunity. For consumer-facing local businesses, Reddit and Quora threads about your niche, as well as blog posts on your own domain, are the most reachable citation sources.

The practical hierarchy for most small businesses:

  1. LinkedIn — highest practical impact for professional services and B2B
  2. Reddit and Quora — effective for businesses in consumer niches with active communities
  3. Blog posts on your own domain — slower to build authority but owned by you
  4. Industry directories and press — high trust, harder to produce at volume

What content gets cited

The Princeton GEO paper did not just name the problem. It tested solutions. The researchers evaluated nine optimization methods and measured their effect on citation visibility across AI search systems. The two strongest individual techniques were statistics addition (adding specific numerical data to content) and quotation addition (adding authoritative quotes or attributed statements). These methods improved position-adjusted citation visibility by up to 40–41% relative to baseline.

Source: Aggarwal et al., KDD 2024, arxiv.org/abs/2311.09735. Specific figures: 41% for quotation addition, ~40% for statistics addition on Position-Adjusted Word Count metric.

The Semrush LinkedIn study adds granular data on what gets cited in practice. LinkedIn articles dominate citations across all three AI platforms, accounting for 50–66% of cited LinkedIn content. Feed posts account for 15–28%. Critically, the median cited LinkedIn post has only 15–25 reactions — AI search does not reward the most-liked posts, it rewards the most relevant answers.

Source: Semrush, “We Analyzed 89K LinkedIn URLs Cited in AI Search,” semrush.com.

Five traits of citation-worthy posts

Research points to five consistent traits in content that AI systems cite, across both the GEO academic literature and observed citation patterns in practice:

  1. Specific over general. A post about 'three things to check before signing a commercial lease in Portugal' will be cited for a narrow query. A post about 'the importance of understanding your lease' will not match any query specifically enough.
  2. Data-backed. Content that includes specific numbers, statistics, or attributed findings performs significantly better in AI citation studies than content with only qualitative claims. Even approximate figures help.
  3. Opinionated, not bland. AI systems drawing on search results prefer content with a clear stance — not inflammatory, but not hedged into meaninglessness. 'We recommend X over Y for small retailers because of Z' is more useful than 'there are many options to consider.'
  4. One clear idea per post. Content that tries to cover everything gets cited for nothing. A post that thoroughly addresses one question gets retrieved when that question is asked.
  5. Consistent cadence. A single excellent post is not a citation strategy. AI systems that retrieve live content see freshness as a relevance signal. Posting consistently — not daily, but regularly — builds the content base that retrieval systems can draw on.

The anatomy of a citation-worthy LinkedIn post

The difference between content that gets cited and content that does not often comes down to a few structural choices. Here are three examples.

Example 1 — The accountant

Not citation-worthy

Tax season is here. Make sure you stay on top of your accounts. We're here to help.

Citation-worthy

Three expenses most freelancers in Portugal miss at tax time: (1) home office deduction — up to €250 per year if you work from home consistently; (2) professional training costs — courses directly related to your activity are fully deductible; (3) equipment depreciation — laptops and software over €300 can be depreciated rather than expensed fully in year one. Each one requires documented receipts. Most of our clients recover between €400–€900 by catching these three.

The second version will be retrieved when someone asks ChatGPT 'what can freelancers deduct in Portugal.' The first will not be retrieved for any query.

Example 2 — The dentist

Not citation-worthy

We offer a wide range of dental services for the whole family. Book your appointment today.

Citation-worthy

Cosmetic consultations in our Lisbon city-centre clinic: Invisalign assessments take 20 minutes. We assess bite alignment, spacing, and the likely treatment length — typically 6–18 months depending on complexity. We see 3–4 new cosmetic patients per week; most found us through a recommendation. If you have been told by another clinic that your case is complex, it is usually worth a second opinion.

The second version is specific enough to be retrieved for 'invisalign consultation Lisbon' or 'cosmetic dentist city centre.'

Example 3 — The social media agency

Not citation-worthy

We help businesses grow their online presence. Let us tell your story.

Citation-worthy

The LinkedIn posts that get the most organic reach from our clients' accounts: (1) a single specific insight, not a list of seven; (2) first line that states an outcome or counter-intuitive fact; (3) no more than one link, placed in the first comment. Accounts that post twice per week consistently outperform accounts that post five times in a burst and then nothing. We have seen this pattern across 40+ client accounts over 18 months.

The third version earns citations because it answers the specific question 'what makes LinkedIn posts perform well' with attributed, testable claims.


Where Ozvor fits

The research is clear on what builds GEO visibility: consistent posting of specific, structured, useful content. The difficulty for most small business owners is not understanding that — it is doing it, week after week, without a dedicated marketing hire.

Ozvor is built to handle the consistency problem. You tell it what you want to say — a topic, a link, a brief — and Anthropic Claude Sonnet drafts a post shaped for the platform you are posting to and structured for the specific, direct style that AI citation research identifies as most effective. You review the draft, edit if needed, and approve it. Nothing posts without your sign-off. The AI disclosure badge on every draft is visible and non-dismissable — you always know what is AI-generated before it goes to your audience.

Under Anthropic’s API terms, your content is not used to train any AI model and is not retained beyond what is needed to return your result. We never sell or share your content. We disclose every AI sub-processor we use, and EU-region inference is on our roadmap.

The product does not guarantee AI citations. No tool can. What it gives you is the consistent posting cadence and the structured, specific content that the research identifies as the best-documented inputs to citation visibility.


Start building your GEO presence

The businesses that show up in AI answers in 2026 and 2027 are the ones posting consistently now. Not every week is perfect. Not every post gets cited. But the content base accumulates, and AI retrieval systems have something to find and cite when a relevant query comes in.

Ozvor is live. Run the free AI Visibility Test and see your TrustIndex Score in 60 seconds — no credit card required. Founding members (first 100 annual subscribers) get a 30% founder discount plus a 30-day money-back guarantee.


Sources used in this article

  • Gartner, "Gartner Predicts Search Engine Volume Will Drop 25% by 2026," February 2024
  • OpenAI / Axios, ChatGPT 200 million weekly active users announcement, August 2024
  • Backlinko, Perplexity AI statistics, citing Perplexity data through 2025
  • BrightEdge, "BrightEdge Survey Reveals 68% of Marketers Are Embracing AI Search Shift," June 2025
  • Aggarwal et al., "GEO: Generative Engine Optimization," KDD 2024 (arxiv.org/abs/2311.09735)
  • Semrush, "The Most-Cited Domains in AI: A 3-Month Study," semrush.com
  • Semrush, "We Analyzed 89K LinkedIn URLs Cited in AI Search: Here's What Drives Visibility," semrush.com

Start building AI visibility today

Run the free test now and see your TrustIndex Score in 60 seconds.

Run free AI test — freeStart Growth — $99/mo
How Small Businesses Get Cited by ChatGPT, Claude, and Perplexity | Ozvor | Ozvor