Exploiting AI’s data preferences
Not everyone is sold on the promise of AI-powered search. Rapyd.AI’s Zwingmann, who has tested ChatGPT’s new search capabilities, thinks that the feature is too niche to succeed with mass audiences, especially when search giant Google is already as reliable as it is. That AI platforms prioritize criteria other than reliability when surfacing search results means that consumers are likely to be burned, which could ultimately erode trust in the technology, Zwingmann said.
But this very issue is one that savvy advertisers can use to their advantage should AI search continue to gain popularity. According to Zwingmann, when AI models are scraping the internet to return results for a search, what they’re looking for is more about content density than content authority. That is, content that appears with high frequency and semantic overlap (or consistency of meaning) are more likely to be surfaced, even if their sources are less reliable. ChatGPT’s search capabilities, for example, are vastly underperforming Google’s with respect to short internet queries, in which meaning and intent is vague, according to TechCrunch.
This means that one way brands can increase their chances of being surfaced by these platforms is by simply increasing their presence on the internet—or, “playing the numbers game,” Zwingmann said. Providing more information on one’s website, for example, as well as posting more on social media could have an impact on how the AI sees that brand, and its likelihood to mention the brand in search results.
The SEO-related incentives for this tinkering is quite clear. In a basic test conducted by Ad Age and Greg Asman, founder and managing principal of marketing consultancy firm The Asman Group, a search query for “Dove” on ChatGPT first surfaced results for the bird, while on Google the same query first surfaced results for the Unilever-owned brand. And even when ChatGPT did surface results for the brand, only a link to its Wikipedia page was included and not a link to its actual website. Dove declined to comment.
Providing more information about one’s brand may not only improve visibility when AI performs real-time data searches, but may also do so with respect to its underlying training data foundation, Zwingmann said. AI systems are always being trained on more data in order to generate better responses, but even the size of the internet is no match for their appetites. According to a study published this summer by the Data Provenance Institute, the amount of potential training data available to these systems is drying up, in part because publishers and other online entities are restricting their data from being used. Some platforms, such as Perplexity, have been accused of continuing to hoover data without consent.
In this data-scarce environment, brands could find advantage by showing up on the web in a larger way. Another path to do this is by increasing one’s advertising. While traditional formats, such as display ads, are ignored as training data, brands can apply workarounds by publishing more sponsored content, which AI systems will gladly train on, Zwingmann said.
This approach may feel dubious for web advertising, but it is not without precedent. Zwingmann compared it to the common tactic of placing sponsored ads alongside similarly looking non-sponsored material, such as in Google search results, Instagram feeds and Amazon listings.
With AI search, “the lines [between advertising and non-advertising] are getting blurred more and more,” Zwingmann said.