What question did this study set out to answer?

The aim is to analyze AI-driven brand recommendation sequences to understand measurement reliability.

April 3, 2026Open Access

When AI Is A Hot Mess about Your Brand

Key Points

The aim is to analyze AI-driven brand recommendation sequences to understand measurement reliability.
Analyzed 7,000+ purchase recommendation sequences from various AI models.
Utilized AIVO Optimize to track recommendations throughout four-turn sequences.
Compared results across different AI platforms including ChatGPT, Perplexity, Gemini, and Claude.
Found that the majority of brand recommendations at the final turn are unreliable noise.
Indicated that existing measurement methods often do not capture a stable signal.
Revealed the need for brands to rethink how they assess AI-driven recommendations.

Abstract

Anthropic's researchers just published evidence that AI failures become increasingly incoherent as reasoning gets longer. We ran the same analysis on 7,000+ purchase recommendation sequences. The findings should change how every brand thinks about AI measurement. AIVO Optimize tracks brand recommendations across four-turn buying sequences — the progression from initial awareness query through to a direct purchase recommendation. We run these sequences repeatedly on identical prompts, across ChatGPT, Perplexity, Gemini, and Claude, and record whether a brand wins or loses the recommendation at each turn. Anthropic's paper is about the fundamental behaviour of large language models under extended reasoning. Ours is about what a consumer sees when they ask AI what moisturiser to buy. The level of abstraction is different. The underlying phenomenon is the same. The question it forces for anyone measuring brand performance in AI — including us — is whether your measurement is capturing a stable signal or averaging over noise and calling it insight. At turn four of a purchase sequence, for most brands in most categories, the honest answer is: mostly noise. That's not a comfortable finding. It's a useful one. You can't fix a problem you haven't measured.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

AIVO Optimize

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

When AI Is A Hot Mess about Your Brand

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study