What question did this study set out to answer?

This research aims to assess user perception of musical taste profiles generated by large language models and identify biases influencing these profiles.

June 7, 2026

A Study of Biases in LLM-Generated Musical Taste Profiles for Recommendation

Key Points

This research aims to assess user perception of musical taste profiles generated by large language models and identify biases influencing these profiles.
Conducted a user study evaluating natural language profiles based on listening histories.
Analyzed bias across user attributes (mainstreamness, taste diversity) and item features (genre, country of origin).
Assessed usefulness of profiles in a recommendation task via shared embedding space analysis.
Identified systematic differences in profile accuracy based on user attributes and item features.
Highlighted that generated profiles varied in quality across different user groups and models.
Demonstrated both potential benefits and limitations of using LLM-based profiles in personalized recommendation systems.

Abstract

Large Language Models (LLMs) offer a promising approach to recommendation by enabling the generation of user profiles in Natural Language (NL) form. When used as summarization devices, LLMs can produce interpretable and editable alternatives to opaque collaborative filtering representations, potentially increasing transparency and user control. However, it remains unclear whether users perceive these profiles as accurate representations of their preferences, which is key for trust and usability. Moreover, because LLMs inherit societal and data-driven biases, profile quality may systematically vary across user and item characteristics. In this paper, we investigate these issues in the context of music streaming, where personalization is challenged by large and culturally diverse catalogs. We conduct a user study in which participants evaluate NL profiles generated from their own listening histories. We analyze whether user identification with these profiles is biased by user attributes, such as mainstreamness and taste diversity, and by item features, including genre and country of origin. We further assess the usefulness of the generated profiles in a downstream recommendation task by analyzing their representations in a shared embedding space. Our results reveal systematic differences across models and user groups, highlighting both the potential and the limitations of scrutable, LLM-based profiling for personalized systems.

AI에게 질문

Bookmark

AI에게 질문

Bookmark

A Study of Biases in LLM-Generated Musical Taste Profiles for Recommendation

Key Points

Abstract

Cite This Study