LLMs Reproduce Human Purchase Intent via Semantic Similarity Elicitation of Likert Ratings
This paper introduces Semantic Similarity Rating (SSR), a new method allowing large language models (LLMs) to accurately simulate human purchase intent by having them generate free-text responses, which are then mapped to Likert scales based on semantic similarity. The method, tested on 57 personal care product surveys, achieved 90% human test-retest reliability and produced realistic response distributions, outperforming direct numerical rating requests. It also generated rich qualitative feedback, though the reference statements were manually optimized for this dataset and not all demographics were replicated consistently.