Limited Emotional Spectrum and Baseline Comparisons
The study uses only seven basic emotional states, potentially missing out on human emotional complexity. Baselines are limited to fixed and neutral strategies, lacking comparison with random emotion sequences. This limits understanding of EvoEmo's true effectiveness.
Scenario Dependence and Generalization
The study tested on 20 daily commercial scenarios which limits its ability to generalize to other scenarios. This raises questions about potential bias and how well the model performs under pressure or during high-stakes interactions.
Interpretability of Emotional Strategies
Due to the black-box nature of LLM and evolutionary optimization, it remains unclear why particular emotional strategies are effective. This lack of transparency makes it difficult to analyze and understand the decision-making process of the agent.
Simulation to Reality Gap
The study uses simulations to evaluate the model, making it unclear how it performs in real-world situations. LLM simulations are often limited in their ability to capture the complexities of human behavior.