Instability and Reproducibility of Topic Models
The study acknowledges that the choice of topic model can greatly influence the results, especially for BERTopic where repeated modeling leads to different outcomes due to the stochastic nature of the model. This introduces a level of instability and makes reproducibility challenging.
Limited Generalizability Across Social Media Platforms
The research focuses solely on Twitter data. While the authors argue that the methodology should be transferable, the specific characteristics of Twitter (character limits, hashtags, etc.) may not fully represent the diversity and complexity of other social media platforms.
Exclusion of Newer Language Models
While the study evaluates four different models, it doesn't explore other emerging models like GPT-3 or WuDao 2.0. This limits the scope of the comparison and potentially overlooks more powerful techniques.
Subjectivity of Interpretation
The study emphasizes the role of human interpretation in making sense of topic modeling results. However, this reliance on subjective judgment can introduce bias and make comparisons across different researchers less reliable.
Influence of Keyword Selection
The choice of keywords for the term search function in Top2Vec and BERTopic significantly influences the results. The study does not fully address how researchers should select appropriate keywords and mitigate potential biases introduced by this selection process.