Lack of Control for Variability Between Species
The datasets used in the study, while diverse, were not well-controlled for variability between species, as they were sampled from animals in various conditions both in the wild and in the laboratory.
Not a Replacement for Traditional Analyses
While the unsupervised latent models offer advantages in certain situations, they are not meant to replace traditional analyses using known behaviorally-relevant feature spaces when such knowledge is available.
Dependence on Large Datasets
Contemporary machine learning algorithms often require large datasets, and the utility of methods like UMAP decreases with smaller datasets where few exemplars are available for each vocalization.
Choice of Distance Metric
The choice of distance metric used to build the graph in UMAP can influence the results, and while the Euclidean distance used in the study is easy to compute, other metrics might be more appropriate depending on the specific animal's perceptual representations.
The structure found using graph-based dimensionality reduction algorithms can be affected by the parameterization used, and although the default parameters are generally a good starting point, exploring the persistence of structure across parameterizations is important.