Show simple item record

Files in this item


Item metadata

dc.contributor.authorKershenbaum, Arik
dc.contributor.authorGarland, Ellen Clare
dc.identifier.citationKershenbaum , A & Garland , E C 2015 , ' Quantifying similarity in animal vocal sequences : which metric performs best? ' , Methods in Ecology and Evolution , vol. 6 , no. 12 , pp. 1452-1461 .
dc.identifier.otherORCID: /0000-0002-8240-1267/work/49580217
dc.descriptionE.C.G is supported by a Newton International Fellowship. Part of this work was conducted while E.C.G. was supported by a National Research Council (National Academy of Sciences) Postdoctoral Fellowship at the National Marine Mammal Laboratory, AFSC, NMFS, NOAA.en
dc.description.abstract1. Many animals communicate using sequences of discrete acoustic elements which can be complex, vary in their degree of stereotypy, and are potentially open-ended. Variation in sequences can provide important ecological, behavioural, or evolutionary information about the structure and connectivity of populations, mechanisms for vocal cultural evolution, and the underlying drivers responsible for these processes. Various mathematical techniques have been used to form a realistic approximation of sequence similarity for such tasks. 2. Here, we use both simulated and empirical datasets from animal vocal sequences (rock hyrax, Procavia capensis; humpback whale, Megaptera novaeangliae; bottlenose dolphin, Tursiops truncatus; and Carolina chickadee, Poecile carolinensis) to test which of eight sequence analysis metrics are more likely to reconstruct the information encoded in the sequences, and to test the fidelity of estimation of model parameters, when the sequences are assumed to conform to particular statistical models. 3. Results from the simulated data indicated that multiple metrics were equally successful in reconstructing the information encoded in the sequences of simulated individuals (Markov chains, n-gram models, repeat distribution, and edit distance), and data generated by different stochastic processes (entropy rate and n-grams). However, the string edit (Levenshtein) distance performed consistently and significantly better than all other tested metrics (including entropy, Markov chains, n-grams, mutual information) for all empirical datasets, despite being less commonly used in the field of animal acoustic communication. 4. The Levenshtein distance metric provides a robust analytical approach that should be considered in the comparison of animal acoustic sequences in preference to other commonly employed techniques (such as Markov chains, hidden Markov models, or Shannon entropy). The recent discovery that non-Markovian vocal sequences may be more common in animal communication than previously thought, provides a rich area for future research that requires non-Markovian based analysis techniques to investigate animal grammars and potentially the origin of human language.
dc.relation.ispartofMethods in Ecology and Evolutionen
dc.subjectAnimal communicationen
dc.subjectEdit distanceen
dc.subjectStochastic processesen
dc.subjectQH301 Biologyen
dc.titleQuantifying similarity in animal vocal sequences : which metric performs best?en
dc.typeJournal articleen
dc.contributor.sponsorThe Royal Societyen
dc.contributor.institutionUniversity of St Andrews. School of Biologyen
dc.contributor.institutionUniversity of St Andrews. Sea Mammal Research Uniten
dc.contributor.institutionUniversity of St Andrews. Centre for Social Learning & Cognitive Evolutionen
dc.contributor.institutionUniversity of St Andrews. Centre for Biological Diversityen
dc.description.statusPeer revieweden

This item appears in the following Collection(s)

Show simple item record