Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorZhang, Huizi
dc.contributor.authorSwallow, Ben
dc.contributor.authorGupta, Mayetri
dc.date.accessioned2023-01-04T12:30:05Z
dc.date.available2023-01-04T12:30:05Z
dc.date.issued2022-08-01
dc.identifier281140223
dc.identifier58f54f28-dd70-4de4-8d66-af0dafd9fd36
dc.identifier000834508100012
dc.identifier.citationZhang , H , Swallow , B & Gupta , M 2022 , ' Bayesian hierarchical mixture models for detecting non-normal clusters applied to noisy genomic and environmental datasets ' , Australian and New Zealand Journal of Statistics , vol. 64 , no. 2 , pp. 313-337 . https://doi.org/10.1111/anzs.12370en
dc.identifier.issn1369-1473
dc.identifier.otherORCID: /0000-0002-0227-2160/work/118411961
dc.identifier.urihttps://hdl.handle.net/10023/26663
dc.description.abstractClustering to find subgroups with common features is often a necessary first step in the statistical modelling and analysis of large and complex datasets. Although follow-up analyses often make use of complex statistical models that are appropriate for the specific application, most popular clustering approaches are either nonparametric, or based on Gaussian mixture models and their variants, often for reasons of computational efficiency. Certain characteristics in the data, such as the presence of outliers, or non-ellipsoidal cluster shapes, that are common in modern scientific datasets, often lead these methods to fail to detect the cluster components accurately. In this article, we present two efficient and robust Bayesian clustering approaches that seek to overcome these limitations-a model-based 'tight' clustering approach to cluster points in the presence of outliers, and a hierarchical Laplace mixture-based approach to cluster heavy-tailed and otherwise non-normal cluster components-and illustrate their power and accuracy in detecting meaningful clusters in datasets from genomics, imaging and the environmental sciences.
dc.format.extent25
dc.format.extent2555234
dc.language.isoeng
dc.relation.ispartofAustralian and New Zealand Journal of Statisticsen
dc.subjectData augmentationen
dc.subjectGibbs samplingen
dc.subjectLatent variable modelsen
dc.subjectMarkov Chain Monte Carloen
dc.subjectNon-Gaussian clustersen
dc.subjectSNP genotypingen
dc.subjectQA Mathematicsen
dc.subjectQH301 Biologyen
dc.subjectQH426 Geneticsen
dc.subject3rd-DASen
dc.subjectACen
dc.subjectMCCen
dc.subject.lccQAen
dc.subject.lccQH301en
dc.subject.lccQH426en
dc.titleBayesian hierarchical mixture models for detecting non-normal clusters applied to noisy genomic and environmental datasetsen
dc.typeJournal articleen
dc.contributor.institutionUniversity of St Andrews. School of Mathematics and Statisticsen
dc.contributor.institutionUniversity of St Andrews. Centre for Research into Ecological & Environmental Modellingen
dc.identifier.doi10.1111/anzs.12370
dc.description.statusPeer revieweden


This item appears in the following Collection(s)

Show simple item record