Files in this item
PFClust: an optimised implementation of a parameter-free clustering algorithm
Item metadata
dc.contributor.author | Musayeva, Khadija | |
dc.contributor.author | Henderson, Tristan | |
dc.contributor.author | Mitchell, John B. O. | |
dc.contributor.author | Mavridis, Lazaros | |
dc.date.accessioned | 2014-03-07T17:31:01Z | |
dc.date.available | 2014-03-07T17:31:01Z | |
dc.date.issued | 2014-02 | |
dc.identifier.citation | Musayeva , K , Henderson , T , Mitchell , J B O & Mavridis , L 2014 , ' PFClust: an optimised implementation of a parameter-free clustering algorithm ' , Source Code for Biology and Medicine , vol. 9 , no. 5 . https://doi.org/10.1186/1751-0473-9-5 | en |
dc.identifier.issn | 1751-0473 | |
dc.identifier.other | PURE: 95083033 | |
dc.identifier.other | PURE UUID: aecc218f-8405-4fc0-aa16-fd2148dc98c9 | |
dc.identifier.other | Bibtex: urn:68778f010ac5a1f4c961a557986f9f0d | |
dc.identifier.other | Scopus: 84893200342 | |
dc.identifier.other | ORCID: /0000-0002-0379-6097/work/34033394 | |
dc.identifier.uri | https://hdl.handle.net/10023/4491 | |
dc.description | This work was supported by the World Anti-Doping Agency and the Scottish Universities Life Sciences Alliance. | en |
dc.description.abstract | Background: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. Results: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. Conclusions: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original. | |
dc.language.iso | eng | |
dc.relation.ispartof | Source Code for Biology and Medicine | en |
dc.rights | © 2014 Musayeva et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. | en |
dc.subject | Clustering | en |
dc.subject | Cluster analysis | en |
dc.subject | Number of clusters | en |
dc.title | PFClust: an optimised implementation of a parameter-free clustering algorithm | en |
dc.type | Journal article | en |
dc.description.version | Publisher PDF | en |
dc.contributor.institution | University of St Andrews. School of Computer Science | en |
dc.contributor.institution | University of St Andrews. School of Chemistry | en |
dc.contributor.institution | University of St Andrews. Biomedical Sciences Research Complex | en |
dc.contributor.institution | University of St Andrews. EaSTCHEM | en |
dc.identifier.doi | https://doi.org/10.1186/1751-0473-9-5 | |
dc.description.status | Peer reviewed | en |
This item appears in the following Collection(s)
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.