Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorDearle, Alan
dc.contributor.authorConnor, Richard
dc.date.accessioned2021-02-04T00:39:11Z
dc.date.available2021-02-04T00:39:11Z
dc.date.issued2021-01
dc.identifier265772781
dc.identifier15b68e8a-226e-4019-a43f-5098fdc9e91c
dc.identifier85080067822
dc.identifier000581494100002
dc.identifier.citationDearle , A & Connor , R 2021 , ' BitPart : exact metric search in high(er) dimensions ' , Information Systems , vol. 95 , 101493 . https://doi.org/10.1016/j.is.2020.101493en
dc.identifier.issn0306-4379
dc.identifier.urihttps://hdl.handle.net/10023/21368
dc.description.abstractWe define BitPart (Bitwise representations of binary Partitions), a novel exact search mechanism intended for use in high-dimensional spaces. In outline, a fixed set of reference objects is used to define a large set of regions within the original space, and each data item is characterised according to its containment within these regions. In contrast with other mechanisms only a subset of this information is selected, according to the query, before a search within the re-cast space is performed. Partial data representations are accessed only if they are known to be potentially useful towards the calculation of the exact query solution. Our mechanism requires Ω(N log N ) space to evaluate a query, where N is the cardinality of the data, and therefore does not scale as well as previously defined mechanisms with low-dimensional data. However it has recently been shown that, for a nearest neighbour search in high dimensions, a sequential scan of the data is essentially unavoidable. This result has been suspected for a long time, and has been referred to as the curse of dimensionality in this context. In the light of this result, the compromise achieved by this work is to make the best possible use of the available fast memory, and to offer great potential for parallel query evaluation. To our knowledge, it gives the best compromise currently known for performing exact search over data whose dimensionality is too high to allow the useful application of metric indexing, yet is still sufficiently low to give at least some traction from the metric and supermetric properties.
dc.format.extent14
dc.format.extent1848874
dc.language.isoeng
dc.relation.ispartofInformation Systemsen
dc.subjectSimilarity searchen
dc.subjectMetric spaceen
dc.subjectMetric indexingen
dc.subjectMetric searchen
dc.subjectFour-point propertyen
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subject3rd-DASen
dc.subjectBDCen
dc.subjectR2Cen
dc.subject~DC~en
dc.subject.lccQA75en
dc.titleBitPart : exact metric search in high(er) dimensionsen
dc.typeJournal articleen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.identifier.doi10.1016/j.is.2020.101493
dc.description.statusPeer revieweden
dc.date.embargoedUntil2021-02-04
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S0306437920300041en


This item appears in the following Collection(s)

Show simple item record