Sampled angles in high-dimensional spaces
Abstract
Similarity search using metric indexing techniques is largely a solved problem in low-dimensional spaces. However techniques based only on the triangle inequality property start to fail as dimensionality increases. Since proper metric spaces allow a finite projection of any three objects into a 2D Euclidean space, the notion of angle can be validly applied among any three (but no more) objects. High dimensionality is known to have interesting effects on angles in vector spaces, but to our knowledge this has not been studied in more general metric spaces. Here, we consider the use of angles among objects in combination with distances. As dimensionality becomes higher, we show that the variance in sampled angles reduces. Furthermore, sampled angles also become correlated with inter-object distances, giving different distributions between query solutions and non-solutions. We show the theoretical underpinnings of this observation in unbounded high-dimensional Euclidean spaces, and then examine how the pure property is reflected in some real-world high dimensional spaces. Our experiments on both generated and real world datasets demonstrate that these observations can have an important impact on the tractability of search as dimensionality increases.
Citation
Connor , R & Dearle , A 2020 , Sampled angles in high-dimensional spaces . in S Satoh , L Vadicamo , A Zimek , F Carrara , I Bartolini , M Aumüller , B Þ Jónsson & R Pagh (eds) , Similarity Search and Applications : 13th International Conference, SISAP 2020, Copenhagen, Denmark, September 30–October 2, 2020, Proceedings . Lecture Notes in Computer Science (Information Systems and Applications, incl. Internet/Web, and HCI) , vol. 12440 , Springer , Cham , pp. 233-247 , 13th International Conference on Similarity Search and Applications, SISAP 2020 , 30/09/20 . https://doi.org/10.1007/978-3-030-60936-8_18 conference
Publication
Similarity Search and Applications
ISSN
0302-9743Type
Conference item
Collections
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.