St Andrews Research Repository

St Andrews University Home
View Item 
  •   St Andrews Research Repository
  • University of St Andrews Research
  • University of St Andrews Research
  • University of St Andrews Research
  • View Item
  •   St Andrews Research Repository
  • University of St Andrews Research
  • University of St Andrews Research
  • University of St Andrews Research
  • View Item
  •   St Andrews Research Repository
  • University of St Andrews Research
  • University of St Andrews Research
  • University of St Andrews Research
  • View Item
  • Login
JavaScript is disabled for your browser. Some features of this site may not work without it.

Modelling string structure in vector spaces

Thumbnail
View/Open
paper_45.pdf (2.906Mb)
Date
09/07/2019
Author
Connor, Richard
Dearle, Al
Vadicamo, Lucia
Keywords
Metric mapping
n-Simplex projection
Pivoted embedding
String
Jensen-Shannon distance
Levenshtein distance
QA75 Electronic computers. Computer science
DAS
Metadata
Show full item record
Altmetrics Handle Statistics
Abstract
Searching for similar strings is an important and frequent database task both in terms of human interactions and in absolute world-wide CPU utilisation. A wealth of metric functions for string comparison exist. However, with respect to the wide range of classification and other techniques known within vector spaces, such metrics allow only a very restricted range of techniques. To counter this restriction, various strategies have been used for mapping string spaces into vector spaces, approximating the string distances within the mapped space and therefore allowing vector space techniques to be used. In previous work we have developed a novel technique for mapping metric spaces into vector spaces, which can therefore be applied for this purpose. In this paper we evaluate this technique in the context of string spaces, and compare it to other published techniques for mapping strings to vectors. We use a publicly available English lexicon as our experimental data set, and test two different string metrics over it for each vector mapping. We find that our novel technique considerably outperforms previously used technique in preserving the actual distance.
Citation
Connor , R , Dearle , A & Vadicamo , L 2019 , Modelling string structure in vector spaces . in M Mecella , G Amato & C Gennaro (eds) , Proceedings of the 27th Italian Symposium on Advanced Database Systems : Castiglione della Pescaia (Grosseto), Italy, June 16th to 19th, 2019 . , 45 , CEUR Workshop Proceedings , vol. 2400 , Sun SITE Central Europe , SEBD 2019 27th Italian Symposium on Advanced Database Systems , Castiglione della Pescaia , Italy , 17/06/19 . < http://ceur-ws.org/Vol-2400/paper-45.pdf >
 
workshop
 
Publication
Proceedings of the 27th Italian Symposium on Advanced Database Systems
ISSN
1613-0073
Type
Conference item
Rights
© 2019, the Author(s). This work has been made available online in accordance with the publisher's policies. This is the final published version of the work, which was originally published at http://ceur-ws.org/Vol-2400/
Collections
  • University of St Andrews Research
URL
http://ceur-ws.org/Vol-2400/paper-45.pdf
URI
http://hdl.handle.net/10023/18082

Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

Advanced Search

Browse

All of RepositoryCommunities & CollectionsBy Issue DateNamesTitlesSubjectsClassificationTypeFunderThis CollectionBy Issue DateNamesTitlesSubjectsClassificationTypeFunder

My Account

Login

Open Access

To find out how you can benefit from open access to research, see our library web pages and Open Access blog. For open access help contact: openaccess@st-andrews.ac.uk.

Accessibility

Read our Accessibility statement.

How to submit research papers

The full text of research papers can be submitted to the repository via Pure, the University's research information system. For help see our guide: How to deposit in Pure.

Electronic thesis deposit

Help with deposit.

Repository help

For repository help contact: Digital-Repository@st-andrews.ac.uk.

Give Feedback

Cookie policy

This site may use cookies. Please see Terms and Conditions.

Usage statistics

COUNTER-compliant statistics on downloads from the repository are available from the IRUS-UK Service. Contact us for information.

© University of St Andrews Library

University of St Andrews is a charity registered in Scotland, No SC013532.

  • Facebook
  • Twitter