Files in this item
VisuaLizations As Intermediate Representations (VLAIR) : an approach for applying deep learning-based computer vision to non-image-based data
Item metadata
dc.contributor.author | Jiang, Ai | |
dc.contributor.author | Nacenta, Miguel A. | |
dc.contributor.author | Ye, Juan | |
dc.date.accessioned | 2022-10-11T10:30:02Z | |
dc.date.available | 2022-10-11T10:30:02Z | |
dc.date.issued | 2022-09-24 | |
dc.identifier | 279836022 | |
dc.identifier | 4cdf82f7-c432-4fb0-bc3e-19e248d183b3 | |
dc.identifier | 85144345067 | |
dc.identifier | 000888899400004 | |
dc.identifier.citation | Jiang , A , Nacenta , M A & Ye , J 2022 , ' VisuaLizations As Intermediate Representations (VLAIR) : an approach for applying deep learning-based computer vision to non-image-based data ' , Visual Informatics , vol. 6 , no. 3 , pp. 35-50 . https://doi.org/10.1016/j.visinf.2022.05.001 | en |
dc.identifier.issn | 2468-502X | |
dc.identifier.other | RIS: urn:353DE926F80691DA9AB5C1A6127AE563 | |
dc.identifier.other | ORCID: /0000-0002-2838-6836/work/120849895 | |
dc.identifier.uri | https://hdl.handle.net/10023/26172 | |
dc.description | Funding: We thank the China Scholarship Council (CSC) for financially supporting my PhD study at University of St Andrews, UK, and NSERC Discovery Grant 2020-04401 (Miguel Nacenta). | en |
dc.description.abstract | Deep learning algorithms increasingly support automated systems in areas such as human activity recognition and purchase recommendation. We identify a current trend in which data is transformed first into abstract visualizations and then processed by a computer vision deep learning pipeline. We call this VisuaLization As Intermediate Representation (VLAIR) and believe that it can be instrumental to support accurate recognition in a number of fields while also enhancing humans’ ability to interpret deep learning models for debugging purposes or in personal use. In this paper we describe the potential advantages of this approach and explore various visualization mappings and deep learning architectures. We evaluate several VLAIR alternatives for a specific problem (human activity recognition in an apartment) and show that VLAIR attains classification accuracy above classical machine learning algorithms and several other non-image-based deep learning algorithms with several data representations. | |
dc.format.extent | 16 | |
dc.format.extent | 3075265 | |
dc.language.iso | eng | |
dc.relation.ispartof | Visual Informatics | en |
dc.subject | Information visualization | en |
dc.subject | Convolutional neural networks | en |
dc.subject | Human activity recognition | en |
dc.subject | Smart homes | en |
dc.subject | Data representation | en |
dc.subject | Intermediate representations | en |
dc.subject | Interpretability | en |
dc.subject | Machine learning | en |
dc.subject | Deep learning | en |
dc.subject | QA75 Electronic computers. Computer science | en |
dc.subject | 3rd-NDAS | en |
dc.subject | MCC | en |
dc.subject.lcc | QA75 | en |
dc.title | VisuaLizations As Intermediate Representations (VLAIR) : an approach for applying deep learning-based computer vision to non-image-based data | en |
dc.type | Journal article | en |
dc.contributor.institution | University of St Andrews. School of Computer Science | en |
dc.identifier.doi | 10.1016/j.visinf.2022.05.001 | |
dc.description.status | Peer reviewed | en |
This item appears in the following Collection(s)
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.