VisuaLizations As Intermediate Representations (VLAIR) : an approach for applying deep learning-based computer vision to non-image-based data

Jiang, Ai; Nacenta, Miguel A.; Ye, Juan

Show simple item record

Files in this item

Name:: Jiang_2022_VI_VLAIR_CC.pdf
Size:: 2.932Mb
Format:: PDF

View/Open

Item metadata

dc.contributor.author	Jiang, Ai
dc.contributor.author	Nacenta, Miguel A.
dc.contributor.author	Ye, Juan
dc.date.accessioned	2022-10-11T10:30:02Z
dc.date.available	2022-10-11T10:30:02Z
dc.date.issued	2022-09-24
dc.identifier	279836022
dc.identifier	4cdf82f7-c432-4fb0-bc3e-19e248d183b3
dc.identifier	85144345067
dc.identifier	000888899400004
dc.identifier.citation	Jiang , A , Nacenta , M A & Ye , J 2022 , ' VisuaLizations As Intermediate Representations (VLAIR) : an approach for applying deep learning-based computer vision to non-image-based data ' , Visual Informatics , vol. 6 , no. 3 , pp. 35-50 . https://doi.org/10.1016/j.visinf.2022.05.001	en
dc.identifier.issn	2468-502X
dc.identifier.other	RIS: urn:353DE926F80691DA9AB5C1A6127AE563
dc.identifier.other	ORCID: /0000-0002-2838-6836/work/120849895
dc.identifier.uri	https://hdl.handle.net/10023/26172
dc.description	Funding: We thank the China Scholarship Council (CSC) for financially supporting my PhD study at University of St Andrews, UK, and NSERC Discovery Grant 2020-04401 (Miguel Nacenta).	en
dc.description.abstract	Deep learning algorithms increasingly support automated systems in areas such as human activity recognition and purchase recommendation. We identify a current trend in which data is transformed first into abstract visualizations and then processed by a computer vision deep learning pipeline. We call this VisuaLization As Intermediate Representation (VLAIR) and believe that it can be instrumental to support accurate recognition in a number of fields while also enhancing humans’ ability to interpret deep learning models for debugging purposes or in personal use. In this paper we describe the potential advantages of this approach and explore various visualization mappings and deep learning architectures. We evaluate several VLAIR alternatives for a specific problem (human activity recognition in an apartment) and show that VLAIR attains classification accuracy above classical machine learning algorithms and several other non-image-based deep learning algorithms with several data representations.
dc.format.extent	16
dc.format.extent	3075265
dc.language.iso	eng
dc.relation.ispartof	Visual Informatics	en
dc.subject	Information visualization	en
dc.subject	Convolutional neural networks	en
dc.subject	Human activity recognition	en
dc.subject	Smart homes	en
dc.subject	Data representation	en
dc.subject	Intermediate representations	en
dc.subject	Interpretability	en
dc.subject	Machine learning	en
dc.subject	Deep learning	en
dc.subject	QA75 Electronic computers. Computer science	en
dc.subject	3rd-NDAS	en
dc.subject	MCC	en
dc.subject.lcc	QA75	en
dc.title	VisuaLizations As Intermediate Representations (VLAIR) : an approach for applying deep learning-based computer vision to non-image-based data	en
dc.type	Journal article	en
dc.contributor.institution	University of St Andrews. School of Computer Science	en
dc.identifier.doi	https://doi.org/10.1016/j.visinf.2022.05.001
dc.description.status	Peer reviewed	en

This item appears in the following Collection(s)

University of St Andrews Research

Show simple item record