Investigating multisensory integration in emotion recognition through bio-inspired computational models

Mansouri Benssassi, Esma; Ye, Juan

Show simple item record

Files in this item

Name:: Mansouri_Benssassi_2021_Investigating_multisensory_IEEETAC_ESMA_2020_AAM.pdf
Size:: 2.457Mb
Format:: PDF

View/Open

Item metadata

dc.contributor.author	Mansouri Benssassi, Esma
dc.contributor.author	Ye, Juan
dc.date.accessioned	2021-09-24T15:30:08Z
dc.date.available	2021-09-24T15:30:08Z
dc.date.issued	2021-08-19
dc.identifier	275505623
dc.identifier	6077ae2a-456e-43c8-aa10-ea550f1f0144
dc.identifier	85113329778
dc.identifier.citation	Mansouri Benssassi , E & Ye , J 2021 , ' Investigating multisensory integration in emotion recognition through bio-inspired computational models ' , IEEE Transactions on Affective Computing , vol. Early Access . https://doi.org/10.1109/TAFFC.2021.3106254	en
dc.identifier.issn	1949-3045
dc.identifier.other	ORCID: /0000-0002-2838-6836/work/100549554
dc.identifier.uri	https://hdl.handle.net/10023/24022
dc.description.abstract	Emotion understanding represents a core aspect of human communication. Our social behaviours are closely linked to expressing our emotions and understanding others emotional and mental states through social signals. The majority of the existing work proceeds by extracting meaningful features from each modality and applying fusion techniques either at a feature level or decision level. However, these techniques are incapable of translating the constant talk and feedback between different modalities. Such constant talk is particularly important in continuous emotion recognition, where one modality can predict, enhance and complement the other. This paper proposes three multisensory integration models, based on different pathways of multisensory integration in the brain; that is, integration by convergence, early cross-modal enhancement, and integration through neural synchrony. The proposed models are designed and implemented using third-generation neural networks, Spiking Neural Networks (SNN). The models are evaluated using widely adopted, third-party datasets and compared to state-of-the-art multimodal fusion techniques, such as early, late and deep learning fusion. Evaluation results show that the three proposed models have achieved comparable results to the state-of-the-art supervised learning techniques. More importantly, this paper demonstrates plausible ways to translate constant talk between modalities during the training phase, which also brings advantages in generalisation and robustness to noise.
dc.format.extent	13
dc.format.extent	2576405
dc.language.iso	eng
dc.relation.ispartof	IEEE Transactions on Affective Computing	en
dc.subject	Spiking neural network	en
dc.subject	Multisensory integration	en
dc.subject	Emotion recognition	en
dc.subject	Neural synchrony	en
dc.subject	Graph neural network	en
dc.subject	QA75 Electronic computers. Computer science	en
dc.subject	QH301 Biology	en
dc.subject	3rd-DAS	en
dc.subject.lcc	QA75	en
dc.subject.lcc	QH301	en
dc.title	Investigating multisensory integration in emotion recognition through bio-inspired computational models	en
dc.type	Journal article	en
dc.contributor.institution	University of St Andrews. School of Computer Science	en
dc.identifier.doi	https://doi.org/10.1109/TAFFC.2021.3106254
dc.description.status	Peer reviewed	en

This item appears in the following Collection(s)

University of St Andrews Research

Show simple item record