Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorZhang, Liangfei
dc.contributor.authorHong, Xiaopeng
dc.contributor.authorArandjelovic, Ognjen
dc.contributor.authorZhao, Guoying
dc.date.accessioned2022-10-31T13:30:10Z
dc.date.available2022-10-31T13:30:10Z
dc.date.issued2022-10-10
dc.identifier281847986
dc.identifiere02f4ca1-1d10-4914-94d9-253f72cf15a7
dc.identifier85139817909
dc.identifier000892948500022
dc.identifier.citationZhang , L , Hong , X , Arandjelovic , O & Zhao , G 2022 , ' Short and long range relation based spatio-temporal transformer for micro-expression recognition ' , IEEE Transactions on Affective Computing , vol. 13 , no. 4 , pp. 1973-1985 . https://doi.org/10.1109/TAFFC.2022.3213509en
dc.identifier.issn1949-3045
dc.identifier.urihttps://hdl.handle.net/10023/26276
dc.descriptionFunding: The authors would like to thank the China Scholarship Council – University of St Andrews Scholarships (No.201908060250) funds L. Zhang for her PhD. This work is funded by the National Key Research and Development Project of China under Grant No. 2019YFB1312000, the National Natural Science Foundation of China under Grant No. 62076195, and the Fundamental Research Funds for the Central Universities under Grant No. AUGA5710011522.en
dc.description.abstractBeing spontaneous, micro-expressions are useful in the inference of a person's true emotions even if an attempt is made to conceal them. Due to their short duration and low intensity, the recognition of micro-expressions is a difficult task in affective computing. The early work based on handcrafted spatio-temporal features which showed some promise, has recently been superseded by different deep learning approaches which now compete for the state of the art performance. Nevertheless, the problem of capturing both local and global spatio-temporal patterns remains challenging. To this end, herein we propose a novel spatio-temporal transformer architecture – to the best of our knowledge, the first purely transformer based approach (i.e. void of any convolutional network use) for micro-expression recognition. The architecture comprises a spatial encoder which learns spatial patterns, a temporal aggregator for temporal dimension analysis, and a classification head. A comprehensive evaluation on three widely used spontaneous micro-expression data sets, namely SMIC-HS, CASME II and SAMM, shows that the proposed approach consistently outperforms the state of the art, and is the first framework in the published literature on micro-expression recognition to achieve the unweighted F1-score greater than 0.9 on any of the aforementioned data sets.
dc.format.extent13
dc.format.extent3817479
dc.language.isoeng
dc.relation.ispartofIEEE Transactions on Affective Computingen
dc.subjectEmotion recognitionen
dc.subjectLong-term optical flowen
dc.subjectSelf-attention mechanismen
dc.subjectTemporal aggregatoren
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subjectHuman-Computer Interactionen
dc.subjectSoftwareen
dc.subject3rd-DASen
dc.subjectACen
dc.subjectMCCen
dc.subjectNCADen
dc.subject.lccQA75en
dc.titleShort and long range relation based spatio-temporal transformer for micro-expression recognitionen
dc.typeJournal articleen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.contributor.institutionUniversity of St Andrews. School of Chemistryen
dc.identifier.doi10.1109/TAFFC.2022.3213509
dc.description.statusPeer revieweden


This item appears in the following Collection(s)

Show simple item record