Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorArandelovic, Ognjen
dc.date.accessioned2016-10-15T23:34:32Z
dc.date.available2016-10-15T23:34:32Z
dc.date.issued2016-10-01
dc.identifier.citationArandelovic , O 2016 , Weighted linear fusion of multimodal data - a reasonable baseline? in Proceedings of the 2016 ACM on Multimedia Conference . ACM , New York , pp. 851-857 , 24th ACM International Conference on Multimedia (MM) , Amsterdam , Netherlands , 15/10/16 . https://doi.org/10.1145/2964284.2964304en
dc.identifier.citationconferenceen
dc.identifier.isbn9781450336031
dc.identifier.otherPURE: 245213827
dc.identifier.otherPURE UUID: 98e9011a-374a-4381-97c6-ddfb73310318
dc.identifier.otherScopus: 84994634206
dc.identifier.otherWOS: 000387733800086
dc.identifier.urihttps://hdl.handle.net/10023/9669
dc.description.abstractThe ever-increasing demand for reliable inference capable of handling unpredictable challenges of practical application in the real world, has made research on information fusion of major importance. There are few fields of application and research where this is more evident than in the sphere of multimedia which by its very nature inherently involves the use of multiple modalities, be it for learning, prediction, or human-computer interaction, say. In the development of the most common type, score-level fusion algorithms,it is virtually without an exception desirable to have as a reference starting point a simple and universally sound baseline benchmark which newly developed approaches can be compared to. One of the most pervasively used methods is that of weighted linear fusion.It has cemented itself as the default off-the-shelf baseline owing to its simplicity of implementation, interpretability, and surprisingly competitive performance across a wide range of application domains and information source types. In this paper I argue that despite this track record, weighted linear fusion is not a good baseline on the grounds that there is an equally simple and interpretable alternative – namely quadratic mean-based fusion – which is theoretically more principled and which is more successful in practice. I argue the former from first principles and demonstrate the latter using a series of experiments on a diverse set of fusion problems: computer vision-based object recognition, arrhythmia detection, and fatality prediction in motor vehicle accidents.
dc.language.isoeng
dc.publisherACM
dc.relation.ispartofProceedings of the 2016 ACM on Multimedia Conferenceen
dc.rights© 2016, the owner/author(s). Publication rights licensed to ACM. This work is made available online in accordance with the publisher’s policies. This is the author created, accepted version manuscript following peer review and may differ slightly from the final published version. The final published version of this work is available at dl.acm.org / https://dx.doi.org/10.1145/2964284.2964304en
dc.subjectArrhythmiaen
dc.subjectObject recognitionen
dc.subjectComputer visionen
dc.subjectCar accidenten
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subjectRA0421 Public health. Hygiene. Preventive Medicineen
dc.subjectA General Worksen
dc.subjectNDASen
dc.subjectSDG 3 - Good Health and Well-beingen
dc.subject.lccQA75en
dc.subject.lccRA0421en
dc.subject.lccAen
dc.titleWeighted linear fusion of multimodal data - a reasonable baseline?en
dc.typeConference itemen
dc.description.versionPostprinten
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.identifier.doihttps://doi.org/10.1145/2964284.2964304
dc.date.embargoedUntil2016-10-15


This item appears in the following Collection(s)

Show simple item record