Files in this item
Language transfer for early warning of epidemics from social media
Item metadata
dc.contributor.author | Appelgren, Mattias | |
dc.contributor.author | Schrempf, Patrick | |
dc.contributor.author | Falis, Matúš | |
dc.contributor.author | Ikeda, Satoshi | |
dc.contributor.author | O'Neil, Alison Q | |
dc.date.accessioned | 2019-12-20T11:30:05Z | |
dc.date.available | 2019-12-20T11:30:05Z | |
dc.date.issued | 2019-12-13 | |
dc.identifier.citation | Appelgren , M , Schrempf , P , Falis , M , Ikeda , S & O'Neil , A Q 2019 , ' Language transfer for early warning of epidemics from social media ' , Paper presented at Artificial Intelligence for Humanitarian Assistance and Disaster Response , Vancouver , Canada , 13/12/19 . < https://arxiv.org/abs/1910.04519 > | en |
dc.identifier.citation | workshop | en |
dc.identifier.other | PURE: 264577925 | |
dc.identifier.other | PURE UUID: 7ae25ad5-ac81-496c-851f-2c10239b29cb | |
dc.identifier.other | ArXiv: http://arxiv.org/abs/1910.04519v1 | |
dc.identifier.other | ORCID: /0000-0003-2484-6855/work/66398460 | |
dc.identifier.uri | https://hdl.handle.net/10023/19177 | |
dc.description.abstract | Statements on social media can be analysed to identify individuals who are experiencing red flag medical symptoms, allowing early detection of the spread of disease such as influenza. Since disease does not respect cultural borders and may spread between populations speaking different languages, we would like to build multilingual models. However, the data required to train models for every language may be difficult, expensive and time-consuming to obtain, particularly for low-resource languages. Taking Japanese as our target language, we explore methods by which data in one language might be used to build models for a different language. We evaluate strategies of training on machine translated data and of zero-shot transfer through the use of multilingual models. We find that the choice of source language impacts the performance, with Chinese-Japanese being a better language pair than English-Japanese. Training on machine translated data shows promise, especially when used in conjunction with a small amount of target language data. | |
dc.format.extent | 6 | |
dc.language.iso | eng | |
dc.rights | Copyright © 2019 the Author(s). This work has been made available online in accordance with publisher policies or with permission. Permission for further reuse of this content should be sought from the publisher or the rights holder. This is the author created accepted manuscript following peer review and may differ slightly from the final published version. The final published version of this work is available at https://arxiv.org/abs/1910.04519 | en |
dc.subject | QA75 Electronic computers. Computer science | en |
dc.subject | PI Oriental languages and literatures | en |
dc.subject | RA0421 Public health. Hygiene. Preventive Medicine | en |
dc.subject | SDG 3 - Good Health and Well-being | en |
dc.subject.lcc | QA75 | en |
dc.subject.lcc | PI | en |
dc.subject.lcc | RA0421 | en |
dc.title | Language transfer for early warning of epidemics from social media | en |
dc.type | Conference paper | en |
dc.description.version | Postprint | en |
dc.contributor.institution | University of St Andrews. School of Computer Science | en |
dc.description.status | Peer reviewed | en |
dc.identifier.url | https://arxiv.org/abs/1910.04519 | en |
This item appears in the following Collection(s)
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.