Show simple item record

Files in this item


Item metadata

dc.contributor.authorBeykikhoshk, Adham
dc.contributor.authorPhung, Dinh
dc.contributor.authorArandelovic, Ognjen
dc.contributor.authorVenkatesh, Svetha
dc.identifier.citationBeykikhoshk , A , Phung , D , Arandelovic , O & Venkatesh , S 2016 , Analysing the history of autism spectrum disorder using topic models . in 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA'2016) . , 7796964 , IEEE , pp. 762-771 , 3rd IEEE International Conference on Data Science and Analytics , Montreal , Canada , 17/10/16 .
dc.identifier.otherPURE: 247575259
dc.identifier.otherPURE UUID: 8cb15654-c201-4f97-a5d4-c315a3b60ecd
dc.identifier.otherScopus: 85011290105
dc.identifier.otherWOS: 000391583800080
dc.description.abstractWe describe a novel framework for the discovery of underlying topics of a longitudinal collection of scholarly data,and the tracking of their lifetime and popularity over time. Unlike the social media or news data, as the topic nuances in science result in new scientific directions to emerge, a new approach to model the longitudinal literature data is using topics which remain identifiable over the course of time. Current studies either disregard the time dimension or treat it as an exchangeable covariate when they fix the topics over time or do not share the topics over epochs when they model the time naturally. We address these issues by adopting a non-parametric Bayesian approach. We assume the data is partially exchangeable and divided it into consecutive epochs. Then, by fixing the topics in a recurrent Chinese restaurant franchise, we impose a static topical structure on the corpus such that the they are shared across epochs and the documents within epochs. We demonstrate the effectiveness of the proposed framework on a collection of medical literature related to autism spectrum disorder. We collect a large corpus of publications and carefully examining two important research issues of the domain as case studies. Moreover, we make the results of our experiment and the source code of the model, freely available to aid other researchers by analysing the results or applying the model to their data collections.
dc.relation.ispartof2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA'2016)en
dc.rights© 2016, IEEE. This work has been made available online in accordance with the publisher’s policies. This is the author created, accepted version manuscript following peer review and may differ slightly from the final published version. The final published version of this work is available at /
dc.subjectBayesian nonparametricsen
dc.subjectData miningen
dc.subjectAutism spectrum disorderen
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subjectQH301 Biologyen
dc.subjectRC0321 Neuroscience. Biological psychiatry. Neuropsychiatryen
dc.titleAnalysing the history of autism spectrum disorder using topic modelsen
dc.typeConference itemen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen

This item appears in the following Collection(s)

Show simple item record