Analysing the history of autism spectrum disorder using topic models

Beykikhoshk, Adham; Phung, Dinh; Arandelovic, Ognjen; Venkatesh, Svetha

Show simple item record

Files in this item

Name:: Beykikhoshk_ASD_DSAA2016_AAM.pdf
Size:: 2.510Mb
Format:: PDF

View/Open

Item metadata

dc.contributor.author	Beykikhoshk, Adham
dc.contributor.author	Phung, Dinh
dc.contributor.author	Arandelovic, Ognjen
dc.contributor.author	Venkatesh, Svetha
dc.date.accessioned	2016-11-21T10:30:16Z
dc.date.available	2016-11-21T10:30:16Z
dc.date.issued	2016-10-17
dc.identifier	247575259
dc.identifier	8cb15654-c201-4f97-a5d4-c315a3b60ecd
dc.identifier	85011290105
dc.identifier	000391583800080
dc.identifier.citation	Beykikhoshk , A , Phung , D , Arandelovic , O & Venkatesh , S 2016 , Analysing the history of autism spectrum disorder using topic models . in 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA'2016) . , 7796964 , IEEE , pp. 762-771 , 3rd IEEE International Conference on Data Science and Analytics , Montreal , Canada , 17/10/16 . https://doi.org/10.1109/DSAA.2016.65	en
dc.identifier.citation	conference	en
dc.identifier.isbn	9781509052066
dc.identifier.uri	https://hdl.handle.net/10023/9855
dc.description.abstract	We describe a novel framework for the discovery of underlying topics of a longitudinal collection of scholarly data,and the tracking of their lifetime and popularity over time. Unlike the social media or news data, as the topic nuances in science result in new scientific directions to emerge, a new approach to model the longitudinal literature data is using topics which remain identifiable over the course of time. Current studies either disregard the time dimension or treat it as an exchangeable covariate when they fix the topics over time or do not share the topics over epochs when they model the time naturally. We address these issues by adopting a non-parametric Bayesian approach. We assume the data is partially exchangeable and divided it into consecutive epochs. Then, by fixing the topics in a recurrent Chinese restaurant franchise, we impose a static topical structure on the corpus such that the they are shared across epochs and the documents within epochs. We demonstrate the effectiveness of the proposed framework on a collection of medical literature related to autism spectrum disorder. We collect a large corpus of publications and carefully examining two important research issues of the domain as case studies. Moreover, we make the results of our experiment and the source code of the model, freely available to aid other researchers by analysing the results or applying the model to their data collections.
dc.format.extent	2632950
dc.language.iso	eng
dc.publisher	IEEE
dc.relation.ispartof	2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA'2016)	en
dc.subject	Bayesian nonparametrics	en
dc.subject	Data mining	en
dc.subject	Autism spectrum disorder	en
dc.subject	QA75 Electronic computers. Computer science	en
dc.subject	QH301 Biology	en
dc.subject	RC0321 Neuroscience. Biological psychiatry. Neuropsychiatry	en
dc.subject	NDAS	en
dc.subject.lcc	QA75	en
dc.subject.lcc	QH301	en
dc.subject.lcc	RC0321	en
dc.title	Analysing the history of autism spectrum disorder using topic models	en
dc.type	Conference item	en
dc.contributor.institution	University of St Andrews. School of Computer Science	en
dc.identifier.doi	https://doi.org/10.1109/DSAA.2016.65

This item appears in the following Collection(s)

University of St Andrews Research

Show simple item record