Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorKuster Filipe Bowles, Juliana
dc.contributor.authorSilvina, Agastya
dc.contributor.authorBin, Eyal
dc.contributor.authorVinov, Michael
dc.contributor.editorGutiérrez Basulto, Victor
dc.contributor.editorKliegr, Tomáš
dc.contributor.editorSoylu, Ahmet
dc.contributor.editorGiese, Martin
dc.contributor.editorRoman, Dumitru
dc.date.accessioned2020-08-24T10:30:05Z
dc.date.available2020-08-24T10:30:05Z
dc.date.issued2020
dc.identifier268501077
dc.identifier2c293f3d-3502-4aca-9df1-a6d8f765dc52
dc.identifier85090100248
dc.identifier000723845700013
dc.identifier.citationKuster Filipe Bowles , J , Silvina , A , Bin , E & Vinov , M 2020 , On defining rules for cancer data fabrication . in V Gutiérrez Basulto , T Kliegr , A Soylu , M Giese & D Roman (eds) , Rules and Reasoning : 4th International Joint Conference, RuleML+RR 2020, Oslo, Norway, June 29–July 1, 2020, Proceedings . Lecture Notes in Computer Science (Programming and Software Engineering) , vol. 12173 LNCS , Springer , Cham , pp. 168-176 , 4th International Joint Conference on Rules and Reasoning (RCUL+RR 2020) , Oslo , Norway , 29/06/20 . https://doi.org/10.1007/978-3-030-57977-7_13en
dc.identifier.citationconferenceen
dc.identifier.isbn9783030579760
dc.identifier.isbn9783030579777
dc.identifier.issn0302-9743
dc.identifier.otherORCID: /0000-0002-5918-9114/work/79565092
dc.identifier.urihttps://hdl.handle.net/10023/20505
dc.descriptionFunding: This research is partially funded by the Data Lab, and the EU H2020 project Serums: Securing Medical Data in Smart Patient-Centric Healthcare Systems (grant 826278).en
dc.description.abstractData is essential for machine learning projects, and data accuracy is crucial for being able to trust the results obtained from the associated machine learning models. Previously, we have developed machine learning models for predicting the treatment outcome for breast cancer patients that have undergone chemotherapy, and developed a monitoring system for their treatment timeline showing interactively the options and associated predictions. Available cancer datasets, such as the one used earlier, are often too small to obtain significant results, and make it difficult to explore ways to improve the predictive capability of the models further. In this paper, we explore an alternative to enhance our datasets through synthetic data generation. From our original dataset, we extract rules to generate fabricated data that capture the different characteristics inherent in the dataset. Additional rules can be used to capture general medical knowledge. We show how to formulate rules for our cancer treatment data, and use the IBM solver to obtain a corresponding synthetic dataset. We discuss challenges for future work.
dc.format.extent9
dc.format.extent198999
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofRules and Reasoningen
dc.relation.ispartofseriesLecture Notes in Computer Science (Programming and Software Engineering)en
dc.subjectCancer dataen
dc.subjectSynthetic dataen
dc.subjectConstraint solversen
dc.subjectFabrication rulesen
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subjectRC0254 Neoplasms. Tumors. Oncology (including Cancer)en
dc.subject3rd-DASen
dc.subjectSDG 3 - Good Health and Well-beingen
dc.subject.lccQA75en
dc.subject.lccRC0254en
dc.titleOn defining rules for cancer data fabricationen
dc.typeConference itemen
dc.contributor.sponsorEuropean Commissionen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.identifier.doi10.1007/978-3-030-57977-7_13
dc.date.embargoedUntil2020-08-19
dc.identifier.grantnumberSEP-210512424en


This item appears in the following Collection(s)

Show simple item record