Files in this item
frances : cloud-based historical text mining with deep learning and parallel processing
Item metadata
dc.contributor.author | Yu, Lilin | |
dc.contributor.author | Charlton, Ash | |
dc.contributor.author | Askins, Wilfrid | |
dc.contributor.author | Terras, Melissa | |
dc.contributor.author | Filgueira, Rosa | |
dc.contributor.editor | Papadopoulos, George Angelos | |
dc.contributor.editor | Filgueira, Rosa | |
dc.contributor.editor | Da Silva, Rafael Ferreira | |
dc.date.accessioned | 2023-11-09T17:30:01Z | |
dc.date.available | 2023-11-09T17:30:01Z | |
dc.date.issued | 2023-09-25 | |
dc.identifier | 290676299 | |
dc.identifier | ff68aa51-20f4-4e62-ac65-de6c1b8df20a | |
dc.identifier | 85174221936 | |
dc.identifier.citation | Yu , L , Charlton , A , Askins , W , Terras , M & Filgueira , R 2023 , frances : cloud-based historical text mining with deep learning and parallel processing . in G A Papadopoulos , R Filgueira & R F Da Silva (eds) , Proceedings : 2023 IEEE 19th international conference on e-science (e-science) . IEEE international conference on e-science , IEEE , Piscataway, NJ , 19th IEEE International Conference on eScience , Limassol , Cyprus , 9/10/23 . https://doi.org/10.1109/e-Science58273.2023.10254798 | en |
dc.identifier.citation | conference | en |
dc.identifier.isbn | 9798350322248 | |
dc.identifier.isbn | 9798350322231 | |
dc.identifier.issn | 2325-372X | |
dc.identifier.uri | https://hdl.handle.net/10023/28651 | |
dc.description.abstract | frances is an advanced cloud-based text mining digital platform that leverages information extraction, knowledge graphs, natural language processing (NLP), deep learning, and parallel processing techniques. It has been specifically designed to unlock the full potential of historical digital textual collections, such as those from the National Library of Scotland, offering cloud-based capabilities and extended support for complex NLP analyses and data visualizations. frances enables realtime recurrent operational text mining and provides robust capabilities for temporal analysis, accompanied by automatic visualizations for easy result inspection. In this paper, we present the motivation behind the development of frances, emphasizing its innovative design and novel implementation aspects. We also outline future development directions. Additionally, we evaluate the platform through two comprehensive case studies in history and publishing history. Feedback from participants in these studies demonstrates that frances accelerates their work and facilitates rapid testing and dissemination of ideas. | |
dc.format.extent | 10 | |
dc.format.extent | 3657425 | |
dc.language.iso | eng | |
dc.publisher | IEEE | |
dc.relation.ispartof | Proceedings | en |
dc.relation.ispartofseries | IEEE international conference on e-science | en |
dc.subject | Digitised historical collections | en |
dc.subject | Information extraction | en |
dc.subject | Apache Spark | en |
dc.subject | Parallel processing | en |
dc.subject | Text mining | en |
dc.subject | Cloud-based platform | en |
dc.subject | Knowledge graphs | en |
dc.subject | Natural language processing | en |
dc.subject | QA75 Electronic computers. Computer science | en |
dc.subject | QA76 Computer software | en |
dc.subject | ZA4050 Electronic information resources | en |
dc.subject | 3rd-DAS | en |
dc.subject | MCC | en |
dc.subject.lcc | QA75 | en |
dc.subject.lcc | QA76 | en |
dc.subject.lcc | ZA4050 | en |
dc.title | frances : cloud-based historical text mining with deep learning and parallel processing | en |
dc.type | Conference item | en |
dc.contributor.institution | University of St Andrews. School of Computer Science | en |
dc.identifier.doi | https://doi.org/10.1109/e-Science58273.2023.10254798 | |
dc.date.embargoedUntil | 2023-09-25 | |
dc.identifier.url | https://doi.org/10.1109/e-Science58273.2023 | en |
This item appears in the following Collection(s)
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.