Show simple item record

Files in this item

Thumbnail

Item metadata

dc.contributor.authorLiang, Liang
dc.contributor.authorFilgueira, Rosa
dc.contributor.authorYan, Yan
dc.contributor.authorHeinis, Thomas
dc.date.accessioned2022-10-07T23:39:16Z
dc.date.available2022-10-07T23:39:16Z
dc.date.issued2022-03
dc.identifier277807121
dc.identifier2140963d-057b-441c-b7a2-3f48e5b5dd31
dc.identifier85118733552
dc.identifier.citationLiang , L , Filgueira , R , Yan , Y & Heinis , T 2022 , ' Scalable adaptive optimizations for stream-based workflows in multi-HPC-clusters and cloud infrastructures ' , Future Generation Computer Systems , vol. 128 , pp. 102-116 . https://doi.org/10.1016/j.future.2021.09.036en
dc.identifier.issn0167-739X
dc.identifier.otherBibtex: 73b0b90895dc48fea5d91646e69bc2d8
dc.identifier.urihttps://hdl.handle.net/10023/26161
dc.descriptionFunding: This work is partially supported by the EU H2020 project DARE, No. 777413; and by Google Cloud Platform research credits program.en
dc.description.abstractThis work presents three new adaptive optimization techniques to maximize the performance of dispel4py workflows. dispel4py is a parallel Python-based stream-oriented dataflow framework that acts as a bridge to existing parallel programming frameworks like MPI or Python multiprocessing. When a user runs a dispel4py workflow, the original framework performs a fixed workload distribution among the processes available for the run. This allocation does not take into account the features of the workflows, which can cause scalability issues, especially for data-intensive scientific workflows. Our aim, therefore, is to improve the performance of dispel4py workflows by testing different workload strategies that automatically adapt to workflows at runtime. For achieving this objective, we have implemented three new techniques, called Naive Assignment, Staging and Dynamic Scheduling. We have evaluated our proposal with several workflows from different domains and across different computing resources. The results show that our proposed techniques have significantly (up to 10X) improved the performance of the original dispel4py framework.
dc.format.extent15
dc.format.extent3059433
dc.language.isoeng
dc.relation.ispartofFuture Generation Computer Systemsen
dc.subjectScientific workflowen
dc.subjectStream-based workflowen
dc.subjectWorkflow optimizationen
dc.subjectdispel4pyen
dc.subjectDistributed systemsen
dc.subjectQA75 Electronic computers. Computer scienceen
dc.subjectQA76 Computer softwareen
dc.subjectIen
dc.subjectACen
dc.subject.lccQA75en
dc.subject.lccQA76en
dc.titleScalable adaptive optimizations for stream-based workflows in multi-HPC-clusters and cloud infrastructuresen
dc.typeJournal articleen
dc.contributor.institutionUniversity of St Andrews. School of Computer Scienceen
dc.identifier.doihttps://doi.org/10.1016/j.future.2021.09.036
dc.description.statusPeer revieweden
dc.date.embargoedUntil2022-10-08
dc.identifier.urlhttps://researchportal.hw.ac.uk/en/publications/scalable-adaptive-optimizations-for-stream-based-workflows-in-mulen


This item appears in the following Collection(s)

Show simple item record