Optimization towards efficiency and stateful of dispel4py
Abstract
Scientific workflows bridge scientific challenges with computational resources. While dispel4py, a stream-based workflow system, offers mappings to parallel enactment engines like MPI or Multiprocessing, its optimization primarily focuses on dynamic process-to-task allocation for improved performance. An efficiency gap persists, particularly with the growing emphasis on conserving computing resources. Moreover, the existing dynamic optimization lacks support for stateful applications and grouping operations. To address these issues, our work introduces a novel hybrid approach for handling stateful operations and groupings within workflows, leveraging a new Redis mapping. We also propose an auto-scaling mechanism integrated into dispel4py’s dynamic optimization. Our experiments showcase the effectiveness of auto-scaling optimization, achieving efficiency while upholding performance. In the best case, auto-scaling reduces dispel4py’s runtime to 87% compared to the baseline, using only 76% of process resources. Importantly, our optimized stateful dispel4py demonstrates a remarkable speedup, utilizing just 32% of the runtime compared to the contender. To address these issues, our work introduces a novel hybrid approach for handling stateful operations and groupings within workflows, leveraging a new Redis mapping. We also propose an auto-scaling mechanism integrated into dispel4py’s dynamic optimization. Our experiments showcase the effectiveness of autoscaling optimization, achieving efficiency while upholding performance. In the best case, auto-scaling reduces dispel4py’s runtime to 87% compared to the baseline, using only 76% of process resources. Importantly, our optimized stateful dispel4py demonstrates a remarkable speedup, utilizing just 32% of the runtime compared to the contender.
Citation
Liang , L , Zhang , H , Yang , G , Heinis , T & Filgueira , R 2023 , Optimization towards efficiency and stateful of dispel4py . in Proceedings of the SC '23 workshops of the international conference on high performance computing, network, storage, and analysis (SC-W '23) : Nov 12-17, 2023 | Denver, CO . ACM , pp. 2021–2032 , 18th Workshop on Workflows in Support of Large-Scale Science (WORKS 2023) , Denver , Colorado , United States , 12/11/23 . https://doi.org/10.1145/3624062.3624281 conference
Publication
Proceedings of the SC '23 workshops of the international conference on high performance computing, network, storage, and analysis (SC-W '23)
Type
Conference item
Collections
Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.