This site requires Cookies enabled in your browser for login.
Updating ...
WaterNet Home
WaterNet
for
pour le
Canada
Menu
WaterNet
Home
GWFO
Home
Master
List
Data
Centre
Collections
X
Defaults
Select All
Websites
X
Global Water Futures Observatories (GWFO) Global Water Futures (GWF) Global Institute for Water Security (GIWS) International Network of Alpine Research Catchment Hydrology
Legacy Research Programs
X
Changing Cold Regions Network (CCRN) Drought Research Initiative (DRI) International Network of Alpine Research Catchment Hydrology (Legacy Site) Improving Processes & Parameterization for Prediction in Cold Regions Hydrology (IP3) The Mackenzie Global Energy and Water Cycle Experiment (GEWEX) Study (MAGS)
Legacy sites
Map
Utilities
X
Account Settings Metadata Editor Record List Alias List Editor
Data Centre
Data Type Editor
. . .
X
Clear
Select All
Advanced Search
Go to Top⇡
Related items loading ...
Fetching Chart ...
Publication Additional Information Download
Publication Type
Journal Article
Authorship
Klenk, K., Spiteri, R.J.
Title
Improving resource utilization and fault tolerance in large simulations via actors
Year
2024
Publication Outlet
Cluster Computing
DOI
https://doi.org/10.1007/s10586-024-04318-5
Citation
Klenk, K., Spiteri, R.J. (2024) Improving resource utilization and fault tolerance in large simulations via actors, Cluster Computing, https://doi.org/10.1007/s10586-024-04318-5
Abstract
Large simulations with many independent sub-simulations are common in scientific computing. There are numerous challenges, however, associated with performing such simulations in shared computing environments. For example, sub-simulations may have wildly varying completion times or not complete at all, leading to unpredictable runtimes as well as unbalanced and inefficient use of human and computational resources. In this study, we use the actor model of concurrent computation to improve both the resource utilization and fault tolerance for large-scale scientific computing simulations. More specifically, we use actors in the SUMMA model to manage a large-scale hydrological simulation over the North American continent with over 500,000 independent sub-simulations. We find that the actors implementation outperforms a standard array job submission as well as the job submission tool GNU Parallel by better balancing the computational load across processors. The actors implementation also improves fault tolerance and can eliminate the user intervention required to detect and re-submit failed jobs.
Program Affiliations
GWF: Global Water Futures
Project Affiliations
GWF-CS: Computer Science
Publication Stage
Published
© 2026 - WaterNet Version 2026-06-10
Global Water Futures Observatories
Powered by
G W F Net
T-2024-09-25-X1JbTMvqA4UODgSDOG4X1NKw Publication 1.0