Community Workflows to Advance Reproducibility in Hydrologic Modeling: Separating Model-Agnostic and Model-Specific Configuration Steps in Applications of Large-Domain Hydrologic Models
Section 1: Publication
Publication Type
Journal Article
Authorship
Knoben W.J.M., Clark M.P., Bales J., Bennett A., Gharari S., Marsh C.B., Nijssen B., Pietroniro A., Spiteri R.J., Tang G., Tarboton D.G., Wood A.W.
Title
Community Workflows to Advance Reproducibility in Hydrologic Modeling: Separating Model-Agnostic and Model-Specific Configuration Steps in Applications of Large-Domain Hydrologic Models
Year
2022
Publication Outlet
Water Resources Research, 58(11), e2021WR031753
DOI
ISBN
ISSN
Citation
Knoben W.J.M., Clark M.P., Bales J., Bennett A., Gharari S., Marsh C.B., Nijssen B., Pietroniro A., Spiteri R.J., Tang G., Tarboton D.G., Wood A.W. (2022). Community Workflows to Advance Reproducibility in Hydrologic Modeling: Separating Model-Agnostic and Model-Specific Configuration Steps in Applications of Large-Domain Hydrologic Models. Water Resources Research, 58(11), e2021WR031753
https://doi.org/10.1029/2021WR031753
Abstract
Despite the proliferation of computer-based research on hydrology and water resources, such research is typically poorly reproducible. Published studies have low reproducibility due to incomplete availability of data and computer code, and a lack of documentation of workflow processes. This leads to a lack of transparency and efficiency because existing code can neither be quality controlled nor reused. Given the commonalities between existing process-based hydrologic models in terms of their required input data and preprocessing steps, open sharing of code can lead to large efficiency gains for the modeling community. Here, we present a model configuration workflow that provides full reproducibility of the resulting model instantiations in a way that separates the model-agnostic preprocessing of specific data sets from the model-specific requirements that models impose on their input files. We use this workflow to create large-domain (global and continental) and local configurations of the Structure for Unifying Multiple Modeling Alternatives (SUMMA) hydrologic model connected to the mizuRoute routing model. These examples show how a relatively complex model setup over a large domain can be organized in a reproducible and structured way that has the potential to accelerate advances in hydrologic modeling for the community as a whole. We provide a tentative blueprint of how community modeling initiatives can be built on top of workflows such as this. We term our workflow the “Community Workflows to Advance Reproducibility in Hydrologic Modeling” (CWARHM; pronounced “swarm”).
Plain Language Summary
Key Points
-Reproducible, transparent modeling increases confidence in model simulations and requires careful tracking of all model configuration steps
-We show an example of model configuration code applied globally that is traced and shared through a version control system
-Standardizing file formats and sharing of code can increase efficiency and reproducibility of modeling studies