5
The Current State of Modeling by Randall J. Hunt 1 and Chunmiao Zheng 2,3 1 Corresponding author: U.S. Geological Survey, Wisconsin Water Science Center, Middleton, WI 53562; [email protected] 2 Department of Geological Sciences, University of Alabama, Tuscaloosa, AL 35487. 3 Center for Water Research, Peking University, Beijing, China. Any philosophy that in its quest for certainty ignores the reality of the uncertain in the ongoing processes of nature denies the conditions out of which it arises.—John Dewey, The Quest for Certainty, 1929 With great power comes great responsibility.—Voltaire Today’s groundwater modeling tools are character- ized by power, capability, and sophistication that was unthinkable even a few years ago. Boundary conditions (lakes, streams, wetlands) have advanced modules that allow more realistic depictions of the natural world. Mod- els can simulate multiple scales conjunctively (Mehl and Hill 2005; Hunt 2006), or combine different formula- tions of the underlying mathematical governing equations (Haitjema et al. 2010). The unsaturated zone can be rigor- ously included (Jones and Woodward 2001; Radcliffe and ˇ Sim˚ unek 2010), even on a large watershed scale using kinematic wave approximations (Niswonger et al. 2006). The ability to efficiently simulate unsaturated zone flow has been a critical advance to allow models to explic- itly simulate groundwater and surface water interactions at the watershed scale (Therrien et al. 2006; Markstrom et al. 2008). At the same time groundwater flow modeling has attained an advanced state, contaminant transport models have continued to grow to provide improved simulation capabilities for complex field conditions. These include simulation of transport processes in highly heterogeneous media based on the dual-domain mass transfer formula- tion (Zheng et al. 2011), accommodation of a full suite of biogeochemical reactions through fully coupled multi- component geochemical-transport models (Prommer et al. 2009; Ma et al. 2010; Parkhurst et al. 2010), and consid- eration of multiphase and multimedia processes (Pruess et al. 2011). Additionally, numerous efforts are ongoing to develop new or improved mathematical formulations that may deal more effectively with non-ideal contaminant transport behaviors arising from strong aquifer hetero- geneity (Berkowitz et al. 2002; Zhang and Benson 2008). One example of such heterogeneity by Bianchi et al. (2011) shows decimeter-scale preferential flow paths with a varying degree of connectivity within a small aquifer cube (Figure 1). Groundwater flow and contaminant transport model- ing have been greatly facilitated by rapid technological advances on two fronts: (1) the development of compact and large-memory field sensors that have yielded rich datasets for model calibration, (2) the exponential growth in computing power, which has substantially strengthened NGWA.org Vol. 50, No. 3 – GROUND WATER – May-June 2012 329

The Current State of Modeling

Embed Size (px)

Citation preview

Page 1: The Current State of Modeling

The Current State of Modeling

by Randall J. Hunt1 and Chunmiao Zheng2,3

1Corresponding author: U.S. Geological Survey, WisconsinWater Science Center, Middleton, WI 53562; [email protected] of Geological Sciences, University of Alabama,Tuscaloosa, AL 35487.3Center for Water Research, Peking University, Beijing, China.

Any philosophy that in its quest for certainty ignoresthe reality of the uncertain in the ongoing processesof nature denies the conditions out of which itarises.—John Dewey, The Quest for Certainty, 1929

With great power comes great responsibility.—Voltaire

Today’s groundwater modeling tools are character-ized by power, capability, and sophistication that wasunthinkable even a few years ago. Boundary conditions(lakes, streams, wetlands) have advanced modules thatallow more realistic depictions of the natural world. Mod-els can simulate multiple scales conjunctively (Mehl andHill 2005; Hunt 2006), or combine different formula-tions of the underlying mathematical governing equations(Haitjema et al. 2010). The unsaturated zone can be rigor-ously included (Jones and Woodward 2001; Radcliffe andSimunek 2010), even on a large watershed scale usingkinematic wave approximations (Niswonger et al. 2006).The ability to efficiently simulate unsaturated zone flowhas been a critical advance to allow models to explic-itly simulate groundwater and surface water interactionsat the watershed scale (Therrien et al. 2006; Markstromet al. 2008).

At the same time groundwater flow modeling hasattained an advanced state, contaminant transport modelshave continued to grow to provide improved simulationcapabilities for complex field conditions. These includesimulation of transport processes in highly heterogeneousmedia based on the dual-domain mass transfer formula-tion (Zheng et al. 2011), accommodation of a full suiteof biogeochemical reactions through fully coupled multi-component geochemical-transport models (Prommer et al.2009; Ma et al. 2010; Parkhurst et al. 2010), and consid-eration of multiphase and multimedia processes (Pruesset al. 2011). Additionally, numerous efforts are ongoingto develop new or improved mathematical formulationsthat may deal more effectively with non-ideal contaminanttransport behaviors arising from strong aquifer hetero-geneity (Berkowitz et al. 2002; Zhang and Benson 2008).One example of such heterogeneity by Bianchi et al.(2011) shows decimeter-scale preferential flow paths witha varying degree of connectivity within a small aquifercube (Figure 1).

Groundwater flow and contaminant transport model-ing have been greatly facilitated by rapid technologicaladvances on two fronts: (1) the development of compactand large-memory field sensors that have yielded richdatasets for model calibration, (2) the exponential growthin computing power, which has substantially strengthened

NGWA.org Vol. 50, No. 3–GROUND WATER–May-June 2012 329

Page 2: The Current State of Modeling

X

YZ

1.00.0

-1.0-2.0-3.0-4.0-5.0-6.0

TPROGSLnK (cm/s)

X(m)

-9-8

-7-6

-5-4

Y(m)

154

155

156157

158159

9

8

7

6

5

4

Dep

th(m

)

Figure 1. One realization of the hydraulic conductivity fieldfrom the MADE site described by Zheng et al. (2011). Thefield is generated using the geostatistical program TPROGSconditioned to hydraulic conductivity values estimated fromgrain-size data at 1741 points. Such detail in the subsurface isoften critical for transport model predictions (from Bianchiet al. 2011). This figure is reproduced/modified by permissionof the American Geophysical Union.

our ability to simulate and understand hydrologic pro-cesses and water quality evolution at unprecedented res-olution and detail across multiple spatial and tempo-ral scales. For example, Hammond and Lichtner (2010)

discussed the application of a massively parallelized codethat employed more than 130,000 CPU cores to simulatethe fate and transport of a uranium plume. Kollet et al.(2010) presented the results of a proof-of-concept studyof regional scale hydrologic simulations based on a max-imum number of approximately 8 × 109 grid cells. Thesestudies demonstrated that regional scale hydrologic andgeochemical simulations on the order of 103 km2 are fea-sible at fine spatial resolution (∼100 to 101 m laterally,10−2 to 10−1 m vertically) and an hourly time interval.

Calibration of flow and contaminant transport modelscan include thousands of parameters and hundreds of thou-sands of observations. As a result, parameter estimationapproaches have become sophisticated enough that multi-day conferences can be held on a single companion code(for example, http:/ /www.lulu.com / product / ebook / pest-conference-2009-proceedings/17376714). These largeparameter estimation problems can be solved using par-allel computing, either locally or distributed to satelliteoffices, or even using cloud computing (Hunt et al. 2010).Moreover, freely available software suites include tradi-tional nonlinear regression as well as advanced mathemat-ical formulations and global methods (e.g., Shuffle Com-plex Evolution, Covariance Matrix Adaptation Scheme).For example, technologies ported from other industriessuch as singular value decomposition (Figure 2) are allow-ing more robust and efficient mathematical formulationsof groundwater models. Graphical user interfaces now

Figure 2. An example of singular value decomposition of a photograph image. When the matrix is perfectly known (as is thecase with pixels in the original image), the highest resolution for a given number of singular values can be shown visually.For reference, the image with 20 singular values represents less than 10% of information contained in the original imagein the upper left, yet it contains enough information that the subject matter can be easily identified. Although informationof groundwater systems is not as well-known as that in this image, a similar concept applies: if too few singular values areselected, a needlessly coarse and blurry representation of the groundwater system results. Moreover, when the informationcontent of the calibration dataset is increased, a larger number of data-supported singular values can be included, resulting ina sharper “picture” of the groundwater system (from Doherty and Hunt, 2010; image from and SVD processing by MichaelN. Fienen, USGS).

330 Vol. 50, No. 3–GROUND WATER–May-June 2012 NGWA.org

Page 3: The Current State of Modeling

(a) (b)

(d)(c)

Figure 3. Groundwater model-derived recharge and discharge patterns visualized using a GIS post-processor. The figureshows: (a) raw model output; (b) results processed using a normalization algorithm; (c) a two-dimensional moving averagealgorithm; and (d) a sophisticated iterative low-pass filter algorithm (from Lin et al. 2009).

routinely include these advances, and with GeographicInformation Systems, a full-featured modeling environ-ment is available where a modeler can harness thisextensive capability using common tools and approaches(Figure 3). This, in turn, reduces learning curves and typo-graphic input errors. In short, the advances of the past haveprovided today’s modeler with a vast array of powerfulenvironmental simulators.

At the same time, philosophical limitations of modelshave become more widely understood. Early approachesof “as simple as possible” were supplanted by those focus-ing on true parsimony— not only as simple as possible,but also including the often overlooked corollary “butno simpler” (Hunt et al. 2007). Simply including thou-sands of parameters in a model does not mean that theresulting calibrated model must be highly heterogeneous(Fienen et al. 2009). Moreover, if a model is oversimpli-fied, salient details are omitted. These shortcomings colorand in some cases confound a model’s ability to representthe natural world, and they can even make estimations ofuncertainty uncertain (Moore and Doherty 2005). But, thenatural world is more complex than can ever be includedin a model, even if it were possible to characterize theentirety of that complexity in the field. Uncertainty ofthe model’s ability to capture natural world complexityimportant for a model prediction, as well as imperfectknowledge of what future salient stress regimes will be,are key drivers of future prediction.

Therefore, models can never be considered crystalballs predicting the future, no matter how well constructed(Doherty 2009). That does not mean that models do nothave utility today, however. The key is to recognizethat models are created for a specific objective or setof objectives, thus all models have a societal decision-making context that can elucidate what is appropriatesimplicity. That is, a model need not reflect all aspectsof the natural world perfectly; rather, it must encapsulate

enough of the natural world to be useful for thespecific question being asked or the specific purposefor which the model was designed (NRC 2007). In thisway model expectations have evolved from constructing“true” models of the incomprehensibly complex naturalworld to “adequate” models, a view that reflects theunknowable aspects inherent to models as well as the factthat modeling endeavors do not have unlimited funding(Haitjema 2011).

In many ways, current modeling has moved beyonda focus on computer codes and enhancing code tobetter reflect the natural world (indeed, unlike the past,a master’s level graduate student will likely never becalled on to write their own groundwater modeling code).Models are instead now better thought of as heuristicscience-based tools to assess what has happened, what isin the realm of possible for the future, and how uncertainthese conjectures are. Heuristic (to find or discover) refersto experience-based techniques for problem solving andlearning. Where an all-encompassing search is impossible,heuristic methods are used to speed up the process offinding a satisfactory solution. In this way, a model is notthe natural world captured in the computer, rather it is aform of “computer-aided thinking” (Molz et al. 2011), atype of calculator that allows exploration of a problem andthe bounds of what might happen using the quantitativeframework and physics that approximate our world. In thiscontext, one who builds a model is always expected to bemindful of the larger world where the model is to be used.Therefore, today’s modeler can be thought of as part of alarger team encompassing many aspects outside the realmof pure hydrology, such as ecology, economics, policy,regulation, and decision-support information technology.

With this in mind, it is worth restating the possibleroles of models in today’s decision making. As Doherty(2011) points out, “a decision often must address the factthat something bad may happen. We may be willing to

NGWA.org Vol. 50, No. 3–GROUND WATER–May-June 2012 331

Page 4: The Current State of Modeling

pay a price to reduce the likelihood of its occurrence.How much we are prepared to pay depends on the cost ofits occurrence, and on the amount by which its likelihoodcan be reduced through preemptive management.” In thisview, the current role of modeling is to provide decisionmakers with a science-based estimate of likelihood thatsomething might happen, not predicting the future. Rather,modeling is better thought of as showing what we donot know, as well as what we do. Or put another way,although modeling cannot guarantee the right answerregardless of the advances we make, it can relate wherewithin the uncertainty limits the answer should lie givenexpert knowledge and observations from the site. Acaution for current models is, therefore, that the powerof running sophisticated models using parallel computingon many CPUs can in fact result in simply getting badanswers faster.

This line of reasoning suggests that groundwatermodel uncertainty also interacts with the decision-makingprocess in less obvious ways (Hunt and Welter 2010).As noted by many, the model-construction processorganizes and formalizes potential conceptual models ofa groundwater system. This in and of itself has utilitybecause it raises the discussion of contrasting professionalopinions to discussion of reasonableness of process andranges of model inputs. Even models that fail to reproduceobservations may provide useful insight into the utility orfutility of the conceptual model—an insight that oftenis more useful than partially successful model results.Moreover, quantitative hydrologic models often feed lessquantitative ecological, economic, and societal elementsof decision making. What else is there to use in theabsence of admittedly heuristic insights gained fromthese groundwater models? Although we cannot calculatehard and fast error bounds around a completely accuratemodel prediction, attempts at such calculations can stillhave value—for example, subtracting the predictionuncertainty for various scenarios of future monitoringand data collection. This type of model output facilitatesfocusing on efficient use of current and future monitoringresources—something valued by many decision makersregardless of background, system managed, and socialcontext.

Thus, despite the plethora of sophisticated tools andcapabilities, a good modeler of today is no different thanone of the past. All present, past, and future modelingproblems can be considered mathematically ill-posed andunder-constrained by observations from the field. There-fore, although we have the ability to create very complexand realistic looking models, there remains a great mod-eling responsibility to distill the complexity of the siteto what is truly important. That is, the ability to decom-pose a societal problem to its component parts is still ofthe utmost importance now as it was in the past whenthese advanced capabilities did not exist. Or put anotherway, it does not matter how “real” a model looks if salientprocesses are missing, confounded, or misrepresented; theonly thing real is the model is not good.

As in the past, the quality of current modelingdepends more on hydrosense than horsepower! Thishydrosense—the intuitive feel of what is and what isnot reasonable for a system that is difficult to observedirectly—is critical because that is still the primary mech-anism to assess if the model is a reasonable simulator,or is suffering from GIGO (garbage in, garbage out).Moreover, knowing the cost-benefit trade-offs of modelpower/complexity is needed to avoid unnecessary, andoften expensive, modeling ornamentation—that whichmakes a model look real but adds cost without benefitin the best case, or in the worst case, so obfuscates modelerrors that the model is not even an adequate simulator forthe question asked. Developing a modelers’ hydrosense isnot an exact science, unfortunately. Clearly, the ability toconceptually frame real problems into mathematical con-structs such as back-of-the-envelope calculations (Hait-jema 2006) is more than an academic exercise. The bestmodelers today are still those that can intuit fundamentalprocesses and concepts (such as mass balance, water flows“downhill” from higher to lower total head, transport timescale), and they then relate these to complex model con-struction and critical assessments of model output. Use ofhand calculations, analytical solutions, and simpler mod-eling techniques such as analytic element methods (forexample, see Ground Water special issue volume 44, num-ber 1) allow a modeler to develop insight without undueamounts of time and funding. Hydrosense pays many div-idends, as this critical thinking is necessary to constructand vet all the powerful modeling tools of today, as wellas the modeling advances of tomorrow.

ReferencesBerkowitz, B., J. Klafter, R. Metzler, and H. Scher. 2002.

Physical pictures of transport in heterogeneous media:Advection-dispersion, random-walk, and fractional deriva-tive, formulations. Water Resources Research 38, no. 10:1191. DOI: 10.1029/2001WR001030

Bianchi, M., C. Zheng, C. Wilson, G.R. Tick, G. Liu, andS.M. Gorelick. 2011. Spatial connectivity in a highlyheterogeneous aquifer: From cores to preferential flowpaths. Water Resources Research 47: W05524. DOI:10.1029/2009WR008966.

Doherty, J. 2011. Modeling: Picture perfect or abstract art?Ground Water 49: 455. DOI: 10.1111/j.1745-6584.2011.00812.x.

Doherty, J. 2009. Model-based decision-making: Some reflec-tions. In Proceedings of the 1st PEST Conference, November2-4, 2009, Potomac, Maryland, ed. M.J. Tonkin. Bethesda,Maryland: S.S. Papadopulos & Associates, Inc., 283–293.

Doherty, J., and R.J. Hunt. 2010. Approaches to highlyparameterized inversion: A guide to using PEST forgroundwater-model calibration. U.S. Geological SurveyScientific Investigations Report 2010–5169. Reston, Vir-ginia: USGS, 60 p.

Fienen, M.N., R.J. Hunt, D.P. Krabbenhoft, and T. Clemo.2009. Obtaining parsimonious hydraulic conductivity fieldsusing head and transport observations: A Bayesian geo-statistical parameter estimation approach. Water ResourcesResearch 45: W08405. DOI: 10.1029/2008WR007431.

Haitjema, H. 2011. Model complexity: A cost-benefit issue. InGeological Society of America Annual Meeting, October9–12, 2011, Minneapolis, Minnesota. http://gsa.confex.com/

332 Vol. 50, No. 3–GROUND WATER–May-June 2012 NGWA.org

Page 5: The Current State of Modeling

gsa/2011AM/finalprogram/abstract_197453.htm (accessedFebruary 15, 2012).

Haitjema, H. 2006. The role of hand calculations in ground waterflow modeling. Ground Water 44, no. 6: 786–791. DOI:10.1111/j.1745-6584.2006.00189.x.

Haitjema, H.M., D.T. Feinstein, R.J. Hunt, and M. Gusyev.2010. A hybrid finite difference and analytic elementgroundwater model. Ground Water 48, no. 4: 538–548.DOI: 10.1111/j.1745-6584.2009.00672.

Hammond, G.E., and P.C. Lichtner. 2010. Field-scale modelfor the natural attenuation of uranium at the Hanford 300Area using high-performance computing. Water ResourcesResearch 46: W09527. DOI: 10.1029/2009WR008819.

Hunt, R.J. 2006. Ground water modeling applications using theanalytic element method. Ground Water 44, no. 1: 5–15.DOI: 10.1111/j.1745-6584.2005.00143.x.

Hunt, R.J., and D.E. Welter. 2010. Taking account of “unknownunknowns”. Ground Water 48, no. 4: 477. DOI: 10.1111/j.1745-6584.2010.00681.x.

Hunt, R.J., J. Luchette, W.A. Schreuder, J.O. Rumbaugh,J. Doherty, M.J. Tonkin, and D.B. Rumbaugh. 2010. Usinga cloud to replenish parched groundwater modeling efforts.Ground Water 48, no. 3: 360–365. DOI: 10.1111/j.1745-6584.2010.00699.x.

Hunt, R.J., J. Doherty, and M.J. Tonkin. 2007. Are modelstoo simple? Arguments for increased parameterization.Ground Water 45, no. 3: 254–262. DOI: 10.1111/j.1745-6584.2007.00316.x.

Jones J.E., and C.S. Woodward. 2001. Newton-krylov-multigridsolvers for large-scale, highly heterogeneous, variablysaturated flow problems. Advances in Water Resources 24:763–774.

Kollet, S.J., R.M. Maxwell, C.S. Woodward, S. Smith,J. Vanderborght, H. Vereecken, and C. Simmer. 2010.Proof of concept of regional scale hydrologic simulations athydrologic resolution utilizing massively parallel computerresources, Water Resources Research 46: W04201. DOI:10.1029/2009WR008730.

Lin, Y.-F., J. Wang, and A.J. Valocchi. 2009. PRO-GRADE:GIS toolkits for ground water recharge and dischargeestimation. Ground Water 47, no. 1: 122–128. DOI:10.1111/j.1745-6584.2008.00503.x.

Ma, R., C. Zheng, H. Prommer, J. Greskowiak, C. Liu,J. Zachara, and M. Rockhold. 2010. A Field-scale reac-tive transport model for U(VI) migration influenced bycoupled multi-rate mass transfer and surface complexationreactions, Water Resources Research 46: W05509. DOI:10.1029/2009WR008168.

Markstrom, S.L., R.G. Niswonger, R.S. Regan, D.E. Prudic, andP.M. Barlow. 2008. GSFLOW—Coupled ground-waterand surface water flow model based on the integration ofthe Precipitation-Runoff Modeling System (PRMS) and theModular Ground-Water Flow model (MODFLOW-2005).U.S. Geological Survey techniques and methods 6-D1.Reston, Virginia: USGS, 240 p.

Mehl, S., and M.C. Hill. 2005. MODFLOW-2005, the U.S.Geological Survey Modular Ground-Water Model: Docu-mentation of shared node local grid refinement (LGR) andthe boundary flow and head (BFH) package. U.S. Geolog-ical Survey techniques and methods 6-A12. Reston, Vir-ginia: USGS, 78 p.

Molz, F., D. Demirkanli, S. Thompson, and D.I. Kaplan. 2011.Mathematical models, computer-aided thinking and the sci-entific method. In GSA Annual Meeting, October 9–12,2011, Minneapolis, Minnesota. http://gsa.confex.com/gsa/2011AM/finalprogram/abstract_192897.htm (accessedFebruary 15, 2012).

Moore, C., and J. Doherty. 2005. The role of the calibrationprocess in reducing model predictive error. Water ResourcesResearch 41, no. 5: W05050.

National Research Council (Committee on Models in the Regu-latory Decision Process). 2007. Models in EnvironmentalRegulatory Decision Making. Washington, DC: NationalAcademies Press, 287 p.

Niswonger, R.G., D.E. Prudic, and R.S. Regan. 2006. Documen-tation of the Unsaturated-Zone Flow (UZF1) package formodeling unsaturated flow between the land surface andthe water table with MODFLOW-2005. U.S. GeologicalSurvey techniques and methods 6-A19. Reston, Virginia:USGS, 74 p.

Parkhurst, D.L., K.L. Kipp, and S.R. Charlton. 2010. PHASTVersion 2—A program for simulating groundwater flow,solute transport, and multicomponent geochemical reac-tions. U.S. Geological Survey techniques and methods 6-A35. Reston, Virginia: USGS, 235 p.

Prommer, H., B. Anneser, M. Rolle, F. Einsiedl, and C. Griebler.2009. Biogeochemical and isotopic gradients in a BTEX/PAH contaminant plume: model-based interpretation of ahigh-resolution field data set. Environmental Science andTechnology 43, no. 21: 8206–8212.

Pruess, K., C. Oldenburg, and G. Moridis. 2011. TOUGH2User’s Guide, Version 2.1, Report LBNL-43134 (revised).Berkeley, California: Lawrence Berkeley National Labora-tory.

Radcliffe, D., and J. Simunek. 2010. Soil Physics with HYDRUS:Modeling and Applications. Boca Raton, Florida: CRCPress, Taylor & Francis Group, 373 p. ISBN-10:142007380X, ISBN-13:9781420073805.

Therrien, R., R.G. McLaren, E.A. Sudicky, and S.M Panday.2006. Hydrogeosphere. Waterloo, Canada: GroundwaterSimulations Group, University of Waterloo.

Zhang, Y., and D. Benson. 2008. Lagrangian simulation ofmultidimensional anomalous transport at the MADE site.Geophysical Research Letters 35: L07403. DOI: 10.1029/2008GL033222.

Zheng, C., M. Bianchi, and S.M. Gorelick. 2011. Lessonslearned from 25 years of research at the MADE site.Ground Water 49, no. 5: 649–662. DOI: 10.1111/j.1745-6584.2010.00753.x.

NGWA.org Vol. 50, No. 3–GROUND WATER–May-June 2012 333