Upload
aenumaelish1
View
259
Download
0
Embed Size (px)
Citation preview
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
1/67
Data Analysis
John Bing-CanarFIELDS Group
U.S. EPA, Region 5, Superfund
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
2/67
Data Analysis topics: overview data requirements data processing statistical analysis secondary sampling cleanup objectives
software and resources case studies
Outline
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
3/67
Stating the obvious:
Overview
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
4/67
Why this topic? data, data issues, in-field devices, clean-up
estimation
available softwares and where to get them case studies to highlight concepts
Overview
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
5/67
Basic data needs
contaminate concentration(s) locational data (X and Y)
true coordinates (e.g., UTM)
latitutde/longitude local coordinates (e.g., northing and easting from one
corner of site)
date/time collected
additional dataelevation (Z)
Data Requirements
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
6/67
Exploratory Data Analysis (EDA)
data error detectioncontaminate, X, Y, Z, date valuesunits (ppm vs. ppb)
data error detection techniquesdescriptive statistics such as maximum/minimum,
graphing (boxplots, histograms) negative values, values outside of range (% oxygen > 100)
mapping (spatial max/min values) are locations far from majority of sample locations?
outlier tests?not a good idea; high contamination values usually
indicate a source not an outlier
Data Processing
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
7/67
Exploratory Data Analysis (EDA)
LOD treatment1/2 LOD?
Duplicates and splits treatments take maximum? take average? take median?
Summary
choose a method and document it (transparency)
Data Processing
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
8/67
Calibration of field methods with lab methodsGoal: calibration equation, e.g.,
y = (m)(x) + badjusted value = (slope)(lab value) + intercept
For example, InnovX XRFadjusted lead = (1.54)(lab value) 14.13Natural log adjusted lead = (1.00)(Natural log of lab
value) + 0.37
if lab value = 100ppm, then adjusted lead value = 149.75
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
9/67
Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b
non-linearity
linear
curvilinear
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
10/67
Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b
outliers
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
11/67
Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b heteroscedasticity of residuals
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
12/67
Calibration of field methods with lab methods problems with a calibration equation, y = (m)(x) + b
heteroscedasticity of residuals
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
13/67
Calibration of field methods with lab methods factors to consider when comparing field to lab
methods:differences in field equipment
Niton XLp712 with Americium 241 radioactive source versusInnovX 4000 with an X-ray source. (apples to oranges)
differences in weather conditions weather (dry soil versus wet soil)
differences in material tested slag or ground slag versus soil residential soil versus cattail marsh (huge differences inorganic matter, i.e., roots) (XRF measures metals in soil notvegetation)
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
14/67
Calibration of field methods with lab methods
basic guide to use when using in-field devices:send a range of samples values to the lab (especially
want higher values). DO NOT randomly send samplesto the lab, or send every 5th sample to the lab
try to send at least 20 samples to the lab. (20 labvalues will give the minimum number of values to usefor calibration of field values to lab values.)Method 6200 (SW-846) calls for a minimum of 5% of
samples sent to the lab.
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
15/67
Upper confidence limits use upper confidence limits
(UCLs) to determine if sitehas met cleanup goals
EPA-funded software:ProUCL
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
16/67
Upper confidence limits
Statistical Analysis
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
17/67
Secondary or phased sampling
given interpretation of previous sampling event,where to continue sampling?
Secondary Sampling
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
18/67
Adaptive fill sampling user-specified number of samples in the most poorly
sampled areas
Initial locationsSecondary locations
Kalamazoo River
Secondary Sampling
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
19/67
Initial locationsRadial locations
Radial sampling to meet the need to adequately describe spatial
variation over distance
Kalamazoo River
Secondary Sampling
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
20/67
Initial locations
Radial locations
Radial sampling to meet the need to adequately describe spatial
variation over distance
Kalamazoo River
Secondary Sampling
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
21/67
Four components:
2D and 3D data spatial estimation (interpolation)
identify cleanup areas
estimate mass/volume
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
22/67
2D and 3D data
2D data (e.g., mercury levels in sediment surface) 3D data (e.g., downhole gamma logging)
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
23/67
Spatial estimation (interpolation)
interpolation generates estimates at unsampledlocations
interpolation
Scio Pottery
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
24/67
Spatial estimation (interpolation)
why interpolate?hot spot delineationsmass/volume calculationsdetermine remediation areas
interpolation
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
25/67
Identify cleanup areas
goal:average concentration; orany area above a specified value
Deer Lake
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
26/67
Estimate mass and volume
what is expected volume of soil (sediment) andmass of contaminant to meet a cleanup goal?
Deer Lake
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
27/67
Estimate mass and volume
Deer Lake
Cleanup Objectives
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
28/67
Software
Freeware FIELDS Tools for ArcGIS (http://epa.instepsoftware.com/FIELDS/) F/S Plus [stand-alone] (http://epa.instepsoftware.com/FIELDS/) SADA [stand-alone] (http://www.tiem.utk.edu/~sada/index.shtml)
GMS [stand-alone] (http://www.ems-i.com/)
Proprietary Software WinGslib (www.statios.com)
EVS/MVS (www.ctech.com) SAGE2001 (www.isaaks.com) earthVision (www.dgi.com)
Software and Resources
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
29/67
SoftwareFreeware FORMS II Lite (http://www.epa.gov/superfund/programs/clp/f2lite.htm)
Generates sample labels, bottle tags, and Chain of Custody (COC) forms; Tracks samples from the field to the laboratory; Facilitates electronic capture of sample information into databases; and
Exports data electronically as .xml, .dbf or .txt files. MARSSIM (http://www.epa.gov/radiation/marssim/index.html)
The Multi-Agency Radiation Surveys and Site Investigation Manual(MARSSIM) provides detailed guidance for planning, implementing, andevaluating environmental and facility radiological surveys conducted to
demonstrate compliance with a dose- or risk-based regulation ProUCL (http://www.epa.gov/esd/tsc/software.htm)
Estimates Upper Confidence Limit (UCL) of the mean using variousparametric and nonparametric methods Statistical methods that can be used to verify the attainment of cleanupstandards
Software and Resources
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
30/67
Software
Freeware VSP (http://dqo.pnl.gov/index.htm)
Statiscally-based sample design software Supported and financed by the USEPA to meet the DQ) (Data QualityObjectives)
Proprietary Software TerraSeer (http://www.terraseer.com/index.php)
Space-Time Intelligence System (STIS) A GIS software that allows for space and time analyses (e.g., clusteranalysis)
Software and Resources
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
31/67
End of Talk
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
32/67
Site Info Management: Decision MakingTools ArcGIS
Internet Tools Google Earth ArcGIS Explorer
MS Virtual Earth
IMAAC
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
33/67
How Robust are these tools? ArcGIS
Capable of sophisticated Spatial and Statistical Analysis On the fly Complex Queries Requires extensive knowledge of GIS
Internet Tools (Google Earth, ArcGIS Explorer, MSVirtual Earth)
More for Presentation Purposes and general use Less functionality for on the fly or detailed data analysis Designed for non-GIS personnel More extensive analysis would require customized
programming
IMAAC Capable of short term, quick response plume mapping More detailed modeling can be conducted
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
34/67
GIS = Geographic Information Systems
It is a collection of computer hardware, software, and geographic data for capturing,
managing, analyzing, and displaying all forms of geographically referenced
information.
How does a full-blown GIS software like ArcGIS differ from Google Earth or other
Internet-based free software?
ArcGIS can answer the following questions:
Query and display:Find and display all samples collected in the last week
Find and display all H2S levels above a criterion
Find and display the highest Cadmium levels in the top 2 feet of soil
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
35/67
ArcGIS can answer the following questions:
Quantification of changes that have occurred over space or time:
What is the area (yds2) of a proposed clean up?
How much has the elevation of a landfill decreased (subsided) overtime?
Where have PCB concentrations increased in the sediment surface from
last summer?
What is the mass of Mercury in a proposed removal area?
Statistical analysis:Is there a statistically significant relationship between residential soil levels for Lead?
Is there a pattern of Arsenic levels in soil from an airborne release?
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
36/67
Case StudyNicor Gas Company Response, Park Ridge, IL
In June 2007, Nicor Gas company (Nicor) discovered PCBs in gas
meters at four homes in Park Ridge, IL. Nicor cleaned up thehomes and contacted U.S. EPA, whose inspectors then performedfollow-up testing of indoor air, soil, and hard surfaces. In July andAugust, further sampling of other homes in Park Ridge wasconducted in order to determine the extent of contaminated gasmeters.
In July and August, further sampling of other homes in Park Ridgewas conducted in order to determine the extent of contaminatedgas meters. In total, more than 140 homes were sampled. Nicor,U.S. EPA Region 5 Land and Chemicals Division (LCD), U.S. EPARegion 5 Superfund Emergency Response Branch (ERB), and
Illinois Environmental Protection Agency (IEPA) contributed to thesampling effort.
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
37/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
38/67
Following the production of thesemaps, new sample data was
received and added to thedatabase. In preparation foranother site meeting, additionaldata queries were created usingArcMap. For example, querieswere generated that showed theresults data and locations of theoriginal four sampled homes, allcommercial sampled properties,
and properties at which detectedPCBs were found concurrently inmeter, air, and wipe samples.
PCB was detected in source samples. Sourcesamples were from gas meters, regulators, and driplegs
Nicor Gas Company Response
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
39/67
Decision Making Tools: Internet Tools
Google Earth
ArcGIS Explorer
MS Virtual Earth
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
40/67
Google Earth Utilizes Google search (address) with satellite imagery, maps, and terrain. Free and license versions with various functionality
Google Earth (free) Earth Plus ($20)
Import GPS data
Earth Pro ($400)-EPA OSCs all have a 1 year license Increased functionality and import/export capabilities Allows GIS coverages to be read directly
Enterprise
Limitations Requires Internet Aerials cant be downloaded and used somewhere else. Limited cartographic features Limited GIS functionality
Advantages Simple Fast Maintained
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
41/67
Ohio River Tools
Sample Designs and navigation Map generation with save
Data saved as ArcView shapefile, DBF,and AutoCAD DXF
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
42/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
43/67
Query Excel Spreadsheet
Queried data can be saved and exported to
a csv and kml format
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
44/67
Google KML Display
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
45/67
Outfall Point Link
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
46/67
Kerr-McGee
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
47/67
ArcGIS Explorer
Layers Search (address) Imagery Topographic maps historical maps street maps
Free Pros
Simple Fast Maintained
Connect to local data and servicesfrom ArcGIS Server
Cons Requires Internet Limited GIS functionality
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
48/67
MW FloodEOC staffed for a week on Midwest
Floods including GIS:1. Incident coordinates and information reported
and brought into ARCGIS Explorer
2. Used ArcGIS Explorer 9.2 and added incidentlabels and coordinates rapidly along with SFsites, flood plains, etc.
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
49/67
NRC Locations of Interest NPL Locations of Interest
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
50/67
FEMA 500 Year Flood Plain FEMA 100 Year Flood Plain
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
51/67
FEMA 100 Year Flood Plain
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
52/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
53/67
MS Virtual Earth
Build/Develop web page on top of their platform Free next 2 years for EPA Flexibility Pros
Flexibility and customizability Access all on the web
Cons Not an out of the box tool Web Security In order to make dynamic need to setup/maintain
database
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
54/67
Virtual Earth at EPA
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
55/67
Virtual Earth at EPA
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
56/67
The IMAAC (Interagency Modeling andAtmospheric Assessment Center) providesatmospheric hazards predictions in support ofFederal agencies responding to incidents ofairborne releases with national significance.
The IMAAC leverages existing Federalcapabilities and is responsible for providing
accurate, reliable estimates of predicted hazardareas, with associated concentrations, thatserve as the foundation for decisions by theauthorized emergency managers.
What is IMAAC?
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
57/67
The NARAC (National Atmospheric Release Advisory
Center), located at the University of CaliforniasLawrence Livermore National Laboratory, has beendesignated as the primary interim provider of IMAACcapabilities and is currently supporting hundreds ofDepartment of Homeland Security stakeholders inaddition to its traditional suite of customers and users
NARAC is a distributed system, providing modeling andgeographical information tools for deployment to an end
user's computer system as well as real-time access toglobal meteorological and geographical databases andadvanced three-dimensional model predictions from thenational center
What is NARAC?
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
58/67
NARAC Features NARAC provides a suite of multi-scale (local-, regional-,
continental- and global-scale) atmospheric flow and
dispersion models for a wide range of hazards. Some ofthe key features of this modeling system are: Automated, and validated real-time 3-D centralized
modeling system at NARAC that simulates complex
wind flows, detailed particle dispersion, wet and drydeposition on multiple spatial scales: Local-scale and regional-scale meteorological forecast and
dispersion models Long-range meteorological forecast and dispersion models
Models re-locatable anywhere in the world in real-time Nuclear explosion fallout model Fast-running, deployable local-scale dispersion model
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
59/67
The IMAAC/NARAC supports customers
through several channels: Direct telephone calls to expert operations
staff
Internet interface (NARAC Web) to a high-performance computing center
Internet-based remote access softwareinstalled on a customer's local computer.
Types of NARAC Support
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
60/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
61/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
62/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
63/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
64/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
65/67
Ardent SentryDatabase Query based on NARAC Plume
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
66/67
8/9/2019 05 OSCReadiness2008 FIELDS Data Analysis
67/67
Contact Information
NARAC - http://narac.llnl.govCustomer SupportEmergency Only: 925-424-6465 (24x7)Non-emergency: 935-424-2722 (daytime)e-mail: [email protected] request account:
https://naracwebx2.llnl.gov/NaracWeb/jsp/RequestAccount.jsp
IMAAC Web
http://imaacweb.llnl.gov925-422-9159 7:30 am - 4:15 pm (PT), M-F925-422-7627
IMAAC emergency contact dispatcher 925-422-9100 (24x7)