16
Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Embed Size (px)

DESCRIPTION

Dataset National Centre for Environemental Prediction / National Centre for Atmospheric Research (NCEP/NCAR) Reanalysis Dataset  Composed of data at 17 pressure levels  Total of approximately grid points  Factors affecting climate Slide 3

Citation preview

Page 1: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Computation Time Analysis - Climate Reanalysis Data

Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Page 2: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Motivation

• Climate Analysis : Why it is important? Increase in occurrence of climate

hazards

• Climate Reanalysis Data Data Centric Approach Climate Network

Slide 2

Page 3: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Dataset• National Centre for

Environemental Prediction / National Centre for Atmospheric Research (NCEP/NCAR) Reanalysis Dataset Composed of data at 17

pressure levels Total of approximately

10000 grid points Factors affecting climate

Slide 3

Page 4: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Background

• Climate Network Model Limited to use 7 factors affecting climate Affects the predictive modeling Computation Time

Slide 4 out of x

Page 5: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Problem

• Computation has 3 steps1. Reading the data from file2. Calculation at each level 3. Combining the results

• Step 2 – highly computation intensive • The present code can only handle 20 units of data at a

time

Slide 5

Page 6: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Slide 6

Page 7: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Actual Work

• Analyzed time taken to run on a single machine

• Distributed Framework Steps 1 and 2 mentioned in previous slide for

each level are independent of each other Ran in a distributed fashion Used the CRC SGE Machine

Slide 7

Page 8: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Assumptions

• Used only one parameter– Geopotential Height

• Only one measure of dispersion– Euclidean Distance

• Processing is similar for other parameters as well as for measures of dispersion

Slide 8

Page 9: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Experimental Set-up

• NCEP Reanalysis Dataset• 20 units of longitude

• Sequential Execution Used the school workstation desktop

• Distributed Framework Used opteron.crc.nd.edu

Slide 9

Page 10: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Distributed Framework: Setup

• opteron.crc.nd.edu• Submitted Bash script• Ran 10 simulations per level• Took the average

Slide 10

Page 11: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Slide 11

Page 12: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Slide 12

Page 13: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Speedup

Slide 13

Page 14: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Results Analysis

• Distributed Framework works better than Sequential Execution

• Expected Speed-Up not achieved Reading data from the file took more time than

expected Reduced time for the other steps

Slide 14

Page 15: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Future Work

• Optimization of reading data from file• Use various file systems – NFS/AFS• Include more measures of dispersion• Increase the number of parameters

Slide 15

Page 16: Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Questions??

Slide 16