16
Computation Time Analysis - Climate Reanalysis Data Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Computation Time Analysis - Climate Reanalysis Data

  • Upload
    alexia

  • View
    48

  • Download
    0

Embed Size (px)

DESCRIPTION

Computation Time Analysis - Climate Reanalysis Data. Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems. Motivation. Climate Analysis : Why it is important? - PowerPoint PPT Presentation

Citation preview

Page 1: Computation Time Analysis - Climate Reanalysis Data

Computation Time Analysis - Climate Reanalysis Data

Dipanwita Dasgupta University of Notre Dame Graduate Operating Systems

Page 2: Computation Time Analysis - Climate Reanalysis Data

Motivation

• Climate Analysis : Why it is important? Increase in occurrence of climate

hazards

• Climate Reanalysis Data Data Centric Approach Climate Network

Slide 2

Page 3: Computation Time Analysis - Climate Reanalysis Data

Dataset• National Centre for

Environemental Prediction / National Centre for Atmospheric Research (NCEP/NCAR) Reanalysis Dataset Composed of data at 17

pressure levels Total of approximately

10000 grid points Factors affecting climate

Slide 3

Page 4: Computation Time Analysis - Climate Reanalysis Data

Background

• Climate Network Model Limited to use 7 factors affecting climate Affects the predictive modeling Computation Time

Slide 4 out of x

Page 5: Computation Time Analysis - Climate Reanalysis Data

Problem

• Computation has 3 steps1. Reading the data from file2. Calculation at each level 3. Combining the results

• Step 2 – highly computation intensive • The present code can only handle 20 units of data at a

time

Slide 5

Page 6: Computation Time Analysis - Climate Reanalysis Data

Slide 6

Page 7: Computation Time Analysis - Climate Reanalysis Data

Actual Work

• Analyzed time taken to run on a single machine

• Distributed Framework Steps 1 and 2 mentioned in previous slide for

each level are independent of each other Ran in a distributed fashion Used the CRC SGE Machine

Slide 7

Page 8: Computation Time Analysis - Climate Reanalysis Data

Assumptions

• Used only one parameter– Geopotential Height

• Only one measure of dispersion– Euclidean Distance

• Processing is similar for other parameters as well as for measures of dispersion

Slide 8

Page 9: Computation Time Analysis - Climate Reanalysis Data

Experimental Set-up

• NCEP Reanalysis Dataset• 20 units of longitude

• Sequential Execution Used the school workstation desktop

• Distributed Framework Used opteron.crc.nd.edu

Slide 9

Page 10: Computation Time Analysis - Climate Reanalysis Data

Distributed Framework: Setup

• opteron.crc.nd.edu• Submitted Bash script• Ran 10 simulations per level• Took the average

Slide 10

Page 11: Computation Time Analysis - Climate Reanalysis Data

Slide 11

Page 12: Computation Time Analysis - Climate Reanalysis Data

Slide 12

Page 13: Computation Time Analysis - Climate Reanalysis Data

Speedup

Slide 13

Page 14: Computation Time Analysis - Climate Reanalysis Data

Results Analysis

• Distributed Framework works better than Sequential Execution

• Expected Speed-Up not achieved Reading data from the file took more time than

expected Reduced time for the other steps

Slide 14

Page 15: Computation Time Analysis - Climate Reanalysis Data

Future Work

• Optimization of reading data from file• Use various file systems – NFS/AFS• Include more measures of dispersion• Increase the number of parameters

Slide 15

Page 16: Computation Time Analysis - Climate Reanalysis Data

Questions??

Slide 16