8
Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Embed Size (px)

Citation preview

Page 1: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Weekly ReportBy: Devin Trejo

Week of June 21, 2015-> June 28, 2015

Page 2: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Previous Goals

• Get the webserver accessible on the network. • Get it's security approved by Dave Hart (Temple Network

Administrator)• Get exp22 to run successfully on the cluster.• Pick out/price a hardware configuration for the cluster based on the

results from the test job. • NEDC Corpus

Page 3: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Accomplishments

• The website has web access over its own static IP address• I synced the web hosted isip website with the local disk. I also made a

local backup of the external drive.• Contacted Dave Hart about getting the security on the box checked. I

emailed him on Tuesday and received no response. I'll follow up with him on Monday.

Page 4: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Accomplishments (cont.)• Cluster is Alive! • I worked on getting gen_feats operating on 1000 EEGs running on the

cluster. I queued 50 instances of the job to run on the cluster. I also ran one instance of the job on owl's nest for comparison. Results next slide.

Page 5: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Accomplishments (cont.)NEDC Test Cluster:user=devin group=devin jobname=feats_049exec_host=n004.nedccluster.com/0Exit_status=0resources_used.cput=05:53:00 resources_used.mem=1848284kb resources_used.vmem=3491816kbresources_used.walltime=06:00:55

Owl's Nest:PBS Job Id: 524936.owlsnest1.nfsJob Name: feats_000Exec host: w066/0Execution terminatedExit_status=0resources_used.cput=05:14:17resources_used.mem=3453572kbresources_used.vmem=3690128kbresources_used.walltime=05:19:17

Page 6: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Accomplishments (cont.)• Of note the NEDC Cluster completed

the job at 6hrs flat compared to the 5 hrs 19 mins on the Owl's Nest.

• Finished with errors, while Owl's Nest finished with no errors.

• We suspect an IO issue.

• 3.5GB of RAM on Owl's Nest. • I'm rerunning the experiment on the

cluster on node w/ 6GB to see if the usage will result in similar results to what was seen on Owl's Nest.

Page 7: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

Accomplishments (cont.)

NFS SELECTION SUMMARYPROCESSOR 2 x Quad-Core Intel® Xeon® Processor E5-2623 v3

3.0GHz 10MB Cache (105W)MOTHERBOARD Intel® C612 Chipset- Dual Intel® 10-Gigabit Ethernet

(RJ45) - 10x SATA3 - IPMI 2.0 with LANMEMORY 8 x 4GB PC4-17000 2133MHz DDR4 ECC Registered

DIMMCHASSIS Thinkmate® STX-4324 4U Chassis - 24x Hot-Swap

3.5" SATA/SAS - 12Gb/s Single Expander - 920W Redundant Power

BOOT DRIVE 2 x 64GB SATA 6.0Gb/s Disk on Module (MLC) (Vertical)

STORAGE DRIVE 12 x 3.0TB SATA 6.0Gb/s 7200RPM - 3.5" - Western Digital RE WD3000FYYZ

CONTROLLER CARD LSI SAS 9300-8i SAS 12Gb/s PCIe 3.0 8-Port Host Bus Adapter

$7,705.00LINK: http://www.thinkmate.com/system/stx-nl-xe24-2460v3-sas3/99671

COMPUTE NODES (INTEL) SELECTION SUMMARY

BAREBONESupermicro SuperServer 2028TP-HTR - 2U TwinPro2 - 24x SATA - Dual Gigabit Ethernet - 2000W Redundant

PROCESSOR8 x Ten-Core Intel® Xeon® Processor E5-2650 v3 2.30GHz 25MB Cache (105W)

MEMORY32 x 8GB PC4-17000 2133MHz DDR4 ECC Registered DIMM

BOOT DRIVE 4 x No SATA Disk on Module for boot

HARD DRIVE4 x 120GB Micron M500DC 2.5" SATA 6.0Gb/s Solid State Drive

COMPUTE NODES (AMD) SELECTION SUMMARY

BAREBONESupermicro A+ Server 2022TG-HTRF - 2U Twin2 - 1400W Redundant

PROCESSOR8 x Sixteen-Core AMD Opteron™ Model 6378 - 2.4GHz 16MB Cache (115W TDP)

MEMORY32 x 16GB PC3-14900 1866MHz DDR3 ECC Registered DIMM

HARD DRIVE4 x 120GB Micron M500DC 2.5" SATA 6.0Gb/s Solid State Drive

Page 8: Weekly Report By: Devin Trejo Week of June 21, 2015-> June 28, 2015

New Goals

• Get isip2.piconepress.com responding to http requests.• Finalize a hardware configuration for the cluster

• Resolve python dependencies and run exp22 on the cluster to test hardware resource draw for training.

• Budget Management• NEDC Corpus