Upload
samantha-charles
View
214
Download
0
Embed Size (px)
Citation preview
Weekly ReportBy: Devin Trejo
Week of June 21, 2015-> June 28, 2015
Previous Goals
• Get the webserver accessible on the network. • Get it's security approved by Dave Hart (Temple Network
Administrator)• Get exp22 to run successfully on the cluster.• Pick out/price a hardware configuration for the cluster based on the
results from the test job. • NEDC Corpus
Accomplishments
• The website has web access over its own static IP address• I synced the web hosted isip website with the local disk. I also made a
local backup of the external drive.• Contacted Dave Hart about getting the security on the box checked. I
emailed him on Tuesday and received no response. I'll follow up with him on Monday.
Accomplishments (cont.)• Cluster is Alive! • I worked on getting gen_feats operating on 1000 EEGs running on the
cluster. I queued 50 instances of the job to run on the cluster. I also ran one instance of the job on owl's nest for comparison. Results next slide.
Accomplishments (cont.)NEDC Test Cluster:user=devin group=devin jobname=feats_049exec_host=n004.nedccluster.com/0Exit_status=0resources_used.cput=05:53:00 resources_used.mem=1848284kb resources_used.vmem=3491816kbresources_used.walltime=06:00:55
Owl's Nest:PBS Job Id: 524936.owlsnest1.nfsJob Name: feats_000Exec host: w066/0Execution terminatedExit_status=0resources_used.cput=05:14:17resources_used.mem=3453572kbresources_used.vmem=3690128kbresources_used.walltime=05:19:17
Accomplishments (cont.)• Of note the NEDC Cluster completed
the job at 6hrs flat compared to the 5 hrs 19 mins on the Owl's Nest.
• Finished with errors, while Owl's Nest finished with no errors.
• We suspect an IO issue.
• 3.5GB of RAM on Owl's Nest. • I'm rerunning the experiment on the
cluster on node w/ 6GB to see if the usage will result in similar results to what was seen on Owl's Nest.
Accomplishments (cont.)
NFS SELECTION SUMMARYPROCESSOR 2 x Quad-Core Intel® Xeon® Processor E5-2623 v3
3.0GHz 10MB Cache (105W)MOTHERBOARD Intel® C612 Chipset- Dual Intel® 10-Gigabit Ethernet
(RJ45) - 10x SATA3 - IPMI 2.0 with LANMEMORY 8 x 4GB PC4-17000 2133MHz DDR4 ECC Registered
DIMMCHASSIS Thinkmate® STX-4324 4U Chassis - 24x Hot-Swap
3.5" SATA/SAS - 12Gb/s Single Expander - 920W Redundant Power
BOOT DRIVE 2 x 64GB SATA 6.0Gb/s Disk on Module (MLC) (Vertical)
STORAGE DRIVE 12 x 3.0TB SATA 6.0Gb/s 7200RPM - 3.5" - Western Digital RE WD3000FYYZ
CONTROLLER CARD LSI SAS 9300-8i SAS 12Gb/s PCIe 3.0 8-Port Host Bus Adapter
$7,705.00LINK: http://www.thinkmate.com/system/stx-nl-xe24-2460v3-sas3/99671
COMPUTE NODES (INTEL) SELECTION SUMMARY
BAREBONESupermicro SuperServer 2028TP-HTR - 2U TwinPro2 - 24x SATA - Dual Gigabit Ethernet - 2000W Redundant
PROCESSOR8 x Ten-Core Intel® Xeon® Processor E5-2650 v3 2.30GHz 25MB Cache (105W)
MEMORY32 x 8GB PC4-17000 2133MHz DDR4 ECC Registered DIMM
BOOT DRIVE 4 x No SATA Disk on Module for boot
HARD DRIVE4 x 120GB Micron M500DC 2.5" SATA 6.0Gb/s Solid State Drive
COMPUTE NODES (AMD) SELECTION SUMMARY
BAREBONESupermicro A+ Server 2022TG-HTRF - 2U Twin2 - 1400W Redundant
PROCESSOR8 x Sixteen-Core AMD Opteron™ Model 6378 - 2.4GHz 16MB Cache (115W TDP)
MEMORY32 x 16GB PC3-14900 1866MHz DDR3 ECC Registered DIMM
HARD DRIVE4 x 120GB Micron M500DC 2.5" SATA 6.0Gb/s Solid State Drive
New Goals
• Get isip2.piconepress.com responding to http requests.• Finalize a hardware configuration for the cluster
• Resolve python dependencies and run exp22 on the cluster to test hardware resource draw for training.
• Budget Management• NEDC Corpus