Hi tune sharing

HiTune sharing

Xiao Zhu1/29/2013

HiTune is...– a Hadoop performance analyzer– developed by Intel– based on Chukwa– https://github.com/intel-hadoop/HiTune– Contact: jason.dai@intel.com jie.huang@intel.com.– Has 3 parts:– 1) Tracker – 2) Aggregation Engine– 3) Analysis Engine

Example of HiTune Output

Chukwa is...– an open source data collection system

for monitoring large distributed systems.– based on HDFS and Map/Reduce

framework.– http://incubator.apache.org/chukwa/

– Has many parts, including:– 1) Agent– 2) Collector– 3) DemuxManager – 4) Other processes for logging and

archive

HiTune is based on ChukwaTracker

Aggregation Engine

Analysis Engine

Collector

Demux Manager

is partly based on

is based on

is partly based on

We tend to call those parts by the right side names, and when we refer toHiTune, we are considering HiTune and Chukwa together

Some of them are simply built upon Chukwa componentsbut others are implemented by modifying Chukwa or add new components.

You will find Chukwa patches and patched Chukwa binary in HiTune release.So when you are going to deploy HiTune, I do not suggest deploy Chukwafirst manually (though you can), for HiTune has already included it.

HiTune is based on ChukwaTracker

Aggregation Engine

Analysis Engine

Collector

Demux Manager

is partly based on

is based on

is partly based on

The tracker includes HiTune java agent part and Chukwa agent part.The analysis engines includes HiTune script part and Chukwa Demux part.

See following data flow for explanations on those parts.

HiTune/Chukwa System Basic StructureHiTune/Chukwa itself needs to set up on a standalone hadoop cluster. We name it as ‘Chukwa Cluster’, and the target cluster is named ‘Hadoop Cluster’.

HiTune Agents

Workload

Map/ReduceHDFS

Collectors Map/Reduce

Hadoop Cluster Chukwa Cluster

User’s ComputerExcel

HiTune/Chukwa Process and Data Flow

1. HiTune agents (java agent part) will be invoked by JVM when the workload starts on every node in hadoop cluster. This part will get system status and hadoop logs and save them on local storage.

HiTune Agents

Workload

Map/ReduceHDFS

2. Agent (Chukwa agent part) process will check java agent output periodically and send new data to (one of) the Collector(s).

HiTune Agents

Workload

Map/ReduceHDFS

3. Collector(s) put data to HDFS on Chukwa Cluster, When it has received 64MB data or a given time interval has passed, it pack received data to data packages (.done)

HiTune Agents

Workload

Map/ReduceHDFS

4. Demux Manager check data packages in Collector output dir on HDFS every 20 seconds. If it find .done files, it start Map/Reduce procedure to analyze it (May cost a long time to finish).

HiTune Agents

Workload

Map/ReduceHDFS

4. (Cont.) After Demux finishes, a HiTune script is required to run by the user. This script will run Map/Reduce to get final output (.csv files) (May cost a long time to finish, but faster than 3).

HiTune Agents

Workload

Map/ReduceHDFS

5. User get final output from hdfs://.JOBS/ manually. Then apply the output (.csv files) to HiTune Excel template to see the result. Graphics, Summaries and etc. will be computed by Excel.

HiTune Agents

Workload

Map/ReduceHDFS

HiTune/Chukwa Process and Data Flow• Yes if you want you can deploy Chukwa on Hadoop cluster.

• Doing so will add difficulties to management and maintenance, but this is theoretically feasible.

Why such structure?• Using Hadoop for MapReduce processing of

logs is somewhat troublesome.• Logs are generated incrementally across many

machines, but Hadoop MapReduce works best on a small number of large files.

• HDFS doesn't currently support appends, making it difficult to keep the distributed copy fresh.

Why such structure?• Chukwa is devoted to bridging that gap

between logs and MapReduce. • Chukwa is a scalable distributed monitoring

and analysis system, particularly logs from Hadoop and other large systems.

• Though process of agents and collectors, large, appended, distributed logs are transformed into large data chunks, which are suitable for Map/Reduce.

Why such structure?• The overhead is mainly caused by agents,

since only agents run on Hadoop Cluster.• According to the HiTune paper, the overhead

is less than 2%• See those papers:• Dai, Jinquan, et al. "Hitune: Dataflow-based performance analysis for big data

cloud." Proc. of the 2011 USENIX ATC (2011): 87-100. (Available on HiTune Github https://github.com/intel-hadoop/HiTune)

• Boulon, Jerome, et al. "Chukwa, a large-scale monitoring system." Proceedings of CCA. Vol. 8. 2008.

current HiTune version: 0.9• Support Hadoop 0.2 best• Based on Chukwa 0.4• Can support Hadoop 0.2+ , some options need

to be changed, and some metrics will be missing. (Current IDH is using Hadoop 1.0+)

• Usually require a long time to complete aggregating and analyzing. Better deploy it on a fast cluster.

Questions?

Backup

HiTune trouble shooting• Trouble shooting on HiTune is usually painful.• Need to check those logs: Hadoop cluster logs (task

tracker logs, job tracker logs, namenode logs, datanode logs), (most important!)Chukwa logs (agent logs, collector logs, demux logs), HiTune logs(script outputs).

• If there is no error or warning in logs, check outputs on disk and HDFS

• HiTuneStatusCheck.sh is not reliable. Check the logs yourself.

6. Later, Chukwa will group and archive data used on Chukwa Cluster HDFS to save space, but we will not discuss it here.

HiTune Agents

Workload

Map/ReduceHDFS

Hi tune sharing

Technology

Freddy’s Tune

Tune Protect Travel - AirAsiaGo Protect Travel... · Tune Protect Malaysia (Tune Insurance Malaysia Berhad 30686-K) Level 9, Wisma Tune, No 19 Lorong Dungun, Damansara Heights, 50490,

FOSTERING INFRASTRUCTURE SHARING IN THE ...documents1.worldbank.org/curated/en/936201561361920622/...FOSTERING INFRASTRUCTURE SHARING IN THE WESTERN BALKANS: B˜lk˜ns Di˚it˜l Hi˚hw˜˛

PolyTune™ · PDF fileGuitar - In tune 4 string bass - In tune 5 string bass - In tune. 11 6 string bass - out of tune 6 string bass - In tune Mode switching

Boiler Tune

A Different Tune

HI-506, HI-507, HI-508, HI-509 Datasheet...2009/07/08 · HI-506, HI-507, HI-508, HI-509 FN3142 Rev 10.00 Page 6 of 25 Jun 14, 2016 Functional Diagrams HI-506 HI-507 HI-508 HI-509

Tune hadoop

CURRENT AFFAIRS GROUP - BBCnews.bbc.co.uk/2/shared/bsp/hi/pdfs/10_11_15_fo4_aninsidejob.pdfillegal immigration is an inside job. SIGNATURE TUNE ACTUALITY IN CAR DEITH: It was a few

HI-3110, HI-3111, HI-3112, HI-3113

Tune Protect Inbound Plan - Amazon S3 · 2017-10-09 · Tune Protect Malaysia (Tune Insurance Malaysia Berhad 30686-K) Level 9, Wisma Tune, No 19 Lorong Dungun, Damansara Heights,

HI-8450, HI-8451, HI-8454, HI-8455

February Tune-up SpecialsFeb 02, 2020 · Jr. Ski Full Tune Snowboard Full Tune Sharpen Edges -Plane Base -Microstone Base -Bevel Edges - Hot Wax -Minor Ptex Adult Full Tune $49.95

Tune In Volume 14 Number 1 | January 2020 | Page Tune In

Handplane Tune Up

Micro HI-FI · Micro HI-FI ComponentSystem 2-663-704-12(1) Owner's Record ... good reception, aud theu set up the auteuna. Keep the anteunas away t?x)m the speaker cords ... 3 Tune

Tune In Volume 15 Number 5 | October 2021 | Page Tune In

Tune In Volume 13 Number 1 | March 2019 | Page Tune In

SPECTRA TUNE LAB - Ledmotive€¦ · IoT and spectral sharing platform • µWAVE Software© with the SPECTRA TUNE LAB basic operation control • Optional: RESTful API • Optional:

To Tune or not to Tune? To Tune or not to Tune? A Lightweight Physical Design Alerter Costa Jean-Denis Le Yaouanc Aurélie Mécanismes de SGBD 2007