Keep your hadoop cluster at its best! v4

Keep your Hadoop cluster at its best!Chris Nauroth Sheetal DolasHadoop Summit, San Jose, 2016

About Us

⬢ Principal Engineer @ Hortonworks⬢ Committer and PMC, Apache Hadoop

– Key contributor to HDFS ACLs, Windows compatibility, and operability improvements

⬢ Hadoop user since 2010– Experience deploying, maintaining and using Hadoop clusters

cnauroth@hortonworks.com cnauroth

Chris Nauroth

About Us

⬢ SmartSense Engineering Lead @ Hortonworks ⬢ Most of the career has been in the field, solving real life business

problems ⬢ Last 6+ years in Big Data⬢ Committer and PMC, Apache Metron

sheetal@hortonworks.com sheetal_dolas

Sheetal Dolas

Agenda

⬢ Days in a life of Hadoop users – Real war stories!⬢ Hadoop Operational Challenges⬢ Winning and avoiding the wars⬢ Q & A

Days in a life of Hadoop usersReal war stories!

Story I: Unstable NameNode, Frequent Fail Overs⬢ NameNode periodically becomes unresponsive⬢ In HA scenario, fails over to standby⬢ In short time, falls back again⬢ Very frequent fail overs and fail backs

It was the garbage collection!

Story II: Very high CPU usage but low throughput⬢ Unusually high system CPU usage⬢ Jobs slowed down⬢ Reduced data IO

System CPU

User CPU N/W IO

Transparent Huge Pages (THP) was turned on!

HDFS Upgrade

HDFS Space

JobPerfor

Cluster Stability

Story III: Cascading impact and cluster melt down⬢ HDFS upgraded⬢ HDFS utilization kept on increasing even after large data deletion⬢ Rebalancing made the situation worse⬢ Eventually HDFS became unresponsive

un-finalized HDFS had cascading impact on cluster!

Story IV: Overloaded cluster

⬢ Jobs run slower⬢ Always waiting containers and jobs, all YARN queues are fully

utilized⬢ Some jobs had to wait for hours to get the container slots

Sub optimally configured container sizes!

Requested Memory

Used Memory

Story V: Accidental deletion of critical datasets

⬢ User accidentally executed hdfs dfs -rm -R on a root directory⬢ Delete is issued in parallel, control + c did not help⬢ In panic, user shuts down HDFS immediately (fortunately)⬢ Restarts later to check trash, loses all data⬢ It’s nearly impossible to recover blocks from local file system

This is a more common mistake than one may think!

Story VI: Hive query returning random results

⬢ A hive query returns different results every time ⬢ Results are usually accurate during office hours⬢ After office hours, results keep changing randomly on every

execution

-- QUERY: WHAT IS TODAY’S TOTAL SALE AS OF NOW ?SELECT SUM(amount) FROM sales WHERE sale_date = TO_DATE (UNIX_TIMESTAMP())

One of the host had a different time zone!

and the stories continue…

Hadoop operational challenges

Hadoop has lots of configurations

⬢ So many configurations! Overwhelming for many users⬢ Best practices are evolving and change across versions

Many configurations are cluster and workload specific⬢ A configuration good for one cluster may not be suitable for

another cluster⬢ Optimally configured clusters may become sub optimal tomorrow

as they grow

Large clusters add to the complexities

⬢ Managing, updating and keeping nodes in sync becomes challenging

⬢ Nodes going down miss the maintenance cycles and get out of sync

⬢ Newly added nodes may have different standards (java version, os, user configurations etc.)

⬢ Clusters start having heterogeneous hardware over period of time

Winning andavoidingthe wars with SmartSense

⬢ Proactive support & personalized cluster insights by– Enabling faster case resolution

– Applying industry best practices

– Providing proactive analysis

⬢ SmartSense is a collection of tools and services– Evaluates cluster’s current configuration and runtime environment against rich set of rules

– Rules are dynamic, reacting to thresholds tailored to the specific cluster and its workloads

– Continuously evolving and improving rule sets, developed by or in close consultation with active committers, support engineers, field engineers.

SmartSense

A G E N T A G E N T

A G E N TA G E N TA G E N T

A G E N T

L A N D I N G Z O N E

S E R V E R

A M B A R I

A G E N T A G E N T

A G E N TA G E N TA G E N T

A G E N T

B U N D L E

W O R K E RN O D E

S m a r t S e n s eA n a l y ti c s

SmartSense Architecture

G AT E W AY

Addressing: Unstable NameNode, Frequent Fail Overs

Daunting Questions⬢ What is right Heap size for

my NN ?⬢ What should be the new

gen size ?⬢ Which GC should I use ?⬢ What GC options to be

configured?⬢ What if my cluster grows ?

SmartSense Answer⬢ Rule: hdfs_nn_jvm_opts⬢ Calculates Heap size based

on– Current heap usage– Total number of objects in file system– Best practices

⬢ Recalculates dependent JVM options based on Heap size

⬢ Validates existing JVM opts⬢ Provides continuous

validations and proactive recommendations

Heap Size– 200 bytes per HDFS object (files, directories, blocks)– 25 % buffer

-Xms should be same as –Xmx New generation size should be 1/8th of –Xmx (capped at 8G) Use Concurrent Mark Sweep (CMS) Garbage Collection

– -XX:+UseConcMarkSweepGC– -XX:CMSInitiatingOccupancyFraction=70– -XX:+UseCMSInitiatingOccupancyOnly– -XX:ParallelGCThreads=8

NameNode JVM Opts

Addressing: Very high CPU usage but low throughput

Daunting Questions⬢ Is THP applicable to my OS

version ?⬢ Is it disabled ? Completely

disabled ?⬢ How do I make sure it is

disabled on newly added nodes too ?

⬢ How do I make these configurations person independent ?

SmartSense Answer⬢ Rule: os_thp⬢ Checks if thp is completely

disabled⬢ Provides OS specific

disabling instructions⬢ Continuous evaluation that

validates newly added nodes and re-commissioned nodes

Disable THP

⬢ For RedHat & CentOSecho "never" > /sys/kernel/mm/redhat_transparent_hugepage/enabled

⬢ For Debian, Ubuntu & SUSEecho "never" > /sys/kernel/mm/transparent_hugepage/enabled

System CPU

User CPU

N/W IO

Addressing: Cascading impact and cluster melt down

Daunting Questions⬢ Should I finalize upgrade ?⬢ What is right time to

finalize ?⬢ How do I make sure it does

not fall through cracks ?

SmartSense Answer⬢ Rule:

hdfs_nn_finalize_upgrade⬢ Checks HDFS health after

upgrade⬢ Evaluates how long HDFS is

running in un-finalized state

⬢ Reminds until it is finalized

Check NN UI / JMX for upgrade status

Do not finalize HDFS upgrade until– All files and blocks have been verified after upgrade– Critical jobs have been executed at least once after upgrade

Finalize between 2 - 7 days after upgradehdfs dfsadmin -finalizeUpgrade

HDFS Upgrade finalization

Addressing : Overloaded cluster

Daunting Questions⬢ What is right container size

for my cluster ?⬢ If I add additional

components (HBase, Storm), how does the container size change ?

⬢ How does container sizes change when I add new types of nodes in the cluster ?

⬢ What’s impact on container sizes if I add SSDs to the nodes?

SmartSense Answer⬢ Rules: yarn_container_size,

mr_container_size, tez_container_size

⬢ Evaluates resources available on individual host (CPU, Memory, Disks, Running Services etc.)

⬢ Calculates technology specific container sizes (MR, Tez, Hive)

⬢ Continuously evaluates as the cluster dynamics change

Container sizing

Identify resources (CPU, Memory, Disks) available on each node Keep aside resources required for other processes (OS, DN, NM,

HBase RS) Calculate max possible containers for each resource (CPU,

Memory, Disks)– CPU Containers: 4x cores– Disk Containers: ( 3x HDD + 10x SSD )– Memory Containers: (Available RAM / 2 )

Number of containers = Min (CPU Containers, Disk Containers, Memory Containers)

Addressing: Accidental deletion of critical datasets

Daunting Questions⬢ Is HDFS trash enabled ?⬢ What is safe trash interval ?⬢ How to prevent accidental

deletion of critical data ?

SmartSense Answer⬢ Rule: hdfs_trash_interval

– Checks if trash is enabled– Validates if trash interval is within

reasonable limits

⬢ Rule: hdfs_nn_protect_imp_dirs– New feature available in Hadoop 2.8– Helps you mark critical directories such

as “/”, “/user”, “/user/apps/hive”, “/user/apps/hbase” etc. are delete protected.

HDFS Trash interval and directory protection

fs.trash.interval detects number of minutes after which the trashed data gets deleted– 0 means trash disabled (data gets deleted immediately)– Keep it the range 1440 (1 day) – 10080 (7 days)– Recommended 4320 (3 days)

fs.protected.directories specifies directories that will be delete protected– Available from Hadoop 2.8– List all key directories there ("/", "/user","/user/apps",

"/user/apps/hive", "/user/apps/hbase", "/user/apps/hbase/data", "/mapred", "/mapred/system", "/tmp" etc. )

Addressing : Hive query returning random results

Daunting Questions⬢ Is my cluster configured

consistently ?⬢ How do I prevent such hard

to analyze issues ?⬢ How do I make sure newly

added do not bring these types of issues ?

⬢ How do I make these set ups person independent ?

SmartSense Answer⬢ Rule: os_time_zone⬢ Checks if all hosts have

same time zone⬢ Rule os_service_ntpd_on

make sure all host times are in sync

⬢ Continuous evaluation that validates newly added nodes and re-commissioned nodes

There are 250+ more such rulesOperations hdfs_dn_volume_tolerance hdfs_dn_xceivers hdfs_nn_handler_count … yarn_zk_quorum yarn_nm_recovery … os_hostname_reverse_looku

p os_ssd_tuning … hive_mr_strict_mode hive_datanucleus_cache … tez_am_heap tez_shuffle_buffer …

Performance ams_mc_distributed_confi

gs ams_mc_write_path ... hbase_jvm_opts hbase_rs_open_region_thr

eads hbase_tcp_nodelay ... hdfs_dn_jvm_opts hdfs_mount_options hdfs_nn_dn_staleness_inte

rval ... hive_auto_convert_join hive_disable_caching hive_enable_cbo ...

Security hdfs_dn_volume_tolerance hdfs_audit_log hdfs_block_access_token hdfs_enable_security_chec

k hdfs_nn_super_user_group hdfs_zkfc_ha_acl ... ranger_policy_refresh_inte

rval smartsense_2_way_ssl_en

abled ... yarn_ats_security yarn_enable_acl ...

There is more than just configurations

How do I show

back/charge back

my tenants ?

Who are the top

users of my platform ?What type

of work loads are

running on my

cluster ?

Which jobs have

significant impact on

my cluster ?

How do I improve

performance of key

jobs ?

What is good time

for maintenanc

Activity Analysis

Summary

There are many things involved in managing Hadoop cluster Best practices evolve and change across versions What is optimal today may not be optimal for tomorrow Changing cluster dynamics, workload characteristic need

continuous re-evaluation and configuration adjustments SmartSense can significantly help avoid common mistakes,

issues, pitfalls and simplify Hadoop operations

Lets keep your Hadoop cluster at its best!Thank You!

Appendix

More Resources

⬢ https://docs.hortonworks.com/index.html⬢ http://hortonworks.com/products/subscriptions/smartsense/⬢ http://hortonworks.com/info/smartsense/⬢ http://hortonworks.com/blog/introducing-hortonworks-smartsense/⬢ https://www.youtube.com/watch?v=IKulo9c8PjE⬢ https://community.hortonworks.com/topics/smartsense.html

SmartSense Bundle Security

⬢ All Bundles are Anonymized and Encrypted

⬢ Multiple built-in security measures– Ambari clear text passwords are not collected– Hive and Oozie database properties are not collected– All IP addresses and host names are anonymized

⬢ Extensible security rules– Exclude properties within specific Hadoop configuration files– Global REGEX replacements across all configuration, metrics, and logs

SmartSense Stack Support

HDP 2.4 HDP 2.3 HDP 2.2 HDP 2.1 HDP 2.0

SmartSense 1.x

Ambari 2.2Built-In!

Ambari 2.1Plug-In

Ambari 2.0Plug-In

Ambari 1.7 Ambari 1.6

SmartSense 1.x

Keep your hadoop cluster at its best! v4

Software

Configuring the Hadoop Cluster for Use by Configuring the Hadoop

Hdfs 2016-hadoop-summit-san-jose-v4

Hunting criminals with hybrid analytics strata hadoop v4

Configuring and Deploying Hadoop Cluster Deployment … · Configuring and Deploying Hadoop Cluster Deployment Templates Thischaptercontainsthefollowingsections: • CreatinganInstantHadoopCluster,page1

Install hadoop in a cluster

Analyze Human Genome Using Big Data · Hadoop cluster. If the Hadoop cluster is in EC2 (Amazon’s Elastic Compute Cloud), the file system might be an S3 bucket. If the Hadoop cluster

Single Node cluster Using Hadoop

Bright Cluster Managerinfo.brightcomputing.com/.../docs/isc14_hadoop_pub.pdf · 2017-10-09 · Bright for Hadoop Cluster Management Bright Cluster Manager 7.0 for Apache Hadoop Provides

Single node hadoop cluster installation

Automated Hadoop Cluster Construction on EC2

Administer Hadoop Cluster

SAS® Analytics on Your Hadoop Cluster Managed by YARNsupport.sas.com/rnd/scalability/grid/hadoop/SASAnalyticsOnYARN.pdf · SAS® Analytics on Your Hadoop Cluster Managed by YARN

Apache Yarn - Hadoop Cluster Management

Hadoop Cluster - Basic OS Setup Insights

Securing Your Hadoop Cluster With Apache Ranger, Atlas and ...biconsulting.hu/letoltes/2017budapestdata/kanto_attila_gegesy_zsombor... · Securing Your Hadoop Cluster With Apache

Starting small on Hadoop.. - · PDF fileStarting small on Hadoop.. Cluster Installation This has 4 parts: 1. Cluster Planning. 2. OS installation. 3. Cluster Software Installation

CDH3 Hadoop Cluster Installation Manual

Continuous Delivery for Linux/Windows/Hadoop...Beta Cluster Hadoop JobTracker Jenkins Slave Hadoop node Hadoop node Hadoop node Hadoop node Slave Node Gateway Prod. Cluster PigServer

Hive - Core Servletscourses.coreservlets.com/Course-Materials/pdf/hadoop/07-Hive-01.pdf · • Hive Overview and Concepts ... Hive Hadoop Cluster Execute on Hadoop Cluster Monitor/Report

Setting High Availability in Hadoop Cluster