Upload
kermit-wooten
View
21
Download
5
Embed Size (px)
DESCRIPTION
Apache Hive What to Expect in the Next Release Carl Steinbach. Real-Time Big Data Meetup , March 2013. Speaker Bio: Carl Steinbach. Currently: Engineer @ Citus Data PMC Chair, Committer -- Apache Hive Project Formerly: Cloudera, Informatica, NetApp, Oracle - PowerPoint PPT Presentation
Citation preview
Real-Time Big Data Meetup, March 2013
Apache HiveWhat to Expect in the Next Release
Carl Steinbach
2
Speaker Bio: Carl Steinbach
Currently:Engineer @ Citus DataPMC Chair, Committer -- Apache Hive
Project
Formerly:Cloudera, Informatica, NetApp, Oracle
Contact:Twitter: @cwsteinbach
LinkedIn: carlsteinbach
3
What is Apache Hive?
SQL to MapReduce(OLAP, not OLTP)
MetaStore
Format Handlers
4
What’s New?
HiveServer2- Committed earlier today…
5
What’s New?
HCatalog- Is Merging into Hive…
6
What’s New?
Columnar Formats
- Optimized Row Columnar Format (ORC)- Parquet
7
What’s New?
Analytic SQL- Work in progress on feature branch- HIVE-896
8
What’s New?
Better Query Plans
HIVE-3784, HIVE-2340, HIVE-3952, HIVE-HIVE-3562, HIVE-3972, HIVE-3841, HIVE-948, HIVE-2340, HIVE-3891, …
9
What’s New?
Smarter Query Compiler
MapJoin hint inferred automatically in most cases (HIVE-3784, HIVE-3403)
10
What’s on the Horizon?
New Runtime Framework
Apache Tez…
11
What’s on the Horizon?
Vectorized Query Execution
12
Real-time SQL on Hadoop
CitusDB, Impala, Apache Drill, …
What matters:Data LocalityBlock aware query planner
13
Monthly Hive Meetups in the Bay Area
Hive User Group Meetup
Hive Contributors Group Meetup