14
Real-Time Big Data Meetup, March 2013 Apache Hive What to Expect in the Next Release Carl Steinbach

Real-Time Big Data Meetup , March 2013

Embed Size (px)

DESCRIPTION

Apache Hive What to Expect in the Next Release Carl Steinbach. Real-Time Big Data Meetup , March 2013. Speaker Bio: Carl Steinbach. Currently: Engineer @ Citus Data PMC Chair, Committer -- Apache Hive Project Formerly: Cloudera, Informatica, NetApp, Oracle - PowerPoint PPT Presentation

Citation preview

Page 1: Real-Time Big Data  Meetup ,  March  2013

Real-Time Big Data Meetup, March 2013

Apache HiveWhat to Expect in the Next Release

Carl Steinbach

Page 2: Real-Time Big Data  Meetup ,  March  2013

2

Speaker Bio: Carl Steinbach

Currently:Engineer @ Citus DataPMC Chair, Committer -- Apache Hive

Project

Formerly:Cloudera, Informatica, NetApp, Oracle

Contact:Twitter: @cwsteinbach

LinkedIn: carlsteinbach

Page 3: Real-Time Big Data  Meetup ,  March  2013

3

What is Apache Hive?

SQL to MapReduce(OLAP, not OLTP)

MetaStore

Format Handlers

Page 4: Real-Time Big Data  Meetup ,  March  2013

4

What’s New?

HiveServer2- Committed earlier today…

Page 5: Real-Time Big Data  Meetup ,  March  2013

5

What’s New?

HCatalog- Is Merging into Hive…

Page 6: Real-Time Big Data  Meetup ,  March  2013

6

What’s New?

Columnar Formats

- Optimized Row Columnar Format (ORC)- Parquet

Page 7: Real-Time Big Data  Meetup ,  March  2013

7

What’s New?

Analytic SQL- Work in progress on feature branch- HIVE-896

Page 8: Real-Time Big Data  Meetup ,  March  2013

8

What’s New?

Better Query Plans

HIVE-3784, HIVE-2340, HIVE-3952, HIVE-HIVE-3562, HIVE-3972, HIVE-3841, HIVE-948, HIVE-2340, HIVE-3891, …

Page 9: Real-Time Big Data  Meetup ,  March  2013

9

What’s New?

Smarter Query Compiler

MapJoin hint inferred automatically in most cases (HIVE-3784, HIVE-3403)

Page 10: Real-Time Big Data  Meetup ,  March  2013

10

What’s on the Horizon?

New Runtime Framework

Apache Tez…

Page 11: Real-Time Big Data  Meetup ,  March  2013

11

What’s on the Horizon?

Vectorized Query Execution

Page 12: Real-Time Big Data  Meetup ,  March  2013

12

Real-time SQL on Hadoop

CitusDB, Impala, Apache Drill, …

What matters:Data LocalityBlock aware query planner

Page 13: Real-Time Big Data  Meetup ,  March  2013

13

Monthly Hive Meetups in the Bay Area

Hive User Group Meetup

Hive Contributors Group Meetup

Page 14: Real-Time Big Data  Meetup ,  March  2013

14

We’re Hiring

citusdata.com/job