Documente Academic
Documente Profesional
Documente Cultură
Objectives
7- 2
Hadoop: Some Data Access/Processing Options
Component Purpose
Hive Puts a partial SQL interface in front of Hadoop. Includes
a metadata “repository” called the Metastore.
Pig A SQL-like scripting language on top of Java - for
MapReduce programming
HBase Applies a partial columnar scheme on top of Hadoop
Impala A database-like SQL layer on top of Hadoop
7- 3
Cloudera Impala
7- 4
Cloudera Impala: Key Features
7- 5
Cloudera Impala: Programming Interfaces
7- 6
How Impala Fits Into the Hadoop Ecosystem
7- 7
Working of Impala
7- 9
How Impala Works with HDFS and HBase
• HDFS
– Impala’s primary storage mechanism
– Data stored as data files
• HBase
– Alternative to HDFS to store Impala data
– Impala table definition can be mapped to HBase tables
7- 10
Summary of Cloudera Impala Benefits
7- 11