Documente Academic
Documente Profesional
Documente Cultură
Rick Pandya
Oracle Database Product Management
Catalog/
Call Web
Center
Retail
Decisions
Social
Search
Looking ahead
Networks
“FUTURE”
“I think” “I want”
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Big Data: Challenge to Opportunity
Harness Big Data to Increase Business Value
Business → Deep Analytics
Value Big Data → High Agility
Platform → Massive Scalability
→ Real Time
Tomorrow
→ High Variety
→ High Volume
Challenges → High Complexity Big Data
→ High Velocity
Today
Time
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Big Data: Infrastructure Requirements
SQL
Schema DBMS DBMS Advanced
ETL Trusted
(OLTP) (DW) Analytics
Secure
Administered
• Time to Build?
• Required Optimizations?
• Cost and Difficulty Maintaining?
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Appliance
Hardware Overview
HC
Starter
Full
Multi-Rack
• Start with 6 BDA Servers and all switches
- Add BDA HC Nodes as needed
• Can expand older machines with new generation servers
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 16
How BDA Elastic Configurations Work
• Start with a BDA Starter Rack
– 6 BDA Servers
– All Switching Included (Leafs, Spine and Management)
• Add BDA HC Node
– Single node increments
– No need for a 6-node In-Rack Expansion upgrade
– Up to a Maximum of 18 BDA HC Nodes in a Rack
• Assembled with requested number of servers
by Oracle
HC – Option: Add servers later at customer site
– Can add BDA HC Node to older (x2 to x4) machines
• Standard Configurations remain available
– Starter Rack, Full Rack
Operational Simplicity
Operational Simplicity
• Consistently High Performance
• Remove Bottlenecks
• Full Stack Install and Upgrades
• Simplified Operations / Management
• Cluster Growth
• Node Migration
• Always Highly Available
• Always Secure
• Latest Hardware Technologies for Hadoop and Spark
./mammoth –i rck_1
RCK_1
Day 1
RCK_1
N Example Service:
N Hadoop Name
Nodes
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 21
Successful Big Data Systems Grow
From Cluster Install with HA to Large Clusters to Dealing with Operational Issues
Add 12 New Nodes across two Racks
Day 90
mammoth –e newhost1,…,newhostn
RCK_1 RCK_2
N
N
mammoth –e newhost1,…,newhostn
RCK_1 RCK_2
This expansion automatically optimizes HA
setup across multiple racks
RCK_1 RCK_2
N
N
N
N
* Separately licensed
Copyright © 2014, Oracle software, can
and/or its affiliates. be reserved.
All rights pre-installed
| and configured on BDA
Why Cloudera CDH on BDA?
• Managed and Tested by Cloudera
– Open Source Distribution
– Most Popular Distribution in the Market
– Rich management and configuration GUI tool
• Fast evolution in critical features
– Built by the Hadoop experts in the community
– Practical instead of esoteric
– Focus on what is needed for large clusters
• Proven at very large scale
– In production at all the large consumers of Hadoop
– Extremely stable in those environments
ASR
Manager ASR Service
Product's auto-diagnosis facility sends Service Request (SR)
SNMP trap to ASR Manager Fault telemetry securely created
transmitted to Oracle
Database Repository
Database (MySQL, • Stores configuration and monitoring
PostgreSQL) information about cluster hosts and
daemons
Authorize
access to data with fine grained controls
Audit
activity and access with Oracle Audit Vault and Database
Firewall
Encrypt
data as it flows thru the system
1 Change identity on local machine to Hadoop cluster user or group that owns the
file/directory
Delete sensitive data on cluster
2
LDAP
• Strong authentication for
Key Distribution
– Key Hadoop services Center
– Oracle Big Data Connectors Authenticate / Hadoop Service
Get Service Ticket Registration
• Ensure users are who they
claim to be
services LDAP
Key Distribution
Center (Optional)
First
Kerberos and Sentry enabled Hadoop
Appliance
Admin Group
• Superuser Role
HR Database • All privileges on BDA-HQ
Employee
• Name
• Manager Company Group
• Salary
• Viewer Role
Reviews
• Name
• View Employee table’s Name
• Date & Manager columns
• Comments
Manager Group
• Supervisor Role
Server: BDA-HQ • Select from Employee &
Reviews Tables
One
Consolidated, secure repository for all
audit data
Hadoop Audit Vault Operating Systems Centralized platform for audit reporting,
Non-Relational Data alerting and policy management
Databases
Relational Data