Sunteți pe pagina 1din 15

Solutions for Big Data

Data Management Solutions


from SAP for Big Data
2017 SAP SE or an SAP affiliate company. All rights reserved.

1/14
Table of Contents

3 Bridging the Big Data Value Gap

6 Data Management Solutions


from SAP for Big Data

10 In-Memory Computing with SAP HANA

14 Summary

2/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Bridging the Big Data Value Gap
Big Data for the business enterprise has been a work in progress. Analysts looking for business insights
want access to both enterprise data and the massive data sets from the Internet of Things, social
media, the Web, and operational logs. Until now, their projects have faced three daunting challenges:
where to keep the data; how to process diverse, unstructured data types; and how to unify enterprise
information architecture with Big Data environments. SAP has addressed all three challenges with its
data management solutions for Big Data.

As organizations move from collecting terabytes While data collection is occurring on a massive
of data to collecting petabytes and exabytes, the scale in the digital economy, there is a serious value
challenge of Big Data analytics has crystallized. gap between the promise and the reality of business
Whats needed now is a way to unify and process intelligence and analytics. On average, 60% to 73%
the diverse data types to make them collectively of all data within an enterprise goes unused for
useful. Without this ability, all the data that is filling these activities.1 Why? Because the technology
siloed repositories has limited value to businesses. for Big Data has had to catch up to the massive
With this ability, the promised insights from scale and diversity of data that businesses are
360-degree views of customer behavior, operational collecting with increased digitalization.
behavior, market behavior, and other facets of the
digital business can be more thoroughly analyzed
and understood. Experiments can be proposed.
Innovations can be introduced with greater confi-
dence. The promised returns from Big Data can
be realized.

1. The Forrester Wave: Big Data Hadoop Distributions, Q1 2016, Mike Gualtieri and Noel Yuhanna, Forrester Research Inc.,
January 19, 2016.

3/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Phase 1 of the journey to make Big Data useful to business
involved solving the storage challenge (see Figure 1). Today, data
lakes, data warehouses, and tiered, low-cost storage are avail-
able. They can scale to accommodate spiraling volumes of data.

Phase 2 is the need for a viable approach for processing


vastly different types of data structured, unstructured, semi-
structured, graph, time-series, JavaScript Object Notation
(JSON), and so on. Making this data useful and connecting it
to applications simplifies the handling of Big Data. It increases
the use of analytics tools by a broader array of users. Features
that address Phase 2 of the Big Data journey are now available
from SAP.

Last, Phase 3 involves unifying the data landscape. This means


merging processing engines for different data types into one
unified data pipeline across storage, compute, and analytics
solutions. Products from SAP provide this service as well.

Once all of these phases in the Big Data journey have been
addressed, the gap between the promise and reality of Big Data
analytics can be eliminated. Both IT and line-of-business
objectives can be addressed, and an end-to-end operational
model for Big Data can become a reality.

Figure 1: Phases of the Big Data Journey

Phase 1 Phase 2 Phase 3

Business
transformation

Processing

Storage

4/14
Operationalizing Big Data Analytics Addressing IT and Business Objectives

IT Objectives Solution
Utilize the broad range of different Unify data sets from corporate, transactional, customer, and
data types across the enterprise business process applications, and from Hadoop and other
data lakes
Do it more quickly Accelerate processing of data to production pipelines
Ensure ongoing performance and Solve the Big Data operational problem while balancing
cost efficiencies the costs of adding more and more data sources
Establish overall data governance Forge cooperation between diverse data owners who dont see
and management value in changing processes, and gain support from senior
management who dont understand Big Data or see the ROI

Business Objectives Solution


Create insights that lead to Combine data exploration and enterprise analytics to gain
meaningful business impacts more expansive insights from Big Data in a business context
Get Big Data analytics into Enable simplified access to data scientists, data engineers,
the hands of more people and business analytics who have worked with traditional
BI solutions
Realize digital transformation Use data to transform the business with predictive analytics,
automation, machine learning, data science, and other tools

Big Data solutions from SAP allow you to marshal


and store all data across your enterprise, quickly
go from in-production analytics to insight, and
take swift action to create business results.

5/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Data Management Solutions
from SAP for Big Data
Meet SAPs complete set of Big Data solutions. Features in the data management solutions from
These offerings allow you to marshal and store SAP for Big Data include:
all data across your enterprise, quickly go from Enterprise-ready, secure, scalable solutions
in-production analytics to insight, and take swift Tiered data storage and processing
action to create business results (see Figure 2). In-production performance and governance
In-memory processing for interactive decision
making and action
Integration with your business process applications
Cloud, on-premise, and hybrid deployment models

Figure 2: Big Data Portfolio Products from SAP


Enriching Digital Business with Frictionless Big Data: Database and Data Management Solutions from SAP
Enterprise Enterprise Analytical Custom and Data pipelines Data mining and
business business and IoT third-party Data science tools exploration Business value
applications networks applications applications
Make data universal to all
On premise and cloud analytics and applications

SAP HANA: computing services SAP Vora: distributed computing engines

Spatial Graph Machine Search OLAP Time Unified


Graph Doc store
learning series Compute and processing

Database and data management solutions from SAP


Text Streaming Series Business
analytics analytics data functions

SAP HANA: database services


Data lakes

SAP Cloud Platform


Big Data Services
Columnar Multicore and Advanced Multitenancy
OLTP+OLAP parallelization compression Elastic
Storage and persistence

S3

Multitier Data Administration High availability


storage modeling and security and disaster
recovery

SAP HANA: integration and quality services SAP solutions for EIM

Data ELT and Apache Hadoop All data


Data quality MDM Data integration
virtualization replications and open source Integrate and ingest
integration

Remote Data Data discovery and Information lifecycle


data sync quality architecture management

IoT: Internet of Things | OLAP: Online analytical processing | OLTP: Online transaction processing
ELT: Extract, load, transform | MDM: Master data management | EIM: Enterprise information management

6/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


SAP VORA All operations in SAP Vora can be accessed, executed,
SAP Vora is a distributed computing solution that and extended through SAP HANA. So, now SAP
is deployed on Apache Hadoop and Spark clusters customers have a foundation for extending enter-
and provides integration with the SAP HANA prise data to distributed Big Data environments with
platform so that you can run combined analytics data sets in the petabyte range through existing
across enterprise and Hadoop data. Now you can deployments of SAP HANA. To the user, SAP HANA
effectively use your Big Data and Internet of Things and SAP Vora act as a single system, with joint query
insights in the context of business processes. optimization and automatic storage decisions.
SAP Vora runs on Big Data clusters no proprietary
hardware is required. But it is a computing solution
separate from your Hadoop data store.

A major focus of SAP Vora is to provide specialized


analytical processing for different data formats.
Multiple in-memory distributed computing engines
built into a single solution eliminate the need to
physically move data between systems for analysis.

7/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Unifying Different Data Formats As an integrated, production-ready solution with
A major focus of SAP Vora is to provide specialized enterprise-ready features, SAP Vora eliminates
analytical processing for different data formats. the need to spend time and resources stitching
Multiple in-memory distributed computing engines together multiple systems. If youre looking for
built into one single solution eliminate the need a way to simplify your IT landscape and reduce
to physically move data between systems for the complexity of working with Big Data, look no
computations and analysis. The in-memory dis- further (see Figure 3).
tributed computing engines are designed to run
high-performance sophisticated analytics against
relational, time series, graph, and JSON data.
SAP Vora empowers your organization to make
decisions in near-real time based on your entire
set of data, including different formats and from
different sources.

Figure 3: SAP Vora

Distributed computing cluster

SAP Vora SAP Vora SAP Vora

Spark Spark Spark

Files Files Files


Hadoop

Data science Predictive Business intelligence Visualization apps

SAP Vora

Data modeler

Relational Time series Graph Doc store

Disk-to-memory accelerator

Distributed transaction log

Apache Spark

Apache Hadoop

8/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Apache Spark Features User-Friendly Web Interface
With its tight integration to Apache Spark, SAP Vora SAP Vora comes with an intuitive Web interface
can use any open source application and library that provides Structured Query Language (SQL)
that runs on Spark so users dont have to switch access to Hadoop data. It makes self-service Big
to new tools to use SAP Vora. Furthermore, the Data computing possible for more users in your
extended capabilities provided by SAP Vora enhance organization, including data scientists, Hadoop
Spark with new enterprise features such as hier- developers, and even business analysts.
archies, enterprise-ready calculations, currency
conversion, and support for units. Business analysts can access a powerful yet simple-
to-use Web interface to rapidly turn Big Data on
Storage Features Hadoop into valuable insights. Drag-and-drop func-
A disk engine is used to optimize access to external tionality makes it easy to rapidly create data models,
storage service and persistency and to process both simple and complex. With one SQL entry point
queries too large for handling in main memory. for interacting with specialized computing engines,
The engines optimization features contribute to regardless of which SAP Vora computing engine is
a significant increase in query execution perfor- used to analyze data, business analysts can always
mance compared to accessing an external storage use the familiar SQL programming language to
back end for each query. interact with the data. This shortens learning curves
dramatically and helps to accelerate adoption.
Distributed Transaction Log
A flexible joint transaction management system Data scientists, software developers, and others
provides transactional consistency for data across with more sophisticated programming skills can
all engines. A high-speed persistency service take advantage of extensive programming support
called the distributed transaction log allows high- for SQL, Python, Scala, C++, and Java. SAP Vora
throughput ingestion and low-latency transactions. supports data stored in CSV, Parquet, Optimized
Metadata persistence is also enabled by the distri Row Columnar (ORC), JSON, and other formats
buted transaction log that can be recovered when and a variety of storage options such as HDFS, S3,
needed. and local le systems. This enables data scientists
and developers to design and deliver mashups from
Security multiple information sources.
Enterprise-grade security is provided through
support for Kerberos-enabled Hadoop distributions
and native Hadoop Distributed File System (HDFS)
permissions that are automatically propagated
to tables when data is loaded.

9/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


In-Memory Computing with SAP HANA
SAP HANA has been considered a revolutionary The in-memory architecture also features columnar
product by the industry since it was first introduced, delta merge to speed processing. Online transaction
ushering in a new class of real-time analytics. Its processing (OLTP) and online analytical processing
in-memory-database design lets you analyze data (OLAP) workload support enables single data
within microseconds of updates to applications copying for lower total cost of ownership (TCO).
without complex layers of data management and And the multiplatform architecture lets you deploy
storage. You get answers to questions instantly. You SAP HANA on premise, on a desktop, or in the
can run high-performance applications that make cloud (see Figure 4).
business processes faster and leaner all without
data preparation, preaggregation, or tuning.

Figure 4: SAP HANA Functions

SAP HANA platform


On premise | Cloud | Hybrid

Application services Processing services Integration and quality services

</>
Data Extract, load,
Web server JavaScript Spatial Graph Predictive Search
virtualization transform and
replication
ALM

SAP Fiori Graphic Application Text Streaming Series Business Data Apache Hadoop Remote
user modeler lifecycle analytics analytics data functions quality and Spark data sync
experience management integration
Database services

Columnar Multicore and Advanced Multitenancy Multitier Data Openness Administration High availability
OLTP and parallelization compression storage modeling and security and disaster
OLAP recovery

OLTP: Online transaction processing | OLAP: Online analytical processing

10/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


ADVANCED ANALYTICS SAP CLOUD PLATFORM BIG DATA SERVICES
The solutions integrated, multimodal advanced SAP Cloud Platform Big Data Services offer a
analytics lets you converge multiple complex Hadoop-as-a-service computing solution in the
analytic processing tasks without data movement cloud. For enterprises that dont have the staff,
or duplication. Native multimodal SQL supports budget, or desire to deploy and manage their own
machine learning, search, text, spatial, graph, Hadoop environment, SAP Cloud Platform Big Data
series, and streaming for better decision making Services provide a scalable, cloud-based Hadoop
with in-the-moment advanced analytics and root data store, Spark complement, and processing
cause analysis. engine for SAP Vora. SAP Vora is also available from
Amazon Web Services (AWS). It can be ordered
from the AWS Marketplace and is available for use
with community support. Use Hadoop in the cloud
to process much larger volumes of data at a lower
price, handle structured and unstructured data
storage and processing, and scale volumes quickly
and easily in a cloud environment (see Figure 5).

Figure 5: SAP Cloud Platform Big Data Services

Workbench

Search Data science Custom


Business analytics Data exploration
and discovery and modeling applications
Data transfer

Proactive help desk

SAP Vora

Apache Hadoop Apache Spark


Portal

Unified control plane

Automated operations center

Data centers optimized for Hadoop

11/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Performance and Scale Setup in Hours Instead of Months
Unlike Big Data services provided by public cloud SAP Cloud Platform Big Data Services boast
providers that sell infrastructure as a service (IaaS), a speedy setup as well. Need a terabyte-scale
SAP Cloud Platform Big Data Services have Hadoop- Hadoop and Spark platform? Well have it ready
and Spark-optimized data centers, and full opera- to go in hours instead of the many months it
tional services are included with every subscription. would take you to purchase network hardware,
These features have been shown to deliver perfor- integrate software, and hire a Hadoop team.
mance that is 10 times faster than popular IaaS
services. We want SAP customers to be able to focus Integration into the SAP Software Landscape
on driving Big Data results instead of dealing with An integral part of SAP Cloud Platform Big Data
challenging Hadoop operations that require a large, Services, SAP Vora provides distributed calculation
highly skilled, dedicated team. capabilities such as relational processing, graph,
time series, and document processing. Through
A Big Data platform in the cloud plus Hadoop opera- SAP Vora you get easy consumption and integration
tions services lowers the cost of running Hadoop of SAP landscapes, tools, and data in SAP HANA.
on premise or using an IaaS provider due to faster
job processing, lower job failure rates, and consis- SAP SOLUTIONS FOR ENTERPRISE
tent operational performance. Automated compute INFORMATION MANAGEMENT
bursting ensures that your implementation will SAP solutions for enterprise information man-
automatically scale to meet sudden increases agement (EIM) include a variety of sophisticated
in compute needs. products that support data integration, manage-
ment, association, archiving, and other activities.
Supported functions are shown in Figure 6.

Figure 6: Support from SAP for Comprehensive Enterprise Information Management

SE AND MONIT
EAN OR
CL

Before After
E

A
M
AT

AN

D B
GR

Data quality
AG
INTE

C
E

Master data
management
Data integration

GOVERN
P
UN D

TE

Data X
CIA

discovery and
ER

Content
architecture
management
ST

SO
AN

AS

Information lifecycle
D

management

A R CH IV E

12/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


SAP Information Steward and SAP PowerDesigner The SAP Master Data Governance application and
software help you understand what data exists SAP Master Data Governance, enterprise asset man-
within the enterprise, and the relationship of this agement extension by Utopia, help you create
data to data in other systems. SAP Data Services a single version of company records.
software, the SAP Agile Data Preparation applica-
tion, SAP HANA smart data integration software, The SAP Extended Enterprise Content Management
SAP Replication Server, SAP Event Stream Pro- application by OpenText and SAP Invoice Manage-
cessor, SAP Landscape Transformation software, ment application by OpenText help you manage all
and the SAP Advanced Data Migration application of your structured and unstructured content.
by BackOffice Associates all help you integrate
data from different sources. As part of the data The SAP Information Lifecycle Management com-
management solutions from SAP for Big Data, ponent helps you archive your data into a backup
SAP Data Services is an option for data ingestion into environment in a systematic and organized manner
Hadoop enterprise data lakes; and the integration to free capacity in your enterprise data warehouse
of existing flows in SAP Data Services is part of or your live system.
the data flow pipeline in SAP Vora. This allows for
the flexible combination of various data pipes
with managed data flows.

SAP Data Services, SAP Agile Data Preparation,


SAP HANA smart data quality software, and the
SAP Information Steward Accelerator application
by BackOffice Associates help you clean your data
and monitor its health.

Data management solutions from SAP for Big Data


are designed to help you simplify your IT landscape,
reduce the complexity of working with Big Data, and
make a Hadoop and Spark Big Data analytics environment
easier and more accessible as a cloud service.

13/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


Summary
Data management solutions from SAP for Big Data Data management solutions from SAP for Big Data
are designed to help you simplify your IT landscape, are designed to provide an array of features with an
reduce the complexity of working with Big Data, and emphasis on data replication and data governance
make a Hadoop and Spark Big Data analytics envi- capabilities. They allow you to load, cleanse, and
ronment easier and more accessible as a cloud transform a wide variety of data types for use in
service. It includes tightly integrated products and SAP HANA, SAP Vora, and SAP Cloud Platform
an optional Big Data cloud infrastructure that let Big Data Services. Within SAP solutions for EIM,
businesses like yours take advantage of distributed SAP Data Services is an option for data ingestion.
storage and the processing of very large data sets.
The Big Data journey has now reached its desti-
SAP Vora, SAP HANA, and SAP Cloud Platform nation with a comprehensive operational solution for
Big Data Services complement each other. They data gathering, storage, processing, and networking
provide an in-memory computing solution for the from SAP. The products deliver tremendous
enterprise plus a scalable Hadoop data store as benefits:
a cloud service that doubles as a Spark complement Processing engines for diverse data types to
and a processing engine for SAP Vora. make data in different formats useful for
analysis
The computing engines in SAP Vora process multi- Unification of enterprise application data and
ple data types from a wide variety of sources and Big Data into one data pipeline to feed analytics
different formats within your business, including solutions
SAP HANA and SAP software landscapes, and make Much faster analytical processing at a lower
the data available to run directly on Hadoop. Building TCO than public cloud provider IaaS offerings
on the functionality of Apache Spark, all of your Massive scalability and petabyte capacity
data can now be included in interactive Big Data
analytics applications. Are you ready to go from Big Data gathering to
actionable insights to business results? Let us
SAP Cloud Platform Big Data Services make show you how.
available the Hadoop environment from SAP as
a Hadoop-as-a-service option in the cloud. This FIND OUT MORE
is an attractive choice for businesses that dont For more information about data management
want to build and support a Hadoop infrastructure solutions from SAP for Big Data, contact your SAP
themselves. The service includes a Big Data representative or visit us at www.sap.com/vora
processing platform with Spark. SAP Vora provides or https://www.sap.com/products/technology
the SQL analytical layer on top, with access to the -platforms/big-data-hadoop.html.
SAP world of landscapes, applications, tools, and
data in SAP HANA.

14/14

2017 SAP SE or an SAP affiliate company. All rights reserved.


www.sap.com/contactsap

Studio SAP | 50959enUS (17/05)

2017 SAP SE or an SAP affiliate company. All rights reserved.

No part of this publication may be reproduced or transmitted in any


form or for any purpose without the express permission of SAP SE or
an SAP affiliate company.

The information contained herein may be changed without prior notice.


Some software products marketed by SAP SE and its distributors
contain proprietary software components of other software vendors.
National product specifications may vary.

These materials are provided by SAP SE or an SAP affiliate company for


informational purposes only, without representation or warranty of any
kind, and SAP or its affiliated companies shall not be liable for errors or
omissions with respect to the materials. The only warranties for SAP or
SAP affiliate company products and services are those that are set forth
in the express warranty statements accompanying such products and
services, if any. Nothing herein should be construed as constituting an
additional warranty.

In particular, SAP SE or its affiliated companies have no obligation to


pursue any course of business outlined in this document or any related
presentation, or to develop or release any functionality mentioned therein.
This document, or any related presentation, and SAP SEs or its affiliated
companies strategy and possible future developments, products, and/or
platform directions and functionality are all subject to change and may be
changed by SAP SE or its affiliated companies at any time for any reason
without notice. The information in this document is not a commitment,
promise, or legal obligation to deliver any material, code, or functionality.
All forward-looking statements are subject to various risks and
uncertainties that could cause actual results to differ materially from
expectations. Readers are cautioned not to place undue reliance on these
forward-looking statements, and they should not be relied upon in making
purchasing decisions.

SAP and other SAP products and services mentioned herein as well as
their respective logos are trademarks or registered trademarks of SAP SE
(or an SAP affiliate company) in Germany and other countries. All other
product and service names mentioned are the trademarks of their
respective companies.

See http://www.sap.com/corporate-en/legal/copyright/index.epx for


additional trademark information and notices.

S-ar putea să vă placă și