Sunteți pe pagina 1din 7

Actian

PAGE 12
OPERATIONAL ANALYTICS
AT THE SPEED OF BUSINESS

Denodo
PAGE 13
THE LOGICAL DATA
WAREHOUSE AS THE
NEW STANDARD FOR
DATA ANALYTICS

SlamData
PAGE 14
THE MISSING LINK IN ETL

Wipro
PAGE 15
NEXT-GEN DATA
WAREHOUSES TO POWER
INTELLIGENT ENTERPRISES

THE
FUTURE
OF DATA
WAREHOUSING

Best Practices Series


10 APRIL/MAY 2019 | DBTA

RETHINKING
THE FUTURE
OF DATA
WAREHOUSING

Best Practices Series

Does the data warehouse still have about everywhere. Most larger enterprises Brain, has said. This requirement runs deep
a place in today’s fast-moving, real-time still maintain data warehouses, and small through every digital engagement.
digital enterprise? Many pundits, analysts, to medium-businesses are also finding data “Every single company I’ve worked with
and vendors have proclaimed the impend- warehouses a cost-effective option, thanks and talked to has the same problem without
ing demise of data warehousing, suggest- to the cloud. a single exception so far—poor data qual-
ing that it has become too slow, isolated, Now, data warehouses are poised to play ity, especially tracking data,” according to
and cumbersome to deliver insights at the a leading role in next-generation initiatives, Ruslan Belkin, vice president of engineer-
push of a button. However, the data ware- from AI to machine learning to the Internet ing for Salesforce.
house has proven the doomsayers wrong, of Things. While data warehouses do not For further proof of the continuing
with evidence that it is evolving into an appear frequently in marketing literature or importance of data warehousing, look to
integral and essential piece of the big data analyst reports on these emerging technolo- skills demand in the IT workforce. Tellingly,
landscape. gies, data warehousing will remain a critical data warehouse engineer has ranked sixth
For decades, companies have invested cornerstone of the foundation of the digital of the top 10 jobs in demand, a recent anal-
millions of dollars designing, implement- era ahead. ysis by Indeed found.
ing, and updating enterprise data ware- “If you can’t build a data warehouse, you There has been pressure on today’s leg-
houses as the foundation of their business shouldn’t do AI,” Andrew Ng, noted com- acy data warehouses to evolve—both archi-
intelligence systems. And they are still just puter scientist and co-creator of Google tecturally and technologically—to deliver
APRIL/MAY 2019 | DBTA 11

the agility, scalability, and flexibility that


business need to thrive in today’s data- Data warehouses are poised to play
driven economy. Alongside new architec-
tural approaches, a variety of technologies
a leading role in next-generation
have emerged as key ingredients of modern
data warehousing, from data virtualization
initiatives, from AI to machine
and cloud services to Hadoop and Spark learning to the Internet of Things.
and machine learning and automation.
Here’s the shape of the future of data
warehousing:
Data warehousing is going to be
cloud-based. What was unimaginable
just a decade ago is no longer the work-
ing reality today—enterprises are turning point where they are one and the same. with many other elements of the data envi-
to cloud to power and store their data Data warehouses, for all intents and pur- ronment, data warehouses have increasingly
warehouses. It will be versatile, providing poses, are data analytics platforms. Compa- become autonomous. These environments
both real-time and historical insight. The nies recognize that data analytical power is were originally designed to be run with as
data warehouse will work in unison with crucial to every aspect of their operations little DBA time as possible.
other components of the environment. and products, and data warehouse technol- Data warehousing is going to support
Information from data warehouses will ogy is already delivering this power. AI and machine learning to deliver results.
increasingly be the source of insights for Data warehousing is going to empower Not only will data warehouses be the foun-
both real-time and analytical actions to users like never before. The key advantage to dation of datasets for AI, but AI will also
provide customer service at the time it’s data warehouse environments is the empha- enhance the operations and capabilities of
needed, while also serving as a repository sis on self-service. Business end users have data warehouses. For example, Google has
for historical data. There has been rapid long had the capability to build queries or ask incorporated machine learning into its
growth and excitement in recent months questions of their data that had never been BigQuery data warehouse.
and years in cloud data warehouses hosted asked before, due to the limitations of data Data warehousing is still going to
by leading internet companies such as silos. Data environments are only growing occupy a central place in delivering the
Google and Amazon, which is essentially more diverse and complex, and budgets for customer experience. The heritage of the
putting a stamp of approval on the concept IT staffing are getting tighter. The platform data warehouse is built on understanding
of data warehouses in the cloud. In addi- data warehouses provide for building queries the customer in new and profound ways.
tion, traditional cloud providers also offer is proving invaluable at a time when decision No other environment maintains data that
their capabilities as a cloud service, along makers can’t afford to wait on their IT or data is so vital to CX. Data warehouses have long
with their traditional on-premise products. management departments for answers. been the established repositories for not
Data warehousing is being extended Data warehousing is going to feed into only historical customer data and demo-
into modern analytics ecosystems through data lakes, Hadoop, and Spark—as well graphics, but also can be blended with real-
the use of data virtualization. By federating as the other way around. There has been time data streams to provide on-the-spot
multiple data warehouses, data virtualization a great deal of discussion about the future services and responses to customers.
can augment traditional ETL and data rep- of data warehouses in a world increasingly The data warehouse—as a system, as a
lication processes by acting as a virtual data served by data lakes and about how tra- concept, and as a way to delivery insights
source while also isolating applications from ditional that extract, transform, and load about customers, markets, and operations—
the complexity of disparate and changing environments are encumbrances when data isn’t going away anytime soon. Data ware-
underlying data sources. needs to tapped on-the-fly for any and all houses are increasingly becoming an even
Data warehousing is going to be ana- applications. more critical part of the digital world. n
lytical. The data warehouse world has Data warehousing is going to require 
blended with the analytics world to the fewer people to populate and operate. As  —Joe McKendrick
12 APRIL/MAY 2019 | DBTA Sponsored Content

Operational Analytics at the


Speed of Business
Accelerating analytics to and accurate data, but also current informed, tactical decisions, these
operate in-the-moment. From data, so they can respond to changes in employees need accurate and real-time
strategic decision-making to low-level the moment. data insights.
operations and customer experience,
your entire company must have up-to- MANAGEMENT NEEDS REAL- CUSTOMERS EXPECT
date information and insights to keep TIME INSIGHTS TO ACHIEVE REAL-TIME INSIGHTS AS
pace with the speed of business. It isn’t PRODUCTIVITY, PROFITABILITY A PART OF THE MODERN
okay for your business to be waiting on AND QUALITY GOALS CUSTOMER EXPERIENCE
daily batch updates. Sales, customer service, HR, finance, Employees and company leaders
manufacturing and logistics—almost aren’t the only people who have a need
LEADERS NEED REAL- every business process in modern for real-time data insights. Modern
TIME INSIGHTS TO MAKE companies is technology-enabled. This customer experiences are highly
INFORMED DECISIONS can be good if the systems and people automated, and customers expect
Technology innovations, customer involved in operations are working the data they view on the company’s
preference, global economics and smoothly together and everything is Website to be current. Product
market changes are causing the going well. Managers depend on data- availability, order status, shipping
environments in which companies driven insights about these business data and returns processing are where
operate to change quickly and processes to understand operational real-time operational data drive digital
dramatically. Business agility is performance, process quality and cost customer experiences. If there is a
a necessity to survive and thrive drivers, enabling them to see where change, then customers expect to see
in modern commerce. Market problems exist that require attention. the change reflected immediately—they
opportunities are short-lived, and have little tolerance for waiting until
threats are more impactful than EMPLOYEES NEED REAL- the next day for data to be refreshed.
ever. For leaders to be effective TIME INSIGHTS TO DO
in recognizing changes in the THEIR JOBS EFFECTIVELY Businesses evolve quickly, in big
environment and make informed Modern businesses are complex, strategic ways and in small tactical
decisions that lead to favorable with operations spread across teams, IT ways. Real-time data and information
outcomes, they need not only complete systems and often geographic locations. insights are what enable all parts of
For employees to be effective your business to identify, understand
in their individual roles, they and respond to changes quickly and
must understand what is decisively. Actian Avalanche—Cloud
occurring in the other parts Data Warehouse Service that enables
of the company with which you to collect and harvest data insights
they interact. Manufacturing in near real-time and at enterprise
employees and planners need scale. This can help you accelerate your
visibility of the sales-and- business-process execution, monitor
order-management pipeline. and better respond to opportunities
Sales teams need visibility and threats and provide employees and
to delivery schedules and customers with the data they need to be
logistics. Customer-service informed and effective. n
agents need visibility of
customers’ orders. To manage ACTIAN
this complexity and make www.actian.com
Sponsored Content APRIL/MAY 2019 | DBTA 13

The Logical Data Warehouse as the


New Standard for Data Analytics
Data warehouses are a great tool
to consolidate data from a variety of
operational systems to become the
reference for corporate reporting. They
are specifically shaped for analytics and
run on specialized hardware.
However, especially in the last few
years, some of its core principles have
been challenged:
• Th
 e rise of data driven decision making
required storage of vast amounts of raw
data. Traditional EDW appliances, with
an elevated cost per stored byte, were too
expensive. Cheaper distributed storage
solutions (HDFS, S3, etc.) took the lead.
• Th
 e star/snowflake schema of an EDW is hidden from the end user. Security, gover- combines and aggregates them together in
not the best way to store data for certain nance and auditing are again centralized. the virtual layer. Optimization techniques,
problems. Key-Value pair, graphs, and Data virtualization software like although similar to those in relational
other NoSQL systems are designed to Denodo follows the ideas of relational engines, have evolved differently to deal
address specific challenges. databases. It provides a metadata catalog with the nature of this problem. Techniques
•C loud vendors dominate the market. and an execution engine. It allows for like complex query rewriting, on-the-fly
Specialized Software as a Service the definition of derived views and data data movement between sources, and MPP
applications are the reference in many models. But unlike a database, it does capabilities provide the processing muscle
sectors and cloud mega-vendors are not provide storage. Instead, connections to perform efficiently.
driving infrastructure to the cloud. to different systems will feed the data The value propositions for these logical
Although these factors provided huge models in execution time. A virtual layer is architectures is simple:
advantages they also broke the premises focused on data delivery, not on storage. • Th
 ere is one place to get data. Data
of the data warehouse. The data landscape How does execution work in a system exploration and “time to data” are greatly
is fragmented, not just in location, but in like this? Underlying databases usually simplified
shape and processing paradigms. provide an execution engine, therefore, the •R eplication needs are significantly
Physical re-consolidation, although pos- virtualization engine takes advantage of reduced, which reduces HW and
sible, is less attractive than before. Volumes them . This is called query push-down. It operation costs
are too high, and replication to multiple sys- serves a double purpose: reduces processing •D ata governance is improved. Data is
tems creates brittle point-to-point connec- in the virtual layer and network traffic. If all logically consolidated, traced to the
tions. Out-of-synch data and uncontrolled the data required for a query is in a single source, and secured
replication leads to “data swamp” scenarios. system, the virtual layer does the SQL dia- As you can imagine, the benefits of a
End users pay the cost of a fragmented lect conversions and completely delegates logical data layer go beyond warehousing
landscape in the form of extended time to the query to the source. and reporting, and can be applied to
market (or, more accurately, “time to data”). However, when data comes from mul- other scenarios like Logical Data Lakes
Thus, it seems that a logical approach is tiple sources, the optimizer needs to come to feed data scientists. n
more feasible: a logical layer that connects up with a multi-source execution plan. The
different systems and exposes them as one. plan is split into multiple branches that DENODO
The complexity of the back-end systems is bring partial results from each source, and www.denodo.com
14 APRIL/MAY 2019 | DBTA Sponsored Content

The Missing Link in ETL


ETL (extract, transform, load) data model. And the majority of Web •A  ny data (JSON,CSV,XML),
has been around for decades. Its APIs provide a JSON payload. JSON regardless of complexity
primary purpose is moving data from data is unlike traditional relational data • More agile and accurate than cus-
source locations to data warehouses in many ways, including non-fixed tom coding
so analytics and data science teams variable schema, variable data types, • Fast high-performance streaming
can perform analysis across a range and the ability to have “nested” data engine for large amounts of data
of critical data sources with standard structures. This presents a major hurdle • Adjusts automatically to changes
tools. More recently, with the rise of when companies need to access this in data
low-cost cloud object storage, like AWS data for analytics purposes.
S3, Azure Blob storage, and others, this Analytics tools expect the data to be WHO CAN USE
process has morphed into ELT (extract, in a fixed tabular form (think spread- SLAMDATA REFORM?
load, transform). In this process, the sheet) of rows and columns. In order to Data Integration Engineers—Makes
data transformations are pushed further do this, the data needs to be transformed their job easier, less coding, ability to
down the pipeline which somewhat from the JSON model to the tabular respond to users’ needs faster
streamlines the problems and also model. Traditional ETL/ELT solutions Data Architects—Makes their job
lowers overall costs. ETL/ELT tools cannot handle complex JSON well, if at easier, less coding, ability to respond to
have flourished in the last decade as all. To be clear, they all claim that they users’ needs faster
the volume and variety of data sources handle JSON, but what they really mean Business Analysts—Lets them have
that enterprises need to handle has is that they can do some very simple REAL self-service against complex
exploded. However, there is one obvious things, and then require engineers to JSON data (nobody else can really say
gap in the solution space, complex write complicated code to solve the this)
JSON data, which coincidentally is also rest. Most companies don’t even bother Data Scientists—Lets them have
one of the most popular and rapidly trying to use commercial ETL/ELT REAL self-service against complex
growing kinds of data we see in the software and simply have highly paid JSON data (nobody else can really say
market. Unlike traditional relational data integration engineers write custom this)
or tabular data, JSON does not have a code to transform their JSON data. This
one-size-fits-all data model. In fact, it approach is slow, complicated, expensive, FAST AND EASY TO INSTALL
can range from very simple to unbeliev- and not self-service in any way. SlamData REFORM is a software
ably complex depending on the whims tool (so no added SaaS compliance or
of the developers building the applica- FINALLY, A SOLUTION FOR security issues) and is also available
tions that create the JSON data. When TRANSFORMING JSON in the AWS Marketplace. Users can
existing ETL/ELT vendors say that they SlamData REFORM is a revolutionary install the solution within their existing
support JSON they mean they support solution lets ANY user visually prepare infrastructure and use it as they need,
VERY simple flat JSON. As soon as analytics-ready tables directly on the securely.
complexity goes up, they go down, and JSON data, regardless of complexity. This REFORM supports JSON data stored
revert to the familiar approach: Start means ZERO CODING and no waiting in AWS S3, Azure Blob Storage, Wasabi,
writing custom code! Some vendors on Data Integration Engineers. Users can and MongoDB. We can add a new con-
don’t even try to avoid code; they curate out custom data sets in minutes, nector to any JSON data source quickly
actually build a coding engine in their and then iterate over them at any time as with our advanced Lightweight Connec-
platform to handle complex JSON. So, their data needs change. These tables can tor Technology (LWC).
the harsh reality is that complex JSON be streamed into all popular data ware- Learn more about SlamData
is the last ETL problem to be solved. houses, including Redshift, Snowflake, REFORM at http://slamdata.com or see
and Teradata, or pushed to any other it in action in this informative video. n
BUT IS JSON DATA DIFFERENT? destination you choose.
JSON data is some of the most •Z ero coding solution
common data created today. Virtually •A ny users, not just engineers
all SaaS applications, Mobile applica- can make complex JSON data SLAMDATA
tions, and IoT have JSON as the default analytics-ready https://slamdata.com/
Sponsored Content APRIL/MAY 2019 | DBTA 15

Next-Gen Data Warehouses


to Power Intelligent Enterprises
In today’s digital era, consumers around to massive volumes of data in terabyte/ infrastructure, and services. Additionally,
the world are driving organizations to trans- petabytes. Such volumes facilitate built-in resiliency, enterprise-grade security,
form themselves into intelligent enterprises in-depth analysis and computing on a and protected data-sharing capabilities
by embracing technological innovation in large scale to build various forecasting are making them intelligent enough to
artificial intelligence (AI), cloud, and Internet models, empowering businesses with empower users for generating insights in
of Things (IoT). These innovations can actionable insight. Harvard Business a self-service consumption model. With
radically impact businesses with adoption of Review Analytic Services recently the advent of AWS, MS Azure, and Google
right strategy to harness the power of data published a report on the advantages Cloud, immense business benefits can be
and analytics to aid digital transformation. real-time data and analytics can bring realized that include:
The need of the hour is to move from a to an enterprise, helping to build a truly • Creation of a data-driven customer
“system of records” to ”actionable insights” data-driven intelligent enterprise. journey, resulting in increased
through successful delivery of intelligent data Next-generation data warehouses customer satisfaction
platforms that can aid real-time analytics, are on-demand, secure, and scalable • Enhanced business agility and faster
providing the right data, on demand. The self-service data centers that fully auto- time-to-market, enabling improved
foundation of a successful, intelligent enter- mate the provisioning, administration, and faster decision making
prise will be next-generation data warehouse tuning, backup, and recovery of data. • Reduced infrastructure, maintenance,
platforms, which can enable any kind of data This accelerates analytics and actionable and admin overhead costs, resulting in
provisioning in a digitally disrupted world. insights while minimizing administra- improved ROI
Traditional data warehouses served the tion requirements. Next-generation data • Anytime/anywhere access, enabling
need of descriptive analytics on core trans- warehouses also provide real-time, com- self-service BI capabilities
actional systems capturing only 20-25% plete access from surface-level analytics • Automation based on AI/ML
of all enterprise data. These warehouses components to the core in-memory With the tremendous growth that
cannot keep pace with business disruption platform. This allows businesses to ingest analysts are predicting in analytical database
and are a big impediment to agile business and store structured and unstructured management over the next three years, the
analytics and digital computing. data, and also transform raw data assets. next-generation data warehouse market will
Some fundamental limitations to the A complete portfolio of data exploration, be shaped by the following forces:
traditional data warehouses include: reporting, analytics, machine learning, • The emergence of data warehouses in
1. I ncreased operational risk and and visualization tools can be enabled on the cloud or data warehousing-as-a-
threat of data breach the data for accelerated analytics without service (DWaaS)
2. L ack of scalability, affecting business replicating data. With next-generation • The need for data warehouse
agility and time-to-market data warehouses, organizations do not infrastructure to support big data
3. I ncreased latency issues as data need an innovation-limiting, pre-defined • Increasing demands for low latency
volumes grow with complexity schema that limits their ability to harness and high-speed analytics
4. L ack of accuracy in ROI insights from available information. • The increased role of business intelli-
quantification gence in enterprise management
5. T ightly coupled platform and inte- THE ADVANTAGES OF • The commoditization of data ware-
gration affecting agility NEXT-GENERATION house software and hardware
6. Provisioning for structured data only DATA WAREHOUSES With the evolution of data warehouses
Today, data processing has become Cloud is the cornerstone for next- in the cloud, it is time to take away the
more evolved and complex with mobile, generation data warehouses, given the complexity traditionally associated with
social media, cloud, machine, and sensor advantages in cost, scalability, performance, business intelligence infrastructure and
data integration. These new data sources anytime/anywhere access, security, and democratize data. Next-generation data
have tremendous business value to be ease of administration. Many enterprises warehouses have the ability to truly enable
unearthed and monetized. Business need have started their data-to-decision a big leap forward in enterprises, allow-
has evolved from descriptive/diagnostic transformational journey enabled by ing on-demand access to make informed
to predictive/prescriptive analysis. This hybrid, public, and private clouds. With business decisions. n
change in analysis is possible only when the advantage of hybrid and cloud-
data is captured in its most native form native platforms, next-generation data WIPRO LTD.
through streaming, in near real-time, and warehouses are becoming smarter in all Visit: https://www.wipro.com/analytics/
merged with historical data amounting three dimensions—storage, computing Email: ask.analytics@wipro.com

S-ar putea să vă placă și