Documente Academic
Documente Profesional
Documente Cultură
Virtual Workshop
Unified Data Analytics
Agenda
2:10 - 2:25 PM Unifying Data Science, Business Analytics and Data Engineering
DATA
MACHINE LEARNING BUSINESS ANALYTICS
But most
organizations PRODUCTION DEPLOYMENT
Azure
fail to unlock
Machine
NOTEBOOKS & IDE’S Learning
MODEL
MANAGEMENT
value due to
data,
DATA
TABLES
VALIDATION STORE
KEY/VALUE DATA
technology
WAREHOUSE
REPROCESSING ETL
and people
JOBS STREAM BATCH
UPDATE AND MERGE
ETL
2
Data is messy, ML is hard, BI is limited to a Lack of enterprise
siloed and slow Production is harder fraction of data readiness
110001100011000100010
001000010111000100101 Fragmented security
010000111100101010011
111100111001110101000 Poor reliability
111001100011000110001
000100010000101110001
Lakes Disjointed governance
001010100001111001010
Make all your data ready for BI and ML
ML is hard,
Production is harder
Enable BI directly on all your source data
1000’s
of users
Operational Databases
Logs (unstructured)
Files (unstructured)
Business/custom apps
(structured)
Unifying Data, AI and People
One platform for data science, ML,
and analytics
AI
Data People
Customer Use Case
Strategic Partnering with Databricks
Professional Services
Consulting, IP and Accelerators to Strategic Support
deliver projects successful and Dedicated Support Engineers with
Services
reduce time to value
Support use case and context awareness
Training & Certification Direct engineer access
Public and Private training &
access to Databricks Training Training
Customer Success
Academy Customer Success
...
Engineer
Product Alignment Customer Backlog & case control
Roadmap
Success
Escalation route
Product
acceleration and Mgt Cadence/QBRs
influence
DB Evangelisation Resident Solution Architect
Lunch & learns Architectural direction, design
Hackathons (use case or feature) Partners Trusted authority
POC support Advisor ML & AI authority
Data-driven innovations across industries
Rob Saker, Global Industry Lead for Retail & Consumer Goods
Customers want what they want,
when and where they want it
Consumer Behavior is Changing Supply Networks
Adobe says $3B of the $9.4B cyber monday 2019 sales was mobile
Consumer Behavior is Changing Supply Networks
RESPONSIVE FULFILMENT
• Real-time On Shelf Availability
• Freight and Logistics Optimization
• Last mile delivery
Aggregate Level Analyses are Problematic
• Traditional analysis tools create aggregate level analyses such as weekly, promo group, market area
and then allocate demand to stores, SKUs and day using basic weighting allocations.
• This is fundamentally flawed. It assumes that the demand curve for each store, SKU and day
resembles the aggregate analysis, with only the quantity changing.
Why can’t traditional tools perform fine grained forecasts?
Current tools Azure Databricks
Fine Grained Analysis Enable Higher Accuracy
Promo Group Market Area Week
• With fine-grained forecasting, we identify the demand and depletion curve by day,
store and SKU.
• The difference between allocation and fine-grained forecasting can lead to a 10%+
improvement in forecast accuracy.
Emerging Data is Driving Consumer-Driven Businesses
STREAMING
MOBILE DATA
DATA
GEOSPATIAL
DATA
SHIPMENT
INVENTORY DATA
DATA
POS
DATA
VIDEO
DATA
WEATHER
DATA COMPETITOR SKU BATCH
POS
DATA DATA
Azure Databricks Advantages
Traditional Analysis Suites Databricks
Fine grained forecasting Aggregate level Day or hourly, store & SKU
Structured, unstructured,
Multi-modal data for training No
image, video, sensor data
LIMITED REAL-TIME AND LARGE VOLUMES OF FORECASTING NOT ABLE NOT EASY TO GET TO
CAUSAL DATA RAPIDLY CHANGING DATA TO SCALE TO FINE GRAIN ACTIONABLE INSIGHTS
Omnichannel engagement with Retail and manufacturing data Companies are making Store/Distribution managers
consumers is making real time is constantly shifting, being tradeoffs with traditional EDW receive summarized data from
mobile, IoT, and other causal restated and changing. Eg. based tools as they’re unable data warehouses the next day,
data more available and Revised data to account for to complete detailed analysis making many time sensitive
important. returns. at an atomic level. insights unactionable.
Azure Databricks Unified Data Analytics Solves These
Challenges
USE REAL-TIME AND KEEP UP WITH CHANGING DO GRANULAR AND ACTIONABLE AND EASY
CAUSAL DATA DATA ACCURATE FORECASTS INSIGHTS FOR MANAGERS
Single streamlined pipeline for real Azure Delta enables companies Use and track 100s of ML Power BI natively integrates to
time and streaming data with Delta to seamlessly manage rapidly models to forecast demand by Azure Delta, enabling front-line
Lake and Apache SparkTM changing data with ACID day/store/SKU using MLflow users to access analytic data as it is
compliance and full integration generated. It turns PowerBI into a
with Azure security…while next generation real-time analytic
greatly improving query powerhouse.
performance
What You Need for Consumer Driven Supply Chain
INVENTORY
DATA
CONSUMER DATA Forecast
Demand
GEO-LOCATION
IOT DATA
DATA
Single View of
COMPETITOR POS Supply Chain
DATA DATA
PRICING
MOBILE APP
DATA
Competitive
Fulfillment
SHIPMENT
WEB TRAFFIC
DATA
Use Case Maturity Model: Consumer-Driven Supply Chain
Typical Data Sets
Streaming Data
Innovation and
Business Value Promotion Plan
Optimize inventory and
Social customer satisfaction
further with clear
Location/IP
Shipment Plan forecasts
Clickstream Create a plan for the
most faster most
Ordering Plan optimal way to get
Batch Data Understand exactly product into customer
how much of each SKU hands
POS needs to ordered and
when
Inventory Saftey Stock
Plan how much inventory you
Marketing want at each location by SKU
by day
Billing & Payment
Third Party
Research Starter Use Cases Advanced Use Cases
Product Catalog