Sunteți pe pagina 1din 19

Incorta

Update for Change


Healthcare
Overview

• Incorta Advanced Data Management

• Incorta UI Widget Improvements

• Sizing Hardware

• Open Issues Status


Incorta Advanced Data
Management
Load Billions of Records in Parquet File

Use Incorta Extract to load


data from numerous data
sources to Staging Area
(Parquet file)
Aggregate and Filter Data using Incorta MV SQL

Extract billions of records


in Parquet File and write
Incorta MV SQL to
aggregate and filter data
from Parquet file and load
to Incorta Snapshot to
analyze data
Functions Supported in Incorta MV SQL
Example: Replace Function in Incorta MV SQL
Example: Transpose using Incorta MV SQL
Transpose data shown in table on left to table on right using Incorta MV SQL shown below
Data Transformation

Ability to do any kind of Data


Transformation and Data
Wrangling using Incorta MV
Python. Transform and load to
Incorta Snapshots to analyze
data in Incorta Dashboards.
Replace and Parse Function to Clean Data

Clean data using


Replace and
parseDouble
Function. Many
more functions are
available to clean
data.
Incorta UI Widget Improvements
Improvement: Line stops for no Value

Line is stopping
here for NO value
Improvement : Average/Linear/SMA Line Type

Lot more options for


specifying Line Type
Notes for Insight (Coming Soon)

Clicking On “i” icon gives


developer ability to describe
insight. Consumer user can view
details by clicking on “i” icon.
Sizing Hardware
Sizing Hardware
• Incorta with Incorta MV SQL requires 32G memory
• Massive amount (billions of records) of data can be loaded in Parquet
file using Incorta Extract
• Using Incorta MV SQL and Incorta MV Python you can control how
much data you want to load in Incorta Snapshots to analyze data
• If aggregated and filtered data for each division is 2G, then you need
32G + 5 times 2G = 42g Memory for 5 divisions
Sizing Hardware

Example:
Aggregating data
from 919k records
to 132 records
using Incorta MV
SQL can reduce
memory footprint
from 13 MB to
11.65 KB
Status of Issues Reported During
POC
Status of Issues Reported During POC
# Issue/Enhancement Status Priority
1 SQL Server Connector - Provide Domain Parameter Available High
2 Load data only to Parquet file Available High
3 Filter data from Parquet file and load to Incorta Snapshot Available High
4 Aggregate and Filter data from Parquet File using SQL and Load to Incorta Snapshot Available High
5 Ability to use Replace function in Spark SQL Available High
6 Status/Log details, Record Count when loading data Available High
7 777 permission for all files after install (security issue) Available High
8 Ability to upload tab delimited file Available High
9 Support for uploading Pipe separated text file Existing Functionality High
10 Moving avg line - no control over line type Available Medium
11 Line chart shows 0 for no data in 2016 Dec month for YOY comparison Available High
12 Transpose data usinf SQL Available High
13 Replace and Parse functions to clean data Available High
14 Duration of load is not showing correctly Available Medium
15 Row level count details are not shown for MV Load Open Low
16 Show full value in chart. Shows 5M instead of 5000000 Open Low
17 Notes for insight (annotations). Example - why dip is there in the chart Open Low
18 Glossary. Example - Explain formula used in the chart Open Low
19 Support for uploading .txt file with tab separated data Open Low

Note: Open Issues shown here are low priority and easy to fix

S-ar putea să vă placă și