Documente Academic
Documente Profesional
Documente Cultură
data,
information technology,
statistical analysis,
Business context.
Technology.
TYPES OF BA
Descriptive analytics.
Predictive analytics.
Prescriptive analytics.
Diagnostic or Detective
IMPORTANCE OF BA:
- profitability of businesses
- revenue of businesses
- shareholder return
Data analysts collect, process and perform statistical analyses of data. Their
skills may not be as advanced as data scientists (e.g. they may not be able to
create new algorithms), but their goals are the same – to discover how data can be
used to answer questions and solve problems.
Data Analyst vs. Data Scientist -
Differences
The job role of a data scientist strong business acumen and data visualization skills to
converts the insight into a business story whereas a data analyst is not expected to possess
business acumen and advanced data visualization skills.
Data scientist explores and examines data from multiple disconnected sources whereas
a data analyst usually looks at data from a single source like the CRM system.
A data analyst will solve the questions given by the business while a data scientist will
formulate questions whose solutions are likely to benefit the business.
In many scenarios, data analysts are not expected have hands-on machine learning
experience or build statistical models but the core responsibility of a data scientist is to build
statistical models and be well-versed with machine learning.
Data quality is important because we need: accurate and timely information to manage
services and accountability. good information to manage service effectiveness. to
prioritise and ensure the best use of resources
R Programming.
Tableau Public:
3.Python.
SAS:
Apache Spark.
Excel.
RapidMiner:
KNIME.
UNIT 2: DBMS
What is SQL?
Comparison Chart
BASIS FOR
OLTP OLAP
COMPARISON
database modification.
database.
transactions.
1. Comparison Chart
2. Definition
3. Key Differences
4. Conclusion
Comparison Chart
BASIS FOR
OLTP OLAP
COMPARISON
database modification.
database.
Data OLTP and its transactions are Different OLTPs database becomes
Normalization Tables in OLTP database are Tables in OLAP database are not
Integrity OLTP database must maintain OLAP database does not get
Definition of OLTP
Definition of OLAP
ETL is short for extract, transform, load, three database functions that are combined into one tool to pull data out of
Transform is the process of converting the extracted data from its previous form into the form it needs to be
in so that it can be placed into another database. Transformation occurs by using rules or lookup tables or by
combining the data with other data.
Load is the process of writing the data into the target database.
volumes of data and multiple source systems, the data is consolidated. ETL is used to migrate data from one
database to another, and is often the specific process required to load data to and from data marts and data
warehouses, but is a process that is also used to to large convert (transform) databases from one format or type to
another
And while involvement of these mining systems, one can come across several
disadvantages of data mining and they are as follows.
It is a known fact that data mining collects information about people using
some market-based techniques and information technology. And these data
mining process involves several numbers of factors.
But while involving those factors, data mining system violates the privacy of
its user and that is why it lacks in the matters of safety and security of its
users. Eventually, it creates Mis-communication between people.
The main functions of the data mining systems creates a relevant space for
beneficial information.
But the main problem with these information collection is that there is a
possibility that the collection of information process can be little
overwhelming for all.
3. Misuse of information:
As it has been explained earlier that in the data mining system the possibility
of safety and security measure are really minimal. And that is why some can
misuse this information to harm others in their own way.
Therefore, the data mining system needs to change its course of working so
that it can reduce the ratio of misuse of information through the mining
process.
4. An accuracy of data:
Most of the time while collecting information about certain elements one used
to seek help from their clients, but nowadays everything has changed. And
now the process of information collection made things easy with the mining
technology and their methods.
One of the most possible limitations of this data mining system is that it can
provide accuracy of data with its own limits.
Single-tier architecture
The objective of a single layer is to minimize the amount of data
stored. This goal is to remove data redundancy. This architecture is
not frequently used in practice.
Two-tier architecture
Three-tier architecture
3. Top-Tier: The top tier is a front-end client layer. Top tier is the
tools and API that you connect and get data out from the data
warehouse. It could be Query tools, reporting tools, managed
query tools, Analysis tools and Data mining tools.
Database Schema
A database schema is the skeleton structure that represents the logical view
of the entire database. It defines how the data is organized and how the
relations among them are associated. It formulates all the constraints that
are to be applied on the data.
Logical Database Schema − This schema defines all the logical constraints
that need to be applied on the data stored. It defines tables, views, and
integrity constraints.