Documente Academic
Documente Profesional
Documente Cultură
The sources may involve multiple databases, data cubes, or flat files. One of the most customary
implementations of data integration is building an enterprise data warehouse.
TIGHT: In this approach data from different sources are integrated into a single physical location
by the process of ETL – Extraction, Transformation, and Loading.
Loose: data remains in the original source databases. A combination which provides scope to
take queries from the user and transforms them in a format the source database can understand
and then sends the query directly to the source databases to obtain the result.
It is usually done from the composition of a source system into the required composition of a
new destination system. The process fundamentally involves converting documents, but data
conversions sometimes involve the transformation of a program from one computer language to
another to authorise the program to run on a different platform. The purpose of this data passage
is the adoption of a new system that’s totally different from the previous one.
Data discretization is defined as a process of converting continuous data attribute values into a
finite set of intervals and associating with each interval some specific data value.
Top-down discretisation: In the top-down discretisation process, one or a few points found first
and are used (called split points or cut points) to split the entire attribute range and then repeats
this loop on the resulting intervals.
Bottom-up discretisation: In the bottom-up discretisation, the process starts by acknowledging all
of the continuous values as possible split-points, removes some by merging neighbourhood
values to form intervals.
In a multidimensional model, data is systematically arranged into multiple dimensions, and each
dimension has multiple levels of abstraction defined by concept hierarchies. This provides users
with the adaptability to observe data from different perspectives.