Documente Academic
Documente Profesional
Documente Cultură
SDBMS
A SDBMS is a software module that
can work with an underlying DBMS
supports spatial data models, spatial abstract data types (ADTs) and a query language
from which these ADTs are callable
supports spatial indexing, efficient algorithms for processing spatial operations, and
domain specific rules for query optimization
Example: Oracle Spatial data cartridge, ESRI SDE
can work with Oracle 8i DBMS
Has spatial data types (e.g. polygon), operations (e.g. overlap) callable from SQL3 query
language
Has spatial indices, e.g. R-trees
SDBMS Example
Post-relational DBMS
Support user defined abstract data types
Spatial data types (e.g. polygon) can be added
Choice of post-relational DBMS
Object oriented (OO) DBMS
Object relational (OR) DBMS
A spatial database is a collection of spatial data types, operators, indices, processing strategies,
etc. and can work with many post-relational DBMS as well as programming languages like Java,
Visual Basic etc.
MULTIMEDIA DBMS
A multimedia database management system (MM-DBMS) is a framework that manages different types of
data potentially represented in a wide diversity of formats on a wide array of media sources.
Like the traditional DBMS, MM-DBMS should address requirements:
Integration
• Data items do not need to be duplicated for different programs
Data independence
• Separate the database and the management from the application programs
Concurrency control
• allows concurrent transactions
Requirements of Multimedia DBMS
Persistence
• Data objects can be saved and re-used by different transactions and program
invocations
Privacy
• Access and authorization control
Integrity control
• Ensures database consistency between transactions
Recovery
• Failures of transactions should not affect the persistent data storage
Query support
• Allows easy querying of multimedia data
MOBILE DATABASE
A mobile database is either a stationary database that can be connected to by a mobile
computing device (e.g., smart phones and PDAs) over a mobile network, or a database which is actually
stored by the mobile device. This could be a list of contacts, price information, distance travelled, or any
other information
– communicate with centralized database server through modes such as wireless or Internet
access;
• Smart client applications have emerged as the architecture of choice over browser-based wireless
Internet applications, as they enable access to data while the mobile user is disconnected from the
network—wireless or otherwise. This capability is best implemented by incorporating persistent
data storage using a mobile database in your application.
• The main advantage of using a mobile database in your application is offline access to data—in
other words, the ability to read and update data without a network connection. This helps avoid
problems such as dropped connections, low bandwidth, and high latency that are typical on
wireless networks today.
Microsoft Backed
Growing rapidly
Integrates MS-Windows applications
Focus was more broad (datawarehouse, etc.)
Popular to MS-Windows User
Mobile DB Environments
Database Front-End
C, C++
Java
Visual Studios (C++, VB, C#, J#)
Appforge – Mobile VB
Database Backend
Sybase’s Ultralite
Oracle Lite
MS-Pocket Access
MS-SQL Server CE
Pointbase
Sybase
Very Powerful
supports 100% Java development (through JDBC drivers and the database's native support for
embedded SQLJ and Java stored procedures)
Supports programming from any development tool that supports ODBC (Visual Basic, C++,
Delphi, and so on).
WindowsCE (Pocket) and PalmOS (Palm).
Includes Mobile SQL that is the mobile equivalent of Oracle's SQL*Plus tool.
Only Oracle DBMS significantly different.
Oracle Lite Architecture
Microsoft’s Mobile Databases
Choosing a Mobile DB
Basics of WWW
• Existing Java programs that connect to DBMS can be extended to generate dynamic HTML using
CGI
• CGI = Common Gateway Interface
• CGI is generic and can be used with
– Java, C and other programming languages
– Unix scripts and other scripting languages
• Low-level DB access exploits DB interface libraries such as JDBC
The time horizon for the data warehouse is significantly longer than that of operational systems.
Operational database: current value data.
Data warehouse data: provide information from a historical perspective (e.g., past 5-10
years)
Every key structure in the data warehouse
Contains an element of time, explicitly or implicitly
But the key of operational data may or may not contain “time element”.
Data Warehouse—Non-Volatile
Enterprise warehouse
collects all of the information about subjects spanning the entire organization
Data Mart
a subset of corporate-wide data that is of value to a specific groups of users. Its scope is
confined to specific, selected groups, such as marketing data mart
Independent vs. dependent (directly from warehouse) data mart
Virtual warehouse
A set of views over operational databases
Only some of the possible summary views may be materialized
Data extraction:
get data from multiple, heterogeneous, and external sources
Data cleaning:
detect errors in the data and rectify them when possible
Data transformation:
convert data from legacy or host format to warehouse format
Load:
sort, summarize, consolidate, compute views, check integrity, and build indicies and
partitions
Refresh
propagate the updates from the data sources to the warehouse
Data Warehouse Usage
Data Mining
Data mining (knowledge discovery in databases):
Extraction of interesting (non-trivial, implicit, previously unknown and potentially
useful) information or patterns from data in large databases
Alternative names and their “inside stories”:
Data mining: a misnomer?
Knowledge discovery(mining) in databases (KDD), knowledge extraction, data/pattern
analysis, data archeology, data dredging, information harvesting, business intelligence,
etc.
Data Mining — Potential Applications
• Database analysis and decision support
– Market analysis and management
• target marketing, customer relation management, market basket analysis, cross
selling, market segmentation
– Risk analysis and management
• Forecasting, customer retention, improved underwriting, quality control,
competitive analysis
– Fraud detection and management
• Other Applications
– Text mining (news group, email, documents) and Web analysis.
– Intelligent query answering
Steps of a KDD Process
• Outlier analysis
– Outlier: a data object that does not comply with the general behavior of the data
– It can be considered as noise or exception but is quite useful in fraud detection, rare
events analysis
• Trend and evolution analysis
– Trend and deviation: regression analysis
– Sequential pattern mining, periodicity analysis
– Similarity-based analysis
• Other pattern-directed or statistical analyses
Data Mining: Classification Schemes
• General functionality
– Descriptive data mining
– Predictive data mining
• Different views, different classifications
– Kinds of databases to be mined
– Kinds of knowledge to be discovered
– Kinds of techniques utilized
– Kinds of applications adapted
• Databases to be mined
– Relational, transactional, object-oriented, object-relational, active, spatial, time-series,
text, multi-media, heterogeneous, legacy, WWW, etc.
• Knowledge to be mined
– Characterization, discrimination, association, classification, clustering, trend, deviation
and outlier analysis, etc.
– Multiple/integrated functions and mining at multiple levels
• Techniques utilized
– Database-oriented, data warehouse (OLAP), machine learning, statistics, visualization,
neural network, etc.
• Applications adapted
– Retail, telecommunication, banking, fraud analysis, DNA mining, stock market analysis,
Web mining, Weblog analysis, etc.
Data Mart