Documente Academic
Documente Profesional
Documente Cultură
Making Accurate Predictions with Data Mining Although the literature contains statements such as data mining will allow us to predict who will buy a particular product, that is against human nature. In situations where data mining is used to predict response to a marketing campaign, only about 5% of the people selected as likely respondents actually do respond.
Making Accurate Predictions with Data Mining (cont.) Although the accuracy of predicting individual behavior is not so good, it is better than it seems, since direct marketing efforts often have hit rates of only about 1% without data mining.
Multidimensional view Transparent to user Accessible Consistent reporting Client-server architecture Generic dimensionality
Dynamic sparse matrix handling Multiuser support Cross-dimensional ops Intuitive manipulation Flexible reporting Unlimited dimension and aggregation
OLAP as Implemented
To date, it does not appear that any implementation exists that satisfies all 12 rules. Some people argue it might not even be possible to attain all of them. More recently, the term OLAP has come to represent the broad category of software technology that enables multidimensional analysis of enterprise data.
0.7
0.6
Sales
0.5
0.4 4 0.3 1 2 3
Product
Region
2 3
Data Mining Technologies (cont.) Decision trees these technologies are conceptually simple and have gained in popularity as better tree growing software was introduced. Because of the way they are used, they are perhaps better called classification trees.
the business problem and obtain the data to study it. Use data mining software to model the problem. Mine the data to search for patterns of interest.
the mining results and refine them by respecifying the model. Once validated, make the model available to other users of the DW.
Web mining
Web mining is a special case of text mining where the mining occurs over a website. It enhances the website with intelligent behavior, such as suggesting related links or recommending new products. It allows you to unobtrusively learn the interests of the visitors and modify their user profiles in real time. They also allow you to match resources to the interests of the visitor.
Visual Presentation
For any kind of high dimensional data set, displaying predictive relationships is a challenge. Shading is used to represent relative degrees of thunderstorm activity, with the darkest regions the heaviest activity.
A Bit of History
An early effort used sequences of twodimensional graphs to add depth. Current virtual reality programs allow the user to step through a data set. Try going to a realtors website and taking a tour of a house up for sale.
Note: On the live map, clicking on an area allows the user to drill down and see results for smaller areas.