Sunteți pe pagina 1din 5

Data Mining Important Questions

Q1. What are the characteristics of data in data ware house? Q2.Explain data warehouse cycle? Q3.What are the uses of data ware house? Q4.What is data architecture of data warehouse operations? Q5. What is a data warehouse? How does it differ from a database? Q6.What are the steps involved in the acquisition of data for a data ware house? Q7.What are the difficulties in implementing a data warehouse? Q8.What is a multidimensional data model? How is it used in data warehouse? Q9.Define the terms : (a) OLAP (b)ROLAP (c)MOLAP (d)DSS (e)Data marts. Q10. Describe the characteristics of data warehouse. How is the concept of relational view related to Data Warehouse? Q11.What is data mining? In your answer address the following: (a) Is it another type? (b) Is it a simple transformation of technology developed from database, statistics and machine learning? (c) Explain how the evolution database technologies lead to data mining? (d) Describe the steps involved in the data mining when viewed as a process of knowledge discovery. Present an example where data mining is crucial to success of business. What data mining functions does this business need? Can they be performed alternatively by data query processing or simple statistics analysis? Q12.How is a data warehouse differing from a database? How are they similar to each other? Describe different challenges regarding data mining methodologies and user interactions. Q13.In both data mining and data warehousing, it is important to have some hierarchical information associated with each dimension. If such a hierarchy is not given, discuss how to generate such hierarchy automatically for the first case of dimension containing only numeric data and also for the second case of a dimension containing only categorical data. Q14. What do you mean by data mining? Differentiate between data mining techniques and data mining strategy. Q15. Define the term Data Cleaning with example.

Q16.Write short notes on the following: (a) Data mining metrices (b) Social implications of data mining Q17. Define KDD. Identify and describe the phases in the KDD process. Q18.Differentiate between the following: (a) Data warehouse and operational databases (b) Intrinsic and actual value Q19.Write short notes on dimensionality reduction. Q20. Explain data mining process with neat diagram. Q21. Explain clustering and regression with example. Q22. What is Z-Score normalization? Q23. Distinguish between dimensionality reduction and numerosity reduction. Q24. Explain Histogram. The following are a list of prices of commonly sold items at a company. The number have been stored 1,1,5,5,5,8,8,10,10,15,15,15,15,20,20,,20,20.Make a histogram for price using singleton buckets. Q25.Describe the structure of data warehouse with the help of a diagram Q26.Describe the benefits and drawbacks of a source-driven architecture for gathering of data at a datawarehouse as compared to a destination-driven architecture. Q27.What are the typical functionalities of a data warehouse. Q28. How would you differentiate between Data warehouse and Views? Q29.What are the differences between three main types of data ware house usage: information processing, analytical processing and data mining? Discuss the motivation behind OLAP Mining. Q30.Propose an algorithm, in pseudo code or in your favourite language you know, the automatic generation of a concept hierarchy for numerical database on the equi-depth partitioning. Q31.If your dataset contains missing value, discuss the basic analysis and the corresponding decisions you will take in the preprocessing phase of the data mining process. Develop a software tool for the detection of outliers if the data for preprocessing are given in the form of a flat file with n-dimensional samples. Q32. Define the term data generalization and analytical characterization with examples. Q33. (a)Describe mining association rules in large databases. (b)Data quality can be assessed in the terms of accuracy, completeness and consistency. two other dimensions of the data quality. Propose

Q34.Describe the following: (a) Mining single dimensional Boolean association rule from transactional databases. (b) The Aproiri Algorithm: Finding frequent item sets using candidate generation. Q35. What do you understand by the terms data characterization in the content to concept description? Q36. With the help of an example explain data discriminations in brief. Q37.List out the reasons why we perform attribute relevance analysis? Q38.What are the main purposes of statistics used in data mining? Q39. What do you understand by outliers? Q40.What do you mean by association rules, for what purpose it is being used? Explain with example. Q41.State 12 guidelines/rules for evaluating OLAP products developed by E.F.Codd. Q42. Describe the capabilities of OLAP. Q43.What is the role of Artificial Intelligence in Data Mining. Q44.Suppose that university course database for UPTU contains the following attributes: name, address, status, major of each student and their cumulative grade point average(GPA),propose a concept hierarchy for the attributes status, major GPA and address. Q45. Describe various issues regarding Classification and Prediction. Q46. Explain Decision tree? Give the algorithm for Decision Tree Induction. Q47. Write short notes on: (a) Bayesian Classification (b) Back Propagation Algorithm Q48. What do you mean by clustering? Explain Data Types in Clustering Q49. Write short note on Divisive hierarchical clustering. Q50. What do you understand by neural network? Explain multilayer Feed Forward Neural network. Differentiate between Feed-forward and feed-backward system. Q51.Discuss the most commonly used techniques in data mining. Q52. Discuss the advantage and disadvantage of data mining. Q53. Give examples of main task that are solved by a data mining system. Q54. State the goals and tasks of data mining. Q55. Explain the concept of data cube and where it is used for visualization of large data sets. Q56. Discuss the key features of Data warehouse with example.

Q57. Describe the following with example. (i) Concept Hierarchy (ii) 3-tier architecture Q58. What is multidimensional data model? How we convert tables and spreadsheets to data Convert 2-D tables into 3-D data cubes. cubes?

Q59. Define data warehousing with suitable example, why we need a separate data warehouse? Differentiate between OLAP and OLTP. Q60. Explain Star, Snow Flake and Fact Constellation schemas. Q61.Write short note on: (a) Decision Tree (b) Genetic algorithm Q62.What is clustering and how is it different from classification? Q63. What do you mean by aggregation? Explain in brief, how the OLAP handles aggregation? Write the differences between MOLAP and HOLAP. Q64. Explain OLAP functions and tools in brief .What are the main features of OLAP servers. Q65.Write short note on: (i) Testing data warehouse (ii) Backup and Recovery (iii) Data mining interfaces (iv) Neural Networks (v) OLAP Queries

!! ALL THE BEST !!

S-ar putea să vă placă și