Documente Academic
Documente Profesional
Documente Cultură
2. What is the difference between logical data model and physical data model in Erwin? 3. Match the following in the context of data flow diagram : i. Circle a. Source of data ii. Square b. File or data base iii. Arrow c. Conversion process iv. Parallel line d. Data flow 1. L-a, ll-b,lll-c, lv-d 2. L-c, ll-a, lll-d, lv-b 3. L-c,ll-d, lll-a, lv-b 4. L-b, ll-a, lll-d, iv-c 4. Managers salary details are hidden from employees This is 1. Conceptual level of data hiding 2. Physical level of data hiding 3. External level of data hiding 4. Either 1 or 2 5. Data Modeling software tools 6. In which normal form is the dimension table and fact table in the schema? 7. Conceptual Models 8. Why recursive relationships are bad? How do you resolve them? 9. What is the difference between star flake and snow flake schema? 10. When should you consider denormalization? 11. Describe the third normal form? 12. What is second normal form ? 13. What is First Normal Form ? 14. Generally speaking, for a weak entity set to be meaningful it must be part of a 1. One-to-one relationship 2. One-to-many relationship 3. Many-to-many relationship 4. Depends on a particular situation 15. Data modeling is the process of constructing 1. An orderly arrangement of data elements 2. A graphic representation of data contained in an information system 3. Physical elements of the information system 4. A verbal description of the data need 16. What is ERD? 17. What is the difference between hashed file stage and sequential file stage in relates to datastage Server? 18. Is this statement TRUE or FALSE all databases must be in third normal form? 19. What is data sparsity and how it effect on aggregation? 20. What is an artificial (derived) primary key? When should it be used?
Data Modeling
Data Warehousing
1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 40. 41. 42. 43. 44. 45. 46. 47. 48. 49. 50. 51. What is Data Warehousing? What is Virtual Data Warehousing? Explain in brief various fundamental stages of Data Warehousing. What is active data warehousing? List down differences between dependent data warehouse and independent data warehouse. What is data modeling and data mining? What is this used for? Difference between ER Modeling and Dimensional Modeling. What is the difference between data warehousing and business intelligence? Describe dimensional Modeling. What is snapshot with reference to data warehouse? List out types of dimension tables. What is degenerate dimension table? What is Data Mart? Define Fact table. Define Dimension table. What is the difference between metadata and data dictionary? What is ETL? Describe the various methods of loading Dimension tables. What is OLTP? What is the difference between OLAP and data warehouse? What is ODS? What is OLAP? List out the difference between OLTP and OLAP. Describe the foreign key columns in fact table and dimension table. Explain in brief Data Mining. Difference between view and materialized view. Explain in brief ER Diagram. What is VLDB? What is cube grouping? Define the term slowly changing dimensions (SCD). Differences between star and snowflake schema. What is a Star Schema? Why fact table is in normal form despite the fact that de-normalized is improves data warehouse processes? Explain the use lookup tables and Aggregate tables. What is a Cube and Linked Cube with reference to data warehouse? What is real time data-warehousing? What is conformed dimensions use for? What is conformed fact? What Snow Flake Schema? How do you load the time dimension? What is junk dimension? What is a level of Granularity of a fact table? Define non-additive facts. Explain the use of factless facts table. What is hybrid slowly changing dimension? Define BUS Schema. List out difference between SAS tool and other tools. Why is SAS so popular? What is data cleaning? How can we do that? Explain in brief critical column. What is data cube technology used for?
What is PDAP?
Answer A data cube stores data in a summarized version which helps in a faster analysis of data. The data is stored in such a way that it allows reporting easily. E.g. using a data cube A user may want to analyze weekly, monthly performance of an employee. Here, month and week could be considered as the dimensions of the cube.
What is XMLA?
Answer XMLA is XML for Analysis which can be considered as a standard for accessing data in OLAP, data mining or data sources on the internet. It is Simple Object Access Protocol. XMLA uses discover and Execute methods. Discover fetched information from the internet while Execute allows the applications to execute against the data sources.
They have a highly compressed structure, making them fast to read. Their structure makes it possible for the system to combine multiple indexes together so that they can access the underlying table faster.
Data transformation The extracted data may undergo transformation with possible addition of metadata before they are exported to another large storage area. In transformation phase, various functions related to business needs, requirements, rules and policies are applied on them. During this process some values even get translated and encoded. Care is also taken to avoid redundancy of data. Data Cleansing In data cleansing, scrutinizing of the incorrect or corrupted data is done and those inaccuracies are removed. Thus data consistency is ensured in Data cleansing. It involves activities like - removing typographical errors and inconsistencies - comparing and validating data entries against a list of entities Data transformation This is the last process of Bidirectional Extracts. The cleansed, transformed extracted source data is then loaded into the data warehouse. Advantages - Updates and data loading become very fast due to bidirectional extracting. - As timely updates are received in a useful pattern companies can make good use of this data to launch new products and formulate market strategies. Disadvantage - More investment on advance and faster IT infrastructure. - Not being able to come up with fault tolerance may mean unexpected stoppage of operations when the system breaks. - Skilled data administrator needs to be hired to manage the complex process.
When the warehouse is at integrated stage, It not only updates data as and when a transaction is performed but also generates transactions which are passed back to the source online data.
What is data modeling and data mining? What is this used for?
Data Modeling is a technique used to define and analyze the requirements of data that supports organizations business process. In simple terms, it is used for the analysis of data objects in order to identify the relationships among these data objects in any business. Data Mining is a technique used to analyze datasets to derive useful insights/information. It is mainly used in retail, consumer goods, telecommunication and financial organizations that have a strong consumer orientation in order to determine the impact on sales, customer satisfaction and profitability. Data Mining is very helpful in determining the relationships among different business attributes.