1.a) Explain OLAP storage models?
b) How do data warehousing and data mining work?
2. Assume that the data for analysis includes the age attribute. The age values for the data are of increasing order13.
A) How can you determine the outliers in the data?
B) What other methods exist for data smoothing?
3. List and describe the primitives for the data mining task?
4. Why perform attribute relevance analysis? Explain the various methods of your?
5. a) How is the association of large databases extracted?
b) Describe the different mining classifications of associated rules?
6. How will you solve a classification problem using decision trees?
7. a) What are the fields in which grouping techniques are used?
b) What are the main requirements of cluster analysis?
8. Write short notes about:
I) Discrimination of different classes
(Ii) Statistical measures in large databases.
9. Briefly compare and explain by taking an example of your point (s).
a) Snowflake scheme, constellation of facts
b) Data cleaning, data transformation.
10. a) Speak about various topics in data integration?
b) Explain the generation of conceptual hierarchies for categorical data?
11. a) Why is it important to have a query language for data mining?
b) Define the scheme and hierarchies derived from the operation?
12. Describe an incremental algorithm based on data cubes for comparisons of mining analytical classes.
13. List and explain the five techniques for improving the a priori efficiency algorithm?
14. What is backpropagation? Explain classification by retrotransmission?
15. Why is atypical mining important?
a) Discuss different approaches to detection of atypical values?
b)Do you briefly discuss two methods of hierarchical grouping with appropriate examples?
16. Write short notes about:
I) Spatial data mining databases
Ii) Mining of the World Wide Web.
17. a) Differentiate between OLAP and OLTP?
b) Draw and explain the star schema for the data store?
18. What is data compression?
a) How are data compressed using Principal Component Analysis (PCA)?
19. List and describe the various types of conceptual hierarchies?
20. List statistical measures for the characterization of data dispersion, and discuss how they can be computed efficiently in large databases?