Documente Academic
Documente Profesional
Documente Cultură
Assignment 2
Due: March 04 (Monday) at 12 midnight
a. For Apriori generate rules and itemsets for (i) default parameter values, (ii)
rules = 50, (iii) confidence = 0.7; rules = 50, (iv) minimum support is 0.1.
Summarize the results and discuss/interpret them w.r.t income of individuals
and their information.
b. For FP-growth, generate itemsets for (i) default parameter values, (ii)
minimum support = 0.1, (iii) find min number of itemsets is unchecked, and
(iv) find min number of itemsets is unchecked; minimum support = 0.1.
Summarize and interpret the interesting results.
c. From results in (a), separate out all strong classification rules, i.e., rules that
contain the class attribute (income) on the right-hand-side.
d. Provide a summary of the results.
Note: You can find dataset description details on the below link.
a. Divide the dataset into 4 equal bins and find the correlated attributes from
each bin. Compare the results from each bin.
b. Apply dimensionality reduction to reduce computations. Report results
from each part separately after dimensionality reduction. You can use
various techniques of your choice for data preprocessing and
dimensionality reduction. Please report your technique in document, you
will be evaluated based on your findings in report.