Documente Academic
Documente Profesional
Documente Cultură
6 May 2018
Class Notes:
Home Work
1. Can I get the location of Variable where it is stored?
a. Function Similar to id() in Python
2. What is && and other bitwise operators in R?
3. Read the R File which I have give.
Interview Questions
1. paste0()
2. %in%
3. Vector of smaller and larger length manipulation
4. Dataframe of larger and smaller number of rows
NOTE : Next class we will have test if you can’t code then you have to teach. Based on the R
file only.
12 May 2018
Home Work:
1. Linear Regression
a. http://www.learnbymarketing.com/tutorials/linear-regression-by-hand-in-excel/
2. What is the hypothesis of Linear regression?
3. Read about BETA0 and BETA1 values in the form of “r” ( Coeff of Cor)
4. Can we understand following Degrees of Freedom?
Regression Analysis
1. MAPE
2. RMSE
3. Log RMSE
4. MSE
5. MAE
Interview Questions
Home Work :
Linear Regression:
Interview Questions
Logistic Regression
Interview Questions:
1. What are cutoffs in Logistic
a. P = .5 : Balanced Data
b. KS Cutoff
2. Different Cutoff
a. ROC AUC Curve
b. Business
3. Model Measuring parameters in Logistics
a. AUC
b. ROC
c. Gini
d. F1 Score
e. Confusion Matrix
f. Accuracy
g. Recall / Precision
h. Concordance Ratio
i. Hosmer Lemeshow
j. Mac Faddens R Squre
4. How to handle imbalanced data in logistic regression
a. Analytics Vidya
b. https://www.analyticsvidhya.com/blog/2016/03/practical-guide-deal-imbalanced-
classification-problems/
c. https://www.analyticsvidhya.com/blog/2016/09/this-machine-learning-project-on-
imbalanced-data-can-add-value-to-your-resume/
5. How ROC curve is created?
Trees :
Date : 26 / 27 May
Homework
1. Reduction in Variance in tree?
2. Deviance in tree?
Interview Questions
1. How the Trees are different in Random forest
2. What is difference between BAGGING and Random Forest
3. What is OOB in Random forest?
4. What the the methods on which trees are build?
a. Gini, Chi Square , Entropy
5. How nodes are divided in trees?
6. When to use Linear Regression over trees?
7. How to use Random forest input to LR
a. Use the important variables in RF to LR
b. We get Important variable plot in R
c. Variable Importance Plot
i. Mean Decrease Gini
8. Use of Trees over Linear regression?
9. How do you prune a tree?
https://www.hackerearth.com/practice/machine-learning/machine-learning-algorithms/tutorial-
random-forest-parameter-tuning-r/tutorial/
Homework
https://www.youtube.com/watch?v=XJ3194AmH40&t=181s
Clustering:
Hierarchical Clustering
1. Type
2. Merge Method
a. Complete ,Single , Average , Centroid , WARD
3. Dendrogram : How this is created?
4. Why this is not good compared to KMeans?
Kmeans