Sunteți pe pagina 1din 6

Data Science Curriculum

---- Month 1 ----


I. Introduction to Data Science
 Data Science A-Z: Real-Life Data Science Exercise – Udemy
 An Introduction to Statistical Learning – eBook
II. Statistics
 Statistical Thinking for Data Science and Analytics – edX

---- Month 2 ----


1. Probability
☐ Introduction to Probability – The Science of Uncertainty – MITx: 6.041x
2. Calculus
 Multivariable Calculus – Khan academy
 Mathematics for Machine Learning: Multivariate Calculus – Coursera

---- Month 3 ----


1. Linear Algebra
 Linear Algebra – Khan academy
 Essence of Linear Algebra – 3Blue1Brown
 Mathematics for Machine Learning: Linear Algebra – Coursera

---- Month 4 ----


1. Python
 Complete Python Bootcamp: Go from Zero to Hero in Python – Udemy
☐ Introduction to Computer Science and Programming Using Python – MITx:
6.00.1x
☐ Introduction to Computation Thinking and Data Science – MITx: 6.00.2x
2. Python Data Science Library
☐ Python for Data Science and Machine Learning Bootcamp (Section 1 – 13)
– Udemy
☐ Python for Data Analysis – eBook

© 2018 BENJAMIN LAU ALL RIGHTS RESERVED


---- Month 5 ----
1. Continue from month 4
2. Tableau
☐ Tableau 10 A-Z – Udemy

---- Month 6 ----


1. Machine Learning
☐ Machine Learning A-Z: Hands-On Python & R in Data Science – Udemy
☐ Machine Learning by Columbia University – edX
☐ Machine Learning by Stanford University – Coursera

---- Month 7 ----


Continue from Month 6

---- Month 8 ----


1. Deep Learning
☐ Deep Learning A-Z: Hands-On Artificial Neural Networks – Udemy
☐ Deeplearning.ai – Coursera
☐ Course.fast.ai – Web-based learning
☐ Deep Learning by Ian Goodfellow et al. – eBook

---- Month 9 ----


Continue from Month 8

---- Month 10 ----


1. Artificial Intelligence
☐ Introduction to Computer Vision by Georgia Institute of Technology –
Udacity
☐ Convolutional Neural Networks for Visual Recognition – Stanford CS231n
☐ Natural Language Processing with Deep Learning – Stanford CS224n

---- Month 11 ----

© 2018 BENJAMIN LAU ALL RIGHTS RESERVED


Continue from Month 10

---- Month 12 ----


1. Kaggle Competition
2. Github

Appendix

© 2018 BENJAMIN LAU ALL RIGHTS RESERVED


Maths Fundamental
I. Statistic
i. Descriptive Statistic
ii. Inferential Statistic
iii. Bayesian Statistic
II. Probability
III. Calculus
i. Multivariable Calculus
ii. Optimization
IV. Linear Algebra
i. Vector/Matrix Operation
ii. Linear Transformation/Linear Equation
iii. Coordinate System
iv. Eigenvector, Eigenvalues

Python Essential Library


I. NumPy
II. Pandas
III. SciPy
IV. Matplotlib
V. Seaborn
VI. Scikit-Learn
VII. Keras
VIII. TensorFlow
IX. NLTK
X. Statsmodels

Data Science Process


I. Data Collection
II. Data Pre-processing

© 2018 BENJAMIN LAU ALL RIGHTS RESERVED


i. Missing Data
ii. Categorical Data
iii. Data Transformation
iv. Feature Selection
v. Training and Testing Sets
III. Data Visualisation/Exploration
i. Python
o Matplotlib
o Seaborn
ii. Tableau
iii. Exploratory Analysis
IV. Model Development
i. Model Selection
ii. Model Training
iii. Model Validation

Machine Learning
Supervised Learning:
I. Linear Regression
II. Logistic Regression
III. Support Vector Machine
IV. Decision Tree
V. Random Forest
VI. K-Nearest Neighbours
VII. Kernel
VIII. Naïve Bayes
Unsupervised Learning:
I. K-Means Clustering
II. Hierarchical Clustering
Model Selection:
I. Evaluating Models Performance
II. Cross-validation
III. Dimensionality Reduction
- Principal Component Analysis (PCA)
- Kernel PCA
Deep Learning
I. Artificial Neural Networks
II. Convolutional Neural Networks
III. Recurrent Neural Networks
Artificial Intelligence

© 2018 BENJAMIN LAU ALL RIGHTS RESERVED


I. Natural Language Processing
II. Computer Vision

© 2018 BENJAMIN LAU ALL RIGHTS RESERVED

S-ar putea să vă placă și