Machine Learning Project Checklist

Încărcat de

Anuj Saxena

0% au considerat acest document util (0 voturi)

28 vizualizări3 pagini

ML checklist

Titlu original

Copy of Machine Learning Project Checklist

Drepturi de autor

Formate disponibile

XLSX, PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

ML checklist

Drepturi de autor:

Formate disponibile

Descărcați ca XLSX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

28 vizualizări3 pagini

Machine Learning Project Checklist

Încărcat de

Anuj Saxena

ML checklist

Drepturi de autor:

Formate disponibile

Descărcați ca XLSX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 3

Căutați în document

Last updated 1/24/2020

To download: Toolbar > File > Download > (Desired File Type)
☑ BusinessStage & Steps
☐ Data Understand the data, your questions, and your goals
I. Import Data •Download
Are youthe
simply exploring the data?in your coding environment
☐ Understandin
& Libraries
data and make it available

☐ Check for duplicates

☐ Separate Data Types (Take an inventory of what data types you have)
II. Exploratory Initial Data Cleaning
☐ Data Analysis • Clean anything that would prevent you from exploring the data
☐ Visualize & Understand
• Understand how your data is distributed (numerical & categorical)
☐ III. Train/Test
Assess Missing Values (Don't fill/impute yet!)
• The goal here is to figure out your strategy for dealing with missing values
☐ Split
Set aside some data for testing.

☐ Dealing with Missing Data (Many options)

• Mean/Median/Mode
IV. Prepare for Feature Engineering
• What columns/features can you make to add value & information to your
☐ ML Transform Data
• Numerical
☐ V. Pick your
Feature Selection
• Numerical: Correlation (Pearson or Spearman) or ANOVA
☐ Models
VI. Model
• Some Regression Examples
- Linear Regression
☐ Selection
Pick one algorithm via some form of Cross-Validation
VII. Model Tune model hyperparameters
☐ VIII.Tuning
Pick the •Pick
Ideally use Cross-Validation again to choose your hyperparameters
☐ best model
the model that performed the best, and you're done!
ADDITIONAL INFO
• Get a Data Dictionary or schema if possible
• Understand what rows represent in your data
• Import important libraries (pandas, numpy, matplotlib, seaborn, datetime), then import others as
needed
• We don't need to keep any rows that are pure duplicates of each other
• Numerical
- Discrete
Examples of things to consider...
• Are there categorical columns that should be numerical?
Some ideas
• Numerical: Histograms & Scatter Plots
Things to consider when working with missing data...
• How many per row?
Depending on size of your data, this can be anywhere between 80-90% train.
The reason we want to deal with missing data after we've split our data is because we want to
simulate real world conditions when we test as much as we can.
Some ideas
• Aggregations (across groups or dates)
Considerations:
• Numerical
Reducing dimensionality of your data can not only improve runtime, but also the quality of your
predictions. Highly correlated or low variance features might work against you.
Go wild.
Cross validation is a great way to estimate how your models will perform out in the wild.
Some examples you can use
• Grid Search
Woohoo!
Created by Patrick de Guzman
http://patrickdeguzman.me/
Useful Functions/Methods

• pd.concat
• pd.merge
• df.drop_duplicates()
• df.select_dtypes(['object', 'bool'])
• df.select_dtypes(['float', 'int'])
• pd.Series.str.replace()
• pd.Series.astype()
• df.value_counts()
• seaborn.distplot()
• df.isna().any()
• df.drop()
• sklearn.model_selection.train_test_split
• sklearn.model_selection.StratifiedShuffleSplit
• sklearn.impute.SimpleImputer
• sklearn.impute.IterativeImputer
• sum
• mean
• sklearn.preprocessing.StandardScaler
• sklearn.preprocessing.MinMaxScaler
• df.corr().abs()
• sklearn.feature_selection.VarianceThreshold

• sklearn.model_selection.train_test_split
• sklearn.model_selection.KFold
• sklearn.model_selection.GridSearchCV
• sklearn.model_selection.RandomizedSearchCV

S-ar putea să vă placă și

Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De la Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Evaluare: 4 din 5 stele
4/5 (895)
MCSE/MSE-101: M.E./M.Tech., I Semester Advanced Computational Mathematics
Document3 pagini
MCSE/MSE-101: M.E./M.Tech., I Semester Advanced Computational Mathematics
Manish Kumar
Încă nu există evaluări
Never Split the Difference: Negotiating As If Your Life Depended On It
De la Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Evaluare: 4.5 din 5 stele
4.5/5 (838)
Week 1 Quiz Coursera Answ
Document7 pagini
Week 1 Quiz Coursera Answ
Praveen Kumar Reddy
Încă nu există evaluări
The Yellow House: A Memoir (2019 National Book Award Winner)
De la Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Evaluare: 4 din 5 stele
4/5 (98)
Propagation of Errors
Document10 pagini
Propagation of Errors
Muhammad Ahmad Raza
Încă nu există evaluări
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De la Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Evaluare: 4 din 5 stele
4/5 (5794)
Lorentz Transformation and Relativistic Doppler Effect: November 2015
Document3 pagini
Lorentz Transformation and Relativistic Doppler Effect: November 2015
faisal naveed
Încă nu există evaluări
Shoe Dog: A Memoir by the Creator of Nike
De la Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Evaluare: 4.5 din 5 stele
4.5/5 (537)
Conditional Probability
Document17 pagini
Conditional Probability
Wan Nabila
100% (1)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De la Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Evaluare: 4.5 din 5 stele
4.5/5 (266)
Name: R.Viswaesvaran - Class: 11a - Sns Academy
Document22 pagini
Name: R.Viswaesvaran - Class: 11a - Sns Academy
Viswa Dexter
Încă nu există evaluări
The Little Book of Hygge: Danish Secrets to Happy Living
De la Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Evaluare: 3.5 din 5 stele
3.5/5 (400)
Defining UBC97 Seismic Analysis in Robot - Example
Document5 pagini
Defining UBC97 Seismic Analysis in Robot - Example
Sinedpak Dufam
Încă nu există evaluări
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De la Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Evaluare: 4.5 din 5 stele
4.5/5 (474)
LP 6
Document1 pagină
LP 6
Nirmal Subudhi
Încă nu există evaluări
Yes Please
De la Everand
Yes Please
Amy Poehler
Evaluare: 4 din 5 stele
4/5 (1891)
TMA4170 Fourier Analysis Spring 2017
Document4 pagini
TMA4170 Fourier Analysis Spring 2017
Gizmico Carro
Încă nu există evaluări
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De la Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Evaluare: 3.5 din 5 stele
3.5/5 (231)
Lecture 2. Part 1-Regression Analysis. MLRM
Document16 pagini
Lecture 2. Part 1-Regression Analysis. MLRM
Richelle Pausang
Încă nu există evaluări
Grit: The Power of Passion and Perseverance
De la Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Evaluare: 4 din 5 stele
4/5 (588)
Data Structures and Algorithm Analysis in Java 3rd Edition Weiss Solutions Manual
Document15 pagini
Data Structures and Algorithm Analysis in Java 3rd Edition Weiss Solutions Manual
commenceingestah7nxd9
100% (20)
The Emperor of All Maladies: A Biography of Cancer
De la Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Evaluare: 4.5 din 5 stele
4.5/5 (271)
Scatter Plots and Line of Best Fit Line of Best
Document2 pagini
Scatter Plots and Line of Best Fit Line of Best
api-16254560
Încă nu există evaluări
The Unwinding: An Inner History of the New America
De la Everand
The Unwinding: An Inner History of the New America
George Packer
Evaluare: 4 din 5 stele
4/5 (45)
Attribute-Based Storage Supporting Secure
Document6 pagini
Attribute-Based Storage Supporting Secure
Vamsi Krishna Kaza
Încă nu există evaluări
On Fire: The (Burning) Case for a Green New Deal
De la Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Evaluare: 4 din 5 stele
4/5 (74)
Properties of One Dimensional Motion
Document11 pagini
Properties of One Dimensional Motion
Fahad Mirza
Încă nu există evaluări
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De la Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Evaluare: 4.5 din 5 stele
4.5/5 (345)
Flight Fare Prediction System Using Machine Learning
Document10 pagini
Flight Fare Prediction System Using Machine Learning
IJRASETPublications
Încă nu există evaluări
Team of Rivals: The Political Genius of Abraham Lincoln
De la Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Evaluare: 4.5 din 5 stele
4.5/5 (234)
Coding For Interviews Dynamic Programming Cheat Sheet
Document4 pagini
Coding For Interviews Dynamic Programming Cheat Sheet
Brian Jordan
33% (3)
Principles: Life and Work
De la Everand
Principles: Life and Work
Ray Dalio
Evaluare: 4 din 5 stele
4/5 (599)
Algorithms and Data Structures
Document80 pagini
Algorithms and Data Structures
程国强
Încă nu există evaluări
Angela's Ashes: A Memoir
De la Everand
Angela's Ashes: A Memoir
Frank McCourt
Evaluare: 4.5 din 5 stele
4.5/5 (440)
Anish Bhandari.22015997.FE5056, Problem Solving Methods and Analysis
Document20 pagini
Anish Bhandari.22015997.FE5056, Problem Solving Methods and Analysis
Anish bhandari
Încă nu există evaluări
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De la Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Evaluare: 4 din 5 stele
4/5 (1090)
Program Python Unit 1
Document11 pagini
Program Python Unit 1
Ved Sawant
Încă nu există evaluări
Fear: Trump in the White House
De la Everand
Fear: Trump in the White House
Bob Woodward
Evaluare: 3.5 din 5 stele
3.5/5 (738)
Floating Point Representation
Document17 pagini
Floating Point Representation
Muhammad Aqib Javaid
Încă nu există evaluări
Steve Jobs
De la Everand
Steve Jobs
Walter Isaacson
Evaluare: 4.5 din 5 stele
4.5/5 (806)
OBE Question Bank
Document17 pagini
OBE Question Bank
KAVITHA T
100% (1)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De la Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Evaluare: 3.5 din 5 stele
3.5/5 (2259)
Ieee 754 Notes
Document1 pagină
Ieee 754 Notes
Mohsin Bhat
Încă nu există evaluări
Rise of ISIS: A Threat We Can't Ignore
De la Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Evaluare: 3.5 din 5 stele
3.5/5 (137)
Analysis & Design of Algorithms (ADA) : Unit - 1
Document26 pagini
Analysis & Design of Algorithms (ADA) : Unit - 1
Anuraag Kumar
Încă nu există evaluări
John Adams
De la Everand
John Adams
David McCullough
Evaluare: 4.5 din 5 stele
4.5/5 (2409)
B41-R4 Mathmatics and Statics
Document12 pagini
B41-R4 Mathmatics and Statics
AYUSH PRADHAN
Încă nu există evaluări
The Glass Castle: A Memoir
De la Everand
The Glass Castle: A Memoir
Jeannette Walls
Evaluare: 4.5 din 5 stele
4.5/5 (1713)
Alpha-Beta Pruning - Example: Minimax On A 6-Ply Game Horizon Depth: H 6
Document28 pagini
Alpha-Beta Pruning - Example: Minimax On A 6-Ply Game Horizon Depth: H 6
anamol pradhan
Încă nu există evaluări
Bad Feminist: Essays
De la Everand
Bad Feminist: Essays
Roxane Gay
Evaluare: 4 din 5 stele
4/5 (1016)
Mysterious Messages Comprehension MA
Document5 pagini
Mysterious Messages Comprehension MA
ximena ferdinand
Încă nu există evaluări
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De la Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Evaluare: 4.5 din 5 stele
4.5/5 (121)
Key Management Using ANSI X9.17: Foreword
Document29 pagini
Key Management Using ANSI X9.17: Foreword
rolyfiee
Încă nu există evaluări
The Outsider: A Novel
De la Everand
The Outsider: A Novel
Stephen King
Evaluare: 4 din 5 stele
4/5 (1839)
Chapter 6 (CONT') : Application: Powers of Matrices and Their Applications. 1 Powers of Matrices
Document9 pagini
Chapter 6 (CONT') : Application: Powers of Matrices and Their Applications. 1 Powers of Matrices
Hafeezul Raziq
Încă nu există evaluări
The Light Between Oceans: A Novel
De la Everand
The Light Between Oceans: A Novel
M.L. Stedman
Evaluare: 4.5 din 5 stele
4.5/5 (789)
Deterministic Dynamic Programming
Document21 pagini
Deterministic Dynamic Programming
Gratia Suryani Sitorus
Încă nu există evaluări
The Woman in Cabin 10
De la Everand
The Woman in Cabin 10
Ruth Ware
Evaluare: 3.5 din 5 stele
3.5/5 (2322)
Hanoosh Tictactoe
Document23 pagini
Hanoosh Tictactoe
Jesús Zubieta
Încă nu există evaluări
A Man Called Ove: A Novel
De la Everand
A Man Called Ove: A Novel
Fredrik Backman
Evaluare: 4.5 din 5 stele
4.5/5 (4609)
Wolf Hall: A Novel
De la Everand
Wolf Hall: A Novel
Hilary Mantel
Evaluare: 4 din 5 stele
4/5 (3811)
Manhattan Beach: A Novel
De la Everand
Manhattan Beach: A Novel
Jennifer Egan
Evaluare: 3.5 din 5 stele
3.5/5 (792)
Brooklyn: A Novel
De la Everand
Brooklyn: A Novel
Colm Toibin
Evaluare: 3.5 din 5 stele
3.5/5 (1937)
The Perks of Being a Wallflower
De la Everand
The Perks of Being a Wallflower
Stephen Chbosky
Evaluare: 4.5 din 5 stele
4.5/5 (2104)
The Art of Racing in the Rain: A Novel
De la Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Evaluare: 4 din 5 stele
4/5 (4200)
Little Women
De la Everand
Little Women
Louisa May Alcott
Evaluare: 4 din 5 stele
4/5 (104)
A Tree Grows in Brooklyn
De la Everand
A Tree Grows in Brooklyn
Betty Smith
Evaluare: 4.5 din 5 stele
4.5/5 (1929)
The Constant Gardener: A Novel
De la Everand
The Constant Gardener: A Novel
John le Carré
Evaluare: 3.5 din 5 stele
3.5/5 (104)
Sing, Unburied, Sing: A Novel
De la Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Evaluare: 4 din 5 stele
4/5 (1103)
Her Body and Other Parties: Stories
De la Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Evaluare: 4 din 5 stele
4/5 (821)