Sunteți pe pagina 1din 13

Whatsapp Now

LEARNBAY

JOB ORIENTED DATA SCIENCE


CERTIFICATION COURSE

Real Time Project Experience And


Placement Assistance Program
Course Features:
Live Classroom data science training in Bangalore by industry experts.
Classes with 40% theory and 60% hands on
Trainers having more than 10+ years of experience in multiple domains like
finance,Healthcare ,Retails and graduate from premier institute like IIT/NIT/BITS
Practical Approach With Real Time Projects And Case studies.
Job Assistance And Placement Support After end of Course.
Classroom And instructor led online Training - Both Options Available

Watch Sample Class Recordings

Who Should Attend:


Those who want to become master in data science and Data Analytics.
Business Analysts who want to learn machine learning
Data Analysts who wants to improve their skills.
Developers aspiring to become data scientist.
Freshers/Experienced Professional,Managers,IT professional.

Course Overview:
Python for Data Science
R Programming
Introduction to SQL
Advance Statistics
Apache Spark/Hadoop
Machine Learning With Real Time Project
Deep Learning And Tensor flow
Real Time Project And Resume Prep Session
Interview Guidance And Job Assistance

***************************************************************************
Duration Of Course - 24 Weeks Course(Around 6 months)
-Weekend Only
(4 Hours Saturday And Sunday) - 200 Hours including Projects
****************************************************************************
Course Fee: RS. 48,000/- (0% EMI Available on Major Credit Cards)

Whatsapp Now For Applicable Discount Coupon


Course Designed For Working Professional
Certification Program to Start Your Career in
Data Science Domain
Real Time Project

Work On Real Time


Projects From Multiple
Domain With Industry
Expert

Download List Of Projects

Click To Watch Project Session Recordings

Job Assistance
Resume Prep Session
Mock Interview And Get
Referral in Companies For
Data Science Roles

Click To Read Google Reviews 


How Job Assistance Works

For Any Queries About Placement Assistance or Real Time Project ,Feel
Free to Chat on Whatsapp Now or Schedule Telephonic Counselling
Session 

Click Here To Read Course FAQ

Schedule A Telephonic Session  Live Chat On Whatsapp


Table Of Contents With Duration

INTRODUCTION TO DATA SCIENCE:


What is data Science? - Introduction.
Importance of Data Science.
Demand for Data Science Professional.
Brief Introduction to Big data and Data Analytics.
Lifecycle of data science.
Tools and Technologies used in data Science.
Business Intelligence vs Data Science.
Role of a data scientist.

PART A – R PROGRAMMING BASICS


(3 Weeks : 24 hours)

1. R Basics, background
2. Getting familiar with basics
Comprehensive R Archive Network
Operators in R – Arithmetic,Relational,Logical
Demo of Installing R On windows from CRAN
and Assignment Operators
Website
Variables,Types Of Variables,Using variables
Installing R Studios on Windows OS
Conditional statements,ifelse(),switch
Setting Up R Workspace.
Loops: For Loops,While Loops,Using Break
Getting Help for R-How to use help system
statement,Switch
Installing Packages – Loading And Unloading
Packages

3. The R Programming Language- Data 4. Functions And Importing data into R


Types Function Overview – Naming Guidelines
creating data objects from the keyword. Arguments Matching,Function with Multiple
How to make different type of data objects. Arguments
Types of data structures in R Additional Arguments using Ellipsis,Lazy Evaluation
Arrays And Lists- Create Access the elements Multiple Return Values
Vectors – Create Vectors,Vectorized Function as Objects,Anonymous Functions
Operations,Power of Vectorized Operations Importing and exporting Data into R- importing
Matrices- Building the first matrices,Matrix from files like excel,csv and minitab.
Operations,Subsetting,visualising subset Import from URL and excel Files
Data Frames- create and filter data Import from database.
frames,Building And Merging data frames.

5. Data Descriptive 6. Graphics in R – Types of graphics


Statistics,Tabulation,Distribution Bar Chart,Pie Chart,Histograms- Create and edit.
Summary Statistics for Matrix Objects. apply() Box Plots- Basics of Boxplots- Create and Edit
Command. Converting an Object into a Table Visualisation in R using ggplot2.
Histograms, Stem and Leaf Plot, Density More About Graphs: Adding Legends to Graphs,
Function.Normal Distribution Adding Text to Graphs, Orienting the Axis Label.
PART B – INTRODUCTION TO SQL (1 Week - 8 hours)

1. Introduction to SQL Server and RDBMS 2. SQL Operations


Covers an overview of using relational databases. Single Table Queries - SELECT,WHERE,ORDER
You’ll learn basic terminology used in future BY,Distinct,And ,OR
modules,SQL Server Management Studio is the Multiple Table Queries: INNER, SELF, CROSS,
primary tool used to create queries and and OUTER,oin, Left Join, Right Join, Full Join,
manage objects in SQL Server databases Union and MANY MORE…..

3. SQL Advance -Operations


Data Aggregations and summarizing the data
Ranking Functions: Top-N Analysis
Advanced SQL Queries for Analytics

PART C- PYTHON FOR DATA SCIENCE


(4 Weeks : 32 hours)

1. Python Programming Basics 2. Making Decisions And Loop Control


Installing Jupyter Notebooks Simple if Statement,if-else Statement
Python Overview if-elif Statement.
Python 2.7 vs Python 3 Introduction To while Loops.
Python Identifiers Introduction To for Loops,Using continue
Various Operators and Operators Precedence and break,
Getting input from User,Comments,Multi line
Comments.

4. Functions And Modules


3. Python Data Types: List,Tuples,Dictionaries
Introduction To Functions – Why
Python Lists,Tuples,Dictionaries
Defining Functions
Accessing Values
Calling Functions
Basic Operations
Functions With Multiple Arguments.
Indexing, Slicing, and Matrixes
Anonymous Functions - Lambda
Built-in Functions & Methods
Using Built-In Modules,User-Defined
Exercises on List,Tuples And Dictionary
Modules,Module Namespaces,
Iterators And Generators

5. File I/O And Exceptional Handling


6. Numpy
Opening and Closing Files
Introduction to Numpy. Array
open Function,file Object Attributes
Creation,Printing Arrays
close() Method ,Read,write,seek.Exception
Basic Operations- Indexing, Slicing and
Handling,the try-finally Clause
Iterating
Raising an Exceptions,User-Defined Exceptions
Shape Manipulation - Changing
Regular Expression- Search and Replace
shape,stacking and spliting of array
Regular Expression Modifiers
Vector stacking
Regular Expression Patterns,re module
7. Pandas And Matplotlib 7. Case Studies Using Numpy,Pandas
Introduction to Pandas
Importing data into Python 3 Case Studies on Numpy,Pandas And
Pandas Data Frames,Indexing Data Matplotlib
Frames ,Basic Operations With Data
frame,Renaming Columns,Subletting and
filtering a data frame.
Matplotlib - Introduction,plot(),Controlling
Line Properties,Working with Multiple
Figures,Histograms

PART D- INTRODUCTION TO STATISTICS


(4 Week - 32 hours)

1. Fundamentals of Math and Probability 2 Descriptive Statistics


Basic understanding of linear algebra, Matrics, Describe or sumarise a set of data
vectors Measure of central tendency and measure of
Addition and Multimplication of matrics dispersion.
Fundamentals of Probability The mean,median,mode, curtosis and skewness
Probability distributed function and cumulative Computing Standard deviation and Variance.
distributed function. Types of distribution.
Class Hand-on Class Handson:
Problem solving using R for vector manupulation 5 Point summary BoxPlot
Problem solving for probability assignments Histogram and Bar Chart
Exploratory analytics R Methods

3. Inferential Statistics
What is inferential statistics 4. Hypothesis Testing
Different types of Sampling techniques Hypothesis Testing
Central Limit Theorem Basics of Hypothesis Testing
Point estimate and Interval estimate Type of test and Rejection Region
Creating confidence interval for population parameter Type o errors-Type 1 Errors,Type 2 Errors
Characteristics of Z-distribution and T-Distribution P value method,Z score Method
Basics of Hypothesis Testing
Type of test and rejection region
Type of errors in Hypothesis resting, Type-l error and
Type-ll errors
P-Value and Z-Score Method
T-Test, Analysis of variance(ANOVA) and Analysis of
Co variance(ANCOVA)
Regression analysis in ANOVA
Class Hands-on:
Problem solving for C.L.T
Problem solving Hypothesis Testing
Problem solving for T-test, Z-score test
Case study and model run for ANOVA, ANCOVA
PART E – UNDERSTANDING AND IMPLEMENTING MACHINE LEARNING
(6 Week - 48 hours)

1. Introduction To Machine Learning 2. Linear Regression


What is Machine Learning? Introduction to Linear Regression
What is the Challenge? Linear Regression with Multiple Variables
Introduction to Supervised Learning,Unsupervised Disadvantage of Linear Models
Learning Interpretation of Model Outputs
What is Reinforcement Learning? Understanding Covariance and Colinearity
Understanding Heteroscedasticity
Case Study – Application of Linear
Regression for Housing Price Prediction

4. Decision Trees And Supervised Learning


3. Logistic Regression
Decision Tree – data set
Introduction to Logistic Regression.– Why Logistic
How to build decision tree?
Regression .
Understanding Kart Model
Introduce the notion of classification
Classification Rules- Overfitting Problem
Cost function for logistic regression
Stopping Criteria And Pruning
Application of logistic regression to multi-class
How to Find final size of Trees?
classification.
Model A decision Tree.
Confusion Matrix, Odd's Ratio And ROC Curve
Naive Bayes
Advantages And Disadvantages of Logistic
Random Forests and Support Vector Machines
Regression.
Interpretation of Model Outputs
Case Study:To classify an email as spam or not
Case Study:
spam using logistic Regression.
1 Business Case Study for Kart Model
2 Business Case Study for Random Forest
3 Business Case Study for SVM

5. Unsupervised Learning
6. Introduction to Deep Learning
Hierarchical Clustering
INeural Network
k-Means algorithm for clustering – groupings
Understaing Neural Network Model
of unlabeled data points.
Understanding Tuning of Neural Network
Principal Component Analysis(PCA)- Data
Independent components analysis(ICA)
Case Study:
Anomaly Detection
Case study using Neural Network
Recommender System-collaborative filtering
algorithm
Case Study– Recommendation Engine for e-
commerce/retail chain
7. Natural language Processing 8. Apache Spark Analytics
Introduction to natural Language What is Spark
Processing(NLP). Introduction to Spark RDD
Word Frequency Algorithms for NLP Introduction to Spark SQL and Dataframes
Sentiment Analysis Using R-Spark for machine learning
Case Study : Hands-on:
Twitter data analysis using NLP installation and configuration of Spark
Hands on Spark RDD programming
Hands on of Spark SQL and Dataframe programming
Using R-Spark for machine learning programming

9. Introduction to Tableau/Spotfire
Connecting to data source
Creating dashboard pages
How to create calculated columns
Different charts
Hands-on:
Hands on on connecting data source and
data clensing
Hands on verious charts
Hands on deployment of Predictive model in
visualisation

PART F – Tensor Flow And Deep Learning


( 2 Week : 20 hours)
1. Introduction to Deep Learning And Tensor
2.Convolutional Neural Network(CNN)
Flow
Convolutional Layer Motivation
Neural Network
Convolutional Layer Application
Understaing Neural Network Model
Architecture of a CNN
Installing TensorFlow
Pooling Layer Application
Simple Computation ,Contants And Variables
Deep CNN
Types of file formats in TensorFlow
Understanding and Visualizing a CNN.
Creatting A Graph – Graph Visualization
Creating a Model – Logistic Regression Model
Building using tensor flow
TensorFlow Classification Examples

3.Understanding Of TFLearn APIs


Getting Started With TFLearn
High-Level API usage -Layers,
Built-in Operations,Training and Evaluatiion-Customizing the
Training Process,Visualization APIs
Sequential And Functional Composition
Fine tuning,
Using TensorBoard with TFLearn
Projects And Case Studies
Building a CNN for Image Classification
PART G - Real Time Projects List
Projects From Retail ,Banking ,Finance ,Insurance ,Sales,Marketing ,Healthcare
,Manufacturing .
You Are Supposed to work at-least on 2 Realtime project

Project 1 : Marketing Domain


Project 2 : Banking Domain
Customer Conversion / Segmentation
Credit Risk Analytics
Problem: A bank Facing Challenges With
Problem: efficiently build or validate in-
Lead Conversion
house models for credit risk management.
Description:
Description:
Identify the leads' segments having a
Create a classifier that leverages financial
higher conversion ratio (lead to buying a
information from bank accounts to
product) so that organisation can
estimate customer risk.
specifically target these potential
customers through additional channels
and re-marketing

Project 3 : Project on Natural Project 4 : Price Analytics


Language Procession Description:
Problem : training a machine learning Creating auto calculating pricing model
model that classifies a given line of text as Problem: build an algorithm that
belonging to one of the books/Articles. automatically suggests the right product
developing a machine learning model prices
(deep learning preferred) for the same.

Project 5 : Classifying Loan Project 6 : Identify And Predict


Application Customer churn in telecom industry
Problem : Work With credit dataset using Description:Understand the customer
classification techniques like Decision behavior and reasons for churn.Apply
Tree, Neural Networks etc to classify loan multiple classification models to predict
applications the customer churn in telecom industry

Project 7 :Retail Domain


Project 8 : Predict Credit Default
Coupon Purchase Prediction Project
Description:predict borrowers chance of
Description:Understand Retail
defaulting on credit loans by building a
Transactional Data set And Using past
credit score prediction model.Develop a
purchase and browsing behavior of
good prediction model for a bank so that
customers ,create a machine learning
they can provide maximum credit to
model which Predict which coupons a
individual without exceeding the risk
customer will buy in a given period of
threshold.
time.

Project 9 : Manufacturing And Production Project 10 : Insurance Purchase


Predict Internal Failures Using Production Prediction
Line Dataset Description:Predicting which insurance
Description:Understanding about option the customer will choose.Building
Manufacturing domain and its failures. machine learning models and Using a
Use production line dataset to predict internal customer’s shopping history, can you
failures using thousands of measurements/tests predict what policy they will end up
made for each component along the assembly choosing?
line
Job Readiness Program (15 Hours)

Resume Preparation Session (4 Hours)


Expert guidance for writing a
resume for data scientist Role

Preparing Project For interviews( 4 Hours)


Will help you to prepare and writing project
description in your resume

Interview Guidance And Prep Session(6 hours)


6 hours of interview readiness session to help you to
prepare for interviews

One on One Mock Interviews(1 Hour)

Still have Queries/Concerns regarding the course,Read FAQ or Contact with


our Course Manager/Counsellor

Read Course FAQ Live Chat on Whasapp

S-ar putea să vă placă și