Principal Component Analysis

Încărcat de

Shahnawaz sahil

0% au considerat acest document util (0 voturi)

51 vizualizări20 pagini

Titlu original

Principal Component Analysis.pptx

Drepturi de autor

Formate disponibile

PPTX, PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca PPTX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

51 vizualizări20 pagini

Principal Component Analysis

Încărcat de

Shahnawaz sahil

Drepturi de autor:

Formate disponibile

Descărcați ca PPTX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 20

Căutați în document

Principal

Component
Analysis
An Introduction to dimensionality reduction.

-Sahil Imani
Some prerequisites before
getting into PCA.
 Origins of PCA
 Importance of Variance in data and information Entropy.
 What do we mean by dimensions?
 Why do we need it to reduce dimensions?
 The logic Behind and A visual explaination
PCA: Origins
 Comes from Statistics, a part of factor analysis and
Dimensionality reduction (Feature Extraction).
 Is NOT a Machine Learning technique by itself.
 Goal Of data analysis is to generally make “sense” of
the data.
 Is done in 3 iteratively steps (Clean, Reduce, Transform)
until we get to an acceptable level.
Video Taken from Computerphile Youtube Channel
Importance of Variance in data
and information Entropy.

 Information Entropy Basically tells the rate of

information generation from a stochastic process.
 Basically, it gives us a relation between
 The Information Gain vs The Uncertainty.
 The more the uncertainty the more the information
Transfer/gained.
Dimensionality
 In terms of data analysis, the number of attributes or
features that determines the final output of a data
driven decision is known as its dimensionality.
 The more attributes we use to better define something
the more “dimensions” it has.
 For dimensions greater than 4 however it becomes
impossible to visualize on a 2d plane. Which is why we
need to reduce/project the data to a lower dimension
while at the same time trying to retain most of the
information.
Need to reduce
Dimensionality
 Helps in data Visualization
 Makes Calculations faster and the upcoming machine
learning stage needs less data to work with for the same
amount of information
 Reduces the data set so we can start drawing
Conclusions
 Optimizes the data for use in Actual machine learning or
statistical modelling.
PCA: The logic behind it and
visual explanation
 A common Example of Dimensionality reduction in
everyday life.
 Some Dimensions/factors contains much more
information than others.
 If we can find the principle or the “important”
dimensions we can discard the ones that doesn’t
contribute much or some highly correlated dimensions.
 This is the logical basis for PCA.
 Visually, we can see it (for 2 dimensions) as trying to fit
in a line along the direction of maximum Variance.
 It will be a linear combination of both the dimensions
A 2D visualization of a data
set having two attributes.
The Math Powering it.
Programming Implementation

 The Basic Flow is:

 To find the Eigen Values and Eigen Vectors Of the
Covariance matrix of the attributes.
 Sort the Eigen Vectors according to the eigen Values(from
max to Min).
 Discard as many Principal components as long as we are
within the Amount of information we need.
 Reproject The data using the reduced Dimension.
Thank You

S-ar putea să vă placă și

Matlab: The Language of Technical Computing
Document27 pagini
Matlab: The Language of Technical Computing
rauk83
Încă nu există evaluări
R K Konodia Civil Gate Previous Year - by EasyEngineering - Net 1 PDF
Document214 pagini
R K Konodia Civil Gate Previous Year - by EasyEngineering - Net 1 PDF
Daante Verma
Încă nu există evaluări
Newton Interviews - Tookie Angus
Document12 pagini
Newton Interviews - Tookie Angus
Peter Bell
Încă nu există evaluări
Real Estate Quizzer
Document27 pagini
Real Estate Quizzer
Rochelle Adajar-Bacalla
Încă nu există evaluări
Module 4-2 Principal Components Analysis
Document18 pagini
Module 4-2 Principal Components Analysis
Rubayat Islam
Încă nu există evaluări
Supplier S Documentation of Equipment PDF
Document32 pagini
Supplier S Documentation of Equipment PDF
zhangjie
Încă nu există evaluări
Pandas Stands For "Python Data Analysis Library"
Document19 pagini
Pandas Stands For "Python Data Analysis Library"
The Innominate
Încă nu există evaluări
Machine Learning
Document185 pagini
Machine Learning
Anbu Saravanan
100% (1)
Swarm Intelligence PSO and ACO
Document69 pagini
Swarm Intelligence PSO and ACO
Krishna Reddy Konda
Încă nu există evaluări
Unit 4 Machine Learning Tools, Techniques and Applications
Document78 pagini
Unit 4 Machine Learning Tools, Techniques and Applications
Jyothi Pulikanti
Încă nu există evaluări
ch03 Powerpoints
Document35 pagini
ch03 Powerpoints
Bea May M. Belarmino
Încă nu există evaluări
Relational Database Design by ER - To-Relational Mapping
Document16 pagini
Relational Database Design by ER - To-Relational Mapping
لو شئت ان تغادر الارض قليلا
Încă nu există evaluări
Need of PCA
Document6 pagini
Need of PCA
Simi Jain
100% (1)
Machine Learning Tutorial
Document139 pagini
Machine Learning Tutorial
PaNkaj Sonwani
100% (1)
Machine Learning Advanced
Document12 pagini
Machine Learning Advanced
dhruvit
100% (2)
Alpha Beta Pruning
Document35 pagini
Alpha Beta Pruning
Chandra Bhushan Sah
Încă nu există evaluări
Fuzzy Logic
Document47 pagini
Fuzzy Logic
Subhashini jayaseelan
Încă nu există evaluări
Chapter# 14 Database Design Theory and Normalization
Document54 pagini
Chapter# 14 Database Design Theory and Normalization
K213158 Muhammad Shahmir Raza
Încă nu există evaluări
Clustering K-Means
Document28 pagini
Clustering K-Means
Faysal Ahammed
Încă nu există evaluări
WOHLFARTH C. - CRC Handbook of Thermodynamic Data of Polymer Solutions at Elevated Pressures - (CRC PRESS 2005 648 P) PDF
Document648 pagini
WOHLFARTH C. - CRC Handbook of Thermodynamic Data of Polymer Solutions at Elevated Pressures - (CRC PRESS 2005 648 P) PDF
davidnps
100% (1)
Principal Component Analysis
Document34 pagini
Principal Component Analysis
Karthik K
100% (1)
Chapter 2: Estimating The Term Structure: 2.4 Principal Component Analysis
Document15 pagini
Chapter 2: Estimating The Term Structure: 2.4 Principal Component Analysis
Nghiem Xuan Hoa
Încă nu există evaluări
TensorFlow With R
Document46 pagini
TensorFlow With R
biondimi
Încă nu există evaluări
PCA
Document33 pagini
PCA
Habib Rehman
100% (1)
Sensitronics FSR Datasheet
Document15 pagini
Sensitronics FSR Datasheet
nattaq12345
Încă nu există evaluări
Watershed Segmentation
Document19 pagini
Watershed Segmentation
Rathan N
Încă nu există evaluări
Level Set Methods in Medical Image Analysis: Segmentation: Nikos Paragios
Document92 pagini
Level Set Methods in Medical Image Analysis: Segmentation: Nikos Paragios
Sirajus Salekin
Încă nu există evaluări
Brain Tumor Detection and Segmentation Using A Wrapper Based Genetic Algorithm For Optimized Feature Set
Document14 pagini
Brain Tumor Detection and Segmentation Using A Wrapper Based Genetic Algorithm For Optimized Feature Set
Glan Devadhas
Încă nu există evaluări
Understanding Principal Component PDF
Document10 pagini
Understanding Principal Component PDF
Francisco Jácome Sarmento
Încă nu există evaluări
Ch06-The Relational Algebra and Calculus
Document94 pagini
Ch06-The Relational Algebra and Calculus
Shivam Nimje
Încă nu există evaluări
555 Timer 2015
Document29 pagini
555 Timer 2015
Ansor Nt
Încă nu există evaluări
Principal Component Analysis - Ipynb
Document27 pagini
Principal Component Analysis - Ipynb
Daniel Williams
Încă nu există evaluări
Interface MATLAB-Simulink
Document3 pagini
Interface MATLAB-Simulink
Subhendu Maity
Încă nu există evaluări
Schmitt Trigger
Document3 pagini
Schmitt Trigger
rupesh
Încă nu există evaluări
9.2 Schmitt Trigger
Document11 pagini
9.2 Schmitt Trigger
lvsaru
Încă nu există evaluări
Unit III Sensors and Machine Vision
Document120 pagini
Unit III Sensors and Machine Vision
Vino
Încă nu există evaluări
Face Recognition Using Pca
Document15 pagini
Face Recognition Using Pca
arjun1698
Încă nu există evaluări
Anfis
Document10 pagini
Anfis
Chanaka Rupasinghe
Încă nu există evaluări
MATLAB Manual
Document17 pagini
MATLAB Manual
ishan varshney
Încă nu există evaluări
Linear Regression With Multiple Variables
Document56 pagini
Linear Regression With Multiple Variables
Asif Bin Latif
Încă nu există evaluări
Object Detection and Segmentation On Tensor Flow Using
Document10 pagini
Object Detection and Segmentation On Tensor Flow Using
spiro
Încă nu există evaluări
MACHINE LEARNING Notes-6-64
Document59 pagini
MACHINE LEARNING Notes-6-64
JAYACHANDRAN J 20PHD0157
Încă nu există evaluări
Experiment - 6 Four-Quadrant Operation of DC Motor
Document12 pagini
Experiment - 6 Four-Quadrant Operation of DC Motor
eng_abdelghany1979
Încă nu există evaluări
Fuzzy
Document343 pagini
Fuzzy
Hardik Agravatt
Încă nu există evaluări
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
Document101 pagini
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
Suleiman Gargaare
Încă nu există evaluări
Soft Computing and Optimization Algorithms
Document27 pagini
Soft Computing and Optimization Algorithms
Andrew Garfield
Încă nu există evaluări
Python Data Structure
Document73 pagini
Python Data Structure
UDAY SLATHIA (RA2111003011085)
Încă nu există evaluări
System Programming - CS609 Handouts
Document338 pagini
System Programming - CS609 Handouts
King Faisal Khan
Încă nu există evaluări
ANN Matlab
Document13 pagini
ANN Matlab
Akhil Arora
Încă nu există evaluări
Eigenfaces Face Recognition (MATLAB)
Document5 pagini
Eigenfaces Face Recognition (MATLAB)
Naveen Pantham
Încă nu există evaluări
02 Machine Learning Overview
Document103 pagini
02 Machine Learning Overview
Dhouha Benzina
Încă nu există evaluări
Using External Code in Labview
Document302 pagini
Using External Code in Labview
Sibin K Mathew
Încă nu există evaluări
(Chen) Linear System Theory PDF
Document421 pagini
(Chen) Linear System Theory PDF
Adib Hashemi
Încă nu există evaluări
Industrial Robotics
Document45 pagini
Industrial Robotics
himanshu singh
Încă nu există evaluări
Shubhobrata Rudra Presentation On Backstepping Control1
Document139 pagini
Shubhobrata Rudra Presentation On Backstepping Control1
Shubho Rudra
Încă nu există evaluări
Analog Circuits - I
Document127 pagini
Analog Circuits - I
deepakpeethambaran
Încă nu există evaluări
Direct and Indirect Adaptive Control
Document1 pagină
Direct and Indirect Adaptive Control
mahmoud
Încă nu există evaluări
ML Assignment 1 PDF
Document6 pagini
ML Assignment 1 PDF
Anubhav Monga
Încă nu există evaluări
DSP CEN352 ch4 DFT
Document59 pagini
DSP CEN352 ch4 DFT
Tewodros
100% (1)
Radial Basis Function
Document11 pagini
Radial Basis Function
Sid Jain
0% (1)
Statistics and Design of Experiments
Document34 pagini
Statistics and Design of Experiments
Marlon Manzares
Încă nu există evaluări
Gfk1413C - CIMPLICITY HMI Statistical Process Control Operation Manual
Document100 pagini
Gfk1413C - CIMPLICITY HMI Statistical Process Control Operation Manual
Eduardo Nascimento
Încă nu există evaluări
Optimum Statistical Classifiers
Document12 pagini
Optimum Statistical Classifiers
sveekan
100% (1)
23 DeepLearning PDF
Document74 pagini
23 DeepLearning PDF
kavinscrib
Încă nu există evaluări
Covariance Matrix Applications: Dimensionality Reduction
Document24 pagini
Covariance Matrix Applications: Dimensionality Reduction
Karthikeya Kamath
Încă nu există evaluări
Stellite 6 DS01-21708 (S R0808)
Document2 pagini
Stellite 6 DS01-21708 (S R0808)
bwv1006
Încă nu există evaluări
Use of Information Technology in The Flight Catering Services
Document32 pagini
Use of Information Technology in The Flight Catering Services
Abhiroop Sen
Încă nu există evaluări
Auto Cad
Document24 pagini
Auto Cad
kanchan Redas Redas
Încă nu există evaluări
Ffu 0000034 01
Document8 pagini
Ffu 0000034 01
Karunia Lestari
Încă nu există evaluări
Allegheny Power Planning Criteria
Document19 pagini
Allegheny Power Planning Criteria
ksdp1
Încă nu există evaluări
Ketron 1000 PEEK PDS E 30032019 01
Document1 pagină
Ketron 1000 PEEK PDS E 30032019 01
jorgepradaco1
Încă nu există evaluări
Sustainability Schematic Report
Document5 pagini
Sustainability Schematic Report
sakhr
Încă nu există evaluări
Watch Out For Flying Pumpkins: BOE Member Promoted
Document16 pagini
Watch Out For Flying Pumpkins: BOE Member Promoted
elauwit
Încă nu există evaluări
Industrial TYROLITc 21
Document611 pagini
Industrial TYROLITc 21
kamil
Încă nu există evaluări
REE0913ra Legazpi
Document6 pagini
REE0913ra Legazpi
ScoopBoy
Încă nu există evaluări
External Otitis (OE)
Document24 pagini
External Otitis (OE)
Hannah BLiss
Încă nu există evaluări
Tubular Flow Reactor
Document24 pagini
Tubular Flow Reactor
luismiguelmmercado
Încă nu există evaluări
Biochemistry - Syllabus Marks Etc
Document8 pagini
Biochemistry - Syllabus Marks Etc
shahzeb
Încă nu există evaluări
Coronary Stents: Current Status
Document42 pagini
Coronary Stents: Current Status
MANSI SALUNKE
Încă nu există evaluări
MH2732-Robotics Lab Manual
Document50 pagini
MH2732-Robotics Lab Manual
ramzi ayadi
Încă nu există evaluări
Dimensional Stability After Molding
Document14 pagini
Dimensional Stability After Molding
pgovindaiah
Încă nu există evaluări
Job Hazard Analysis Form: Section A: Task Information
Document3 pagini
Job Hazard Analysis Form: Section A: Task Information
Hasnei N
Încă nu există evaluări
Current Invoice No. 1: Enabling Works Ceiling
Document1 pagină
Current Invoice No. 1: Enabling Works Ceiling
Eyad Refai
Încă nu există evaluări
D 5431 - 93 Rdu0mzetotm
Document4 pagini
D 5431 - 93 Rdu0mzetotm
Juan
Încă nu există evaluări
Testing Fire-Protection
Document2 pagini
Testing Fire-Protection
mia murcia
Încă nu există evaluări
Proposed Revisions To Usp Sterile Product - Package Integrity Evaluation
Document56 pagini
Proposed Revisions To Usp Sterile Product - Package Integrity Evaluation
Darla Bala Kishor
Încă nu există evaluări
5100 NSL (User's Guide) PDF
Document40 pagini
5100 NSL (User's Guide) PDF
JEREMEE MICHAEL TYLER
Încă nu există evaluări
1.8 Cardero
Document29 pagini
1.8 Cardero
Rodrigo Flores Mdz
Încă nu există evaluări
Quiz13 130630200754 Phpapp02
Document10 pagini
Quiz13 130630200754 Phpapp02
anukrititiwa
Încă nu există evaluări
HP 300s+ Scientific Calculator: Sophisticated Design Ideal For Math and Science Students
Document3 pagini
HP 300s+ Scientific Calculator: Sophisticated Design Ideal For Math and Science Students
gema
Încă nu există evaluări