Bine ați venit la Scribd!

Q530 KK

Încărcat de

0% au considerat acest document util (0 voturi)

40 vizualizări10 pagini

K-Means the principle is to minimize the sum of squares of distances between data and the corresponding cluster centroids. K Nearest Neighbors The classification is using majority vote.

Descriere originală:

Titlu original

Q530_kk

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

K-Means the principle is to minimize the sum of squares of distances between data and the corresponding cluster centroids. K Nearest Neighbors The classification is using majority vote.

Drepturi de autor:

Attribution Non-Commercial (BY-NC)

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

40 vizualizări10 pagini

Q530 KK

Încărcat de

Sumedh Kakde

K-Means the principle is to minimize the sum of squares of distances between data and the corresponding cluster centroids. K Nearest Neighbors The classification is using majority vote.

Drepturi de autor:

Attribution Non-Commercial (BY-NC)

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 10

Căutați în document

K-Means & K Nearest Neighbors

Chen Yu Indiana University

K-Means

K-Means
The algorithm can group your data into k number of categories. The principle is to minimize the sum of squares of distances between data and the corresponding cluster centroids.

Example
arousal(-5 to 5) valance 3 3 -1 -4 2 3 0 -5 We know that those data belong to two clusters. The question is how to determine which data points belong to cluster 1 and which belong to the other one.

Example

K-Mean Algorithm
Repeat the following three steps until convergence (stable): Step 1: determine the centroid coordinates Step 2: determine the distances of each data point to the centroids. Step 3: group the data points based on minimum distance.

Example
Initialize the first two centroids c1=(3,3) c2=(2,3) 3 3 -1 -4 2 3 0 -5

Measuring distances
Calculate the distance between two data items. Euclidean distance

Iteration 1: calculate distances

3 -1 2 0 3 -4 3 -5 c1=(3,3) 0 8.1 1 8.5 c2=(2,3) 1 7.6 0 8.2

Iteration 1: assign clusters

3 -1 2 0 3 -4 3 -5 c1=(3,3) 0 8.1 1 8.5 c2=(2,3) 1 7.6 0 8.2

Iteration 1: compute new centroids

c1=(3,3) c2=(0.3,-2)
3 -1 2 0 3 -4 3 -5 0 8.1 1 8.5 1 7.6 0 8.2

Iteration 2: calculate distances

3 -1 2 0 3 -4 3 -5 c1=(3,3) 0 8.1 1 8.5 c2=(0.3,-2) 5.7 2.4 5.3 3.0

Iteration 2: assign clusters

3 -1 2 0 3 -4 3 -5 c1=(3,3) 0 8.1 1 8.5 c2=(0.3,-2) 5.7 2.4 5.3 3.0

Iteration 2: compute new centroids

3 -1 2 0 3 -4 3 -5 c1=(2.5,3) 0 8.1 1 8.5 c2=(-0.5,-4.5) 5.7 2.4 5.3 3.0

Iteration 3
3 -1 2 0 3 -4 3 -5 c1=(2.5,3) 0.5 7.8 0.5 8.3 c2=(-0.5,-4.5) 8.2 0.7 7.9 0.7

The new centroids will be the same!

function c = kmean(k,data) [ndat ndim] = size(data); % initilize the centra point r = randperm(ndat); c(1:k,:) = data(r(1:k),:);

Matlab Example

ctemp = zeros(size(c)); cluster = zeros(size(data,1)); it = 0; while (c ~= ctemp) it = it + 1; fprintf(1,'iteration %d: (%6.3f,%6.3f),(%6.3f,%6.3f),(%6.3f,%6.3f)\n', it, c(1,1),c(1,2),c(2,1),c(2,2),c(3,1),c(3,2)); plot(data(:,1),data(:,2),'.r',c(:,1),c(:,2),'*b') pause; ctemp = c; dist = distance(data',c'); [non cluster] = min(dist,[],2); for i = 1 : k c(i,:) = mean(data(find(cluster == i),:)); end; end;

Classification
Supervised: given training samples, classify a new instance. Features: attributes of an object that can be represented a multidimensional space. e.g. object categorization, face recognition, emotion recognition, etc.

K Nearest Neighbors
The classification is using majority vote. The classifiers do not use any model to fit the data and only based on memory. The KNN uses neighborhood classification as the predication value of the new query instance.

Example
Two features: arousal(-5 to 5) valance emotion 3 3 happy -1 -4 sad 2 3 happy 0 -5 sad Now given a new instance (3,4), what category is this new data point in?

How KNN works?

The general principle is to find the k training samples to determine the knearest neighbors based on a distance measure. Next, the majority of those k nearest neighbors decide the category of the next instance.

Example

Finding K-nearest neighbors

Sort the distances of all training samples to the new instance and determine the k-th minimum distance.

Classification
Find the majority of the category of k nearest neighbors.

Summary
Step 1: determine k Step 2: calculate the distances between the new input and all the training data Step 3: sort the distance and determine k nearest neighbors based on the k-th minimum distance. Step 4: gather the categories of those neighbors. Step 5: determine the category based on majority vote.

Example
arousal(-5 to 5) 3 -1 2 0 (3,4) = ? valance 3 -4 3 -5 emotion happy sad happy sad

Example
Step 1: k = 3 Step 2: 3 3 -1 -4 to (3,4) 2 3 0 -5 distance 1 8.9 1.4 9.4

Step 3: ranking
3 2 -1 0 3 3 -4 -5 rank 1 2 3 4 neighbors Y Y Y N

Step 4: categorization
3 2 -1 0 3 3 -4 -5 rank 1 2 3 4 neighbors Y Y Y N category happy happy sad sad

S-ar putea să vă placă și

Fear: Trump in the White House
De la Everand
Fear: Trump in the White House
Bob Woodward
Evaluare: 3.5 din 5 stele
3.5/5 (738)
A Man Called Ove: A Novel
De la Everand
A Man Called Ove: A Novel
Fredrik Backman
Evaluare: 4.5 din 5 stele
4.5/5 (4610)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De la Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Evaluare: 3.5 din 5 stele
3.5/5 (231)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De la Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Evaluare: 4.5 din 5 stele
4.5/5 (121)
Grit: The Power of Passion and Perseverance
De la Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Evaluare: 4 din 5 stele
4/5 (588)
Never Split the Difference: Negotiating As If Your Life Depended On It
De la Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Evaluare: 4.5 din 5 stele
4.5/5 (838)
The Little Book of Hygge: Danish Secrets to Happy Living
De la Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Evaluare: 3.5 din 5 stele
3.5/5 (400)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De la Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Evaluare: 4.5 din 5 stele
4.5/5 (266)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De la Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Evaluare: 4 din 5 stele
4/5 (5794)
Rise of ISIS: A Threat We Can't Ignore
De la Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Evaluare: 3.5 din 5 stele
3.5/5 (137)
Her Body and Other Parties: Stories
De la Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Evaluare: 4 din 5 stele
4/5 (821)
A Tree Grows in Brooklyn
De la Everand
A Tree Grows in Brooklyn
Betty Smith
Evaluare: 4.5 din 5 stele
4.5/5 (1929)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De la Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Evaluare: 4 din 5 stele
4/5 (1090)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De la Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Evaluare: 3.5 din 5 stele
3.5/5 (2259)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De la Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Evaluare: 4.5 din 5 stele
4.5/5 (345)
Shoe Dog: A Memoir by the Creator of Nike
De la Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Evaluare: 4.5 din 5 stele
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
De la Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Evaluare: 4.5 din 5 stele
4.5/5 (271)
The Glass Castle: A Memoir
De la Everand
The Glass Castle: A Memoir
Jeannette Walls
Evaluare: 4.5 din 5 stele
4.5/5 (1713)
Team of Rivals: The Political Genius of Abraham Lincoln
De la Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Evaluare: 4.5 din 5 stele
4.5/5 (234)
John Adams
De la Everand
John Adams
David McCullough
Evaluare: 4.5 din 5 stele
4.5/5 (2409)
Principles: Life and Work
De la Everand
Principles: Life and Work
Ray Dalio
Evaluare: 4 din 5 stele
4/5 (599)
Yes Please
De la Everand
Yes Please
Amy Poehler
Evaluare: 4 din 5 stele
4/5 (1891)
The Light Between Oceans: A Novel
De la Everand
The Light Between Oceans: A Novel
M.L. Stedman
Evaluare: 4.5 din 5 stele
4.5/5 (789)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De la Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Evaluare: 4 din 5 stele
4/5 (895)
Wolf Hall: A Novel
De la Everand
Wolf Hall: A Novel
Hilary Mantel
Evaluare: 4 din 5 stele
4/5 (3811)
The Perks of Being a Wallflower
De la Everand
The Perks of Being a Wallflower
Stephen Chbosky
Evaluare: 4.5 din 5 stele
4.5/5 (2104)
The Woman in Cabin 10
De la Everand
The Woman in Cabin 10
Ruth Ware
Evaluare: 3.5 din 5 stele
3.5/5 (2322)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De la Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Evaluare: 4.5 din 5 stele
4.5/5 (474)
Sing, Unburied, Sing: A Novel
De la Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Evaluare: 4 din 5 stele
4/5 (1103)
The Art of Racing in the Rain: A Novel
De la Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Evaluare: 4 din 5 stele
4/5 (4200)
Angela's Ashes: A Memoir
De la Everand
Angela's Ashes: A Memoir
Frank McCourt
Evaluare: 4.5 din 5 stele
4.5/5 (440)
The Constant Gardener: A Novel
De la Everand
The Constant Gardener: A Novel
John le Carré
Evaluare: 3.5 din 5 stele
3.5/5 (104)
The Outsider: A Novel
De la Everand
The Outsider: A Novel
Stephen King
Evaluare: 4 din 5 stele
4/5 (1839)
On Fire: The (Burning) Case for a Green New Deal
De la Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Evaluare: 4 din 5 stele
4/5 (74)
Brooklyn: A Novel
De la Everand
Brooklyn: A Novel
Colm Tóibín
Evaluare: 3.5 din 5 stele
3.5/5 (1937)
The Yellow House: A Memoir (2019 National Book Award Winner)
De la Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Evaluare: 4 din 5 stele
4/5 (98)
The Unwinding: An Inner History of the New America
De la Everand
The Unwinding: An Inner History of the New America
George Packer
Evaluare: 4 din 5 stele
4/5 (45)
Little Women
De la Everand
Little Women
Louisa May Alcott
Evaluare: 4 din 5 stele
4/5 (104)
Bad Feminist: Essays
De la Everand
Bad Feminist: Essays
Roxane Gay
Evaluare: 4 din 5 stele
4/5 (1016)
Steve Jobs
De la Everand
Steve Jobs
Walter Isaacson
Evaluare: 4.5 din 5 stele
4.5/5 (806)
Manhattan Beach: A Novel
De la Everand
Manhattan Beach: A Novel
Jennifer Egan
Evaluare: 3.5 din 5 stele
3.5/5 (792)
Full Download Ebook Ebook PDF Longitudinal Data Analysis by Donald Hedeker PDF
Document41 pagini
Full Download Ebook Ebook PDF Longitudinal Data Analysis by Donald Hedeker PDF
sarah.steir690
100% (42)
Blended-Learning-Plan-Form (Mathematics 7)
Document2 pagini
Blended-Learning-Plan-Form (Mathematics 7)
Karlz
Încă nu există evaluări
Gann Time Price Square
Document5 pagini
Gann Time Price Square
anudora
0% (1)
Math8 - q1 - Mod12 - Finding The Equations of A Line - 08092020
Document17 pagini
Math8 - q1 - Mod12 - Finding The Equations of A Line - 08092020
Cassandra Nicole Francisco
Încă nu există evaluări
Job Crafting
Document10 pagini
Job Crafting
Sumedh Kakde
Încă nu există evaluări
(U E in - Ppa) Notes
Document28 pagini
(U E in - Ppa) Notes
Sumedh Kakde
100% (1)
Critical Summary - It's Time To Split HR Ram Charan
Document2 pagini
Critical Summary - It's Time To Split HR Ram Charan
Sumedh Kakde
Încă nu există evaluări
Iggy's Bread of The World
Document7 pagini
Iggy's Bread of The World
Sumedh Kakde
Încă nu există evaluări
G16 Sehore
Document11 pagini
G16 Sehore
Sumedh Kakde
Încă nu există evaluări
Expectation of An Indian Consumer From Company Owned / Company Branded Stores
Document6 pagini
Expectation of An Indian Consumer From Company Owned / Company Branded Stores
Sumedh Kakde
Încă nu există evaluări
Chapter 5 - Quiz
Document3 pagini
Chapter 5 - Quiz
Sumedh Kakde
Încă nu există evaluări
Prob16.3 CVP
Document1 pagină
Prob16.3 CVP
Sumedh Kakde
Încă nu există evaluări
Organizational Theory, Design, and Change: Organizing in A Changing Global Environment
Document43 pagini
Organizational Theory, Design, and Change: Organizing in A Changing Global Environment
Sumedh Kakde
Încă nu există evaluări
(Type Here) (Type Here) (Type Here)
Document6 pagini
(Type Here) (Type Here) (Type Here)
Sumedh Kakde
Încă nu există evaluări
Activity Based Costing
Document17 pagini
Activity Based Costing
Sumedh Kakde
Încă nu există evaluări
Assignment 1
Document5 pagini
Assignment 1
Sumedh Kakde
Încă nu există evaluări
CVP Caselet
Document2 pagini
CVP Caselet
Sumedh Kakde
Încă nu există evaluări
Commercial Banking - Marketing
Document1 pagină
Commercial Banking - Marketing
Sumedh Kakde
Încă nu există evaluări
Phys 31 Module 4
Document42 pagini
Phys 31 Module 4
Coyzz de Guzman
Încă nu există evaluări
Chap 1 A - Propositions
Document44 pagini
Chap 1 A - Propositions
عمار الدلال
100% (1)
Unit 8 Answers
Document8 pagini
Unit 8 Answers
pyropop3
Încă nu există evaluări
Comparative Study of RP-HPLC and UV Spectrophotometric Techniques For The Simultaneous Determination of Amoxicillin and Cloxacillin in Capsules
Document6 pagini
Comparative Study of RP-HPLC and UV Spectrophotometric Techniques For The Simultaneous Determination of Amoxicillin and Cloxacillin in Capsules
iabureid7460
Încă nu există evaluări
Trigonometri: Jadi, π radian = 180° Jadi, 1° = 0,02 radian Jadi, 1° radian = 57,296° atau 57,3°
Document8 pagini
Trigonometri: Jadi, π radian = 180° Jadi, 1° = 0,02 radian Jadi, 1° radian = 57,296° atau 57,3°
dhika
Încă nu există evaluări
Phet Alpha Decay
Document2 pagini
Phet Alpha Decay
Andika Sanjaya
100% (1)
Resolução Do Capítulo 4 - Equilíbrio de Corpos Rígidos PDF
Document231 pagini
Resolução Do Capítulo 4 - Equilíbrio de Corpos Rígidos PDF
Malik Passos
Încă nu există evaluări
How To Solve Fractions: Solving Quadratic Equations
Document7 pagini
How To Solve Fractions: Solving Quadratic Equations
api-126876773
Încă nu există evaluări
Development of The 3rd Generation Balanced Scorecard - Par
Document11 pagini
Development of The 3rd Generation Balanced Scorecard - Par
hshafeeq
Încă nu există evaluări
8 The Chinese Postman Problem
Document5 pagini
8 The Chinese Postman Problem
Wantei Kupar Warjri
Încă nu există evaluări
A Case Study in Ow Assurance of A Pipeline-Riser System Using OLGA
Document10 pagini
A Case Study in Ow Assurance of A Pipeline-Riser System Using OLGA
Ahmed Bedhief
Încă nu există evaluări
Unit 4 - Continuous Random Variables
Document35 pagini
Unit 4 - Continuous Random Variables
Shouq Abdullah
Încă nu există evaluări
Study of Oscillation
Document4 pagini
Study of Oscillation
PHƯỚC DƯƠNG THANH
Încă nu există evaluări
N6 Control Systems August 2018
Document14 pagini
N6 Control Systems August 2018
lechutnm
Încă nu există evaluări
Numerical Methods PPT AYUSH MISHRA
Document11 pagini
Numerical Methods PPT AYUSH MISHRA
subscribe.us100
Încă nu există evaluări
Interpreting Validity Indexes For Diagnostic Tests
Document10 pagini
Interpreting Validity Indexes For Diagnostic Tests
Judson Borges
Încă nu există evaluări
Quantum Chemical Descriptors in QSAR QSPR Studies
Document17 pagini
Quantum Chemical Descriptors in QSAR QSPR Studies
Carlos Alberto Bayona López
Încă nu există evaluări
Agglomeration
Document16 pagini
Agglomeration
Musanje Martin
Încă nu există evaluări
Trig Book
Document174 pagini
Trig Book
José Maurício Freire
Încă nu există evaluări
Data Science Linear Regression
Document105 pagini
Data Science Linear Regression
Jaime Prades Košćina
Încă nu există evaluări
Second Order Hold Based Discretization Method of
Document5 pagini
Second Order Hold Based Discretization Method of
vitinrj
Încă nu există evaluări
Rounding PDF
Document1 pagină
Rounding PDF
fajar081
Încă nu există evaluări
AEPSHEP2012
Document300 pagini
AEPSHEP2012
Vitaly Vorobyev
Încă nu există evaluări
Mathematics Teaching Methods
Document134 pagini
Mathematics Teaching Methods
DAVID KOIYA
Încă nu există evaluări
Hydra 325 Laboratory Experiment No.1
Document2 pagini
Hydra 325 Laboratory Experiment No.1
lalguina
Încă nu există evaluări
The Mechanism of Activated Digusion Through Silica Glass
Document9 pagini
The Mechanism of Activated Digusion Through Silica Glass
Elena
Încă nu există evaluări