Bine ați venit la Scribd!

289ml Project Presentation

Încărcat de

0% au considerat acest document util (0 voturi)

36 vizualizări6 pagini

Comparing and find the best model for Expedia to predict for their user's future hotel assignment. Implementing different learning methods, Softmax KNN K-means Classification Tree (w / k-fold cross validation) using K-mean cluster method in this project is not an ideal method, but can still be used.

Descriere originală:

Titlu original

289ml project presentation

Drepturi de autor

Formate disponibile

PPTX, PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca PPTX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

36 vizualizări6 pagini

289ml Project Presentation

Încărcat de

api-317071008

Drepturi de autor:

Formate disponibile

Descărcați ca PPTX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 6

Căutați în document

EXPEDIA HOTEL

RECOMMENDATION
Bing Zhang, Qingxuan Li
Department of Electrical and Computer Engineering
University of California, Davis

PROJECT STATEMENT
Use different methods to predict the users hotel selection based on their existed data features.
From the dataset of Expedia, there are around 20 features components for each prediction.
First, feature selection modify input feature into smaller size
Backward Elimination
PCA dimension reduction
Implement different learning methods,
Softmax
KNN
K-means
Classification Tree (w/ k-fold cross validation)
Compared and find the best model for Expedia to predict for their users future hotel assignment.

FEATURE SELECTION
Backward elimination
This algorithm is the part of Stepwise regression, which starting with all candidate
variables, and the delete each element. Then through the model to test if it can
improves the model the most by their deleted.
; is the weight of each element is the vector.
To decide , we use Moore-Penrose pseudoinverse (Linear independent columns)
*y

PCA dimension reduction

It is a statistical procedure by using orthogonal transformation = , to convert a
set of observations of possibly correlated variables into a set of values of linear
uncorrelated variables.

IMPLEMENTATION METHODS
K-nearest neighbor
In the classification phase, k is a user-defined constant, and an unlabeled vector
(a query or test point) is classified by assigning the label which is most frequent
among the k training samples nearest to that query point.
To compute the distance metric, in matlab it is function pdist2 to calculate the
Euclidean distance. Then sort the result, based on the result to assign the nearest
vector into the label of training vector.

K-mean cluster
Using K-mean cluster method in this project is not a ideal method. However, it can
still be used. When we do training, we separate the labels from the dataset, and
using K-mean algorithm package in Matlab to get the each cluster. Then assign the
testing data into the cluster then label them. Using this equation to assign the
vector into the potential right cluster.

Classification Tree
Visually represent decision-making results based on all of the input features.
Divide each feature into many different section
Easy to handle Expedias input feature data (difference in order of magnitude)

Classification Tree with k-fold cross validation

Generate tree for each k-fold and calculate the most accurate decision tree.
Find results for each input feature for all available Tree.
Find mode decision. If not exist, use the most accurate decision tree.

data# 20,000
(18,000 train
#fold
2,000 test)

100,000
(90,000 train
10,000 test)

200,000
(180,000 train
20,000 test)

10.55%

16.77%

21.04%

10.85%

18.4%

21.82%

10.75%

18.54%

22.31%

10.95%

19.15%

22.56%

RESULTS
Method
KNN K= 100
KNN with confusion
matrix K=1
K-means cluster
Classification Tree
Classification Tree

Dataset
100 Classes;
20000
100 Classes;
20000
100 Classes;
200000
100 Classes;
20000
100 Classes;
200000

Accuracy
4%
24.49%
16.59%
10.55%
21.04%

Method
KNN K= 100
KNN with confusion
matrix K=1
K-means cluster
Classification Tree
Classification Tree with
k-fold cross validation
(k=18)

Table1:
Classification
Tree Without Feature
100 Classes;
with k-fold
cross
22.38%
selection
200000
validation (k=18)
Method
Dataset
KNN K= 100
100 Classes; 20000
KNN with confusion
100 Classes; 20000
matrix K=1
Softmax regression
10 Classes; 20000
K-mean cluster
100 Classes; 200000
Classification Tree
100 Classes; 200000
Classification Tree with
k-fold cross validation
100 Classes; 200000
(k=18)

Dataset
100 Classes; 20000

Accuracy
9.25%

100 Classes; 20000

24.69%

100 Classes; 200000

21.22%
20.845%

100 Classes; 200000

22.25%

Table2: With backward

elimination

Table3: With PCA dimension

selection

Accuracy
13.5%
40.95%
14.1%
31.05%
21.98%
23.97%

S-ar putea să vă placă și

Shoe Dog: A Memoir by the Creator of Nike
De la Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Evaluare: 4.5 din 5 stele
4.5/5 (537)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De la Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Evaluare: 4 din 5 stele
4/5 (5794)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De la Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Evaluare: 4 din 5 stele
4/5 (890)
The Yellow House: A Memoir (2019 National Book Award Winner)
De la Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Evaluare: 4 din 5 stele
4/5 (98)
The Little Book of Hygge: Danish Secrets to Happy Living
De la Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Evaluare: 3.5 din 5 stele
3.5/5 (399)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De la Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Evaluare: 3.5 din 5 stele
3.5/5 (231)
Never Split the Difference: Negotiating As If Your Life Depended On It
De la Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Evaluare: 4.5 din 5 stele
4.5/5 (838)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De la Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Evaluare: 4.5 din 5 stele
4.5/5 (474)
Rise of ISIS: A Threat We Can't Ignore
De la Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Evaluare: 3.5 din 5 stele
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De la Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Evaluare: 4.5 din 5 stele
4.5/5 (344)
Grit: The Power of Passion and Perseverance
De la Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Evaluare: 4 din 5 stele
4/5 (587)
Yes Please
De la Everand
Yes Please
Amy Poehler
Evaluare: 4 din 5 stele
4/5 (1891)
On Fire: The (Burning) Case for a Green New Deal
De la Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Evaluare: 4 din 5 stele
4/5 (73)
The Emperor of All Maladies: A Biography of Cancer
De la Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Evaluare: 4.5 din 5 stele
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De la Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Evaluare: 4.5 din 5 stele
4.5/5 (265)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De la Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Evaluare: 4 din 5 stele
4/5 (1090)
Team of Rivals: The Political Genius of Abraham Lincoln
De la Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Evaluare: 4.5 din 5 stele
4.5/5 (234)
Angela's Ashes: A Memoir
De la Everand
Angela's Ashes: A Memoir
Frank McCourt
Evaluare: 4.5 din 5 stele
4.5/5 (440)
Principles: Life and Work
De la Everand
Principles: Life and Work
Ray Dalio
Evaluare: 4 din 5 stele
4/5 (599)
Steve Jobs
De la Everand
Steve Jobs
Walter Isaacson
Evaluare: 4.5 din 5 stele
4.5/5 (806)
Fear: Trump in the White House
De la Everand
Fear: Trump in the White House
Bob Woodward
Evaluare: 3.5 din 5 stele
3.5/5 (738)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De la Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Evaluare: 3.5 din 5 stele
3.5/5 (2219)
The Unwinding: An Inner History of the New America
De la Everand
The Unwinding: An Inner History of the New America
George Packer
Evaluare: 4 din 5 stele
4/5 (45)
John Adams
De la Everand
John Adams
David McCullough
Evaluare: 4.5 din 5 stele
4.5/5 (2409)
Bad Feminist: Essays
De la Everand
Bad Feminist: Essays
Roxane Gay
Evaluare: 4 din 5 stele
4/5 (1015)
The Glass Castle: A Memoir
De la Everand
The Glass Castle: A Memoir
Jeannette Walls
Evaluare: 4.5 din 5 stele
4.5/5 (1711)
The Outsider: A Novel
De la Everand
The Outsider: A Novel
Stephen King
Evaluare: 4 din 5 stele
4/5 (1839)
The Light Between Oceans: A Novel
De la Everand
The Light Between Oceans: A Novel
M.L. Stedman
Evaluare: 4.5 din 5 stele
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De la Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Evaluare: 4.5 din 5 stele
4.5/5 (119)
The Perks of Being a Wallflower
De la Everand
The Perks of Being a Wallflower
Stephen Chbosky
Evaluare: 4.5 din 5 stele
4.5/5 (2099)
Brooklyn: A Novel
De la Everand
Brooklyn: A Novel
Colm Tóibín
Evaluare: 3.5 din 5 stele
3.5/5 (1937)
Wolf Hall: A Novel
De la Everand
Wolf Hall: A Novel
Hilary Mantel
Evaluare: 4 din 5 stele
4/5 (3811)
The Woman in Cabin 10
De la Everand
The Woman in Cabin 10
Ruth Ware
Evaluare: 3.5 din 5 stele
3.5/5 (2322)
A Man Called Ove: A Novel
De la Everand
A Man Called Ove: A Novel
Fredrik Backman
Evaluare: 4.5 din 5 stele
4.5/5 (4609)
The Art of Racing in the Rain: A Novel
De la Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Evaluare: 4 din 5 stele
4/5 (4200)
Manhattan Beach: A Novel
De la Everand
Manhattan Beach: A Novel
Jennifer Egan
Evaluare: 3.5 din 5 stele
3.5/5 (792)
Sing, Unburied, Sing: A Novel
De la Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Evaluare: 4 din 5 stele
4/5 (1103)
Little Women
De la Everand
Little Women
Louisa May Alcott
Evaluare: 4 din 5 stele
4/5 (104)
Her Body and Other Parties: Stories
De la Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Evaluare: 4 din 5 stele
4/5 (821)
A Tree Grows in Brooklyn
De la Everand
A Tree Grows in Brooklyn
Betty Smith
Evaluare: 4.5 din 5 stele
4.5/5 (1929)
The Constant Gardener: A Novel
De la Everand
The Constant Gardener: A Novel
John le Carre
Evaluare: 3.5 din 5 stele
3.5/5 (104)
Final Report BPP & Company
Document74 pagini
Final Report BPP & Company
Rohan Chauhan
Încă nu există evaluări
27_Optimize the storage volume using Data mining techniques
Document71 pagini
27_Optimize the storage volume using Data mining techniques
Venkat Karthik
Încă nu există evaluări
Credit Risk Handling in Telecommunication
Document14 pagini
Credit Risk Handling in Telecommunication
Anonymous RrGVQj
Încă nu există evaluări
WEKA & Rapid Miner Tutorial
Document15 pagini
WEKA & Rapid Miner Tutorial
anamik2100
Încă nu există evaluări
Data Mining - Theories - Algorithms - and Examples PDF
Document347 pagini
Data Mining - Theories - Algorithms - and Examples PDF
Anand Kumar
Încă nu există evaluări
Final PPT
Document16 pagini
Final PPT
me2kathick
100% (1)
Lab2 Solution PDF
Document2 pagini
Lab2 Solution PDF
Kunal Ranjan
Încă nu există evaluări
Data Mining: Machine Learning Tutorial
Document40 pagini
Data Mining: Machine Learning Tutorial
MuzaFar
Încă nu există evaluări
April Mineral Exploration
Document8 pagini
April Mineral Exploration
Holisterf05LP
Încă nu există evaluări
CS1634 Datawarehousing and Data Mining
Document4 pagini
CS1634 Datawarehousing and Data Mining
R_Senthil
Încă nu există evaluări
Predictive Analytics
Document62 pagini
Predictive Analytics
Md. Abdullah Al Mahmud
Încă nu există evaluări
NN Assignment PDF
Document3 pagini
NN Assignment PDF
sonika
Încă nu există evaluări
Memory Based Reasoning - BIA
Document19 pagini
Memory Based Reasoning - BIA
Jayeeta Chatterjee
100% (1)
Lecture - 9 Unsupervised Learning (K-Means, Association Analysis and Frequuent Items)
Document73 pagini
Lecture - 9 Unsupervised Learning (K-Means, Association Analysis and Frequuent Items)
ABDURAHMAN ABDELLA
Încă nu există evaluări
Presented By: Kevin Gachathi Kibugi. SMPQ/00425/2016
Document42 pagini
Presented By: Kevin Gachathi Kibugi. SMPQ/00425/2016
Nahashon Kimani
Încă nu există evaluări
Fuzzy CMeans
Document20 pagini
Fuzzy CMeans
Narendra Jain
100% (1)
Text Mining: A Burgeoning Technology For Knowledge Extraction
Document5 pagini
Text Mining: A Burgeoning Technology For Knowledge Extraction
ijsret
100% (1)
E-Health Care Management
Document92 pagini
E-Health Care Management
bsudheertec
Încă nu există evaluări
Lecture 1-Introduction To Data Mining - M
Document38 pagini
Lecture 1-Introduction To Data Mining - M
Khizar Shahid
Încă nu există evaluări
SPSS for Statistical Analysis and Data Mining
Document3 pagini
SPSS for Statistical Analysis and Data Mining
iamsuresh0079907
Încă nu există evaluări
A Study On Data Mining Based Intrusion Detection System
Document5 pagini
A Study On Data Mining Based Intrusion Detection System
IJIRAE
Încă nu există evaluări
Introduction To SMAC Social Mobile Analytics and Cloud
Document3 pagini
Introduction To SMAC Social Mobile Analytics and Cloud
Giliola Marin
Încă nu există evaluări
Addis Ababa University: College of Natural Sciences School of Information Science
Document113 pagini
Addis Ababa University: College of Natural Sciences School of Information Science
አስምሮ ላቂያዉ
Încă nu există evaluări
Crime Analysis and Prediction Using Machine Learning
Document5 pagini
Crime Analysis and Prediction Using Machine Learning
International Journal of Innovative Science and Research Technology
Încă nu există evaluări
Kotler05 Tif
Document28 pagini
Kotler05 Tif
សារុន កែវវរលក្ខណ៍
100% (1)
Slides 06FPBasic
Document30 pagini
Slides 06FPBasic
Sai Vignesh Birru
Încă nu există evaluări
MCA Syllabus Jai Narain Vyas University
Document26 pagini
MCA Syllabus Jai Narain Vyas University
Layse
Încă nu există evaluări
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
Document17 pagini
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
WINORLOSE
Încă nu există evaluări
Artificial Intelligence: Machine Learning Algorithms Id3 Dbscan
Document30 pagini
Artificial Intelligence: Machine Learning Algorithms Id3 Dbscan
elgeneral0313
Încă nu există evaluări
Literature Review on Big Data Analytics Tools and Methods
Document6 pagini
Literature Review on Big Data Analytics Tools and Methods
vishal
Încă nu există evaluări