Bine ați venit la Scribd!

Săriți peste schemele de tip carusel

FinalProject Rubila

Încărcat de

Rubila Dwi Adawiyah

0% au considerat acest document util (0 voturi)

17 vizualizări3 pagini

big data

Titlu original

FinalProject-Rubila

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

big data

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

17 vizualizări3 pagini

FinalProject Rubila

Încărcat de

Rubila Dwi Adawiyah

big data

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 3

Căutați în document

Rubila Dwi Adawiyah (14/360054/PA/15759)

Natasha Christabelle (14/360017/PA/15753)

M. Nurlazuardi Ajipawenang (14/360029/PA/15754)

MOVIE RECOMMENDATION USING MAHOUTS RECOMMENDATIO

ENGINE
Background

Predicting what user wants is the common use of big data. For example, Google show you relevant
ads, Amazon recommend relevant products, and Netflix recommend movies that you might like
Recommendation involves the prediction of what new items a user would like or dislike based
on preferences of or associations to previous items.

For example, a user, Kevin Benedict, likes the following books which are mostly classic books
(items):
A Tale of Two Cities
The Great Gatsby
For Whom the Bell Tolls

Recommendations will predict which new books (items), Kevin Benedict, will like:
Jane Eyre
The Adventures of Tom Sawyer

In this project, we will use Mahout. Mahout is a machine learning application programming
interface (API) built on Hadoop.

Goals

The goal of this project is to show the movie recommendations for each user.

Tools

Java (to run hadoop)

Hadoop (used by Mahout)
Mahout
Python (use to show the result)

Data set

The dataset we use is The GroupLens Movie DataSet which provides the rating of movies in this
format. This data set contains 943 users, 1,682 movies and 100,000 ratings.
This archive contains:

u.data: contains several tuples(user_id, movie_id, rating, timestamp)

u.user: contains several tuples(user_id, age, gender, occupation, zip_code)
u.item: contains several tuples(movie_id, title, release_date, video_release_data,
imdb_url, cat_unknown, cat_action, cat_adventure, cat_animation, cat_children,
cat_comedy, cat_crime, cat_documentary, cat_drama, cat_fantasy, cat_film_noir,
cat_horror, cat_musical, cat_mystery, cat_romance, cat_sci_fi, cat_thriller, cat_war,
cat_western)

Methods

Mahouts recommendation engines work can be simply described in following way.

X =

S U R

S is the similarity matrix between items

U is the users preferences for items
R is the predicted recommendations

Therefore, to simplify the steps, we use the dataset whose format supports this matrix
multiplication. The dataset itself from MovieLens supports this format. It contains a set of lines
with the userId, the itemId and a preference value separated by a tab. The userId and itemId
are integers and the preference value is a double.

Then, after hadoop and mahout are successfully installed, we simply run Mahout
Recommenders command:
hadoop jar <MAHOUT DIRECTORY>/mahout-core-0.7-job.jar
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob -s
SIMILARITY_COOCCURRENCE --input u.data --output output

Argument -s SIMILARITY_COOCURRENCE tells recommender which item similarity formula to

use. It is said that two items (movies) are very similar if they often appear together in users
rating. So to find the movies to recommend to a user, we need to find the 10 movies which are
most similar to the movies the user has rated. For example, if a user A gives a good rating on
movie X and other users gives a good rating on movie X and movie Y, then we can recommend
the movie Y to the user A.

Mahout computes the recommendations by running several Hadoop mapreduce jobs.

After 30-50 minutes, the jobs are finished and each user will have the 10 movies that he/she
might mostly like based on the co-occurrence of each movie in users reviews.

The recommendation result is not easily read. So, we use a small python program to show for a
given user, the movies she/he has rated and the movies we recommend him. The python
program uses the file u.data for the list of rated movies, the file u.item to get the movie titles
and output.txt to get the list of recommended movies for the user.

S-ar putea să vă placă și

Written by Ruby
Document1 pagină
Written by Ruby
Rubila Dwi Adawiyah
Încă nu există evaluări
Harvesting and Analyzing Tweets Using R
Document23 pagini
Harvesting and Analyzing Tweets Using R
Rubila Dwi Adawiyah
Încă nu există evaluări
Solr Elasticsearch
Document10 pagini
Solr Elasticsearch
Rubila Dwi Adawiyah
Încă nu există evaluări
CHAOS THEORY IN CRYPTOGRAPHY
Document17 pagini
CHAOS THEORY IN CRYPTOGRAPHY
Rubila Dwi Adawiyah
Încă nu există evaluări
Week7 ITIL Slides
Document9 pagini
Week7 ITIL Slides
Rubila Dwi Adawiyah
Încă nu există evaluări
Lucene Solr
Document52 pagini
Lucene Solr
Rubila Dwi Adawiyah
Încă nu există evaluări
What Is Data Science OReilly PDF
Document12 pagini
What Is Data Science OReilly PDF
Rubila Dwi Adawiyah
Încă nu există evaluări
PTI 20122 07 Distributed System TBH
Document31 pagini
PTI 20122 07 Distributed System TBH
Rubila Dwi Adawiyah
Încă nu există evaluări
Indiana University
Document2 pagini
Indiana University
Khusna Indah Wijayanti
Încă nu există evaluări
ES - ATC and ATM (Distributed System)
Document13 pagini
ES - ATC and ATM (Distributed System)
Rubila Dwi Adawiyah
Încă nu există evaluări
Programming Concept-Sebesta
Document31 pagini
Programming Concept-Sebesta
Rubila Dwi Adawiyah
Încă nu există evaluări
Sebesta CPL 9e FigureSlides
Document74 pagini
Sebesta CPL 9e FigureSlides
Rubila Dwi Adawiyah
Încă nu există evaluări
Session B
Document5 pagini
Session B
Rubila Dwi Adawiyah
Încă nu există evaluări
The Crying Beach
Document2 pagini
The Crying Beach
Rubila Dwi Adawiyah
Încă nu există evaluări
The Crying Beach
Document2 pagini
The Crying Beach
Rubila Dwi Adawiyah
Încă nu există evaluări
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De la Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Evaluare: 4 din 5 stele
4/5 (5783)
The Yellow House: A Memoir (2019 National Book Award Winner)
De la Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Evaluare: 4 din 5 stele
4/5 (98)
Never Split the Difference: Negotiating As If Your Life Depended On It
De la Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Evaluare: 4.5 din 5 stele
4.5/5 (838)
Shoe Dog: A Memoir by the Creator of Nike
De la Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Evaluare: 4.5 din 5 stele
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
De la Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Evaluare: 4.5 din 5 stele
4.5/5 (271)
Fear: Trump in the White House
De la Everand
Fear: Trump in the White House
Bob Woodward
Evaluare: 3.5 din 5 stele
3.5/5 (738)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De la Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Evaluare: 4 din 5 stele
4/5 (890)
The Little Book of Hygge: Danish Secrets to Happy Living
De la Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Evaluare: 3.5 din 5 stele
3.5/5 (399)
Team of Rivals: The Political Genius of Abraham Lincoln
De la Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Evaluare: 4.5 din 5 stele
4.5/5 (234)
Yes Please
De la Everand
Yes Please
Amy Poehler
Evaluare: 4 din 5 stele
4/5 (1888)
Grit: The Power of Passion and Perseverance
De la Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Evaluare: 4 din 5 stele
4/5 (587)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De la Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Evaluare: 4.5 din 5 stele
4.5/5 (265)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De la Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Evaluare: 3.5 din 5 stele
3.5/5 (231)
On Fire: The (Burning) Case for a Green New Deal
De la Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Evaluare: 4 din 5 stele
4/5 (72)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De la Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Evaluare: 4.5 din 5 stele
4.5/5 (474)
Principles: Life and Work
De la Everand
Principles: Life and Work
Ray Dalio
Evaluare: 4 din 5 stele
4/5 (599)
Rise of ISIS: A Threat We Can't Ignore
De la Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Evaluare: 3.5 din 5 stele
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De la Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Evaluare: 4.5 din 5 stele
4.5/5 (344)
The Unwinding: An Inner History of the New America
De la Everand
The Unwinding: An Inner History of the New America
George Packer
Evaluare: 4 din 5 stele
4/5 (45)
Steve Jobs
De la Everand
Steve Jobs
Walter Isaacson
Evaluare: 4.5 din 5 stele
4.5/5 (806)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De la Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Evaluare: 3.5 din 5 stele
3.5/5 (2219)
Angela's Ashes: A Memoir
De la Everand
Angela's Ashes: A Memoir
Frank McCourt
Evaluare: 4.5 din 5 stele
4.5/5 (440)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De la Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Evaluare: 4 din 5 stele
4/5 (1090)
John Adams
De la Everand
John Adams
David McCullough
Evaluare: 4.5 din 5 stele
4.5/5 (2409)
Bad Feminist: Essays
De la Everand
Bad Feminist: Essays
Roxane Gay
Evaluare: 4 din 5 stele
4/5 (1015)
The Glass Castle: A Memoir
De la Everand
The Glass Castle: A Memoir
Jeannette Walls
Evaluare: 4.5 din 5 stele
4.5/5 (1711)
The Outsider: A Novel
De la Everand
The Outsider: A Novel
Stephen King
Evaluare: 4 din 5 stele
4/5 (1800)
The Woman in Cabin 10
De la Everand
The Woman in Cabin 10
Ruth Ware
Evaluare: 3.5 din 5 stele
3.5/5 (2322)
A Man Called Ove: A Novel
De la Everand
A Man Called Ove: A Novel
Fredrik Backman
Evaluare: 4.5 din 5 stele
4.5/5 (4609)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De la Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Evaluare: 4.5 din 5 stele
4.5/5 (119)
The Light Between Oceans: A Novel
De la Everand
The Light Between Oceans: A Novel
M.L. Stedman
Evaluare: 4.5 din 5 stele
4.5/5 (789)
Brooklyn: A Novel
De la Everand
Brooklyn: A Novel
Colm Tóibín
Evaluare: 3.5 din 5 stele
3.5/5 (1937)
Wolf Hall: A Novel
De la Everand
Wolf Hall: A Novel
Hilary Mantel
Evaluare: 4 din 5 stele
4/5 (3811)
Manhattan Beach: A Novel
De la Everand
Manhattan Beach: A Novel
Jennifer Egan
Evaluare: 3.5 din 5 stele
3.5/5 (791)
Little Women
De la Everand
Little Women
Louisa May Alcott
Evaluare: 4 din 5 stele
4/5 (104)
The Perks of Being a Wallflower
De la Everand
The Perks of Being a Wallflower
Stephen Chbosky
Evaluare: 4.5 din 5 stele
4.5/5 (2099)
The Art of Racing in the Rain: A Novel
De la Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Evaluare: 4 din 5 stele
4/5 (4193)
A Tree Grows in Brooklyn
De la Everand
A Tree Grows in Brooklyn
Betty Smith
Evaluare: 4.5 din 5 stele
4.5/5 (1929)
Her Body and Other Parties: Stories
De la Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Evaluare: 4 din 5 stele
4/5 (821)
Sing, Unburied, Sing: A Novel
De la Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Evaluare: 4 din 5 stele
4/5 (1103)
The Constant Gardener: A Novel
De la Everand
The Constant Gardener: A Novel
John le Carré
Evaluare: 3.5 din 5 stele
3.5/5 (104)
Postman Quick Reference Guide
Document22 pagini
Postman Quick Reference Guide
DanBoRu
100% (4)
Required SAP Notes For SAP Document and Reporting Compliance Outbound Nota Fiscal (As of Technical Note 2020.006)
Document4 pagini
Required SAP Notes For SAP Document and Reporting Compliance Outbound Nota Fiscal (As of Technical Note 2020.006)
shashidhar thumma
Încă nu există evaluări
Julia by Example
Document44 pagini
Julia by Example
gastromono
Încă nu există evaluări
Wss-Source Book: Department OF Computer Science and Engineering
Document181 pagini
Wss-Source Book: Department OF Computer Science and Engineering
PREM KUMAR M
Încă nu există evaluări
Api Example 1: Vulnerability Type: Api Request's Response Leakage. Screenshot of Component
Document49 pagini
Api Example 1: Vulnerability Type: Api Request's Response Leakage. Screenshot of Component
Ashlee Gregory
Încă nu există evaluări
Advance Java Prgramming
Document5 pagini
Advance Java Prgramming
Rahul Abhay Bhagtani
Încă nu există evaluări
Aws Innovate Q4T3S3
Document33 pagini
Aws Innovate Q4T3S3
man.sinh.lee
Încă nu există evaluări
Python Introduction
Document29 pagini
Python Introduction
NAVYA Tadisetty
Încă nu există evaluări
Untitled
Document20 pagini
Untitled
rishikumar
Încă nu există evaluări
Syllabus of Class XI Half Yearly (1) SCIM N
Document6 pagini
Syllabus of Class XI Half Yearly (1) SCIM N
SHREYANK JOHRI 10-A
Încă nu există evaluări
2016 KATE Roms - Manual
Document177 pagini
2016 KATE Roms - Manual
Los Rog
Încă nu există evaluări
Swapnil Kamble-IT Security - Resume
Document3 pagini
Swapnil Kamble-IT Security - Resume
mahesh
Încă nu există evaluări
(D.E. Stevenson) Programming Language Fundamentals PDF
Document218 pagini
(D.E. Stevenson) Programming Language Fundamentals PDF
Mil
Încă nu există evaluări
SLD Migration and GW - Acl - Mode - SAP Blogs
Document18 pagini
SLD Migration and GW - Acl - Mode - SAP Blogs
Jagadish Babu
Încă nu există evaluări
demoblaze test report
Document19 pagini
demoblaze test report
ryu putra
Încă nu există evaluări
Mic Project
Document15 pagini
Mic Project
sudarshan sonawane
Încă nu există evaluări
Assignment 1 With Solution
Document5 pagini
Assignment 1 With Solution
Alva Ro
Încă nu există evaluări
Algo - Lec3 - Verifying Correctness of Algorithm PDF
Document126 pagini
Algo - Lec3 - Verifying Correctness of Algorithm PDF
Hamza Bhatti
Încă nu există evaluări
IBM Training: Front Cover
Document432 pagini
IBM Training: Front Cover
macribas
Încă nu există evaluări
Big Data Analytics Framework Hands On
Document4 pagini
Big Data Analytics Framework Hands On
Pushpinder Singh
Încă nu există evaluări
CSC Computer Education, M.K.B.Nagar
Document37 pagini
CSC Computer Education, M.K.B.Nagar
Ganesh Kumar
Încă nu există evaluări
Simulink Tutorial
Document15 pagini
Simulink Tutorial
sukhbir24
Încă nu există evaluări
BSC6800 Performance Management (BSC6800V100R007)
Document17 pagini
BSC6800 Performance Management (BSC6800V100R007)
yuriy_batalin
Încă nu există evaluări
Resume Rickyhuget
Document1 pagină
Resume Rickyhuget
api-720960601
Încă nu există evaluări
Reed - Mark DevOps - The Ultimate Beginners Guide To Learn DevOps Step by Step - 2020 - Publishing Facto
Document87 pagini
Reed - Mark DevOps - The Ultimate Beginners Guide To Learn DevOps Step by Step - 2020 - Publishing Facto
shankar vn
100% (1)
CSC 5309 Course Outline Web Development Technologies-Fall-2014-15
Document5 pagini
CSC 5309 Course Outline Web Development Technologies-Fall-2014-15
soulidentities
Încă nu există evaluări
Create A Calculator App That Can Perform Basic
Document7 pagini
Create A Calculator App That Can Perform Basic
KAVIN PARITHI S
Încă nu există evaluări
Git Quick Reference
Document1 pagină
Git Quick Reference
Juan Pablo Justiniano
100% (1)
Integrating BMC Remedy Action Request System With Single Sign-On (SSO) and Other Client-Side Login Intercept Technologies
Document24 pagini
Integrating BMC Remedy Action Request System With Single Sign-On (SSO) and Other Client-Side Login Intercept Technologies
Ashish Kumar
Încă nu există evaluări
Assignment - Shell Programming: Learning Outcome A
Document2 pagini
Assignment - Shell Programming: Learning Outcome A
Dinh Nhat Duy (K16HL)
Încă nu există evaluări