Bine ați venit la Scribd!

Data Analytics Project

Încărcat de

0% au considerat acest document util (0 voturi)

18 vizualizări5 pagini

This document provides an introduction, objectives, and methodology for a movie recommendation data analytics project. The project uses association rule mining, an unsupervised learning technique, to analyze transaction data and identify frequent itemsets and relationships between movies. The Apriori algorithm is employed to generate frequent itemsets and rules with sufficient support and confidence thresholds.

Descriere originală:

movie recommendation using association rule mining

Drepturi de autor

Formate disponibile

DOCX, PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca DOCX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

18 vizualizări5 pagini

Data Analytics Project

Încărcat de

swar

Drepturi de autor:

Formate disponibile

Descărcați ca DOCX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 5

Căutați în document

Movies

Recommendation
Data Analytics Project
Report
FAS Members :

Avirup Banerjee (18S714)

Movies
Data Analytics Project
Swaroop Singamsetty()
Vipras Ladu Morye()
Report
Recommendation
Yagneshwar Chowdary Bandlamudi()
Anshul Sharma (18S712)
CONTENTS

Introduction......................................................1

Project Objective..........................2

Methodology 3

Results............................................4

Summary...............................................5

References 6
Introduction
Project Objective
Methodology

In this project we have used Association Rule Mining Technique which is a unsupervised learning
methodology.
Association Rule Mining technique is used when one wants to figure out associations between
different objects in a set, within in a transaction database find frequent patterns, or search for patterns
within relational databases or any other information repository. The applications of Association Rule
Mining are found in Marketing, Basket Data Analysis (or Market Basket Analysis) in retailing,
clustering and classification
We will consider an example,

So, in the above transactions numbered 1 to 5 we can see diapers are bought with beer in 3 occasions.
Similarly bread is bought with milk in 3 transactions, making them both ‘frequent transactions’.

To understand the mechanics of the methodology we need to know a few things here,

Itemsets: Collection of one or more items, in the above example transaction 2 is a 4-item-set simply
because it’s a set of 4 items.

Support: Fraction of transactions that contain item-set ‘I’ , i.e. support(I) = [frequency(I)] / N
where, N = no. of total transactions

Confidence: In confidence we have to understand the concept of antecedent and consequent.

Antecedent is as the name suggests the transaction that occurs previously and consequent is the
transaction that occurs as a reason of the antecedent.
Moreover, confidence compares the co-occurrence of the antecedent and consequent itemsets in the
database to the occurrence of only antecedent item-sets.
confidence = (no. of transactions where both antecedent and consequent occurs) / (no. of transactions
with antecedent transactions)
like for bread and milk this confidence ratio would be simply put, (¾) = .75 and in probability terms
this turns up to 75% ; for every association rule implemented there always has to be a minimum
confidence level.

Lift ratio: This is defined as the comparison of the confidence ratio with the benchmark confidence
value where, benchmark confidence = (no. of transactions with consequent dataset)/(no. of transactions
in the database)

and so, Lift Ratio = (confidence) / (benchmark confidence). A lift ratio greater than 1 suggests that
there is some usefulness to the rule, in other words the level of association between the antecedent and
consequent itemsets is higher than would be expected if they were independent, the larger the ratio, the
greater the strength of the association.

The algorithm we used here is Apriori. In this algorithm, Association Rule Mining is used as a two-
step-approach ,
i. Frequent item-set generation(where support >= pre determined min-support)
ii. Rule Generation: Calculate support and confidence for all rules and discard rules that fail min-
support and min-confidence thresholds.

For frequent item-set generation full database scan is required, so this turns out to be most costly in
terms of computation. Behind the algorithm, there is a concept of lattice creation. Like for ‘n’ number
of items the size of the lattice will become 2n .
Check the below example, when one start moving upwards subsets get created till the null set. And
also infrequent item-sets get deleted one by one after the full lattice is created.

R has packages to be used to implement Apriori algorithm, the most important being ‘arules’ etc. And
we used these same to implement here in our project.

S-ar putea să vă placă și

Exam Topic Breakdown Exam Topic Number of Questions
Document93 pagini
Exam Topic Breakdown Exam Topic Number of Questions
José Maldonado
100% (3)
(1987) A New Definition of The Rainflow Cycle Counting Method
Document3 pagini
(1987) A New Definition of The Rainflow Cycle Counting Method
Marcell Enzweiler
Încă nu există evaluări
Mining Frequent Itemsets Using Apriori Algorithm
Document5 pagini
Mining Frequent Itemsets Using Apriori Algorithm
seventhsensegroup
Încă nu există evaluări
Unit 3 - DM FULL
Document46 pagini
Unit 3 - DM FULL
minto
Încă nu există evaluări
Efficient Frequent Itemset Mining Mechanism Using Support Count
Document7 pagini
Efficient Frequent Itemset Mining Mechanism Using Support Count
International Journal of Application or Innovation in Engineering & Management
Încă nu există evaluări
Arules Viz
Document24 pagini
Arules Viz
Dhio Muhammad
Încă nu există evaluări
Arules Viz
Document26 pagini
Arules Viz
Muneesh Bajpai
Încă nu există evaluări
Visualizing Association Rules: Introduction To The R-Extension Package Arulesviz
Document24 pagini
Visualizing Association Rules: Introduction To The R-Extension Package Arulesviz
DevendraReddyPoreddy
Încă nu există evaluări
Visualizing Association Rules: Introduction To The R-Extension Package Arulesviz
Document24 pagini
Visualizing Association Rules: Introduction To The R-Extension Package Arulesviz
DevendraReddyPoreddy
Încă nu există evaluări
Contents
Document59 pagini
Contents
anchal
Încă nu există evaluări
Research Journal of Pharmaceutical, Biological and Chemical Sciences
Document7 pagini
Research Journal of Pharmaceutical, Biological and Chemical Sciences
Susliana Esra M Silaban
Încă nu există evaluări
Data Analysis Using Apriori Algorithm & Neural Netwok: Ashutosh Padhi
Document27 pagini
Data Analysis Using Apriori Algorithm & Neural Netwok: Ashutosh Padhi
MILAN
Încă nu există evaluări
Online Course Assignments
Document8 pagini
Online Course Assignments
msc cs
Încă nu există evaluări
Data Mining Unit-2
Document10 pagini
Data Mining Unit-2
19Q91A1231 NALDEEGA SAKETHA CHARY
Încă nu există evaluări
University Institute of Engineering and Technology, Chandigarh
Document23 pagini
University Institute of Engineering and Technology, Chandigarh
dhruv kumar
Încă nu există evaluări
1 Explain Apriori Algorithm With Example or Finding Frequent Item Sets Using With Candidate Generation
Document21 pagini
1 Explain Apriori Algorithm With Example or Finding Frequent Item Sets Using With Candidate Generation
kambala dhanush
Încă nu există evaluări
Report of 2nd Defence
Document6 pagini
Report of 2nd Defence
Sachin Dhingra
Încă nu există evaluări
Association Rule Mining: Applications in Various Areas: Akash Rajak and Mahendra Kumar Gupta
Document5 pagini
Association Rule Mining: Applications in Various Areas: Akash Rajak and Mahendra Kumar Gupta
Nylyam Dela Cruz Santos
Încă nu există evaluări
Mining: Association Rules
Document54 pagini
Mining: Association Rules
anon_947471502
Încă nu există evaluări
Discover Frequent Items in Small Stationary
Document16 pagini
Discover Frequent Items in Small Stationary
Lens New
Încă nu există evaluări
Ijcs 2016 0303008 PDF
Document16 pagini
Ijcs 2016 0303008 PDF
editorinchiefijcs
Încă nu există evaluări
Association Analysis: Unit-V
Document12 pagini
Association Analysis: Unit-V
Pradeepkumar 05
Încă nu există evaluări
Lab8 Apriori
Document9 pagini
Lab8 Apriori
giulio141091
Încă nu există evaluări
Experiment No - 08: AIM: Implementation of Association Rule Mining in WEKA. Theory
Document11 pagini
Experiment No - 08: AIM: Implementation of Association Rule Mining in WEKA. Theory
MIHIR PATEL
Încă nu există evaluări
Association Rule Mining by Using New Approach of Propositional Logic
Document5 pagini
Association Rule Mining by Using New Approach of Propositional Logic
International Journal of computational Engineering research (IJCER)
Încă nu există evaluări
R PPT 35
Document38 pagini
R PPT 35
bernatin T
Încă nu există evaluări
UNIT 3: Association Rules and Regression: I) Apriori Algorithm
Document18 pagini
UNIT 3: Association Rules and Regression: I) Apriori Algorithm
Vrushali Vilas Borle
Încă nu există evaluări
Usage of Apriori Algorithm of Data Mining As An Application To Grievous Crimes Against Women
Document6 pagini
Usage of Apriori Algorithm of Data Mining As An Application To Grievous Crimes Against Women
seventhsensegroup
Încă nu există evaluări
Unit 4 - Data Mining - WWW - Rgpvnotes.in
Document12 pagini
Unit 4 - Data Mining - WWW - Rgpvnotes.in
Vijendra Singh Rathore
Încă nu există evaluări
"Fast Algorithms For Mining Association Rules" by Rakesh Agarwal Ramakrishnan Srikant
Document5 pagini
"Fast Algorithms For Mining Association Rules" by Rakesh Agarwal Ramakrishnan Srikant
Tushar Bhonsle
Încă nu există evaluări
DM Unit-II
Document80 pagini
DM Unit-II
Laxmi
Încă nu există evaluări
13 + Temporal Optimal-HUIS Data Streams
Document5 pagini
13 + Temporal Optimal-HUIS Data Streams
Jatin Gera
Încă nu există evaluări
Association Rule - Data Mining
Document131 pagini
Association Rule - Data Mining
ajemla213
100% (1)
Data Mining Unit 4 (1) PDF PDF
Document11 pagini
Data Mining Unit 4 (1) PDF PDF
naman gujarathi
Încă nu există evaluări
Interesting Measures For Mining Association Rules: FAST-NUCES, Lahore
Document4 pagini
Interesting Measures For Mining Association Rules: FAST-NUCES, Lahore
Jean Sorel
Încă nu există evaluări
Assignment ON Data Mining: Submitted by Name: Manjula.T
Document11 pagini
Assignment ON Data Mining: Submitted by Name: Manjula.T
Lõvey Dôvey Ãnanth
Încă nu există evaluări
Differencial Link Analysis in Health Care Using Data Mining
Document27 pagini
Differencial Link Analysis in Health Care Using Data Mining
veerabalaj
Încă nu există evaluări
Market Basket Analysis For A Supermarket
Document9 pagini
Market Basket Analysis For A Supermarket
abhilashponnam@gmail.com
Încă nu există evaluări
Mining The Most K-Frequent Itemsets With Ts-Tree: Savo Tomović and Predrag Stanišić
Document8 pagini
Mining The Most K-Frequent Itemsets With Ts-Tree: Savo Tomović and Predrag Stanišić
Khánh Phụng
Încă nu există evaluări
Unit 4 - Association Analysis
Document12 pagini
Unit 4 - Association Analysis
Anand Kumar Bhagat
Încă nu există evaluări
5615ijdkp06 PDF
Document8 pagini
5615ijdkp06 PDF
Amaranatha Reddy P
Încă nu există evaluări
Data Warhouse
Document5 pagini
Data Warhouse
Sayan Mondal
Încă nu există evaluări
DM Unit 3
Document22 pagini
DM Unit 3
ajayagupta1101
Încă nu există evaluări
UNIT-5 DWDM (Data Warehousing and Data Mining) Association Analysis
Document7 pagini
UNIT-5 DWDM (Data Warehousing and Data Mining) Association Analysis
Vee Beat
Încă nu există evaluări
Association Rule
Document27 pagini
Association Rule
Pradeep Keshwani
Încă nu există evaluări
Apriori Algorithm
Document23 pagini
Apriori Algorithm
Arun Mozhi
Încă nu există evaluări
Unit 4 - DA - Frequent Itemsets and Associations
Document31 pagini
Unit 4 - DA - Frequent Itemsets and Associations
MASTER PIECE
Încă nu există evaluări
Ariori Introduction and Concept
Document37 pagini
Ariori Introduction and Concept
Abdul Khan
Încă nu există evaluări
Association Analysis: Basic Concepts and Algorithms: Problem Definition
Document15 pagini
Association Analysis: Basic Concepts and Algorithms: Problem Definition
GODDU NAVVEN BABU
Încă nu există evaluări
Data Mining Nostos - Resp
Document39 pagini
Data Mining Nostos - Resp
IgorJales
Încă nu există evaluări
Unit 5 Mining Frequent Patterns and Cluster Analysis
Document63 pagini
Unit 5 Mining Frequent Patterns and Cluster Analysis
Ruchira
Încă nu există evaluări
Chapter 3
Document27 pagini
Chapter 3
Bikila Seketa
Încă nu există evaluări
Lesson 8 Association Rules
Document58 pagini
Lesson 8 Association Rules
John Quinia
Încă nu există evaluări
Data Analytics Unit 4
Document22 pagini
Data Analytics Unit 4
Aditi Jaiswal
Încă nu există evaluări
ch14 Min Assoc Rules
Document12 pagini
ch14 Min Assoc Rules
rohitmultani153
Încă nu există evaluări
cs6659 AI 3rd Unit
Document4 pagini
cs6659 AI 3rd Unit
Rahul Dawn
Încă nu există evaluări
Bread, Milk Bread, Diapers, Beer, Eggs Bread, Diapers, Beer, Cola Bread, Milk, Diapers, Beer Bread, Milk, Diapers, Cola
Document4 pagini
Bread, Milk Bread, Diapers, Beer, Eggs Bread, Diapers, Beer, Cola Bread, Milk, Diapers, Beer Bread, Milk, Diapers, Cola
Sddr
Încă nu există evaluări
Association Rule Mining Using Modified Bpso: Amit Kumar Chandanan, Kavita & M K Shukla
Document8 pagini
Association Rule Mining Using Modified Bpso: Amit Kumar Chandanan, Kavita & M K Shukla
TJPRC Publications
Încă nu există evaluări
Bootstrap - Take 2 - Data Mining Bias, Code and Using Geometric Mean - Au - Tra.Sy Blog - Automated Trading System
Document9 pagini
Bootstrap - Take 2 - Data Mining Bias, Code and Using Geometric Mean - Au - Tra.Sy Blog - Automated Trading System
rlindsey
Încă nu există evaluări
A Framework To Discover Association Rules Using Frequent Pattern Mining
Document5 pagini
A Framework To Discover Association Rules Using Frequent Pattern Mining
IIR india
Încă nu există evaluări
Data Analytics
De la Everand
Data Analytics
Jeffery Short
Evaluare: 1 din 5 stele
1/5 (1)
PCI Compliance: Understand and Implement Effective PCI Data Security Standard Compliance
De la Everand
PCI Compliance: Understand and Implement Effective PCI Data Security Standard Compliance
Anton Chuvakin
Evaluare: 4.5 din 5 stele
4.5/5 (2)
(7.1) Dda (Digital Differential Analyzer) Line Algorithm
Document7 pagini
(7.1) Dda (Digital Differential Analyzer) Line Algorithm
Gnaneswaran Narayanan
Încă nu există evaluări
Chap3 HW
Document2 pagini
Chap3 HW
Samantha Sinnerine
Încă nu există evaluări
Modular Assessment Grade 11: Statistics and Probability Mr. Antonio E. Soto JR
Document4 pagini
Modular Assessment Grade 11: Statistics and Probability Mr. Antonio E. Soto JR
rheena espiritu
Încă nu există evaluări
Fcteg 02 632417
Document5 pagini
Fcteg 02 632417
mda mps
Încă nu există evaluări
Calculating Lyapunov
Document5 pagini
Calculating Lyapunov
Neelima Sharma
Încă nu există evaluări
CSE Artificial Intelligence Report
Document16 pagini
CSE Artificial Intelligence Report
Anonymous 22GBLsme1
Încă nu există evaluări
BARON Solver Algorithm
Document26 pagini
BARON Solver Algorithm
DamdaePark
Încă nu există evaluări
Andy Klise 3x3x3 Speedcubing Guide v4 PDF
Document2 pagini
Andy Klise 3x3x3 Speedcubing Guide v4 PDF
ritik
Încă nu există evaluări
Lessons 22-45
Document179 pagini
Lessons 22-45
Shakeel Nawaz
Încă nu există evaluări
Chapter 7 Numerical Solution of Ordinary Differential Equations
Document5 pagini
Chapter 7 Numerical Solution of Ordinary Differential Equations
trfuawlachew
Încă nu există evaluări
Control Systems: University of Engineering & Technology Lahore
Document8 pagini
Control Systems: University of Engineering & Technology Lahore
AYESHA FAHEEM
Încă nu există evaluări
Solutions Manual To Accompany Statistics For Business Decision Making and Analysis 0321123913
Document7 pagini
Solutions Manual To Accompany Statistics For Business Decision Making and Analysis 0321123913
StephenShawiwgd
100% (45)
Control-Oriented Modeling Approach For
Document6 pagini
Control-Oriented Modeling Approach For
Yair Sarabia Noriega
Încă nu există evaluări
Analysis and Design of Algorithm Practical File
Document21 pagini
Analysis and Design of Algorithm Practical File
megha
Încă nu există evaluări
Saponara Game Theory Practice 2
Document3 pagini
Saponara Game Theory Practice 2
vuduyduc
Încă nu există evaluări
Error Propagation Lab Recent222
Document8 pagini
Error Propagation Lab Recent222
Mas Im -
Încă nu există evaluări
4.2. Cryptographic Coding (Part 2)
Document27 pagini
4.2. Cryptographic Coding (Part 2)
Reach
Încă nu există evaluări
ClassLectures Numerical Methods
Document84 pagini
ClassLectures Numerical Methods
Faisal Baig
50% (2)
Program For Shortest Job First
Document2 pagini
Program For Shortest Job First
Akansha Tyagi
Încă nu există evaluări
Stochastic HW
Document3 pagini
Stochastic HW
HaniyaAngel
Încă nu există evaluări
BLAST Glossary With Highlights
Document9 pagini
BLAST Glossary With Highlights
imran47
Încă nu există evaluări
Presentation On Speech Recognition
Document11 pagini
Presentation On Speech Recognition
aditya_4_sharma
Încă nu există evaluări
Properties of Fourier Transform
Document10 pagini
Properties of Fourier Transform
Mark Ali
Încă nu există evaluări
Symmetry Properties of Linear Algebraic Systems With Non-Canonical Scalar Multiplication
Document8 pagini
Symmetry Properties of Linear Algebraic Systems With Non-Canonical Scalar Multiplication
api-679458257
Încă nu există evaluări
A Three Party Authentication For Key Distributed Protocol Using Classical and Quantum Cryptography
Document6 pagini
A Three Party Authentication For Key Distributed Protocol Using Classical and Quantum Cryptography
leks_2007
Încă nu există evaluări
Lecture Note 8 - Generalized Method of Moments Estimation
Document33 pagini
Lecture Note 8 - Generalized Method of Moments Estimation
Faizus Saquib Chowdhury
Încă nu există evaluări
Master of Science (Integrated-Information Technology) : Gujarat Technological University
Document2 pagini
Master of Science (Integrated-Information Technology) : Gujarat Technological University
bhagy finaviya
Încă nu există evaluări