Bine ați venit la Scribd!

Ga1 Deguzman Delto Regodon

Încărcat de

0% au considerat acest document util (0 voturi)

103 vizualizări5 pagini

This document contains a group activity submission for a data mining exercises class. It includes responses to 6 questions that involve identifying supervised vs unsupervised learning tasks, describing the roles of validation and test data partitions, commenting on sample data, and indicating the next step given a randomly selected training data set. Key details summarized include that validation is used to compare models while test assesses final model, the sample in one question was not random as every 8th observation was selected, and the next step given a random training set would be to validate and test the data sets.

Descriere originală:

sada

Drepturi de autor

Formate disponibile

DOCX, PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca DOCX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

103 vizualizări5 pagini

Ga1 Deguzman Delto Regodon

Încărcat de

Kirby Helmuth

Drepturi de autor:

Formate disponibile

Descărcați ca DOCX, PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 5

Căutați în document

ITBSPEC3:

FUNDAMENTALS OF ANALTYICS MODELING

Professor: Maria Vicky S. Solomo

GROUP ACTIVITY 1:
[Data Mining Exercises]
Submitted by:

B32

December 14, 2017

1. Assuming that data mining techniques are to be used in the following cases, identify whether
the task required is supervised or unsupervised learning:
(a) Deciding whether to issue a loan to an applicant, based on demographic and financial
data (with reference to a database of similar data on prior customers).
-Supervised Learning, because the database contains whether the loan was
approved or not.

(b) In an online bookstore, making recommendations to customers concerning additional

items to buy, based on the buying patterns in prior transactions.
-Because there is no clear outcome, it is Unsupervised Learning.

(c) Identifying a network data packet as dangerous (virus, hacker attack), based on
comparison to other packets whose threat status is known.
-This is Supervised Learning because the status is known for the other packets.

(d) Identifying segments of similar customers.

-Unsupervised Learning, because the outcome is not known.

(e) Predicting whether a company will go bankrupt, based on comparing its financial data
to similar bankrupt and non-bankrupt firms.
-Supervised Learning, because the company kept track on its financial data.

(f) Estimating the required repair time for an aircraft based on a trouble ticket.
-Supervised Learning because there is a historic repair times.

(g) Automated sorting of mail by zip code scanning.

-This is Supervised Learning because there is a knowledge if the sorting was
correct.

(h) Printing of custom discount coupons at the conclusion of a grocery store checkout,
based on what you just bought and what others have bought previously.
-Unsupervised Learning, because we dont know what items will be purchased in
the succeeding days.

2. In Figure below, locate the plot of the crime rate vs. median value, and interpret it.
-The question is irrelevant to the plot.

3. Describe the difference in roles assumed by the validation partition and the test partition.
-Validation partition is used to assess the performance of each supervised learning model so
that we can compare models and pick the best one while the test partition is used for assessing the
performance of the final chosen model on new data. The test data are not used to compare models, or
to further tweak the model or improve its fit.

4. Consider the sample from a database of credit applicants in Figure 2.12. Comment on the
likelihood that it was sampled randomly, and whether it is likely to be a useful sample.
-This sample is not selected randomly as we can see from OBS#, that there is a pattern
in the observations chosen for the sample. In particular, every 8th observation from the database
was selected for the sample.

5. Consider the sample from a bank database shown in Figure 2.13; it was selected randomly
from a larger database to be the training set. Personal loan" indicates whether a solicitation for
a personal loan was accepted and is the response variable. A campaign is planned for a similar
solicitation in the future and the bank is looking for a model that will identify likely responders.
Examine the data carefully and indicate what your next step would be.
-Since the random sampling from the larger database is considered to be the training set,
it is already in the data reduction in data mining steps. Thus, the next step would be to validate
and test the datasets

S-ar putea să vă placă și

CISA Exam-Testing Concept-Classification of Information Assets (Domain-5)
De la Everand
CISA Exam-Testing Concept-Classification of Information Assets (Domain-5)
Hemang Doshi
Evaluare: 3 din 5 stele
3/5 (2)
Data Mining Review Questions / Data Mining Labs: Classification or Prediction (Textbook Reference - 2.1)
Document2 pagini
Data Mining Review Questions / Data Mining Labs: Classification or Prediction (Textbook Reference - 2.1)
Tam Tran
Încă nu există evaluări
6720 Labs Chapter 2
Document3 pagini
6720 Labs Chapter 2
sweetie05
Încă nu există evaluări
Data Mining
Document26 pagini
Data Mining
ralturk
Încă nu există evaluări
Data Mining
Document40 pagini
Data Mining
porpy_carol
100% (1)
Assignment Solution 074
Document8 pagini
Assignment Solution 074
Atharv Sharma
Încă nu există evaluări
Domain 6
Document70 pagini
Domain 6
Nagendra Krishnamurthy
Încă nu există evaluări
Abhay Seminar Papers
Document16 pagini
Abhay Seminar Papers
HARRY POTTER
Încă nu există evaluări
Data Mining Questions
Document7 pagini
Data Mining Questions
Pritam Saha
Încă nu există evaluări
Assignmen1 Radziatul Syikin - 2019893594
Document5 pagini
Assignmen1 Radziatul Syikin - 2019893594
muhd fadhli
Încă nu există evaluări
Predictive Modeling: Types, Benefits, and Algorithms
Document4 pagini
Predictive Modeling: Types, Benefits, and Algorithms
morph ling
Încă nu există evaluări
Lecture 3
Document10 pagini
Lecture 3
mohammed.riad.bi.2020
Încă nu există evaluări
DWM Lab Manual
Document92 pagini
DWM Lab Manual
Hemamalini
Încă nu există evaluări
Data Mining Notes
Document75 pagini
Data Mining Notes
Aravind Rossi
Încă nu există evaluări
Bank LoanPredic-WPS Office
Document8 pagini
Bank LoanPredic-WPS Office
CHARAN SAI MUSHAM
Încă nu există evaluări
DATAMINING For Prepare
Document20 pagini
DATAMINING For Prepare
RAJASEKHAR777
Încă nu există evaluări
Literature Survey
Document3 pagini
Literature Survey
Ezhilarasi
Încă nu există evaluări
DMDW Lecture Notes
Document24 pagini
DMDW Lecture Notes
sunandan Rana
Încă nu există evaluări
An Exhaustive Investigation On Loan Prediction in Banks Using LRD
Document15 pagini
An Exhaustive Investigation On Loan Prediction in Banks Using LRD
International Journal of Innovative Science and Research Technology
Încă nu există evaluări
9.customer Loan Prediction
Document1 pagină
9.customer Loan Prediction
Venky Naidu Balineni
Încă nu există evaluări
CUSTOMER SEGMENTATION ANALYSIS OF DATA AND MACHINE LEARNING APPROACH With Plugorism
Document6 pagini
CUSTOMER SEGMENTATION ANALYSIS OF DATA AND MACHINE LEARNING APPROACH With Plugorism
hyd java
Încă nu există evaluări
Lecture 3 Data Mining
Document36 pagini
Lecture 3 Data Mining
su kh
Încă nu există evaluări
Bank Loan Approval Prediction Using Data Science Technique (ML)
Document10 pagini
Bank Loan Approval Prediction Using Data Science Technique (ML)
IJRASETPublications
Încă nu există evaluări
Data Mining Unit 1
Document24 pagini
Data Mining Unit 1
Anurag sharma
Încă nu există evaluări
Synopsis
Document9 pagini
Synopsis
mahesh0070
Încă nu există evaluări
1.1what Is Data Mining?: Gallop
Document64 pagini
1.1what Is Data Mining?: Gallop
Nelli Harshitha
Încă nu există evaluări
CS-505 Introduction To Data Mining Exercises: Page 1 of 4
Document4 pagini
CS-505 Introduction To Data Mining Exercises: Page 1 of 4
Amy Aung
Încă nu există evaluări
MIS Assignement-1 - Swostik
Document7 pagini
MIS Assignement-1 - Swostik
Swostik Rout
Încă nu există evaluări
Data Mining and Data Warehousing
Document9 pagini
Data Mining and Data Warehousing
apurva manchanda 7010
Încă nu există evaluări
Week Two Assignment A
Document1 pagină
Week Two Assignment A
Ravalika
Încă nu există evaluări
Vragen Case Studies - 2
Document22 pagini
Vragen Case Studies - 2
ur fb
Încă nu există evaluări
Vragen Case Studies - 1
Document19 pagini
Vragen Case Studies - 1
ur fb
Încă nu există evaluări
Unit 3
Document34 pagini
Unit 3
varsha.j2177
Încă nu există evaluări
Data Mining and Predictive Analytics
Document7 pagini
Data Mining and Predictive Analytics
Armand Cristobal
Încă nu există evaluări
Practical Econometrics Data Collection Analysis and Application 1st Edition Hilmer Test Bank
Document5 pagini
Practical Econometrics Data Collection Analysis and Application 1st Edition Hilmer Test Bank
LaurenBatesnqkim
100% (13)
Vol 11 3
Document5 pagini
Vol 11 3
bhuvanesh.cse23
Încă nu există evaluări
Credit Rating by Hybrid 4 Tekniği Vermiş PDF
Document7 pagini
Credit Rating by Hybrid 4 Tekniği Vermiş PDF
Eda Çevik
Încă nu există evaluări
IRJET Prediction For Loan Approval Using12
Document4 pagini
IRJET Prediction For Loan Approval Using12
Altaf SMT
Încă nu există evaluări
John - Fields - HW1 Data Mining
Document10 pagini
John - Fields - HW1 Data Mining
Satya Narayan Shukla
Încă nu există evaluări
What Is Machine Learning
Document5 pagini
What Is Machine Learning
Hassan Saddiqui
Încă nu există evaluări
Unit-1 Introduction To Data Mining
Document33 pagini
Unit-1 Introduction To Data Mining
zeeshaan7290
Încă nu există evaluări
Synopsis Print
Document4 pagini
Synopsis Print
Ruchika khekare
Încă nu există evaluări
Improve Profiling Bank Customer Behavior Using ML
Document8 pagini
Improve Profiling Bank Customer Behavior Using ML
ranaya23
Încă nu există evaluări
DM Notes-1
Document71 pagini
DM Notes-1
Dheeraj Kumar
Încă nu există evaluări
Examples of Data Mining
Document11 pagini
Examples of Data Mining
mattew657
Încă nu există evaluări
Credit Risk Analysis Using Naive Bayes in Machine Learning
Document5 pagini
Credit Risk Analysis Using Naive Bayes in Machine Learning
IJRASETPublications
Încă nu există evaluări
Customer Loan Prediction: Term Project Report
Document11 pagini
Customer Loan Prediction: Term Project Report
Anusha Bhatnagar
100% (1)
Exercises 5
Document5 pagini
Exercises 5
Bhuvana Eswari
Încă nu există evaluări
Discussion Questions BA
Document11 pagini
Discussion Questions BA
sanjay.diddee
Încă nu există evaluări
56 Customer Churn Analysis and Prediction Using Data Mining Models in Banking Industry
Document6 pagini
56 Customer Churn Analysis and Prediction Using Data Mining Models in Banking Industry
abdul.moen.itti
Încă nu există evaluări
Link For Google Colab Note Book: Pa Ge
Document17 pagini
Link For Google Colab Note Book: Pa Ge
sid p
Încă nu există evaluări
Credit Card Fraud Detection
Document89 pagini
Credit Card Fraud Detection
Rahul Repala
Încă nu există evaluări
Data Mining Notes For BCA 5th Sem 2019 PDF
Document46 pagini
Data Mining Notes For BCA 5th Sem 2019 PDF
Aslam Nizami
Încă nu există evaluări
Data Mining Transparencies
Document50 pagini
Data Mining Transparencies
Rishika Shukla
Încă nu există evaluări
Suni
Document104 pagini
Suni
saitej
Încă nu există evaluări
Cluster Credit Risk R PDF
Document13 pagini
Cluster Credit Risk R PDF
Fabian Chahin
Încă nu există evaluări
10 1016@j Ins 2019 05 093
Document13 pagini
10 1016@j Ins 2019 05 093
唐皓鑫
Încă nu există evaluări
11 V May 2023
Document9 pagini
11 V May 2023
Brahim
Încă nu există evaluări
Fraud Detection and Analysis For Insurance Claim Using Machine Learning
Document9 pagini
Fraud Detection and Analysis For Insurance Claim Using Machine Learning
IJRASETPublications
Încă nu există evaluări
Data Mining and Data Warehousing
Document13 pagini
Data Mining and Data Warehousing
Kattineni Chaitanya
Încă nu există evaluări
Advanced Product Quality Planning: Why Plan
Document29 pagini
Advanced Product Quality Planning: Why Plan
Sudhagar
Încă nu există evaluări
Packplus 2018 - Exhibitors Details Invitation Show Area Stand No. Hall Form 1 Status Zone GST Number
Document75 pagini
Packplus 2018 - Exhibitors Details Invitation Show Area Stand No. Hall Form 1 Status Zone GST Number
Truck Trailer & Tyre Expo
Încă nu există evaluări
MC200912013 GSFM7514
Document6 pagini
MC200912013 GSFM7514
Yaga Kangga
Încă nu există evaluări
Entrep Pretest 2022-2023
Document3 pagini
Entrep Pretest 2022-2023
kate escarillo
Încă nu există evaluări
TSN Banking 2016-2017 1st Exam
Document18 pagini
TSN Banking 2016-2017 1st Exam
Sanchez Roman
Încă nu există evaluări
The Walt Disney Company
Document18 pagini
The Walt Disney Company
Personal trainer
Încă nu există evaluări
Rights of The Holder
Document6 pagini
Rights of The Holder
Van Openiano
Încă nu există evaluări
Havells Initiating Coverage 27 Apr 2011
Document23 pagini
Havells Initiating Coverage 27 Apr 2011
ananth999
Încă nu există evaluări
Pre Design Services
Document61 pagini
Pre Design Services
Clarice Clores
Încă nu există evaluări
Corporate Social Responsibility Practices in Garments Sector of Bangladesh, A Study of Multinational Garments, CSR View in Dhaka EPZ
Document11 pagini
Corporate Social Responsibility Practices in Garments Sector of Bangladesh, A Study of Multinational Garments, CSR View in Dhaka EPZ
Alexander Decker
Încă nu există evaluări
Christian Louboutin V Yves Saint Laurent
Document28 pagini
Christian Louboutin V Yves Saint Laurent
Fame Appeal
Încă nu există evaluări
Aviation Leaders Programme in Public Policy - Singapore
Document1 pagină
Aviation Leaders Programme in Public Policy - Singapore
Anonymous tSYkkHToBP
Încă nu există evaluări
En Banc: Aces Philippines Cellular Satellite Corporation, Cta Eb Case No. 1242
Document32 pagini
En Banc: Aces Philippines Cellular Satellite Corporation, Cta Eb Case No. 1242
Argie Capuli Rapisura
Încă nu există evaluări
Acct Summ Fy11
Document419 pagini
Acct Summ Fy11
Anousack Kittilath
Încă nu există evaluări
How Has Technology Impacted The Global Business Environment?
Document2 pagini
How Has Technology Impacted The Global Business Environment?
Husnain Mustafa
100% (1)
Introduction To Mutual Fund and Its Variousaspects.: Shares
Document2 pagini
Introduction To Mutual Fund and Its Variousaspects.: Shares
indu9sri
Încă nu există evaluări
Techknowlogia Journal 2000 Mac April
Document68 pagini
Techknowlogia Journal 2000 Mac April
Mazlan Zulkifly
Încă nu există evaluări
Burnmark Report Oct 2016
Document25 pagini
Burnmark Report Oct 2016
Farooq Miza
100% (1)
Study Id54247 Global Footwear Market
Document52 pagini
Study Id54247 Global Footwear Market
Tanvi Khandelwal (Ms.)
Încă nu există evaluări
Role of Sales Manager
Document2 pagini
Role of Sales Manager
Shy Nee
Încă nu există evaluări
Indias Top 500 Companies 2019
Document454 pagini
Indias Top 500 Companies 2019
arvindshastry
100% (3)
2.construction Assistant CV Template
Document2 pagini
2.construction Assistant CV Template
Prabath Danansuriya
100% (1)
Special Journal 1
Document59 pagini
Special Journal 1
Praygod Manase
Încă nu există evaluări
Retail Scenario in India Cii Report
Document21 pagini
Retail Scenario in India Cii Report
api-3823513
100% (1)
Request For Proposal "Empanelment of Indian Institute of Management, Jammu (Iim Jammu) "
Document11 pagini
Request For Proposal "Empanelment of Indian Institute of Management, Jammu (Iim Jammu) "
Pulkit Prajapati
Încă nu există evaluări
A1-Cash and Cash Equivalents - 041210
Document28 pagini
A1-Cash and Cash Equivalents - 041210
MRinaldiAulia
Încă nu există evaluări
Magic Quadrant For Content-Aware Data Loss Prevention
Document16 pagini
Magic Quadrant For Content-Aware Data Loss Prevention
pfv
Încă nu există evaluări
Distribution Strategy: Rural Marketing
Document47 pagini
Distribution Strategy: Rural Marketing
pnarona
Încă nu există evaluări
DMCI Construction
Document3 pagini
DMCI Construction
Mara Ysabelle Villena
Încă nu există evaluări
Adjusting Entries
Document11 pagini
Adjusting Entries
BenjaminJrMoronia
100% (1)