Usl 70 Marks Set 1

Încărcat de

Roshan Kumar

0% au considerat acest document util (0 voturi)

28 vizualizări2 pagini

Titlu original

USL_70_MARKS_SET_1

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

28 vizualizări2 pagini

Usl 70 Marks Set 1

Încărcat de

Roshan Kumar

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 2

Căutați în document

UNSUPERVISED LEARNING

TOTAL MARKS:70 DURATION: 4 HOURS

INSTRUCTIONS: -
1. Candidates should answer all the questions in the same order provided in the question paper.
2. Any activity that compromises the integrity of the examination will not be permitted.
3. Students should complete the examination within the provided timeline.
4. Candidates are expected to check and ensure that the correct answer file (in. ipynb format) is uploaded
in LMS.

Dataset Information:
The dataset given is about TB prevalence, all forms (per 100000 populations per year) in different countries.
Group countries based on how similar their situation has been year-by-year to understand the world situation
regarding the tuberculosis disease. The cluster information is given for reference. Please remove the same
before building the models.

Note: Mention all the assumptions made and also if some of the sub questions cannot be done, please mention
the reason for not doing.

1. Data Understanding (5 marks)

a. Read the dataset (tab, csv, xls, txt, inbuilt dataset). What are the number of rows and no. of cols
& types of variables (continuous, categorical etc.)? (1 MARK)
b. Calculate five-point summary for numerical variables (1 MARK)
c. Summarize observations for categorical variables – no. of categories, % observations in each
category. (1 MARK)
d. Generate the covariance and correlation tables for the data (1 MARK)
e. Create Visualization plots to find the relationship amongst the variables. (1 MARK)

2. Dimensionality Reduction (10 marks)

a. How will you decide when to apply PCA based on the correlation? (2 marks)
b. Apply PCA on the above dataset and determine the number of PCA components to be used so
that 95% of the variance in data is explained by the same. (8 marks)

3. Clustering: Use PCA dimensions to cluster the data. Apply K-means and Agglomerative clustering.
(30 Marks)
Some pointers which would help you, but don’t be limited by these
a. Find the optimal K Value. (5 marks)
b. Apply Clustering and find out if the data points have been clustered correctly using appropriate
visualization (20 marks)
UNSUPERVISED LEARNING

c. Evaluate the clusters formed using appropriate metrics to support the model built and compare
both the models. (5 marks)

4. Use the cluster labels from the best method above and convert the problem to a supervised learning
classification. (15 marks)
a. Split dataset into train and test (70:30) (2 marks)
b. Are both train and test representative of the overall data? How would you ascertain this
statistically? (3 marks)
c. In case of a Supervised Machine Learning Problem, how will you decide when to apply
PCA & How do you improve the accuracy of the model? Write clearly the changes that
you will make before re-fitting the model. Fit the final model. Please feel free to have any number
of iterations to get to the final answer. Marks are awarded based on the quality of final model
you are able to achieve. (10 marks)

5. Summarize as follows (10 marks)

a. Summarize the overall fit of the model. Compare all the clustering and classification models built
and list down the measures to prove that it is a good model.
b. Write down a business interpretation/explanation of the model.
c. Which variables are affecting the target the most and explain the relationship. Feel free to use
charts or graphs to explain.
d. What changes from the base model had the most effect on model performance?
e. What are the key risks to your results and interpretation?

S-ar putea să vă placă și

Vijaya ML
Document26 pagini
Vijaya ML
Vijayalakshmi Palaniappan
83% (6)
Assignment 3: Logistic Regression (Individual Submission)
Document3 pagini
Assignment 3: Logistic Regression (Individual Submission)
Serin Silué
0% (1)
MAST 6474 Introduction To Data Analysis I MAST 6478 Data Analytics
Document4 pagini
MAST 6474 Introduction To Data Analysis I MAST 6478 Data Analytics
Mygen
Încă nu există evaluări
Machine Learning Project Report: Different Models and Text Learning Case Study
Document36 pagini
Machine Learning Project Report: Different Models and Text Learning Case Study
ankitbhagat
100% (6)
Accuracy and Precision Mini Lab
Document6 pagini
Accuracy and Precision Mini Lab
Alistair Morgan
100% (1)
Project Questions
Document4 pagini
Project Questions
vansh gupta
Încă nu există evaluări
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
Document50 pagini
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
Ashish Pavan Kumar K
Încă nu există evaluări
Assignment 2 S315
Document5 pagini
Assignment 2 S315
Sadeeq Ul Hasnain
Încă nu există evaluări
Practice Questions for Tableau Desktop Specialist Certification Case Based
De la Everand
Practice Questions for Tableau Desktop Specialist Certification Case Based
Exam OG
Evaluare: 5 din 5 stele
5/5 (1)
NPV 70 Marks Set 2
Document4 pagini
NPV 70 Marks Set 2
Roshan Kumar
Încă nu există evaluări
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
Document200 pagini
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
Joan Petit Gros
Încă nu există evaluări
Python Mini Report PDF
Document13 pagini
Python Mini Report PDF
Rahul Singh
100% (1)
Sop TCD
Document3 pagini
Sop TCD
Swati Singh
Încă nu există evaluări
Data Analytics For PDF
Document701 pagini
Data Analytics For PDF
Ramesh Padmanabhan
100% (2)
SLC 70 Marks Set 1
Document3 pagini
SLC 70 Marks Set 1
Roshan Kumar
Încă nu există evaluări
New Microsoft Word Document2
Document2 pagini
New Microsoft Word Document2
sree
Încă nu există evaluări
2019 05 Exam SRM Syllabus
Document5 pagini
2019 05 Exam SRM Syllabus
Sujith Gopinathan
Încă nu există evaluări
MBA DEGREE EXAMINATION BIG DATA ANALYTICS
Document4 pagini
MBA DEGREE EXAMINATION BIG DATA ANALYTICS
Sneha Sabu
Încă nu există evaluări
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
Document4 pagini
Project - Data Mining: Bank - Marketing - Part1 - Data - CSV
donna
Încă nu există evaluări
Assignment 4 - BUS 336
Document4 pagini
Assignment 4 - BUS 336
Omar Al-lheebi
Încă nu există evaluări
2024 05 Exam SRM Syllabus
Document6 pagini
2024 05 Exam SRM Syllabus
Ashish Kumar Yadav
Încă nu există evaluări
Assignment 1 2020
Document5 pagini
Assignment 1 2020
Babi Feed
Încă nu există evaluări
Stat Homework
Document4 pagini
Stat Homework
Eph
Încă nu există evaluări
Assignment 2 2020
Document6 pagini
Assignment 2 2020
Babi Feed
Încă nu există evaluări
Description: Bank - Marketing - Part1 - Data - CSV
Document4 pagini
Description: Bank - Marketing - Part1 - Data - CSV
ravikgovindu
Încă nu există evaluări
CS-7830 Assignment-2 Questions 2022
Document4 pagini
CS-7830 Assignment-2 Questions 2022
manish kardas
Încă nu există evaluări
Third Assessment-Business Analytics-2019-S1
Document2 pagini
Third Assessment-Business Analytics-2019-S1
Anonymous vj2D87v
Încă nu există evaluări
st404 Assignment 2 2024
Document7 pagini
st404 Assignment 2 2024
harshilme18
Încă nu există evaluări
Organization of The Examination: Theoretical Part - Duration: 1h
Document4 pagini
Organization of The Examination: Theoretical Part - Duration: 1h
Saramisd
Încă nu există evaluări
Machine Learning Project: Sneha Sharma PGPDSBA Mar'21 Group 2
Document36 pagini
Machine Learning Project: Sneha Sharma PGPDSBA Mar'21 Group 2
preeti
100% (2)
Eda 70 Marks Set 2 Exampaper
Document3 pagini
Eda 70 Marks Set 2 Exampaper
Roshan Kumar
Încă nu există evaluări
Assignment 2
Document2 pagini
Assignment 2
Yo Tu
Încă nu există evaluări
Course Name: SG 1022 Quantitative Methods (Jan 2021) Summative Assessment 1
Document7 pagini
Course Name: SG 1022 Quantitative Methods (Jan 2021) Summative Assessment 1
Kashif Zaman Watto
Încă nu există evaluări
Stat211 062 02 E1
Document9 pagini
Stat211 062 02 E1
Annia Codling
Încă nu există evaluări
Ass 3-P2
Document2 pagini
Ass 3-P2
Muhammad Sohaib
Încă nu există evaluări
Assignment 1
Document2 pagini
Assignment 1
Arnav Yadav
Încă nu există evaluări
Assignment 2
Document2 pagini
Assignment 2
marc
Încă nu există evaluări
Assignment Question DWDS
Document7 pagini
Assignment Question DWDS
Nick LiOu
Încă nu există evaluări
May 2016 Examination Diet School of Computer Science CS1003: Module Code: Module Title: Exam Duration: Exam Instructions
Document3 pagini
May 2016 Examination Diet School of Computer Science CS1003: Module Code: Module Title: Exam Duration: Exam Instructions
Jamie Hopkins
Încă nu există evaluări
CS001-B03 - Exploratory Data Analysis 20
Document7 pagini
CS001-B03 - Exploratory Data Analysis 20
Viswa Spiritual
Încă nu există evaluări
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
Document8 pagini
Mcqs Bank Unit 1: A) The Autonomous Acquisition of Knowledge Through The Use of Computer Programs
varad
100% (1)
Review Questions DS
Document14 pagini
Review Questions DS
Saleh Alizade
Încă nu există evaluări
2020 6 17 Exam Pa Project Statement PDF
Document6 pagini
2020 6 17 Exam Pa Project Statement PDF
Hông Hoa
Încă nu există evaluări
The Bcs Professional Examinations BCS Level 4 Certificate in IT
Document4 pagini
The Bcs Professional Examinations BCS Level 4 Certificate in IT
Abiodun Adebayo
Încă nu există evaluări
MN405 Data and Information Management
Document7 pagini
MN405 Data and Information Management
Sambhav Jain
Încă nu există evaluări
PO687 End of Term Project
Document3 pagini
PO687 End of Term Project
pp3986
Încă nu există evaluări
MRRP Assignment
Document8 pagini
MRRP Assignment
King Kaigh
100% (1)
BSNS 6001 Exam 2 (Summer 2016) PDF
Document4 pagini
BSNS 6001 Exam 2 (Summer 2016) PDF
Rene
Încă nu există evaluări
Marking Scheme: Unit: Databases Assignment Title: Universal Conference Management December 2015 - Sample Assignment
Document4 pagini
Marking Scheme: Unit: Databases Assignment Title: Universal Conference Management December 2015 - Sample Assignment
Bright Harrison Mwale
Încă nu există evaluări
End-Term Exam (PGDM 2019-21), Term-V Introduction To R in Business Applications (Open Book and Online) Max. Marks - 40 Max. Time - 4 Hours
Document2 pagini
End-Term Exam (PGDM 2019-21), Term-V Introduction To R in Business Applications (Open Book and Online) Max. Marks - 40 Max. Time - 4 Hours
Sakshi Shah
Încă nu există evaluări
Practice Problems On Descriptive Statistics
Document4 pagini
Practice Problems On Descriptive Statistics
profharish
Încă nu există evaluări
Data Analytics For Accounting Exercise Multiple Choice and Discussion Question
Document3 pagini
Data Analytics For Accounting Exercise Multiple Choice and Discussion Question
ukandi rukmana
Încă nu există evaluări
Design of Experiment Question
Document3 pagini
Design of Experiment Question
vmgobinath
Încă nu există evaluări
Predictive Analytics Exam-June 2019: Exam PA Home Page
Document9 pagini
Predictive Analytics Exam-June 2019: Exam PA Home Page
justtestit
Încă nu există evaluări
Common Core State Standards: A Crosswalk To The Michigan Grade Level Content Expectations
Document15 pagini
Common Core State Standards: A Crosswalk To The Michigan Grade Level Content Expectations
lchambless
Încă nu există evaluări
2018 Exam Pa Syllabi
Document9 pagini
2018 Exam Pa Syllabi
justtestit
Încă nu există evaluări
Data Mining Assignment No 2
Document4 pagini
Data Mining Assignment No 2
Nouman Rasheed
Încă nu există evaluări
Key Requirements
Document2 pagini
Key Requirements
陈二二
Încă nu există evaluări
ECON1541 - Assignment2 - Torben
Document6 pagini
ECON1541 - Assignment2 - Torben
krkshyam27
Încă nu există evaluări
FIN7C7 Assignment Brief - assignment 1
Document9 pagini
FIN7C7 Assignment Brief - assignment 1
sufyanyounas06
Încă nu există evaluări
ECON+1274 1248 Project 2023
Document4 pagini
ECON+1274 1248 Project 2023
kdoll 29
Încă nu există evaluări
Math 533 Project
Document4 pagini
Math 533 Project
Not listing
Încă nu există evaluări
Measurement of E-Learning Quality Based On ISO 19796-1 Using Fuzzy Analytical Network Process Method
Document10 pagini
Measurement of E-Learning Quality Based On ISO 19796-1 Using Fuzzy Analytical Network Process Method
prastitinovi
Încă nu există evaluări
Assignment 2 2020
Document3 pagini
Assignment 2 2020
Marine Lhsr
Încă nu există evaluări
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
De la Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
Încă nu există evaluări
9 C
Document1 pagină
9 C
Roshan Kumar
Încă nu există evaluări
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
Document5 pagini
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
Roshan Kumar
Încă nu există evaluări
Interim Report Group 01 PDF
Document20 pagini
Interim Report Group 01 PDF
Roshan Kumar
Încă nu există evaluări
9 D
Document1 pagină
9 D
Roshan Kumar
Încă nu există evaluări
Eda 70 Marks Set 2 Exampaper
Document3 pagini
Eda 70 Marks Set 2 Exampaper
Roshan Kumar
Încă nu există evaluări
A Practical Introduction To Python Programming Heinold
Document263 pagini
A Practical Introduction To Python Programming Heinold
robert ko
100% (1)
Itp 70 Marks Set 2
Document3 pagini
Itp 70 Marks Set 2
Roshan Kumar
Încă nu există evaluări
Big Bazaar Project
Document69 pagini
Big Bazaar Project
Foad Akhavan
Încă nu există evaluări
A Study of Machine Learning Algorithms On Email Spam Classification
Document10 pagini
A Study of Machine Learning Algorithms On Email Spam Classification
Anirudh Sharma
Încă nu există evaluări
Android Pothole Detection System Using Deep Learning
Document3 pagini
Android Pothole Detection System Using Deep Learning
International Journal of Innovative Science and Research Technology
Încă nu există evaluări
Reading and Writing Set 2 Assgn
Document16 pagini
Reading and Writing Set 2 Assgn
muheedpanoli
Încă nu există evaluări
Thesis
Document69 pagini
Thesis
andenet
Încă nu există evaluări
CSE121
Document8 pagini
CSE121
Vedanth Pradhan
Încă nu există evaluări
Edx Machine Learning Course Outlines
Document4 pagini
Edx Machine Learning Course Outlines
Tiamiyu Hamzah
100% (1)
Unit-1 Artificial Intelligence 06M
Document13 pagini
Unit-1 Artificial Intelligence 06M
siddhesh shelar
Încă nu există evaluări
Drug Recommendation System
Document7 pagini
Drug Recommendation System
Richard Wani
Încă nu există evaluări
A Handbook of Mathematical Mode - Dr. Ranja Sarker
Document232 pagini
A Handbook of Mathematical Mode - Dr. Ranja Sarker
John
100% (1)
Openai Chatgpt Arhitektura
Document13 pagini
Openai Chatgpt Arhitektura
Ranko Mandic
Încă nu există evaluări
Sample Multiple Choice Questions. Class: Ty BSC (It) Semester-Vi Subject: Business Intelligence
Document8 pagini
Sample Multiple Choice Questions. Class: Ty BSC (It) Semester-Vi Subject: Business Intelligence
siddharth
Încă nu există evaluări
Memtech 2021 ND23
Document58 pagini
Memtech 2021 ND23
asmimcse
Încă nu există evaluări
Vision Transformers (ViT) in Image Recognition - Full Guide - Viso - Ai
Document11 pagini
Vision Transformers (ViT) in Image Recognition - Full Guide - Viso - Ai
S Vasu Krishna
Încă nu există evaluări
2019 Shale Analytics PDF
Document5 pagini
2019 Shale Analytics PDF
Jerome Onwunalu
Încă nu există evaluări
Report 4 (Engr400)
Document2 pagini
Report 4 (Engr400)
Vijay Reddy
Încă nu există evaluări
The Benefits and Challenges of ChatGPT An Overview
Document3 pagini
The Benefits and Challenges of ChatGPT An Overview
Irfan Mohd
Încă nu există evaluări
Introduction To Machine Learning For Beginners
Document5 pagini
Introduction To Machine Learning For Beginners
Nandkumar Khachane
Încă nu există evaluări
Suprabha Islam (AI)
Document2 pagini
Suprabha Islam (AI)
Suprabho Islam
Încă nu există evaluări
Neural Networks
Document22 pagini
Neural Networks
Programming Life
Încă nu există evaluări
Bayesian Learning Video Tutorial
Document25 pagini
Bayesian Learning Video Tutorial
Mohammed Danish
Încă nu există evaluări
Satish 2JI19EC112 PDF
Document35 pagini
Satish 2JI19EC112 PDF
Sammed Huchchannavar
100% (1)
Thesis Sales Forcasting
Document43 pagini
Thesis Sales Forcasting
Hainsley Edwards
Încă nu există evaluări
Manjari Bahety CMBA2Y3-1926
Document9 pagini
Manjari Bahety CMBA2Y3-1926
Siddharth Choudhery
Încă nu există evaluări
A Hybrid Approach For Face Recognition Using A Convolutional Neural Network Combined With Feature Extraction Techniques
Document14 pagini
A Hybrid Approach For Face Recognition Using A Convolutional Neural Network Combined With Feature Extraction Techniques
IAES IJAI
Încă nu există evaluări
Causal Representation Learning for Machine Intelligence
Document23 pagini
Causal Representation Learning for Machine Intelligence
Anooshdini2002
Încă nu există evaluări
Financial Forecasting Based On Artificial Neural Networks-Promising Directions For Modeling
Document6 pagini
Financial Forecasting Based On Artificial Neural Networks-Promising Directions For Modeling
Sameera Roshan
Încă nu există evaluări