IOD - Data Scientist

Încărcat de

Raveli

0% au considerat acest document util (0 voturi)

10 vizualizări3 pagini

Data Cleansing Test

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Data Cleansing Test

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

10 vizualizări3 pagini

IOD - Data Scientist

Încărcat de

Raveli

Data Cleansing Test

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 3

Căutați în document

Data Cleansing Test

Background
This test was designed for Data Scientist prospects to see your approach in handling raw,
unreliable data.

We believe that this test mimics the position’s responsibilities at ilmuOne Data. The data that
our clients collect often do not align with their business goals or the analysis that they ask our
team to perform.

By completing this test, you will help us gain a better understanding of your potential with
regards to this vacancy. Aside from your programming skills, you should consider this test as
a chance to show your problem-solving skills and your ability to think outside the box.

Please note our evaluation will not be limited to your final submission. We will also consider
intangibles including but not limited to your effort and professionalism. Our evaluation will also
be subject to our judgement of your experience based on your CV and first interview. For
example, if you are a fresh graduate, this test would highlight your ability to learn new
concepts. Even if you are unable to finish the test, please submit your best possible attempt.
Good luck!
Problem Statement

Swiftnet is a conventional telecommunications company who conducts most of their Customer

Relationship Management (CRM) through inbound call centers. Swiftnet’s new management
has made it clear that their CRM operations are below their standards. As such, they are very
eager to analyze the call center data they have collected throughout the years. Unfortunately,
the data is messy and difficult to work with.

Swiftnet has enlisted your help to perform data cleansing on their call center data.
However, they wish to first see your capabilities before granting you access to their entire
database. Thus, you are provided with two (2) sample datasets taken from:
• Their customer service log (call_logging.csv); and,
• Their user database (user_data.csv)

With this data, Swiftnet has two (2) specific requests:

1. Call Unification

Rows in the customer service log do NOT represent unique calls, due to the way Swiftnet
tracks their data. Swiftnet’s CRM system creates a new line whenever a customer service
agent picks up the phone. However, a customer might speak to multiple service agents in the
throughout their call. As such, data from the same call is often dispersed to separate rows in
the database.

In order to reliably analyze customer experience, rows from the same call need to be marked
with unique identifiers called call_ID. So far, Swiftnet’s analysts usually perform this manually.
Thus, your task would be to automate this process. Please write a script which generates a
new column of call_ID, with a subset of the customer service log as input. The script must
meet the following two (2) criteria:
• If the script is performed multiple times using the same data point from the customer
service log, it should always generate the same call_id for each data point.
• If the script is performed using two non-overlapping subsets of the customer service
log, and the results are concatenated, the column call_id should still function as a
unique identifier
2. Descriptive Analysis

Please describe the sample datasets you have received and compile your findings in a
report. Assuming that the sample datasets are representative of Swiftnet’s data, please
highlight all insightful findings which you believe would be interesting for Swiftnet’s
management. You are also encouraged to list down questions you would like to ask the client
and recommend additional data points which may strengthen your analysis.

Expected Output
1. A Script (.ipynb, .py, or .R) which generates a new column call_id: unique identifiers
for their customer service log

2. A descriptive analysis report

S-ar putea să vă placă și

Fear: Trump in the White House
De la Everand
Fear: Trump in the White House
Bob Woodward
Evaluare: 3.5 din 5 stele
3.5/5 (738)
A Man Called Ove: A Novel
De la Everand
A Man Called Ove: A Novel
Fredrik Backman
Evaluare: 4.5 din 5 stele
4.5/5 (4609)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De la Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Evaluare: 3.5 din 5 stele
3.5/5 (231)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De la Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Evaluare: 4.5 din 5 stele
4.5/5 (120)
Grit: The Power of Passion and Perseverance
De la Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Evaluare: 4 din 5 stele
4/5 (588)
Yes Please
De la Everand
Yes Please
Amy Poehler
Evaluare: 4 din 5 stele
4/5 (1891)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De la Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Evaluare: 4.5 din 5 stele
4.5/5 (266)
The Little Book of Hygge: Danish Secrets to Happy Living
De la Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Evaluare: 3.5 din 5 stele
3.5/5 (399)
Never Split the Difference: Negotiating As If Your Life Depended On It
De la Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Evaluare: 4.5 din 5 stele
4.5/5 (838)
Shoe Dog: A Memoir by the Creator of Nike
De la Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Evaluare: 4.5 din 5 stele
4.5/5 (537)
The Emperor of All Maladies: A Biography of Cancer
De la Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Evaluare: 4.5 din 5 stele
4.5/5 (271)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De la Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Evaluare: 4 din 5 stele
4/5 (5794)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De la Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Evaluare: 3.5 din 5 stele
3.5/5 (2259)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De la Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Evaluare: 4.5 din 5 stele
4.5/5 (344)
Principles: Life and Work
De la Everand
Principles: Life and Work
Ray Dalio
Evaluare: 4 din 5 stele
4/5 (599)
Rise of ISIS: A Threat We Can't Ignore
De la Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Evaluare: 3.5 din 5 stele
3.5/5 (137)
Team of Rivals: The Political Genius of Abraham Lincoln
De la Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Evaluare: 4.5 din 5 stele
4.5/5 (234)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De la Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Evaluare: 4 din 5 stele
4/5 (1090)
John Adams
De la Everand
John Adams
David McCullough
Evaluare: 4.5 din 5 stele
4.5/5 (2409)
The Glass Castle: A Memoir
De la Everand
The Glass Castle: A Memoir
Jeannette Walls
Evaluare: 4.5 din 5 stele
4.5/5 (1712)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De la Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Evaluare: 4 din 5 stele
4/5 (895)
Her Body and Other Parties: Stories
De la Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Evaluare: 4 din 5 stele
4/5 (821)
Sing, Unburied, Sing: A Novel
De la Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Evaluare: 4 din 5 stele
4/5 (1103)
Angela's Ashes: A Memoir
De la Everand
Angela's Ashes: A Memoir
Frank McCourt
Evaluare: 4.5 din 5 stele
4.5/5 (440)
Wolf Hall: A Novel
De la Everand
Wolf Hall: A Novel
Hilary Mantel
Evaluare: 4 din 5 stele
4/5 (3811)
A Tree Grows in Brooklyn
De la Everand
A Tree Grows in Brooklyn
Betty Smith
Evaluare: 4.5 din 5 stele
4.5/5 (1929)
The Woman in Cabin 10
De la Everand
The Woman in Cabin 10
Ruth Ware
Evaluare: 3.5 din 5 stele
3.5/5 (2322)
The Light Between Oceans: A Novel
De la Everand
The Light Between Oceans: A Novel
M.L. Stedman
Evaluare: 4.5 din 5 stele
4.5/5 (789)
The Constant Gardener: A Novel
De la Everand
The Constant Gardener: A Novel
John le Carré
Evaluare: 3.5 din 5 stele
3.5/5 (104)
The Perks of Being a Wallflower
De la Everand
The Perks of Being a Wallflower
Stephen Chbosky
Evaluare: 4.5 din 5 stele
4.5/5 (2101)
The Art of Racing in the Rain: A Novel
De la Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Evaluare: 4 din 5 stele
4/5 (4200)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De la Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Evaluare: 4.5 din 5 stele
4.5/5 (474)
The Outsider: A Novel
De la Everand
The Outsider: A Novel
Stephen King
Evaluare: 4 din 5 stele
4/5 (1839)
Giuliani Letter To Sen. Graham
Document4 pagini
Giuliani Letter To Sen. Graham
Fox News
83% (12)
The Unwinding: An Inner History of the New America
De la Everand
The Unwinding: An Inner History of the New America
George Packer
Evaluare: 4 din 5 stele
4/5 (45)
The Yellow House: A Memoir (2019 National Book Award Winner)
De la Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Evaluare: 4 din 5 stele
4/5 (98)
On Fire: The (Burning) Case for a Green New Deal
De la Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Evaluare: 4 din 5 stele
4/5 (73)
Little Women
De la Everand
Little Women
Louisa May Alcott
Evaluare: 4 din 5 stele
4/5 (104)
Brooklyn: A Novel
De la Everand
Brooklyn: A Novel
Colm Tóibín
Evaluare: 3.5 din 5 stele
3.5/5 (1937)
Manhattan Beach: A Novel
De la Everand
Manhattan Beach: A Novel
Jennifer Egan
Evaluare: 3.5 din 5 stele
3.5/5 (792)
Bad Feminist: Essays
De la Everand
Bad Feminist: Essays
Roxane Gay
Evaluare: 4 din 5 stele
4/5 (1015)
Gowtham Kumar Chitturi - HRMS Technical - 6 Yrs
Document4 pagini
Gowtham Kumar Chitturi - HRMS Technical - 6 Yrs
Anu
Încă nu există evaluări
Steve Jobs
De la Everand
Steve Jobs
Walter Isaacson
Evaluare: 4.5 din 5 stele
4.5/5 (806)
Labor Law 1
Document24 pagini
Labor Law 1
Naomi Cartagena
100% (1)
Midterm Exam Statcon
Document4 pagini
Midterm Exam Statcon
lhemnaval
100% (4)
Spa Claims
Document1 pagină
Spa Claims
Josephine Berces
100% (1)
Yamaha F200 Maintenance Schedule
Document2 pagini
Yamaha F200 Maintenance Schedule
Grady Sanders
Încă nu există evaluări
Electric Arc Furnace STEEL MAKING
Document28 pagini
Electric Arc Furnace STEEL MAKING
AMMASI A SHARAN
100% (3)
Tajima TME, TMEF User Manual
Document5 pagini
Tajima TME, TMEF User Manual
george000023
Încă nu există evaluări
MSDS Bisoprolol Fumarate Tablets (Greenstone LLC) (EN)
Document10 pagini
MSDS Bisoprolol Fumarate Tablets (Greenstone LLC) (EN)
ANNa
Încă nu există evaluări
Database Management System and SQL Commands
Document3 pagini
Database Management System and SQL Commands
dev gupta
Încă nu există evaluări
Sewing Machins Operations Manual
Document243 pagini
Sewing Machins Operations Manual
jemal
Încă nu există evaluări
VRIO
Document3 pagini
VRIO
Jane Apple Bulanadi
Încă nu există evaluări
Ahakuelo Indictment
Document24 pagini
Ahakuelo Indictment
HNN
Încă nu există evaluări
CORDLESS PLUNGE SAW PTS 20-Li A1 PDF
Document68 pagini
CORDLESS PLUNGE SAW PTS 20-Li A1 PDF
Αλεξης Νεοφυτου
Încă nu există evaluări
Solved - in Capital Budgeting, Should The Following Be Ignored, ...
Document3 pagini
Solved - in Capital Budgeting, Should The Following Be Ignored, ...
rifa hana
Încă nu există evaluări
Invoice Acs # 18 TDH Dan Rof - Maret - 2021
Document101 pagini
Invoice Acs # 18 TDH Dan Rof - Maret - 2021
Rafi Raziq
Încă nu există evaluări
Omae2008 57495
Document6 pagini
Omae2008 57495
Vinicius Cantarino Curcino
Încă nu există evaluări
Developments in Prepress Technology (PDFDrive)
Document62 pagini
Developments in Prepress Technology (PDFDrive)
Sur Velan
Încă nu există evaluări
Divider Block Accessory LTR Howden
Document4 pagini
Divider Block Accessory LTR Howden
jason
Încă nu există evaluări
ESK-Balcony Air-A
Document2 pagini
ESK-Balcony Air-A
JUANKI P
Încă nu există evaluări
SVPWM PDF
Document5 pagini
SVPWM PDF
mauricetappa
Încă nu există evaluări
Fammthya 000001
Document87 pagini
Fammthya 000001
Mohammad Norouzzadeh
Încă nu există evaluări
For Email Daily Thermetrics TSTC Product Brochure
Document5 pagini
For Email Daily Thermetrics TSTC Product Brochure
Ilku
Încă nu există evaluări
Carelink Connect: User Guide
Document41 pagini
Carelink Connect: User Guide
Miha Soica
Încă nu există evaluări
Ambient Lighting Vol 6 Compressed
Document156 pagini
Ambient Lighting Vol 6 Compressed
advait_etc
Încă nu există evaluări
X HM11 S Manual AUpdf
Document228 pagini
X HM11 S Manual AUpdf
Antonio José Domínguez Cornejo
Încă nu există evaluări
Chapter 11 Walter Nicholson Microcenomic Theory
Document15 pagini
Chapter 11 Walter Nicholson Microcenomic Theory
Umair Qazi
Încă nu există evaluări
Si Ka
Document12 pagini
Si Ka
nasmine
Încă nu există evaluări
16 Easy Steps To Start PCB Circuit Design
Document10 pagini
16 Easy Steps To Start PCB Circuit Design
jack
Încă nu există evaluări
How Can You Achieve Safety and Profitability ?
Document32 pagini
How Can You Achieve Safety and Profitability ?
Mohamed Omar
Încă nu există evaluări