Documente Academic
Documente Profesional
Documente Cultură
www.dataseer.com
2
Contents
About
Become a data ninja 3
Our training clients 4
Our faculty 5
Meet the lead trainer: Isaac Reyes 6
What is data science? 7
The skills of the data scientist 8
Our Courses
Data Storytelling for Business
Course Overview 9
Course Outline Day 1 10
Course Outline Day 2 11
Become a
data ninja.
Data is useless without the skill to analyse it.
Data alone is merely a commodity. Its data scientists and analysts who
breathe life into this data and create value, advantage and impact. And
the business world agreesMcKinsey predicts that the United States alone
faces a shortage of 140,000-190,000 people with deep analytical skills.
We train the regions analytics talent so that they are prepared to face the
challenges and opportunities posed by the new data environment.
All of our courses utilise real commercial datasets that will prepare you for
the information you will encounter in your next role as a data scientist or
analyst.
You cannot give me too much data. I see big data as storytelling whether it
is through information graphics or other visual aids that explain it in a way
that allows others to understand across sectors. I always push for the full
scope of the data over averages and aggregations and I like to go to the raw
data because of the possibilities of things you can do with it.
Mike Cavaretta
Data Scientist and Manager, Ford Motor Company
4
Trusted by industry
Data science and analytics is revolutionizing business across all industry verticals.
Since 2015, weve trained over 100 companies, government departments and NGOs in
fundamental data science skills. From banking to telcos and retail to real estate: weve
trained people in your field.
5
OUR FACULTY
Learn from thought leaders on the field
DataSeer is an analytics and data science training provider that has been offering innovative public and
private training courses since 2015.
What is data
science?
Data Science is one of the fastest growing disciplines
in the business sector today. New findings from MIT
research show that companies with data-driven decision
making environments had 4% higher productivity and 6%
higher profits than other businesses.
In 2008, Dr DJ Patil and Jeff Hammerbacher, You Cant Hide From Data
heads of analytics and data at LinkedIn and The combination of distributed processing
Facebook respectively, coined the term data power in the cloud, ultra-fast internet and
science to describe the emerging field of cheap storage has made one thing clear:
study that focused on teasing out the hidden data is here to stay. Unprecedented amounts
value in the data that was being collected of data are now being collected, saved, and
from touchpoints all over the retail and stored safely in the cloud. As exabyte upon
business sectors. exabyte is stored, a new discipline grows to
tunnel through the mountain of datasets to
Data Science is now the umbrella term used find the nuggets of gold: actionable insights
for a discipline that spans Programming, that can change the way you do business.
Statistics, Data Mining, Artificial Intelligence,
Networking, Analytics, Business Intelligence,
Visualisation and a host of other subject
areas. The science is constantly changing
and evolving, as it moves to keep abreast of
technology and business practices alike. Data
Science has applications not only in business
decisions, but also across a wide range of
verticals including biostatistics, astronomy
and molecular biology. Wherever you find
large amounts of information, youll find an
application for data science.
Without big data, companies are blind and deaf, wandering out
onto the web like deer on a freeway. - Geoffrey Moore
8
M K
S
AT N
L
Century
IL
H OW
SK
& L
MACHINE
ST ED
LEARNING
IN
AT G
The Big Three Skills: Coding, Statistics
IS E
CO
TI
and Business DATA
CS
SCIENCE
COURSE ONE
COURSE DURATION:
10
DATA STORYTELLING FOR BUSINESS
Dataset
This course utilises a 50,000 row, 70 variable Customer Relationship Management (CRM) dataset as a
learning tool.
Data Fields
The dataset includes over 25 customer behavior variables including information about customer spend,
customer complaints, customer retention and purchase frequency. The dataset also features over 20
customer demographic variables including age, occupation and marital status.
Data Format
The data is provided to participants in unstructured .dat format. Participants are taught how to import
the dataset into Excel and convert the .dat file into an .xlsx file.
11
12
COURSE TWO
COURSE DURATION:
1 DAY ADVANCED
VISUALIZATION AND
PREREQUISITES:
None. DASHBOARD DESIGN
Take your visualization and dashboard
LAPTOP SPECS: skills to the next level.
Intel i3 processor, 2GB
RAM.
Either Mac or Windows Advanced Visualization and Dashboard Design is aimed at the professional
operating system who already possesses fundamental data visualization and data storytelling
skills. A natural continuation point from Data Storytelling for Business, this
course provides participants with the skills needed to produce stunning,
understandable business dashboards and graphs. Taught using a variety of
REQUIRED SOFTWARE: visualization tools, the course covers the keys to designing for interactivity and
Any data visualization drill down effects. The course also covers less commonly used but valuable
visualization methods, including methods for visualizing networks and flows.
software package (e.g.
Dashboard design is covered in detail, with participants creating a dashboard
Excel, Tableau, PowerBI,
makeover during the class practical workshop.
Qlik, R, Python) and
Powerpoint
Suitable For
This course is suited to any professional who wants to improve their data visu-
alization and dashboard skills
ADVANCED VISUALIZATION AND DASHBOARD DESIGN
13
14
COURSE THREE
COURSE DURATION:
3 DAYS INTRODUCTION TO
R PROGRAMMING
PREREQUISITES:
- None. FOR BUSINESS
LAPTOP SPECS:
APPLICATIONS
Intel i3 processor,
4GB RAM
Windows operating system
R is the worlds leading data science
Unrestricted PC that has and statistics programming language.
install permissions
In this introduction to R, you will master the basics of this beautiful open
source language, including factors, lists and data frames. After completing
REQUIRED SOFTWARE: the course, you will be ready to undertake your very own end-to-end data
Base R or Microsoft R analysis projects using the worlds most sophisticated data analysis tool. R
Open itself is completely free and can be used to extend the capabilities of data
RStudio warehousing software such as SQL Server 2016 and Microsoft Azure ML
Microsoft account Studio! Working on business datasets in class, you will leverage the power of R
to inform business decision making and analyses. Join millions of R users world
(for Jupyter via Azure
wide in a user community that is growing by 40% every year!
ML Studio or Azure
Suitable For
This course is suited for quants and IT professionals who want a crash course
in an end-to-end data science workflow that is completely implemented in R. It
is also suitable for professionals who seek to understand the ecosystem and
community behind R and make it a powerful and cost-effective application for
their enterprise.
INTRODUCTION TO R PROGRAMMING FOR BUSINESS APPLICATIONS
X. R connections: samples of R APIs and bindings with other languages (3:30pm -- 4:15pm)
IV. Special values in R: missing values, nulls, infinite values, and NaNs (12:30pm -- 1:00pm)
15
INTRODUCTION TO R PROGRAMMING FOR BUSINESS APPLICATIONS
16
17
COURSE FOUR
COURSE DURATION:
3 DAYS INTRODUCTION TO
DATA SCIENCE AND
PREREQUISITES:
It is recommended
that participants have
MACHINE LEARNING
completed an introductory
R programming course or
IN R AND AZURE
MOOC and at least one
introductory statistics unit at
the university level
Learn the fundamentals of data
science and analytics, from problem
formulation through to model building
LAPTOP SPECS:
Intel i3 processor,
and interpretation of results.
4GB RAM
Windows operating system
Introduction to Data Science and Machine Learning in R and Azure is aimed at the
Unrestricted PC that has
professional who wants an understanding of data science fundamentals with
install permissions a strong focus on business applications. By the end of the course, participants
will be capable of building, tuning and deploying regression and classification
models for a variety of business problems. Participants will also gain an
REQUIRED SOFTWARE: understanding of unsupervised learning techniques and big data architecture.
Excel 2010, 2013 or 2016
R or RStudio latest version Taught using a variety of open source and cloud technologies, the course
A free trial or paid teaches techniques for handling, manipulating and analyzing high volume
(millions of rows), high dimension (thousands of variables) business data. Real
subscription to Microsoft
world projects from the DataSeer analytics consulting team are extensively
Azure ML Studio used to illustrate how each models is used in the real world.
Suitable For
This course is suitable for any person who wants to acquire fundamental data
science skills.
INTRODUCTION TO DATA SCIENCE AND MACHINE LEARNING IN R AND AZURE
18
INTRODUCTION TO DATA SCIENCE AND MACHINE LEARNING IN R AND AZURE
19
INTRODUCTION TO DATA SCIENCE AND MACHINE LEARNING IN R AND AZURE
20
COURSE FIVE
COURSE DURATION:
3 DAYS PREDICTIVE
ANALYTICS AND
PREREQUISITES:
It is recommended
that participants have
ADVANCED MACHINE
completed an introductory
R programming course or
LEARNING IN R AND
MOOC and at least one 2nd
year statistics unit at the AZURE
university level.
Use R and Azure ML Studio to build
and tune advanced machine learning
LAPTOP SPECS:
Intel i3 processor, 4GB models.
RAM.
Windows operating system
Predictive analytics and machine learning techniques are revolutionizing
Unrestricted PC that has
business and government. Predictive Analytics and Machine Learning in R &
install permissions Azure is aimed at the person who wants to have a better understanding of the
mechanics behind the models and how these models are realistically applied
in the business setting. In addition to covering advanced machine learning
REQUIRED SOFTWARE: techniques in depth, the course covers the management of stakeholder
Excel 2010, 2013 or 2016 expectations during predictive analytics projects and analytics project
R or RStudio latest version management. Advanced machine learning methods are discussed in depth,
A free trial or paid including those used to win global data science competitions.
subscription to Microsoft
Azure ML Studio Suitable For
This course is suited to any professional who already understands
analytics and machine learning basics and is ready to progress to higher
levels of sophistication. It is also suitable to any professional who is
interested in who predictive analytics projects are conceptualized, scoped
and project managed.
PREDICTIVE ANALYTICS AND ADVANCED MACHINE LEARNING IN R & AZURE
22
PREDICTIVE ANALYTICS AND ADVANCED MACHINE LEARNING IN R & AZURE
Dataset
This course utilises the following datasets as learning tools:
A 50,000 row, 70 variable Customer Relationship Management (CRM) dataset as a learning tool.
A 750,000 row, 30 variable digital marketing dataset from the insurance sector
A 227,000 row, 21 variable airlines dataset
Data Fields
The dataset includes over 25 customer behavior variables including information about customer spend,
customer complaints, customer retention and purchase frequency. The dataset also features over 20
customer demographic variables including age, occupation and marital status.
The digital marketing dataset includes information about customer demographics, product category
purchased and the digital marketing channel the customer engaged with at each respective online
touchpoint. The airlines dataset includes information on domestic US flights that departed Houston in
2011. The fields include departure time, arrival time, flight number and destination location (alongside
17 other fields).
Data Format
The data is provided to participants in unstructured .dat format. Participants are taught how to import
the dataset into Excel and convert the .dat file into an .xlsx file.
23
PREDICTIVE ANALYTICS AND ADVANCED MACHINE LEARNING IN R & AZURE
VII. Workshop Feedback, Awarding of Certificates and Course Wrap-up (4:45pm 5:00pm)
24
25
DataSeer
www.dataseer.com
info@dataseer.com