Documente Academic
Documente Profesional
Documente Cultură
IBM
Note
Before using this information and the product it supports, read the information in Notices on page 35.
Product Information
This document applies to and may also apply to subsequent releases.
Licensed Materials - Property of IBM
Copyright IBM Corporation 2015.
US Government Users Restricted Rights Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp.
Contents
Chapter 1. What is Watson Analytics? . . . . . . . . . . . . . . . . . . . . . .. 1
Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 35
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 39
Some of our questions are about a status or situation. What is the revenue by
country in Europe? What is the trend of my product costs in the United States?
What is the breakdown of revenue by country and product line? Questions that are
about a status or a situation are handled in Explore.
Some of our questions are about why something happened. Why did certain sales
deals close while others did not? Why do some customers leave? What drives the
result that I see in my report? These questions are handled in Predict.
And when you find interesting information, you want to share these insights with
other people by creating beautiful dashboards quickly and easily. This is what
Assemble is all about. You can add insights from Explore and Predict to Assemble
and also create new visualizations on the go.
IBM Watson Analytics can help you understand your data better and find
insights that are hidden in your data. Watson Analytics offers you the benefits of
advanced analytics without the complexity. A smart data discovery service
available on the cloud, it guides data exploration, automates predictive analytics,
and makes it easy to create dashboards and infographics.
You can get answers and new insights to make confident decisions in minutes all
on your own.
When you start Watson Analytics, you are in the Welcome page where you access
all of the capabilities Explore, Predict, Assemble, and Refine where you can
shape how the data will appear.
In this tutorial, you're a Human Resources manager who has been given a big
project you'll be leading a new training initiative for your entire global company.
You want to better understand where the training budget is currently invested in
all areas of the company because at this moment, you just know how it's spent in
your area of the company.
We know your time is valuable. We created this tutorial to guide you through
some basic concepts and features using a sample data set so that you can quickly
learn more about Watson Analytics and its innovative way of bringing you closer
to your data. Then you can add your own data and start discovering new insights
in your business. Have fun!
Procedure
1. Go to Watson Analytics Resources page (https://
community.watsonanalytics.com/resources/).
2. Scroll down to find and tap Sample Dataset - Human Resources Training.
3. Tap the WA_HR_Training link and tap Save File.
4. In Watson Analytics, tap Add.
5. Drag WA_HR_Training.csv to Drop file or browse.
While uploading the file, Watson Analytics analyzes the data and metadata,
creates hierarchies from the metadata, and identifies concepts to use in
analyses.
The data set appears as a tile in the Welcome page and you're ready to get to
work.
If you refine a data set, the changes that you make to it are available in Predict,
Explore, and Assemble. If you modify the data in an exploration, prediction, or
view, the changed data is available only in that asset.
There are different reasons to refine your data sets. For example, you might want
to enrich the data by adding more value, such as calculations. Or, you might want
to filter the data in a particular area of your business. In addition, you might want
to make data more usable by renaming columns, changing data types, and
modifying the default aggregations. Additionally, you might want to create
hierarchies and groups.
When you refine a data set, a new data set is created that is related to your
original data set.
You can also learn more about your data. When you view the data metrics for
your data set, you see the following information.
v The quality score for each column, which indicates a column's potential
readiness for use in a prediction.
v The percentage of data that is missing from each column.
v Distribution graphs of the data in each numeric column.
You can also sort and scroll through your data to preview it.
Procedure
1. On the Welcome page, tap Refine and then select the WA_HR_Training data
set.
3. Select the items that will be filtered out: GO Accessories corporate and GO
Accessories operations.
The data set is filtered to show only these two.
Procedure
1. On the Welcome page, tap the refined data set, WA_HR_Training
Refinement.
IBM Watson Analytics generates some starting points for you. Each starting
point is a way to start diving into the data. You can select a starting point or
ask your own question about the data.
2. If you're wondering how to ask a question, there's a coach to help you. Tap
How to ask a question.
You see categories of questions. Take some time to look through the questions
in each category. If you select one of the questions, you'll see a new set of
starting points. But for this tutorial, let's return to previous dialog box.
This is the insight bar. Without you having to do a thing, Watson Analytics
identifies patterns and associations in your data and automatically creates
other starting points for you to explore. As you change the data in the
8. Let's find out which departments spend the most on training. Tap
Organization in the interactive title just above the visualization. Then tap
Department.
The blue text in the title shows you the most important columns in the
visualization. When you tap the blue text, the most relevant columns are
shown first but you can select any column.
You see that the sales departments have spent the most on courses each year.
It's that easy! Look how quickly you are interacting with the data and
discovering insights in Watson Analytics.
10. Let's set aside, or collect, this visualization to use later in a dashboard. Tap
, which is located at the bottom right side of the window, below the
visualization. You can collect only the visualization that you are currently
viewing.
11. Let's find out how many new hires are planned for each department. Tap 20
is the lowest Planned position count for Department Operations and tap
New page.
Procedure
1. On the Welcome page, tap Predict and select WA_HR_Training Refinement.
5. You can view details about each predictor. Hover over the blue circle, Position.
You see that the predictive strength is 74.3%. What does this mean? Predictive
strength measures how well a predictor accurately predicts a target. A
predictor with a predictive strength of 100% perfectly predicts a target.
Procedure
1. On the Welcome page, tap Assemble and select WA_HR_Training
Refinement.
You aren't required to select a data set in Assemble. You could simply create a
dashboard of the visualizations that you collected earlier in the tutorial. But
let's select a data set now so that you can easily create another visualization.
2. The name of the view defaults to the name of the data set. Change the name
to Tutorial - Dashboard in the Name your view field.
Templates contain predefined layouts and grid lines for easy arrangement and
alignment of the visualizations that you want to share.
4. Tap Create.
Let's start by creating a visualization in the dashboard.
5. From the data tray at the bottom of the window, drag Department to the top
left pane. You'll see a square with arrows in it appear in the middle of the
pane. Drop Department on top of the square.
8. Drag Tutorial - Expenses visualization to the top right pane and drop it on
the square that appears in the pane.
10. Add the second Tutorial - Course Costs visualization to the bottom right
pane.
You can complete this task only if you are subscribed to the Professional edition or
the Personal edition of Watson Analytics.
Procedure
1. On the Welcome page, tap Add and tap Upload data.
2. Tap Twitter.
Results
Because you can quickly tap into Twitter data and understand the sentiment of a
series of disparate or competitive hashtags through Watson Analytics, you can now
begin to focus on key aspects of your business that may have seemed elusive in
the past. This provides a means to better align with departmental goals and even
organizational agendas by using the power of Watson Analytics.
We prepared the sample data for this tutorial. You can use other sample data sets
from the Watson Analytics community. You can also upload your own data sets.
Before uploading other data sets, take a look at the data files to see if there are
things you want to change.
Watson Analytics can provide better predictions and explorations if the quality of
your data is high. If the quality of your data is low, the accuracy of the analyses in
your explorations and predictions is less reliable. You can improve the quality of
your data.
When a data set is loaded, Watson Analytics reads the data and determines a data
quality score that describes the data set's suitability for making predictions. The
higher the score, the better the quality of data. If you provide a high-quality data
set, Watson Analytics provides a high data quality score.
You can see the score that is associated with each data set in the list of assets on
the Welcome page. For example, a score of 68 indicates a data set of medium
quality. The lower the score, the higher the number of outliers or missing values
and other issues.
To obtain a high data quality score, clean your data before you import it into
Watson Analytics.
v Remove blank rows from your data file.
v Remove summary rows and columns from your data file.
v Eliminate nested column headings and nested row headings.
Learn more
There is much more you can do in Watson Analytics. Check out the Help menu.
In this menu, you have access to the following sources of more info:
v Detailed topics in the Docs
v Introductory Tours that open in a separate browser tab
v Little Hints that appear in various places in Watson Analytics
v Expert blogs, discussion forums, and sample data in the Community
https://www.youtube.com/user/watsonanalytics
If you are subscribed to the Professional edition or the Personal edition of Watson
Analytics, you have access to more types of data:
v Twitter metadata
v Cognos BI reports
v Data from SPSS Statistics .sav files
v Databases such as IBM DB2, IBM dashDB, IBMSQL Database for Bluemix,
Microsoft SQL Server, MySQL, Oracle, PostgreSQL
This material may be available from IBM in other languages. However, you may be
required to own a copy of the product or product version in that language in order
to access it.
IBM may not offer the products, services, or features discussed in this document in
other countries. Consult your local IBM representative for information on the
products and services currently available in your area. Any reference to an IBM
product, program, or service is not intended to state or imply that only that IBM
product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any IBM intellectual property right may
be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any non-IBM product, program, or service. This document may
describe products, services, or features that are not included in the Program or
license entitlement that you have purchased.
IBM may have patents or pending patent applications covering subject matter
described in this document. The furnishing of this document does not grant you
any license to these patents. You can send license inquiries, in writing, to:
For license inquiries regarding double-byte (DBCS) information, contact the IBM
Intellectual Property Department in your country or send inquiries, in writing, to:
The following paragraph does not apply to the United Kingdom or any other
country where such provisions are inconsistent with local law: INTERNATIONAL
BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS"
WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED,
INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
NON-INFRINGEMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR
PURPOSE. Some states do not allow disclaimer of express or implied warranties in
certain transactions, therefore, this statement may not apply to you.
IBM may use or distribute any of the information you supply in any way it
believes appropriate without incurring any obligation to you.
Licensees of this program who wish to have information about it for the purpose
of enabling: (i) the exchange of information between independently created
programs and other programs (including this one) and (ii) the mutual use of the
information which has been exchanged, should contact:
The licensed program described in this document and all licensed material
available for it are provided by IBM under terms of the IBM Customer Agreement,
IBM International Program License Agreement or any equivalent agreement
between us.
All statements regarding IBM's future direction or intent are subject to change or
withdrawal without notice, and represent goals and objectives only.
This information is for planning purposes only. The information herein is subject to
change before the products described become available.
This information contains examples of data and reports used in daily business
operations. To illustrate them as completely as possible, the examples include the
names of individuals, companies, brands, and products. All of these names are
fictitious and any similarity to the names and addresses used by an actual business
enterprise is entirely coincidental.
Depending upon the configurations deployed, this Software Offering may use
session and persistent cookies that collect each user's
v name
v user name
v password
v personally identifiable information other than name, user name, password,
profile name and position
for purposes of
v session management
v authentication
v enhanced user usability
v single sign-on configuration
v usage tracking or functional purposes other than session management,
authentication, enhanced user usability and single sign-on configuration
If the configurations deployed for this Software Offering provide you as customer
the ability to collect personally identifiable information from end users via cookies
and other technologies, you should seek your own legal advice about any laws
applicable to such data collection, including any requirements for notice and
consent.
For more information about the use of various technologies, including cookies, for
these purposes, see IBM's Privacy Policy at http://www.ibm.com/privacy and
IBM's Online Privacy Statement at http://www.ibm.com/privacy/details in the
section entitled "Cookies, Web Beacons and Other Technologies" and the "IBM
Software Products and Software-as-a-Service Privacy Statement" at
http://www.ibm.com/software/info/product-privacy.
Trademarks
IBM, the IBM logo and ibm.com are trademarks or registered trademarks of
International Business Machines Corp., registered in many jurisdictions worldwide.
Other product and service names might be trademarks of IBM or other companies.
A current list of IBM trademarks is available on the Web at Copyright and
trademark information at www.ibm.com/legal/copytrade.shtml.
Notices 37
38 Last updated: 2015-10-29: Getting started with Watson Analytics
Index
A predictors 17
preparing data 3, 5
assembling visualizations 23 Professional edition 33
C Q
cleaning data 33 questions 9
collecting 9, 17
collection 23
communicating with others 23
R
refining data 5
D
data
additional types 33 S
cleaning 33 shaping data 5
preparing 5 spiral 17
Twitter 29
uploading 3
T
targets 17
E Tweets 29
exploring data 9 Twitter 29
Twitter 29
U
F upgrading
filtering editions 33
data set 5 uploading data 3
Twitter 29
I
importing 3 V
insights 9 visualizations
adding 23
details 9, 17
P
Personal edition 33
predictive insights 17