Documente Academic
Documente Profesional
Documente Cultură
Bas
2012-01064-MN-0
BSCE 5-2
Check List: The items to be performed for a task are listed so that,
as each is accomplished, it can be indicated as having been completed.
Sample Check Sheet
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
Example 1
Example 2
Example 1 shows how many customer complaints were received in each of five
categories.
Example 2 takes the largest category, documents, from Example 1, breaks it
down into six categories of document-related complaints, and shows cumulative
values.
2
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
4. Scatter Plot
4
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
A scatter plot is effectively a line graph with no line - i.e. the point intersections
between the two data sets are plotted but no attempt is made to physically draw a
line. The Y axis is conventionally used for the characteristic whose behavior we
would like to predict. It is used to define the area of relationship between two
variables.
The scatter diagram graphs pairs of numerical data, with one variable on
each axis, to look for a relationship between them. If the variables are
correlated, the points will fall along a line or curve. The better the
correlation, the tighter the points will hug the line.
When to Use, a Scatter Diagram
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
Then they look up the limit for N on the trend test table. For N = 24, the limit
is 6.
Q is equal to the limit. Therefore, the pattern could have occurred from
random chance, and no relationship is demonstrated.
5. Histogram
A histogram is a graphical representation of the distribution of numerical
data. It is an estimate of the probability distribution of a continuous
variable (quantitative variable) and was first introduced by Karl Pearson. To
construct a histogram, the first step is to "bin" the range of valuesthat is,
divide the entire range of values into a series of intervalsand then count
how many values fall into each interval. The bins are usually specified as
consecutive, non-overlapping intervals of a variable. The bins (intervals)
must be adjacent, and are often (but are not required to be) of equal size.
If the bins are of equal size, a rectangle is erected over the bin with height
proportional to the frequency the number of cases in each bin. A
histogram may also be normalized to display "relative" frequencies. It then
shows the proportion of cases that fall into each of several categories, with
the sum of the heights equaling 1.
However, bins need not be of equal width; in that case, the erected rectangle
is defined to have its area proportional to the frequency of cases in the
bin The vertical axis is then not the frequency but frequency density the
number of cases per unit of the variable on the horizontal axis. Examples of
variable bin width are displayed on Census bureau data below.
As the adjacent bins leave no gaps, the rectangles of a histogram touch each
other to indicate that the original variable is continuous.
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
6. Stratification
Stratification is a technique used in combination with other data analysis
tools. When data from a variety of sources or categories have been lumped
together, the meaning of the data can be impossible to see. This technique
separates the data so that patterns can be seen.
When to Use Stratification
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
The ZZ400 manufacturing team drew a scatter diagram to test whether product
purity and iron contamination were related, but the plot did not demonstrate a
relationship. Then a team member realized that the data came from three different
reactors. The team member redrew the diagram, using a different symbol for each
reactors data:
Now patterns can be seen. The data from reactor 2 and reactor 3 are circled. Even
without doing any calculations, it is clear that for those two reactors, purity
decreases as iron increases. However, the data from reactor 1, the solid dots that
are not circled, do not show that relationship. Something is different about reactor
1.
7. Control Charts
The control chart is a graph used to study how a process changes over time.
Data are plotted in time order. A control chart always has a central line for
the average, an upper line for the upper control limit and a lower line for the
lower control limit. These lines are determined from historical data. By
comparing current data to these lines, you can draw conclusions about
whether the process variation is consistent (in control) or is unpredictable
(out of control, affected by special causes of variation).
Control charts for variable data are used in pairs. The top chart monitors the
average, or the centering of the distribution of data from the process. The
bottom chart monitors the range, or the width of the distribution. If your data
9
Darwin A. Bas
2012-01064-MN-0
BSCE 5-2
were shots in target practice, the average is where the shots are clustering,
and the range is how tightly they are clustered. Control charts for attribute
data are used singly.
When to Use a Control Chart
10