Documente Academic
Documente Profesional
Documente Cultură
Jean-luc Doumont
Second law
Maximize the signal-to-noise ratio
Third law
Use effective redundancy
Discrete Continuous
variable variables
Designing the graph
Comparison
among data
Distribution
of a variable
Correlation
among variables
Evolution
of a variable
t1
t2
1.00 © 2010 by Principiae
Subsets distinguished
within one panel
To compare data, consider a length representation Showing the entire data set, as points along a scale,
(horizontal bars, starting necessarily from zero) or, is the most accurate way to convey its distribution.
for closely grouped data, a position representation For large data sets, it may be useful to summarize
(dots along a scale that need not start from zero). the distribution with a histogram or with a box plot,
Both of these afford more accuracy than a pie chart. showing five percentiles plus the outliers. Box plots
allow an easy comparison between subsets of data,
each summarized by one box, along the same scale.
Population [millions]
Germany 82.2
France 60.5
UK 58.8
Netherlands 15.9 0 10 20
Belgium 10.2
10 25 50 75 90%
France
Germany
Netherlands
UK Subset 1
Belgium Subset 2
78 79
Scatter plots are a powerful way to reveal correlation The evolution of one or more dependent variables
or simply to explore bivariate data by mapping it out. versus an independent one is best shown by lines.
For more than two variables, they can be combined Variables expressed in different units must be drawn
in arrays (one scatter plot for each pair of variables). in different panels, with a common horizontal scale.
Italy
15 0 50 100 µs
France
Finland
Current 100
European Union [µA]
Females 10
Belgium Germany 0
−100
Denmark Sweden
5 United Kingdom Voltage 10
Portugal
Austria
Netherlands Ireland [V]
Lux.
5
1.00 © 2010 by Principiae
0 0
0 5 10
Males 0 50 100 s
Constructing the graph
1.0
measured A poor graph
0.8 calculated
The graph exhibits a very low signal-to-noise ratio,
Output power [W]
Frequency [GHz]
0.6
calculated
The graph is plainer and therefore better contrasted:
the background no longer interferes with the data,
0.4 yet it provides sufficient information about them.
Frequency [GHz]
calculated
The graph shows the data and nothing but the data:
tick marks are relevant, not arbitrarily equidistant;
nondata lines are gray, to make the data prominent.
630 MHz
16 17.2 GHz 19