Documente Academic
Documente Profesional
Documente Cultură
We desire to use minimal sufficient statistics whenever possible as the greatest reduction
of the data.
n
P (S = T ( x) V = v) =
P (S = s V = v)
P (V = v )
n s
ns
s (1 )
=
n r1
n r2
n r1
n r2
r (1 ) + r (1 )
1
2
But this is still a function of , so V is NOT sufficient. S is sufficient. S is also minimal.
can exploit the relationship between ratios of likelihood functions and minimal
sufficiency.
f ( x)
f ( y)
sufficient statistic.
Consider a statistic as dividing the sample space into classes called equivalence classes.
Each class contains all observations X with the same value of S. If S is minimal
sufficient, then so is any 1 1 function of S (unique inverse). So minimal sufficiency is
somehow related to the set of equivalence classes but not to the particular labeling of
the equivalence classes.
= h ( z, x )
D ( x ) = z;
f ( x )
If T ( x ) = T ( z ) , then
g (T ( x ) ) = g (T ( z ) ) (since the function g is the same)
f (x ) = f (z )
f (x )
h ( x)
h (z)
= m ( x,z)
f (z )
But this implies that x and z are in the same equivalence class. Therefore, the partition
defined by the ratio of the likelihoods includes that based on the statistic T, so this
partition is minimal sufficient. (This result is essentially Theorem 6.2.13, which is
proved differently).
Consider S1 .
f (x )
f ( y )
exp ( n ) xi xi!
exp ( n )
xi y j
xi! y j!
yj
y j!
= h ( x , y ) i, j = 1,...n
x( i )
x( i )!
y( j )
y( j )!
exp ( n )
exp ( n )
sufficient.
Example: Assume that you are performing life testing. Suppose that of n components, r
die after y1 ,.....yr time periods and that n-r are still alive after y1 ',.....ym . Assuming that
the lives of Y1 ,.....Yn are ~iid f ( yi ) = exp ( yi ) , then the joint pdf is,
j =1
k =1
(The second factor is the probability of times to death exceeding y'k , k = 1,...m )
Then
f ( y ) = r exp y* ,
where y* =
j =1
k =1
y j + y'k .
For this problem, you have a sufficient statistic that is of dimension 2, while the
dimension of the parameter vector is 1. Is this a minimal sufficient statistic? Yes, check
the ratio.
Can we say anything special if we are dealing with the exponential family of
k
distributions: f ( x ) = h ( x ) c ( ) exp wi ( ) ti ( x )
i =1
Recall that
ti ( x j ) is sufficient.
j =1
and a member of the exponential family, these statistics are also minimal sufficient.
n
n
1 n
n
2 i =1
n
n
n
n
n
n
x
,
xi2 is minimal sufficient for , .
i =1 i =1
Note Example 6.2.14 in C&B. The densities are expressed in terms of x, s 2 . Not
surprisingly, the ratio of the likelihood functions is still independent of the parameters, so
these are minimal sufficient statistics as well. Neither sufficient nor minimal sufficient
x 1 exp ( x )
( )
1
= exp
+ ln + ( 1) ln ( x ) ln ( ( ) )
w`1 ( ) =
, t ( x ) = x, c ( ) = ln
ln ( ( ) ) , h ( x ) = ( 1) ln x
{
{
f ( x j ) = exp am ( ) b jm ( x j ) + c j ( ) + d j ( x j )
k
m =1
m =1
Tm ( x ) = b jm ( x j )
n
j =1
For the general exponential family with multiple unknown parameters, the dimension of
the parameter vector q and the index m of the sum over k terms are not necessarily the
same. (Recall the life testing example.)
If
, fixed. X ~ N ( , 2 2 )
( x )2
1
f ( x) =
exp
2 2 2
2
x2
x
1
1
= exp 2 2 + 2 2 ln 2 2 2
2
2
2
n
n