8 Linear Inverse Problem

Linear Inverse Problem
Qin Zhang
SAMSI/ CRSC Undergraduate Workshop, 2008
May 20, 2008
Outline
Forward Problem and Inverse Problem
Review of linear algebra
Ill-poseness of a linear inverse problem
Least Square solution for overdetermined system
Spring-mass system
Without external forces the displacement y(t) is described by

my + cy + ky = 0
the solution is of the form
c
y(t) = Ae 2m t sin(t + )
where thep
constants A and are determined by the initial conditions,
and = k/m.
Question: What if we dont know m, c or k in advance?
Spring-mass system
Without external forces the displacement y(t) is described by

my + cy + ky = 0
the solution is of the form
c
y(t) = Ae 2m t sin(t + )
where thep
constants A and are determined by the initial conditions,
and = k/m.
Question: What if we dont know m, c or k in advance?
Forward Problem vs Inverse problem
Consider the following model
y = F(x, b)
where b is the model parameter, x is the input variable, and y is the
output variable.
Forward problem:
Given the parameter b, what is the value of y for x?
Inverse problem:
Having data (x, y), how to calculate or estimate the parameter b?
In some sense, we need to find the inverse F 1
y = F(x, b)
output variable.
Forward problem:
Inverse problem:
y = F(x, b)
output variable.
Forward problem:
Inverse problem:
y = F(x, b)
output variable.
Forward problem:
Inverse problem:

If the function F is a linear function in b, then given data (x, y) to
solve for the model parameter b, is a linear inverse problem.
Example:
y = bx
y = b0 + b1 x + b2 x2
y = b0 + b1 cos(x) + b2 sin(x2 ) + b3 ex
y = b0 + b1 x1 + b2 x2 + . . . bn1 xn1 + bn xn
b0

b1
= 1 x1 . . . xn .
..
bn

Example:
y = bx
y = b0 + b1 x + b2 x2
y = b0 + b1 x1 + b2 x2 + . . . bn1 xn1 + bn xn
b0

b1
= 1 x1 . . . xn .
..
bn

Example:
y = bx
y = b0 + b1 x + b2 x2
y = b0 + b1 x1 + b2 x2 + . . . bn1 xn1 + bn xn
b0

b1
= 1 x1 . . . xn .
..
bn

Example:
y = bx
y = b0 + b1 x + b2 x2
y = b0 + b1 x1 + b2 x2 + . . . bn1 xn1 + bn xn
b0

b1
= 1 x1 . . . xn .
..
bn
Multidimension
In general, b and y can be vectors.

Suppose there are n number of parameters b1 , b2 , . . . , bn , and m
number of observation y1 , y2 , . . . , ym ,
y1
y2
..
.
=
=
x11 b1 + x12 b2 + . . . x1n bn

x21 b1 + x22 b2 + . . . x2n bn
..
.
ym = xm1 b1 + x22 b2 + . . . xmn bn
Linear Model
y1
y2
..
.
ym
x11
x21
..
.
. . . x1n
x2n
..
.
xm1 . . . xmn
b1
b2
..
.
bn
In general, the linear model can be written as
y = Xb
where X is a m n matrix, b is a n 1 vector, and y is a m 1 vector.
Example
b1 + 2b2 = 3
b1 + 2b2 = 3
2b1 3b2 = 1
b1 + 2b2 = 3
2b1 3b2 = 1
3b1 2b2 = 4
Linear System of Equations
y = Xb
If m < n, underdetermined system
usually have infinite many solution.
If m = n, X is a square matrix. If X is nonsingular, then
b = X 1 y
If m > n, overdetermined system
usually have no exact solution.
Question: Can we find some solution in a proper sense?
y = Xb
b = X 1 y
y = Xb
b = X 1 y
Transpose of a matrix
The transpose of a matrix X is denoted by X T , whose rows are the
columns of X.
For example,
1 2
X = 0 1 ,
3 0
X =
1 0 3
2 1 0
It has the following properties:

(AT )T = A,
(A + B)T = AT + BT ,
(AB)T = BT AT
columns of X.
For example,
1 2
X = 0 1 ,
3 0
X =
1 0 3
2 1 0

(AT )T = A,
(A + B)T = AT + BT ,
(AB)T = BT AT
columns of X.
For example,
1 2
X = 0 1 ,
3 0
X =
1 0 3
2 1 0

(AT )T = A,
(A + B)T = AT + BT ,
(AB)T = BT AT
Inner product
b1
(1, 2, 3) b2 = b1 + 2b2 + 3b3
b3
In general,
b1
n
.. X
(a1 , . . . , an ) . =
ai bi
i=1
bn
< a, b >= aT b
Inverse of a matrix
An n n square matrix X is nonsingular (invertible) if there exists a
matrix X 1 such that XX 1 = X 1 X = I, where I is the identity
matrix.
Theorem: X is nonsingular if and only if det(X) 6= 0.

a b
If X =
then det(X) = ad bc,
c d
X 1 =
1
ad bc
d b
c a
In Matlab, we can use the command inv(X) to compute X 1 .
Solve linear system of equation

Xb = y X 1 Xb = X 1 y b = X 1 y
for example,
1 2
0 1
3 0

b1
1
b2 = 0
b3
3
3
b1
4 b2 =
1
b3
1
2 3
2
1 4 3
0 1
4
2
3
4
1
= 1
1
In Matlab, we have two ways to solve this problem:

inv(X) y
X\y

Xb = y X 1 Xb = X 1 y b = X 1 y
for example,
1 2
0 1
3 0

b1
1
b2 = 0
b3
3
3
b1
4 b2 =
1
b3
1
2 3
2
1 4 3
0 1
4
2
3
4
1
= 1
1

inv(X) y
X\y

Xb = y X 1 Xb = X 1 y b = X 1 y
for example,
1 2
0 1
3 0

b1
1
b2 = 0
b3
3
3
b1
4 b2 =
1
b3
1
2 3
2
1 4 3
0 1
4
2
3
4
1
= 1
1

inv(X) y
X\y
Vector norm
The vector norm is a positive valued function to measure the length of
a vector.
In Euclidean space R2 , the Euclidean norm of a vector v = (x, y) is
defined by
p
kvk = x2 + y2
In general, the 2- norm of a n 1 vector y Rn is defined by
n
X
1
kyk2 = (
y2i ) 2
i=1
so
kyk22 =
n
X
i=1
y2i = yT y =< y, y >
Vector norm
The vector norm is a positive valued function to measure the length of
a vector.
In Euclidean space R2 , the Euclidean norm of a vector v = (x, y) is
defined by
p
kvk = x2 + y2
In general, the 2- norm of a n 1 vector y Rn is defined by
n
X
1
kyk2 = (
y2i ) 2
i=1
so
kyk22 =
n
X
i=1
y2i = yT y =< y, y >
Matrix norm
The 1-norm of a m n matrix X Rmn is defined by
kXk1 = max
1jn
m
X
|Xij |
i=1
= the largest absolute column sum

Similarly, we can define the 2-norm or -norm of a matrix X.
In Matlab, we use the command
norm(X, 1)
to compute the matrix 1-norm.
ill-conditioned system
Example:

0.835 0.667
0.333 0.266

b1
b2
0.835 0.667
0.333 0.266

b1
b2
0.168
0.067
0.168
0.066
b1
b2
b1
b2
You will notice major difference here! Why?

det(X) = 106
A system is ill-conditioned if some small perturbation in the system
causes a relatively large change in the exact solution.
Example:

0.835 0.667
0.333 0.266

b1
b2
0.835 0.667
0.333 0.266

b1
b2
0.168
0.067
0.168
0.066
b1
b2
b1
b2

det(X) = 106
Example:

0.835 0.667
0.333 0.266

b1
b2
0.835 0.667
0.333 0.266

b1
b2
0.168
0.067
0.168
0.066
b1
b2
b1
b2

det(X) = 106
Error estimate and condition number

Theorem:
Let be be the solution of Xb = y, and let bc be the solution of
(X + X)b = y + y, then we have
kbe bc k
kXk kyk
kXkkX 1 k(
+
)
kbe k
kXk
kyk
We see the relative error depends on kXkkX 1 k, which is defined as
the condition number of the nonsingular matrix X.
cond(X) = kXkkX 1 k
A matrix X is
ill-conditioned: If cond(X) is large.
well-conditioned: If cond(X) is small.
Error estimate and condition number

Theorem:
Let be be the solution of Xb = y, and let bc be the solution of
(X + X)b = y + y, then we have
kbe bc k
kXk kyk
kXkkX 1 k(
+
)
kbe k
kXk
kyk
We see the relative error depends on kXkkX 1 k, which is defined as
the condition number of the nonsingular matrix X.
cond(X) = kXkkX 1 k
A matrix X is
ill-conditioned: If cond(X) is large.
well-conditioned: If cond(X) is small.
Example for overdetermined linear system
Consider an overdetermined linear system y = Xb, which has more

columns than rows (m > n), for example

2
1 2
b
1
0 1
= 3
b2
4
3 0
NO exact solution!!!
Least Square Solution
For an overdetermined linear system y = Xb with X Rmn and

m > n, we want to minimize the sum of squared errors (SSE),
min ky Xbk22
b
The solution
b is called the least square solution of the system.
In the previous example, we want to minimize
SSE = (2 b1 2b2 )2 + (3 b2 )2 + (4 3b1 )2
Vector- partial derivative
Result: Let ~a and ~b be n 1 vectors, then

aT b
= a,
b
bT a
=a
b
Proof: note that aT b = a1 b1 + a2 b2 + . . . + an bn =

(
aT b
aT b
)j =
= aj
b
bj
Pn
i=1 ai bi .
So
Normal equation
0=
ky Xbk22
(y Xb)T (y Xb)
=
b
b
= X T (y Xb) + [(y Xb)T (X)]T
= X T (y Xb) X T (y Xb)
= 2X T (y Xb)
= 2(X T y X T Xb)
Normal Equation:
X T Xb = X T y
If (X T X)1 exists, then
b = (X T X)1 X T y
Simple Linear Regression

Consider the model
yi = bxi ,
To write it in matrix form
i = 1, . . . n

y1
..
. =
yn
then
XT X =
n
X
x2i ,
x1
.. b
.
xn
XT Y =
i=1
So the least square solution

b is
Pn
xi yi
b = Pi=1
n
2
i=1 xi
n
X
i=1
xi yi
In summary, given the observations y = (y1 , y2 , . . . , ym )T , and assume

the model is
y = Xb +
where is the observation error. The parameter that gives the smallest
error between the model and observation is
b := min ky Xbk22 = (X T X)1 X T y

b
Remark:
the estimation
b depends on whether cond(X T X) is large or not
How do we know if the parameters estimate are good or bad?
Uncertainty in the estimation is related to the measurement error.
(to be addressed in an upcoming tutorial titled "Statistical View
of Linear Least Squares" by Dr. Michael D. Porter)

8 Linear Inverse Problem

Încărcat de

Informații document

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

8 Linear Inverse Problem

Încărcat de

Drepturi de autor:

Formate disponibile

Linear Inverse Problem

May 20, 2008

Forward Problem and Inverse Problem

Review of linear algebra

Ill-poseness of a linear inverse problem

Least Square solution for overdetermined system

Without external forces the displacement y(t) is described by

Without external forces the displacement y(t) is described by

Forward Problem vs Inverse problem

Consider the following model

Forward Problem vs Inverse problem

Consider the following model

Forward Problem vs Inverse problem

Consider the following model

Forward Problem vs Inverse problem

Consider the following model

Linear Inverse Problem

Linear Inverse Problem

Linear Inverse Problem

Linear Inverse Problem

In general, b and y can be vectors.

x11 b1 + x12 b2 + . . . x1n bn

ym = xm1 b1 + x22 b2 + . . . xmn bn

In general, the linear model can be written as

Linear System of Equations

Linear System of Equations

Linear System of Equations

It has the following properties:

It has the following properties:

It has the following properties:

In Matlab, we can use the command inv(X) to compute X 1 .

Solve linear system of equation

In Matlab, we have two ways to solve this problem:

Solve linear system of equation

In Matlab, we have two ways to solve this problem:

Solve linear system of equation

In Matlab, we have two ways to solve this problem:

y2i = yT y =< y, y >

y2i = yT y =< y, y >

= the largest absolute column sum

You will notice major difference here! Why?

You will notice major difference here! Why?

You will notice major difference here! Why?

Error estimate and condition number

Error estimate and condition number

Example for overdetermined linear system

Consider an overdetermined linear system y = Xb, which has more

Least Square Solution

For an overdetermined linear system y = Xb with X Rmn and

Vector- partial derivative

Result: Let ~a and ~b be n 1 vectors, then

Proof: note that aT b = a1 b1 + a2 b2 + . . . + an bn =

Simple Linear Regression

So the least square solution

In summary, given the observations y = (y1 , y2 , . . . , ym )T , and assume

b := min ky Xbk22 = (X T X)1 X T y

S-ar putea să vă placă și