Documente Academic
Documente Profesional
Documente Cultură
1
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
LI-YENG SUNG
Department of Mathematics, University of South Carolina, Columbia, SC 29208, U.S.A.
Key words: Kadomtsev–Petviashvili-I equation, inverse spectral method, Cauchy problem, unique-
ness of solutions.
1. Introduction
The Cauchy problem
for the Kadomtsev–Petviashvili-I (KPI) equation (cf. [8]) is solved in [6] by the
inverse spectral method, under the assumption that q0 is a small Schwartz function.
It is shown in [6] that the solution q(x, y, t) obtained by the inverse spectral method
is a C ∞ classical solution of (1.1) for (x, y) ∈ R2 , t 6= 0, and t → q(·, t) ∈
C 1 ((−∞, 0) ∪ (0, ∞), C0 (R2 )).
We will show in this paper that t → q(·, t) ∈ C(R, H ∞ (R2 )) and q(x, y, t)
is the unique generalized solution for the forward (respectively backward) prob-
lem of (1.1) in the class C([0, ∞), H 3 (R2 )) ∩ C 1 ((0, ∞), L∞ (R2 )) (respectively
C((−∞, 0], H 3 (R2 )) ∩ C 1 ((−∞, 0), L∞ (R2 ))).
The inverse spectral method for KPI is studied formally in [10] and [5]. Rigor-
ous aspects of this method have been investigated in [14] and [20]. The version of
the inverse spectral method used in this paper is essentially that of [20]. However,
the results in [14] and [20] are obtained under the additional assumption that
Z
dx q0 (x, y) = 0, (1.2)
R
2 LI-YENG SUNG
which is also assumed in many of the papers that study (1.1) by PDE techniques
(cf. [4, 12, 13, 16, 17]). As a consequence of the nonphysical constraint (1.2), the
scattering data have decay in all directions, which greatly simplifies the analysis.
When (1.2) is not assumed, the analysis is much more subtle due to the lack of
decay of the scattering data in certain directions.
The implications of the constraint (1.2) are also studied in [1, 2] and [3] using
the inverse spectral method. But the fact that the solution obtained by the inverse
spectral method (without assuming (1.2)) is the unique solution of (1.1) in a general
class of solutions has not been rigorously established until now.
Since there is an isomorphism (cf. [9]) between solutions of the KP equation
and the Johnson equation (cf. [7]) in the case of rapidly decaying initial data, the
results of this paper can also be applied to the Johnson equation.
The rest of the paper is organized as follows. The inverse spectral method for
(1.1) is described in Section 2, where we give both the solution from the left and
the solution from the right. We also recall some relevant results from [6]. Section 3
contains the proof that the inverse spectral solution from the left and from the right
are identical. The integrability of the time-dependent Jost function is studied in
Section 4. We prove in Section 5 that t → q(·, t) ∈ C(R, H ∞ (R2 )), and establish
the uniqueness of solutions in Section 6.
For the convenience of the reader we collect here some notation frequently used
in this paper.
(a) S(Rn ) is the space of Schwartz functions in n real variables and S 0 (Rn ) is the
space of tempered distributions equipped with the weak∗ -topology.
(b) H k (R2 ), k = 0, 1, T
2, . . ., are the L2 based Sobolev spaces in two real variables,
∞
and H ∞ (R2 ) = k 2
k=0 H (R ) is equipped with the natural Fréchet space
m p
topology. Wp (R) is the L based Sobolev space in one real variable.
(c) C(X, Y ) is the space of continuous maps from the topological space X into the
topological space Y and Cb (X, Y ) is the space of bounded continuous maps
from the Banach space X into the Banach space Y . Cb (Rn ) is the space of
bounded continuous functions on Rn equipped with the sup-norm and C0 (Rn )
is the subspace of Cb (Rn ) whose members vanish at infinity.
(d) Let I be an open interval and Y be a topological vector space. C 1 (I, Y ) is the
space of continuously differentiable Y -valued functions on I .
(e) Let X be a measure space and Y be a Banach space. L1 (X, Y ) is the space of
Y -valued Bochner integrable functions on X.
(f) The Fourier transforms fˆ and f˜ are defined by
Z Z
ˆ
f (ξ, η) = dx dy e −i(xξ +yη) ˜
f (x, y) and f (ξ, y) = dx e−ixξ f (x, y).
R2 R
Using L± we define the time-dependent Jost function from the left λ(x, y, t, k),
k ∈ R, by the following equation:
Z k Z ∞
− + + −
λ(x, y, t, k) = 1 + P dlL (k, l) + P dlL (k, l) ×
−∞ k
i[x(k−l)+y(l 2−k 2 )+4t (k 3 −l 3 )]
×e λ(x, y, t, l). (2.10)
We can now write down the solution of (1.1) from the left:
Z Z ∞ Z k
1 −
dlL (k, l) ei[x(k−l)+y(l −k )] ×
+ 2 2
q(x, y, t) = dk dlL (k, l) −
π R k −∞
3 −l 3 )
× e4it (k [i(k − l)λ(x, y, t, l) + λx (x, y, t, l)]. (2.11)
In summary, given a small q0 ∈ S(R2 ), the inverse spectral solution from the
left for (1.1) is obtained through the following steps: (I) Solve (2.5) and (2.6)
for µ± (x, y, k). (II) Define L± (k, l) by (2.8) and (2.9) using µ± (x, y, k). (III)
Solve (2.10) for λ(x, y, t, k). (IV) The solution q(x, y, t) is defined by (2.11) using
L± (k, l) and λ(x, y, t, k).
Note that we can also define the solution q(x, y, t) via the right scattering data:
Z
−i
dx dy e−i[x(k−l)+y(l −k )] q0 (x, y)µ− (x, y, k), k > l,
2 2
R −(k, l) = 2π R2 (2.12)
0, otherwise,
Z
−i
dx dy e−i[x(k−l)+y(l −k )] q0 (x, y)µ+ (x, y, k), k 6 l,
2 2
R +(k, l) = 2π R2 (2.13)
0, otherwise.
Let the time-dependent Jost function from the right ρ(x, y, t, k), k ∈ R, be defined
by the following equation:
Z ∞ Z k
− + + −
ρ(x, y, t, k) = 1 + P dlR (k, l) + P dlR (k, l) ×
k −∞
i[x(k−l)+y(l 2−k 2 )+4t (k 3 −l 3 )]
×e ρ(x, y, t, l). (2.14)
The solution of (1.1) from the right can then be expressed as
Z Z k Z ∞
1 −
dlR (k, l) ei[x(k−l)+y(l −k )] ×
+ 2 2
q(x, y, t) = dk dlR (k, l) −
π R −∞ k
3 −l 3 )
× e4it (k [i(k − l)ρ(x, y, t, l) + ρx (x, y, t, l)]. (2.15)
SOLUTIONS OF THE KADOMTSEV–PETVIASHVILI-I EQUATION 5
The relations among L± (k, l), R ± (k, l), λ(x, y, k) and ρ(x, y, k), and the equiv-
alence of (2.11) and (2.15) will be established in Section 3.
Finally we recall some relevant results from [6] which are obtained under the
smallness assumption
Z
dy dξ(1 + ξ 2 )|q̃0 (ξ, y)| 1. (2.16)
R2
(i) The integral equation (2.5) (resp. (2.6)) is uniquely solvable in the Banach
space Cb (Ry , L1 (Rξ ) ⊕ Cδ(ξ )) for Im k > 0 (resp. Im k 6 0).
(ii) The scattering data L− (k, l) and R + (k, l) (resp. L+ (k, l) and R − (k, l)) are
C ∞ for k 6 l (resp. k > l). Let F (k, l) be L± (k, l) or R ± (k, l). Then the
following estimates hold:
r α+β
∂ ∂ ∂
(k − l) (k − l )
m 2 2 n
+ F (k, l) . (1 + |k|)α+β
∂k ∂l ∂k∂l
for α, β, m, n > 0. (2.17)
(iii) In particular, we have
|F (k, l)| 6 Cq0 (1 + |k − l|)−2 , (2.18)
where Cq0 1 under the assumption (2.16), and the integral equations
(2.10) and (2.14) are uniquely solvable in L2 (Rk ) ⊕ C for (x, y, t) ∈ R3 .
(iv) The Jost functions µ± are related by
µ± (x, y, k) = µl (x, y, k) ∓
Z k
dl ei[x(k−l)+y(l −k )] L± (k, l)µl (x, y, l),
2 2
∓ (2.19)
∓∞
µ± (x, y, k) = µr (x, y, k) ∓
Z ±∞
dl ei[x(k−l)+y(l −k )] R ± (k, l)µr (x, y, l),
2 2
∓ (2.20)
k
where the left and right Jost functions µl and µr are the unique solutions of
∂ α1 +α2 +α3
(µ# (x, y, k) − 1) . (1 + |y|α3 ) for (x, y, k) ∈ R3 . (2.23)
∂x α1 ∂y α2 ∂k α3
(vi) Let q(x, y, t) be defined by either (2.11) or (2.15). Then we have
q(x, y, 0) = q0 (x, y). (2.24)
(vii) For t 6= 0, Equations (2.11) and (2.15) can be written as
Z Z ∞ Z k
∂ 1
q(x, y, t) = dk dlL− (k, l) − dlL+ (k, l) ×
∂x π R k −∞
!
i[x(k−l)+y(l 2 −k 2 )+4t (k 3 −l 3 )]
×e λ(x, y, t, l) , (2.25)
Z Z k Z ∞
∂ 1 − +
q(x, y, t) = dk dlR (k, l) − dlR (k, l) ×
∂x π R −∞ k
!
i[x(k−l)+y(l 2 −k 2 )+4t (k 3 −l 3 )]
×e ρ(x, y, t, l) , (2.26)
LEMMA 3.2. The operators I − (L± )t and I − (R± )t are bounded and invertible
on Lp (R) (1 6 p < ∞) and Cb (R).
8 LI-YENG SUNG
We define
φ # (x, y, k) = e−i(xk−yk ) µ# (x, y, k),
2
where # = +, −, l, or r. (3.7)
Then (2.19) and (2.20) can be rewritten as
φ ± = (I − L± )φ l and φ ± = (I − R± )φ r . (3.8)
From (3.8) and Lemma 3.1 we derive the following equations in Cb (R):
φ r = (I − R+ )−1 (I − L+ )φ l and φ r = (I − R− )−1 (I − L− )φ l . (3.9)
For k ∈ R, let the functions ηl (x, y, k) be defined by the following analog
of (2.21):
Z y
i 0
dy 0 ei(ξ −2kξ )(y−y ) qe0 ∗ η̃l (ξ, y 0 , k), (3.10)
2
η̃l (ξ, y, k) = 2π δ(ξ ) +
2π −∞
which is uniquely solvable in Cb (Ry , L1 (Rξ )⊕Cδ(ξ )) under the assumption (2.16).
Of course ηl enjoys the same properties as µl .
Let γ l (x, y, k) be defined by
LEMMA 3.3. Let ζ(ξ, y, k) (resp. τ (ξ, y, k)) be the Fourier transform of
(µl (x, y, k)−1) (resp. the complex conjugate of (η(x, y, k) − 1)) in the x variable.
Then we have
Z Z
lim dξ sup |ζ(ξ, y, k)| = lim dξ sup |τ (ξ, y, k)| = 0. (3.13)
y→−∞ R k∈R y→−∞ R s∈R
Proof. It suffices to discuss the case of ζ(ξ, y, k). From (2.21) we obtain
Z y
0
dy 0 ei(ξ −2kξ )(y−y ) q̃0 (ξ, y 0 ) +
2
ζ(ξ, y, k) = i
−∞
Z y
i 0
dy 0 ei(ξ −2kξ )(y−y ) (q̃0 ∗ ζ )(ξ, y 0 , k).
2
+ (3.14)
2π −∞
It follows immediately from (3.14) that ζ(ξ, y, k) is continuous in all the vari-
ables and hence supy,k∈R |ζ(ξ, y, k)| is a measurable function in ξ . Moreover, we
find from the Neumann series solution of (3.14) that
∞
X
sup |ζ(ξ, y, k)| 6 ζj (ξ ), (3.15)
y,k∈R j =0
where
SOLUTIONS OF THE KADOMTSEV–PETVIASHVILI-I EQUATION 9
j Z j +1
1
kζj kL1 (R) 6 dξ dy|q̃0 (ξ, y)| . (3.16)
2π R2
Proof. It follows from (2.23) and (3.12) that the iterated integral on the left-hand
side of (3.18) is independent of y. Using the notation in Lemma 3.3 we can also
rewrite the left-hand side of (3.18) by the Fourier inversion formula as
Z Z
1 l
dxφ (x, y, k) dsγ l (x, y, s)f (s)
2π R R
Z Z
1 −i(xk−yk 2 ) 2
= dx e (µ (x, y, k) − 1) ds ei(xs−ys ) (ηl (x, y, s) − 1)f (s)+
l
2π R R
Z Z
1 −i(xk−yk 2 ) 2
+ dx e (µ (x, y, k) − 1) ds ei(xs−ys ) f (s)+
l
2π R R
Z Z
1
dx e−i(xk−yk ) ds ei(xs−ys ) (ηl (x, y, s) − 1)f (s) + f (k)
2 2
+
2π R R
2 Z Z
eiyk
ds dξ ζ(k − s − ξ, y, k)τ (ξ, y, s)e−iys f (s)+
2
= 2
(2π ) R R
2 Z
eiyk
ds[ζ(k − s, y, k) + τ (k − s, y, s)]e−iys f (s) + f (k).
2
+ (3.19)
2π R
The lemma follows from (3.13) and (3.19). 2
where
10 LI-YENG SUNG
Since g ∈ L1 (R) is arbitrary and S(R) is dense in L2 (R), the identity (3.20)
follows from (3.24). 2
We now show that (2.11) and (2.15) define the same function q(x, y, t). We can
rewrite (2.10) and (2.14) as
−
P E(x,y,t )(I − L+ )E−1 + − −1
(x,y,t ) + P E(x,y,t )(I − L )E(x,y,t ) λ = 1, (3.25)
−
P E(x,y,t )(I − R+ )E−1 + − −1
(x,y,t ) + P E(x,y,t )(I − R )E(x,y,t ) ρ = 1, (3.26)
PROPOSITION 3.7. The formulas (2.11) and (2.15) define the same function
q(x, y, t).
Proof. The case where t = 0 follows from (2.24). For t 6= 0, in view of (2.25)
and (2.26), we can rewrite (2.11) and (2.15) as
q(x, y, t)
Z
1 ∂ + − −1
= dk E(x,y,t )((I − L ) − (I − L ))E(x,y,t ) λ(x, y, t, l) , (3.28)
π ∂x R
q(x, y, t)
Z
1 ∂ + − −1
= dk E(x,y,t )((I − R ) − (I − R ))E(x,y,t ) ρ(x, y, t, l) . (3.29)
π ∂x R
LEMMA 4.1. For each (y, t), L(y,t ) is a contraction on L2 (A × Rk ) for any A ⊆
Rx .
Proof. We have, by (2.18),
Z k
∓
dlL (k, l)e i[x(k−l)+y(l 2−k 2 )+4t (k 3 −l 3 )]
f (x, l)
2
±∞ L (A×Rk )
Z
6 Cq 0
−2
dl(1 + |k − l|) kf (·, l)kL2 (A)
. (4.2)
R L2 (Rk )
Next we have a technical lemma for functions in the Sobolev spaces Wpm (R).
LEMMA 4.2. Let 1 < p < ∞ and f (k) ∈ Wpm (R) for m > 0. Then
± ∓isk
P (e f (k))
m . (1 + s)−j kf k m+j+1 for s > 0 and j, m > 0. (4.3)
W (R) p Wp (R)
Proof. We will only prove the estimate for g = P+(e−isk f (k)). Also, it suffices
to prove (4.3) for j = 0 since
In the following two lemmas the function F (k, l) is piecewise C ∞ on the half
planes {(k, l) : k > l} and {(k, l) : k 6 l}, and it satisfies the estimates in (2.17).
Rk
LEMMA 4.3. Let H (x, y, t, k) be either P− −∞ dlF (k, l)ei[x(k−l)+y(l −k )+4t (k −l )]
2 2 3 3
R ∞
or P+ k dlF (k, l)ei[x(k−l)+y(l −k )+4t (k −l )] , and H̃ (ξ, y, t, k) be the Fourier trans-
2 2 3 3
form of H (x, y, t, k) in the x variable. Then, for 1 < p < ∞ and m > 0,
SOLUTIONS OF THE KADOMTSEV–PETVIASHVILI-I EQUATION 13
Z
1 + |t|m+2
dξ kH̃ (ξ, y, t, k)kWpm (Rk ) . for t ∈ R, y 6 0. (4.10)
R 1 + |y|1−(1/p)
Proof. We will treat only the case where H is defined by the first formula. By a
change of variables we can write
Z ∞
−
H (x, y, t, k) = P dl eixl e−ik(2yl) G(l, y, t, k), (4.12)
0
where
2 +4t l 3 ) 2 −l 2 k)
G(l, y, t, k) = ei(yl F (k, k − l)e12it (lk . (4.13)
The estimates in (2.17) and the piecewise smoothness of F imply that, for each
(y, t), the map l → e−ik(2yl)G(l, y, t, k) belongs to C((0, ∞), L2 (Rk ))∩L1 ((0, ∞),
L2 (Rk )). Therefore the bounded operator P− can be moved inside the integral in
(4.12) and we obtain from the Fourier inversion formula that
H̃ (ξ, y, t, k) = 2π E+ (ξ )P− e−ik(2yξ ) G(ξ, y, t, k) , (4.14)
where E+ (·) is defined in (2.7).
From (2.17) we also obtain the following estimate:
` `
∂
G(l, y, t, k) . 1 + |t| min 1, 1 for ` > 0. (4.15)
∂k ` 1 + |kl| 1 + |l|
Combining (4.3) (with j = 0, 1) and (4.15) we find, for ξ > 0 and y 6 0,
− −ik(2yξ )
P e G(ξ, y, t, k)
W m
p
m+1
1 + |t| 1 + |t| 1
. min , . (4.16)
|ξ |1/p 1 + |yξ | 1 + |ξ |
The estimate (4.10) follows immediately from (4.14) and (4.16).
The estimate (4.15) and the Lebesgue dominated convergence theorem imply
that the map (ξ, y, t) → e−ik(2yξ ) G(ξ, y, t, k) is continuous from the set {(ξ, y, t) :
ξ 6= 0} into Wpm (R). The precompactness statement follows immediately from the
boundedness of P− on Wpm (R). 2
or
Z ∞
P+
2−k 2 )+4t (k 3 −l 3 )]
dl(k − l)F (k, l)ei[x(k−l)+y(l .
k
where
Proof. We have
Z k Z ∞
[L(y,t )H ](x, k) = P− +
dlL (k, l) + P + −
dlL (k, l) ×
−∞ k
i[x(k−l)+y(l 2 −k 2 )+4t (k 3 −l 3 )]
×e H (x, y, t, l)
= g− (x, y, t, k) + g+ (x, y, t, k), (4.24)
where
g∓ (x, y, t, k)
Z Z
1
= dξ eixξ
dl eixl E± (l)P∓[K± (l, y, t, k)H̃ (ξ, y, t, k − l)], (4.25)
2π R R
and
2 +4t l 3 )
e−ik(2yl) L± (k, k − l)e12it (lk
2 −l 2 k)
K± (l, y, t, k) = ei(yl . (4.26)
We note that (2.17) and (4.10) together imply (ξ, l) → K± (l, y, t, k)H̃ (ξ, y, t,
k−l) ∈ L1 (R2ξ,l , L2 (Rk )) for fixed (y, t). Therefore by integrating against arbitrary
L2 (Rk ) functions we obtain (4.25) as an identity on C(Rx , L2 (Rk )) for each fixed
(y, t).
From (2.17) and (4.3) we obtain the following estimate for y 6 0:
∓
P [K± (l, y, t, k)H̃ (ξ, y, t, k − l)]
2
L (Rk )
. (1 + t 2 )kH̃ (ξ, y, t, ·)kL2 (R) + (1 + |t|)kH̃ (ξ, y, t, ·)kH 1 (R) +
1 1
+ kH̃ (ξ, y, t, ·)kH 2 (R) min , . (4.27)
1 + |yl| 1 + |l|
Combining (4.25), (4.27), (4.10) (with p = 2) and the Plancherel theorem we find
Z
kg∓ kL2 (Rx,k ) . dξ kP∓ [K± (l, y, t, k)H̃ (ξ, y, t, k − l)]kL2 (R2 )
k,l
R
1+t 4
. for y 6 0. (4.28)
1 + |y|
The estimate (4.22) follows from (4.24) and (4.28).
Let ε be an arbitrary positive number. From (4.25) we have
g∓ (x, y, t, k)
Z Z
1
= dξ + dξ eixξ ×
2π |ξ |<ε |ξ |>ε −1
Z
× dl e E± (l)P∓[K± (l, y, t, k)H̃ (ξ, y, t, k − l)]+
ixl
R
Z Z
1
+ dξ e ixξ
dl eixl E± (l)P∓ [K± (l, y, t, k)H̃ (ξ, y, t, k − l)]
2π ε<|ξ |<ε−1 R
= I1 (x, y, t, k) + I2 (x, y, t, k). (4.29)
16 LI-YENG SUNG
It follows from (4.14), (4.15) and (4.27) that for bounded t we can make
kI1 (x, y, t, k)kL2 (Rx,k ) arbitrarily small by choosing ε small enough.
On the other hand, since |K± (l, y, t, k)| . (1 + |l|)−1 by (2.17), the map
(y, t) → K± (l, y, t, k)f (k) ∈ C(R2 , L2 (R2k,l )) for any f ∈ L2 (R). This together
with (4.11) show that P± [K(l, y, t, k)H̃ (ξ, y, t, k − l)] forms a precompact subset
of L2 (R2l,k ) for ε < |ξ | < ε −1 and (y, t) ∈ B.
R
The Plancherel theorem then implies R dl eixl E± (l)P∓[K(l, y, t, k)H̃ (ξ, y, t,
k − l)] forms a precompact subset of L2 (Rx,k ) for ε < |ξ | < ε −1 and (y, t) ∈ B.
Hence, we have
Z
dl e E± (l)P [K(l, y, t, k)H̃ (ξ, y, t, k − l)]
ixl ∓
→0 (4.30)
R L2 (r )
Proof. Let χy,t (x, k) = (1 + |y|)[λ(x, y, t, k) − 1 − (L(y,t )1)(x, k)]. Then χ(y,t )
satisfies
Note that L(y,t )1 has the same properties as H in Lemma 4.3. The estimate (4.31)
then follows from (4.22) and (4.33).
Let ωy,t (x, k) = λ(x, y, t, k) − 1 − (L(y,t )1)(x, k), then we have
The limit (4.32) follows from Lemma 4.1, Lemma 4.3, (4.23) and (4.34). 2
−∞
× [λ(x, y, t, l) − 1 − (L(y,t )1)(x, l)]. (4.42)
For the case j = 1, the lemma follows from Lemma 4.1, (4.37), (4.39), (4.43),
(4.44) and the corresponding results for h+ (x, y, t, k). The higher order cases are
established by similar techniques and mathematical induction. 2
PROPOSITION 4.8. Lemma 4.7 (resp. Lemma 4.6) is valid if λ(x, y, t, k) is re-
∂`
placed by ∂y ` λ(x, y, t, k) (resp. λ(x, y, t, k) − 1 − (L(y,t )1)(x, k) is replaced by
∂ j+`
∂x j ∂y `
[λ(x, y, t, k) − 1 − (L(y,t )1)(x, k)]) for ` > 0 (resp. j, ` > 0).
REMARK 4.9. We have so far obtained results concerning λ(x, y, t). But of course
similar results (for y > 0) also hold for ρ(x, y, k, t). This is the reason why we
need both (2.11) and (2.15) to represent the solution.
THEOREM 5.1. Let q0 be a Schwartz function which satisfies the smallness as-
sumption (2.16) and q(x, y, t) be the solution of (1.1) obtained by the inverse
spectral method. Then for each t, q(·, t) ∈ H j (R2 ) for j > 0, and the map
t → q(·, t) is continuous from R into H j (R2 ).
Proof. Throughout this proof Qj is a polynomial in one real variable. From
(2.11) we can write
q(x, y, t) = q1,+ (x, y, t) + q1,− (x, y, t) + q2,+ (x, y, t) + q2,− (x, y, t),(5.2)
where
Z Z ±∞
i
dl ei[x(k−l)+y(l −k )+4t (k −l )] ×
2 2 3 3
q1,∓ (x, y, t) = dk
π R k
× (k − l)L∓ (k, l)λ(x, y, t, l), (5.3)
q2,∓ (x, y, t)
Z Z ±∞
1
dl ei[x(k−l)+y(l −k )+4t (k −l )] L∓ (k, l)λx (x, y, t, l). (5.4)
2 2 3 3
= dk
π R k
where
Z Z ∞
i 2−k 2 )+4t (k 3 −l 3 )]
v1 (x, y, t) = dk dl ei[x(k−l)+y(l (k − l)L− (k, l), (5.8)
π R k
Z Z ∞
i 2−k 2 )+4t (k 3 −l 3 )]
v2 (x, y, t) = dk dl ei[x(k−l)+y(l ×
π R k
where kGkL2 (R2 ) < ∞. It then follows from (5.13) and the Plancherel theorem that
kv1 (x, y, t)kL2 (R2x,y ) . 1 for t ∈ R. (5.14)
Next we investigate the square integrability of v2 (x, y, t). Let H (k, l) = i(k −
l)L−(k, l) and M(ξ, y, t, k) be the Fourier transform of (L(y,t )1)(x, k) in the x
variable. Note that the results for H̃ (ξ, y, t, k) in Lemma 4.3 are also valid for
M(ξ, y, t, k).
We can rewrite v2 (x, y, t) as
Z Z
1
dl dk ei(xl+yl +4t l ) e−i2kly e12it (lk −l k) ×
2 3 2 2
v2 (x, y, t) = 2
dξ eixξ
2π R R2
× E− (l)H (k, k − l)M(ξ, y, t, k − l), (5.15)
From (2.17) we have
20 LI-YENG SUNG
Z Z 3/2
dl dk|H (k, k − l)| 4/3
< ∞. (5.16)
R R
By the Plancherel theorem, Lemma 4.3, (5.15), (5.16) and Hölder’s inequality we
have
Z
Z
kv2 (x, y, t)kL2 (Rx ) . dξ
dk|H (k, k − l)M(ξ, y, t, k − l)|
R R L2 (Rl )
Z Z 3/2 1/2
. dl dk|H (k, k − l)|4/3 ×
R R
Z
× dξ kM(ξ, y, t, ·)kL4 (R)
R
6 Q5 (t)(1 + |y|)−3/4 for t ∈ R, y 6 0. (5.17)
It follows from (5.17) that
kv2 (x, y, t)kL2 ({(x,y):y 60}) 6 Q6 (|t|) for t ∈ R. (5.18)
Moreover, by splitting the integral in (5.15) over the sets {ξ : |ξ | < ε −1 or
ε < |ξ |} and {ξ : ε < |ξ | < ε −1 }, we obtain from Lemma 4.3 (cf. the arguments in
the proof of Lemma 4.5) that, for any bounded subset B of R2 ,
lim kv2 (x, y, t)kL2 (|x|>r) = 0 uniformly for (y, t) ∈ B. (5.19)
r→∞
It follows from (5.7), (5.12), (5.14), (5.18) and their analogs for q1,+ that
kq1,± (x, y, t)kL2 ({(x,y):y 60}) 6 Q7 (|t|) for t ∈ R. (5.20)
The estimates (5.6) and (5.20) together with (5.2) show that
kq(x, y, t)kL2 ({(x,y):y 60}) 6 Q8 (|t|) for t ∈ R. (5.21)
Similarly, using Proposition 3.7 and Remark 4.9, we find
kq(x, y, t)kL2 ({(x,y):y >0}) 6 Q9 (|t|) for t ∈ R. (5.22)
From (5.5), (5.11), (5.13) and (5.17) we see that the L2 ({(x, y) : y 6 0}) norm
of q(x, y, t) becomes arbitrarily small for |y| large and t bounded. So to prove
the continuous dependence of q(x, y, t) in t with respect to L2 ({(x, y) : y 6 0}),
it suffices to look at the case where (y, t) ∈ B and B is a bounded subset of
{(y, t) : y 6 0}. Then (4.32), (4.36), (5.13) and (5.19) further show that we may
also assume x be bounded. The continuity of q(x, y, t) in t with respect to the
L2 ({(x, y) : y 6 0}) norm therefore follows from (2.27). In view of Remark 4.9
and Proposition 3.7 this is also true in the L2 (R2x,y ) norm.
We have proved the case where j = 0. The other cases are established by similar
techniques using Lemmas 4.3–4.5 and Proposition 4.8. 2
SOLUTIONS OF THE KADOMTSEV–PETVIASHVILI-I EQUATION 21
Let 1ε (x, y, t) = q1,ε (x, y, t) − q2,ε (x, y, t), ωε (x, y, t) = (q12 )ε (x, y, t) −
(q2 )ε (x, y, t), and Rε (x, y, t) = r1,ε (x, y, t) − r2,ε (x, y, t). From (6.5), (6.6), (6.8)
2
Taking the real part of (6.10) we find dtd (1ε , 1ε ) = −3Re(ωε , (1ε )x ), which
together with (6.9) imply that
Z t
k1ε (·, t)k2 = −3 dt 0 Re(ωε (·, t 0 ), (1ε )x (·, t 0 )). (6.11)
0
where 1(x, y, t) = q1 (x, y, t)−q2 (x, y, t) and σ (x, y, t) = q1 (x, y, t)+q2 (x, y, t).
Let T > 0 be arbitrary. It follows from (6.3), (6.12) and the Sobolev inequality
(cf. [18]) that
Z t
k1(·, t)k 6 CT
2
dt 0 k1(·, t 0 )k2 + k1x (·, t 0 )k2 for 0 6 t 6 T . (6.13)
0
THEOREM 6.4. Let q0 be a Schwartz function which satisfies the smallness as-
sumption (2.16). The solution q(x, y, t) for (1.1) obtained by the inverse spectral
method is the unique generalized solution of the forward problem for (1.1) in
the class C([0, ∞), H 3 (R2 )) ∩ C 1 ((0, ∞), L∞ (R2 )). It is also the unique gen-
eralized solution of the backward problem in the class C((−∞, 0], H 3 (R2 )) ∩
C 1 ((−∞, 0), L∞ (R2 )).
LEMMA 6.5. Let q(x, y, t) be the solution obtained by the inverse spectral method.
Then we have
Z Z
dx dy q (x, y, t) =
2
dx dy q02 (x, y). (6.18)
R2 R2
Proof. We use the notation in the proof of Proposition 6.3. From Equations (6.5)
and (6.6) (for qε and rε ) we have
Z Z
(d/dt) dx dy (qε ) = −3
2
dx dy (q 2 )ε (qε )x . (6.19)
R2 R2
REMARK 6.6. When q0 is real, the uniqueness of solution for (1.1) implies that
q(x, y, t) is real for all t ∈ R and hence kq(x, y, t)kL2 (R2x,y ) is conserved.
24 LI-YENG SUNG
Acknowledgment
This work was supported in part by the National Science Foundation under Grant
DMS-94-96154.
References
1. Ablowitz, M. J. and Villarroel, J.: On the Kadomtsev–Petviashvili equation and associated
constraints, Stud. Appl. Math. 85 (1991), 195–213.
2. Boiti, M., Pempinelli, F. and Pogrebkov, A.: Solutions of the KPI equation with smooth initial
data, Inverse Problems 10 (1994), 505–519.
3. Boiti, M., Pempinelli, F. and Pogrebkov, A.: Properties of solutions of the Kadomtsev–
Petviashvili I equation, J. Math. Phys. 35 (1994), 4683–4718.
4. Faminskii, A. V.: The Cauchy problem for the Kadomtsev–Petviashvili equation, Russian Math.
Surveys 5 (1990), 203–204.
5. Fokas, A. S. and Ablowitz, M. J.: On the inverse scattering of the time-dependent Schrödinger
equation and the associated Kadomtsev–Petviashvili (I) equation, Stud. Appl. Math. 69 (1983),
211–228.
6. Fokas, A. S. and Sung, L.-Y.: The Cauchy problem for the Kadomtsev–Petviashvili-I equation
without the zero mass constraint, Math. Proc. Camb. Phil. Soc. 125 (1999), 113–138.
7. Johnson, R. S.: Water waves and Korteweg–de Vries equations, Fluid Mech. 97 (1980), 701–
719.
8. Kadomtsev, B. B. and Petviashvili, V. I.: On the stability of solitary waves in weakly dispersive
media, Soviet Phys. Dokl. 15 (1970), 539–541.
9. Lipovskij, V. D., Matveev, V. B. and Smirnov, A. O.: On the isomorphism between KP-equation
and Johnson equation, Zap. Semin. LOMI 150 (1986), 70–75.
10. Manakov, S. V.: The inverse scattering transform for the time-dependent Schrödinger equation
and Kadomtsev–Petviashvili equation, Physica D 3 (1981), 420–427.
11. Muskhelishvili, N. I.: Singular Integral Equations, Dover, New York, 1992.
12. Saut, J.-C.: Remarks on the generalized Kadomtsev–Petviashvili equations, Indiana Math. J.
42 (1993), 1011–1027.
13. Saut, J.-C.: Recent results on the generalized Kadomtsev–Petviashvili equations, Acta Appl.
Math. 39 (1995), 477–487.
14. Segur, H.: Comments on IS for the Kadomtsev–Petviashvili equation, in: M. Tabor and Y. M.
Treve (eds.), Mathematical Methods in Hydrodynamics and Integrability in Dynamical Systems,
AIP Conference Proceedings, No. 88, 1982, pp. 211–228.
15. Stein, E.: Singular Integrals and Differentiability Properties of Functions, Princeton University
Press, Princeton, 1970.
16. Tom, M. M.: On a generalized Kadomtsev–Petviashvili equation, Contemp. Math. 200 (1996),
193–210.
17. Ukai, S.: Local solutions of the Kadomtsev–Petviashvili equation, J. Fac. Sci. Univ. Tokyo,
Sect. IA Math. 36 (1989), 193–209.
18. Wloka, J. T.: Partial Differential Equations, Cambridge University Press, Cambridge, 1987.
19. Yosida, K.: Functional Analysis, Springer-Verlag, Berlin, 1995.
20. Zhou, X.: Inverse scattering transform for the time dependent Schrödinger equation with
applications to the KP-I equation, Comm. Math. Phys. 128 (1990), 551–564.
Mathematical Physics, Analysis and Geometry 2: 25–51, 1999.
25
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
WERNER KIRSCH
Institute of Mathematics of the Ruhr-University Bochum, D-44780 Bochum, Germany
VLADIMIR KOTLYAROV
Mathematical Division, B. Verkin Institute for Low Temperature Physics, 310164 Kharkov, Ukraine
Introduction
It has been known for a long time [1 – 3] that the asymptotics of the solution of
the initial value problem with decreasing initial data is a superposition of solitons.
This superposition is the main term of the asymptotics. The next term tends to zero
when t → ∞. Sometimes, the phases of these solitons are the same as in the reflec-
tionless case. They depend on the eigenvalues and the normalization coefficients
only. This situation takes place for those nonlinear evolution equations integrated
by the inverse scattering transform, for which the kernel of the Marchenko integral
equation has no stationary point in the soliton domain. These are, for example,
the Korteweg–de Vries equation and the modified one. For many other integrable
equations, the kernel of the Marchenko integral equation has a stationary point. In
this case, the phases of solitons depend, in addition, on the reflection coefficient
[4 – 6]. It is important to remark that the additional phase shift, contributed by
non-zero reflection coefficient, has a finite order for t → ∞.
The first regorous results on this subject were obtained in the well-known papers
by A. B. Shabat [7] for the Korteweg–de Vries equation with decreasing initial
data, and by E. Ya. Khruslov for a step-like initial function [8]. Recently, Deift, Its
26 WERNER KIRSCH AND VLADIMIR KOTLYAROV
and Zhou [9, pp. 181–204] formulated the steepest descent method for the study
of long-time asymptotics for integrable nonlinear wave equations, based on the
oscillatory Riemann–Hilbert problem. This is a modern and very powerful method
for the asymptotic investigation of decreasing solutions of nonlinear wave equa-
tions as well as for the Painlevé equations, and for some models of quantum and
statistical physics. The references to this approach can be found in the book [9].
The Riemann–Hilbert method was also applied by Fokas and Its to the study of
initial boundary value problems on the semi-axis [10, 11]. Nevertheless, in the case
of non-decreasing solutions it is more convenient to use the associated Marchenko
integral equations.
The aim of this paper is twofold. The first is to present rigorous results on
the soliton asymptotics of a solution of the sine-Gordon equation with decreas-
ing and some non-decreasing initial data based on direct asymptotic analysis of
the Marchenko integral equation. The second is to obtain explicitly the additional
phase shift caused by the non-zero reflection coefficient.
Taking into account the equivalence [2] of the sine-Gordon equation given in
laboratory coordinates and in light-cone coordinates, we restrict our consideration
to the equation
A large class of solutions of Equation (1) can be constructed as follows. Let the
system of integral equations
Z ∞
K1 (x, y, t) + K2 (x, z, t)H (y + z, t) dz = 0 (x < y < ∞),
x
Z (2)
∞
K2 (x, y, t) + K1 (x, z, t)H (y + z, t) dz = −H (x + y, t),
x
where
Z
H (y, t) = eiλy+2it /λ dρ(λ), ⊂ C,
have a unique solution, which is sufficiently smooth. Then the function u(x, t)
defined by the equation
THEOREM 1. Let w(x) be a real Schwarz type function (w(x) ∈ S(R)). Then,
uniformly with respect to x ∈ R for t → ∞, the solution of the problem (3), (4)
has the asymptotic form
∂ Im det[I + D(x, t)]
v(x, t) = 4 arctan + o(1), t → ∞.
∂x Re det[I + D(x, t)]
The entries of the matrix D have the form
imj exp[2it/λj ]
Dj l (x, t) = exp i(λj x + θj (X) + λl x + θl (X)) ,
λj + λl
where
Z
1 α(X)
ln[1 + |r(s)|2 ] ds 1 x
θj (X) = , α(X) = √ , X= .
2π −α(X) s − λj 2X 2t
where
vj (x, t) = 4σj arctan exp −2νj x − νj−2 t − xj + 2π σj (j − 1),
σj = sign Im(mj )
is the soliton-kink with the phase
Z νj
1 ln[1 + |r(s)|2 ] ds
xj = xj −
0
, j = 1, 2, . . . , n,
2π −νj s 2 + νj2
28 WERNER KIRSCH AND VLADIMIR KOTLYAROV
and
Im λn+j sin[2 Re λn+j (x + |λn+j |−2 t + βj )]
wj (x, t) = 4 arctan
Re λn+j cosh[2 Im λn+j (x − |λn+j |−2 t − λj )]
is the soliton-breather with the phases
Re θn+j Im θn+j
βj = βj0 + , γj = γj0 − ,
Re λn+j Im λn+j
where
Z |λn+j |
1 ln[1 + |r(s)|2 ] ds
θn+j = , j = 1, 2, . . . , p.
2π −|λn +j | s − λj
The soliton phases xj0 , βj0 and γj0 have to be taken in the same form as in the
reflectionless case (r(λ) ≡ 0).
In the non-decreasing case, the kernel of the Marchenko integral equation is as
follows
Z ∞ 2n Z
1 X X N
1
H (y) = r(λ) e dλ +
iλy
h(λ) e dλ +
iλy
mj (t) eiλj y . (6)
2π −∞ 2π k=1 γk j =1
This form of the kernel corresponds to the following structure of the spectrum of
the Dirac operator with a “step-like” potential w(x): this operator has a continuous
spectrum of multiplicity two on the real line R of the λ-plane, and a continuous
spectrum of multiplicity one on a set of analytic arcs γj (j = 1, 2, . . . , 2n) with
end-points that are zeros of the polynomial
Y
k Y
m+k+1
P (λ ) =
2
λ −
2
Ei2 λ2 − Ej2 λ2 − E 2j ,
i=1 j =k+1
Re Ei = 0, i 6 k, Im Ej > 0, k + 2m = n,
which corresponds to the Riemann surface of an almost periodic potential wn (x)
[12] and, possibly, a finite number of eigenvalues λj which lie on the imaginary
axis of the λ-plane and symmetrically with respect to this axis.
Let σ be the complete spectrum of the Dirac operator
d w(x)
L = iσ3 +i σ2 ,
dx 2
which acts in the space L2 (R). Suppose that the condition
C = max |λ|−2 > 0 (7)
λ∈σ \R
bj ∈ σ \
holds and the maximal value is attained in a finite number of points E
R, j = 1, 2, . . . , 2p.
SOLITON ASYMPTOTICS OF SOLUTIONS OF THE SINE-GORDON EQUATION 29
Let
1
DN = (x, t) : x > Ct − ln t N+1
, t > t (N) , (8)
2 Im E
bj .
where Im E = max16j 6n Im E
THEOREM 2. Let condition (7) be fulfilled and for all eigenvalues λj , |λj | > √1C .
Then for any natural number N, the solution of problem (3), (5) in the domain (8)
has the asymptotic form
∂ Im det[I + (ξ, t)]
v(x, t) = 4 arctan + o(1), ξ = x − Ct, t → ∞, (9)
∂x Re det[I + (ξ, t)]
p
where (ξ, t) = kB lr (ξ, t)kl,r=1 is a block matrix with the entries
X
N−k−1
wl(k+m) (t) X X lrpq
m j
lr
Bkj (x, t) = lr
G (t)Jp+q (ξ ),
m=0
k!m!t k+m+3/2 p=0 q=0 mj
Z ∞
b b
Jplr = τ p ei(El +Er )τ dτ.
ξ
lrpq
The functions ωl(n) (t) and Gmj (t) are determined by the scattering data of the
initial function w(x) and they are uniformly bounded with respect to t.
THEOREM 3. Under the conditions of Theorem 2, let us suppose that the max-
b1 = ia and E
imal value (7) is attained in two points E b2 = −ia, a > 0. Then
formula (9) takes the form
[(N+1)/2]
∂ X
v(x, t) = 2π(n − 1) +
∂x n=1
+ 4 arctan exp −2a x − a −2 t − αn (x, t) + o(1), t → ∞, (10)
where
Z √
1 1 t /x
ln[1 + |r(µ)|2 ] dµ
αn (x, t) = αn0 − ln t 2n−1/2 − √
. (11)
2a 2π − t /x µ2 + a 2
The numbers αn0 are determined (Equation (5.11)) by the scattering data of the
initial function w(x), and r(µ) is the reflection coefficient of the Dirac operator L.
THEOREM 4. Under the conditions of Theorem 2, let us suppose that the maxi-
mal value in (7) is attained in four points E b1,2 = ±E, Eb3,4 = ±E. Then formula
(9) takes the form
[(N+1)/2]
∂ X Im E sin[2 Re E(x + |E|−2 t + βn (x, t))]
v(x, t) = 4 arctan +
∂x n=1 Re E cosh[2 Im E(x − |E|−2 t + γn (x, t))]
+ o(1), (12)
30 WERNER KIRSCH AND VLADIMIR KOTLYAROV
Z √
1 t /x
ln[1 + |r(µ)|2 ] dµ
βn (x, t) = βn0 + Re √
, (13)
2π Re E − t /x µ−E
1
γn (x, t) = γn0 + ln t 2n−1/2 −
2 Im E
Z √t /x
1 ln[1 + |r(µ)|2 ] dµ
− Im √
. (14)
2π Im E − t /x µ−E
The numbers βn0 , γn0 are determined (Equations (5.12), (5.13)) by the scattering
data of the initial function w(x).
REMARK. √ If the Dirac operator has a discrete spectrum which lies inside the circle
of radius 1/ C, then the superposition of usual solitons, i.e. kinks (Re λk = 0) and
breathers (Re λk 6= 0), should be added to Equation (10) and (12) of asymptotic
solitons. Each asymptotic soliton gets an additional phase shift caused by solitons
generated by the discrete spectrum.
where K1 (x, y, t) = K 1 (x, y, t), K2 (x, y, t) = −K 2 (x, y, t), and these functions
solve the integral Marchenko equation (2). For this case, the kernel of Equation (2)
has the form
X N Z ∞
1
H (y, t) = mj (t) eiλj y
+ r(λ, t) eiλy dλ (1.2)
j =1
2π −∞
with
mj (t) = mj e2it /λj , r(λ, t) = r(λ) e2it /λ .
Here the numbers λj (j = 1, 2, . . . , N) are the eigenvalues of the Dirac operator
L. Since w(x) is a real function, the eigenvalues are distributed in two subsets [2]:
λj = iνj , νj > 0, j = 1, 2, . . . , n;
r(−λ) = −r̄(λ), λ ∈ R.
It is known [2] that for any t ∈ R Equation (1.4) has a unique solution in each
space L(i) [x, ∞) (i = 1, 2, ∞) and if r(λ) ∈ S(R), then K(x, y, t) is a Schwarz
type vector-function for each t, too.
and let L = (L1 (x, y, t), L2 (x, y, t)) be the solution of the integral equation
I +Rb L = GR , GR = (0, −R). (2.1)
and it is easy to obtain an estimate, uniform with respect to x, t, for the vector-
function L
Z ∞
L(x, ., t)
2 6 kGR k2 = kRk2 = σ (2x, t) = R(s, t)2 ds
2 2 2
Z ∞ 2x
1 2
6 r(λ) dλ = C 2 , (2.2)
2
2π −∞
32 WERNER KIRSCH AND VLADIMIR KOTLYAROV
where
Z
∞ 2
L(x, ., t)
2 = |L1 (x, y, t)|2 + L2 (x, y, t) dy.
2
x
Moreover, there is a uniform estimate in the space L∞ [x, ∞) for the components
Lj (x, y, t)
L1 (x, ., t)
6 kRk2 kL2 k2 6 σ (2x, t) 6 C 2 , (2.3)
∞ 2
L2 (x, ., t)
6 kRk∞ + kRk2 kL1 k2
∞
6 sup |R(y, t)| + σ (2x, t) 6 C1 + C22 , (2.4)
y>2x
where
Z ∞
1
C1 = |r(λ)| dλ.
2π −∞
Proof. The operator identities (2.6) are equivalent to pair equations of Marchenko
type on functions Q1 (y, z) and Q2 (y, z). Since the functions L1 (y, z) and L2 (y, z)
SOLITON ASYMPTOTICS OF SOLUTIONS OF THE SINE-GORDON EQUATION 33
are the solution of Equation (2.1), it is easy to check that the representations for
Q1 (y, z) and Q2 (y, z) are valid. 2
Z y
gj(1) (x, y, t) = hj(1)(y, t) + hj(1)(s, t)L1 (s, y, t) +
x
+ hj(2)(s, t)L2 (s, y, t) ds,
Z y
(1)
gj(2) (x, y, t) = hj(2)(y, t) + hj (s, t)L2 (s, y, t) +
x
+ hj(2)(s, t)L1 (s, y, t) ds.
The next lemma presents a degenerate integral equation for the problem (3), (4).
(1)
X
N
GN (x, y, t) = FN(1) (x, y, x, t) = mj (t) eiλj y h(2)
j (x, t),
j =1
(2.9)
(2)
X
N
GN (x, y, t) = FN(2) (x, y, x, t) = mj (t) eiλj y h(1)
j (x, t).
j =1
34 WERNER KIRSCH AND VLADIMIR KOTLYAROV
Proof. It follows from Equations (1.4), (2.1), (2.5) and (2.6) that
M +H bN I + Q b M = −H bN L + GHN .
bN = H
Then the operator F bN (I + Q)
b is degenerate and the corresponding kernel has
the form (2.8). The vector-function GN = −H bN L + GHN and the corresponding
components have the form (2.9). 2
LEMMA 3. The solution of Equation (2) with kernel (1.2) has the representation
X
N
K1 (x, y, t) = L1 (x, y, t) + Xj (x, t)gj(1) (x, y, t) +
j =1
+ Yj (x, t)gj(2) (x, y, t) ,
(2.10)
X
N
K2 (x, y, t) = L2 (x, y, t) + Xj (x, t)gj(2) (x, y, t) +
j =1
+ Yj (x, t)gj(1) (x, y, t) ,
where L = (L1 , L2 ) is the solution of Equation (2.1) and the N-dimensional vec-
tors X = (X1 , X2 , . . . , XN ) and Y = (Y1 , Y2 , . . . , YN ) are the solution of the
system of linear algebraic equations
X
N
Xj + (Aj l Xl + Bj l Yl ) = −mj (t)hj(2) ,
l=1
(2.11)
X
N
Yj + (Aj l Yl + Bj l Xl ) = −mj (t)hj(1) .
l=1
X
N X
N
M1 (x, y, t) = iλj y
Xj (x, t) e , M2 (x, y, t) = Yj (x, t) eiλj y .
j =1 j =1
Then Equation (2.7) is equivalent to the system (2.11) of linear algebraic equations,
and Equation (2.5) leads to the representation (2.10). 2
SOLITON ASYMPTOTICS OF SOLUTIONS OF THE SINE-GORDON EQUATION 35
COROLLARY 3. The solution of the problem (3), (4) has the form
X
N
v(x, t) = 4iL2 (x, x, t) + 4i Xj (x, t)h(2) (1)
j (x, t) + Yj (x, t)hj (x, t) .
j =1
So, the solution of the initial value problem (3), (4) is completely defined via
the solution L of Equation (2.1) and Jost type functions hj(1)(x, t) and hj(2)(x, t).
Therefore we have to study the asymptotic behaviour of these functions for a large
time.
So, the first estimate (3.5) is valid. For the second one we can write
kSk∞ 6 kGR1 k∞ + kR1 k2 kP k2 +
R b
kSk2 .
2
By using Equations (3.1), (3.6), (3.8), (2.2) and taking into account the first esti-
mate (3.5) we finally obtain
C b2
C(ρ)C C
kSk∞ 6 sup |ρ(Y )| + + C(ρ, X0 ) √ .
t 3/2 Y >X t t
Then Equation (3.11) and the complex conjugated one may be written as follows
N2 + b0t1 + b
0t2 N2 − b 0t3 + b
0t4 N 2 = −r̂,
4 2
N2 + b 0t + b0 t N2 − b 0t + b
3 1
¯
0 t N 2 = −r̂,
i.e.,
f +b0t f = g, (3.13)
b1 b2
0t −b0t3 0t −b0t4
b
0t = + . (3.14)
b
−0t
3
b 1
0t −b0t
4
b0
2
t
It is easy to see that
∗ b1 ∗ ∗
0t = b =b
1 3
0t1 = b
b 0t1 , 0t , b
0t3 0t .
Therefore the first summand in (3.14) is a self-adjoint operator in L2 [X, ∞).
The second summand is not self-adjoint in the same space.
LEMMA 5. For any X > X0 > −∞ and for any fixed t, the operator b 0t is a
compact operator in the space L2 [X, ∞), and Re(b 0t f, f ) > 0 for any vector-
function f (Z) ∈ L2 [X, ∞) and for sufficiently large t.
Proof. The compactness of the operator b0t follows from the fact that it is an op-
erator of the Hilbert–Schmidt type in the space L2 [X, ∞). This statement follows
from the inequality
Z ∞
2
Y r̂(Y + 2X) dY < ∞. (3.15)
0
38 WERNER KIRSCH AND VLADIMIR KOTLYAROV
Z ∞
X + Y 1/4 X + S 1/4
− + ×
X+S X+Y
X
r0 (X + Y )r̄0 (X + S)
× 4π i(S − Y + i0) ϕ(S) dS, i = 1,
lim b
0i ϕ = Z ∞ (3.16)
t →∞ t
X + Y 1/4 X + S 1/4
− − ×
X+S X+Y
X
r (X + Y )r0 (X + S)
× 0 ϕ(S) dS, i = 3,
4π(S − Y )
SOLITON ASYMPTOTICS OF SOLUTIONS OF THE SINE-GORDON EQUATION 39
√
holds with r0 (Y ) = r(1/ Y ) and r(λ) the reflection coefficient; the equalities are
valid in the metric of the space L2 [X, ∞).
Lemma 7 is proved. 2
It follows from the given proof that for the solution N2 (X, Y, t) of Equation
(3.11) the following estimate is valid
N2 (X, Y, t) − N(X, Y )
= o(1), t → ∞, (3.17)
2
LEMMA 8. Equation (3.18) has a unique solution in the space L2 [X, ∞) which
can be represented in the explicit form
N(X, Y ) = −r̂(X + Y ) ×
Z ∞
i X + Y 1/2 ln[1 + |r0 (X + S)|2 ]
× exp dS . (3.19)
2π X X+S S − Y + i0
The function N(X, Y ) is bounded for Y ∈ [X, ∞).
Proof. We look for the solution N(X, Y ) in the form
N(X, Y ) = −r̂(X + Y )A(X, Y ),
where
eiπ/4
r̂(X + Y ) = √ (X + Y )−3/4 r0 (X + Y ).
2π
Then
Z r r !
∞
X+Y X + Y |r0 (X + S)|2 A(X, S)
A(X, Y ) − 1+ dS −
X X+S X+S 4π i(S − Y + i0)
Z ∞r r !
X+Y X + Y |r0 (X + S)|2 Ā(X, S)
− 1− dS = 1.
X X+S X+S 4π i(S − Y )
LEMMA 9. Let L1 (x, y, t), L2 (x, y, t) be the solution of Equation (2.1), and let
L1(0)(x, y, t), L2(0)(x, y, t) be defined by Equation (3.20). Then the estimates
L1 (x, y, t) − L0 (x, y, t)
= o(1), t → ∞, (3.21)
∞
1
L2 (x, y, t) − L (x, y, t)
= o(1), t → ∞,
0
(3.22)
2 2
are valid.
Proof. It follows from Equations (3.2), (3.9), (3.10) and estimates (3.5), (3.17)
that
L2 (x, . , t) − L(0)(x, ., t)
2 L2 [x,∞)
6 2 N2 (x, ., t) − N(x, ., t)
+
S2 (x, ., t)
L2 [X,∞) L2 [x,∞)
= o(1), t → ∞.
Estimate (3.22) can be easily deduced from Equation (3.11). 2
1
h2j (x, t) = √ ηj (X, t) exp(iλj x). (4.2)
t
The functions εj (X, t) = o(1) as t → ∞, and the functions ηj (X, t) are uniformly
bounded with respect to X and t.
Proof. For h1j (x, t), by virtue of (3.20) and (3.21), we have
Z ∞
−iλj x
1
hj (x, t) e −1 = L1(0)(x, y, t) eiλj (y−x) dy −
x
Z ∞
(0)
− L1 (x, y, t) − L1 (x, y, t) eiλj (y−x) dy
Z ∞x
= L1(0)(x, y, t) eiλj (y−x) dy + o(1), t → ∞.
x
where ν(X) = 2π 1
ln[1 + |r0 (2X)|2 ], and the function A0 (X, Z) is continuous and
bounded when Z → X. By using the last equality, it is easy to obtain the estimate
1
J 1 (λj ) + J −λ̄j = O t −1 , t → ∞.
2
The integrals J 2 (λj ) and J (−λ̄j ) give us
Z α Z α
2 |r(µ)|2 B(X, µ) dµ |r(µ)|2 B(X, µ) dµ
J (λj ) + J −λ̄j = −
2
+ ,
0 2π i(µ − λj ) 0 2π i(µ + λj )
LEMMA 11. For the entries (2.13) of the matrices A and B, the relations
imj (t) ϕj l (X, t)
Aj l (x, t) = √ exp(i(λj + λl )x),
t λj + λl
and
imj (t)[1 + εj l (X, t)]
Bj l (x, t) = exp i(λj x + θj (X) + λl x + θl (X))
λj + λl
are true, and the functions ϕj l (X, t) are uniformly bounded with respect to X and
t.
From the previous results, we deduce the following statement.
THEOREM 10 . For t → ∞ the solution of the problems (3), (4) has the asymptotic
form
X
N
v(x, t) = 4i Yj(0)(x, t) eiλj x+iθj (X) + o(1), x > X0 t (X0 > −∞), (4.3)
j =1
where the functions Yj(0)(x, t) (together with Xj(0) ) solve the system of linear alge-
braic equations
X
N
Xj(0) (x, t) + Dj l (x, t)Yl(0) (x, t) = 0,
l=1
(4.4)
X
N
Dj l (x, t)Xl(0) (x, t) + Yj(0)(x, t) = −mj (t) eiλj x+iθj (X)
l=1
44 WERNER KIRSCH AND VLADIMIR KOTLYAROV
H (x + y, t) = HN (x + y, t) + RN (x + y, t), (5.1)
where
X
p
X
N−1
(ξ + η)n ωk(n) (t)
HN (x + y, t) = exp[iEk (ξ + η)]
k=1 n=0
t n+3/2
X
p
X
N−1 X
N−k−1
= eiEl (ξ +η) ηk (l)
ωkm (t)ξ m , (5.2)
l=1 k=0 m=0
The functions ωk(m) (t) were defined in [12]. It is important to remark here that they
are uniformly bounded with respect to t. The function R0 (y, t) is the same as that
in Equation (3.1), i.e.,
1 h √ √ i
−4it Y
e iπ4 −3/4 1
R0 (y, t) = √ r̂(Y ) e 4it Y
− r̂(Y ) e , r̂(Y ) = √ Y r √ .
t 8π Y
For the function RN1 (y, t), the estimate
1 N − Im Eη
R (y, t) 6 C1 ρ(Y ) + C2 + C3 |η| e (5.4)
N
t 3/2 t 2y2 t N+3/2
holds. Therefore, if L = (L1 , L2 ) is the solution of Equation (2.1) with kernel
(5.3), then for L = P + S we obtain now the estimates
1 ln t
kP1 k∞ = O(1), kP2 k∞ = O √ , kSk2,∞ = ON √ (5.5)
t t
as t → ∞.
Due to Equations (5.1)–(5.3) and estimates (5.4), (5.5), we are in the same
situation as in the previous case. But, since the degenerate kernel (5.2) has a form
different from (1.2), we need to introduce new functions of Jost type. Namely,
Z ∞
hn (x, t) = (x − Ct) e
l1 n iEl (x−Ct )
+ L1 (x, y, t)(y − Ct)n eiEl (y−Ct ) dy,
x
Z ∞ (5.6)
hn (x, t) =
l2
L2 (x, y, t)(y − Ct) e n iEl (y−Ct )
dy,
x
where L = (L1 , L2 ) is the solution of Equation (2.1) with the kernel (5.3).
Instead of Lemma 3 we obtain the following statement:
LEMMA 12. The solution of the initial value problem (3), (5) has the representa-
tion
X X
p N−1
v(x, t) = 4iL2 (x, x, t) + 4i n (x, t) +
Xnl (x, t)hl2
l=1 n=0
+ Ynl (x, t)hl1
n (x, t) , (5.7)
where the functions Xnl (x, t), Ynl (x, t) are the solution of the system of linear alge-
braic equations
X X
p N−1
X X
p N−1
Xkl + Alr r
ks Xs + Ys = akl ,
lr r
Bks l = 1, 2, . . . , p, (5.8)
r=1 s=0 r=1 s=0
X X
p N−1
X X
p N−1
Bks Xs + Ykl +
lr r
ks Ys = bk ,
Alr r l
k = 0, 1, . . . , N − 1.
r=1 s=0 r=1 s=0
46 WERNER KIRSCH AND VLADIMIR KOTLYAROV
m=0 x
X Z
N−k−1
(l)
∞
lr
Bks (x, t) = ωkm (t) m (y, t)hs (y, t) + hm (y, t)hs (y, t) dy,
hl1 r1 l2 r2
m=0 x
X
N−k−1
(l)
akl =− ωkm (t)hl2
m (x, t), (5.9)
m=0
X
N−k−1
(l)
bkl = − ωkm (t)hl1
m (x, t).
m=0
Since estimates (5.4) and (5.5) hold, Lemmas 5–9 are true under the restriction
that x, t ∈ DN . For the Jost type function (5.6), instead of Lemma 10 we obtain
the following one.
X
k
−1/2
k (x, t) = t
hl2 hl2 m
exp(iEl ξ ) km (X, t)ξ ,
m=0
where
k! ∂ k−m
km (X, t) = (−i)
hl1 +
k−m l
B0 (X, El ) 1 ε (X, t) ,
m!(k − m)! ∂Elk−m km
" Z √ #
i 1/ 2X
ln[1 + |r(µ)|2 ] dµ
B0 (X, El ) = exp √ , X = x/2t.
2π −1/ 2X µ − El
l
The function εkm (X, t) = o(1), as t → ∞, and the functions hl2
km (X, t) are
uniformly bounded with respect to X and t.
lj ∂ k lj
hk (x, t) = (−i)k h (x, t).
∂Elk 0
SOLITON ASYMPTOTICS OF SOLUTIONS OF THE SINE-GORDON EQUATION 47
COROLLARY 6. Since
x C ξ 1 ln t
X= = + = + O ,
2t 2 2t 2|El |2 t
the asymptotic equality
lj lj ln t
hkm (X, t) = hkm (t) 1 + ON
t
holds.
X
N−k−1 X
m X
s
−1/2 (l)
ks (x, t) = t
Alr lrpq lr
ωkm (t) Hms (t)Jp+q (ξ ),
m=0 p=0 q=0
X
N−k−1
(l)
X
m X
s
lr
Bks (x, t) = ωkm (t) Glrpq lr
ms (t)Jp+q (ξ ),
m=0 p=0 q=0
where
Z ∞
Jplr (ξ ) = τ p exp[i(Er + El )τ ] dτ,
ξ
lrpq
Hms (t) = hl1
mp (t)hsq (t) + hmp (t)hsq (t),
r2 l2 r1
X X
p N−1
Xkl + Ys = 0,
lr r
Bks l = 1, 2, . . . , p,
r=1 s=0
48 WERNER KIRSCH AND VLADIMIR KOTLYAROV
X X
p N−1
Xs + Ykl = bkl ,
lr r
Bks k = 0, 1, . . . , N − 1.
r=1 s=0
p
Let us introduce the block matrix (ξ, t) = kB lr (ξ, t)klr=1 . Then the system of
these equations takes the form
X + Y = 0,
X + Y = b.
Therefore
1
Y = (I + )−1 b + (I − )−1 b .
2
Equation (5.7) leads to the representation
∂ det[I + (ξ, t)]
v(x, t) = 2i ln + o(1), ξ = x − Ct, t → ∞,
∂ξ det[I + (ξ, t)]
for the solution of the problem (3), (5). Hence Equation (7) is valid and Theorem 2
is proved.
Now let us shortly consider Theorem 3. In this case, the entries of the matrix
(x, t) have the form
X
N−k−1
kj (ξ, t) = ωkm (t)wmj (ξ, t),
m=0
X
m X
j
wmj (ξ, t) = h1mp (t)h1j q (t) + 1 + O t −1 Ip+q (ξ ),
p=0 q=0
Z ∞
In (ξ ) = τ n e−2aτ dτ,
ξ
where
−1
ωkm (t) = −ih0 0(k + m + 3/2) k!m!(4a)2(k+m+1) t k+m+3/2 [1 + ψkm (t)]
with some constant h0 , the gamma-function 0(k + m + 3/2) and an estimate for
2
k + m2
ψkm (t) = O √ .
t
The determinant 1(ξ, t) = det[I + (ξ, t)] can be written in a standard way
X
N X
1(ξ, t) = Dk (ξ, t), Dk (ξ, t) = det i1 ...ik (ξ, t).
k=1 i1 <i2 <···<ik
SOLITON ASYMPTOTICS OF SOLUTIONS OF THE SINE-GORDON EQUATION 49
If n 6 [ N+1
2
], then the determinant of the matrix kwmj (x, t)k of n + 1 order can be
represented as follows
det
wmj (x, t)
1 1
h00 0 . . . 0 I0 I1 . . . In h00 h10 . . . hn0
1 1
h 1 h 1 . . . 0 I1 I2 . . . In+1 0 h111 . . . h1n1
10
= . .. .
11
.. .. .. .. .. .. ..
. . . . . .. . .
h1 h1 . . . h1 In In+1 . . . I2n 0 0 . . . h 1
n0 n1 nn nn
Y n
h1kk det
Im+j (ξ )
= B0 (X, ia) det
Im+j (ξ )
.
2 2n+2
=
k=0
X
N
1(ξ, t) = 1 + i n Pn (X, t)t −n(n+1/2) e−2naξ 1 + δn (t) , (5.10)
n=1
where
(−h0 )n B02n (X, ia)0 (n) 01(n) N +1
Pn (X, t) = Q , n6 ,
(4a)2n2 (2a)n2 ( n−1
k=0 k!)
2 2
N +1
Pn (X, t) = O(1), n >
2
with some δn (t) = O(n2 /t). This expansion leads to Equations (10), (11) with
1 |h0 | 1 0 (n) 01(n)
αn0 = ln , (5.11)
2a [(n − 1)!]2 (2a)6n−3 0 (n−1) 01(n−1)
where 0 (n) and 01(n) are the determinants with the gamma-function entries 0(k +
j + 3/2) and 0(k + j + 1). Theorem 3 is proved.
Below we give a sketch of the proof of Theorem 4. In this case
A(ξ, t) B(ξ, t)
(ξ, t) = ,
−B(ξ, t) −A(ξ, t)
where
X
N−k−1
Akj (ξ, t) = 1
ωkm (t)wmj (ξ, t),
m=0
X
N−k−1
Bkj (ξ, t) = 2
ωkm (t)wmj (ξ, t),
m=0
50 WERNER KIRSCH AND VLADIMIR KOTLYAROV
X
m X
j
1
wmj (ξ, t) = h1mp (t)h1j q (t) 1 + O t −1 Jp+q (ξ ),
p=0 q=0
X
m X
j
2
wmj (ξ, t) = h1mp (t)h1j q (t) 1 + O t −1 Ip+q (ξ ),
p=0 q=0
Z ∞ Z ∞
Jn (ξ ) = n 2iEτ
τ e dτ, In (ξ ) = τ n e−2 Im Eτ dτ,
ξ ξ
Theorem 4 is proved.
Acknowledgements
V.K. thanks the Fakultät und Institut für Mathematik, Ruhr Universität Bochum for
kind hospitality and the DFG for financial support.
References
1. Novikov, S. P., Manakov, S. V., Pitaevsky, L. P. and Zakharov, V. E.: Theory of Solitons. The
Inverse Scattering Method, Nauka, Moscow, 1980, 319 pp. English transl. Plenum, New York,
1984.
2. Faddeev, L. D. and Takhtadjan, L. A.: Hamiltonian Methods in the Theory of Solitons, Nauka,
Moscow, 1986. English transl. Springer-Verlag, 1987.
3. Schuur, P. C.: Asymptotic analysis of soliton problems, Lect. Notes in Math. 1232, Springer-
Verlag, 1986.
4. Its, A. R.: Method of the isomonodromic deformation in the theory of completely integrable
nonlinear evolution systems, Doctoral dissertation, Leningrad University, 1986.
5. Novokshenov, V. Yu.: Asymptotics as t → ∞ of the solution of the Cauchy problem for the
nonlinear Schrödinger equation, Sov. Math. Dokl. 21(2) (1980), v. 251, 529–533.
6. Kotlyarov, V. P.: Asymptotic analysis of the Marchenko integral equation and soliton
asymptotic of a solution of the nonlinear Schrödinger equation, Physica D 87(1–4) (1995),
176–185.
7. Shabat, A. B.: On the Korteweg–de Vries equation, Sov. Math. Dokl. 14 (1973), 1266–1269.
8. Khruslov, E. Ya.: Asymptotics of the solution of the Cauchy problem for the Korteweg–de
Vries equation with the step-like initial data, Mat. Sbornik 99(2) (1976), 261–281.
9. Fokas, A. S. and Zakharov, V. E. (eds.): Important Developments in Soliton Theory, Springer-
Verlag, 1993.
10. Fokas, A. S. and Its, A. R.: An initial-boundary value problem for the sine-Gordon equation in
laboratory coordinates, Teoreticheskaya i Matematicheskaya Fizika 92(3) (1992), 387–403.
11. Fokas, A. S. and Its, A. R.: The linearization of the initial-boundary value problem of the
nonlinear Schrödinger equation, SIAM J. Math. Anal. 27(3) (1996), 738–764.
12. Kotlyarov, V. P.: Asymptotic solitons of the sine-Gordon equation, Theor. and Math. Physics
80(1) (1989), 15–28.
13. Kotlyarov, V. P.: Influence of double continuous spectrum of the Dirac operator on the nonlinear
Schrödinger equation asymptotic solitons, Matem. Zametki 49(2) (1991), 84–94. (Math. Notes
(1991) (1–2), 172–180.)
14. Reed, M. and Simon, B.: Methods of Modern Mathematical Physics. 1. Functional Analysis,
Academic Press, NY, 1972.
Mathematical Physics, Analysis and Geometry 2: 53–81, 1999.
53
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
NAKAO HAYASHI
Department of Applied Mathematics, Science University of Tokyo, 1-3, Kagurazaka, Shinjukuku,
Tokyo 162, Japan; e-mail: nhayashi@rs.kagu.sut.ac.jp
PAVEL I. NAUMKIN
Instituto de Física y Matemáticas, Universidad Michoacana AP 2-82, CP 58040, Morelia,
Michoacan, Mexico; e-mail: naumkin@ifm1.ifm.umich.mx
paper is to prove the local existence in time of small solutions to the Cauchy problem (A) in the usual
Sobolev space, and the global-in-time existence of small solutions to the Cauchy problem (A) in the
weighted Sobolev space under some conditions on the complex conjugate structure of the nonlinear
terms, namely if N (eiθ v) = eiθ N (v) for all θ ∈ R.
Key words: Davey–Stewartson system, Ishimori system, elliptic–hyperbolic case, nonlocal nonlinear
Schrödinger equation.
1. Introduction
We study the initial value problem for the two-dimensional nonlinear nonlocal
Schrödinger equations
iut + 1u = N (v), (x, y) ∈ R2 , t ∈ R,
(1.1)
u(0, x, y) = u0 (x, y), (x, y) ∈ R2 ,
54 N. HAYASHI AND P. I. NAUMKIN
where the solution u is a complex valued function, the Laplacian is 1 = ∂x2 + ∂y2 ,
and the nonlinear term N (v) = N1 (v) + N2 (v) consists of the local nonlinear part
N1 (v), which is cubic with respect to the vector v = (u, ux , uy , ū, ūx , ūy ) in the
neighborhood of the origin, and the nonlocal nonlinear part N2 (v) =R (v, ∂x−1 Kx (v))
+ (v, ∂y−1 Ky (v)), where (·, ·) denotes the inner product, ∂x−1 = −∞ dx 0 , ∂y−1 =
x
Ry 0
−∞ dy and the vectors Kx and Ky are quadratic with respect to the vector v in the
neighborhood of the origin. We assume that N1 (z) ∈ C4 (C6 ; C), N1 (z) = O(|z|3 )
as z → 0, Kx (z), Ky (z) ∈ (C4 (C6 ; C))6 , Kx (z) = O(|z|2), Ky (z) = O(|z|2 )
as z → 0. We also assume that the components Kx(2) ≡ Kx(4) ≡ 0, and Ky(3) ≡
Kx(6) ≡ 0.
We show how to translate the Davey–Stewartson and Ishimori systems to the
nonlinear nonlocal Schrödinger Equation (1.1) and we include some historical
comments and remarks about previous results for these systems. The Davey–Ste-
wartson (D-S) system is written as
iut + αuxx + uyy = a|u|2 u + buωx , (x, y) ∈ R2 , t ∈ R,
ωxx + βωyy = ∂x |u|2 , (x, y) ∈ R2 , t ∈ R, (1.2)
u(0, x, y) = u0 (x, y), (x, y) ∈ R2 ,
where α, β ∈ R, a, b ∈ C. This system was derived by Davey and Stewartson
[11], Benney and Roskes [6] and Djordjevic and Redekopp [12] for describing
the evolution of weakly nonlinear water waves that travel predominantly in one
direction, and where the wave amplitude is modulated slowly in the horizontal
direction. Independently, Ablowitz and Haberman [1] and Cornille [10] obtained
a particular form of system (1.2) as an example of a completely integrable system
generalizing the one-dimensional Schrödinger eguation. Djordjevic and Redekopp
in [12] have shown that the parameter β can become negative when the capillarity
effects are important.
In the literature on the inverse scattering transform method the Davey–Stewart-
son system (1.2) with α = 1, β = −1, a = −1, b = 2, is referred to as the
DSI system. And if α = −1, β = 1, a = 2, b = −1, or α = −1, β = 1, a =
−2, b = 1, system (1.2) is called the DSII focusing system or the DSII defocusing
system, respectively. For these cases the existence of soliton-like solutions or so-
called lump solutions were obtained in [4] and [14], and the existence of solutions
to the Cauchy problem was proved via the inverse scattering transform method (see
[2, 3, 10, 15, 33]). Using the inverse scattering transform method, Sung [33] proved
the global existence and uniqueness for the Cauchy problem to DSII systems with
any (large) initial data û0 ∈ L1 (R2 ) ∩ L∞ (R2 ) for the defocusing case, and with
small initial data in the same space for the focusing case. Moreover, he proved some
regularity results and qualitative properties; for instance, that the solution u(t, x, y)
disperses to zero in L∞ (R2 ) as t → ∞. Beals and Coifman [7] proved the global
well-posedness of the Cauchy problem for the DSII defocusing system when the
initial data are in the Schwartz class. The same result was obtained by Fokas and
Sung [15] for the DSI system. However, all the cases of the Davey–Stewartson
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 55
system (1.2) to which the inverse scattering transform method can be applied are
very exceptional. Note also that the inverse scattering transform method is rather
complicated. In comparison with other functional analytical methods it leads to
more restrictions on the initial data.
Ghidaglia and Saut [16] classified the Davey–Stewartson system (1.2) as elliptic–
elliptic, elliptic–hyperbolic, hyperbolic–elliptic and hyperbolic–hyperbolic accord-
ing to the signs of the coefficients (α, β) as follows: (+, +), (+, −), (−, +)
and (−, −) respectively. In [16] they proved some results on the local and global
existence of solutions in the usual Sobolev spaces L2 (R2 ), H1 (R2 ) and H2 (R2 ) for
the more simple elliptic–elliptic and hyperbolic–elliptic cases. Also, they obtained
a blow-up effect for the solutions of the elliptic–elliptic Davey–Stewartson system.
Some results on the asymptotic behavior in time of solutions for the elliptic–elliptic
case were found by Constantin in [9]. As far as we know, the first results on the
local existence and uniqueness of solutions to the Cauchy problem for the elliptic–
hyperbolic and hyperbolic–hyperbolic Davey–Stewartson systems with sufficiently
small initial data were obtained via the functional analytical methods by Linares
and Ponce in [30]. They used a sharp version of the smoothing effect obtained by
Kato [27] for the evolution groups {eit ∂x ∂y }∞−∞ and {e
it 1 ∞
}−∞ .
Now let us reduce the Davey–Stewartson systems in the elliptic–hyperbolic case
to Equation (1.1). Without a loss of generality we now take β = −1 in system
(1.2). Let us assume that the function ω satisfies the following radiation condition
ω(x, y, t) → 0 as x + y → −∞ and x − y → −∞ in order to solve the second
equation of system (1.2). This condition guarantees the existence of the inverse
operator (∂x2 − ∂y2 )−1 in L1 (R2 ). Thus, after a rotation in the xy-plane: x = √12 (x 0 +
y 0 ), y = √12 (x 0 − y 0 ) the elliptic–hyperbolic Davey–Stewartson system (we have
chosen α = 1) takes the form (the primes we omit):
case of a = 1, system (1.4) was studied formally in [6] and [29] by virtue of the
inverse scattering transform method.
Via the stereographic projection we translate the Ishimori system (1.4) to a
system of two scalar equations. Let us put
−1
E = E (1) , E (2) , E (3) = 1 + |u|2 u + ū, −i(u − ū), 1 − |u|2 ,
so that u = (E (1) +iE (2) )/(1+E (3)) is a single complex valued unknown function.
Thus the restriction |E| = 1 is fulfilled automatically, and the boundary condition
E → (0, 0, 1) as |x| + |y| → ∞ leads to a natural decay property of the solution
u at infinity: u → 0 as |x| + |y| → ∞. Thus we get
iut + uxx + αuyy
2ū
= (ux )2 + α(uy )2 + ia(ux ωy + uy ωx ), (x, y) ∈ R2 , t ∈ R,
1 + |u| 2 (1.5)
2 −2
ωxx + βωyy = 2ib(1 + |u| ) (ux ūy − uy ūx ),
(x, y) ∈ R , t ∈ R,
2
where Kx(3) = −Ky(2) = ã(1 + |u|2 )−2 (ux ūy − ūx uy ) and ã = 2ab is an arbitrary
real constant. Thus the elliptic–hyperbolic Ishimori system is a particular case of
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 57
Equation (1.1) with N1 = 2(1 + |u|2 )−1 ū(∇u)2 , and Kx(3) = −Ky(2) = a(1 +
|u|2 )−2 (ux ūy − ūx uy ) and when all the other components of the vectors Kx and
Ky are equal to zero.
Note that the standard Lp –Lq time-decay estimates with the classical energy
method are useful for the nonlinear Schrödinger equation; however, they are not
applicable directly to Equations (1.3) and (1.6), because of the presence of deriv-
atives and nonlocal terms in the nonlinearities. There are few papers devoted to
Equations (1.3) and (1.6). In [24] and [25] the global existence of solutions to the
Cauchy problem for these equations with small analytic initial data was proved.
The local and global existence of small solutions for Equation (1.3) were also
shown in [17] and [21] in suitable weighted Sobolev spaces and in [8] and [22]
the usual Sobolev spaces were applied for proving the local existence of small
solutions for (1.3). Later the smallness condition was removed in [19].
The purpose of this paper is to study the nonlinear nonlocal Schrödinger equa-
tion (1.1), involving the elliptic–hyperbolic Davey–Stewartson (1.3) and Ishimori
(1.6) systems in the usual weighted Sobolev spaces. Our results of Theorems 1.1,
1.2, 1.4 improve the previous results on these systems [8, 20, 22 – 24, 30]. As far
as we know, the result of Theorem 1.3 is the first for the problem of the global
existence and the asymptotic behavior for large time of small solutions of the
elliptic–hyperbolic Ishimori system. Section 4 is devoted to the proof of local-
in-time existence of small solutions to the Cauchy problem for (1.1). Let us denote
the usual Sobolev space as follows: Hm,0 = {φ ∈ L2 ; k(1 − ∂x2 − ∂y2 )m/2 φk < ∞},
here and below k·k is the norm of the Lebesgue space L2 . For simplicity we denote
the norm of the Sobolev space Hm,0 as follows: k · kHm,0 = k · km,0 . We prove the
following result.
THEOREM 1.1. Let the initial data u0 ∈ H4,0 and the norm ku0 k2,0 be sufficiently
small. Then there exists a time T > 0 such that there exists a unique solution
u ∈ C([0, T ]; H3,0 ) ∩ L∞ (0, T ; H4,0 ) of the Cauchy problem (1.1). Moreover, if
the norm ε = ku0 k4,0 of the initial data is sufficiently small then there exists a time
T > 1 and a unique solution u ∈ C([0, T ]; H3,0 ) ∩ L∞ (0, T ; H4,0 ) to the Cauchy
problem (1.1) such that kuk4,0 6 Cε for all t ∈ [0, T ].
If we assume additional constraints on the nonlinear terms we can relax the
regularity condition on the initial data.
REMARK 1.1. Theorems 1.1 and 1.2 give us the local existence in time of so-
lutions to the elliptic–hyperbolic Davey–Stewartson and Ishimori systems in H2,0
and H4,0, respectively. These results are improvements of the previous works [22]
and [20]. Note that an additional smallness condition on the initial data allows us
to choose the existence time T > 1.
Kx(3) = iµ(1) |u|2 + µ(2) uūx + µ(3) ūux + µ(4) uūy + µ(5) ūuy + iµ(6) |ux |2 +
+ iµ(7) |uy |2 + iµ(8) ux ūy + iµ(9) ūx uy + Bx(3) , and
Ky(2) = iν (1) |u|2 + ν (2) uūx + ν (3) ūux + ν (4) uūy + ν (5) ūuy + iν (6) |ux |2 +
+ iν (7) |uy |2 + iν (8) ux ūy + iν (9) ūx uy + By(2) , where
Bj(l) = O(|v|3 ), l = 1, 2, 3, j = x, y.
We assume that the coefficients of these representations satisfy the following con-
ditions
λ(1)
j , λj(2) − λj(3) , λj(4) − λj(5), λj(6) , λj(7) , λj(8) + λj(9) ∈ R,
µ(1) , µ(2) − µ(3) , µ(4) − µ(5) , µ(6) , µ(7) , µ(8) + µ(9) ∈ R,
ν (1) , ν (2) − ν (3) , ν (4) − ν (5) , ν (6) , ν (7) , ν (8) + ν (9) ∈ R, (1.7)
where j = x, y.
are zero. We get the elliptic–hyperbolic Ishimori system when ν (8) = −ν (9) =
−i ã, µ(8) = −µ(9) = −i ã, and the other coefficients λ, µ, ν are equal to zero.
Thus the conditions (1.7) are fulfilled for these examples.
We denote the norm of the weighted Sobolev space Hm,l,p by k · kHm,l,p = k · km,l,p .
p p q
For simplicity we write Hm,l = Hm,l,2 , Lj = Lp (Rj ), Lx Ly = Lp (Rx ; Lq (Ry )),
Hm,l
j = Hm,l (Rj ), and we take into account the ordering of the norms, denoting
kφkY2 X1 = k kφkX1 kY2 , where X1 and Y2 are some Banach spaces. For exam-
ple: kφkCL2 = k kφ(t)kL2 kC = supt kφ(t)kL2 , etc. We denote by U(t) the linear
Schrödinger evolution group. We prove the following result.
In the following four theorems we always assume that condition (1.9) is fulfilled
and Kj(l) = 0, j = x, y, l = 4, 5, 6.
THEOREM 1.3. We assume that the initial data u0 ∈ H4,1 and the norm ku0 k4,1 =
ε is sufficiently small. Then there exists a unique solution u ∈ C(R; L2 ) of the
Cauchy problem (1.1) such that
p
(1 + |t|)−γ kU(−t)u(t)k4,1 + 1 + |t|(k(1 − 1)u(t)kL∞ 2+
x Ly
for all t ∈ R, where γ > 0 is sufficiently small and C is some positive constant.
For the case of Theorem 1.2 we have the corresponding global existence result.
for all t ∈ R, where γ > 0 is sufficiently small and C > 0 is some constant.
If the initial data decay more rapidly at infinity we will obtain in Section 5 the
decay estimate in the uniform norm and the large time asymptotic behavior of small
solutions to the Cauchy problem (1.1).
THEOREM 1.5. We assume that the initial data u0 ∈ H4,1 ∩ H3,2 and the norm
ku0 k4,1 + ku0 k3,2 = ε is sufficiently small. Then there exists a unique solution
u ∈ C(R, L2) of the Cauchy problem (1.1) such that
for all t ∈ R, where γ > 0 is sufficiently small and C is some positive constant.
Moreover, there exists a unique final state W + ∈ L2 ∩ L∞ and the real-valued
60 N. HAYASHI AND P. I. NAUMKIN
phase function Q+ ∈ L∞ such that the solution u of the Cauchy problem (1.1) has
the following asymptotics:
(1 − 1)u(t) = t −1 W + (χ)exp(iχ 2 t + iQ+ (χ) log t) + O(t −1−δ ), (1.8)
as t → ∞ uniformly in x ∈ R2 , where χ = ( 2tx , 2ty ), δ ∈ (0, 1/4) is some constant.
The next result corresponds to the case of Theorem 1.2.
THEOREM 1.6. We assume that the local part N1 and Kj(l) , j = x, y, l = 2, 3
depend on u, ū only, and Kj(1) , j = x, y, are linear with respect to ∇u and ∇ ū. We
also assume that the initial data u0 ∈ H2,1 ∩H1,2 and the norm ku0 k2,1 +ku0 k1,2 =
ε is sufficiently small. Then there exists a unique solution u ∈ C(R, L2 ) of the
Cauchy problem (1.1) such that
(1 + |t|)−γ (ku(t)k2,1 + kU(−t)u(t)k1,2 ) + (1 + |t|)ku(t)k∞ 6 Cε
for all t ∈ R, where γ > 0 is sufficiently small and C > 0 is some constant.
Moreover, there exists a unique final state u+ ∈ L2 ∩ L∞ and the real-valued
phase function Q+ ∈ L∞ such that the solution u of the Cauchy problem (1.1) has
the asymptotics
u(t) = t −1 u+ (χ) exp(iχ 2 t + iQ+ (χ) log t) + O(t −1−δ ),
as t → ∞ uniformly in x ∈ R2 , where χ = ( 2tx , 2ty ), δ > 0 is some constant
satisfying 0 < δ < 1/4.
REMARK 1.3. Theorems 1.3 and 1.4 show the global-in-time existence of small
solutions to the elliptic–hyperbolic Ishimori and Davey–Stewartson systems, re-
spectively. For the cases of the Ishimori and Davey–Stewartson systems we have
Q+ ≡ 0, therefore Theorems 1.5 and 1.6 show the existence of the usual scattering
states for these cases.
We organize the rest of the paper as follows. First we give the notations which
we use in this paper. Then in Section 2 we prepare some preliminary estimates.
In Lemma 2.1 we estimate commutators of the fractional derivatives of order α ∈
(0, 1). In Lemma 2.2 we obtain a smoothing property of the linear part of the
Schrödinger equations (1.1). Finally in Lemma 2.3 we estimate different terms
appearing in the nonlinearity. In Section 3 we prove Theorems 1.1 and 1.2 which
state the local existence of solutions to the Cauchy problem (1.1). Section 4 is
devoted to the proof of Theorems 1.3 and 1.4. And in Section 5 we prove Theorems
1.5 and 1.6.
R xj
Notations. We denote ∂j = ∂x∂ j , ∂j−1 = −∞ dxj0 , x1 ≡ x, x2 ≡ y, ∂x−1 =
∂1−1 , ∂y−1 = ∂2−1 . Let Fj φ or φ̂ be the Fourier transform of φ(xj ), namely φ̂(ξj ) =
R −iξ x
√1
2π R
e j j φ(xj ) dxj . We denote by Fj−1 φ or φ̌ the inverse Fourier transform
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 61
R
of the function φ(ξj ), defined by the formula φ̌(xj ) = √1
2π R
eiξj xj φ(ξj ) dξj . In
what follows we also use the following relation |∂j | = Fj−1 |ξj |Fj = −Hj ∂j . The
Hilbert transformation with respect to the variable x is defined as follows:
Z
1 φ(z, y) ξx
H1 φ(x, y) ≡ Hx φ(x, y) = PV dz = −iFx−1 Fx φ,
π R x −z |ξx |
where PV means the principal value of the singular integral. The operator H2 is
defined in the same way. We widely use the fact that the Hilbert transformations
Hj are bounded operators from L2j to L2j , j = x, y. The fractional derivative
|∂x |α , α ∈ (0, 1) is equal to
Z
dz
|∂x |α φ = Fx−1 |ξx |α Fx φ = C (φ(x + z, y) − φ(x, y)) 1+α
R |z|
and similarly we have
Z
dz
|∂x | Hx φ =
α
−iFx−1 signξx |ξx |α Fx φ =C (φ(x + z, y) − φ(x, y)) ,
R z|z|α
with some constant C, see [32]. The derivatives |∂y |α and |∂y |α Hy are defined
analogously. Let J = (J1 , J2 ) = (Jx , Jy ), where Jj = xj + 2it∂j .
where 1 = ∂x2 + ∂y2 and the function f (t, x, y) is a force. Below in Sections 3
and 4 we will consider the nonlinearity instead of f (t, x, y). Smoothing effects for
solutions to the Cauchy problem (2.1) were studied by many authors (see [13] and
[28] and references therein). In our key Lemma 2.2 of this section we will obtain a
simple and explicit modification of smoothing effects of Doi [13].
In the next lemma we prove that if φ is a sufficiently smooth function, then the
commutators [|∂j |α , φ], and [|∂j |α Hj , φ], j = x, y, are continuous operators from
L2j and L2j .
LEMMA 2.1. The following inequalities
k[|∂x |α , φ(x, y)]ψ(x, y, η)kL2x Lqη Lpy 6 C kφkL∞ s + kφx kL∞ Ls kψk 2 q r
x Ly x y Lx Lη Ly
and
k[|∂x |α Hx , φ(x, y)]ψ(x, y, η)kL2x Lqη Lpy 6 C kφkL∞ s + kφx kL∞ Ls kψk 2 q r
x Ly x y Lx Lη Ly
62 N. HAYASHI AND P. I. NAUMKIN
The commutators [|∂j |α ]Hj , φ are estimated in the same way. Lemma 2.1 is
proved. 2
this definition we easily see that the operator S acts continuously from L2 to L2
with the following estimate
kS(ϕ)ψk 6 2 exp(kϕk∞ )kψk,
where kϕk∞ = kϕ1 kL∞ x
+ kϕ2 kL∞
y
.
The inverse operator X (ϕ) = (1 + itanh(ϕ1 )Hx )−1 cosh(ϕ
−1 1
1)
also exists and is
continuous
kX−1 (ϕ)ψk 6 (1 − tanh(kϕk∞ ))−1 kψk 6 exp(kϕk∞ )kψk. (2.2)
The same is true for the operator Y.
The operator S helps us to obtain a smoothing property of the Schrödinger-
type equation (2.1) by virtue of the usual energy estimates. In the next lemma
we prepare an energy estimate, involving the operator S, in which we have an
additional positive term giving us the norm of the half derivative of the unknown
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 63
Now we apply the usual energy method to (2.4) (i.e., we multiply (2.4) by S(ϕ)u
integrate over R2 and take the imaginary part of the result) to get
1d
kSuk2 + Im(Su, Mu) 6 |(Su, Ru)| + |Im(Su, Sf )|. (2.6)
2 dt
Then using the estimates of Lemma 2.1 we obtain
Im(Su, Mu)
X X
=2 Su, ωj2 S|∂j |u = 2 ωj Su, ∂j ωj SHj u − [∂j , ωj S]Hj u
j =1,2 j =1,2
64 N. HAYASHI AND P. I. NAUMKIN
X p p
= −2 ωj S |∂j |u + |∂j |, ωj S u,
j =1,2
p p X
− ωj S |∂j |u + |∂j |Hj , ωj S Hj u − 2 (ωj Su, [∂j , ωj S]Hj u)
j =1,2
X
p
p
p
>2
ωj S |∂j |u
2 −
ωj S |∂j u
|∂j |, ωj S u
+
j =1,2
p
p
p
+
|∂j |Hj , ωj S Hj u
−
|∂j |, ωj S u
|∂j |Hj , ωj S Hj u
−
− 2|(ωj Su, [∂j , ωj S]Hj u)|
X
p
>
ωj S |∂j |u
2 − Ckuk2 exp(2kϕk∞ ) kωk4 + kωk∞ kωk1,0,∞ .
∞
j =1,2
(2.7)
We have the result of the lemma from (2.5)–(2.7). 2
and
|(Su, Sφ∂y−1 (ψ∂x w)|
p
2
p
2
6 4 exp(4kϕk∞ )
kφkL2y S |∂x |u
+
kψkL2y S |∂x |w
+
+ C kuk2 + kwk2 exp(6kϕk∞ ) kφk2L∞ L2 + kφx k2L∞ L2 +
x y x y
where 1r + 1s = p1 .
Next we note that
p p p
X |∂x |Hx ψ = ψX |∂x |Hx + cosh(ϕ1 ) |∂x |Hx , ψ +
p
+ isinh(ϕ1 ) |∂x |, ψ ,
whence by Lemma 2.1 we find
p
X |∂x |Hx ψ(x, y)w(x, y, η)
2 q p
Lx Lη Ly
p
6 ψ(x, y)X |∂x |Hx w(x, y, η)
L2 Lq Lp +
x η y
+ Ce2kϕk∞
kψkL∞ r + kψx kL∞ Lr kwk 2 q s ,
x Ly x y Lx Lη Ly (2.11)
whence using (2.11) with w(x, y, η) = u(x, y), and (2.12) with g = φ, q = r =
∞, p = s = 2, we find
∗
Y Su, Xφ∂x ψu
p
6
φY∗ S |∂x |u
+ Ckuke3kϕk∞ kφk1,0,∞(1 + kϕk1,0,∞ ) ×
p
× ekϕk∞
kψkL∞ y
S |∂x |u
+ Ckuke3kϕk∞ kψk1,0,∞ (1 + kϕk1,0,∞)
p
2
p
6 2e4kϕk∞
kφkL∞ S |∂ x |u
+
kψkL∞ S |∂x |u
2
y y
+ Ckuke 4kϕk∞
(1 + kϕk1,0,∞ )×
p p
×
|∂x |, φ(x, y) |∂x |Hx ψ(x, y)u(x, η)
L2 L2 L1 +
p
x η y
+e 2kϕk∞
kφkL∞ L2 + kφx kL∞ L2
X |∂ x |Hx ψ(x, y)u(x, η)
2 2 2
x y x y L L L
x η y
4kϕk∞
p
2
p
2
6 4e kφkL2y S |∂x |u + kψkL2y S |∂x |u
+ Ckuk2 e6kϕk∞ ×
× (kφkL∞ 2 + kφx kL∞ L2 + kψkL∞ L2 + kψx kL∞ L2 )(1 + kϕk1,0,∞ ). (2.13)
x Ly x y x y x y
2
This shows the second estimate in the lemma. Via estimate (2.8) with p = q = 2
we have
Su, Sφ∂ −1 (ψ∂x w)
y
= Y∗ Su, Xφ∂y−1 (ψ∂x w)
6 Y∗ Su, Xφ∂y−1 (∂x ψw) + kY∗ Suk
Xφ∂y−1 (ψx w)
6 Y∗ Su, Xφ∂y−1 (∂x ψw) + Ce2kϕk∞ kφkL∞ 2×
x Ly
× kψkL∞ 2 + kψx kL∞ L2 kukkwk.
x Ly x y
+ Ckuke4kϕk∞ (1 + kϕk1,0,∞ )×
p p
×
|∂x |, φ(x, y) |∂x |Hx ψ(x, η)w(x, η)
L2 L1 L2 +
p
x η y
+e 2kϕk∞
kφkL2y L∞ + kφx kL2y L∞
X |∂x |Hx ψ(x, y)w(x, y)
L2 L1 .
x x x y
In the same way as in the proof of (2.13), by virtue of estimate (2.11) with p =
1, q = ∞, r = s = 2 and w(x, y, η) = w(x, y), and after that using inequality
(2.12) with g = kφkL2y and g = kψkL2y , we get
∗
Y Su, Xφ∂ −1 (∂x ψw)
y
p
6 e2kϕk∞
kφkL2y S |∂x |u
+
+ Ckuke3kϕk∞ kφkL∞ x Ly
2 + kφx kL∞ L2 (1 + kϕk1,0,∞ ) ×
p
x y
× e2kϕk∞
kψkL2y S |∂x |w
+
+ Ckwke3kϕk∞ kψkL∞ x Ly
2 + kψx kL∞ L2 (1 + kϕk1,0,∞ )
x y
p
2
p
2
6 4e4kϕk∞
kφkL2y S |∂x |u
+
kψkL2y S |∂x |w
+
+ Ce6kϕk∞ kuk2 + kwk2 kφkL∞ 2 + kφx kL∞ L2 + kψkL∞ L2 +
x Ly
x y x y
+ kψx kL∞
x Ly
2 1 + kϕk1,0,∞ .
2
(2.14)
Thus the third estimate of the lemma follows from (2.14). Lemma 2.3 is proved. 2
3. Local Existence
In this section we prove Theorems 1.1 and 1.2, which state the local-in-time exis-
tence of solutions to the Cauchy problem (1.1). The main point is the use of the
smoothing operator S and the energy type estimate of Lemma 2.2.
6 Ckwk32,0 (3.2)
if the solution is sufficiently small kwk 6 ε 6 1. Now let us consider the linearized
version of Equation (3.1)
Lw = G(ṽ, w) + R(ṽ, w̃), (x, y, t) ∈ R3 ,
(3.3)
w(0, x, y) = (1 − 1)u0 (x, y), (x, y) ∈ R2 ,
where the vector ṽ = ((1 − 1)−1 w̃, (1 − 1)−1 w̃x , (1 − 1)−1 w̃y ,
¯ (1 − 1)−1 w̃
(1 − 1)−1 w̃, ¯ x , (1 − 1)−1 w̃
¯ y ) is defined by the known function w̃,
from the ball
n
B = w̃ ∈ C1 [0, T ]; L2 (R2 ) : sup kw̃k 6 2ε, sup kw̃k2,0 6 2ρ,
t ∈[0,T ] t ∈[0,T ]
−1
−1
o
sup
∂t ∂x kw̃kL2
L∞ +
∂t ∂y kw̃kL2
L∞ 6 4ρ 2 ,
2 2
y x x y
t ∈[0,T ]
where ε = ku0 k2,0 , ρ = ku0 k4,0 . Thus the Cauchy problem (3.3) defines a map-
ping M : w = M(w̃). First let us show that there exists a time T > 0, such that the
mapping M transforms the closed ball B into itself provided that the value ε > 0 is
sufficiently small. After that we prove that there exists a time T > 0, such that M
is a contraction mapping in the norm supt ∈[0,T ] k · k under the constraint that it acts
on the subspace B, and ε > 0 is sufficiently small. By the usual energy method we
have from Equation (3.3)
Z t
kw(t)k 6 ku0 k2,0 + Cρ
2 2 2
kw(τ )k22,0 dτ. (3.4)
0
In order to obtain the estimates of the norm supt ∈[0,T ] kw(t)k2,0 we consider the
function h = 1w. We use the smoothing operators S(ϕ) = X(ϕ)Y(ϕ), intro-
duced in Section 2, where X(ϕ) = cosh(ϕ1 ) + isinh(ϕ1 )Hx , Y(ϕ) = cosh(ϕ2 ) +
isinh(ϕ2 )Hy , the components of the vector ϕ(t, x, y) = (ϕ1 (t, x), ϕ2 (t, y)) are
defined as ϕ1 (t, x) = 1ε ∂x−1 kw̃(t, x, y)k2L2 and ϕ2 (t, y) = 1ε ∂y−1 kw̃(t, x, y)k2L2 ,
y x
70 N. HAYASHI AND P. I. NAUMKIN
Sh, S(N1 )u ∂x h = Sh, S|ṽ| (N1 )ux ∂x h
x
|ṽ|
p
2
6 Cε
ω1 S |∂x |h
+ Cρ 2 (1 + ρ)khk2 .
The value |(Sh, S(N1 )uy ∂y h)| is estimated in the same way. Using the fact that
Kj(l) (ṽ) are quadratic and hence |Kj(l) (ṽ)|/|ṽ| are linear with respect to the vector
ṽ in the neighborhood of the origin, we have
(l)
|Kj (ṽ)|
√
|ṽ|
2 6 CkṽkL2y 6 εCω1
Ly
and
(l)
(l)
|Kj (ṽ)|
|Kj (ṽ)|
|ṽ|
∞ 2 +
|ṽ|
Lx Ly x L∞ 2
x Ly
The estimate of the summand |(Sh, Shy ∂x−1 Kx(3) )| is obtained in the same way
as in the above. Finally, since the functions (Ky(l) (ṽ))ux are linear with respect
√
to the vector ṽ, we have the estimates k(Ky(l) (ṽ))ux kL2y 6 CkṽkL2y 6 εCω1
and k(Ky(l) (ṽ))ux kL∞ 2 + k((Ky (ṽ))ux )x kL∞ L2 6 CkṽkL∞ L2 + kṽx kL∞ L2 6 Cρ,
x Ly
(l)
x y x y x y
whence by the third estimate of Lemma 2.3 we have
p
Sh, Sv (l) ∂ −1 K (l) (ṽ) hx 6 Cε
ω1 S |∂x |h
2 + Cρ 2 (1 + ρ)khk2 ,
y y ux
for l = 1, . . . , 6. The terms |(Sh, Sv (l)∂x−1 (Kx(l) )uy hy )| are estimated analogously.
Thus we get for the second term G:
p
2
p
2
|Im(Sh, SG(ṽ, h))| 6 Cε
ω1 S |∂x |h
+ Cε
ω2 S |∂x |h
+
+ Cρ 2 (1 + ρ)khk2 . (3.8)
if ε > 0 is sufficiently small. Thus the mapping M transforms the ball B into itself.
Let us now show that M is a contraction mapping in the norm supt ∈[0,T ] k · k. Let
ω† satisfy Equation (3.3) with the known function w̃ † ∈ B instead of w̃. Then for
the difference g = w † − w we get
Lg = G(ṽ † , g) + G(ṽ † , w) − G(ṽ, w) +
+ R(ṽ † , w̃ † ) − R(ṽ, w̃), (x, y, t) ∈ R3 , (3.10)
g(0, x, y) = 0, (x, y) ∈ R .
2
if we choose the time interval T > 0 to be sufficiently small. Thus the trans-
formation M is a contraction mapping. Therefore there exists a unique solution
u ∈ C([0, T ]; H3,0 ) ∩ L∞ (0, T ; H4,0 ) of the Cauchy problem (1.1). Theorem 1.1
is proved. 2
Proof of Theorem 1.2. Now we assume that the local nonlinear part N1 =
N1 (u, ū), Kj(l) = Kj(l) (u, ū), j = x, y, l = 2, 3, 5, 6, the functions Kj(1) and
Kj(4) depend linearly on u, ū, so that the derivatives (Kx )uy , (Kx )ūy , (Ky )ux
and (Ky )ūx depend on u, ū only. Considering a linearized version of Equation
(1.1) similar to Equation (3.3), we take the known function ũ to be fixed from
the ball ũ ∈ B = {ũ ∈ C1 ([0, T ]; L2 ) : supt ∈[0,T ] kũk 6 2ε, supt ∈[0,T ] kũk2,0 6
2ρ, supt ∈[0,T ] (k∂t ∂x−1 kũk2L2 kL∞
x
+k∂t ∂y−1 kũk2L2 kL∞
y
) 6 4ρ 2 }, where ε = ku0 k, ρ =
y x
ku0 k2,0 . Now the local part N1 does not give us any derivative loss, so we can
choose the vector ϕ(t, x, y) = (ϕ1 (t, x), ϕ2 (t, y)) in the definition of the smooth-
ing operator S to be more simple ϕ1 (t, x) = 1ε ∂x−1 kũ(t, x, y)k2L2 and ϕ2 (t, y) =
y
1 −1
∂ kũ(t, x, y)k2L2 .
ε y
All the other costructions are similar to the proof of Theo-
x
rem 1.1, so we leave them to the reader. 2
4. Global Existence
Repeating the proof of Theorem 1.1 and applying the operators Jx , Jy we can
easily obtain an additional estimate of U(−t)u(t) in the weighted Sobolev space
H4,1 . (See also the proof of Theorem 1.3 below.)
THEOREM 4.1. Let the initial data u0 ∈ H4,1 and the norm ε = ku0 k2,0 of
the initial data is sufficiently small, then there exists a time T > 0 and a unique
solution U(−t)u ∈ C([0, T ]; H3,0 ) ∩ L∞ (0, T ; H4,1 ) to the Cauchy problem (1.1).
Moreover, we can choose the existence time T > 0 so that the following es-
timate supt ∈[0,T ] ku(t)k4,0 6 2ku0 k4,0 is valid for the solution u of the Cauchy
problem (1.1).
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 73
Proof of Theorem 1.3. Let us prove the following a-priori estimates for the
function w = (1 − 1)u
(1 + t)−γ
√ kwk2,0 < 2ε, (4.1)
kwk + 1 + t kwkL∞ L
x y
2 + kwx kL∞ L2 + kwkL∞ L2 + kwy kL∞ L2 < 100ε
x y y x y x
for all t > 0, where γ > 0 is small. We show (4.1) by arguments of the contra-
diction. We assume that there exists a time T > 0 such that estimates (4.1) are
violated in the interval [0, T ]. By Theorem 4.1 we have the following estimates
(1 + t)−γ
√ kwk2,0 6 4ε, (4.2)
kwk + 1 + t kwkL∞ 2 + kwx kL∞ L2 + kwkL∞ L2 + kwy kL∞ L2 6 100ε
x Ly x y y x y x
for all t ∈ [0, T ]. To estimate the norms kwk2,0 and kU(−t)wk2,1 we multiply
Equation (3.1) by the operators A = 1−1, and A = (1−1)Jl , where l = x, y and
then we use the same smoothing operator S(ϕ) as in Section 3. Applying Lemma
2.2 we obtain the estimates for the functions h = Aw
d
p
2
p
2
kShk2 +
ω1 S |∂x |h
+
ω2 S |∂y |h
dt
6 2|Im(Sh, SAG(v, w))| + 2|Im(Sh, SAR(v, w))|+
+ Ce2kϕk∞ kωk4∞ + kωk1,0,∞kωk∞ + kϕt k∞ khk2 . (4.3)
√
Now by virtue of (4.2) we get the estimates kωk1,0,∞ 6 C ε(1 + t)−1/2 , kϕk∞ 6
1 and kϕt k∞ 6 Cε(1 + t)−1 . Since kφx k2L4 L2 6 kφkL∞ 2 kφk2,0 in view of (4.2)
x Ly
x y
we also have the estimate
|Im(Sh, SAR(v, ω))| 6 Cε 3 (1 + t)−1+2γ + Cε(1 + t)−1 khk2 . (4.4)
Using the identities φ̄ψx = 2it 1
(φ̄Jx ψ − (Jx φ)ψ) − φ̄x ψ and φψx = 2it 1
(φJx ψ −
(Jx φ)ψ) + φx ψ with φ = v (l) , ψ = wx we can estimate the terms of the
form, for example, v̄ (l) wxx Jx v (k) in the operator G(v, w), so we obtain the estimate
kAG(v, w) − G(v, Aw)k 6 Cε 2 (1 + t)−1+γ + Cε(1 + t)−1 khk. Thus we get
|Im(Sh, SAG(v, w))|
6 |Im(Sh, SG(v, h))| + Cε 3 (1 + t)−1+2γ + Cε(1 + t)−1 khk2 . (4.5)
Then, as in Section 3, we obtain via (4.2)
p
2
p
2
|Im(Sh, SG(v, h))| 6 Cε
ω1 S |∂x kh
+ Cε
ω2 S |∂y kh
+
+ Cε 3 (1 + t)−1+2γ + Cε(1 + t)−1 khk2 . (4.6)
Substitution of (4.4)–(4.6) into (4.3) yields
d
p
2
p
2
kShk2 + (1 − Cε)
ω1 S |∂x kh
+ (1 − Cε)
ω2 S |∂y kh
dt
6 Cε(1 + t)−1 kShk2 + Cε 3 (1 + t)−1+2γ . (4.7)
74 N. HAYASHI AND P. I. NAUMKIN
Integration of (4.7) gives us the desired estimate kwk2,0 < 2ε(1 + t)γ and also
kU(−t)wk2,1 < Cε(1 + t)γ for all t ∈ [0, T ].
In order to estimate the norms kwk, kwkL∞ 2 , kwx kL∞ L2 , kwkL∞ L2 and
x Ly x y y x
1 iτ ξ 2 +iτ η2
kwy kL∞ 2 we change a dependent variable u(t, x, y) =
y Lx τ
e U (τ, ξ, η),
1 iτ ξ 2 +iτ η2
where ξ = 2τx , η = 2τy , τ = 1 + t, whence ux (t, x, y) = τ e Iξ U (τ, ξ, η),
and uy (t, x, y) = τ1 eiτ ξ +iτ η Iη U (τ, ξ, η)
2 2
with operators Iξ = iξ + 2τ1 ∂ξ , and
Iη = iη + 2τ1 ∂η . Taking into account the complex-conjugate structure of the
nonlinearity in Equation (1.1) we obtain
V V
1 −1
iU + 1U + τ N + τ V , ∂ K +
τ 2 1 ξ ξ
4τ V τ τ
(4.8)
+ τ V , ∂η−1 Kη = 0, (ξ, η) ∈ R2 , τ > 1,
τ
U (1, ξ, η) = u0 (ξ, η)e−iξ −iη ,
2 2
(ξ, η) ∈ R2 ,
where the vectors V = (U, Iξ U, Iη U, U , Iξ U , Iη U ). Note also that
i iτ ξ 2 +iτ η2
(Jx + 2i∂x )u(t, x, y) = (x + 2iτ ∂x )u(t, x, y) = e Uξ (τ, ξ, η)
τ
and similarly we have (Jy + 2i∂y )u(t, x, y) = τi eiτ ξ +iτ η Uη (τ, ξ, η).
2 2
kwx kL∞ 2 = √ kIξ W kL∞ L2 . Let us obtain the main term in the nonlinearity of
x Ly
1
τ ξ η
Equation (4.3) which decays slowly with time. Using the identity
1 1
ϕ̄Iξ ψ = (ϕ̄Iξ ψ − ϕIξ ϕ) + (ϕ̄ψ)ξ , (4.9)
2 4τ
where we take ϕ = ψ = U , we write the following representation iλj(2) U Iξ U =
i (2)
λ ((U Iξ U −U Iξ U )+ 2τ1 (|U |2 )ξ ) = −λ(2)
2 j
i (2)
j Im(U Iξ U )+ 4τ λj (|U | )ξ . The term
2
Using the identity ψIη φ = iηφψ + 2τ1 ψφη we get Iη U ∂ξ−1 Kξ(3) = U ∂ξ−1 iηKξ(3) +
1
U ∂ −1 Kξ(3). By virtue of (4.9), (4.10) as above we will extract the real part in
2τ η ξ
each term of iηKξ(3) . We have −µ(1) η|U |2 = −µ(1) Im(U Iη U ) − 2τi µ(1) (|U |2 )η ,
then we get iµ(2) ηU Iξ U = µ(2) Re(U Iη Iξ U )− 4τ1 µ(2) (U Iξ U )η − 4τ1 µ(2) (U Iη U )ξ .
The term iµ(3) ηU Iξ U has a similar representation. Then we obtain iµ(4) ηU Iη U =
µ(4) |Iη U |2 − 2τ1 µ(4) Uη Iη U , and the summand iµ(5) ηU Iη U is considered analo-
gously. Furthermore we get −µ(6) η|Iξ U |2 = −µ(6) Im(Iξ U Iη Iξ U )− 2τ1 µ(6) |Iξ U |η ,
and −µ(8) ηIη U Iξ U = −µ(8) Im(Iη U Iη Iξ U )+ 4τi µ(8) (|Iη U |)ξ − 2τi µ(8) Iη U Iξ Uη .
The values −µ(7) η|Iη U |2 and −µ(9) ηIξ U Iη U are considered in the same manner.
Therefore we obtain
(Iη U )∂ξ−1 Kξ(3) = U ∂ξ−1 −µ(1) Im(U Iη U ) + µ(2) + µ(3) Re(U Iη Iξ U )+
+ µ(4) + µ(5) |Iη U |2 − µ(6) Im(Iξ U Iη Iξ U )−
− µ(7) Im Iη U Iη2 U − µ(8) − µ(9) Im(Iη U Iη Iξ U )−
1
− 2iµ(1) (|U |2 )η − µ(2) U Iξ U η − µ(2) (U Iη U )ξ − µ(3) (U Iξ U )η −
4τ
− µ(3) (U Iη U )ξ − 2µ(4) Uη Iη U − 2µ(5) U η Iη U − 2µ(6) |Iξ U |2 η −
− 2µ(7) |Iη U |2 η + i µ(8) − µ(9) (|Iη U |)ξ − 2iµ(8) Iη U Iξ Uη +
1
+ 2iµ(9) Iξ U Iη Uη + U ∂ξ−1 iηBξ(3) + Uη ∂ξ−1 Kξ(3) . (4.12)
2τ
The summand (Iξ U )∂η−1 Kη(2) can be represented in the same manner. Thus via
(4.11) and (4.12) we get for the nonlinearity τ N ( Vτ ) = τ1 QU + P (U ) + τ N1 ( Vτ ),
where the coefficient at the main term is the real valued function
X (3) (5)
Q = ∂j−1 λ(1) (2) (4)
j |U | − λj − λj Im(U Iξ U ) − λj − λj Im(U Iη U ) +
2
j =x,y
(9)
+ λ(6) (7) (8)
j |Iξ U | + λj |Iη U | + λj + λj Re(Iξ U Iη U ) +
2 2
+ ∂ξ−1 − µ(1) Im(U Iη U ) + µ(2) + µ(3) Re(U Iη Iξ U ) +
+ µ(4) + µ(5) |Iη U |2 − µ(6) Im(Iξ U Iη Iξ U ) − µ(7) Im(Iη U Iη2 U ) −
− µ(8) − µ(9) Im(Iη U Iη Iξ U ) + ∂η−1 − ν (1) Im(U Iξ U ) +
+ ν (2) + ν (3) |Iξ U |2 + ν (4) + ν (5) Re(U Iη Iξ U ) − ν (6) Im Iξ U Iξ2 U −
− ν (7) Im(Iη U Iη Iξ U ) − ν (8) − ν (9) Im(Iξ U Iη Iξ U )
76 N. HAYASHI AND P. I. NAUMKIN
larly k(Yξ (τ ) − 1)φk 6 τ −α (kφk + kφξ k) are valid, where α ∈ [0, 12 ). Multiplying
(4.13) by Yξ (−τ ) and Yη (−τ ) we obtain
1
L̃η Yξ (−τ )Wj + Q(Yξ (−τ )Wj ) + A2j +1 = 0,
τ
1
L̃ξ Yη (−τ )W2j + Q(Yη (−τ )W2j ) + A2j +2 = 0, j = 0, 1, (4.14)
τ
where L̃j = i∂τ + 1 2
∂ ,
4τ 2 j
j = ξ, η, the remainder terms are
j V
A2j +1 = Yξ (−τ )Rj + (Yξ (−τ ) − 1) 1 − Iξ2 − Iη2 τ Iξ N −
τ
1
− Q(Yξ (−τ ) − 1)Wj ,
τ
j = 0, 1, and similarly
j V
A2j +2 = Yη (−τ )Rj + (Yη (−τ ) − 1) 1 − Iξ2 − Iη2 τ Iη N −
τ
1
− Q(Yτ (−τ ) − 1)W2j ,
τ
3 3γ −3/2
j = 0, 1. Via (4.2) we have kA2j +1 kL∞ 2 +kA2j +2 kL∞ L2 6 Cε τ
ξ Lη
, j = 0, 1.
η ξ
√
Integrating (4.14) with respect to time τ and using the representations τ kwkL∞ 2
x Ly
= kW kL∞ 2
ξ Lη
6 kY ξ (−τ )W k ∞
Lξ Lη2 + k(1 − Y ξ (−τ ))W k ∞ 2
Lξ Lη 6 kW (1)k ∞ 2
Lξ Lη +
78 N. HAYASHI AND P. I. NAUMKIN
10ετ −δ , δ = 14 −3γ > 0 by virtue of (4.2) we obtain the inequality supt ∈[0,T ] (kw(t)
kL∞ 2 + kwx (t)kL∞ L2 + kw(t)kL∞ L2 + kwy (t)kL∞ L2 ) < 100ε. The contradiction
x Ly x y y x y x
obtained proves estimate (4.1) for all t > 0. Now a standard continuation argument
yields the result of Theorem 1.3. 2
The proof of Theorem 1.4 is simpler than that of Theorem 1.3, so we leave it to
the reader.
Proof of Theorem 1.5. Now we assume that the initial data decay more rapidly at
infinity u0 ∈ H4,1 ∩ H3,2 . So in the same way as in the proof of Theorem 1.3 we
can obtain the following estimate
for all 1 < s < τ , where p = 2, ∞ and δ = 14 − 3γ > 0. Therefore there exist
unique limits W + ∈ L∞ ∩ L2 such that limt →∞ g(t) = W + in L∞ ∩ L2 . Hence
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 79
(1 − 1)u(t, x, y)
1
= eiτ ξ +iτ η W
2 2
τ
1 1
= eiτ ξ +iτ η Y(−τ )W + eiτ ξ +iτ η (1 − Y(−τ ))W
2 2 2 2
τ τ Z τ
1 iτ ξ 2 +iτ η2 + x y x y ds
= e W τ, , exp i Q s, , + O t −1−δ
τ 2τ 2τ 1 2τ 2τ s
(5.5)
Theorem 1.6 is proved in the same way as in the proof of Theorem 1.5, and so
we omit it.
References
1. Ablowitz, J. M. and Haberman, R.: Nonlinear evolution equations in two and three dimensions,
Phys. Rev. Lett. 35 (1975), 1185–1188.
2. Ablowitz, J. M. and Fokas, A. S.: On the inverse scattering transform of multidimensional
nonlinear equations, J. Math. Phys. 25 (1984), 2494–2505.
3. Ablowitz, J. M. and Segur, A.: Solitons and Inverse Scattering Transform, PA, SIAM, 1981.
4. Anker, D. and Freeman, N. C.: On the soliton solutions of the Davey–Stewartson equation for
long waves, Proc. R. Soc. A 360 (1978), 529–540.
5. Bardos, C., Sulem, C. and Sulem, P. L.: On the continuous limit for a system of classical spins,
Comm. Math. Phys. 107 (1986), 431–454.
6. Benney, D. J. and Roskes, G. L.: Wave instabilities, Stud. Appl. Math. 48 (1969), 377–385.
7. Beals, R. and Coifman, R. R.: The spectral problem for the Davey–Stewartson and Ishimori
hierarchies, Proc. Conf. on Nonlinear Evolution Equations: Integrability and Spectral Methods,
Manchester, U.K., 1988.
80 N. HAYASHI AND P. I. NAUMKIN
8. Chihara, H.: The initial value problem for the elliptic–hyperbolic Davey–Stewartson equation,
Preprint, 1995.
9. Constantin, P.: Decay estimates of Schrödinger equations, Comm. Math. Phys. 127 (1990),
101–108.
10. Cornille, H.: Solutions of the generalized nonlinear Schrödinger equation in two spatial
dimensions, J. Math. Phys. 20 (1979), 199–209.
11. Davey, A. and Stewartson, K.: On three-dimensional packets of surface waves, Proc. R. Soc. A
338 (1974), 101–110.
12. Djordjevic, V. D. and Redekopp, L. G.: On two-dimensional packets of capillary-gravity waves,
J. Fluid Mech. 79 (1977), 703–714.
13. Doi, S.: On the Cauchy problem Schrödinger type equations and regularity of solutions, J.
Math. Kyoto Univ. 34 (1994), 319–328.
14. Fokas, A. S. and Santini, P. M.: Recursion operators and bi-Hamiltonian structures in
multidimensions. I, II, Comm. Math. Phys. 115 (1988), 375–419, 449–474.
15. Fokas, A. S. and Sung, L. Y.: On the solvability of the N-wave, Davey–Stewartson and
Kadomtsev–Petviashvili equations, Inverse Problems 8 (1992), 673–708.
16. Ghidaglia, J. M. and Saut, J. C.: On the initial value problem for the Davey–Stewartson systems,
Nonlinearity 3 (1990), 475–506.
17. Hayashi, N.: Local existence in time of small solutions to the Davey–Stewartson system,
Annales de l’I.H.P. Physique Theorique 65 (1996), 313–366.
18. Hayashi, N.: On the initial value problem for the Davey–Stewartson and the Ishimori sys-
tems, in The Proceedings of the 4th MSJ International Reserch Institute on “Nonlinear
Waves”, Sapporo, July 10–21, 1995, GAKUTO International Series, Mathematical Sciences
and Applications, Gakkotosho, vol. 10, 1997, pp. 145–154.
19. Hayashi, N.: Local existence in time of solutions to the elliptic–hyperbolic Davey–Stewartson
system without smallness condition on the data, J. Analysé Mathématique 73 (1997), 133–164.
20. Hayashi, N.: Local existence in time of small solutions to the Ishimori system, Diff. Integral
Eqs., to appear.
21. Hayashi, N. and Hirata, H.: Global existence and asymptotic behavior in time of small solutions
to the elliptic–hyperbolic Davey–Stewartson system, Nonlinearity 9 (1996), 1387–1409.
22. Hayashi, N. and Hirata, H.: Local existence in time of small solutions to the elliptic–
hyperbolic Davey–Stewartson system in the usual Sobolev space, Proceedings of the Edinburgh
Mathematical Society 40 (1997), 563–581.
23. Hayashi, N. and Naumkin, P. I.: Asymptotic behavior in time of solutions to the derivative non-
linear Schrödinger equation revisited, Discrete and Continuous Dynamical Systems 3 (1997),
383–400.
24. Hayashi, N. and Saut, J. C.: Global existence of small solutions to the Davey–Stewartson and
the Ishimori systems, Diff. Integral Eqs. 8 (1995), 1657–1675.
25. Hayashi, N. and Saut, J. C.: Global existence of small solutions to the Ishimori system without
exponential decay of the data, Diff. Integral Eqs. 9 (1996), 1183–1195.
26. Ishimori, Y.: Multi vortex solutions of a two dimensional nonlinear wave equation, Progr.
Theor. Phys. 72 (1984), 33–37.
27. Kato, T.: On the Cauchy problem for the (generalized) Korteweg–de Vries equation, Advances
in Mathematics Supplementary Studies, Studies in Appl. Math. 8 (1983), 93–128.
28. Kenig, C. E., Ponce, G. and Vega, L.: Well-posedness and scattering results for the generalized
Korteweg–de Vries equation via contraction principle, Comm. Pure Appl. Math. 46 (1993),
527–620.
29. Konopelchenko, B. G. and Matkarimov, B. T.: Inverse spectral transform for the nonlinear
evolution equation generating the Davey–Stewartson and Ishimori equations, Stud. Appl. Math.
82 (1990), 319–359.
DAVEY–STEWARTSON AND ISHIMORI SYSTEMS 81
30. Linares, F. and Ponce, G.: On the Davey–Stewartson systems, Ann. Inst. Henri Poincaré, Anal.
non linéaire 10 (1993), 523–548.
31. Soyeur, A.: The Cauchy problem for the Ishimori equations, J. Funct. Anal. 105 (1992), 233–
255.
32. Stein, E. M.: Singular Integral and Differentiability Properties of Functions, Princeton Univ.
Press, Princeton Math. Series 30, 1970.
33. Sung, L. Y.: An inverse-scattering transform for the Davey–Stewartson II equations, Part III,
J. Math. Anal. Appl. 183 (1994), 477–494.
34. Sung, L. Y.: The Cauchy problem for the Ishimori equation, Preprint, 1993.
Mathematical Physics, Analysis and Geometry 2: 83–106, 1999.
83
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
P. BUSCH
Department of Mathematics, University of Hull, HU6 7RX Hull, U.K.
e-mail: p.busch@maths.hull.ac.uk
Mathematics Subject Classifications (1991): Primary: 47D45; Secondary: 47D20, 81Q99, 81R99.
Key words: Hilbert space, trace class, state operator, stochastic map, isometry, quantum mechanics,
reversibility.
1. Introduction
The mathematical modeling of a quantum dynamical system is based on the dual
concepts of states and observables. Of particular importance for the representation
of dynamics and symmetries are structure-preserving bijective maps of the sets
of states and observables, respectively. Such maps, referred to as affine automor-
phisms of states and Jordan automorphisms of observables, respectively, have been
quite thoroughly studied in the Hilbert space model ([1], for a recent systematic
account, see [4]) and in the C ∗ -algebraic formulation of quantum theory [7]. These
automorphisms are special instances of the larger classes of linear isometries act-
ing on the state space and the algebra of observables, respectively, which are of
interest in their own right. While isometries of operator algebras were analyzed by
Kadison in 1951 [8], applications of isometric transformations of states have been
considered only in recent years. The characterization of the class of isometric state
transformations in the Hilbert space model of quantum mechanics is the subject of
the present paper.
The paper is organized as follows. In Section 2 the notion of a linear state trans-
formation – also called stochastic map – in the Hilbert space model of quantum
mechanics is presented and some of its basic properties are reviewed. An isometric
map among the stochastic maps is distinguished by a feature that turns out to be
fundamental for the subsequent considerations: it sends orthogonal pairs of states to
orthogonal pairs (Proposition 2.1). In Section 3 the main result (Theorem 1) con-
cerning isometric stochastic maps is stated: any stochastic isometry decomposes
84 P. BUSCH
2. Stochastic Maps
Let H and H̃ be complex separable Hilbert spaces with inner products h· | ·i, taken
to be linear in the second argument and conjugate linear in the first. Let B∞ (H)
be the complex Banach space of linear operators of trace class on H, with trace
tr[·] and trace norm k · k1 . Its dual space can be identified with the Banach space
B(H) of bounded linear operators on H. The self-adjoint parts of B∞ (H) and
B(H), V := B1 (H)s and W := B(H)s , are real Banach spaces. By V + we
denote the set of positive linear operators in V , the convex positive cone of V .
With reference to the quantum physical application, V is called state space, the
elements of the subset S := V + ∩ {ρ ∈ V : tr[ρ] = 1} are called states. The
set S is convex, its extreme points are given by the orthogonal projections of rank
1. These are called pure states and denoted Pϕ , ϕ ∈ H, ϕ 6= 0. For H̃, we
denote the corresponding entities as Ṽ , W̃ , S̃, etc. A linear map T : V → Ṽ is
called a stochastic map (on V ) if it is positive [T (V + ) ⊂ Ṽ + ] and trace preserving
[tr ◦ T = tr], or equivalently, if it sends states to states [T (S) ⊆ S̃]. A stochastic
map that is isometric [kT (ρ)k1 = kρk1 , ρ ∈ V ] will be called stochastic isometry.
The term operator will be used throughout to denote a linear operator. By 0 and
I (or IH ) we will denote the zero and identity operators, respectively. According
to the spectral theorem, any self-adjoint operator a admits a decomposition into
positive and negative parts, denoted a = a+ − a− , where the ranges of a+ , a−
are mutually orthogonal subspaces. For a bounded operator a we define |a| :=
(a ∗ a)1/2 . If a is self-adjoint, one has |a| = a+ + a− . For ρ ∈ V we have kρk1 =
tr[ρ+ ] + tr[ρ− ] = kρ+ + ρ− k1 , and so
ρ ∈ V + ⇐⇒ kρk1 = tr[ρ]. (2.1)
STOCHASTIC ISOMETRIES IN QUANTUM MECHANICS 85
We recall some basic facts about stochastic maps and stochastic isometries.
A linear map T : V → Ṽ is called contractive [or a contraction], if for all
ρ ∈ V , kT (ρ)k1 6 kρk1 .
By the spectral theorem, ρ ⊥ σ exactly when ρ and σ are the positive and
negative parts of ρ − σ , respectively. Hence we have:
ρ ⊥ σ ⇐⇒ kρ − σ k1 = kρ + σ k1 , ρ, σ ∈ V + \{0}. (2.2)
3. Stochastic Isometries
In this section the main results concerning stochastic isometries will be stated. The
proofs will be developed in Sections 4 and 5.
A stochastic map is called pure if it sends pure states to pure states. An iso-
metric linear or antilinear map U : H → H̃ will be called unitary or antiunitary,
respectively. We first present the known case of surjective stochastic isometries.
X
N X
N
kT (ρ)k1 = wk kTk (ρ)k1 = wk kρk1 = kρk1 .
k=1 k=1
Pn+m Pn+m
Convergence in trace norm follows from k k=n wk Uk ρUk∗ k1 6 k=n wk kρk1
and the fact that Ṽ is complete. 2
X
N
T (ρ) = wν Tν (ρ), ρ ∈ V. (3.3)
ν=1
The sum converges in trace norm when N = ∞. The ranges of the Tν can be rep-
resented as subspaces of B1 (H̃ν )s , where the H̃ν are mutually orthogonal closed
subspaces of H̃.
⊥
Q 6 I , or P 6 I − Q =: W Q . The supremum of a family of projections
P , α ∈ A, will be denoted α∈A P (α) . Note that the projections 5ϕ introduced
(α)
This gives Pϕk ·T ∗ (5(T (ρ))) = 0 and therefore one has 5(ρ)⊥ ·T ∗ (5(T (ρ))) = 0.
Since T ∗ (5(T (ρ))) 6 I , it follows that T ∗ (5(T (ρ))) 6 5(ρ). Thus, T ∗ (5(T (ρ)))
= 5(ρ).
Conversely, assume (ii) holds. We show that the stochastic map T is orthogo-
nality preserving and thus, by Proposition 2.1, an isometry. Let ρ, σ ∈ S, ρ ⊥ σ .
Then hT (ρ), 5(T (σ ))i = hρ, T ∗ (5(T (σ )))i = hρ, 5(σ )i = 0. This implies
T (ρ) ⊥ T (σ ).
Property (iii) is entailed by (ii). The converse implication is a fairly straightfor-
ward consequence of the fact that T ∗ is normal. 2
LEMMA 4.2. Let T : V → Ṽ be a stochastic isometry and not pure. There exists
a strictly decreasing sequence w̃ν , 0 <Pw̃ν < 1, ν = 1, 2 . . . , N 6 ∞, and a
sequence of numbers mν ∈ N such that ν w̃ν mν = 1 and
X
N
T (Pϕ ) = w̃ν 5νϕ (4.1)
ν=1
for all ϕ ∈ H\{0}. Here the 5νϕ are projections with rank mν . Furthermore,
µ
5νϕ ⊥ 5ψ for all ϕ, ψ ∈ H\{0} if ν 6= µ.
Proof. For ϕ, ψ 6= 0, write the spectral decompositions of T (Pϕ ), T (Pψ ) as
X
N X
M
T (Pϕ ) = aν Pϕν , T (Pψ ) = bµ Pψµ , (4.2)
ν=1 µ=1
STOCHASTIC ISOMETRIES IN QUANTUM MECHANICS 89
T ∗ (Pϕν ) = aν Pϕ . (4.3)
Similarly,
T ∗ (Pψµ ) = bµ Pψ . (4.4)
Case 1: 0 =6 |hϕ | ψi|(6= 1). Take unit vectors ϕ 0 , ψ 0 in the span of ϕ, ψ such that
hϕ | ϕ i = hψ | ψ 0 i = 0. Then Pϕ + Pϕ0 = Pψ + Pψ 0 and therefore T (Pϕ ) + T
0
and so
2 2
T (Pψ ) T (Pϕ ) = T (Pψ ) T (Pϕ ),
On multiplication of Equation (4.5) with Pψµ from the left and with Pϕν from the
right, we obtain
The orthonormal systems of eigenvectors shall now be denoted ϕν,i , ψµ,j , where
i ∈ {1, . . . , nν }, j ∈ {1, . . . , mµ }. Now observe that due to (4.8), (4.9),
hT (Pψ ), Pϕν,i i = hT (Pψ ), T ∗ (Pϕν,i )i = aν hPψ , Pϕ i,
which implies that for each ν there must be µ, j such that hPψµ,j , Pϕν,i i 6= 0.
Therefore, by (4.7), aν = bµ . A similar reasoning entails that for each µ there
must be ν such that aν = bµ . Since the two sequences of eigenvalues are strictly
decreasing, they must be identical: aν = bν , for all values of ν. It remains to be
shown that the multiplicities coincide as well. Equation (4.11) can be written as
X
N
(nν − mν )aνn = 0, for all n ∈ N. (4.12)
ν=1
that is,
N
X aκ n xκ
= −1.
κ=2
a1 x1
But then
X
N aκ n xκ X aκ n xκ
N
1= 6 =: S(n) (< ∞).
aκ=2
x 1 a 1 x κ=2 1 1
aκ n+1 xκ a2 n
XN
S(n + 1) = 6 S(1).
κ=2
a1 x1 a1
STOCHASTIC ISOMETRIES IN QUANTUM MECHANICS 91
Proof of Proposition 3.3. For the w̃ν , mν given in Lemma 4.2, put wν = w̃ν mν ,
and define maps Tν via
1 ν
Tν (Pϕ ) := 5 = wν−1 Pν T (Pϕ )Pν ,
mν ϕ
with the projections Pν , ν = 1, 2, . . . , N, given in Lemma 4.3. Then (4.1) realizes
(3.3), and in view of (4.15), the maps Tν are mixing stochastic maps on V with
ranges in B1 (H̃ν )s , H̃ν = Pν H̃.
Re{ahϕj | 5ψ ϕk i} = 0.
Choosing for a the values a = 1, i, one concludes that hϕj | 5ψ ϕk i = 0. Hence (5.4)
follows, and so (5.3) is verified. 2
5ϕ 5ϑ 5ψ 5ϕ = a5(L)
ϕ + ā5ϕ ,
(A)
a = hϕ | ϑihϑ | ψihψ | ϕi. (5.9)
The proof will be based on the study of a number of special cases and on
exploiting the linearity of T . We note two trivial special cases: if any single pair
among the three vectors are mutually orthogonal then the left-hand side of Equation
(5.9) is identically 0; and if any two of these vectors are linearly dependent then
(5.9) reduces to (5.3).
94 P. BUSCH
LEMMA 5.2. Let T be an m-mixing stochastic isometry. For any pair of mutually
orthogonal unit vectors ϕ1 , ϕ2 ∈ H the following holds:
1 + i (L) 1 + i (A)
5ϕ1 5ϕ1 +iϕ2 5ϕ1 +ϕ2 5ϕ1 = 5ϕ1 + 5ϕ1 . (5.10)
4 4
Proof. We note first that at this stage we do not claim the independence of the
projections 5(L) (A)
ϕ1 , 5ϕ1 of the choice of ϕ2 . This will be established in a later step.
We will frequently use Equations (5.2) and (5.3) without explicit mentioning.
Let {ϕ1k | k = 1, . . . , m} be an arbitrary orthonormal basis (ONB) of the subspace
5ϕ1 H. Then, by virtue of Equation (5.3), for α ∈ C, |α| = 1, the system of vectors
ϕ2k (α) = 25ϕ2 5ϕ1 +αϕ2 ϕ1k , k = 1, . . . , m, form an ONB of 5ϕ2 H. Furthermore,
5ϕ1 +αϕ2 (ϕ1k + ϕ2k (α)) = 5ϕ1 +αϕ2 (ϕ1k ) + 25ϕ1 +αϕ2 5ϕ2 5ϕ1 +αϕ2 (ϕ1k )
= 25ϕ1 +αϕ2 (ϕ1k )
= 2(5ϕ1 + 5ϕ2 )5ϕ1 +αϕ2 (ϕ1k )
= ϕ1k + ϕ2k (α). (5.11)
In the third line we have used the fact that 5ϕ1 +αϕ2 ⊥5ϕ1 −αϕ2 (since T is orthogo-
nality preserving) and that, by virtue of the linearity of T ,
and therefore
Combining this with (5.14), it follows that uk ∈ {+i, −i} for all k. Denote the
spectral projections of the partial isometry U associated with the eigenvalues i, −i
as 5(L) (A)
ϕ1 , 5ϕ1 , respectively. Then the spectral decomposition of U is
U = i 5(L)ϕ1 − 5ϕ1 , 5ϕ1 + 5ϕ1 = 5ϕ1 .
(A) (L) (A)
(5.15)
5ϕ1 5ϕ1 +iϕ2 5ϕ1 +ϕ2 5ϕ1 = 5ϕ1 5ϕ1 +iϕ2 (5ϕ1 + 5ϕ2 )5ϕ1 +ϕ2 5ϕ1
1
= (5ϕ1 + U ),
4
where by virtue of (5.15) the last expression equals the right-hand side of (5.10). 2
1 + i (L) 1 − i (A)
5ϕ1 5ϕ1 +iϕ3 5ϕ1 +ϕ3 5ϕ1 = eϕ +
5 eϕ .
5 (5.19)
1 1
4 4
5ϕ1 5ϕ1 +αϕ3 5ϕ1 +βϕ2 5ϕ1 = 5ϕ1 5ϕ1 +αϕ3 (5ϕ1 + 5ϕ2 + 5ϕ3 )5ϕ1 +βϕ2 5ϕ1
1
= 5ϕ1 5ϕ1 +αϕ3 5ϕ1 5ϕ1 +βϕ2 5ϕ1 = 5ϕ1 .
4
Here we have used the orthogonalities: 5ϕ1 +αϕ3 5ϕ2 = 5ϕ3 5ϕ1 +βϕ2 = 0. 2
96 P. BUSCH
Approaching the proof of Proposition 5.2, we observe that any triple of pairwise
independent unit vectors ϕ, ψ, ϑ spans a subspace [ϕ, ψ, ϑ] of H of dimension 2
or 3. We assume dim H > 3. As will be made evident, the case dim H = 2
can be dealt with by restriction of the constructions to be carried out for the case
dim H > 2. Let ϕ1 , ϕ2 , ϕ3 be three mutually orthogonal unit vectors such that
[ϕ, ψ, ϑ] ⊆ [ϕ1 , ϕ2 , ϕ3 ] =: K. Then the nine operators Aϕk ϕ` (cf. (5.7)) form a
basis of the space of operators on K. By polarization, an alternative basis is given
by the operators
Proof of Proposition 5.2. Step 1. Excluding the trivial cases mentioned imme-
diately after Proposition 5.2, we need to verify Equation (5.9) for any triple of
unit vectors ϕ, ψ, ϑ ∈ H which are pairwise independent and nonorthogonal.
Consider the case dim H > 3. We work with a specific choice of ONB {ϕ1 , ϕ2 , ϕ3 }
of K, namely, ϕ1 = ϕ, ϕ2 ⊥ ϕ1 such that ψ ∈ [ϕ1 , ϕ2 ], and ϕ3 ⊥ [ϕ1 , ϕ2 ] such that
ϑ ∈ [ϕ1 , ϕ2 , ϕ3 ] = K. Using the operator bases (5.20) and (5.21), and denoting
these operators as Pξk , 5ξk in the order given, we can write
X
4 X
9
Pψ = sk Pξk , Pϑ = t` Pξ` ,
k=1 `=1
and
X
4 X
9
5ψ = sk 5ξk , 5ϑ = t` 5ξ` ,
k=1 `=1
X
4 X
9
5ϕ 5ϑ 5ψ 5ϕ = sk t` 5ϕ 5ξ` 5ξk 5ϕ = a5(L)
ϕ1 + ā5ϕ1 ,
(A)
(5.23)
k=1 `=1
where a turns out to be the same expression as that given in the braces {· · ·} in
Equation (5.22). Hence, a = hϕ | ϑihϑ | ψihψ | ϕi, and Equation (5.9) is verified.
Note that the case dim H = 2 is covered by putting sk = tk = 0 for k > 4.
Step 2. Next we show that the spectral projections occurring in Equation (5.9)
do not depend on the choice of ψ, ϑ ∈ H. Note that the result of Step 1 holds
for any choice of unit vectors ϑ ∈ H, and thus for any choice of ϕ3 ⊥ [ϕ1 , ϕ2 ] in
the case dim H > 3. The construction of 5(L) (A)
ϕ1 , 5ϕ1 so far depends on the choice
of ψ ∈ H, but only via ϕ2 ⊥ ϕ1 . In the case dim H = 2, the ray orthogonal to ϕ1
is uniquely determined by that vector, so that the spectral projections in Equation
(5.23) are the same for all ψ, ϑ. In the case dim H > 3, we have to show that these
projections are actually the same for all choices of unit vectors ϕ20 ⊥ ϕ1 (including
ϕ20 = ϕ3 , cf. Equation (5.19)). To this end we repeat the procedure of Step 1, this
time choosing a unit vector ϕ20 ⊥ ϕ1 = ϕ in the plane [ϕ, ϑ]. Then there exists
a unit vector ϕ30 such that ψ ∈ [ϕ1 , ϕ20 , ϕ30 ] = K. We obtain again a spectral
decomposition of the form (5.23),
0 0 0 0
5ϕ 5ϑ 5ψ 5ϕ = a5ϕ(L)
1
+ ā5ϕ(A)
1
, 5ϕ(L)
1
+ 5ϕ(A)
1
= 5ϕ , (5.24)
for the same set of vectors ϕ, ψ, ϑ. Provided that the eigenvalues a, ā do not coin-
0 0
cide, i.e. a ∈
/ R, then the spectral projections 5ϕ(L) 1
, 5ϕ(A)
1
constructed from ϕ1 , ϕ20
along the lines of the proof of Lemma 5.2 coincide with the projections 5(L) (A)
ϕ1 , 5ϕ1
constructed from ϕ1 , ϕ2 .
We show that, given ϕ, ψ, any ϕ20 ⊥ ϕ1 can be realized with a choice of ϑ such
that a = hϕ1 | ϑihϑ | ψihψ | ϕ1 i ∈ / R. Let ϕ20 be any unit vector such that ϕ20 ⊥ ϕ
0
and ϕ2 ⊥ / ψ. Choose ϑ = α1 ϕ1 + α2 ϕ20 , with α1 , α2 to be further specified shortly.
Take a unit vector ϕ30 orthogonal to ϕ1 , ϕ20 and such that ψ = β1 ϕ1 + β2 ϕ20 + β3 ϕ30 .
Since ϕ20 ⊥/ ψ, we have β1 6= 0 6= β2 . We obtain
a = α1 {ᾱ1 β1 + ᾱ2 β2 }β̄1 = |α1 |2 |β1 |2 + α1 ᾱ2 β2 β1 .
The first term on the right-hand side is real and positive. Then a 6= R can be easily
achieved by proper choice of α1 , α2 . Hence all choices of ϕ20 ⊥ ϕ1 , ϕ20 6= ϕ3 lead to
the same spectral decomposition in Equations (5.10) or (5.16). (Note that ϕ20 = ϕ3
would imply β2 = 0.)
98 P. BUSCH
A continuity argument finally shows that the case ϕ20 = ϕ3 can be included,
too, so that the spectral decompositions in Equations (5.16) and (5.19) do indeed
coincide. In fact, let ϕ2(n) be a sequence of unit vectors orthogonal to ϕ1 and different
from ϕ3 , such that kϕ2(n) − ϕ3 k → 0. Then Pϕ1 +αϕ(n) converges to Pϕ1 +αϕ3 in trace
2
norm. By the continuity of T , 5ϕ1 +αϕ(n) converges to 5ϕ1 +αϕ3 in trace norm. Then
2
1 + i (L) 1 − i (A)
5ϕ1 + 5ϕ1
4 4
= 5ϕ1 5ϕ1 +iϕ(n) 5ϕ1 +ϕ(n) 5ϕ1 −→ 5ϕ1 5ϕ1 +iϕ3 5ϕ1 +ϕ3 5ϕ1 ,
2 2
convergence in trace norm. It follows that the first and last expressions coincide.
This concludes Step 2.
We introduce the following notation for the operators of Equations (5.9):
ϕ ϕ ϕ
Wϑψ = 5ϕ 5ϑ 5ψ 5ϕ = aϑψ 5(L)
ϕ + aϑψ 5ϕ ,
(A)
ϕ
aϑψ = hϕ | ϑihϑ | ψihψ | ϕi.
ϕ =P
5(L) ϕ =P
(L)
5ϕ , 5(A) (A)
5ϕ , (5.28)
Proof. Let χ ∈ H be a unit vector not orthogonal to ϕ, and choose any unit
ϕ
vector ψ ∈ H neither orthogonal nor parallel to ϕ, χ and such that aχψ ∈
/ R.
STOCHASTIC ISOMETRIES IN QUANTUM MECHANICS 99
ϕ
We compute the operator 5χ Wχψ 5χ in two ways, using (5.3) and noting that
ϕ χ
aχψ = aψϕ :
ϕ χ ϕ ϕ
5χ Wχψ 5χ = hPχ , Pϕ iWψϕ = hPχ , Pϕ i aχψ 5(L)
χ + aχψ 5χ
(A)
ϕ ϕ
= aχψ ϕ 5χ + aχψ 5χ 5ϕ 5χ .
5χ 5(L) (A)
1 1 (L)
T (Pϕ ) = 5ϕ = P 5ϕ P (L) + P (A) 5ϕ P (A)
m m
(L)
m m(A) (A)
= T (L) (Pϕ ) + T (Pϕ ).
m m
Due to the linearity and continuity of T this equation extends to all (σ -)convex
combinations of pure states and thus to all states ρ and finally to all of V . 2
is an orthonormal basis of 5ψ H, cf. Equation (5.4). Moreover, for any two unit
vectors ψ, ϑ not orthogonal to ϕ, the corresponding basis vectors ψk , ϑ` are mu-
tually orthogonal if k 6= `. In fact, due to Equations (5.31), (5.32) we have (∼
denoting proportionality)
ϕ
hϑ` | ψk i ∼ h5ϑ ϕ` | 5ψ ϕk i = ϕk | Wϑψ ϕ` ∼ δk` . (5.34)
100 P. BUSCH
are mutually orthogonal. For any ψ ∈ H\{0} there exists a unique ray Pψk H, ψk ∈
5ψ H, such that
Pk 5ψ = 5ψ Pk = Pψk , k = 1, . . . , m. (5.36)
Then
_
Pk = Pψk , k = 1, . . . , m, (5.37)
ψ∈H\{0}
and
X
m _
Pk = 5ϕ . (5.38)
k=1 ϕ∈H\{0}
6 2kϑk(n) − ψk k kξ k.
We conclude that kPϑ (n) − Pψk k → 0 and so kPϑ (n) − Pψk k1 → 0. We also have
k k
kPϑ (n) − Pψ k1 → 0 and therefore, due to the continuity of T , k5ϑ (n) − 5ψ k1 → 0
and also kPk 5ϑ (n) − Pk 5ψ k1 → 0. But Pk 5ϑ (n) = Pϑ (n) → Pψk , and so Pk 5ψ =
k
Pψk . This proves (5.36).
The relation (5.37) is an immediate consequence
P of the fact just demonstrated
P
that for all ψ, Pψk 6 Pk . Finally we have ( k Pk )5ϕ = 5ϕ and so 5ϕ 6 k Pk .
Since all Pψk 6 5ψ , the converse ordering holds as well. This proves (5.38).
The maps Tk are clearly linear and positive. We have Tk (Pξ ) = Pk 5ξ = Pξk for
all vectors ξ 6= 0, so the Tk are trace preserving and thus pure stochastic maps. Let
ψ, ξ ∈ H\{0}, with ψ ⊥ ξ . Then 5ψ ⊥5ξ (since T is orthogonality preserving),
and so Pψk ⊥Pξk , that is, Tk (Pψ )⊥Tk (Pξ ). Thus the Tk are orthogonality preserv-
ing and therefore (pure stochastic) isometries. Equation (5.39) then follows by
straightforward application of (5.40). The existence of unitary or antiunitary maps
Uk satisfying (5.41) is a consequence of Theorem 2.3.1 of [5] (cf. Proposition 3.1
above).
If Uk is unitary, we find, using Uk∗ Uk = I :
Tk (Pϕ )Tk (Pϑ )Tk (Pψ )Tk (Pϕ ) = Uk (Pϕ Pϑ Pψ Pϕ )Uk∗ = aϑψ Tk (Pϕ ).
ϕ
Tk (Pϕ )Tk (Pϑ )Tk (Pψ )Tk (Pϕ ) = Uk (Pϕ Pϑ Pψ Pϕ )Uk∗ = aϑψ ϕ
Tk (Pϕ ).
P
Observing that Tk (Pξ )⊥T` (Pχ )(k 6= `) and 5ϕ = Tk (Pϕ ), etc., we obtain:
ϕ
X X
Wϑψ = 5ϕ 5ϑ 5ψ 5ϕ = Tk (Pϕ )Tk (Pϑ )Tk (Pψ )Tk (Pϕ ) = ak Tk (Pϕ ),
k k
ϕ ϕ
where ak = aϑψ or ak = aϑψ according to whether Uk is unitary or antiunitary.
Comparing this with Equations (5.31), (5.32), it is seen that all the Uk must either
be simultaneously unitary or antiunitary in order to reproduce the right-hand sides
of these equations.
From Tk (Pϕ ) · T` (Pψ ) = 0 for k 6= `, ϕ, ψ ∈ H\{0}, it follows that Uk∗ U` =
δk` IH . Also, it is obvious that the projection Uk Uk∗ has the same range as Pk , and
so Uk Uk∗ = Pk . 2
REMARK 5.1. The stochastic isometries T (L), T (A) of Proposition 5.3, Equa-
tion (5.30), are convex combinations of pure stochastic isometries generated by
linear and antilinear isometries of H, respectively. This explains the use of the
superscripts (L), (A) throughout this section.
102 P. BUSCH
This is apparent from the construction of the pure isometries Tk in Proposition 5.4,
which was based on the choice of some nonzero vector ϕ 0 ∈ H and Pan arbitrary
ONB ϕk of 5ϕ0 H. Accordingly, it can be shown that if T (ρ) = m1 k Uk ρUk∗ =
1
P ∗
m k Vk ρVk , where Vk is another set of unitary maps, then
X
Vk = γk` U` ,
`
with (γk` ) a unitary m × m matrix. It is easily verified that this condition ensures
that the Vk define the same mixing stochastic isometry as the Uk . The totality of
all projections Qk = Vk Vk∗ thus obtained from a given set Uk using all unitary
matrices (γk` ) commute with all elements of T (B1 (H)) and ◦ T (B(H)).
X
N
S(ρ) = Uk∗ ρUk + P0 ρP0 , ρ ∈ Ṽ .
k=1
Then T −1 = S|T (V ) .
Proof. The positivity of S is obvious. TracePpreservation follows from the fact
that Uk Uk∗ = Pk is the projection onto H̃k and Nk=0 Pk = IH̃ .
∗
Let ρ = T (σ ). Then, since Uk U` = δk` IH and P0 ρP0 = 0, one has
X
N X
N X
N
S(ρ) = S(T (σ )) = wk U`∗ Uk σ Uk∗ U` = wk σ = σ = T −1 (ρ).
`=1 k=1 k=1
Hence S|T (V ) = T −1 . 2
This result suggests the possibility that reversible physical state changes are not
necessarily represented by surjective, and hence pure, stochastic isometries, but
that a state change effected by any stochastic isometry is reversible: there exists
STOCHASTIC ISOMETRIES IN QUANTUM MECHANICS 105
a single dynamical map that sends all final states back to the initial states. This
interpretation is further elaborated in [3].
Pursuing further the dynamical interpretation of stochastic isometries, it may
be observed that such maps induce a reduction of the symmetries of the physical
system in question. In this sense it can be said that dynamical maps represented
as stochastic isometries describe the formation of structure (in a reversible way).
According to a theorem due to Wigner, any symmetry operation [1], defined as an
angle preserving map of the set of rays of H onto itself, is induced by a unitary
or antiunitary map according to Equation (3.1). For simplicity, we consider a com-
pletely positive stochastic isometry of the form (3.2), with all Uk unitary. Let U be
a unitary symmetry operation. Then we compute:
X
N
T (UρU ∗ ) = wk (Uk U Uk∗ )Uk ρUk∗ (Uk U ∗ Uk∗ ) = Ũ T (ρ)Ũ ∗ ,
k=1
where
X
Ũ = Uk U Uk∗ = ◦ T (U )
k
is unitary. The map U 7→ Ũ is injective. But to every U there do exist unitary maps
V 6= Ũ such that
Ũ T (ρ)Ũ ∗ = V T (ρ)V ∗ (6.1)
PN
for all ρ ∈ B1 (H). In fact, define a unitary map W = k=0 λk Pk , with Pk =
∗
Uk Uk , λk ∈ C, |λk | = 1, such that W 6= I . Let V = Ũ W . Then (6.1) holds
due to the fact that W commutes with all T (ρ). This shows that on the state space
T (B1 (H)s ) not all symmetries of H can be distinguished.
There is another interpretation of a stochastic isometry T and its associated map
◦
T defined in Remark 6.1. These maps lead to a physically equivalent description
of all states and observables of the given quantum system in the following sense:
for all states ρ and effects a, the corresponding states T (ρ) and effects ◦ T (a) give
the same probabilities:
hT (ρ), ◦ T (a)i = hρ, ai.
This observation can be elaborated into a general theory of extensions of the statis-
tical description of a quantum physical system on the basis of Theorem 1 [2].
Acknowledgement
The author wishes to thank Pekka Lahti for helpful comments on an earlier version
of the manuscript.
References
1. Bargmann, V.: Notes on Wigner’s theorem on symmetry operations, J. Math. Phys. 5 (1964),
862–868.
2. Busch, P.: Quantum extensions of quantum statistical models, Preprint, 1998.
3. Busch, P. and Quadt, R.: Operational characterization of irreversibility, Report Series, Depart-
ment of Mathematics, University of Hull, 1998.
4. Cassinelli, G., DeVito, E., Lahti, P. J. and Levrero, A.: Symmetry groups in quantum mechanics
and the theorem of Wigner on the symmetry transformations, Rev. Math. Phys. 9 (1997), 921–
941.
5. Davies, E. B.: Quantum Theory of Open Systems, Academic Press, New York, 1976.
6. Emch, G. G.: Algebraic Methods in Statistical Mechanics and Quantum Field Theory, Wiley,
New York, 1972.
7. Kadison, R. V.: Transformations of states in operator theory and dynamics, Topology 3 (Suppl.
2) (1965), 177–198.
8. Kadison, R. V.: Isometries of operator algebras, Ann. of Math. 54 (1951), 325–338.
9. Maeda, S.: Probability measures on projections in von Neumann algebras, Rev. Math. Phys. 1
(1990), 235–290.
10. Wright, R.: The structure of projection-valued states: a generalization of Wigner’s theorem, Int.
J. Theor. Phys. 16 (1977), 567–573.
Mathematical Physics, Analysis and Geometry 2: 113–139, 1999.
113
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
L. BOUTET DE MONVEL
Institut de Mathématiques de Jussieu, Université Pierre et Marie Curie, 4 place Jussieu,
75251 Paris Cedex 05, France; e-mail: boutet@math.jussieu.fr
1. Introduction
Let us first recall what a star product is (detailed definitions are given inPSection 2).
Let X be a manifold and let O b denote the algebra of formal series f = k>k fk hk ,
0
where the fk are smooth functions on X and h is a “small” formal parameter. A star
product on X is a unitary algebra law on O b for which the unit is 1 and the product
is local, i.e., given by a formula
X
f, g → B(f, g) = fg + hk Bk (f, g),
k >k0
isomorphism between the original D-algebras; however, there are in general many
more “exotic” star algebras which do not come from a D-algebra.
If X is of dimension 1 the classification depends on whether X is open, of genus
> 2, of genus 1 or of genus 0.
An inner automorphism of the algebra E of pseudo-differential operators on
X(U : P → AP A−1 ) has a symbol σ (U ) = d Log σ (A), which is a section of
the sheaf ω (on the “basis” B6 = 6/C× of closed forms homogeneous of degree
0 on 6, and an exponent which is the degree of A; we will see in Section 2 that
any automorphism U of E has likewise a symbol and an exponent ∈ C. Similarly,
a star algebras has a symbol σ (A) ∈ H 1 (B6, ω) and an exponent ∈ H 1 (B6, C).
We will see in Section 3 that if X is an open curve or a curve of genus > 1, star
algebras on 6 are completely determined by their exponent. The classification is
more subtle when X is closed of genus 1 or 0.
The techniques used in this paper are a mixture of noncommutative cohomol-
ogy, holomorphic cohomology, and the relation between the cohomology of a sheaf
with a filtration and the cohomology of the associated graded sheaf. This contains
nothing really new or difficult, but the mixture can be somewhat muddling.
As far as I know the questions studied here have not been investigated before
and the results are new.
In Sections 2 and 3 we recall the definition of star algebras, and some classifi-
cation principles.
In Section 4 we describe the classification when dim X > 2.
In Section 5 we describe the case where X is a curve (dim X = 1): results are
substantially different if X is open, X = P1 , X is of genus 1, or X is of genus
g > 2.
2. Star Algebras
2.1. CONES
(m an integer, m → −∞).
Remark 1. Sheaves are of course useless in the real case but must be used in
the complex case where global sections do not necessarily exist.
Remark 2. For analytic cones there is also a notion of convergent symbol (in-
troduced by the author in [6] to define analytic pseudo-differential operators).
These are in fact the more important and for many questions it is essential to
use convergent rather than formal symbols.? However, for the classification results
below, there is no significant qualitative difference between formal and convergent
symbols, so we will stick to formal symbols and avoid convergence technicalities.
? e.g., convergent rather than formal symbols are essential in the finiteness theorems of T. Kawai
and M. Kashiwara [22], or for going from E -modules to D-modules in the thesis of D. Meyer [23],
and probably in most problems involving a comparison between E and D-modules.
COMPLEX STAR ALGEBRAS 117
3. Pseudo-differential Algebras
3.1. E - ALGEBRAS
Let 6 = T ∗ X − {0} be the cotangent bundle (minus the zero section) of a complex
manifold X, equipped with its canonical symplectic structure. The basis is B6 =
6/C× = P X, the projective cotangent bundle. There is a canonical star algebra on
6, viz. the algebra of pseudo-differential operators, microlocalization of the alge-
bra of differential operators on X, whose Poisson bracket is the standard Poisson
bracket of T ∗ X. If we choose local coordinates x = (x1 , . . . , xn ) on X and the
COMPLEX STAR ALGEBRAS 119
To understand how local E-algebras can be patched together to make global ob-
jects, we have to know what automorphisms of E look like.
Let U ∈ D b−× be an automorphism of E : U preserves symbols and the unit 1, so
P
U − 1 is of degree 6 −1 and the logarithm D = Log U = − n>1 − n1 (1 − U )n is
well defined; it is a derivation of degree 6 −1 of E.
Now if D is a derivation of degree 6 k its symbol δ = σk (D) is a homogeneous
derivation of degree k of the Poisson algebra O, b i.e., a symplectic vector field on
6, homogeneous of degree k. This corresponds, via the symplectic structure of 6,
to a closed differential form α, homogeneous of degree k + 1.
Let ρ denote
P the radial vector field, infinitesimal generator of the action of
C× (ρ = ξj ∂ξj in local coordinates on X, T ∗ X as above): the associated Lie
derivation is Lρ = iρ d + diρ (iρ denotes the interior product) so
diρ α = (k + 1)α.
Hence α is exact (the differential of a homogeneous function) if k + 1 6= 0. If
k + 1 = 0, s = iρ α is locally constant, and α is locally the differential of a
homogeneous function of degree 0 iff s = 0.
By successive approximations, it follows that locally any derivation D of E is
of the form sad(Log P1 ) + ad Q with P1 elliptic of degree 1, Q ∈ E, and any
automorphism of E is locally of the form
U = (Int P1 )s Int Q0 (5)
with P1 elliptic of degree 1, Q0 elliptic of degree 0.? Int P denotes the inner
automorphism a → P aP −1 .
? As usual in the context of pseudo-differential operators, elliptic = invertible.
COMPLEX STAR ALGEBRAS 121
PROPOSITION 4. Let
0→A→B→C→0 (7)
3.5. SYMBOLS
If A ∈ AlgE ' H 1 (P X, Aut E) we have defined its symbol as the image of its
defining cocycle in H 1 (P X, ω). To compute H 0 and H 1 for automorphisms, it
will be useful to compute them first for symbols.
The following exact sequences of sheaves are also useful to handle ω:
0 → OP X /C → ω → C → 0, (9)
0 → C → OP X → OP X /C → 0. (10)
These give rise to long exact cohomology sequences. We will call “exponent
map” the cohomology maps coming from the map ω → C in (9).
With slight abuse we will call “Chern maps”? the maps:
ch : H j (Y, O/C) → H j +1 (Y, C) (11)
in the long exact cohomology sequence derived from (10).
The sheaf O/C (Y = X or P X) identifies with the sheaf of closed holomorphic
1-forms on Y . If Y is a Stein manifold we have H j (Y, O) = 0 for j > 1 so the
Chern map H j (Y, O/C) → H j +1 (Y, C) is an isomorphism for j > 1.
? The standard Chern map: H 1 (Y, O × ) → H 2 (Y, C) factors through H 1 (Y, O/C).
COMPLEX STAR ALGEBRAS 123
where (here, and whenever possible) H pq denotes the space of harmonic forms of
type p, q on Y .
PROPOSITION 5. (i) If n = dim X > 2, or if X is a closed curve of genus 6= 1,
the map H 0 (X, O/C) → H 0 (P X, ω) is an isomorphism.
(ii) If X is an open curve or a closed curve of genus 1, then w is split and
H 0 (P X, ω) ' H 0 (X, O/C) ⊕ H 0 (X, C).
Proof. A global section of ω is a closed 1-form on T ∗ X − {0}, homogeneous of
degree 0. Locally on X such a form α reads
X
α= αk dxk + βk dξk , (13)
where the coefficients αk resp. βk are of degree 0 resp. −1. If n > 2 this implies
βk = 0 so the αk only depend on x. Hence (i) for n > 2.
If X is a closed curve of genus 6= 1 (n = 1 so P X = X), then the Chern map
H 0 (X, C) ' C → H 1 (X, O/C) = C is injective: it maps s ∈ C to s ch O(1)
(where as above O(1) denotes the sheaf of homogeneous functions of degree 1 on
T ∗ X) and ch O(1) 6= 0 if g 6= 1.? So the exponent map H 0 (X, ω) → H 0 (X, C)
vanishes, and the map H 0 (X, O/C) → H 0 (X, ω) is an isomorphism, hence (i) in
this case.
If n = 1 and X is open or of genus 1, there exists a global nonvanishing vector
field, so ω is split: ω = O/C ⊕ C, hence (ii). 2
If X is open then H 1 (X, O/C) = 0 because in the long exact cohomology se-
quence from (10) we have H 1 (X, O) = H 2 (X, C) = 0, so H 1 (X, ω) ' H 1 (X, C).
If X is of genus g = 1, we have H 1 (X, O/C) = H 20 + H 11 = C and
H (X, ω) ' H 11 + H 1 (X, C) ' C3 .
1
3.6. FILTRATIONS
As mentioned above Aut E has a natural filtration (as well as E−× ⊂ Aut E): any
a ∈ Aut E is of degree 6 0 and a ∈ E−× is of degree n < 0 if a = Int (1+b), b ∈ En .
The corresponding graded sheaf is
M M
gr Aut E = (Aut E)k /(Aut E)k−1 ' ω + O(k). (14)
k 60 k<0
COMPLEX STAR ALGEBRAS 125
It is commutative, and this will help to extract more information. This also works
for any E-algebra A because the filtration above, and the leading terms, are by
definition invariant by automorphisms so gr Aut A ' gr Aut E.
(1 + bi )−1 aij (1 + bj )
Thus our map is well defined and onto (if H 0 (P X, Aut A) 6= 0 it may not be
injective because two cocycles of degree m with coefficients in Aut A can then be
equivalent although their symbols are not). 2
We first point out the following results (which will also be useful in Section 5):
is injective. It is bijective if A = E.
Proof. If n > 2, A and Aut A have no global section of degree < 0 so the sym-
bol map u → σ (u) is injective. More generally, if A, A0 are two E-algebras and
u, v two isomorphisms A → A0 the difference symbol σ (u−1 v) ∈ H 0 (P X, ω) is
well defined and completely determines v (given u) (note that we have σ (u−1 v) =
σ (vu−1 )).
On the other hand, if A = E (more generally if A comes from a D-algebra)
the symbol map is onto because, by Proposition 5, H 0 (P X, ω) ' H 0 (X, O/C) '
Aut D, and this obviously lifts to Aut E.
If X is a ball of Cn or more generally a Stein contractible domain, we have
H 1 (P X, ω) = 0 (Lemma 1) so H 1 (P X, E−× ) → H 1 (P X, Aut E) is onto, i.e., any
E-algebra can be defined by a cocycle with coefficients in E−× .
Now let A, A0 be two algebras defined by cocycles a = (aij ), a 0 = (aij0 ) ∈ E−×
and let u : A0 → A be an isomorphism, i.e., a family (ui ) ∈ Aut E such that
ui aij0 = aij uj . Then the symbols σ (ui ) patch together since σ (aij ) = σ (aij0 ) = 0,
and the resulting symbol σaa 0 (u) is well defined. It only depends on the classes of
a, a 0 in H 1 (P X, E−× ). However, it does depend on a, a 0 ∈ H 1 (P X, E−× ) and not
just on their images in H 1 (P X, Aut E): any other representatives are of the form
α · a, α 0 · a 0 with α, α 0 ∈ H 0 (P X, ω) for the action of H 0 (P X, ω) of Proposition 4,
and we get σα·a,α 0·a 0 (u) = σ (u) + α − α 0 .
If X ⊂ Cn , n > 2, is a Stein contractible domain, the exponent of σaa 0 (u)
vanishes: σaa 0 (u) ∈ H 0 (X, O/C), and again σaa 0 (u) completely
S determines u.
Let now A ∈ AlgE . There exists a covering X = Xi where all finite inter-
sections are isomorphic to Stein contractible domains of Cn . Then Ai = A|Xi can
be defined by a cocycle (ai ) with coefficients in E−× ; this being so the patching iso-
morphisms uij all have exponent 0 and are determined by their symbols σai aj (uij )
(for fixed Ai ). In particular we have proved:
COMPLEX STAR ALGEBRAS 127
If X is a ball and dim X > 3 we have H 1 (P X, O(−k)) = 0 for all k > 0, i.e.,
H 1 (P X, gr E−× ) = 0 (this is also true if X is a Stein manifold).
It follows that we have H 1 (P X, E−× ) = 0, and more generally for any E-algebra
A we have H 1 (P X, A× − ) = 0.
Hence if A is an E-algebra, it is built by patching together models of E over a
covering Xi of X, where the patching cocycle belongs to H 1 (X, OX /C).
Moreover, if A, B are two such algebras, any isomorphism B → A comes
from a ϕ ∈ H 0 (X, OX /C), i.e., comes locally from an inner automorphism P →
ϕP ϕ −1 . Summing up we have proved:
This result is closely related to the result of [8] on microlocally free D-modules
in dimension > 3.
If dim X = 2 what was said above remains true, in particular any symbol α ∈
H 1 (P X, ω) is the symbol of an E-algebra (in fact of a D-algebra). However,
the picture changes considerably because H 1 (P X, E−× ) is usually very large. The
following examples show what can happen, and also how, in global situations on
compact manifolds, things can nevertheless at least partially cancel out.
EXAMPLE 1. Let X be the unit ball of C2 (or more generally a Stein contractible
manifold).?
Then H 1 (X, ω) = H 1 (X, O/C) = 0 so H 1 (P X, Aut E) is the quotient of
H 1 (P X, E−× ) by the action of H 0 (X, Aut E) = H 0 (X, O/C).
Now P X is the union of the two Stein subcones Ui = {ξi 6= 0} (i = 1, 2) so a
cocycle is represented by just one section a12 ∈ E−× (U1 ∩ U2 ). It is elementary that
any a ∈ H 1 (P X, E−× ) has a unique normalized representative of the form
X p q
a12 = apq (x)ξ1 ξ2 , (16)
p,q<0
? What is used is H 1 (X, O/C) = 0 and the fact that T ∗ X is a trivial holomorphic vector bundle.
128 L. BOUTET DE MONVEL
i.e., with no holomorphic term in ξ1 or ξ2 (this is obvious for the additive co-
homology H 1 (P X, gr E−1 ) and follows by successive approximation for E−× ). So
H 1 (P X, Aut E) is the set of conjugate classes of normalized symbols a12 as above,
with a12 ∼ ϕ(x)a12 ϕ(x)−1 for ϕ a nonvanishing function on X. This set is still
very large; on the other hand such algebras tend to have very few global sections
or automorphisms.
The analysis of these algebras is closely related to that of “microlocally” free
D-modules in dimension 2, made by M. Carette [7].
For global compact manifolds, some things may cancel out.
EXAMPLE 2. Let X = P2 (C) be the complex projective plane: then P X is
isomorphic to the incidence manifold {x · ξ = 0} ⊂ X × X∗ (X∗ the dual projective
space). T ∗ X itself is the quotient of the incidence cone 0 = {x · ξ = 0} ⊂
C − {0} × C by the group action (x, ξ ) ∼ (λx, (1/λ)ξ ). The sheaf OP X (n) of
homogeneous functions of degree n on T ∗ X identities with the sheaf of restric-
tions to 0 of functions f (x, ξ ) such that f (λx, ξ ) = f (x, λξ ) = λn f (x, ξ ),
i.e., OP X (n) = OX (n) ⊗ OX∗ (n) (where exceptionally here OX (n) denotes the
canonical sheaf of the projective space). It follows easily that H 1 (P X, gr E−× ) = 0
so H 1 (P X, A×− ) = 0 for any E-algebra A.
The symbol map H 1 (X, Aut E) → H 1 (X, ω) = H 1 (X, O/C) is one to one,
and again, as in dimension > 3, the correspondence D-algebra → E-algebras is
an equivalence.
Note that in this case we have H 1 (P X, O/C) ' H 20 + H 11 ' H 11 = C, and
E-algebras ∼ D-algebras are parameterized by H 11 = C.
whose coefficients are translation invariant (independant of x). Two such cocycles
b, b0 define the same E-algebra iff b0 (ξ ) ' b(ξ + a) for some constant vector a.
P 1 α α
(This is an asymptotic relation between symbols: b(ξ + a) = ∂ ba .)
α! ξ
Note that in any case OP X (1) identifies with the sheaf of sections of T X (vector
fields) so the dual OP X (−1) identifies with the sheaf of sections of T ∗ X . If X is
of genus > 2, this is ample so we have H 1 (X, OP X (n)) = 0 for all n < 0, and it
follows that H 1 (X, gr E−× ) = 0, so H 1 (X, A×
− ) = 0 for any E-algebra A, and the
canonical map gr H 0 (X, Aut A) → H 0 (X, gr Aut E) is bijective for any E-algebra
A (it is surjective since on any curve we have H 2 (X, gr E−× ) = 0).
Furthermore, in this case the Chern map H 1 (X, O/C) → H 2 (X, C) is one
to one, as well as the map H 0 (X, C) → H 1 (X, O/C) (cf. 9). Finally the map
H 1 (X, Aut E) → H 1 (X, ω) = H 1 (X, O/C) is one to one. Thus
This is the most complicated of the cases examined here. Let X be a closed curve
of genus 1: X = C/ 0 where the group of periods 0 ' Z2 acts by translations.
We denote ξ the symbol of the constant vector field ∂/∂x on C.
Since T X is trivial, ω is split: ω = O/C + C. Also, for all n, we have
H 0 (O(n)) = H 00 ' H 1 (X, O(n)) = H 01 ' C, H 2 (O(n)) = 0.
We denote
G, resp. G− ⊂ G (28)
the group of automorphisms of E of the form Int ξ s Int(1 + a(ξ −1 )), resp. the sub-
group s = 0: this is the commutant of ξ , it is a constant subsheaf of Aut E.
For any α ∈ C we set ξa = eax ξ . This is only defined up to a multiplicative
constant eaµ , µ ∈ 0, but the inner automorphism Int (eax ξ ) is well defined, as well
as the corresponding commutator sheaf
Ga− ⊂ Ga , (29)
a = α 11 6= 0, b = α 10 6= 0, α 01 = 0.
with x̃ = x(1 + b/ξ )−1 so that σ (x̃) = x, [ξ + bLog ξ, x̃] = 1: with this choice the
Tµ (Int ξ )bµ (symbolically exp µ(ξ + bLog ξ )) commute with the Int eaµ x̃ , so again
the Uµ define an E-algebra with symbol α 11 = a, α 10 = b, α 01 = 0.
Proof. The fibers of this map are the orbits of the action of H 0 (X, ω) (Proposi-
tion 4). We will prove that this is transitive.
For this action σ (Int ξ ) acts by u = (uij ) → us = usij with Int usij = Int ξis Int uij
Int ξjs (mod coboundary equivalence), where ξi ∈ A(Xi ) has symbol ξ , and mul-
tiplication is the multiplication of A. Now in the local frame on Xi we have
ξj = Uij (ξ ) = x − αij + · · · so for leading terms we get
ξ
σ (usij ) = σ (uij ) = σ (u)(1 + αij ξ −1 ) (32)
ξ − αij
or with additive notation σ (us ) = σ (u) + sα 11 .
This proves Lemma 4, and the other assertions of Theorem 2 are immediate
consequences. 2
Let X be the projective line (X = P1 (C)). It is the union of the two open sets
X0 = {z = 6 ∞}, X∞ = {z 6= 0}, and since these are Stein, contractible (' C), E-
or D-algebras are classified by cocycles reduced to one function on X0 ∩ X∞ .
D-algebras are classified by H 1 (X, O/C) = H 2 (X, C) = C. The D-algebra
Ds (s ∈ C = H 1 (X, O/C)) is defined by the cocycle (Int z)s .
Let us introduce homogeneous coordinates x, y (z = xy ). We make use of the
sheaf of homogeneous differential operators D hom on C2 , i.e., differential operators
of x and y which commute with the generator of homotheties ρ = x∂x + y∂y ; this
algebra is generated by ρ and the operators
e = x∂y , h = x∂x − y∂y , f = y∂x (33)
which satisfy the relations
[h, e] = 2e, [h, f ] = −2f, [e, f ] = h,
h2 + 2(ef + f e) = ρ(ρ + 2). (34)
136 L. BOUTET DE MONVEL
This is because the two vector fields ∂0 = ∂z , ∂∞ = ∂1/z are globally holomorphic
and elliptic on X0 , resp. X∞ and their symbols are ζ, −z2 ζ , so any cocycle can
uniquely be reduced to the form above, as for the additive cohomology group
H 1 (X, gr E−× ).
To compare D-algebras and E-algebras it is convenient to use the following
intermediate exact sequence; let Int E0 ' E−× /C× be the group of inner automor-
phisms of E0 ; we have an exact sequence:
0 → Int E0 → Aut E → C → 0
hence a surjection
H 1 (X, Int E0 ) → H 1 (X, Aut E) (36)
whose fibers are the orbits of the action of C = H 0 (X, C) on H 1 (X, Int E) (q0∞ →
(Int ∂0 )s q0∞ (Int ∂∞ )−s , cf. Proposition 4). 2
It follows that Ds resp. D−s−2 give isomorphic E-algebras, although they are
not isomorphic D-algebras. This is the only case where two D-algebras on X = P1
give isomorphic E-algebras: the algebra of global sections is obviously an invariant
of an E-algebra, and in this the global sections e, h, f (with the notations above)
are well defined (up to an additive constant by their symbols, and the commutation
relations fix these constants). It follows that s(s + 2) = h2 + 2(ef + f e) is an
invariant of the E-algebra coming from Ds .
Note that D-algebras form a one-parameter family, so there are many E-algebras
which do not come from an E-algebra.
As last remark we turn to the following problem: does there exist a global
symbolic calculus, i.e., is the underlying sheaf of a given E-algebra isomorphic
b This is always true for real E-algebras, where one can patch global symbols
to O?
using a partition of the unity.
Let us examine what happens on X = P1 (C). There is a canonical 2-covering
of T ∗ X − {0} by C2 − {0}: (u, v) → (z = u/v, ζ = 12 v 2 ). If A is a E-algebra
on 6 = T ∗ X its pull-back on 6 0 = C2 − {0} is a star-algebra for the canonical
Poisson bracket ({v, u} = 1), equipped with an involution above the symmetry
(u, v) → (−u, −v) (note that on 6 0 , u and v are of degree 12 ). If A has a global
b 0 ).
symbolic calculus, its pull-back defines a star-product on O(6
b ) there is (up to isomorphism) only one star algebra law for the
Now on O(6 0
e = − 12 u2 , h = 2u ∗ v + 12 , f = 12 v 2 (38)
h2 + 2(ef + f e) = − 34 so s = − 12 or s = − 32 . (39)
We have proved: 2
PROPOSITION 12. The only D-algebras on P1 for which there is a global to-
tal symbolic calculus are D−1/2 and D−3/2 . In particular there is no global total
symbolic calculus for D.
138 L. BOUTET DE MONVEL
Acknowledgement
I thank J. J. Sansuc for his kind help in proofreading the manuscript, and the
reviewer for many detailed comments and suggestions.
References
1. Bayen, F., Flato, M., Fronsdal, C., Lichnerowicz, A. and Sternheimer, D.: Deformation theory
and quantization I, II, Ann. Phys. 111 (1977), 61–131.
2. Beilinson, A. and Bernstein, J.: Localisation de g-modules, C.R. Acad. Sci. Paris, Sér. I 292
(1981), 15–18.
3. Boutet de Monvel, L.: On the index of Toeplitz operators of several complex variables, Invent.
Math. 50 (1979), 249–272.
4. Boutet de Monvel, L.: Star products on conic Poisson manifolds of constant rank, Mat. Fiz.
Anal. Geom. (Kharkov) 2(2) (1995), 1–9.
5. Boutet de Monvel, L. and Guillemin, V.: The Spectral Theory of Toeplitz Operators, Ann. of
Math. Stud. 99, Princeton University Press, 1981.
6. Boutet de Monvel, L. and Krée, K.: Pseudodifferential operators and Gevrey classes, Ann. Inst.
Fourier 17 (1967), 295–323.
7. Carette, M.: Exotic D-modules in dimension 2, thèse Paris VI, en préparation.
8. D’Agnolo, A. and Schapira, P.: The Rado–Penrose correspondence II: Line bundles and simple
D-modules, J. Funct. Anal. 153(2) (1998), 343–356.
9. De Wilde, M. and Lecomte, P.: Existence of star products and of formal deformations of the
Poisson Lie algebras of arbitrary symplectic manifolds, Lett. Math. Phys. 7 (1983), 487–496.
10. De Wilde, M. and Lecomte, P.: Formal deformations of the Poisson Lie algebra of a symplectic
manifold and star products. Existence, equivalence, derivations, in M. Hazewinkel and M.
Gerstenhaber (eds), Deformation Theory of Algebras and Structures and Applications, Kluwer
Acad. Publ., Dordrecht, 1988, pp. 897–960.
11. Fedosov, B. V.: A simple geometrical construction of deformation quantization, J. Differential
Geom. 40 (1994), 213–238.
12. Frenkel, J.: Cohomologie à valeurs dans un faisceau non abélien, C.R. Acad. Sci. Paris 240
(1955), 2368–2370.
13. Frenkel, J.: Cohomologie non abélienne et espaces fibrés, Bull. Soc. Math. France 85 (1957),
135–220.
14. Gerstenhaber, M.: On the deformation of rings and algebras, Ann. of Math. 79 (1964), 59–103.
15. Guillemin, V. and Sternberg, S.: Geometrical Asymptotics, Amer. Math. Soc. Surveys 14, Amer.
Math. Soc., Providence, RI, 1977.
16. Giraud, J.: Cohomologie non abélienne, Grudlehren Math. Wis. 179, Springer-Verlag, 1971.
17. Gutt, S.: Equivalence of deformations and associated star products, Lett. Math. Phys. 3 (1979),
297–309.
18. Karasev, M. V. and Maslov, V. P.: Pseudodifferential operators and a canonical operator in
general symplectic manifolds, Math. USSR Izv. 23 (1984), 277–305.
19. Karasev, M. V. and Maslov, V. P.: Nonlinear Poisson Brackets: Geometry and Quantization,
Transl. Math. Monogr. 119, Amer. Math. Soc., Providence, 1993.
20. Kashiwara, M.: Quantization of contact manifolds, Publ. RIMS, Kyoto Univ. 32 (1996), 1–7.
21. Kawai, T. and Kashiwara, M.: On holonomic systems of microdifferential equations III –
systems with regular singularities, Publ. RIMS, Kyoto Univ. 17 (1981), 813–979.
22. Kontsevitch, M.: Deformation quantization of Poisson manifolds, Preprint, I.H.E.S., 1997.
23. Meyer, D.: D-modules et E-modules associés à un opérateur à caractéristiques simples, thèse
Paris VI, 1998.
COMPLEX STAR ALGEBRAS 139
24. Moyal, J.: Quantum mechanics as a statistical theory, Proc. Cambridge Philos. Soc. 45 (1965),
99–124.
25. Vey, J.: Déformation du crochet de Poisson sur une variété symplectique, Comment. Math.
Helv. 50 (1975), 421–454.
26. Weinstein, A.: Deformation quantization, Séminaire Bourbaki 789, 1994.
Mathematical Physics, Analysis and Geometry 2: 141–177, 1999.
141
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
S. YU. DOBROKHOTOV
Institute for Problems in Mechanics of Russian Academy of Sciences, Prosp. Vernadskogo 101,
Moscow 117526, Russia, e-mail: dobr@ipmnet.ru and Institut de Mathématiques de Jussieu, CNRS
UMR 9994, Laboratoire de Physique mathématique e Géometrie, case 7012, Université Paris 7
Denis Diderot, place Jussieu, F-75251 Paris Cedex 05, France, e-mail: dobro@math.jussieu.fr
A. I. SHAFAREVICH
M. V. Lomonosov Moscow State University, Vorob’evy Gory, Moscow 119899, Russia,
e-mail: shaf@difgeo.math.msu.su
Key words: complex Lagrangian manifold, tunneling, Laplace–Beltrami operator, Liouville tori,
semiclassical approximation.
E = H |3 = H (I1, . . . , In ), (1.1)
then the “asymptotic eigenvalues” Eν associated with the family 3 are defined by
the Bohr–Sommerfeld rule (see [28, 24]),
142 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
σj
Ij = Ijν ≡ h + νj , (1.2)
4
and by the formula (see, e.g., [28, 24, 15, 7, 22])
Eν = H (I1ν , . . . , Inν ). (1.3)
Here, h > 0 is a small parameter (which must be specially interpreted in each
concrete situation), σj is the Maslov index of the cycle associated with the ac-
tion variable Ij , νj are integer, νj ∼ 1/ h. The Maslov canonical operator (or
Fourier integral operator, see, e.g., [24]) also allows one to construct the asymptotic
eigenfunctions (known also as quasimodes) 9ν , approximately (in some norm)
satisfying the original quantum equation
b9ν = Eν 9ν + O(hα ).
H (1.4)
The number α > 1 characterizes the precision of the approximation. The functions
9ν are localized in the neighborhood of the tori 3(I1ν , . . . , Inν ): supp 9ν → π(3)
as h → 0. Here π : T ∗ Q → Q is the natural projection. If H b is a selfadjoint
operator, then in the o(h)-neighborhood of Eν there always exists a number E e
from the spectrum of the operator H b. Therefore if the spectrum of the operator
Hb is discrete, then Eν is an approximation of some exact eigenvalue E. e Generally
speaking, this is not so for the functions 9ν : they may be close to (sometimes rather
complicated) combinations of true eigenfunctions of H b associated with different
eigenvalues (see, e.g., [2, 28]). Some examples of this type are connected with
a situation in which the Hamiltonian H and additional motion integrals of the
corresponding dynamic system are invariant with respect to a certain finite group.
In this case, the asymptotic eigenvalues Eν corresponding to some torus 3 and to
all images of 3 with respect to the group action coincide for all possible α. So we
can see the “asymptotic degeneration” of this part of the spectrum.
The simplest example of such a situation appears in the famous double-well
problem for the Schrödinger equation. In the one-dimensional classical problem
(see [22], and also [15, 39]), there exist two families of invariant circles asso-
ciated with each well, if the energy level is under the barrier between the wells
(the reflection of the coordinate x → −x maps one family onto the other). Each
asymptotic eigenvalue corresponds to two different asymptotic eigenfunctions 9ν±
(each localized in a certain well) for an arbitrarily large α. This “degeneration”
vanishes thanks to tunneling effects between the wells. This effect leads to the
appearance of an exponentially (with respect to the Plank constant h) small splitting
between eigenvalues, and also to the phenomenon that the true eigenfunctions 9 eν±
are localized at once in both wells:
1
eν± = √ (9ν+ ± 9ν− ) + O(h).
9 (1.5)
2
Of course, these effects exist in the multidimensional case (see, e.g., [15, 39, 40,
6]). However, rigorous mathematical investigations have been caried out at only
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 143
for the lowest energy levels. In this case, we have two equilibrium points instead
of invariant tori, and the splitting of the asymptotic eigenvalues is defined by the
special trajectories of the corresponding Hamiltonian system called instantons. The
integral along the instanton defines something like a “tunnel distance” between
these 0-dimensional invariant sets (see [1, 8, 16, 25, 34 – 36] for the theoretical
treatment, and also [9, 15, 20, 35, 39, 40]).
The main difficulties in the multidimensional case for higher energy levels
are very similar to the difficulties appearing in the multidimensional saddle point
method. In an analysis of one-dimensional tunnel effects so-called canonical paths,
and Stokes and of Stokes lines in 1-D complex configuration space (on the complex
plane) play important roles. The Stokes and anti-Stokes lines are a zero-set of the
imaginary or real part of a certain complex function S(z); the canonical paths are
ones where Im S(z) does not decrease, and this means that it is possible to introduce
the orientation of these paths. Roughly speaking, one can use these objects and the
complex WKB method without knowing the classical (Hamilton) mechanics, the
phase space, invariant tori (circles) in the phase space, etc. But it is well known
that the passing from one-dimensional to multidimensional semiclassical meth-
ods (without tunnel effects) required to lift many objects from the configuration
space to the phase space and recall Hamilton mechanics. This is not trivial for
the construction of quasimodes associated with tori in real phase space (see, e.g.,
[15, 24, 25, 28]) and it is clear that it is much more complicated if one wants to take
into account the tunnel effects in complex phase space.? Also it is unclear how to
generalize the Stokes lines and canonical paths is the multidimensional situation.
That is why we think that it is interesting to consider such a multidimensional
example, where one can rigorously prove the formulas for quasimodes and splitting
connected with the complex phase space and interpret these formulas via some
multidimensional analogs of Stokes and anti-Stokes lines, paths (on the complex
Lagrangian manifold) in complex phase space, etc. This example must differ from
the double well problems, where the final formulas for splitting include integrals
over paths in real (not complex) configuration space (see [22, 15, 39]). Ya. G. Sinai
drew the attention of the first author (S. Yu. Dobrokhotov) to one of such examples
with additional discrete symmetry (see [18]). This is the case of the Laplace–
Beltrami operator h2 1 on a 2-D Liouville torus Q. This example admits separation
of variables and is well studied (without tunneling), in particular in the paper [18]
(see also the references in this paper). The corresponding classical Hamiltonian is
quadratic in momenta and it is obvious that if a torus 3+ = (p = P (ϕ), x = X(ϕ))
is invariant with respect to the phase flow gHt , then the “inverse in momentum”
torus 3− = (p = −P (ϕ), x = X(ϕ)) is also invariant. We denote by (p, x)
the coordinates in the phase space T ∗ Q (x = (x1 , . . . , xn ) are coordinates on Q),
? One of the explanations of this fact is connected with the complicated structure of related geo-
metrical objects. Even in the simplest examples of 1-D (of course integrable) situations, the analogs
of the Liouville tori (circles) are not tori in four real-dimensional space and have a more complicated
geometrical and topological structure.
144 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
and by ϕ = (ϕ1 , . . . , ϕn ) the angle coordinates on the tori. If the tori 3+ and 3−
do not coincide, then one can construct two families of quasimodes 9ν+ and 9ν−
associated with these families of tori, which for all possible α in (1.4) correspond
to the same asymptotic eigenvalues Eν . From the point of view of Equation (1.4),
this means that, together with the asymptotic solution 9ν+ , the complex conjugated
+
function 9ν− = 9 ν is also an asymptotic solution of (1.4).?
Now the following natural question arises:
Does splitting of the corresponding asymptotic eigenvalues exist?
Another interesting problem related to this question is:
Is it possible to construct the modes (more exactly, the asymptotics of the real
eigenfunctions; see [2]) using the quasimodes? In particular, is it possible to use
formula (1.5)?
We answer these questions under some additional assumptions, and we not
only give analytical formulas for the splitting of the eigenvalues, and “modes via
quasimodes”, but also their geometrical interpretations. In fact it was not difficult
to obtain these formulas, using separation of variables and combination of some
ideas from [18] and analytical results of the complex WKB method from [13, 33].
Thus, we regard the main result of this paper to be the geometrical interpretation
and our observation that the splitting formula has a natural geometrical and topo-
logical structure. This interpretation includes some multidimensional analogs of
anti-Stokes lines, canonical paths on the complex Lagrangian manifold and also
singularities of this complex manifold and their projections to the related invariant
tori (which are absent in the 1-D case). It shows that it is sometimes possible to
add to the set of geometrical and topological objects (invariant tori 3, the cycles
γj on them, the corresponding action variables Ij and Maslov indices σj ), which
characterize the “spectral series”, certain new geometrical and topological objects
which characterize the splitting and tunneling.
It is important that in the example, considered the classical separation of vari-
ables (we mean angle-actions variables) does not coincide directly with the sepa-
ration of variables in the initial configuration space and the splitting formula really
includes complex paths in the complex phase space (in contrast to examples which
may be reduced to 1-D double-well problems).
The structure of the splitting formula is very transparent. We will describe it
below. Let 3+ and 3− be two “momentum” symmetric invariant tori of the geo-
desic flow on the Liouville torus implying the quasimodes ψν+ , ψν− = ψ̄ν+ and
asymptotic eigenvalues Eν and let there be no other tori with the same energy
and the same value of an additional integral. These tori 3+ and 3− belong to the
real phase space T ∗ Q. Let 8 be the related complex phase space with coordinates
? It is clear that all considerations about “inverse” tori are also true for the Schrödinger-type
operator −h2 1 + V (x) on Q, where V (x) is a potential, and that multidimensional Hamiltonian
systems sometimes can admit larger groups of discrete symmetries and therefore more than two
“identical” families of invariant tori.
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 145
eν± = Eν + O(h2 ),
E
and
eν+ − E
E eν− = h (ω · m + O(h))e−ρ(3+ , 3− )/ h, (1.6)
π
R
where the “tunnel” distance ρ is equal to (1/2) 0 w dz.
We prove also that formula (1.5) for true asymptotics of eigenfunctions (modes)
via quasimodes holds if the latter are constructed “correctly”. This means the fol-
lowing. Each quasimode ψν± may be multiplied by some complex constant C ± . The
normalization condition ||ψν± || = 1 allows to fix the modulus of C ± , but not their
phases φ ± = arg C ± . It is clear from the semiclassical formulas that the choice
of φ ± is equivalent to the choice of initial points b± in the integral defining the
related action function on 3± . We explain that the choice of points b± is “true”
from the point of view of formula (1.5) if they are so-called Stokes points.? The
exact constructions and definitions are contained in Section 4.
We explain also the connection between “momentum" tunneling and band theory
and “over barrier” reflection. These two sections include well-known 1-D results,
which we only interpret from the point of view of their generalization to the mul-
tidimensional problem. Section 4 contains the main results of the paper and is
devoted to the Laplace–Beltrami operator on a torus; we give the exact definitions
and descriptions of geometrical objects for the splitting of eigenvalues in (1.6) as
well as for the asymptotics (1.5) of true eigenfunctions corresponding to a pair of
Liouville tori.
The preliminary version of this paper appeared in [11]; in [12] the results were
generalized for Laplace–Beltrami operators on 2D-surfaces with quadratically in-
tegrable geodesic flow.
Let us stress again that it was not difficult to obtain the analytical expressions
for the values and functions in formulas (1.5) and (1.6) by means of results [13] and
[18], but it required some calculations to see them and to catch their geometrical
structure. The main difficulties were strict and suitable definitions and descriptions
of “the oriented tunnel path” (this defines the phase ρ(3+ , 3− ) in (1.6)) and the
construction of the projection of complex singularities of the Lagrangian manifold
to (real) invariant tori (this defines the vector m in the amplitude in (1.6)). Let us
add that the structure of formula (1.6) and its interpretation looks natural enough,
and therefore we hope that (1.6) should hold in other concrete problems. Also we
hope that the objects introduced in Sections 2–4 can help to obtain formulas (1.5),
(1.6) and similar formulas in other examples directly from the multidimensional
complex WKB-method.
then there exist two functions 9ν± (9ν+ (x) = 9ν− (−x)), satisfying up to O(h2 ) the
corresponding Schrödinger equation:
h2 d2
− + V (x) 9ν± = Eν 9ν± + O(h2 ).
2 dx 2
Here x ± are the points of intersection of the circle 3+ with the x-axis. However
it is well known ([22, 13, 14]), that the exact spectrum of the Schrödinger opera-
tor is nondegenerate and the true eigenfunctions 9 eν± correspond to two different
eigenvalues Eeν± ; the difference between these eigenvalues has the form
Z x− p !
eν− − E
eν+ = ωh 1
E exp − 2(V (x) − Eν ) dx (1 + O(h)) (2.2)
π h −x −
eν± up to O(h) coincide with the odd and even linear combina-
and the functions 9
±
tions of 9ν :
eν± = 9ν+ ± 9ν− + O(h).
9
Here ω = dH/dI is the frequency of classical motion along the circle 3+ (or 3− ),
I is action variable.
Now we give an obvious geometric interpretation of the formula (2.2) (see, e.g.,
[6]). Consider the complexification 8 of the real plane R 2 ; we denote by (z, w)
complex coordinates, corresponding to real coordinates (x, p), respectively. In 8
consider the curve (Riemannian surface) L, defined by the equation H (z, w) = Eν .
On this surface, consider a loop 0 (we will call this loop a tunnel cycle) such that
the projection of this loop onto the z-plane coincides with the segment −x − , x − of
the real axis. The surface L and the loop 0 are plotted on Figure 2. The surface is
obtained by gluing together two copies of the complex plane, each cutt along the
segments [−x + , −x − ] and [x − , x + ].
Then the formula (2.2) can be rewritten in the form:
where
Z
1
ρ= Im(w dz).
2 0
We call ρ the tunnel distance between the circles 3± .
All considerations about “inverse” tori from subsection are also true for the Schrö-
dinger-type operator −h2 1 + V (x) on Q, where V (x) is a potential. Let us discuss
here the 1-D case and recall the relationship between the 1-D spectral problem
with tunneling and the band theory of the periodic Sturm–Liouville problem. If
Q = R, the one-dimensional invariant tori (circles) 3+ and 3− coincide and the
mentioned questions do not arise. But if one considers the 1-D periodic problem
for the Schrödinger equation (i.e., the Schrödinger equation on the circle, V (x) is
a periodic function), then the phase space T ∗ Q is a cylinder and if the classical
energy level exceeds max V (x), then 3+ and 3− do not coincide and the question
about splitting makes sense. Generally speaking, the answer to this question is well
known and can be easily understood in terms of the spectral properties of a periodic
Sturm–Liouville operator in L2 (R).
Actually (see, e.g., [17, 23]), the spectrum of the Schrödinger operator H b =
−h (d /dx )+V (x) with periodic coefficient V (x) (the Sturm–Liouville operator)
2 2 2
distance between two different eigenvalues is equal to the length of the gap. The
length δk of the kth gap tends to zero as k → ∞; if V (x) is an infinitely smooth
function then δk = o(k −N ) for arbitrary N > 0 (see, e.g., [23, 13]). So in this
case (non-degenerate gaps), each asymptotic eigenvalue corresponds to a pair of
exact eigenvalues (each coinciding with one end of the gap); the distance between
these two eigenvalues is o(hN ). This implies the appearance of the “asymptotically
degenerate” spectrum of the operator H b on the circle, which we mentioned above.
The degeneration of gaps is a non-generic phenomenon, although the finite gap
case (only a finite number of gaps are non-degenerate) plays a very important role
in the theory of integration of nonlinear equations (see, e.g., [10, 23, 29]).
In other words, the quasimodes 9ν+ and 9ν− associated with the circles 3+ and
3− , asymptotically describe an invariant subspace of the operator H associated
with two ends of the gap of its spectrum in L2 (R). Of course, all this is true for the
operator H b = 1/c2 (x)d2 /dx 2 on the unit circle, where c2 (x) 6= 0 is a smooth 2π -
periodic function. If the potential V (x) or the function c2 (x) are analytic functions
in some neighborhood of the real axis (for instance, if V (x) is a trigonometric
polynomial), then the formula for splitting or for the length of the gaps is known
(see [13]). This formula describes the tunneling between the circles 3+ and 3−
and includes integrals along certain paths or cycles in the complex phase space;
they take the role of the instanton in this case (see below). Conditions under the
potential when this formula applies are the sufficient ones of non-degeneration of
the corresponding gap. In Appendix 3 (see Assertion A3.2 and the last paragraph)
we describe the algorithm enabling at least numerically to check their validity.
This “splitting” effect is close to the well-known quantum over barrier re-
flection (see, e.g., [22, 5, 26]). We emphasize again that the principle difference
between these examples and 1-D double well problem is that the splitting formula
here really includes the complex paths on a complex plain. Also the supp ψν =
supp ψ̄ν , and π 3+ = π 3− , but the projections of 3+ and 3− onto the momentum
axis do not coincide. That is why we call this type of splitting a “momentum
tunneling”.
h2 ϕ 00 + c2 (x)Eϕ = 0, h → 0. (3.1)
150 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
Figure 4. Tunnel cycle and anti-Stokes points in the periodic Sturm–Liouville problem.
(preserving the circles and interchanging the rest points of ξ ). Thus each segment
of a separatrix, connecting some rest point a with some point b+ on the circle 3+
induces a cycle, connecting the points a, ā, b+ , b− , b− ∈ 3− (see Figure 4). As a
result we obtain a number of loops on L. Consider integrals of the form Im θ along
each loop (note that the loops carry the natural orientation induced by the field ξ ).
DEFINITION. The loop 0 with the minimal value of this integral is called a
tunnel cycle on L. Points of intersection b± = 3± ∩ 0 as well as their projections
π(b±) to the configuration space are called Stokes points on the circles 3± or on
the circle Q.
Now, we introduce a number (a tunnel distance) between 3+ and 3−
Z
1
ρ= Im θ
2 0
and two functions
Z !
± −1/4 i x
9ν = (Eν c (x))
2
exp θ . (3.2)
h π(b± )
ASSERTION 3.1. Assume that the tunnel cycle on L is unique up to a homo-
eν± and two
topy. Then for every integer ν = O(1/ h) there exist two eigenvalues E
eigenfunctions 9eν of the periodic Sturm–Liouville problem (3.1), satisfying the
±
following conditions:
wEeν± = Eν + O(h),
e − e + ωh ρ
Eν − Eν = exp − (1 + O(h))
π h
2νh2 e−ρ/ h
= R (1 + O(h)), (3.3)
π ( 1 2π c(x) dx)2
2π 0
eν± = 9ν+ ± 9ν− + O(h).
9 (3.4)
Here ω is the frequency of the classical motion along the circle 3+ (or 3− ).
The proof is presented in Appendix 3, and it follows from the formulas of the
book [13].
152 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
Remarks. (1) The real and imaginary parts of the Kählerian metric defined
above on the surface L defines there Riemannian and symplectic structures; ac-
cording to these the vector field ξ is simultaneously a locally R Hamiltonian and
locally gradient
R one. The local Hamiltonian for this field is Re θ, and the local
potential is Im θ.
(2) Functions 9ν± are “quasimodes”, corresponding to the circles 3± ; exponents
in (3.2) are defined by the action functions on these circles.
(3) Stokes points b± play an important role in the definition of “quasimodes”
±
9ν . The choice of these points as initial ones for the integrals in (3.2) guaran-
tees that the true eigenfunctions are close to the odd and even (with respect to
permutations) combinations of the quasimodes (see (3.4)). If one chooses initial
points in another way, the true eigenfunctions will be close to some other (more
complicated) linear combinations of the quasimodes.
(4) It is possible to avoid the assumption of uniqueness of the tunnel cycle. How-
ever, if there are many non-homotopic tunnel cycles, the formula for the splitting
must be replaced by another one (see [13]).
(5) The limit h → 0 for eigenvalues of the operator h2 c−2 d2 /dx 2 is of course
equivalent to the limit ν → ∞ for eigenvalues of the operator c−2 d2 /dx 2 (i.e., as
h = 1) where ν is the number of the eigenvalue.
Figure 5. Circles forming two invariant tori of the geodesic flow on a torus.
condition is satisfied for an open set in the plane E, E and means that
dH |3 6k dS|3 .
each torus 3 is a direct product of two circles S1,2 where Sj on the corresponding
cylinder xj , pj is defined by the equation
pj2 − qj (x) = 0,
Figure 6. Tunnel cycle connecting two tori 3± and anti-Stokes points on the tori.
The spectrum of the operator 1 is discrete, its asymptotic properties for large
E are studied in detail in the paper [18]. Let us recall some known facts from [18].
The first integral S of the classical Hamiltonian system defines an operator
b U2 (x2 ) ∂2 U1 (x1 ) ∂2
S = −h 2
+ , (A1.2)
U1 (x1 ) − U2 (x2 ) ∂x12 U1 (x1 ) − U2 (x2 ) ∂x22
commuting with 1. The last fact enables to separate the variables in the problem
(A1.1) and reduce it to two problems on the circles x1 ∈ [0, 2π ], x2 ∈ [0, 2π ] for
the functions ψ1 (x) and ψ2 (x), x ∈ [0, 2π ]:
It is clear, that due to the separation of the variables, both the classical motion
and the related spectral problem are decomposed to two one-dimensional problems
on the circles (xj ∈ [0, 2π )). So to write the formulas, one has only to look over
the different cases and combine the related formulas from the one-dimensional case
(see Appendix 3).
Consider first the corresponding one-dimensional classical mechanics. This is
induced by the Hamilton functions Hj (xj , pj ) = pj2 − qj (xj ), where
Hj |Sj = 0. (A1.7)
If there are two such trajectories, we will denote them Sj± : the index “+” corre-
sponds to the positive values of momenta pj and the index “–” – to the negative
ones. Let us denote c = E/E, c1 = max U1 , c2 = min U1 , c3 = max U2 , c4 =
min U2 . The structure of the common level set
H = E, S=E (A1.8)
depends essentially on relations between the constants c and cj (in the previous
section we considered the case c2 < c < c1 ).
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 159
H = H (I1 , I2 ) ≡ E(I1 , I2 ),
S = S(I1 , I2 ) ≡ E(I1 , I2 ).
ωj (I1 , I2 ) = ∂H/∂Ij .
E/E ≡ c = c2 ; E/E ≡ c = c3 .
The passage of the parameter c across the numbers c2 and c3 relates to the passage
of the parameters (E, E) across the discontinuity set of the functions I1 (E, E),
I2 (E, E) and to the passage across the separatrices of the family of the trajectories
of the Hamiltonian system with the Hamiltonian H in T ∗ Q. Both the classical
motion and the structure of eigenfunctions in the neighborhood of this set becomes
more complicated (see [18]) and we do not consider this case here.
Now we describe the asymptotics of eigenvalues and eigenfunctions of the
Laplace–Beltrami operator for the different values of the parameter c. We will
always assume that conditions (i1 ), (i3 ) of the previous section are fulfilled. In this
section, we give explicit formulas for the case considered in the previous section
leaving the other cases for the next section. So we assume that
(1) c2 < c < c1 . The Equations (A1.8) define in the phase space two tori
3 = S1 × S2± . Their projections to the configuration space Q coincides with the
±
annulus x1− 6 x1 6 x1+ , x2 -arbitrary on the torus; here x1± are the roots of the
trigonometric polynomial q1 (x) (according to the condition (i3 ) there are only two
roots). The quasimodes 9ν± have the form
±
9ν± = K3b ± (1)
160 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
Z !
1 1 x p π
ψ1 = 1/4
cos q1 (y, Eν ) dy + . (A1.10)
(q1 (x1 , Eν )) h x10 4
Here x20 = π(b2±) is the Stokes point on [0, 2π ] (see Sections 3, 4, 7), x10 =
π(b1 ) ∈ [x1− , x1+ ]. The formula (A1.10) is valid inside the interval (x1− , x1+ ); out-
side this interval the function ψ1 = o(hN ) for all N. In the neighborhood of the
points x1± the function ψ1 has the form different from (A1.10); it can be expressed
in terms of Airy functions (see, e.g., [13, 33, 28, 18]). The true eigenfunctions are
close to the sum and difference of 9ν± (see Assertion 4.1 in the previous section).
them again is the product ψ1 ψ2± . The functions ψ2± are defined by the formula
(A1.9) and ψ1 has the form
√
−(x1 −x10 )2 ω0 (x1 − x10 ) ω0
ψ1 = e 2h Hν1 √ . (A2.3)
h
ν ν
E eν+ = hω2 (I1 , I2 ) exp(−ρ/ h)(1 + O(h)),
eν− − E (A2.6)
π
where ρ is the tunnel distance between the circles S1 and S2 in (for now) the 2-D
complex phase space,
I2ν = hν2 , I1ν = h(ν1 + 1/2), ω2 = ∂H/∂I2 ,
νj ∈ Z+ , ν2 = O(1/ h), ν1 = O(1).
Remark. Formula (A2.6) can be obtained as a limit case of formula (4.6) as I
tend to their critical values. However, in the double-well problem (see Section 2)
the situation is quite different: the limit of the formula (2.2) as E tends to min V
exists but leads to the wrong value for the splitting (see [9]).
The cases
(4) c4 < c < c3 and
(5) c = c4 are quite equivalent to the cases (1) and (3), respectively.
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 163
and (A3.1), (A3.2) is the spectral problem on a unit circle for the Schrödinger
operator and the operator −h2 c−2 (x) dx d2
2 , respectively. Also, let us assume that the
set q(x, E) > 0 for E ∈ ME on the interval [0, 2π ] is either a point or a segment
[x − , x + ], possibly coinciding with [0, 2π ], and q 6= 0 inside [x − , x + ] (compare
with (i1 )–(i3 )). If the set q(x, E) = 0 consists of the unique point x = x0 , then we
assume that this point is a nondegenerate extremal one:
∂ 2q
ω02 = − (x0 , E) 6= 0. (A3.5)
∂x 2
We have already recalled in Section 3 (see, e.g., [17, 23]) certain properties of
solutions to Equation (A3.1) with the function q (A3.3). Let us also add that in
the “allowed” zones one can choose a basis of solutions constituted by the Floquet
solutions (Bloch functions) (ϕ(x), ϕ(x)), where
ϕ(x + 2π ) = ei2πβ ϕ(x). (A3.6)
The number β (Floquet exponent or quasi momentum) is not unique, but one can
always choose ϕ and β in such a way that the integral part [β] coincides with the
minimal number of zeros (oscillations) on any segment [x, x +2π ] of real solutions
associated with the “allowed” zone. We will suppose that β is chosen in this way.
At one of the ends of each allowed zone, (A3.1) possesses a 2π -periodic solution,
and we can number the zones by k, k = 0, 1, 2, . . . , according to the increase of
eigenvalues Ek of the periodic problem. The periodic solutions are associated with
the left ends of the even allowed zones and with the right ends of the odd ones.
All of these facts are true for any 1-D Schrödinger operator on the circle; but our
operator also includes a small parameter h, and further we study properties of the
spectrum and solutions in the limit h → 0.
164 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
Let us also note that if E → V (x0 ) = min V (x) and x − , x + → x0 (x0 is the
minimum point of V (x)) then the classical system passes to a stationary state (at
the equilibrium point x0 ). However the function (A3.9) has a limit and we have
r r
∂ 2q ∂ 2V
ω0 = − 2 (x0 ) = (x0 ) 6= 0. (A3.10)
∂x ∂x 2
This is not true in the case (A3.4), when “stationary” state means that I = 0, the
motion of all the points from the circle S 1 stop and ω → 0. Our future consid-
erations are local with respect to the parameter E and it is sufficient to assume
that q(x, E) is defined in a certain neighborhood of the fixed point E 0 where there
exists a diffeomorphism I = I (E) to its image and 0 < |∂I /∂E| < ∞.
A3.2. Now, we describe the asymptotic solutions of Equation (A3.1) as h → 0.
The corresponding formulas depend on the structure of the set
H(p, x, E) ≡ p 2 − q(x, E) = 0.
The following situations are possible.
(1) q > 0 and q = 0 at the unique point x0 . We have the “zero-dimensional
invariant torus” (equilibrium point) 30 = (p = 0, x = x0 ). The asymptotics of the
periodic solutions and the spectral parameters E are easily obtained with the help
of the harmonic oscillator approximation. It has the form
√
−(x−x0 )2 ω0 (x − x0 ) ω0
ϕ = ϕν ≡ Cν e 2h Hν √ + O(h ) ,
1/2
(A3.11)
h
1
Eν = H (Iν ) + O(h ) ≡ H (x0 , 0) + hω0
2
+ ν + O(h2 ). (A3.12)
2
Here, ω0 is defined by (A3.10), Hν are Hermite polynomials, Cν are normalization
constants, and ν = 0, 1, . . . are the numbers of the functions ϕν . These are arbitrary
positive integers, but they must not depend on h (this means that ν = O(1) as
h → 0). The formula (A3.11) defines the solution ϕ in an arbitrary small (but
independent of h) neighborhood Uδ of the point x0 . Outside this neighborhood let
us continue this function first to the interval [x0 − π, x0 + π ], multiplying it by
a cut off function. We obtain a finite function ϕ̃ν from L2 (R), and of course it
is not periodic. Now let us continue it periodically to the whole real axis and we
construct a periodic solution belonging to the νth allowed zone. It is clear that we
can do this in another way; namely, continue the function ϕν to the whole axis by
the rule (A3.6) and again obtain an asymptotic solution of Equation (A3.1). This
is not surprising: for instance, in the case (A3.3) this means only that the lengths
of the allowed zones “under the barrier” max V (x) are exponentially small with
respect to the parameter h as h → 0. The length δν of the νth allowed zone is
defined by tunneling effects, and
Z x0 +2π p
h ln δν = − |q| dx + o(1) (A3.13)
x0
166 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
(see, e.g., [13]). The lengths of the gaps between these exponentially small zones
are equal to ω0 h + O(h2). We do not need any deeper analysis of the formulas for
ϕν and δν in this case.
So in this case formulas (A3.11), (A3.12) and (A3.6) allow us to construct the
asymptotics of the problem (A3.1) and (A3.2). The numbers ν come from the
quantization rule for Eν and they are also the numbers of allowed zones related to
these solutions.? Let us also note that it is natural to choose the quasi-momentum
β in the νth zone in the interval β ∈ (ν, ν + 1).
(2) The function q is strictly positive inside the interval
(x − , x + ) ⊂ [x̃ − π, x̃ + π ], x̃ = (x − + x + )/2,
3 = (p 2 − q = 0, x ∈ [x − , x + ]). (A3.14)
Eν = H (I ν ) + O(h2 ). (A3.16)
The related eigenfunctions inside the interval (x1 , x2 ) have the form??
Z !
1 1 xp π
ϕν = cos q(ξ, Eν ) dξ + + O(h). (A3.17)
(q(x, Eν ))1/4 h x+ 4
particularly the periodic solution in the νth zone. The length 1ν of this zone is
equal to (compare with (A3.13))
Z x+ p
h ln 1ν = − q(x, Eν ) dx + o(1) (A3.18)
x−
(see [13]). The lengths of the gaps between the zones with the numbers ν and ν + 1
are equal to the distances between Eν and Eν+1 . By means of the formula (A3.16)
one obtains the lengths of the gaps in the form hω(I ν ) + O(h2 ). Thus again the
Bohr–Sommerfeld quantization rules and formulas (A3.17) allow asymptotics of
periodic solutions of Equation (A3.1) to be constructed and, as before, the number
ν of the solution coincides with the number ν from the quantization rule and with
the number of the allowed zones related to this solution. (Here again one has
to count only those zeros of the solution which belong to the segment [x − , x + ];
compare with the previous case.)
Finally, let us note that formulas (A3.12) and (A3.13) can be rewritten in the
form (A3.16) and (A3.18), respectively. It is possible to show (see, e.g., [37, 18]),
that both of these formulas are true simultaneously for some numbers ν (these
numbers must be “not too big” for the asymptotics (A3.11) and “not too small”
for the asymptotics (A3.17)). Thus the formulas (A3.15), (A3.16) and (A3.18)
give the quantitative characteristics for the “zone structure” in the “under-barrier”
area. In the case (A3.4), the action variable, according to the quantization rule,
“enumerates” the allowed (stable) zones. Of course this is due to the fact that the
zero-level of the function p 2 − q(x, E) possesses only one connected component.
(3) If the parameter E in (A3.4) is close to the maximum of V (x), then formulas
(A3.15)–(A3.18) break down; at least they must be revised. This corresponds to the
fact, that now the classical motion takes place in the neighborhood of the separatrix
defined by the equation
p 2 − q(x, Es ) = 0, Es ≡ max V (x).
Once the energy E passes through the point max V (x), we see a bifurcation of the
phase picture: the curve 3 splits into two smooth curves (circles) 3+ and 3− in
the phase space T ∗ S 1 which are symmetric with respect to the axis p = 0. Let
us note that the separatrix is related to a singular variety (curve) and the standard
WKB-method does not work here. The turning point x + merges with the turning
point x − (on the circle), we have an unstable turning point and the true asymptotics
in its neighborhood is presented by means of the parabolic cylinder functions (see
[30, 18, 33, 37]). We do not consider this case here. But let us also note that from the
point of view of the spectral theory in L2 (R) the passage of the energy E (going
from smaller to bigger) across the separatrix leads to an increase of the allowed
zones and to a decrease of the gaps. Near the separatrix their lengths have the same
order O(h). When the distance E − Es > 0 becomes O(1), the situation becomes
directly opposite to the under-barrier case (E < Es ): the lengths of the “allowed”
zones are of the order O(h) and the lengths of the gaps are O(h∞ ) (see Figure 8).
168 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
Figure 8. Gaps (black segments) and “allowed zones” (white segments) on the E-axis.
(4) Now we consider the main case: q(x, E) > 0 for all x. Here, we have two
invariant curves in T ∗ Q symmetric with respect to p = 0:
√
3± = (p = ± q, x = [0, 2π ]). (A3.19)
Obviously, if the motion on 3+ is defined by some functions (P (t), X(t)), then
the motion on 3− is defined by the functions (−P (−t), X(−t)). Let us fix the
orientation on these curves according to the last formulas. Then the action variable
I will be positive on each of them and the energy H in terms of the action variable
has the same form for each family 3± (and, of course, coincide on 3+ and 3−
for each fixed value of E). Now, let us recall the well-known result about the
asymptotics of the solutions to Equation (A3.1) (see, e.g., [13, Ch. III, §9]).
ASSERTION A3.1. (a) For real q(x, E) > 0 and any arbitrary natural N, there
exist two linearly-independent solutions to Equation (A3.1), ϕ + and ϕ − = ϕ̄ + ,
having in each finite interval [a, b] ⊂ R the following asymptotics as h → 0:
ϕ + = ϕN+ + O(hN+1 ), ϕ − = ϕN− + O(hN+1 ) uniformly in x, where
Z xp Z x !
1 i XN
ϕN+ = exp q(x, E) dx + hm αm (x, E) dx
(q(x, E))1/4 h x0 m=1 x0
Z !
1 i xp
= exp q(x, E) dx (1 + O(h)), (A3.20)
(q(x, E))1/4 h x0
ϕN− = ϕ̄N+ .
Here αm are smooth 2π -periodic functions, x0 is an arbitrary real number.
(b) Let I ν be defined by the Bohr–Sommerfeld quantization rule (note that the
Maslov index of the curves 3± is equal to zero)
I ν = hν, ν ∈ Z+ , ν ∼ 1/ h. (A3.21)
ν
Then there exists a set of “corrections” I(k) , k = 2, 3, . . . , such that for any
+ −
N > 0, the functions ϕN and ϕN for
X
N
Eν = E(I˜Nν ), I˜Nν = I ν + ν k
I(k) h
k=2
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 169
are 2π -periodic functions of x. The function E(I ) is inverse to the function (A3.7).
Let us discuss this result. It is obvious that the functions ϕN+ and ϕN− are associ-
ated with the curves 3+ , 3− respectively. The parameter E changes continuously
(of course the case (A3.3) is included). If E coincides with E(I˜Nν ), then the func-
tions ϕN+ and ϕN− become 2π -periodic. However, this does not mean that the related
exact solutions ϕ + and ϕ − are also 2π -periodic: the Assertion A3.1 is true for
any segment [a, b], but only if it is finite. In the case (A3.3) this means that the
spectrum (in L2 (R)) of the periodic Sturm–Liouville operator as a rule has the gap
(E eν− ). The ends E
eν+ , E eν± coincide with Eν mod O(hN+1 ). The lengths of the gaps
are O(h ). The functions ϕN+ , ϕN− only define the asymptotics of a certain basis of
∞
consist of analytic curves defined in the complex configuration space (i.e. on the
cylinder) S by the equation
Z zp
∗
Im S(z , z) ≡ Im q(ξ, E) dξ = 0. (A3.23)
z∗
According to the conditions on the function q, the canonical domain D of the band
type (an annulus) in S (see [13]) exists and has the following properties:
(a) The boundary ∂D of the domain D consists of certain Stokes lines and
includes two connected components.
(b) It is symmetric with respect to the real axis x (the circle S 1 ).
(c) There are no turning points and Stokes lines inside D.
(d) Certain turning points, say zj and zj , Im zj > 0 belong to the upper and
lower connected components of the boundary ∂D. These points are the singular
ones of the boundary: ∂D is not smooth at these points (in the generic case only
one pair of the turning points z0 and z0 belong to ∂D).
The closed domain D is covered by two leaves of the manifold L which is
the Riemannian surface defined by Equation (A3.22). These leaves coalesce in D
only at the turning points zj . Let us fix in√D the continuous branch of the function
√
q(z, E)√by means of the assumption q(z, E)|z=x > 0, x ∈ S 1 and denote
W (z) = q(z, E). According to the ideas of the saddle-point method (see, e.g.,
[14]), let us consider in D two vector fields iW (z) and iW −1 (z) = iW /|W |2 (see
also [32]). It is easy to see that the latter field coincides with the projection of the
field ξ , defined in Section 3 with the help of the Kählerian metric h , i. However
the field iW −1 is singular in the turning points while the corresponding field ξ is
smooth everywhere on the complex curve L. Obviously the vector fields iW (z)
and iW −1 (z) are parallel on D and the difference between them is connected with
a choice of the time t on their trajectories. Let gt0 and gt be the related shifts along
their trajectories in the domain D during the time t and let x 0 be a point on the
circle S 1 = [0, 2π ). In the domain D let us consider the complex phase (the action
function)
Z z
S(x , z) =
0
W (z) dz, (A3.24)
x0
and denote
Z e
z
ρ = 2 Im W (z) dz, (A3.25)
x0
where z̃ is an arbitrary point on the upper part of ∂D and the path of integration
belongs to D. Obviously ρ does not depend on the choice of x 0 and z̃.
ASSERTION A3.2. (a) The trajectories of the vector fields iW (z) and iW −1 (z)
in D coincide with the lines of the steepest ascent of the function Im S(z, x 0 ) (with
respect to z):
∂Im S ∂Im S
iW = +i .
∂x ∂y
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 171
We call the cycle 0k a tunnel cycle and the number ρ = ρ(3+ , 3− ) a tunnel
distance between 3+ , 3− . It is easy to see that this definition agrees with those of
Sections 2–4.
Remark. It is easy to see that the tunnel distance can be presented in the fol-
lowing equivalent forms:
Z zj p
ρ= q(z, E) dz, (A3.26)
zj
where the integrating path is an arbitrary one from the domain D, and
I
1
ρ= w dz, (A3.27)
2 0
172 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
where 0 is an arbitrary cycle on L homotopic to one of the tunnel cycles. Also one
can see that the Stokes point x 0,k is the (unique) solution of the equation
Z x0 p Z zk p
q(x, E) dx + Re q(z, E) dz = 0. (A3.28)
x 0,k x0
eν+ =
eν− − E 2h2 ν
E 1
H exp(−ρ̃/ h)(1 + O(h)). (A3.32)
π( 2π c(x) dx)2
The tilde over ρ in the last formula means that one must only consider the closed
curves with the energy E = 1 and take this E in the integrals (A3.25)–(A3.27).
The formulas mentioned give the splitting for the exponentially close eigenval-
ues of the Schrödinger operator (A3.1) on the circle. Actually, they show that the
“support” (the wave-front) of the “true asymptotics” of the eigenfunctions belongs
to both curves 3± . In other words, there exists a tunnel resonance between these
two curves, just as in the double-well problem. Let us also note that as in the double
well problem (see [22]), the even combination of ϕN+ and ϕN− corresponds to the
smaller energy level and the odd combination corresponds to the larger one.
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 173
Proof. It is contained, for instance, in the book [13] (see Ch. III, §3) and here we
only present some of its parts. According to [13] Equation (A3.1) has the solution
−1/4 1
w1 (z, h) = q (z, E) exp − S(z0 , z) (1 + hε(z)),
h
Z zp
S(z0 , z) = q(z, E) dz,
z0
In conclusion let us point out the following fact. The form of the dependence
of q on E is not essential, and all the results mentioned are local with respect to
E. In a certain sense, the parameter I plays a more important role, than E. The
main assumption is that the derivative ∂I /∂E is bounded and bounded away from
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 175
Acknowledgements
The first author (S. Yu. Dobrokhotov) is grateful to Yacov G. Sinai, who attracted
attention to the discussed problem. We are also grateful to Anne Boutet de Monvel
for the stimulating discussion about this problem, and to Peter Grinevich, Michael
Karasev, Andreas Knauf, Vladimir Nazaikinski, and especially Alexey Sossinsky
and Leonid Pastur for very useful discussions. The first author is much indebted to
Jochen Bruening and the Volkswagen Fund for the opportunity to do part of this
work at the Humbolt University of Berlin. The second author is grateful to Heiner
Zieschang for the opportunity to do part of this work at the Ruhr University in
Bochum. Also, this work was possible thanks to partial financial support from the
Russian Fund for Basic Research (Grant 96-01-00720).
References
1. Agmon, S.: Lectures on Exponential Decay of Solution of Second-Order Elliptic Equations,
Math. Notes 29, Princeton University Press, 1980.
2. Arnold, V. I.: Modes and quasimodes, Funct. Anal. Appl. 6(2) (1972), 12–20.
3. Arnold, V. I.: Mathematical Methods in Classical Mechanics, Nauka, Moscow, 1974; Engl.
transl. Springer-Verlag, New York, 1989.
4. Babich, V. M. and Buldyrev, V. S.: Short-Wavelength Diffraction Theory. Asymptotic Methods,
Nauka, Moscow, 1972; Engl. transl. Springer-Verlag, Berlin, 1991.
5. Baz, A. I., Zeldovich, Ya. B. and Perelomov, A. M.: Scattering, Reactions and Decompositions
in Nonrelativistic Quantum Mechanics, Nauka, Moscow, 1971; Engl. transl. Israel Program for
Scientific Translations, Jerusalem, 1969.
176 S. YU. DOBROKHOTOV AND A. I. SHAFAREVICH
6. Creagh, S. C.: Tunneling in multidimensional systems, J. Phys. A: Math. Gen. 27 (1994), 4969–
4993.
7. Colin de Verdiere, Y.: Spectre conjoint d’operateurs pseudo-differenttiels qui commitent II. Le
cas integrable, Math. Z. 171(1) (1980), 51–73.
8. Dobrokhotov, S. Yu., Kolokoltsov, V. N. and Maslov, V. P.: Quantization of the Bellman
equation exponential asymptotics and tunneling, Adv. Soviet Math. 13 (1992), 1–46.
9. Dobrokhotov, S. Yu. and Kolokoltsov, V. N.: The double-well splitting of the low energy levels
for the Schrödinger operator of discrete 84 -models on tori, J. Math. Phys. 36(3) (1995), 1038–
1053.
10. Dubrovin, B. A., Matveev, V. B. and Novikov, S. P.: Nonlinear equations KdV-type, finite-gap
linear operators and Abelian manifold, Russian Math. Surveys 31(1) (1976), 55–136.
11. Dobrokhotov, S. Yu. and Shafarevich, A. I.: “Momentum” tunneling between tori and the split-
ting of eigenvalues for the Beltrami–Laplace operators on Liouville surfaces, Preprint Institute
for Problems in Mechanics, 1997.
12. Dobrokhotov, S. Yu. and Shafarevich, A. I.: Tunnel splitting of the spectrum for the Beltrami–
Laplace operators on 2D surfaces with quadratically integrable geodesic flow, Funct. Anal.
Appl. (to appear).
13. Fedorjuk, M. V.: Asymptotic Analysis: Linear Ordinary Differential Equations, Nauka,
Moscow, 1993; Engl. transl. Springer-Verlag, Berlin, New York, 1993.
14. Fedorjuk, M. V.: Saddle Point Method, Nauka, Moscow, 1977.
15. Gultzwiller, M. C.: Chaos in Classical and Quantum Mechanics, Springer-Verlag, New York,
1982.
16. Helffer, B. and Sjöstrand, J.: Puit multiples en limit semiclassic, 2, Ann. Inst. Henri Poincare
42 (185), 127–212.
17. Krein, M. G.: Generalizations of A. M. Lyapunov study about linear equations with periodic
coefficients, Dokl. AN SSSR 14 (1993), 1879–1883.
18. Kosygin, D. V., Minasov, A. A. and Sinai, Ya. G.: Statistical properties of the spectra of the
Laplace–Beltrami operators on Liouville surfaces, Russian Math. Surveys 48(4) (1993), 3–130.
19. Kobayashi, S. and Nomizu, K.: Foundations of Differential Geometry, Interscience Publishers,
London, New York, 1963.
20. Katsnelson, M. I., van Schilfgaarde, M., Antropov, V. P. and Harmon, B. N.: Ab-initio in-
stanton molecular dynamics for the description of tunneling phenomena, Phys. Rev. A (1996),
December.
21. Lazutkin, V. F.: KAM Theory and Semiclassical Approximations to Eigenfunctions, Springer-
Verlag, Boston, Berlin, New York, 1993.
22. Landau, L. D. and Lifshits, E. M.: Quantum Mechanics, Addision-Wesley, Reading, MA, 1965.
23. Marchenko, V. A.: Sturm–Liouville Operators and Their Applications, Naukova Dumka, Kiev,
1977; Engl. transl. Birkhäuser, Basel, 1986.
24. Maslov, V. P.: Perturbation Theory and Asymptotic Methods, Moscow State University,
Moscow, 1965; Engl. transl. Dunod, Paris, 1972.
25. Maslov, V. P.: Nonstandard characteristics in asymptotic problems, Proc. Int. Congr. Math. 1
(1983), 139–184.
26. Maslov, V. P.: The scattering problems in quasiclassical approximation, Dokl. AN SSSR 151(2)
(1963), 306–309.
27. Maslov, V. P.: The Complex WKB Methods in Nonlinear Equations, Nauka, Moscow, 1977;
Engl. transl. Birkhäuser-Verlag, Basel, Boston, Berlin, 1994.
28. Maslov, V. P. and Fedorjuk, M. V.: Semiclassical Approximation in Quantum Mechanics,
Nauka, Moscow, 1976; Engl. transl. D. Reidel, Boston, 1981.
29. McKean, H. P. and van Moerbeke, P.: The spectrum of Hill’s equation, Invent. Math. 30 (1975),
217–274.
“MOMENTUM” TUNNELING BETWEEN TORI AND THE SPLITTING OF EIGENVALUES 177
30. Marz, Ch.: Spectral asymptotics for Hill’s equation near potential maximum, Asymptotic Anal.
5(3) (1992), 221–267.
31. Mishenko, A. S., Sternin, B. Yu. and Shatalov, V. E.: Lagrangian Manifolds and Canonical
Operator Methods, Nauka, Moscow, 1978.
32. Neishtadt, A. I. and Sidorenko, V. V.: The delayed stability loss in Ziegler’s system, Preprint,
Keldysh Institute of Applied Mathematics RAS, Moscow, 1995.
33. Olwer, F. W. J.: Introduction to Asymptotics and Special Functions, Academic Press, New York,
London, 1974.
34. Polyakov, A.: Quark confinement and topology of gauge theories, Nuclear Phys. B 120 (1977),
429.
35. Rajaraman, R.: Solitons and Instantons, North-Holland, Amsterdam, 1984.
36. Simon, B.: Semiclassical analysis of low lying eigenvalues, Ann. Inst. H. Poincaré 38 (1983),
295–307.
37. Slavyanov, S. Yu.: Asymptotic Solutions One-Dimensional Schrödinger Equation, Leningrad
State University, 1991; Engl. transl. Amer. Math. Soc., Providence, 1996.
38. Thomas, L. E. and Wassell, S. R.: Semiclassical approximation for Schrödinger operators on
two-sphere at high energy, J. Math. Phys. 36(10) (1995), 5480–5505.
39. Wilkinson, M.: Tunneling between tori in phase space, Physika D 21 (1986), 341–354.
40. Wilkinson, M. and Hannay, J. H.: Multidimensional tunneling between excited states, Physika
D 27 (1987), 201–212.
Mathematical Physics, Analysis and Geometry 2: 179–196, 1999.
179
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
Nonclassical Thermomechanics of
Granular Materials
PASQUALE GIOVINE
Dipartimento di Meccanica e Materiali, Via Graziella, Località Feo di Vito,
I-89060 Reggio Calabria, Italy
1. Introduction
The theory of Dunn and Serrin (1985) concerning the thermomechanics of inter-
stitial working was mainly motivated by the incompatibility of the constitutive
equation for Korteweg fluids with Coleman’s and Noll’s (1963) interpretation of
the second law of thermodynamics.
Furthermore, in the Coulomb model for granular materials with incompressible
grains, the ‘equilibrium’ portion of the Cauchy stress tensor T is a special example
of an elastic material of grade 3; precisely because the volume fraction ν of the
rigid grains is proportional to the bulk mass density ρ of the body, one has
where grad(·) denotes the spatial gradient, 1(·) the Laplacian (i.e., the trace tr(·) of
grad[grad(·)]) and I the identity tensor; moreover, the βi , for i = 0, 1, 2, 3, 4, are
material constants (see Equation (5.1) of Goodman and Cowin (1971) or Equation
(17.5) of Capriz (1989)).
180 PASQUALE GIOVINE
Moreover, Giovine and Oliveri (1995) proved that, in the conservative case,
granular materials with inelastic grains behave like continua with latent microstruc-
ture (see the definition in Capriz (1989)) for which the description of mechanical
actions requires displacement gradients of higher order than the first, without the
need to introduce an extra ‘order parameter’, as in Goodman and Cowin (1972)
and Giovine (1995).
Hence, these particular bodies must be studied using the broader thermody-
namic proposal of Dunn and Serrin, who posit the existence of a rate of supply of
mechanical energy, the interstitial working, to overcome the inconsistencies that
arise between the consequences of the Clausius–Duhem inequality and the consti-
tutive equations; but, unlike them, here we also modify the mechanical principle of
angular momentum balance, while characteristic assumptions of granular materials
are applied in the balance of linear momentum.
In fact, it is well known that classical kinetic energy is inadequate to describe
the micromotions of the grains relative to one another, even if rotations of the
grains themselves are neglected. Both continuum theory, which introduces a ki-
netic energy of the local dilatational motions, and frictional–collisional theory,
which defines a pseudothermal energy associated with deviations of the motion
of individual particles from the local average velocity, suppose the existence of
a new term in the expression of the total energy (see Equation (2.14) of Bedford
and Drumheller (1983) and Equation (2.2) of Johnson and Jackson (1987), respec-
tively). For consistency, the linear momentum must also be altered by the addition
of the Lagrangian derivative of this extra kinetic energy term.
Moreover, the condition of symmetry of the stress tensor is derived, together
with a new requirement that the static part of interstitial work be divergence–
free, by means of the action–reaction principle: this last restriction assures one
that the material does not support an equilibrium flux of heat–like classical fluids,
but unlike Korteweg fluids.
Finally we observe that we can obtain general properties of the model if we
reduce to study the conservative case; in fact this theory represents a new example
of the general Euler–Poincaré theory of Holm, Marsden and Ratiu (1998) applied
to continuous bodies (cf., the incompressible conservative case studied in Section 6
of Giovine and Oliveri (1995) by means of a Hamiltonian variational principle of
local type). Thus it can be used as an example to generalize that theory in order to
include entropic and dissipative processes.
2. Kinematics
means the closure of an open connected set with smooth boundary ∂B∗ . If x∗ is the
place taken by a material element of C in B∗ , the motion of C is a smooth mapping
x = χ(x∗ , τ ) (2.1)
of C onto a time-sequence of regions in E with the time τ varying in some interval
[τ0 , τ1 ]. The deformation gradient, denoted by
∂χ
F := (x∗ , τ ), (2.2)
∂x∗
is therefore non-singular and has positive determinant indicated with
ι := det F > 0. (2.3)
During the motion the velocity and the acceleration are given by
∂χ ∂ 2χ
v := (x∗ , τ ) and a := (x∗ , τ ). (2.4)
∂τ ∂τ 2
Alternatively, velocity and acceleration at a given time τ can be considered as fields
on the region Bτ := χ(B∗ , τ ) occupied by C in the current placement, so
v = χ̇ (x, τ ) and a = χ̈ (x, τ ). (2.5)
To complete the kinematic specification of the model of a granular continuum
with inelastic grains, the choices of an explicit expression of the principle of mass
conservation and of the kinetic energy of the material are necessary.
The first choice is classical, since it is intended to exclude phenomena of diffu-
sion (typical of the mixtures). The mass of any part S of the continuum body C,
which occupies the subregion P∗ of B∗ in the reference placement, is conserved,
so that, if we call ρ the mass density, the balance of mass is
Z
d
ρ = 0, (2.6)
dτ Pτ
where Pτ := χ(P∗ , τ ).
The local form of (2.6) gives the continuity equation, i.e.
ρ̇ + ρdivv = 0, (2.7)
˙ denotes the material time derivative:
where div(·) := tr[grad(·)] and the dot (·)
˙ := ∂(·) + v · grad(·).
(·) (2.8)
∂τ
From (2.6) the material form can also be obtained, which relates the mass density
ρ with the determinant of the displacement gradient F :
ρι = ρ∗ (2.9)
(ρ∗ is the value of ρ in B∗ ).
182 PASQUALE GIOVINE
The choice of a suitable kinetic energy is more delicate. Besides the absence
of diffusion of grains, we also neglect the contribution due to the rotations of the
granules themselves (see Ahmadi (1982) for a generalized theory which includes
this possibility). Whereas, in addition to the usual translational kinetic energy, we
consider that joined to the ‘dilatancy’ as defined by Reynolds (1885), i.e., the
kinetic energy associated to the variations of the volume of interstitial voids (see
Appendix A for an evaluation of this term in a simple fluctuating motion).
Then the density per unit mass of the total kinetic energy of the material turns
out to be
κ = 12 v 2 + 12 γ (ρ)ρ̇ 2 , (2.10)
where the function γ depends on the microgeometry of the material element and
on the kind of admissible microdeformations. Equation (A.7) of Appendix A gives
γ (ρ) = 19 ξ∗2 ρ∗2/3 ρ −8/3 (2.11)
with ξ∗ a constant length. The quantity 13 γ (ρ)ρ̇ 2 is recognized to be what it remains
in our model of the ‘grain temperature’ of granular materials as introduced by
Johnson and Jackson (1987) in a more general context; the grain temperature satis-
fies their ‘pseudothermal energy’ Equation (2.7): we shall see that, in our inelastic
grains theory, this extra equation reduces to an explicit expression for the rate of
dissipation.
By the way, a further generalization of the constitutive expression for the addi-
tional kinetic energy could cause radical changes in the inferences (see the balance
Equation (3.13) of Giovine and Oliveri (1995) for the micromomentum of a gran-
ular material with elastic grains or the Equation (4.6) of ‘granular heat transfer’ of
the extended theory of Capriz and Mullenger (1995)).
Now, it is worth noting that generalized Lagrangians which include kinetic
energy as in expression (2.10) were considered by Holm et al. (1998) in their
stress tensor formulation of Section 7; hence the principal adiabatic nondissipative
aspects of our theory emerge from the more general framework of that Euler–
Poincaré theory. In particular, the dynamic part of our model acquires the following
additional structure, in the conservative case: the Hamilton’s principle formula-
tion; the Kelvin–Noether circulation theorem; the Lie–Poisson Hamiltonian struc-
ture; the characterization of equilibrium solutions as critical points of conserved
quantities (see also Holm et al. (1985)).
3. Dynamic Balances
The characteristic expression of kinetic energy for granular media with rigid grains
is now used to denote, in the classical balance of linear momentum, fluxes through
the boundary of different physical meaning; a balance that has the usual form
Z Z Z
d
ρv = ρf + T n, (3.1)
dτ Pτ Pτ ∂Pτ
NONCLASSICAL THERMOMECHANICS OF GRANULAR MATERIALS 183
with f ∈ V the body force per unit mass and n ∈ V the exterior normal to ∂Pτ ,
but in which the stress tensor T is the sum of a partial stress tensor T p and of a
tensor of inertia flux T i :
T = T p + T i , with T i := −ρ 2 γ (ρ)ρ̈ + 12 γ 0 (ρ)ρ̇ 2 I (3.2)
(see again Johnson and Jackson (1987) for the collisional–translational contri-
bution to the stress tensor in Equation (2.6) or the review article by Hutter and
Rajagopal (1994) in which many granular theories that split the stress tensor in the
Cauchy equation are reviewed); in (3.2) the prime (·)0 denotes differentiation with
respect to ρ.
The tensor T i represents the Lagrangian derivative (times ρ) of the extra kinetic
energy term 12 ργ (ρ)ρ̇ 2 (see Appendix B for the proof) and Equation (3.1) indicates
that any subbody Pτ acts as if it were a body with variable mass for which the influx
of linear momentum per unit area of ∂Pτ is ρ 2 (γ (ρ)ρ̈ + 12 γ 0 (ρ)ρ̇ 2 )n.
Remark. Again we must note that expression (3.2) for the stress tensor is also
deduced in the stress formulation in Section 7 of Holm et al. (1998), if we suppose
that the partial stress depends on a potential energy σ (ρ) per unit mass (see also
Giovine and Oliveri (1995)). The momentum density m of Equation (7.7) of Holm
et al. (1998) is here m = ρv+grad(ρ 2 γ (ρ)ρ̇), while their stress T̂ of Equation (7.8)
is
T̂ = m ⊗ v + ρ 2 γ (ρ)ρ̇(grad v)T + ρ 2 σ 0 (ρ) − 18 γ 0 (ρ)ρ̇ 2 I ;
finally, their Euler–Poincaré Equation (7.6) for continua in the momentum conser-
vation form, ∂m/∂τ = −div T̂ , coincides with our dynamic balance with T p =
−ρ 2 σ 0 (ρ)I , when the external forces f vanishes and the continuity Equation (2.7)
is used.
ρa = ρf + div T , (3.3)
184 PASQUALE GIOVINE
we take the scalar product of both sides of (3.3) by the velocity v and integrate
along regular motions over a subbody Pτ . Using Equations (2.7) and (3.2)1 , the
relation
Z Z Z Z
d
ρκ = ρf · v + (T + T )n · v −
p i
Tp ·L (3.4)
dτ Pτ Pτ ∂Pτ Pτ
The last integral on the right-hand side of Equation (3.5) is the power of internal
forces, while the first two are, respectively, the power of external body actions and
of external surface actions; on the left-hand side there appear, in succession, the
time-rate of change of the total kinetic energy and the flux of kinetic energy through
the boundary.
The power of internal forces has to satisfy the principle of action-reaction, thus
its density −(T p · L + div u) is zero for all rigid body velocity distributions, i.e.,
T p and u must be such that
T p · L + div u = 0, for all velocity fields of the form
(3.6)
v(x, τ ) = c(τ ) + p(τ ) × x,
with c ∈ V translation velocity and p ∈ V angular velocity.
The consequences of this demand become more evident when we take into
consideration the results of Section 5. In particular, we show that the interstitial
work flux u decomposes into the sum of a dynamic part ud = (z ⊗ I ) · L and a
static part us , the first one linear in the trace of L and the second independent of L
itself:
u = us + ud (3.7)
(see Equation (5.13)). Thus, relation (3.6) is modified as follows:
div us + [T p + (div z)I ] · L + (I ⊗ z) · grad L = 0; (3.8)
but, for rigid velocity fields v = c + p × x, it is L = −ep ∈ Skw, where e is the
Ricci permutation tensor, and grad L = 0, then principle (3.8) gives
−ep · T p + div us = 0, (3.9)
NONCLASSICAL THERMOMECHANICS OF GRANULAR MATERIALS 185
div us = 0 (3.10)
and hence it follows from the remaining part of (3.9) that the skew part of the partial
stress tensor T p must necessarily be zero. Moreover, as the inertia flux tensor T i
is spherical, we recover the classical condition of symmetry of the Cauchy stress
tensor T .
Nevertheless, it is worth noting that in our theory, unlike those of Dunn and Ser-
rin (1985) and Goodman and Cowin (1972), the classical condition of symmetry of
T is not assumed in advance but it is inferred by the principle of action/reaction as
well as the condition (3.10), which is absent in other theories on granular materials
with rigid grains and which assure us that the flux of the static part of interstitial
working through the boundary of any subbody Pτ of Bτ vanishes.
4. Thermodynamics
We preserve the purely thermal Clausius–Duhem inequality in exactly its classical
form, but we modify the thermomechanical energy balance consistently with the
condition of balance of mechanical energy formulated in (3.5).
During every admissible thermodynamic process, we suppose the existence of
a density of internal energy ε per unit mass, of a rate of heat generation λ per
unit mass due to radiation or other sources, of a heat flux through the boundary of
vector q ∈ V, of a density of entropy η per unit mass and of an absolute positive
temperature θ.
Also, we postulate the validity, for each process and for each subpart Pτ of Bτ ,
of the following generalized balance of energy
Z Z Z
d
ρ(ε + κ) = ρ(λ + f · v) + [T T v + u − q] · n, (4.1)
dτ Pτ Pτ ∂Pτ
With sufficiently regular processes, which also assure the validity of the kinetic
energy theorem (3.5), the balance (4.1) is replaced by
Z Z Z
d
ρε = (ρλ + T · L + div u) −
p
q · n, (4.3)
dτ Pτ Pτ ∂Pτ
Using Equation (4.4) and (4.2) and the usual localizing procedure, the reduced
version of the Clausius–Duhem inequality follows
ρ(ψ̇ + θ̇η) 6 T p · L + div u − θ −1 q · g, (4.5)
where ψ := ε − θη is the Helmholtz free energy per unit mass and g := grad θ.
The system of balance equations in local form (2.7), (3.3), (3.6), (4.4) and
(4.5) regulates the motion and the temperature field of a granular material with
incompressible grains; more generally, it applies to nonclassical continua that have
the equilibrium portion of the Cauchy stress tensor T which behaves as an elastic
material of grade 3, in order to assure a constitutive equation of the type (1.1), and
an augmented kinetic energy like (2.10) (or, alternatively, if the additional energy is
a more generic nonnegative function κ̄ of ρ and ρ̇ with the properties κ̄(ρ, 0) = 0
and ∂ 2 κ̄/∂ ρ̇ 2 6= 0).
We perform the thermodynamic consequences of the quoted system of balance
equations for the constitutive structure of the fields ψ, η, T p , q and u within the
procedure outlined by Coleman and Noll (1963), appropriately adapted to this
context. Substantially, they differentiate the analytical character of the relations:
Equations (3.3) and (4.4) describe the ambient actions on the body, actions which
are represented by the densities f and λ, respectively; (3.6) and (4.5) character-
ize the internal constitutive structure of the continuum and have to be identically
satisfied during every admissible thermodynamic process.
and
for all and for all proper orthogonal tensors Q (Q ∈ Orth+ ), respectively; we
suppose that the smoothness of the tilde functions in (5.1) is sufficient for all the
following developments and that Q in (5.2) depends on and Q as follows:
We may now incorporate the functional dependence of the free energy ψ and
of the interstitial work u expressed in (5.1) in the Clausius–Duhem inequality (4.5)
and use the chain rule, the identity
ḋ = grad(ρ̇) − LT d (5.3)
and the conservation of mass (2.7). Consequently, we may assert that the constitu-
tive fields ψ, η, T p , q and u must be such that the entropy imbalance
0 6 T p + ρd ⊗ ψd + ρ(ρψρ + d · ψd )I · L + (uL + ρ 2 I ⊗ ψd ) · t grad L −
− ρψS · Ṡ − ρ(η + ψθ )θ̇ − ρψg · ġ − ρψL · L̇ + uρ∗ · d∗ + uρ · d + (5.4)
+ ud · S + uS · grad S + uθ∗ · g∗ + (uθ − θ −1 q) · g + ug · grad g
holds for every motion x and for every temperature field θ; here subscripts denote
partial differentiation, while the left exponent t (·) indicates minor left transposition,
i.e., (t grad L)ij k = (grad L)j ik in components.
In the relation (5.4) we can specify the values of θ̇ in R, of ġ, d∗ and g∗ in V,
of Ṡ and grad g in Sym, of L̇ in Lin and of grad S and grad L in the subsets of
third-order tensors symmetric in all places (grad S ∈ Sym3 ) and in the last two
(grad L ∈ Sym32 ), respectively, independently of and arbitrarily, hence (5.4) is
linear in those quantities: thus its fulfillment implies that, at each , the relative
terms must all vanish. Hence, we obtain the following restrictions:
η + ψθ = 0, (5.5)
ψS = 0, ψg = 0, ψL = 0, (5.6)
uρ∗ = 0, uS · s = 0, ∀s ∈ Sym3 , uθ∗ = 0, ug · G = 0, ∀G ∈ Sym, (5.7)
(uL + ρ 2 I ⊗ ψd ) · t r = 0, ∀r ∈ Sym32 , (5.8)
Now we can use the expression (5.14) of the static part of the flux u in (5.15)1
to derive the following constraint for functions ωi :
ω2ρ − 2ω1δ + ω3θ = 0, for all ρ, δ and θ ∈ R+ . (5.16)
Also, by inserting (5.15)1 into the local energy law (4.4) and the reduced Clausius–
Duhem inequality (5.9), we obtain the following simplified forms:
ρ ε̇ = ρλ + T p · L − div ρ 2 φδ (tr L)d − div q and (5.17)
p
T + ρφδ d ⊗ d + ρ ρ ωρ + 2 φδ − div(ρφδ d) I · L−
1
− θ −1 q · g > 0, (5.18)
NONCLASSICAL THERMOMECHANICS OF GRANULAR MATERIALS 189
and
T e := T̃ (ρ∗ , ρ, d, S, θ∗ , θ, 0, 0)
= −ρφδ d ⊗ d + ρ 2 (ρ −1 φδ + φδρ )δ + φδ (tr S) + (6.4)
+ 2φδδ (d · Sd) − ωρ − 12 φρ I,
respectively. From a comparison of (6.3) with Equation (4.9) of Dunn and Serrin
(1985), we may observe that, again unlike Korteweg materials, granular media
cannot support heat fluxes in equilibrium: this follows from the new relation (3.10)
which, with the requirement of symmetry of the stress T , expresses the balance of
moment of momentum. Instead, our material can sustain shear stresses, a property
that is uncommon in ordinary fluids, owing to the first term on the right-hand side
of (6.4), term proportional to the diad (d ⊗ d) which accounts for the Ericksen
stress. The term (−ρ 2 ωρ I ), independent of δ, appears similar to the pressure term
in the stress tensor for a compressible fluid, whereas the configuration pressure
term (− 12 ρ 2 φρ I ) is related to the variation of the volume distribution of granules.
Furthermore, the Hessian matrix of σ must be nonnegative in equilibrium, in
the sense that its determinant and all of its principal sub-determinants have to be
nonnegative or, equivalently, that the inequality
e
σgg · (g ⊗ g) + 2σgL
e
· (g ⊗ L) + σLL
e
· (L ⊗ L) > 0 (6.5)
must be verified for each g ∈ V and L ∈ Lin; the exponent e denotes values of
functions in equilibrium.
By substituting in (6.5) the expressions of the gradients σgg , σgL and σLL eval-
uated at (ρ∗ , ρ, d, S, θ∗ , θ, 0, 0), we obtain
−qge · (g ⊗ g) + θ Tge − ρ 2 φδθ I ⊗ d · (L ⊗ g)−
(6.6)
− qLe · (g ⊗ L) + θTLe · (L ⊗ L) > 0,
which is satisfied if and only if
qge · g ⊗ g 6 0 (6.7)
and
−4θ qge · (g ⊗ g) TLe · (L ⊗ L)
t e T 2 (6.8)
> θ (Tg ) − θρ 2 φδθ d ⊗ I − qLe · (g ⊗ L) ,
for all vectors g and all tensors L.
The term on the right-hand side of (6.8) is always nonnegative, which means
that, for (6.7), the viscosity tensor TLe is positive semi-definite at equilibrium.
The large number of restrictions on the constitutive equations given above are
still inadequate to provide uncomplicated expressions for the unknown fields: the
NONCLASSICAL THERMOMECHANICS OF GRANULAR MATERIALS 191
with
with β̄0 , β̄1 and β̄ material constants, we have that the equilibrium Cauchy stress
value, which coincides with the partial equilibrium stress value (6.11)3 , can be
further specialized to
T e = β̄0 − β̄1 ρ 2 + β̄δ + 2β̄ρ(tr S) I − 2β̄d ⊗ d; (6.13)
where
T p = − ρ 2 ωρ + ρϕ + 12 ρ 2 ϕρ δ + ρ 2 ϕ(tr S) + λ̄(tr D) I −
(6.22)
− ρϕd ⊗ d + 2µ̄D,
ε = ω − θωθ + 12 (ϕ − θϕθ )δ (6.23)
NONCLASSICAL THERMOMECHANICS OF GRANULAR MATERIALS 193
and the scalar fields must satisfy the inequalities (6.10), (6.17), (6.19) and (6.20).
Finally, it is interesting to note that the constitutive expression (3.2) for the
Cauchy stress tensor T , with the partial stress T p given by (6.22), can be recog-
nized rather implicity in Goodman and Cowin (1972) and not enough explicitly in
the joined paper of Cowin and Nunziato (1981).
In fact Equation (4.19)1 of Cowin and Nunziato (1981) for the stress tensor is
2α
TCN = − 3 + λ̂(tr D) I − 2 d ⊗ d + 2µ̄D, (6.24)
ρm
with
ρ 2 π̄ 2α
3 = ρ ωρ +
2 1 2
2
ρ ϕρ δ + ρ γ ρ̈ +
2
(tr D) − ρdiv 2 d , (6.25)
ρm ρm
when some minor changes in notation are made and l = 0: the so-called ‘equili-
brated inertia’ k(:= γ /ρm2 ) is the supposed constant, so we can add to the right-
hand side of (6.25) the null term 12 ρ 2 γ 0 ρ̇ 2 . Moreover, by choosing α := 12 ρρm2 ϕ̄(ρ),
ρ 2 π̄
λ̂ := λ̄ + ρm
and substituting (6.25) in (6.24), we now obtain the completely
explicit
TCN = T = T p + T i , (6.26)
% = ρm ν (A.1)
and the total volume change of the granular material with inelastic grains is only
due to the increase or decrease of the volume of interstitial voids: no additional
independent kinematic variable is then necessary (as introduced in Goodman and
Cowin (1972) or Giovine and Oliveri (1995) for elastic granules).
194 PASQUALE GIOVINE
1 X
m
κM = 12 z2 + 1
2 4 3
µ(ξ̇i )2 , (A.3)
3
π ς i=1
where here the dot denotes the time derivative and z is the velocity of the center of
mass.
On the other hand, conservation of mass gives the relation
1/3
%∗
ς = ς∗ , (A.4)
%
so that, from (A.2) and (A.4), the rate of change of ξi is
ξ̇i = − 13 ξ∗ %∗1/3%−4/3 %̇, (A.5)
while the mass of a grain is now
4
µ= %π ς 3 . (A.6)
3m
By using (A.5) and (A.6) in (A.3), we obtain the following expression for κM :
1 2 2/3 −8/3 2
κM = 12 z2 + ξ % %
18 ∗ ∗
%̇ . (A.7)
When a variation δx(x, τ ), with x in Bτ and the time τ in the interval [τ0 , τ1 ]
(τ1 > τ0 ) of duration of the process, is added to the motion x, a corresponding
variation of the domain Bτ and of the fields defined on Bτ ×[τ0 , τ1 ] is implied. The
total variation of the mass density is restricted by the conservation of mass (3.4),
i.e.,
δρ + ρdiv δx = 0. (B.1)
Then, for variations δx and δρ which vanish at the extremes of the interval
[τ0 , τ1 ], the following equality holds
Z τ1 Z Z τ1 Z
δ dτ 1
2
ργ (ρ)ρ̇ =
2
dτ ρ 2 γ (ρ)ρ̈ + 12 γ 0 (ρ)ρ̇ 2 n · δx −
τ0 Pτ τ0 ∂Pτ
Z (B.2)
− δx · grad ρ 2 γ (ρ)ρ̈ + 12 γ 0 (ρ)ρ̇ 2 .
Pτ
In fact, by using the transport theorem, adapted to this context, and the property
that the variation δ and the material time derivative (·) ˙ commute (see (3.10)1 of
Giovine and Oliveri (1995)), we have
Z τ1 Z Z τ1 Z
δ dτ 1
2
ργ (ρ)ρ̇ 2
= dτ 1
2
ρδ γ (ρ)ρ̇ 2
τ0 Pτ τ P
Z 0τ1 Z τ
˙ + 1 γ 0 (ρ)ρ̇ 2 δρ
= dτ ρ γ (ρ)ρ̇ δρ 2
τ0 Pτ
Z τ1 Z (B.3)
= dτ ˙
ρ [γ (ρ)ρ̇δρ] −
τ0 P
Z τ1 τZ
˙ ρ̇] − 1 ργ 0 (ρ)ρ̇ 2 δρ.
− dτ ρ [γ (ρ) 2
τ0 Pτ
From the classical transport theorem and from Equation (B.1), it follows:
Z τ1 Z "Z #τ1
δ dτ 1
2
ργ (ρ)ρ̇ 2 = ργ (ρ)ρ̇δρ −
τ0 Pτ Pτ
τ0
Z Z
τ1
− dτ ρ γ (ρ)ρ̈ + 12 γ 0 (ρ)ρ̇ 2 δρ (B.4)
τ Pτ
Z τ1 0 Z
= dτ ρ 2 γ (ρ)ρ̈ + 12 γ 0 (ρ)ρ̇ 2 div δx,
τ0 Pτ
since δρ vanishes for τ = τ0 and τ = τ1 . Now integration by parts and the use of
the divergence theorem proves the equality (B.2).
Acknowledgements
We thank one reviewer for suggestions and observations about the compatibility
of our work with the general Euler–Poincaré theory. This research was supported
196 PASQUALE GIOVINE
by the Italian M.U.R.S.T. through ‘Fondi per la ricerca scientifica 60%’ and by the
Italian C.N.R. through the contract n.94.04665.CT26.
References
Ahmadi, G.: A generalized continuum theory for granular materials, Internat. J. Non-Linear Mech.
17 (1982), 21–33.
Arthur, J. R. F. and Menzies, B. K.: Inherent anisotropy in sand, Geotechnique 22 (1972), 115–129.
Bedford, A. and Drumheller, D. S.: On volume fraction theories for discretized materials, Acta
Mechanica 48 (1983), 173–184.
Capriz, G.: Continua with Microstructure, Springer Tracts Nat. Philos. 35, Springer, New York, 1989.
Capriz, G. and Mullenger, G.: Extended continuum mechanics for the study of granular flows, Rend.
Accad. Lincei, Mat. 6 (1995), 275–284.
Coleman, B. D. and Noll, W.: The thermodynamics of elastic materials with heat conduction and
viscosity, Arch. Rational Mech. Anal. 13 (1963), 167–178.
Cowin, S. C. and Nunziato, J. W.: Waves of dilatancy in a granular material with incompressible
granules, Internat. J. Engrg. Sci. 19 (1981), 993–1008.
Dunn, J. E. and Serrin, J.: On the thermomechanics of interstitial working, Arch. Rational Mech.
Anal. 88 (1985), 95–133.
Giovine, P.: Termodinamica dei Continui Granulari Dilatanti, Atti XII Congresso AIMETA ‘95,
Napoli, Italy, Meccanica dei Fluidi IV, pp. 287–292, 1995.
Giovine, P. and Oliveri, F.: Dynamics and wave propagation in dilatant granular materials, Meccanica
30 (1995), 341–357.
Goodman, M. A. and Cowin, S. C.: Two problems in the gravity flow of granular materials, J. Fluid
Mech. 45 (1971), 321–339.
Goodman, M. A. and Cowin, S. C.: A continuum theory for granular materials, Arch. Rational Mech.
Anal. 44 (1972), 249–266.
Holm, D. D., Marsden, J. E. and Ratiu, T. S.: The Euler–Poincaré equations and semidirect products
with applications to continuum theories, Adv. Math. 137 (1998), 1–81.
Holm, D. D., Marsden, J. E., Ratiu, T. S. and Weinstein, A.: Nonlinear stability of fluid and plasma
equilibria, Phys. Rep. 123 (1985), 1–116.
Hutter, K. and Rajagopal, K. R.: On flows of granular materials, Continuum Mech. Thermodyn. 6
(1994), 81–139.
Jenkins, J. T. and Savage, S. B.: A theory for the rapid flow of identical, smooth, nearly elastic,
spherical particles, J. Fluid Mech. 130 (1983), 187–202.
Johnson, P. C. and Jackson, R.: Frictional–collisional constitutive relations for granular materials,
with applications to plane shearing, J. Fluid Mech. 176 (1987), 67–93.
Reynolds, O.: On the dilatancy of media composed of rigid particles in contact, Philos. Magazine 20
(1885), 469–481.
Truesdell, C. and Noll, W.: The Non-Linear Field Theories of Mechanics, 2nd edn, Springer-Verlag,
Berlin, 1992.
Mathematical Physics, Analysis and Geometry 2: 197–220, 1999.
197
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
DANIEL H. LENZ
Fachbereich Mathematik, J. W. Goethe Universität, 60054 Frankfurt/Main, Germany.
e-mail: dlenz@math.uni-frankfurt.de
Introduction
Families of random operators arise in the study of disordered media. More pre-
cisely, one is given a topological space X and a family of operators (Hx )x∈X on
L2 (G). Here, X represents the set of “all manifestations” of a fixed kind of disorder
on the locally compact Abelian group G [3, 4].
The simplest example of a disordered medium is given by the periodic structure
of a crystal. In this case X is the quotient of G by the subgroup of periods. In the
general case X will not be a quotient of G, but there will still be an action α of G
on X. The fact that all points of X stem from the same kind of disorder structure is
taken account of by requiring the action to be ergodic.
Whereas for a fixed x ∈ X the operator Hx may not have a large symmetry
group, the whole family of operators will be G-invariant. This leads to the study of
this family as a new object of interest. This study is best performed in the context
of C ∗ -algebras. In fact, it turns out that the crossed products G ×α C0 (X) provide
a natural framework for these objects [3, 6, 7, 10, 36].
As it is, one is even led to a more general algebraic structure, viz C ∗ -algebras
of groupoids when studying certain quasicrystals modelled by tilings [21, 26, 27].
But this is not considerd here.
198 DANIEL H. LENZ
L2 (G × X) can easily be seen to be a Hilbert algebra (cf. [18]), i.e. to fulfil the
following conditions:
(i) ha | bi = hb∗ | a ∗ i a, b ∈ A;
(ii) ha ∗ b | ci = hb | a ∗ ∗ ci a, b, c ∈ A;
(iii) For a ∈ A the mapping b 7→ a ∗ b is continuous;
(iv) {a ∗ b : a, b ∈ A} is dense in L2 (G × X).
In particular, the action of A on itself from the left yields a representation of
A on L2 (G × X), which can be extended to a representation of G ×α C0 (X) on
L2 (G × X). These considerations will be given a more precise form in Section 2
in order to study G ×α L∞ (X). Now we prefer to introduce two representations,
that allow a direct integral decomposition.
where B(H ) denotes the algebra of bounded operators on the Hilbert space H .
Then it is easy to see that π and πx , x ∈ X, are continuous representations of
Cc (G × X). Their extensionsR to G ×α C0 (X) will also be denoted by π and πx .
⊕
Identifying L2 (G × X) with X L2 (G) dm, we get
Z ⊕
π(A) = πx (A) dm, A ∈ G ×α C0 (X).
X
Thus the crossed product is the norm closure of an algebra of certain integral
operators.
We remark that the mapping X 3 x 7→ πx (A) is strongly continuous for
A ∈ G ×α C0 (X), as can be directly calculated for A ∈ Cc (G × X), and then
follows by density arguments for all A ∈ G ×α C0 (X). The representation π has
RANDOM OPERATORS AND CROSSED PRODUCTS 201
two interesting and well known symmetry properties, that will be given in the next
proposition.
PROPOSITION 1.2.1. (a) Let DR be the algebra of diagonalisable operators on
⊕
the Hilbert space L2 (G × X) ' X L2 (G) dm, then
π(G ×α C0 (X)) ⊂ D 0 .
(b) π(G ×α C0 (X)) ⊂ {Tt ⊗ S−t : t ∈ G}0 . Moreover for t ∈ G, x ∈ X and
A ∈ G ×α C0 (X) the formula Tt παt (x) (A) Tt∗ = πx (A) holds.
Proof. (a) This is the fact that π(A) permits a direct integral decomposition.
(b) This can be directly calculated for A ∈ Cc (G × X) and then follows for
arbitrary A ∈ G ×α C0 (X) by a density argument. 2
Z
π tˆξ(x) := a(s, x)ξ(α−s (x))(tˆ−s | ) ds, a ∈ Cc (G × X), ξ ∈ L2 (X),
G
b For tˆ ∈ G
where (· | ·) denotes the dual pairing between G and G. b the mapping π tˆ
is then a representation of Cc (G × X), which has a unique continuous extension
to a representation of G ×α C0 (X), again denoted by π tˆ. For A ∈ G ×α C0 (X)
the mapping tˆ 7→ π tˆ(A) is strongly continuous, as can be seen using the same
arguments as in the case of the mapping x 7→ πx (A). Therefore, we can define a
representation
Z ⊕ Z ⊕ !
ˆt
e
π := π dtˆ : G ×α C0 (X) → B L2 (X) dtˆ ,
b
G b
G
U := (FG ⊗ I )W ∗ : L2 (G × X) → L2 (G b × X),
where FG : L2 (G) → L2 (Gb) is the Fourier transform and I the identity. Then we
R
have e ∗
π = U π U , where L2 (Gb × X) is identified with b⊕ L2 (X) dtˆ.
G
Remarks. (1) The crossed product G ×α C0 (X) is just the group C ∗ -algebra
C ∗ (G), if X consists of only one point. In this case we identify Cc (G × X) with
Cc (G) and we get π(ϕ)ξ = ϕ ∗ ξ = Tϕ ξ, e π (ϕ)ξ = M(F (ϕ))ξ, where Tϕ
denotes the operator of convolution with ϕ ∈ Cc (G) and M(ψ) denotes the op-
erator of multiplication with ψ. This implies e π (G ×α C0 (X)) = M(C0 (G)) b and
π(G ×α C0 (X)) = {F −1 Mψ F | ψ ∈ C0 (G)}. b
(2) The direct integral decomposition of e
π relies essentially on the symmetry
π(G ×α C0 (X)) ⊂ {Tt ⊗ S−t : t ∈ G}0 ,
as can be seen in the following way: Using W (Tt ⊗ I ) W ∗ = Tt ⊗ S−t , one gets
immediately U (Tt ⊗ S−t )U ∗ = Mt ⊗ I, where Mt denotes the operator of multipli-
b Therefore, we have U π(A)U ∗ ∈ {Mt ⊗ I | t ∈ G}0 ,
cation with (t | ·) on L2 (G).
and this implies (cf. 5, Ch. 2, II in [18]) that U π(A)U ∗ has a direct integral
decomposition.
The lemma and the definitions of G ×α L∞ (X) and L(A) directly yield
THEOREM 2.1.4. AdW : G ×α L∞ (X) → L(A), A 7→ W ∗ AW is a spatial
isomorphism of von Neumann algebras.
Those operators which are inverse images of left bounded operators under AdW
will play an important role.
DEFINITION 2.1.5. (a) A function a ∈ L2 (G × X) is called the kernel of the
operator A ∈ G ×α L∞ (X) if a is left bounded and A = W La W ∗ .
(b) Let K := {A ∈ G ×α L∞ (X) | A has a kernel}.
We study K in the next proposition.
C := {Tt ⊗ I, W (I ⊗ Mv )W ∗ : t ∈ G, v ∈ L∞ (X)}.
CW Ra W ∗ = W Ra W ∗ C, a ∈ Cc (G × X), C ∈ C.
holds, where the Atˆ are uniquely determined up to a set of measure zero. From now
on we will identify G ×α C0 (X) with π(G ×α C0 (X)). For A in G ×α C0 (X) we
b := U AU ∗ . For A ∈ G ×α L∞ (X) we define
set Ax := πx (A), Atˆ := π tˆ(A) and A
ˆt
the Ax and A by
Z ⊕ Z ⊕
A= Ax dm and U AU = ∗
Atˆ dtˆ.
X b
G
The fact that these families are only defined up to a set of measure zero will be no
inconvenience.
Remark 1. It is always possible to choose the Ax such that
Tt∗ Aαt (x) Tt = Ax
holds for all x ∈ X and all t ∈ G. This can be seen in the following way: Theo-
rem 2.1.7 implies G ×α L∞ (X) ⊂ {Tt ⊗ S−t | t ∈ G}0 . In particular, we have for
fixed t ∈ G
Tt Aαt (x) Tt∗ = Ax , a.e. x ∈ X.
Therefore we get, using the Fubini theorem, that the family of operators defined by
fx ξ | ηi := M(t 7→ hTt Aαt x Tt∗ ξ | ηi),
hA
where M is the mean on the Abelian group G, coincides almost everywhere with
fx has the required
the family Ax . Moreover, it is easy to see that the family A
invariance property.
206 DANIEL H. LENZ
EC = CE = C, C ∈ C.
τ1 (CIλ C ∗ ) = τ2 (CIλ C ∗ ).
τ (AA∗ ) = ha | ai
for A with kernel a. The trace τ is semifinite and τ = τc ◦ AdW ∗ , where τc is the
canonical trace on L(A) (cf. 2, Ch. 6, I of [18]). Moreover
Proof. Clearly the identity of B(L2 (G × X)) is contained in the von Neumann
algebra G ×α L∞ (X) = π(G ×α C0 (X))00 and we can apply the foregoing lemma
with J = K to get the uniqueness.
As AdW is an isomorphism by Theorem 2.1.4, the remaining statements follow
easily from the corresponding statements in 2, Ch. 6, I of [18]. 2
RANDOM OPERATORS AND CROSSED PRODUCTS 207
for Borel measurable B ⊂ R. The map µ is called the integrated density of states
(IDS) for A (cf. [6]).
There is another way to calculate the trace that can be seen as an application of
[13] (cf. Remark 1 below).
for positive g ∈ L∞ (G), where tr denotes the usual trace on B(L2 (G)).
Proof. Uniqueness is obvious. Existence will follow from the uniqueness of the
Haar measure on G, once we have shown that the RHS of the equation induces an
invariant measure.
As A is positive there exists C ∈ G ×α L∞ (X) with A = C ∗ C. We calculate
Z Z
µ(B) := tr(MχB Ax MχB ) dm = tr(MχB Cx∗ Cx MχB ) dm.
X X
R
Using that tr is a trace we conclude µ(B) = X tr(Cx MχB Cx∗ ) dm.
This formula and some simple monotone convergence arguments show that µ
is a measure with
Z Z Z
g 2 dµ = tr(Cx Mg 2 Cx∗ ) dm = tr(Mg Ax Mg ) dm.
G X X
208 DANIEL H. LENZ
THEOREM 2.2.7. 3 = τ .
Proof. As K is a strongly dense ideal in G ×α L∞ (X) by Proposition 2.1.6, it
is by Lemma 2.2.1 enough toR show 3(A∗ A) = τ (A∗ A) for A ∈ K. Choosing
a positive g ∈ L∞ (G) with G g 2 dt = 1, we calculate for A ∈ K with kernel
a ∈ L2 (G × X)
Z
∗
3(A A) = tr(Mg A∗x Ax Mg ) dm
Z Z
X
= |g(t)a(t − s, αt (x))|2 dt ds dm
Z
X G×G
Z Z !
(Fubini) = |g(t)|2 |a(t − s, αt (x))|2 dm ds dt
G G X
Z Z Z !
(m, ds transl. inv. ) = |g(t)|2
|a(s, x)| dm ds dt,
2
G G X
follows from (∗) immediately, as the equation χI (C) = χI (C)χI (C)∗ holds for
arbitrary operators C. Note that tr(χI (Bx |Hn )) is essentially the number of eigen-
values of Bx |Hn in I . For I = (−∞, E], E ∈ R, this type of equation has been
established for pseudodifferential operators with almost periodic coefficients in
[36], for Schrödinger operators in [6] using heat equation methods and for discrete
G in [3]. It is called Shubin’s trace formula.
Remarks. (1) In [13] transverse functions and transverse measures on groupoids
are introduced and studied (cf. I, 5 of [14] as well). It is possible to give G × X the
structure of a groupoid. The measure m then induces a unique transverse measure
3 with certain properties. It is possible to show that 3 satisfies the equation
Z Z
3(ξ ) f dt = ξx (f ) dm
G X
for transverse functions ξ and f ∈ L∞ (G). A direct calculation shows that for
A ∈ (G ×α L∞ (X))+ the mapping ξ x (B) := tr(χB Ax χB ) is a transverse function
(if the components Ax are chosen according to Remark 1 in Section 2.1). In this
context Theorem 2.1.7 says essentially 3(ξA ) = τ (A).
(2) In [1] it is shown that for a family Aω , ω ∈ , of almost periodic Schrödinger
operators and F ∈ C0 (R) the equation
Z Z
tr(Mg F (Aω )Mg ) dm = F dk
R
holds, where the measure dk is given by a certain limit procedure. Using Definition
2.2.3 and Theorem 2.2.7 we see dk = dµA .
210 DANIEL H. LENZ
If m(X) < ∞ (e.g. if X is compact) there exist two alternative formulas for the
trace on G ×α L∞ (X). They will now be discussed.
Define for A ∈ G ×α L∞ (X) the operator Am : L2 (G) → L2 (G) by
Z
hAm ξ | ηi := hAx ξ | ηi dm, ξ, η ∈ L2 (G).
X
where 1 denotes the function of L2 (X) with constant value 1. Then the following
holds.
THEOREM 2.3.1. τ = µ = τ∞ ◦ J .
Proof. We will show (1) τ = τ∞ ◦ J and (2) τ = µ.
(1) τ = τ∞ ◦ J : By Theorem 2.2.7, it is enough to show 3(A) = τ∞ ◦ J (A)
for A ∈ (G ×α L∞ (X))+ . For such an A let Am and the function φ be defined as
above, i.e. Am = F −1 Mφ F and Mφ = J (A). Choosing a positive g ∈ L∞ (G) with
kgkL2 (G) = 1 we calculate
Z
3(A) = tr(Mg Ax Mg ) dm = tr(Mg Am Mg )
X
= tr(Mg F −1 Mφ1/2 F F −1 Mφ1/2 F Mg ).
For φ ∈ L1 (G) b the operator K = Mg F −1 Mφ1/2 F is a Hilbert–Schmidt opera-
Rtor with kernel k(t, x) = g(t)F −1 (φ 1/2 )(t − s). Thus the formula tr(K ∗ K) =
G×G |k(t, s)| dt ds holds and we get
2
Z Z Z
−1
3(A) = |g(t)F (φ )(t − s)| dt ds = kφ k 2 b =
1/2 2 1/2
φ(tˆ) dtˆ,
G G
L (G)
Gb
where we used the translation invariance of dt, kgkL2 (G) = 1, and the fact that the
Fourier transform is an isometry. R
For arbitrary φ the equality 3(A) = G b φ(tˆ) dtˆ now follows by a simple
monotone limit procedure.
RANDOM OPERATORS AND CROSSED PRODUCTS 211
hMφ ξ | ηi = hAm F −1 ξ | F −1 ηi
Z
= hAx F −1 ξ | F −1 ηi dm
X
= hA(I ⊗ F −1 )(1 ⊗ ξ ) | (I ⊗ F −1 )(1 ⊗ η)i
= hAU ∗ (1 ⊗ ξ ) | U ∗ (1 ⊗ η)i
= hÂ(1 ⊗ ξ ) | (1 ⊗ η)i
= hMψ ξ | ηi.
Remarks. (1) The expression µ was used in [25] (cf. also [9]).
(2) The mapping J was first introduced by Coburn, Moyer and Singer [10] (cf.
also [36]) in their paper on almost periodic operators.
(3) If X consists of only one point it is a forteriori compact. The positive oper-
ators in G ×α C0 (X) are just the operators A = FG−1 Mϕ FG where ϕ ∈ C0 (G) b is
positive (cf. Remark 1 in Section 1). The trace of such an A is then given by
Z
τ (A) = ϕ(tˆ) dtˆ,
b
G
We close this section with some remarks on the special case that G is discrete. As
in this case the function δe : G → C defined by δe (t) = 1 for t = e and δe (t) = 0
for t 6= e is positive, bounded with kδe kL2 (G) = 1, we get easily
Z
τ (A) = hAx δe | δe i dm.
X
J : G ×α L∞ (X) → L∞ (X)
212 DANIEL H. LENZ
with τ = τ∞ ◦ J , where τ∞ is the usual trace on L∞ (X) (cf. [31] for a thorough
discussion of this case).
We will look at the following situation: Let G and X be locally compact Abelian
groups, and let j : G → X be a group homomorphism. This induces a homomor-
b → G,
phism j ∗ : X b where Xb (resp. G)
b denotes the dual group of X (resp. G). Then
there is an action α of G on X given by
αt : X → X, αt (x) := x + j (t),
b on G
and an action of X b given by
b → G,
α̂ x̂ : G b α̂ x̂ (tˆ) := tˆ + j ∗ (x̂).
Similarly to the unitaries Tt and St resp., acting on L2 (G) and L2 (X) resp. for
t ∈ G, there are unitaries Tx̂ and Sx̂ defined by
b → L2 (G),
Sx̂ : L2 (G) b Sx̂ ξ(tˆ) := ξ(α̂ (−x̂) (tˆ)),
b → L2 (X),
Tx̂ : L2 (X) b Tx̂ ξ(ŷ) := ξ(ŷ − x̂).
Moreover, we define
W b × G)
b : L2 (X b → L2 (X
b × G)
b by W
b ξ(x̂, tˆ) := ξ(x̂, α̂ x̂ (tˆ))
e by
and U
e := (FX ⊗ FG )W ∗ = (FX ⊗ I )U,
U
where FX and FG are the Fourier transfom on L2 (X) and L2 (G) resp. and U and
W are as defined in the first section.
We will first establish a spatial isomorphism between the von Neumann algebras
G ×α L∞ (X) and X b ×α̂ L∞ (Gb). This can be done quite easily, as the generators
of these von Neumann algebras are known explicitly.
RANDOM OPERATORS AND CROSSED PRODUCTS 213
We will then provide proofs for some statements first appearing in [6] and [7],
that yield much more, namely an isomorphism between G ×α C0 (X) and X b ×α̂
b).
C0 (G
We will need the following propositions.
PROPOSITION 3.1.1. (a) W ∗ (Tt ⊗I ) = (Tt ⊗St )W ∗ , W b ∗ (Tx̂ ⊗I ) = (Tx̂ ⊗Sx̂ )W ∗ .
(b) W (Tt ⊗ I ) = (Tt ⊗ S−t )W, Wb (Tx̂ ⊗ I ) = (Tx̂ ⊗ S−x̂ )W .
∗
e b
(c) U W = W U . e
Proof. This can be seen by direct calculation. 2
b ×α̂ C0 (G).
X b We remark that both of these facts have already been stated in [6]
and [7], where, however, no proof was given.
LEMMA 3.1.4. Let a ∈ L2 (G × X) be the kernel of a bounded operator A on
L2 (G × X), i.e.
Z
Aξ(t, x) = a(t − s, αs (x))ξ(s, x) ds.
G
e := U
Then A eAU is a bounded operator on L2 (X
e∗ b × G)
b with kernel
b × G),
â := (FX ⊗ FG )W a ∈ L2 (X b
i.e.
Z
e x̂, tˆ) =
Aξ( â(x̂ − ŷ, α̂ x̂ (tˆ))ξ(ŷ, tˆ) dŷ.
X̂
b × G).
Proof. Let ξ̂ := (FX ⊗ FG )(ξ ) be an arbitrary function in L2 (X b We
calculate
eAU
U e∗ ξ̂(x̂, tˆ)
= (FX ⊗ FG )W ∗ AW ξ(x̂, tˆ)
Z !
= (FX ⊗ FG ) (t, x) 7→ a(t − s, x)W ξ(s, α−t (x)) ds (x̂, tˆ)
G
Z !
= (FX ⊗ FG ) (t, x) 7→ (W a)(t − s, αt −s (x))ξ(s, αt −s (x)) ds (x̂, tˆ).
G
R
As for fixed t ∈ G the mapping x 7→ G |(W a)(t − s, αt −s (x))ξ(s, αt −s (x))| ds
belongs to L1 (X), this expression equals
Z Z ! !
(I ⊗ FG ) (x̂ | −x) (W a)(t − s, αt −s (x))ξ(s, αt −s (x)) ds dx (x̂, tˆ),
X G
b ×α̂ C0 (G
THEOREM 3.1.5. The mapping AdUe : G ×α C0 (X) → X b), A 7→ U
eAU
e∗
∗
is an isomorphism of C -algebras.
RANDOM OPERATORS AND CROSSED PRODUCTS 215
It is easy to see that D is in fact dense in G ×α C0 (X). The lemma yields that for
A = π(W ∗ (g ⊗ (h1 ∗ h2 ))) ∈ D the operator AdUe (A) has the kernel
We now provide a proof for another theorem which was already stated (without
proof) in [6] and [7].
THEOREM 3.1.6. Let τ (resp. τ̂ ) be the trace on G ×α L∞ (X) (resp. X b ×α̂
∞ b e e ∗
L (G)) defined in the last section. Then the equation τ (A) = τ̂ (U AU ) holds
for all A ∈ (G ×α L∞ (X))+ .
Proof. It is enough to consider the case A = BB ∗ with B ∈ K with kernel b
(cf. Lemma 2.2.1). Then the kernel of U eB Ue∗ is given by (FG ⊗ FX )(W b) and we
have
Z Z
eAU
τ̂ (U e∗ ) = |(FG ⊗ FX )(W b)(x̂, tˆ)|2 dx̂ dtˆ
b
G b
X
Z Z
= |W b(t, x)|2 dt dx
G X
Z Z
= |b(t, x)|2 dt dx
X G
= τ (BB ∗ ).
τ (AA∗ ) = b
τ (AdŨ (A)AdŨ (A)∗ ).
216 DANIEL H. LENZ
Remarks. (1) Kaminker and Xia show in [25] by the use of a spectral duality
principle that certain elliptic periodic operators have purely continuous spectra on
the complement of the set of discontinuities of λ 7→ τ (EA ((−∞, λ])). Theorem
3.2.2 shows in particular that this holds for arbitrary periodic operators for purely
algebraic reasons.
(2) For periodic Schrödinger operators it is possible to show that the spectrum
is purely absolutely continuous using some analyticity arguments (cf. [33] and
references therein).
We finish this section with a short discussion of another formula for τ . Let
H⊥ ⊂ Gb be the annihilator of H and let q: Gb → G/Hb ⊥ denote the canonical
RANDOM OPERATORS AND CROSSED PRODUCTS 217
we calculate
Z
τ (A) = hAtˆδe | δe i dtˆ
b
ZG
The RHS of this equation is essentially the integrated density of states defined
in Ch. XIII of [33] for periodic operators.
4. Spectral Duality
By spectral duality we mean a relation between the spectral types of Ax , x ∈ X,
b of the form “If Atˆ has pure point spectrum a.e.
and the spectral types of Atˆ, tˆ ∈ G,
b, then Ax has purely (absolutely) continuous spectrum a.e. x ∈ X”.
tˆ ∈ G
Theorems of this form have been stated in [7, 9, 25]. We cite the theorem of [25].
THEOREM 4.1. Let A ∈ G ×α L∞ (X) be selfadjoint with purely continuous
spectrum on a Borel set E s.t. Atˆ has pure point spectrum on E for almost all
b then Ax has purely continuous spectrum on E for almost all x ∈ X.
tˆ ∈ G,
In [22] another form of duality is proven for the Almost Mathieu Equation. The
method developed there can be carried over with only small changes to give
THEOREM 4.2. Let (Z, α, X, m) be a dynamical system and assume that X is a
complete metric space. Let A ∈ Z×α L∞ (X) be selfadjoint with spectral family EA
s.t. Ax has only pure point spectrum with simple eigenvalues for almost all x ∈ X.
Then µ(B) := τ (EA (B)) is a spectral measure for Aη for almost all η ∈ bZ =: S 1 .
218 DANIEL H. LENZ
s.t. ϕjl (x) = 0 for all j ∈ Z, x ∈ X, l ∈ N with Nj (x) < l, the set
for all x ∈ X as either ϕjl (x) and Tk ϕjl (αk (x)) = ϕjl +k (x) are different members of
an ONB or at least one of them is zero.
The simplicity of the eigenvalues is crucial to get this measurable section of
eigenfunctions. We will now show (cf. [22])
(1) Fix ψ ∈ L2 (X), F ∈ C0 (R), j ∈ Z, l ∈ N. Let ξ(z, x) := ψ(x)ϕjl (x)(z),
ξ̂η (x) := U ξ(η, x) and
µη (F ) := hξ̂η | F (Aη )ξ̂η iL2 (X) .
Then µη (F ) is independent of η a.e. η.
(2) µη does not depend on η a.e. η.
(3) There exist νη not depending on η s.t. νη is a spectral measure for Aη for a.e. η.
By the remarks at the beginning
R of the proof, the theorem follows from (3).
(1) It is enough to show 0 = S 1 (η | z)µη (F ) dη for all z ∈ Z with z 6= 0. We
calculate
Z
(η | z)µη (F ) dη = h(I ⊗ Mz )U ξ | U F (A)ξ i
S1
= hU (Tz ⊗ Sz )ξ | U F (A)ξ i
= h(Tz ⊗ Sz )ξ | F (A)ξ i.
Using that ϕjl (x) is an eigenvector corresponding to the eigenvalue ejl (x) say, we
get
Z
··· = F (ejl (x))ψ(αz (x))ψ(x)hTz ϕjl (αz (x)) | ϕjl (x)i dm
X
= 0,
RANDOM OPERATORS AND CROSSED PRODUCTS 219
where we used the relation hTz ϕjl (αz (x)) | ϕjl (x)i = 0.
(2) As C0 (R) is separable, this follows from (1).
(3) Let {ψm } be an ONB of L2 (X). Then {ψm ⊗ ϕjl } is an ONB in L2 (Z × X)
and, as U is unitary, it follows that the ξl,j,m := U (ψm ⊗ ϕjl ) form an ONB in
L2 (Ẑ × X). Thus the set T := {ξl,j,m (η, ·) | l, j, m} is total in L2 (X) for almost all
η ∈ S 1 . (Notice that the set Mϕ := {η ∈ S 1 | ϕ ⊥ ξl,j,m (η, ·)∀l, j, m} has measure
zero for each ϕ ∈ L2 (G).) Therefore the measures ν η defined by
X
ν η (B) := hχB (Aη )ξl,j,m (η, ·) | ξl,j,m (η, ·)iL2 (X)
l,j,n
are spectral measures for almost all η ∈ S 1 , which do not depend on η by (2). The
theorem follows. 2
Acknowledgements
The author would like to thank P. Stollmann for many useful discussions. Financial
support from Studienstiftung des deutschen Volkes (Doktorandenstipendium) is
gratefully acknowledged.
References
1. Avron, J. and Simon, B.: Almost periodic Schrödinger Operators, II. The integrated density of
states, Duke Math. J. 50 (1982), 369–391.
2. Bellissard, J.: Schrödinger operators with an almost periodic potential, in R. Schrader and T.
Seiler (eds), Mathematical Problems in Theoretical Physics, Springer, Berlin, 1982, pp. 356–
363.
3. Bellissard, J.: K-Theory of C ∗ -algebras in solid state physics, in T. C. Dorlas, N. M. Hugenholz
and M. Winnik (eds), Statistical Mechanics and Field Theory: Mathematical Aspects, Lecture
Notes in Phys. 257, Springer, Berlin, 1986, pp. 99–156.
4. Bellissard, J.: Gap labelling theorems for Schrödinger operators, in M. Waldschmitt, P. Moussa,
J. M. Luck and C. Itzykson (eds), From Number Theory to Physics, Springer, New York, 1992,
pp. 538–630.
5. Bellissard, J., Bovier, A. and Ghez, J.-M.: Gap labelling theorems for Schödinger operators,
Rev. Math. Phys. 4 (1992), 1–37.
6. Bellissard, J., Lima, R. and Testard, D.: Almost periodic Schrödinger operators, in L. Streit
(ed.), Mathematics + Physics, World Scientific, Singapore, 1985, pp. 1–64.
7. Bellissard, J. and Testard, D.: Quasi-periodic Hamiltonians. A mathematical approach, Proc.
Sympos. Pure Math. 38 (1982), 579–582.
8. Bohr, H.: Almost Periodic Functions, Chelsea Publishing Company, 1947.
9. Chojnacki, W.: A generalized spectral duality theorem, Comm. Math. Phys. 143 (1992), 527–
544.
10. Coburn, L. A., Moyer, R. D. and Singer, I. M.: C ∗ -algebras of almost periodic pseudo-
differential operators, Acta Math. 139 (1973), 279–307.
11. Combes, J. M. and Hislop, P. D.: Localization for some continuous, random Hamiltonians in
d-dimensions, J. Funct. Anal. 124 (1994), 149–180.
12. Combes, J. M., Hislop, P. D. and Mourre, E.: Spectral averaging, perturbation of singular
spectra and localization, Trans. Amer. Math. Soc. 348 (1996), 4883–4894.
13. Connes, A.: Sur la théorie non commutative de l’intégration, Lecture Notes in Math. 725,
Springer, Berlin, 1979.
220 DANIEL H. LENZ
14. Connes, A.: Noncommutative Geometry, Academic Press, San Diego, 1994.
15. Davidson, K. R.: C ∗ -Algebras by Example, Fields Institute Monographs, 1996.
16. Delyon, F. and Souillard, B.: The rotation number for finite difference operators and its
properties, Comm. Math. Phys. 89 (1983), 415–426.
17. Dixmier, J.: C ∗ -Algebras, North-Holland, Amsterdam, 1975.
18. Dixmier, J.: Von Neumann Algebras, North-Holland, Amsterdam, 1981.
19. van Elst, A.: Gap-labelling theorems for Schrödinger operators on the square and cubic lattice,
Rev. Math. Phys. 6 (1994), 319–342.
20. Folland, G. B.: A Course in Abstract Harmonic Analysis, Stud. Adv. Math., CRC Press, Bocca
Raton, 1995.
21. Forrest, A., Hunton, J. and Kellendonk, J.: Projection quasicrystals I: Toral rotations, Preprint.
22. Gordon, A., Jitomirskaya, S., Last, Y. and Simon, B.: Duality and singular continuous spectrum
in the almost Mathieu equation, Acta Math. 178 (1997), 169–183.
23. Haagerup, U.: On the dual weights for crossed products of von Neumann algebras. I. Removing
separability conditions; II Application of operator valued weights, Math. Scand. 43 (1978),
99–118, 119–140.
24. Johnson, R. and Moser, J.: The rotation number for almost periodic potentials, Comm. Math.
Phys. 84 (1982), 403–438.
25. Kaminker, J. and Xia, J.: The spectrum of operators elliptic along the orbits of R n actions,
Comm. Math. Phys. 110 (1987), 427–438.
26. Kellendonk, J.: Nonncommutative geometry of tilings and gap labelling, Rev. Math. Phys. 7
(1995), 1133–1180.
27. Kellendonk, J.: The local structure of tilings and their integer group of coinvariants, Comm.
Math. Phys. 187 (1997), 115–157.
28. Kirsch, W., Stollmann, P. and Stolz, G.: Localization for random perturbations of periodic
Schrödinger operators, Random Operators Stochastic Equations 6 (1998), 241–268.
29. Klopp, F.: Localization for some continuous random Schrödinger operator, Comm. Math. Phys.
167 (1995), 553–569.
30. Maak, W.: Fastperiodische Funktionen, Springer, Berlin, 1950.
31. Pedersen, G. K.: C ∗ -Algebras and Their Automorphism Groups, Academic Press, London,
1987.
32. Pimsner, M. and Voiculescu, D.: Exact sequences for K-groups and Ext groups of certain cross
product C ∗ -algebras, J. Operator Theory 4 (1980), 93–118.
33. Reed, M. and Simon, B.: Analysis of Operators IV, Academic Press, San Diego, 1980.
34. Rieffel, M.: C ∗ -algebras associated with irrational rotations, Pacific J. Math. 93 (1981), 415–
429.
35. Rudin, W.: Functional Analysis, Tata McGraw-Hill, New Delhi, 1974.
36. Shubin, M. A.: The spectral theory and the index of elliptic operators with almost periodic
coefficients, Russian Math. Surveys 34 (1979), 109–157.
37. Takesaki, M.: Tomita’s Theory of Modular Hilbert Algebras and Its Applications, Lecture Notes
in Math. 128, Springer, New York, 1970.
38. Takesaki, M.: Theory of Operator Algebras, I, Springer, New York, 1979.
39. Tomiyama, J.: Invitation to C ∗ -Algebras and Topological Dynamics, World Scientific, 1987.
40. Tomiyama, J.: The Interplay between Topological Dynamics and Theory of C ∗ -Algebras, Lec-
ture Note Series, Vol. 2, Research Institute of Mathematics, Global Analysis Research Center,
Seoul National University, 1992.
41. Tomiyama, J.: C ∗ -algebras and topological dynamical systems, Rev. Math. Phys. 8 (1996),
741–760.
42. Weidmann, J.: Linear Operators in Hilbert Spaces, Grad. Texts in Math. 68, Springer, New
York, 1980.
Mathematical Physics, Analysis and Geometry 2: 223–244, 1999.
223
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
MICHAEL DEMUTH
Institute of Mathematics, Technical University of Clausthal, 38678, Clausthal-Zellerfeld, Germany
KALYAN B. SINHA?
Indian Statistical Institute, New Delhi, India
for some α > 1. This result is applied to potential perturbations and to perturbations by imposing
Dirichlet boundary conditions.
1. Introduction
The main objective of this article is to prove the absence of the singularly continu-
ous spectrum for self-adjoint operators H in L2 (Rd ) which are perturbations of the
Laplacian H0 = −1. The proofs are based on the theory of complete scattering
systems. Instead of studying H0 and H directly we investigate scattering systems
for functions of H0 and H , respectively. To this aim, the abstract time-dependent
completeness theory is generalized to such functions. In particular, the function
t → e−λt is admissible. In many cases, the corresponding semigroups are integral
operators, the kernels of which can be represented by the Feynman–Kac formula
in terms of the conditional Wiener measure. Using this, the abstract operator theo-
retical conditions can be simplified. For potential perturbations and for the case of
obstacle scattering, one gets explicit conditions for the potential and the obstacle
? Research partly supported by Jawaharlal Nehru Centre for Advanced Scientific Research,
Bangalore.
224 MICHAEL DEMUTH AND KALYAN B. SINHA
2. Results
Some of the following ideas go back to Enss [5 – 7]. His proof of completeness was
oriented towards studying potential perturbations of the Laplacian. Here we have
mainly the obstacle scattering in mind where the potentials are arbitrarily high over
certain regions in Rd . For that we transform the problem of the completeness of the
scattering system {H, H0 } to the question of whether the system {ϕ(H ), ϕ(H0)} is
complete, where ϕ is a real-valued function with positive or negative first deriv-
ative. The abstract operator theoretical conditions in Theorem 1 ensure that the
singularly continuous subspace of H vanishes, i.e. the singularly continuous spec-
trum is empty. In Corollary 4, these general conditions are translated to semigroups,
i.e. to ϕ(H ) = e−λH . If the semigroups are ultracontractive one gets integral
conditions for their kernels which guarantee the absence of singularly continuous
spectrum (see Theorem 5).
The present section gives the results which will be proved in Section 3. In Sec-
tion 4, we apply the theory to potential perturbations and to singular perturbations
obtained by imposing Dirichlet conditions on the obstacle region. It turns out that
the following conditions are sufficient: For potential scattering (see Theorem 13,
Remark 14)
or
Z
dx(1 + |x|2 )α |V (x)| < ∞, with some α > 1.
Rd
For scattering by an obstacle, the conditions are given in terms of the equilibrium
potential v0 (·) (cf. (79)) of the singularity region 0 or in terms of its capacity (see
(81) and
R Theorem 15, Corollary 16). The theory can be applied if cap(0) is finite
and if dxv0 (x)(1 + |x|2 )α < ∞ for some α > 1. This allows unbounded 0 (see
Example 17).
We introduce the following notation. Let H0 be the self-adjoint realization of
the Laplacian in L2 (Rd ). Let H be an arbitrary self-adjoint operator in L2 (Rd ) and
assume H > −1. For the sake of abbreviation, we sometimes use H := L2 (Rd ).
Let ϕ be a real-valued function in C ∞ (R). We assume either ϕ 0 (λ) > 0 or ϕ 0 (λ) <
0 for all λ ∈ σc (H0 ) ∪ σc (H ), where σc (·) denotes the continuous spectrum of the
corresponding operator. Functions ϕ with these properties are called admissible.
SCHRÖDINGER OPERATORS WITH EMPTY SINGULARLY CONTINUOUS SPECTRA 225
In fact, we prove somewhat more. Assumptions (i)–(iii) imply the existence and
completeness of the wave operators
± (ϕ(H ), ϕ(H0)) = s − lim eit ϕ(H ) e−it ϕ(H0 ) Pac (ϕ(H0 )), (7)
t →±∞
where Pac (ϕ(H0 )) is the projection operator onto the absolutely continuous sub-
space of the self-adjoint operator ϕ(H0 ); here Pac (ϕ(H0))H = H. We prove
ran[± (ϕ(H ), ϕ(H0))] = Hc (ϕ(H )), (8)
226 MICHAEL DEMUTH AND KALYAN B. SINHA
i.e., the range of + and the range of − are equal to the continuous subspace of
ϕ(H ).
Remark 2. H and ϕ(H ) have the same spectral measure. Hence, Hc (ϕ(H )) =
Hc (H ). On the other hand, Hac (ϕ(H )) = Hac (H ) because ϕ 0 6= 0 almost every-
where (see [10], p. 518, Example 1.9). On the other hand,
ran[+ (ϕ(H ), ϕ(H0))] = ran[− (ϕ(H ), ϕ(H0))] ⊆ Hac (ϕ(H ))
⊆ Hc (ϕ(H )).
This implies
Hac (H ) = Hac (ϕ(H )) = Hc (ϕ(H )) = Hc (H )
or
Hsc (H ) = {θ}.
Thus, the existence of the wave operators in (7) and the identity (8) imply the
assertions in (5) and (6).
and
Hsc (H ) = {2},
i.e., σsc (H ) = ∅.
THEOREM 5. Let λ > 0 be fixed but arbitrary and let e−λH be an integral
operator in L2 (Rd ) with the kernel (e−λH )(x, y) admitting an estimate of the form
|x−y|2
|e−λH (x, y)| 6 cλ− 2 e−γ
d
λ ecλ (11)
with 0 < γ < 14 .
Assume, furthermore, that the kernel Dλ (x, y) of the integral operator
Dλ := e−λH − e−λH0 (12)
satisfies one of the following conditions: Either
Z
sup |x|α |Dλ (x, y)| dy < ∞ (13)
|x|>R >1 2|y|>R
3. Proofs
3.1. PROOF OF THEOREM 1
DEFINITION 6. Let F be the Fourier transform from L2 (Rd , dx) to L2 (Rd , dk).
Let U be a transformation from L2 (Rd , dk) to L2 (R, db; L2 (S d−1 )) given by
1 d b
(U Ff )b (ω) = √ e 4 b (Ff )(e 2 ω), (16)
2
228 MICHAEL DEMUTH AND KALYAN B. SINHA
ω ∈ S d−1 , where S d−1 is the unit sphere of Rd and L2 (R, b; L2 (S d−1 )) denotes the
space of square-integrable L2 (S d−1 )-valued functions on R.
(Note that U and F are unitary. For a smooth function ψ(·): R → R we obtain
(U F ψ(H0 )f )b (ω) = ψ(eb )(U Ff )b (ω), f ∈ dom(ψ(H0 )),
i.e., U F maps into the spectral representation in which ln(H0 ) is diagonal.)
Let G be the one-dimensional Fourier transform with respect to b, i.e.,
Z
1
(GU Ff )a (ω) := √ e−iab (U Ff )b (ω) db. (17)
2π R
G maps L2 (R, db; L2 (S d−1 )) to L2 (R, da; L2 (S d−1 )).
Now we define the operators P+ , P− by
P+ f := F ∗ U ∗ G∗ Mχ(a>0) GU Ff, (18)
P− f := F ∗ U ∗ G∗ Mχ(a<0) GU Ff. (19)
Here (Mg f )(x) := g(x)f (x) is the multiplication operator by the function g (see
(3)) and χ(a>0) denotes the characteristic function of the set {a : a > 0}.
Clearly, P+ , P− are orthogonal projection operators satisfying (15).
Similarly, we define an operator Pαt on L2 (Rd ) for α > 0 by
Pαt f := F ∗ U ∗ G∗ Mχ(|a|6αt )GU Ff. (20)
First we prove the last corollary, then Lemma 7. Finally, we prove that (21),
(22) are valid.
Proof of Corollary 8.
Preremark. Assumption (ii) in Theorem 1 is only used to show that the dif-
ference ψ(H ) − ψ(H0 ) is a compact operator in H for any function ψ(·) ∈
Cc∞ (R).
Proof. Set
Z T
1
ε+ (g(·)) := lim g(t) dt (28)
T →∞ T 0
[12], p. 23 ff). The term in (32) vanishes because of the intertwining relation and
(25). Finally, one has
kf k2 = ε+ (e−iϕ(H )· f, + ψ(H0 )P− e−iϕ(H )· f ). (35)
Since D is dense, the identity in (35) is true for any f ∈ Hc (ϕ(H )). Now, if
ran(+ ) ⊂ Hc (ϕ(H )) there would be an f0 , f0 6= θ, with f0 ⊥ ran(+ ), i.e.
∗+ f0 = 0. Then the intertwining property of the wave operator + together with
(35) would imply f0 = θ. This is a contradiction. Hence, ran(+ ) = Hc (ϕ(H )). 2
Proof of Lemma 7. First we prove (25) and (26). For that we define the positive
self-adjoint operator A2 in L2 (Rd ) by
A2 f := F ∗ U ∗ G∗ Ma 2 GU Ff, (36)
which is densely defined with
dom(A2 ) := {f ∈ L2 (Rd ), A2 f ∈ L2 (Rd )}.
Then, by (21),
k(1 + A2 )−1 ei|t |ϕ(H0 ) ψ(H0 )P+ k (37)
6 k(1 + A2 )−1 (1 − Pαt )k + k(1 + A2 )−1 Pαt ei|t |ϕ(H0 ) ψ(H0 )P+ k
1
6c + ckPαt ei|t |ϕ(H0 ) ψ(H0 )P+ k
1 + α2t 2
1
6c .
1 + α2t 2
Hence,
lim kP+ ψ(H0 )e−i|t |ϕ(H0 ) (1 + A2 )−1 k = 0. (38)
t →∞
The range of (1 + A2 )−1 is dense in H by the definition of A2 and (38) is true for
any ψ ∈ Cc∞ (R). The set {ran[ψ(H0 )(1 + A2 )−1 ], ψ ∈ Cc∞ } is dense in H. Thus,
(21) and (38) imply
s − lim P+ e−it ϕ(H0 ) = 0.
t →∞
Similarly, (26) will follow from (22).
Now we show the compactness of ζ2−1 (H )(− − 1I)ψ(H0 )P+ . For the moment,
the wave operator is assumed to exist (the proof is given in Lemma 10). Thus, we
have
ζ2−1 (H )(− (ϕ(H ), ϕ(H0)) − 1I)ψ(H0 )P+
Z 0
= −i ds eisϕ(H ) ζ2−1 (H )[ϕ(H ) − ϕ(H0 )]ζ1−1 (H0 )e−isϕ(H0 ) ζ1 (H0 )ψ(H0 )P+ .
−∞
SCHRÖDINGER OPERATORS WITH EMPTY SINGULARLY CONTINUOUS SPECTRA 231
From Assumption (i) we know that ζ2−1 (H )[ϕ(H )−ϕ(H0)]ζ1−1 (H0 ) is compact.
Therefore the assertion is true if
Z ∞
kζ2−1 (H )(ϕ(H ) − ϕ(H0 ))ζ1−1 (H0 )e−isϕ(H0 ) ζ1 (H0 )ψ(H0 )P+ k ds
0
is finite.
For that, we use the decomposition
× k(1 + A2 )− 2 −ε (1I − Pαs )e−isϕ(H0 ) ζ1 (H0 )(H0 + 1)ψ(H0 )P+ k +
1
+ k(1 + A2 )− 2 −ε Pαs e−isϕ(H0 ) ζ1 (H0 )(H0 + 1)ψ(H0 )P+ k .
1
ζ1 (H0 )(H0 + 1)ψ(H0 ) is bounded, while in the second term in the curly brackets
ζ1 (·)(· + 1)ψ(·) is again in Cc∞ (R \ {0}) such that the estimate in (21) is applicable.
Finally, (23) follows from (22) in the same way. 2
Now we will show that the inequality in (21) holds for an appropriate α given
the assumptions in Theorem 1.
Proof. Define
Then
with
Z
1 0
K(a, a 0 ) = eib(a−a ) ρ(eb ) db. (41)
2π R
Hence,
Therefore,
for |a| 6 αt and a 0 > 0. Using integration by parts, the last integral is equal to
∞
ψ(eb )
−ie i[b(a−a 0 )+t ϕ(eb )] , (44)
(a − a 0 ) + teb ϕ 0 (eb ) −∞
Z ∞
1 0 b )] d ψ(eb )
− db ei[b(a−a )+t ϕ(e . (45)
i −∞ db a − a 0 + teb ϕ 0 (eb )
The term in (44) is zero because ψ(e−∞ ) = ψ(0) = 0 and ψ(∞) = 0, and because
the denominator in (44) is bounded, which will be shown next.
We are in the case ϕ 0 < 0. Thus, there are two positive constants ã1 , ã2 with
as long as eb ∈ supp ψ. Hence, for t > 0 there are two positive constants a1 , a2
such that
Thus, the term in (44) vanishes. For estimating the term in (45), we repeat this
procedure l-times. Then
c 1
|K(a, a 0 )| 6 . (47)
t l−2 |a − a 0 + teb ϕ 0 (eb )|2
This implies
Z Z ∞
c
da da 0 |K(a, a 0 )|2 6 . (48)
|a|6αt 0 t 2l−2
Here l is arbitrary because ϕ is assumed to be in C ∞ (R). The proof of (22) is
similar and this completes the proof of Lemma 9. 2
LEMMA 10. Under the assumptions (i), (ii), (iii) of Theorem 1, the wave opera-
tors ± (ϕ(H ), ϕ(H0)) exist.
Proof. Again we consider only the case ϕ 0 < 0 and the existence of + . It
suffices to prove the existence on a dense set. For that we choose
For any ψ1 (·) ∈ Cc∞ (R \ {0}), [ψ1 (H ) − ψ1 (H0 )] is compact and therefore we have
the identity
s − lim eit ϕ(H ) e−it ϕ(H0 ) ψ1 (H0 ) = s − lim ψ1 (H )eit ϕ(H ) e−it ϕ(H0 ) (49)
t →∞ t →∞
if one of these strong limits exist. We show the existence of the second limit. By
restricting to a dense set again, it is sufficient to establish that
The first factor is bounded since ψ1 ∈ Cc∞ , the second is assumed to be bounded
in Assumption (iii), the third is bounded according to [11] (see (39)). It remains to
estimate the t-dependence of the last factor.
Because
c
k(1 + A2 )− 2 −ε (1I − Pβ|t | )k 6 1+ε for each β > 0,
1
t
it is enough to estimate
kPβ|t | e−it ϕ(H0 ) (H0 + 1)ζ1 (H0 )ψ2 (H0 )f k.
Because f = (1 + A2 )−1 g, we consider
kPβ|t | e−it ϕ(H0 ) (H0 + 1)ζ1 (H0 )ψ2 (H0 )(1 + A2 )−1 k.
For α > 0
c
k(1I − Pα|t | )(1 + A2 )−1 k 6 ,
t2
thus the proof will be complete if we can get an integrable estimate for
Integration by parts produces here the denominator a − a 0 − teb ϕ 0 (eb ). For suitable
α, β, this is bounded from below by
|a − a 0 − teb ϕ 0 (eb )| > c|t|,
which implies
kPβ|t | e−it ϕ(H0 ) (H0 + 1)ζ1 (H0 )ψ2 (H0 )Pα|t | k 6 cl · |t|−l , l ∈ N.
and that
or if
Z Z
dx dy |D1 (x, y)|(1 + |y|2 )1+ε < ∞. (56)
Rd Rd
6 c sup(1 + |u|2 ) 2 +ε
1
|D1 (u, v)| dv + c. (61)
u |v|62|u|
For |u| 6 R, the supremum in (61) is always finite. For |u| > R > 1
Z
2 12 +ε
sup (1 + |u| ) |D1 (u, v)| dv (62)
|u|>R >1 Rd
Z
(|u|−|v|2
6 c sup |u| 1+2ε
e− 8 dv +
|u|>R |v|6 R2
Z
+ c sup |u|1+2ε |D1 (u, v)| dv.
|u|>R 2|v|>R
The first term in (62) is always finite, the second is assumed to be finite in (55).
The second factor in (57) can be estimated in a similar way. In brief, we have
Z Z Z
dv |e−H (x, u)||D1 (u, v)|e−H0 (v, y)(1 + |y|2 ) 2 +ε
1
sup dx du
y Rd Rd Rd
Z Z
dv |D1 (u, v)|e−H0 (v, y)(1 + |y|2 ) 2 +ε
1
6 c sup du
y Rd Rd
Z
e−H0 (v, y)(1 + |y|2 ) 2 +ε dv +
1
6 c sup
y |v|6 |y|
Z 2
Z
du |D1 (u, v)|(1 + |v|2 ) 2 +ε |e−H0 (v, y)|
1
+ c sup dv
y 2|v|>|y| Rd
|y|2
6 c sup(1 + |y|2 ) 2 +ε |y|d e−
1
16
y
Z
1 +ε
+ c sup(1 + |v| ) 2 2 |D1 (u, v)| du. (63)
v Rd
Hence, the condition in (55) implies that both factors on the left-hand side of (57)
are finite.
SCHRÖDINGER OPERATORS WITH EMPTY SINGULARLY CONTINUOUS SPECTRA 237
LEMMA 12. The difference e−λH − e−λH0 is compact if for each β > 0
Z
sup |x|β
|Dλ (x, y)| dy < ∞ (66)
|x|>R >1 Rd
or if
Z Z
dx dy |Dλ/2 (x, y)| < ∞. (67)
Rd Rd
It suffices to prove
Z
lim sup dy |χ(|x| 6 n)Dλ (x, y)χ{|y| 6 2n} − Dλ (x, y)| = 0, (68)
n→∞ x Rd
which tends also to zero as n → ∞. This proves e−λH − e−λH0 to be compact if the
condition in (66) is satisfied.
For (67), we use the standard decomposition
Remark. The proof of Theorem 5 is complete now. The conditions in (55) and
(56) are stronger than the ones in (66) or (67), respectively. Therefore, we need
either the condition in (13) or that in (14). As we will see in the applications,
SCHRÖDINGER OPERATORS WITH EMPTY SINGULARLY CONTINUOUS SPECTRA 239
these two conditions have a common intersection but generally allow different
perturbations. The dependence on λ in (13) or (14) is not critical.
4. Applications
4.1. POTENTIAL PERTURBATIONS
σsc (H0 + MV ) = ∅.
2
Z Z λ
R2
−c |x|λ
ds s − 2 e−c
d
6 ce du |V (u)| s
|u|6 R2 0
2
Z
−c |x|λ
6 cλe |V (u)| du.
|u|6 R2
The last term is finite because each Kato-class potential is in L1loc (Rd ).
For |u| > R/2 we obtain
Z Z λ
sup |x|α du ds |(e−sH0 )(x, u)|V (u)|
|x|>R |u|> R2 0
Z Z λ
d |x−u|2 |u|α
6 c sup |x| α
du ds s − 2 e− 4s |V (u)| +
|x|>R |x|
2 >|u|> 2
R
0 Rα
Z Z λ
|u|α
+ c sup |x|α du ds |(e−sH0 )(x, u)|V (u)|
|x|>R |u|> |x| 0 |x|α
Z Z
2
λ
d (|x|−|u|)2
6 cR −α sup |V (u)||u|α sup |x|α du ds s − 2 e− 4s +
|u|> R2 |x|>R |u|6 |x|
2 0
Z Z λ
+ c sup sup |u|α |V (u)| dv ds |(e−sH0 )(x, v)|
|x|>R |u|> R2 Rd 0
Z λ
|x|2 R2
6 cR −α sup (|V (u)||u|α ) · sup |x|α e−c ds s − 2 e−c
d
λ s
|u|> R2 |x|>R 0
|u|> R2
Remark 14. The condition in (69) or (13) is the usual one for short-range po-
tentials
1
|V (x)| 6 c , for |x| > R. (70)
(1 + |x|)1+ε
The condition in (14) is satisfied if
Z
dx (1 + |x|2 )α |V (x)| < ∞, (71)
Rd
which also allows unbounded potentials.
The Laplace operator H0 in L2 (Rd ) is associated to the Wiener process. Its semi-
group e−λH0 can be represented by
(e−λH0 f )(x) = Ex {f (X(λ))} (72)
SCHRÖDINGER OPERATORS WITH EMPTY SINGULARLY CONTINUOUS SPECTRA 241
for all f ∈ L2 (Rd ). Here Ex {·} is the expectation with respect to the Wiener
measure.
The obstacle region is denoted by 0, where 0 is a closed set in Rd . We denote
its complement by 6 = Rd \0. Define S0 := inf{s, X(s) ∈ 0} the first hitting time
of 0 for the underlying Wiener process. We assume that the set of regular points of
0 coincide with the regular points of the interior of 0. Setting
(Tλ f )(x) = Ex {f (X(λ)), S0 > λ}, (73)
we obtain a strongly continuous semigroup {Tλ , λ > 0} on L2 (6). The generator
of this semigroup is the Friedrichs extension of H0 restricted to Cc∞ (6) (see, e.g.,
[1]). Denoting this generator by H̃6 , we have
e−λH6 := e−λH̃6 ⊕ 0
on L2 (6) ⊕ L2 (0). The difference of the two semigroups in L2 (Rd ) is given by
(e−λH0 f − e−λH6 f )(x) = Ex {[1 − χ{S0 > λ}]f (X(λ))}
= Ex {f (X(λ)), S0 < λ}. (75)
y,λ
Using the conditional Wiener measure Ex {·}, this semigroup difference is an
integral operator with the kernel
Dλ (x, y) = (e−λH0 )(x, y) − (e−λH6 )(x, y)
= Exy,λ {S0 < λ}. (76)
Note that this kernel is symmetric in x and y, and recall the crucial role in the
conditions of Theorem 5 played by the integral of |Dλ (x, y)| with respect to one of
the variables. With regard to the condition (14), we note that
Z
|Dλ (x, y)| dx = Ey {S0 < λ}. (77)
Rd
Then the wave operators ± (H6 , H0 ) exist and are complete. The singularly
continuous spectrum of H6 is empty, i.e., σsc (H6 ) = ∅.
for x ∈/ 0. This means that Theorem 15 is true for all bounded 0. Hence, we get
the following result.
EXAMPLE 17. Let the dimension d satisfy d > 3, let 0 be a union of balls
B(an , rn ) = Bn centered in an = (n, 0, 0, . . . , 0), i.e., |an | = n, and with radii
rn < 1.
SCHRÖDINGER OPERATORS WITH EMPTY SINGULARLY CONTINUOUS SPECTRA 243
n Rd
XZ Z
6 dx (1 + |x|2 )α + Ex {SBn < λ}(1 + |x|2 )α dx}
n Bn Rd \B n
X Z
(|x−an |−rn )2
1/q
6c rn |an |2α + cap(Bn )1/p dx e−c λ (1 + |x|2αq )
n Rd \Bn
X d−2
6c {rn |an |2α + rn p (1 + |an |2α )}
n
X d−2
6c rn p n2α . 2
n
P 1 d−2−2α
In particular, one can take rn = (1/n)p , then n n is finite for d > 6 and
1 6 α < 1, 5.
References
1. Baumgärtel, H. and Demuth, M.: Decoupling by a projection, Rep. Math. Phys. 15 (1979),
173–186.
2. Baumgärtel, H. and Wollenberg, M.: Mathematical Scattering Theory, Birkhäuser, Basel, 1983.
3. Cycon, H. L., Froese, R. G., Kirsch W., and Simon, B.: Schrödinger Operators with Applica-
tions to Quantum Mechanics, Springer-Verlag, Berlin, 1987.
4. Demuth, M., McGillivray, I., and Noll, A.: Capacity and spectral theory, In: M. Demuth, E.
Schrohe, B.-W. Schulze and J. Sjöstrand (eds), Spectral Theory, Microlocal Analysis, Singular
Manifolds, Adv. in Partial Differential Equations 14, Akademie-Verlag, Berlin, 1997, pp. 12–
77.
5. Enss, V.: Asymptotic completeness for quantum-mechanical potential scattering I. Short range
potentials, Comm. Math. Phys. 61 (1978) 285–291.
6. Enss, V.: Asymptotic completeness for quantum-mechanical potential scattering II. Singular
and long-range potentials, Ann. Phys. 119 (1979), 117–132.
7. Enss, V.: Geometrical methods in spectral and scattering theory of Schrödinger operators, In:
G. Velo and A. S. Wightman (eds), Rigorous Atomic and Molecular Physics, Plenum Press,
New York, 1981.
8. Eder, S.: Factorization theorems for compact operators and semigroup differences, PhD Thesis,
TU Clausthal, 1999.
244 MICHAEL DEMUTH AND KALYAN B. SINHA
9. Fukushima, M., Oshima, Y., and Takeda, M.: Dirichlet Forms and Symmetric Markov
Processes, de Gruyter, Berlin, 1994.
10. Kato, T.: Perturbation Theory for Linear Operators, Springer-Verlag, Berlin, 1976.
11. Muthuramalingam, Pl. and Sinha, K. B.: Asymptotic completeness in long-range scattering II,
Ann. Sci. Ecole Norm. Sup. 4 18 (1985), 57–87.
12. Perry, P. A.: Scattering Theory by the Enss Method, Math. Rep. 1, Harwood Acad. Publ., Chur,
1983.
13. Reed, M. and Simon, B.: Methods of Modern Mathematical Physics, Vol. III: Scattering Theory,
Academic Press, New York, 1979.
14. Simon, B.: Schrödinger semigroups, Bull. Amer. Math. Soc. (N.S.) 7 (1982), 447–526.
15. Sinha, K. B.: Mathematical theory of scattering in quantum mechanics – a review, Current Sci.
56 (1987), 293–297.
Mathematical Physics, Analysis and Geometry 2: 245–278, 1999.
245
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
FRANZ MERKL
Courant Institute of Mathematical Sciences, 251 Mercer Street, New York, NY 10012, U.S.A.
e-mail: merkl@cims.nyu.edu?
Abstract. This article describes the solution of the Kadomcev–Petviashvilli equation with C 10 real
periodic initial data in terms of an asymptotic expansion of Bloch functions. The Bloch functions
are parametrized by the spectral variety of a heat equation (heat curves) with an external potential.
The mentioned spectral variety is a Riemann surface of in general infinite genus; the Kadomcev–
Petviashvilli flow is represented by a one-parameter-subgroup in the real part of the Jacobi variety
of this Riemann surface. It is shown that the KP-I flow with these initial data propagates almost
periodically.
Key words: Kadomcev–Petviashvilli flow, Jacobi variety, infinite genus Riemann surfaces, Riemann–
Roch theorem.
1. Introduction
The Kadomcev–Petviashvilli partial differential equation (KP-I) is given by
ut = 14 uxxx + 32 ayy − 32 ux uxx , (1)
u = 2ax ; (2)
we impose the constraint that the mean value of u averaged in x-direction vanishes.
The KP-I equation reduces to the Korteweg–de Vries equation when there is no y-
dependence, ayy = 0. Bourgain [1] proved that the initial value problem with real
periodic L2 initial data u(x, y, t = 0) = q(x, y) is globally well posed. Feldman,
Knörrer and Trubowitz [2] showed that the KP flow propagates almost periodically
in t in case of real-valued real analytic periodic initial data. Here we prove almost
periodicity for real-valued periodic initial data q under the following regularity
condition: the Fourier coefficients q̂ of q have to satisfy k|j |r q̂(j )k1 < ∞ for
some r > 8; C 10 initial data fulfill this condition. The method differs from the
? Current address: Eurandom, PO Box 513, 5600 MB Eindhoven, The Netherlands. e-mail:
merkl@eurandom.tue.nl
246 FRANZ MERKL
Laurent expansion
ψ = exp(tz3 + yz2 + xz + az−1 + bz−2 + cz−3 + · · ·) (3)
at ∞ satisfies the KP-I-equation. More details are described in Section 8; a closely
related algebraic treatment of the KdV-equation is described in [14]; see [4 – 6, 8]
for other interesting approaches to solutions of the KP-equation and modified KP-
equation. In a geometric language, we have a one-point deformation at ∞ of the
line bundle O(P ) defined by the divisor P . The deformation is parametrized by
(x, y, t); the KP-I-flow is obtained by a deformation in t-direction. It corresponds
to a one-parameter-subgroup in the Jacobi variety of X.
We will generalize this to some Riemann surfaces of infinite genus: It is well
known (see, e.g., [2]) that the heat curves are invariant as the external potential
flows with the KP flow. Heat curves may be viewed as infinitely many handles
glued to a complex plane; the handles accumulate only at infinity, but they become
rather dense there. The meromorphic functions ψ that we are going to expand may
have one pole per handle; they grow like exp(tz3 + yz2 + xz) near infinity; z is a
coordinate in the complex plane with holes for the handles cut out. The presence
of infinitely many handles accumulating near infinity, each of them containing one
pole of ψ, is an obstruction for a convergent Laurent series (3), but we will derive
an asymptotic expansion of finite order:
ψ = exp(tz3 + yz2 + xz + az−1 + bz−2 + cz−3 + · · · + O(z−n )); (4)
it is valid outside the handles. The coefficient a = a(x, y, t) again solves the KP-
equation, and for an appropriate choice of the pole distribution we are able to fulfill
the given initial condition. The maximal order n of the expansion (4) that we can
control depends on the regularity of the potential.
We review some basic facts on heat curves and Bloch functions in the prepara-
tory Section 2, mostly without proofs. The main results are stated in Section 3. We
introduce line bundles with growth conditions at infinity in the first half of Section
4; this gives us a uniform geometric language to make the proofs more clear. The
main technical ingredient to derive the asymptotic expansion (4) is the Fredholm
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 247
theory of the Cauchy–Riemann operator in certain line bundles over infinite genus
Riemann surfaces; especially we use a Riemann–Roch theorem for Riemann sur-
faces of infinite genus. This tool was prepared in [12]; we check the hypotheses
of the Riemann–Roch theorem for the relevant line bundles in the second half of
Section 4. We prove in Section 5 that meromorphic functions with the prescribed
pole distribution and with the prescribed asymptotics are unique; this is an infinite-
genus analogue for the nonspeciality of the divisor P in the above example. The
asymptotic expansion (4) and bounds for the derivatives of the coefficients with
respect to the parameters x, y, z, P are derived in Section 6, based on a Fredholm
theory for the Cauchy–Riemann operator. In Section 7, a L2 -bound for the error
term is used to derive a pointwise bound O(z−n ). Section 8 proves the KP-I equa-
tion for the Laurent coefficient a in the expansion. Compactness of the real part of
the Jacobi variety yields the almost periodicity of the KP-I flow; this is shown in
Section 9. Finally we prove that the initial conditions are fulfilled in Section 10.
Some parts of the material presented here are also contained in the author’s
doctoral thesis [13]; the latter was done under the supervision of Horst Knörrer at
the ETH Zürich.
2. Heat Curves
To start, we review the definition and some fundamental properties of heat curves.
For a detailed description of these constructions and for proofs we refer to [2], Part
III, §15, §16, [2], Part IV, §23, [7], Ch. I, §2, and [13].
Let 0 = γ1 Z ⊕ γ2 Z be a lattice in R2 , where γ1 = (γ11, 0), γ2 = (γ21, γ22 )
R γ11 γ11 , γ22 > 0. We fix a potential q ∈ ∗L (R ∗/ 0) with zero spatial mean,
2 2
with
0 q(x, y) dx = 0. For ξ = (ξ1 , ξ2 ) ∈ C × C let Fξ (q) denote the set of
all distributional solutions ψ in L2loc (R2 ) of the heat equation
The set H(q) is an analytic variety; it is invariant as q flows with respect to the
KP-flow; for proofs of these facts see [2], §15.
For free heat curves one has simply
2
H(0) = {(ξ1 , ξ2 ) = (eγ11 z , eγ21 z+γ22 z ) | z ∈ C} (7)
248 FRANZ MERKL
2
corresponding to the exponential solutions ψ(x, y) = exz+yz of the free heat
equation; there are countably many transversal self intersections of this surface
located at the points z = σ±j , j = (j1 , j2 ) ∈ (Z \ {0}) × Z with
iπ γ11 j2 γ21
σj := j1 + − ;
γ11 2γ22 j1 2γ22
at these special points the space Fξ (q) is two-dimensional; over all other points of
the free heat curve this space is one-dimensional.
The double point singularities of the free heat curve H(0) open up to handles
(or branch cuts) as the potential q is turned on; Krichever [7] has given an explicit
description of heat curves in terms of cutting and gluing operations:
Take a complex plane C (coordinate z) and cut it open along countably many
pairs of parallel lines [v1,j , v2,j ], [v1,−j , v2,−j ] with v1,j − v1,−j = v2,j − v2,−j ; the
index j runs over the index set J = N × Z. Then glue corresponding border lines
together by translation with v1,j − v1,−j to form a handle; we get an oval aj from
the the two identified branch cuts. In exceptional cases, v1,j = v2,j ; this means
that the two quadratic branch points degenerate to a double point singularity. For
the lines [v1,±j , v2,±j ] chosen appropriately, there is a natural biholomorphic map
between the Riemann surface obtained in this way and the heat curve H(q); let
[ [
z : H(q) \ aj → C \ [v1,±j , v2,±j ]
j j
denote this biholomorphic map outside the branch cuts. The coordinate z is charac-
terized modulo 2π i/γ11 by the condition ezγ11 = ξ1 ; here ξ1 is defined by the Bloch
condition (6). The additive constant in 2π iZ/γ11 is determined by the condition
|z(ξ )|→∞
ξ2 e−(γ21 z(ξ )+γ22 z(ξ ) )
2
−→ 1, (8)
ξ in the domain of z; the fact that one can fulfill (8) is a consequence of [2], Part
III, §16, Theorem 16.1. One should compare this with the special case (7).
All geometric facts on heat curves that we need are reviewed in Propositions
1 and 2 below. The first proposition describes the distribution of the ramification
points; it also provides bounds on the width |v1,j − v1,−j | of the handles:
PROPOSITION 1. Let q be real-valued and periodic, and P let r > 4. Assume that
the Fourier coefficients q̂(j ) of q satisfy q̂(0) = 0 and j |j |r |q̂(j )| < ∞. Then
|v1,j − v2,j | 6 O(|j1 |−1 |j |−r ) and |v1,j − σj | → 0 as |j | → ∞, j = (j1 , j2 ) ∈
(Z \ {0}) × Z. The branch cuts remain on lines parallel to the real axis:
iπj1
Im v1,j = Im v2,j = Im σj = .
γ11
Proof. See [2], Part IV, Theorem 23.4, Lemma 23.5 and [7], Ch. I, §2. 2
smaller but also more dense as |z| → ∞: asymptotically the potential term may be
treated as a small perturbation.
Complex conjugation ι: (ξ1 , ξ2 ) 7→ (ξ1 , ξ2 ) maps the heat curve H(q) onto
itself; it is described by z 7→ z in the coordinate z; it leaves the ovals aj lying over
the branch cuts pointwise fixed.
We introduce the domain U on which the asymptotic expansion will be valid;
we obtain it by removing neighbourhoods of the branch cuts: We choose some
small ε > 0; it will be held fixed. Then we remove all the ellipses described by
|z − vj,1 | + |z − vj,2 | < |vj,1 − vj,2 | + ε/|j1 | from the z-plane; for sufficiently small
ε these ellipses are pairwise disjoint, since neighbouring branch cuts to [vj,1 , vj,2 ]
have a distance of order const · |j1 |−1 . Let U denote the z-plane with all these
ellipses removed; let U0 be obtained by a similar construction with ε replaced by
ε/2; so smaller ellipses are removed here: U ⊆ U0 .
In the exceptional cases when some of the handles degenerate to double points,
we work with the normalization of H(q); no singularities of the Riemann surface
remain here.
The second proposition controls the Bloch solutions:
PROPOSITION 2. (1) When ξ ∈ H(q) is not a double point, the space of so-
lutions Fξ (q) has complex dimension 1. For 0 6= ψξ ∈ Fξ (q) the map Fx,y :
ξ 7→ ψξ (x, y)/ψξ (0, 0) is meromorphic on H(q) outside the double points; these
functions are regular at the normalization of the double points. There are at most
poles of first order at the zeros of ψξ (0, 0); there is precisely one zero of ψξ (0, 0)
over every cycle aj , and there are no other zeros.
(2) The Bloch solutions behave asymptotically over U like exponential solutions
of the free heat equation:
e−(xz(ξ )+yz(ξ ) ) ψξ (x, y)/ψξ (0, 0) = 1.
2
lim
z(ξ )→∞,ξ ∈U
We do not include a proof of this proposition in this article; but one may consult
[7], §2, especially Theorem 2.2 for a proof of the first statement in the case of
real analytic potentials; see also [13] for less regular potentials. The last citation
contains also a proof of part (2) of the proposition.
3. Statement of Results
We choose a divisor P of infinite degree over H(q) that assigns multiplicity 1 to
one point Pj on every oval aj , but 0 to all other points. The divisor of zeros of a
meromorphic function f is denoted by (f ); negative values mean poles.
We are going to prove:
THEOREM 1 (Asymptotic expansion of meromorphic functions). (1) Assume that
the potential q satisfies k|j |−r q̂(j )k1 < ∞ with r > 6. Let x = (x, y, t) ∈ R3 .
Then the complex vector space of global meromorphic functions ψ on the heat
curve with the following properties has the dimension 1:
250 FRANZ MERKL
(3) The derivatives D mak (x, P ) exist when v(m) < Q r − max{4, n + 1}; these
derivatives are continuous functions of (x, P ) ∈ R3 × j aj ; here the divisor P
is viewed as an element of the Cartesian product of all ovals aj endowed with the
product topology. The derivatives D m ψx,P (ξ ) exist when r > v(m) + 4Qtoo (with
poles for ξ at P ); they are are continuous functions of (x, P ) ∈ R3 × j aj and
meromorphic functions of ξ .
The next theorem shows us how to get a solution of the KP-I initial value
problem in terms of this expansion:
THEOREM 2 (Solution of the KP-I initial value problem). (1) For fixed P , the
first coefficient a(x) := α1 (x, P ) in the expansion (9) and u(x) := 2 ∂x
∂
a(x) yield
a solution of the KP-I equation (1). For fixed P , t and ξ , the function (x, y) 7→
ψx,P (ξ ) solves the heat equation
THEOREMQ 3 (Jacobi group structure; almost periodicity of the KP flow). (1) Let
O, P , Q ∈ j aj . Then there is a nonvanishing global meromorphic function φ on
the heat curve which is bounded over U and satisfies (φ) > O −P −Q; it is unique
up to a multiplicative constant.Q Let P ⊕O Q := (φ)+P +Q−O Q denote
Q the divisor
Q
of zeros of φ; then P ⊕O Q ∈ j aj ; the operation Q ⊕O : j aj × j aj → j aj
defines a continuous Abelian group structure on j aj with neutral elementQ O.
(2) Let Q := (ψx,P ) + P denote the divisor of zeros of ψx,P . Then Q ∈ j aj ;
Q Q
we define the map v: R3 × j aj → j aj , v(x, P ) := Q. Then v is a continuous
Q
group operation of (R3 , +) on j aj : the relations v(x + y, P ) = v(x, v(y, P ))
and v(x, P ) = v(x, O) ⊕O P hold.
(3) Let u denote the solution of the KP-I initial value problem with real periodic
initial data q with k|j |−r q̂(j )k1 < ∞, r > 8. Then we have almost periodicity of
the solution in the following sense: For every ε > 0 there is L > 0 so that every
real interval of length L contains a number τ with
sup |D mu(x, y, t + τ ) − D m u(x, y, t)| < ε
(x,y,t )∈R3
From these basic examples we build the bounded line bundles of interest by
tensor products; the tensor product for two pairs is defined in a natural way by
(L, φ) ⊗ (L0 , φ 0 ) = (L ⊗ L0 , φ · φ 0 ), and the dual of (L, φ) is defined by (Ľ, 1/φ).
We abbreviate L(P ) := L ⊗ Ob (P ) and L1,0 := L ⊗ Ob1,0 for every bounded line
bundle L; the space of global holomorphic sections in L is denoted by 0(L).
The first statement in Theorem 1 claims that dim 0(Ex (P )) = 1; the Riemann–
Roch theorem for Riemann surfaces of infinite genus proved in [12] yields
dim 0(Ex (P )) − dim 0(E−x
1,0
(−P )) = 1;
one should note that every handle receives precisely one point of the divisor P ; in
the language of reference [12] this implies that the codegree of P vanishes; this
justifies the right-hand side 1 in the Riemann–Roch formula. Similarly the first
part of Theorem 3 states dim 0(Ob (P + Q − O)) = 1; the Riemann–Roch theorem
yields in this case:
dim 0(Ob (P + Q − O)) − dim 0(Ob1,0(−P − Q + O)) = 1;
one should note that the codegree of P + Q − O vanishes too, since the number of
points per handle of the divisor counted with multiplicity is 1.
It remains to check the hypotheses of the Riemann–Roch theorem for Ex (P )
and Ob (P + Q − O). This is a rather technical matter; the reader who searches
only an overview may want to skip the rest of this section except the introduction
of the handle coordinates zj : Uj → C in Equations (11); only Lemma 1 will be
used later too.
We show that the hypotheses (X1)–(X6) in [12] are fulfilled: First, coordinates
zj in the handles are required: We introduce scaling constants cj = c−j > 0 for
every handle; they should satisfy 0 < lim infj cj /|j1 | 6 lim supj cj /|j1 | < ∞;
this means that the distance of neighbouring handles is of order cj−1 in the z-plane.
More details for cj are specified below. Set sj := (v1,j + v2,j )/2 (center of the j -th
√
branch cut) and tj := |v2,j − v1,j |cj /2 (rescaled width of the j -th branch cut);
define zj by the equations
tj p
(z − sj )cj = zj + for |zj | > tj ,
zj
tj p
(z − s−j )cj = zj + for |zj | 6 tj (11)
zj
in a neighbourhood of the oval aj . We have the equation z−j zj = tj ; the ovals aj
√
are described by |zj | = tj . The equations |zj | = const describe ellipses with
focii at the ramification points; for example the boundary ellipses of U0 are given
by |zj | = ε1 (j ) for some constants ε1 (j ) = ε1 (−j ).
Define the j -th handle Uj to consist of all ξ ∈ H(q) such that |zj (ξ )| < 1 and
|z−j (ξ )| < 1; we restrict the coordinate zj to be defined only in this domain. We
choose the scaling constants cj so that:
• the constants ε1 (j ) do not depend on j ∈ J at least for large |j |;
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 253
• ε1 (j ) < 1;
• the handles Uj are pairwise disjoint and disjoint from U .
This all can be done if the constant ε in the definition of U0 is sufficiently small.
The intersection Uj ∩ U0 consists of two connected components Vj , V−j ; these are
the annuli V±j = {ξ ∈ Uj : 1 > |z±j (ξ )| > ε1 (j )}. Obviously we can include the
elliptic annuli z[Vj ] – at least for large |j | – in pairwise disjoint discs in the z-plane
centered at sj with a radius rj 6 O(|j1 |−1 ).
These considerations show that the hypotheses (X1)–(X3) in [12] are fulfilled.
We check (X4): We look for bounds on the derivatives of the coordinate changes:
dz tj
sup cj − 1 = sup 2 6 const · tj
Vj dzj V j zj
and
d2 z tj
sup cj 2 = 2 sup 3 6 const · tj .
Vj dzj V j zj
√
Using Proposition 1 and the fact that |cj | scales like |j1 |, we get tj 6 O(oj ) for
large |j | with oj := |j |−r ; but also tj 6 O(oj ) for large |j |; from this we obtain
the hypothesis (X4) in [12].
The hypothesis (X5) in [12] states parabolicity of the surface; thisR means: for all
N b X and ε > 0 there is χ ∈ Cc∞(X, [0, 1]) with χ|N = 1 and X dχ ∧ ∗dχ <
ε. This is easy to prove by explicitly constructing χ from a cutoff-function in C
interpolated through the handles; we refer the reader either to [2] or to [13].
All quantitative estimates on the scaling constants that are required for the
Riemann–Roch theorem are assembled in the hypothesis (X6) in [12]; we check
them here:
• αj := |cj ||sj |2 is bounded from below by a positive constant since both factors
are bounded from below;
• tj 6 O(oj2 );
P have |α2j | 6P O(|j1 ||s
• We j | ) 6 O(|j ||σj | ) 6 O(|j | ); consequently
2 2 3
j ∈J |j |
6−2r
j ∈J (αj oj ) < ; the sum runs over a two-dimensional lattice;
hence it is convergent for r > 4.
• oj αj2 6 O(|j |6−r ) is bounded for r > 6;
• lim supj |rj /sj | < 1, since |rj | → 0, while |sj | → ∞.
Q
Let P , Q, O ∈ j aj . The bounded line bundles Ob (P ) and Ob (P + Q − O) were
already treated as examples in the cited reference based on the hypotheses (X1)–
(X6) only; but it remains to check the remaining hypotheses (L1)–(L2) in [12] of
the Riemann–Roch theorem for the “deformed” bounded line bundles Ex (P ): we
observe that deformations with asymptotically almost constant transition functions
preserve the hypotheses of the Riemann–Roch theorem: let the bounded line bundle
(L, ψ0 ) satisfy the hypotheses of the Riemann–Roch theorem with basis sections
254 FRANZ MERKL
ψj over Uj , and let (E, φ0 ) be a bounded line bundle with basis sections φj over Uj
that satisfy the following bounds: for some constants ζj of modulus 1 and ν = ±1,
ν
φ0 d φ0ν
−1 ν
sup oj sup ν − ζj < ∞ and sup oj sup −1
ν < ∞. (12)
j Vj φj j Vj dzj φj
LEMMA 1. (1) There exist imaginary numbers αj,l = −α−j,l and holomorphic
functions ηj,l = η−j,l : Uj → C for l ∈ Z with |αj,l | 6 |sjl |,
The functions ηj,l can be chosen antisymmetric with respect to complex conjuga-
tion: ηj,l (ι(ξ )) = ηj,l (ξ ), ξ ∈ Uj . (2) Similarly there exist imaginary numbers
αj,log = −α−j,log and holomorphic functions ηj,log = η−j,log : Uj → C with
sup oj−1 sup | Log z − ηj,log − αj,log | < ∞
j Vj
and (15)
d
sup oj−1 sup (Log z − ηj,log ) < ∞,
j Vj dzj
and |αj,log | 6 π . Here “Log” means the principal branch of the logarithm. The
functions ηj,log can be chosen antisymmetric with respect to complex conjugation,
too.
Proof. (1) We define ηj,l := wjl + w−j l
− Re s−j
l
with wj := zj /cj + sj over Uj ,
and αj,l := −i Im s−j ; obviously |αj,l | 6 |s−j | = |sjl | holds. We get supVj |wj −
l l
z| 6 O(|tj /cj |) and supVj | dzdj (wj − z)| 6 O(|tj /cj |) since wj − z = −tj /(zj cj )
over Vj ; recall infj infVj |zj | > 0.
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 255
This implies
Z 1
sup |wjl − z | 6 |l| sup |wj − z|
l
|(1 − t)z + twj |l−1 dt 6 O(|sjl−1 tj /cj |);
Vj Vj 0
one should note that |sj |−1 |(1 − t)z + twj | is bounded from above and from below
on Vj uniformly in j by positive constants.
The derivative is estimated by
d l−1 d
l−1 d
sup l
(wj − z ) 6 |l| sup z
l
(wj − z) + |wj − z |
l−1
wj
Vj dz j Vj dz j dz j
and
d l d l d
sup (z − ηj,l ) = sup (z − wj ) −
l
w−j 6 O(|sjl−1 tj /cj |);
l
Vj dzj Vj dzj dzj
these bounds hold for all l ∈ Z. The estimates (14) follow from O(|sjl−1 tj /cj |) 6
O(|j |−2r+l−1) 6 O(|j |−r ) = O(oj ) with the help of |sj | 6 O(|j |), |cj−1 | 6 const,
tj 6 O(|j |−2r ), and l − 1 6 r.
The symmetry properties for ηj,l follow from w−j (ι(ξ )) = wj (ξ ) and s−j = sj .
(2) The proof of these bounds is similar to the previous one; one just has to
replace l-th powers in the proof by logarithms and the (l − 1)-st powers by recip-
rocals. One should note that z[Vj ] does not meet a neighbourhood of the negative
real axis; so Log z is continuous and bounded on every fixed Vj . 2
5. Nonspeciality of Divisors
We are going to prove uniqueness of holomorphic sections in Ob (P ), Ex (P ) and
Ob (P + Q − O), starting with the simplest case:
256 FRANZ MERKL
LEMMA Q 2. 0(Ob (P )) consists only of the constant functions for every divisor
P ∈ j aj .
Proof. We have to show dim 0(Ob (P )) = 1. By the Riemann–Roch theorem
dim 0(Ob (P )) − dim 0(Ob1,0(−P )) = 1; so it suffices show 0(Ob1,0(−P )) = {0}.
So assume that there was an ω ∈ 0(Ob1,0 (−P )), ω 6= 0. We may assume that ω
is antisymmetric, ι∗ ω = −ω, by replacing it either by ω − ι∗ ω or by iω − ι∗ (iω); at
least one of these two R forms is nonzero. Since all aj -cycles are symmetric, ι∗ aj =
aj , the aj -periods aj ω are imaginary. We cut the heat curve open along all ovals
aj ; the resulting surface Y may be viewed as the complex plane with all branch cuts
[v1,j , v2,j ] cut open. The homology of Y is generated by these ovals; and σ := Re ω
has zero aj -periods; so σ is exact over Y , say σ = df on Y . ω ∈ 0(Ob1,0(−P ))
implies |ω/(z−2 dz)| 6 const over U0 ; by integrating this bound over semi-infinite
pathes γ in U0 we get supU0 |zf | < ∞; we have fixed an integration constant. To
derive this bound, one cannot use radial semiinfinite lines and arcs centered at 0
in the z-plane for γ in general; these might hit the elliptic holes in U0 . But one
may modify the lines making a little detour around the elliptic holes; this locally
increases the length of the path only at most by a multiplicative constant.
Using the maximum principle for bounded subharmonic functions on parabolic
surfaces we conclude supU0 f = sup∂U0 f ; the boundary ∂U0 consists of the union
of all the ellipses ∂(Uj \ U0 ).
The sup∂U0 f is reached at one point ξ ∗ ∈ ∂(Uj ∗ \ U0 ) for some handle index
∗
j , since otherwise we would have sup∂U0 f = lim sup|j |→∞ sup∂(Uj \U0 ) f = 0;
the latter is impossible since f does not vanish everywhere and is antisymmetric,
ι∗ f = −f .
The symmetric oval aj ∗ divides the handle Uj ∗ into two pieces; let W be the
half of Uj ∗ in which ξ ∗ lies. f |W has to reach its maximum at a point ζ ∗ on
the boundary ∂W . ∂W consists of the symmetric oval aj ∗ and of one of the two
connected components of ∂Uj ∗ . f |W cannot reach its maximum at a ζ ∗ ∈ ∂Uj ∗ ,
since this would imply f (ζ ∗ ) > f (ξ ∗ ) = sup∂U0 f, which is impossible as ζ ∗ is
an interior point of U0 and f is nonconstant. Hence, ζ ∗ ∈ aj ∗ ; to be precise, aj ∗
is doubled, consisting of two opened branch cuts, since we have cut X open along
the aj ; we consider only the branch cut in the border of W .
f is constant along aj ∗ : to show this, take any vector v tangent to aj ∗ . Then
ι∗ v = v, hence hdf, vi = hι∗ df, ι∗ vi = −hdf, vi, i.e. hdf, vi = 0.
We know that f |W reaches its maximum on the whole oval aj ∗ , but does not
reach it on W \aj ∗ . We also know that df = σ = Re ω has a zero at one point Pj ∗ ∈
aj ∗ of the divisor P . The following lemma, applied to f written in logarithmic polar
√
coordinates z±j ∗ = tj ∗ el+iϕ , shows that this is impossible; this contradiction
yields that ω does not exist. 2
at the boundary. Assume that u(z) > 0 for Re z > 0 and u(z) = 0 for Re z = 0.
Then
∂
∀ϕ ∈ R : u(l + iϕ) > 0.
∂l l=0
Proof. By the maximum principle we get u(z) > 0 for 0 < Re z < L, and
therefore for every fixed 0 < L1 < L:
ε := inf u(z) > 0.
Re z=L1
Let us compare the following two Dirichlet boundary conditions on the compact
cylinder ([0, L1 ] + iR)/2π iZ:
0, Re z = 0,
r1 (z) =
ε, Re z = L1 ,
r2 (z) = u(z), Re z ∈ {0, L1 },
r1 6 r2 implies that the (unique) solutions u1 resp. u2 of the Dirichlet problem
on the cylinder with boundary conditions r1 resp. r2 satisfy u1 6 u2 . But u1 (z) =
ε
L1
Re z, u2 (z) = u(z), hence
∂u u2 (l + iϕ) u1 (l + iϕ) ε
(l + iϕ) = lim > lim = > 0. 2
∂l l&0 l l&0 l L1
6. Asymptotic Expansion
We want to determine the asymptotic behaviourQ of ψ ∈ 0(Ob (P + Q − S)) or ψ ∈
0(Ex (P )) for varying x ∈ R3 , P , Q, S ∈ j aj . Both cases are included when
P
we consider line bundles F = Ex ( N nκ P κ ) with divisors P κ = (Pjκ )j ∈J ∈
Q PN κ=1
j aj and multiplicities nκ ∈ Z, κ=1 nκ = 1. We are interested in an asymptotic
expansion of the global section in F (which is unique up to a constant) and its
dependence on the parameters x and P κ . We avoid to examine even more general
bounded line bundles to keep the treatment
Q simpler.
We fix a reference divisor O ∈ j aj ; the calculations will be done using the
fixed bounded line bundle Ob (O)⊗Ob (−n∞), n ∈ N; this is convenient to analyze
the parameter-dependence.
We will define only smooth global sections ϕ in F, ϕl in F(−l∞) and µj
in F(O); these sections will be “almost holomorphic” sections in F; with these
functions we build an ansatz for the global holomorphic section ψ in F:
X X
n−1
ψ = rϕ + βj µj + αl ϕl . (16)
j ∈J l=0
The complex coefficients αl and βj and the error term r = rn have to be deter-
mined; r is an only smooth section in Ob (O) ⊗ Ob (−n∞).
We have dropped the parameters x, P for simplicity of the notation; but we will
write them explicitly at some places to avoid ambiguity. Detailed specifications of
the ansatz functions µj , ϕl , ϕ are given below; but we describe the leading idea
first:
• The first summand rϕ in (16) will describe the error term in the expansion
2 3
(9). ϕ equals ezx+z y+z t over the domain U where the asymptotic expan-
sion is valid, but it is interpolated through the handles only smoothly but
“almost P meromorphically” with zeros/poles with orders bounded by the di-
visor − N κ
κ=1 nκ P . The factor r in the error term should satisfy a bound
−n
O(|z |) over U ; it will be only smooth in the handles with poles allowed at
the reference divisor O. The finiteness of a certain weighted Sobolev-norm
will guarantee these conditions on the error term r. We allow an additional
zero per handle of ϕ (below described by the divisor P 0 ) to get equal numbers
of zeros and poles per handle (counted with multiplicity) in the ansatz func-
tion; theP location of this zero is chosen in a way that the bounded line bundle
E(x,y,t )( N κ
κ=0 nκ P ) is almost isomorphic to Ob , meaning that it has a almost
holomorphic global basis section ϕ.
• The lth term in the last sum in (16) will describe the lth term in the ex-
pansion (9). To get this, we will choose the ansatz functions ϕl to equal
z−l ezx+z y+z t near infinity over U ; they are interpolatedPthrough the handles
2 3
of J -indexed sequences β =
The space `2w (J ) is defined to be the Hilbert space P
(βj )j ∈J ∈ CJ with the weighted norm kβk22,w := j ∈J |cj−2 sj2n−4 | · |βj |2 being
finite.
We define the smooth line bundle L := O(O) and L0,1 over the heat curve; the
latter denotes the bundle of L-valued one-forms of type (0, 1). It was shown in [12]
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 261
that there are Hermitian metrics | · |A on L, | · |B̌ on L0,1 and a volume form with
the following properties:
• Let B̌ be Rthe completion of Cc∞ (H(q), L0,1 ) with respect to the norms
kωk2B̌ := H(q) |ω|2B̌ . Further let V be be the completion of Cc∞ (H(q), L)
R
with respect to the norm krk2V := H(q) (|r|2A + |∂r|2B̌ ). Then the Cauchy–
Riemann operator ∂: V → B̌ is a Fredholm operator of index 1 − n.
• ∼ d2 (1/z) over U0 near infinity, ∼ |cj−2 sj−4 | d2 zj over Vj ; we use the
abbreviation d2 w := d Re w∧d Im w; the relation “∼” means that the quotient
of both sides is bounded from above and from below by positive constants not
depending on the handle index j .
• |1|A ∼ |zn| over U0 near infinity, |1|A ∼ |sjn | over Vj ;
• |dz|B̌ ∼ |zn+2 | over U0 near infinity, |dzj |B̌ ∼ |cj sjn+2 | over Vj ;
over U but are interpolated almost meromorphically through the handles; the smooth
cutoff function χ serves only to remove the singularity of z−l near z = 0; so
the function χ should vanish near z = 0; but it should equal 1 outside a small
neighbourhood of z = 0. The functions φl will be specified below.
We abbreviate:
x̂ = (−l, x) := (−l, x, y, t);
ẑ = (Log z, z) := (Log z, z, z2, z3 );
η̂ = (ηj,log , η) := (ηj,log , ηj,1 , ηj,2 , ηj,3 );
α̂ = (αj,log , α) := (αj,log , αj,1 , αj,2 , αj,3 );
ûv̂ = ulog vlog + uv := ulog vlog + u1 v1 + u2 v2 + u3 v3
for û = (ulog , u) = (ulog , u1 , u2 , u3 ), v̂ = (vlog , v) = (vlog , v1 , v2 , v3 ); û, v̂ ∈
C4 ; u, v ∈ C3 ; the data ηj,... and αj,... were
Q introduced in Lemma 1. We define
an additional divisor P 0 = (Pj0 )j ∈J ∈ j aj with multiplicity n0 = −1 by the
relation
YN
zj (Pjκ )nκ = e2x̂ α̂ ; (18)
κ=0
262 FRANZ MERKL
one should note that P 0 depends on x and on l, although we suppress writing this
explicitly. The divisor P 0 serves as a location for an additional zero per handle of
the ansatz functions.
Morally we define:
X
φl = exp χ0,1 log φ0,l + χj,1 log φj,l
j
with
to see the last equation one uses zj − zj (Pjκ ) = −zj · zj (Pjκ ) · (z−j − z−j (Pjκ ))/tj ,
P
the relation (18), and N κ=0 nκ = 0. To make φj,l well-defined, we have to specify
which branch of the logarithm should be taken over Vj ; so we set over Vj :
X XN !
zj (Pjκ )
φl := exp χ0,1 · x̂ ẑ + χj,1 · x̂ α̂ + x̂ η̂ − nκ Log 1 − .
j κ=0
zj
“Log” means the principal branch of the logarithm; one should note that |zj | >
√
|zj (Pjκ )| = tj over Vj .
φl is meromorphic (poles at P ) outside the union of all Vj . We calculate to
which extent it deviates from being holomorphic over Vj ;
−1 ∂φl XN
zj (Pjκ )
φ = ∂χj,1 · − x̂ ẑ + x̂ α̂ + x̂ η̂ − nκ Log 1 −
l ∂z ∂z z
j j j
κ=0 √
tj
6 const1 · |x̂(−ẑ + α̂ + η̂)| + log 1 −
|zj |
6 const · |j |−r · (|x| + 1); (21)
We need to estimate also derivatives with respect to x and Pjκ over Vj ; this is a
rather technical matter. The reader might want to read only the statement of the
four lemmas in this section, but skip their proofs.
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 263
ν
YN
∂ νκ
∂j := νκ .
κ=1
∂pj,κ
One should notice that there is no derivative with respect to pj,0 ; this variable is
not considered independent, since
X
N
pj,0 = 2i x̂ α̂ + nκ pj,κ (22)
κ=1
(modulo 2π ), see Equation (18). We examine the case (ν, m) 6= (0, 0):
ν m −1 ∂φl ∂χj,1 ν m
∂ D φ =
j l
∂zj ∂z · (∂j D x)(−z + α + η) −
j
XN
ν m zj (Pjκ )
− nκ ∂j D Log 1 − .
zj
κ=0
ν
∂j D m x equals 0 unless m1 +m2 +m3 = 1 and ν = 0; Lemma 1 yields the following
bound:
sup |(∂j D m x)(−z + α + η)| 6 const · |j |−r · δν,0 .
ν
Vj
To estimate the derivatives of the Log-terms over Vj , set f (t) := Log(1 − t) and
√ ν
gκ := tj eipj,κ /zj . By the chain rule, ∂j D m f (gκ ) is a finite sum of terms of the
form
Y
k
ν
f (k) (gκ ) ∂j ` D m` gκ
`=1
Pk P
with k > 1, `=1 ν ` = ν and k`=1 m` = m. The |gκ | are bounded over Vj
√
uniformly in j by a constant less than 1; one should note |zj | > const > tj over
Vj . We get a bound |f (k) (gκ )| < constk uniformly in j over Vj .
Let us estimate in the case κ = 0:
ν
∂j ` D m` g0 = (−2α)m (in)ν ` g0 ;
k
Y
ν ` m`
∂j D g0 = |nν (−2α̂)m g0k | 6 constν,k,m · |sj |v(m) |j |−r
`=1
Q
we used the notation v(m) = m1 + 2m2 + 3m3 , ζ m := n ζnmn ; recall |αj,l | 6 |sjl |
from Lemma 1, k > 1 and
sup |gκ | 6 min{1, const · |j |−r }. (23)
Vj
264 FRANZ MERKL
The case κ > 0 is easier since there is no x-dependence; here we get as well
k
Y Y
ν ` m`
∂j D gκ = |gκk |δ0,m δ0,νκ 0
0
`=1 κ 6=κ
It results
ν m −1 ∂φl
∂ D φ 6 constν,m,x̂ · |sj |v(m) |j |−r ; (25)
j l
∂z j
we included the case (ν, m) = (0, 0) in the last two bounds too; see the estimates
(21) and (23).
The next lemma examines ϕ = φ0 only; it yields a bound for the left upper
corner M11 in the matrix Mx,P :
X Z 2 !1/2
|dz | ∂ϕ
6 |f |A j B̌
D m
ϕ −1
|1| ∂z
V A j
±j ∈J \J 0 j
|dzj |B̌ m −1 ∂ϕ
6 sup sup D ϕ kf kV
±j ∈J \J 0 Vj |1|A ∂zj
6 constm,x̂ sup |cj ||sj |v(m)+2 |j |−r kf kV
±j ∈J \J 0
we utilized the bound (25) and the bounds on the Hermitian metrics | · |A and
| · |B̌ . The last supremum goes to 0 as J 0 increases towards J ; this shows that the
multiplication operator with ϕ −1 ∂ϕ can be approximated in the operator norm by
compact operators; hence it is compact.
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 265
Q
It remains to examine the dependence on P κ : Let Qκ , R κ ∈ j aj . We join
Q
Qκ and R κ by a path [0, 1] → ( j aj )N , s 7→ (P κ (s))κ , P κ (0) = Qκ , P κ (1) =
R κ , zj (Pjκ (s)) = zj (Qκj )eisqj,κ with |qj,κ | 6 π ; this means that we have para-
metrized the shorter arc on aj joining Qκj and Rjκ . To avoid confusion, we write the
parameters explicitly in the next computation:
f · D m ∂ϕx,Q − D m ∂ϕx,R
ϕx,Q ϕx,R
B̌
Z
X 1 ∂ m ∂ϕx,P (s)
=
f · qj,κ D ds
∂p ϕ
j,κ 0 j,κ x,P (s) B̌
−r
6 constm,x sup |qj,κ ||cj ||sj |
v(m)+2
|j | kf kV .
j,κ
This time we used the estimate (25) for ν being the κ-th unit vector. Q The last
supremum converges to 0 when R → Q in the product topology on ( j aj )N ,
i.e. qj,κ → 0 for all j ∈ J and κ = 1, . . . , N; this means that D m (∂ϕx,Q /ϕx,Q )
depends continuously on Q. 2
−1
LEMMA 5. Assume that v(m) + n − r < −1. Then the norm kD m (ϕx,P ∂ϕl,x,P )kB̌
m −1
is finite. The function D (ϕx,P ∂ϕl,x,P ), viewed as an element of B̌, depends con-
Q
tinuously on the divisors P κ ∈ j aj .
Proof. We estimate ϕl /ϕ over Vj . We work with varying values of l; to avoid
0,l,x
confusion, we have to write the parameter-dependence explicitly: Pj0 = Pj . The
cutoff function χ equals 1 over Vj , hence
X
ϕl −l
= z exp χj,1 · (l · (Log z − αj,log − ηj,log ) + h) (26)
ϕ j
with
0,l,x 0,0,x
zj (Pj ) zj (Pj )
h := Log 1 − − Log 1 − ;
zj zj
one should note that all terms using zj (Pjκ ) with κ > 0 cancel. The ratio (26) is
uniformly bounded:
ϕl
sup sup < ∞ (27)
j Vj ϕ
ν
We estimate the derivatives ∂j D m (ϕl /ϕ) over Vj too: One observes that the only
(x, P )-dependent term in the right-hand side of expression (26) is h; its derivatives
are bounded by the estimate (24):
|∂j D mh| 6 constν,m · |sj |v(m) |j |−r .
ν
ν
We apply the chain rule to write the derivatives ∂j D m ϕϕl as a finite sum:
ν ϕl ϕl X Y
k
ν
∂j D m = const(ν ` ),(m` ) ∂j ` D m` h
ϕ ϕ (ν ` ),(m` ) `=1
Pk Pk
with `=1 ν ` = ν and `=1 m` = m; so we get
ϕl X Yk
const(ν ` ),(m` ) (|sj |v(m` ) |j |−r ) 6 constν,m,x̂
ν
|∂j D m |6 (28)
ϕ (ν )(m ) `=1
` `
Q
we have used the estimate (27); note that the product k`=1 may be empty for
m = 0.
We apply the product rule to ϕ −1 ∂ϕl /∂zj and bound the terms by (28) and (25):
ν m −1 ∂ϕl
∂ D ϕ
j ∂zj
X X
ν 00 m00 ϕl ν 0 m0 −1 ∂ϕl
6 constν,ν ,m,m ∂j D ∂ D ϕl
ϕ j ∂zj
0 0
00 0 00
ν +ν =ν m +m =m0
X X 0
6 constν,ν 0 ,m,m0 ,x̂ · |sj |v(m ) |j |−r
ν 0 6ν m0 6m
The summation index j runs over a two-dimensional lattice; therefore the norm is
finite when v(m) + n − r < −1.
QWe estimate the dependence on the divisors P κ : We use again a path [0, 1] →
( j aj )N , s 7→ (P κ (s)), P κ (0) = Qκ , P κ (1) = R κ , zj (Pjκ (s)) = zj (Qκj )eisqj,κ
with |qj,κ | 6 π as in the proof of the previous lemma. We do not suppress the
parameters x, P here to avoid confusion. Using the estimate (29) this time for ν
being a unit vector, we get similarly to the previous bound:
2
d m −1
D ϕ
ds x,P (s) ∂ϕl,x,P (s)
XB̌
6 constm,x̂ + constm,x̂ |qj,κ |2 |j |2(v(m)+n−r) < ∞
j,κ
−1
when v(m) + n − r < −1. This shows that D m (ϕx,P ∂ϕl,x,P ) depends continuously
Q
on the divisors P ∈ j aj .
κ
2
There is no dependence on x and P κ here; hence this definition makes sense in the
κ=0,l=0,x κ=0,l=0,x
degenerate case Oj = Pj too. In the nondegenerate case Oj 6= Pj
we have
ϕ(Oj ) resOj r
(M21 r)j = .
resOj µj j ∈J
and consequently
X
kM21 rk22,w = |cj−2 sj2n−4 | · |(M21 r)j |2
j ∈J
XZ
6 const · |r|2A
j Vj ∪V−j
6 const · krk2V .
This shows that M21 is bounded. 2
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 269
and
ν m X p k m kipj,0
1 α e
sup ∂j D
√ ipj,0 = sup const k,ν,m tj √ ipj,0 k+1
Vj z j − tj e Vj (z j − tj e )
06k 6m
X
const0k,ν,m |j |−rk |sj | 6 const000
v(m) v(m)
6 ν,m |sj |
06k 6m
with m depending on m > 0 and ν > 0. The product rule and supj supVj |zj /(zj −
z(Oj ))| < ∞ imply
ν m ∂µj p
sup ∂j D 6 const(4) tj |s v(m) | sup |∂χj,2 |
ν.m j B̌
Vj ϕ B̌ Vj
The last sum is finite when v(m) − r + 3 < −1 because of |cj | 6 O(|j |) and
|sj | 6 O(|j |); recall that j runs through a two-dimensional lattice. This shows that
M12 is a bounded operator. Sums over finite subsets of J yield operators of finite
rank; since these partial sums approximate M12 in the operator norm, M12 and its
derivative D m M12 must be compact operators. Using the estimates for ν being the
κ-th unit vector, we see that these operators depend continuously on P κ . 2
We
P summarize the outcome of the previous four lemmas; we use the divisor P :=
κ
n P
j,κ κ j :
the corollary is applied to the bounded line bundle Ex (P ); we know that Riemann–
Roch theorem for infinite genus Riemann surfaces is valid for this bounded line
bundle. 0(Ex (P )) being one-dimensional implies that the kernel of Mx,P is one-
dimensional too: The kernel is at least one-dimensional since the index of Mx,P
equals 1, but itP cannot be more than one-dimensional:
Pn−1 The latter would imply that
−1 −1
there is a r = j ∈J βj ϕ µj,x,P + l=0 αj ϕ ϕl,x,P ∈ V with (β, α) 6= (0, 0).
If l < n were the smallest index with αl 6= 0 then |r| ∼ |z−l | near infinity over U ;
this contradicts r ∈ V . Consequently α must vanish. But then the βj must vanish
too; else r had a pole at Pj0 (pole of second order in the degenerate case Oj = Pj0 );
this would contradict r ∈ V too.
We show next that Mx,P (r, β, α)T = 0 and α0 = 0 imply (r, β, α)T = 0: In
this case, we can improve the above estimate (31) of the L2 -norm of ψ (defined as
above) by one order:
Z Z n−1 2
X
|ze−xz ψ|2 = αl z1−l + zr < ∞. (32)
U ∩{|z|>1} U ∩{|z|>1}
l=1
272 FRANZ MERKL
Q know that the line bundle Ex (P ) is isomorphic to some Ob (Q) for a Q−1∈
We
j aj ; when ψ 6= 0 such an isomorphism is given by multiplication with ψ ,
since the only global sections in Ob (Q) are constants. But then ψ −1 must fulfill the
bound
This lower bound for ψ contradicts the upper L2 -bound (32). We conclude that
only ψ = 0, (r, β, α)T = 0 is possible when α0 = 0. We summarize these
considerations:
LEMMA 8. Let f0 be a holomorphic function over supp χj,0 ∩ {|z| > 1}, fj be
a meromorphic function defined over Uj with at most a simple pole at Pj . Assume
that the bounds
Z
|zn e−xz f0 |2 < ∞ (33)
supp χj,0
and
m
with positive constants kl
. We used that
X
n+v(k)−1
D rn = D rn+v(k) +
l l
z−l D l αl
l=n
X m n+v(k)−1
X
f0 := D ψ − e m xz
zv(k)−l D l αl over supp χj,0 , (37)
k+l=m
k l l=0
X m n+v(k)−1
X
fj := D ψ − m
φj,l−v(k) D l αl over Uj ; (38)
k+l=m
k l l=0
X m n+v(k)−1
X
n −xz
z e (f0 − fj ) = −z n
(zv(k)−l − e−xz φj,l−v(k) )D l αl
k+l=m
k l l=0
X m X n−1
= − zn−l · (1 − zl e−xz φj,l) )D l αl+v(k) .
k+l=m
k l l=−v(k)
the last bound holds when Proposition 5 is applicable, i.e. when the inequality (36)
is valid. Lemma 8 yields that the following limit exists:
X m
n −xz
lim z e f0 = lim zn+v(k) D l rn+v(k) .
z→∞,z∈U z→∞,z∈U
k+l=m
k l
We build linear combinations to eliminate the terms that are growing at infinity:
u · ψ := ψxx − ψy
= 2ax + (axx − ay + 2bx )z−1 + O(z−2 ) ψ; (39)
w · ψ := 2ψt + ψxxx − 3ψxy
= (3axx − 3ay ) + 2at − 3by + 3ax2 − 3axy + 3bxx + axxx z−1 +
+ O(z−2 ) ψ. (40)
and the defining equation (39) for u now turns out to be the heat equation (10).
We take a linear combinations of derivatives of Equations (41) and (42); we
want the dependence on b to cancel. We end up with the KP-I-equation:
and consequently
|D m uP (x, y, t) − D m uP (x, y, t + τ )|
= |D m uv((x,y,t ),P )(0) − D m uv((x,y,t +τ ),P )(0)| < ε.
AN ASYMPTOTIC EXPANSION FOR BLOCH FUNCTIONS 277
here γn = (γn1 , γn2 ) is a basis vector in the lattice 0; see the asymptotic description
of heat curves (8) resulting from [2], Part III, §16, Theorem 16.1. This means that
4n ∈ 0(Eγn ); we have set γn := (γn1 , γn2 , 0). Multiplication with 4n defines
therefore an isomorphism Ex (P ) → Ex+γn (P ) of bounded line bundles. This
isomorphism also acts on the one-dimensional space of global sections:
note that both sides are normalized by their behaviour at infinity. The last equation
just states the Bloch condition (6).
Let
2 +t z3
ψx,P = exz+yz (1 + a(x)z−1 + O(z−2 ))
denote the asymptotic expansions in U near infinity; then the Bloch condition
(43) yields aγn + a(x) = a(x + γn ). Differentiation with respect to x implies
0-periodicity of the solution of the KP-I equation: u(x) = u(x + γn ) and that the
mean value in x-direction of u(x) equals a constant:
Z γ11
u(x + s, y, t) ds = 2(a(x + γ11, y, t) − a(x, y, t)) = 2aγ1 ;
0
this integral must vanish when it vanishes for the initial conditions u(x, y, t = 0).
Let ψξ (x, y) 6= 0 solve the heat equation (5) with the Bloch condition (6). Then
we can reconstruct the potential q by
Acknowledgements
The author wants to thank Horst Knörrer and Henry McKean for their continuous
support, their advice and many discussions. The author also would like to thank
Giovanni Felder, Joel Feldman, Fritz Gesztesy, Igor Krichever, Alain-Sol Sznitman
and Eugene Trubowitz for interesting discussions. This work was supported by the
Swiss National Science Foundation, Bern and by the Deutsche Forschungsgemein-
schaft, Bonn.
References
1. Bourgain, J.: On the Cauchy Problem for the Kadomcev–Petviashvili equation, Geom. Funct.
Anal. 3 (1993), 315–341.
2. Feldman, J., Knörrer, H., and Trubowitz, E.: Riemann surfaces of infinite genus I–IV, ETH
Preprint, Zürich, 1994.
3. Feldman, J., Knörrer, H., and Trubowitz, E.: Infinite genus Riemann surfaces, In: J. Carell and
R. Murty (eds), Canadian Mathematical Society 1945–1995, Volume 3, 1996, pp. 91–111.
4. Gesztesy, F., Holden, H., Saab, E., and Simon, B.: Explicit construction of solutions of the
modified Kadomcev–Petviashvili equation, J. Funct. Anal. 98(1) (1991), 211–228.
5. Gesztesy, F. and Schweiger, W.: Rational KP and mKP-solutions in Wronskian form, Rep.
Math. Phys. 30(2) (1991), 205–222.
6. Gesztesy, F., and Unterkofler, K.: On the (modified) Kadomcev–Petviashvili hierarchy, Differ-
ential Integral Equations 8(4) (1995), 797–812.
7. Krichever, I. M.: Spectral theory of two-dimensional periodic operators and its applications,
Russian Math. Surveys 44(2) (1989), 145–225.
8. McKean, H. P.: Integrable systems and algebraic curves, In: Global Analysis, Lecture Notes in
Math. 755, Springer, New York, 1978, pp. 83–200.
9. McKean, H. P. and Trubowitz, E.: Hill’s surfaces and their theta functions, Bull. Amer. Math.
Soc. 84(6) (1978), 1042–1085.
10. McKean, H. P. and Trubowitz, E.: Hill’s operator and hyperelliptic function theory in the
presence of infinitely many branch points, Comm. Pure Appl. Math. 29 (1976), 143–226.
11. McKean, H. P. and van Moerbeke, P.: The spectrum of Hill’s equation, Invent. Math. 30 (1975),
217–274.
12. Merkl, F.: A Riemann–Roch theorem for infinite genus Riemann surfaces, submitted to Invent.
Math. (1999).
13. Merkl, F.: A Riemann–Roch theorem for infinite genus riemann surfaces with applications to
inverse spectral theory, Dissertation ETH Zürich No. 12469, 1997.
14. Mumford, D.: An algebro-geometric construction of commuting operators and of solutions to
the Toda lattice equation, Korteweg–de Vries equation and related non-linear equations, In:
Proceedings of the International Symposium on Algebraic Geometry, Kyoto, 1977, pp. 115–
153.
Mathematical Physics, Analysis and Geometry 2: 279–289, 1999.
279
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
PETER STOLLMANN
Department of Mathematics, Johann Wolfgang Goethe-Universität, Frankfurt, Germany
1. Introduction
In this note, we present a very simple approach to proving Lifshitz asymptotics
for random operators and apply it to Schrödinger operators with Anderson and
Poisson potentials. Let us first briefly recall what Lifshitz asymptotics is about:
consider H0 = −1 on Rd and a random potential term Vω . The operator H0 + Vω
is to be thought of as the effective Hamiltonian of fixed a realization of a random
solid. If we assume that the Vω are bounded below uniformly in ω, Vω > 0 say,
the restriction (H0 + Vω )3 of H0 + Vω to an open cube 3 with Neumann boundary
conditions has compact resolvent. Therefore, the spectral counting function
n(E, (H0 + Vω )3 ) := tr χ[0,E] (H0 + Vω )3
which gives the number of eigenvalues below E, counted with multiplicity, is fi-
nite. This function bears important information about the random potential under
consideration. In fact, its limit as 3 exhausts the whole space has an asymptotic
behaviour characteristic of disorder. To see that, let us first recall that by the cel-
ebrated Weyl asymptotic formula, in absence of disorder, i.e. for Vω = 0, we
have
(where we use |3| for the volume of the cube) which means that
1
lim n(E, (H0 )3 ) =: N0 (E) = Cd E d/2 .
3%Rd |3|
280 P. STOLLMANN
(Here we assume that 0 is the inf of the spectrum of H0 + Vω a.e. for notational
convenience.) His reasoning is as follows:
first of all n(E, (H0 + Vω )3 ) 6 n(E, (H0 )3 ) as the nonnegative potential term
shifts the eigenvalues to the right. Therefore, with E1 (. . .) denoting the bottom
eigenvalue of the operator in question, we have
Z
1
N(E) 6 n(E, (H0 + Vω )3 )χ{E1 ((H0 +Vω )3 )6E} dP(ω)
|3|
6 CE d/2 P{ω : E1 ((H0 + Vω )3 ) 6 E}.
Of course, this is not a mathematically rigorous proof. The point which certainly
has to be made precise is the existence of a large enough region where Vω = 0.
LIFSHITZ ASYMPTOTICS VIA LINEAR COUPLING OF DISORDER 281
For due to tunneling effects, φ might still live on parts of space where Vω 0. Of
course, Vω may not increase the potential energy too much.
Our way around that difficulty goes as follows: Let H (ω) = H0 + Vω be a
random Schrödinger operator with Vω > 0. By what we said above, Lifshitz be-
haviour for the integrated density of states can be deduced from an estimate of the
following form, where H3 (ω) denotes H0 +Vω in L2 (3), with Neumann boundary
condition (b.c.), 3 = 3l (0) an open cube with sidelength l in Rd and E1 (·) the first
eigenvalue.
P{E1 (H3 (ω)) 6 E1 (H0 ) + bl −2 } 6 4 exp(−l d γ ). (1.1)
This latter inequality states in precise terms that it is very unlikely to find really
small eigenvalues of H3 (ω). In order to prove such an inequality, one has to over-
come the following main problem: for a simple-minded lower bound on the first
eigenvalue one would need a uniform lower bound on the perturbation Vω . Such
uniform lower bounds only hold with small probability. On the other hand, what
one knows by standard probabilistic tools are lower bounds for the mean of Vω
for typical ω. So what we need is a relation between the mean of Vω and the first
eigenvalue. In our approach we choose a derivative related with E1 (ω) as such a
link. As you will see, that provides a conceptually simple proof of inequality (1.1).
More precisely, let us consider
H3 (ω, t) = H0 + tVω on L2 (3) with Neumann b.c.
Then the first eigenvalue E1 (ω, t) of this operator behaves like
E1 (H3 (ω)) > E1 (ω, t) ≈ E1 (H0 ) + tE10 (ω, 0) for small t, (1.2)
with
E10 (ω, 0) = (Vω φ0 |φ0 ),
where φ0 is the normalized ground state of H0 .
Now, let us take a closer look at (1.2). We have to find out just how large we
may take t. Analytic perturbation theory suggests t ≈ l −2 , as this is the distance of
E1 (H0 ) to E2 (H0 ) for typical Schrödinger operators. With this choice and b small
enough, from E1 (ω) 6 E1 (H0 ) + bl −2 it follows that E10 (ω, 0) has to be small.
But, in the Anderson case,
1 X
E10 (ω, 0) = (Vω φ0 |φ0 ) = ω(i) const. (1.3)
|3| i∈3
is essentially the mean of a sum of |3| = l d i.i.d. variables. The probability that
this mean differs from the expectation by some fixed constant goes to zero expo-
nentially in the number of independent copies, i.e. exp(−γ |3|) which is exactly
the decay we need. For Poisson potentials, we provide a rather elementary large
deviation estimate, reducing it to something like (1.3). The above considerations
282 P. STOLLMANN
constitute already the main idea of our method which we call ‘linear coupling of
disorder’ for obvious reasons. The rest of the paper is devoted to carrying out the
details needed to turn the above heuristics into a rigorous proof.
In principle, this requires three steps: firstly, the standard procedure to deduce
Lifshitz tails from (1.1) above. To prove (1.1) we need, of course, large deviation
results for (Vω φ0 |φ0 ). As a last ingredient, a remainder estimate for the first-order
approximation to E1 (ω, t) is needed, which, again, is quite standard.
As all three steps are rather straightforward, the expert reader could stop at
this point. (But please, read on.) To appreciate the simplicity of our approach, the
reader should compare it with the proofs available so far; see [1, 10], where detailed
references to the literature can be found. Usually, there is some tricky part when
it comes down to showing the main point: small E1 (ω) come from large deviation
from the typical ω.
For the Anderson model, Temple’s or Thirring’s inequality is used at that point.
The Poisson model was treated using the celebrated work of Donsker and Varad-
han [2] on the asymptotics of the Wiener sausage. A beautiful introduction to this
circle of ideas can be found in [8].
In our approach we single out a very convenient link, namely E10 (ω, 0).
This enabled us to apply our method to a random quantum waveguide model [5],
quite reminiscent of the Anderson model. For this model, however, determining the
derivative E10 (ω, 0) is harder (and more interesting), and the methods using Tem-
ple’s or Thirring’s inequality fail. Despite of all the advertisement for our method,
we should stress that, so far, we haven’t achieved ‘the right constant’ in the Poisson
case.
d
(P) THE POISSON MODEL. Let denote the point P measures on R and P the
Poisson measure on , which is concentrated on { i δXi ; for some discrete se-
quence (Xi )}. Define
X
VωP (x) = f ∗ ω(x) = f (x − Xi (ω)).
i
LIFSHITZ ASYMPTOTICS VIA LINEAR COUPLING OF DISORDER 283
We refer to [1, 4, 10] for a discussion of this very important quantity. Note that the
trace appearing above simply counts the number of eigenvalues below t, so that
N(t) is interpreted as the number of energy levels per unit volume of H (ω). The
fact that
1
E tr χ[0,t ] (H3 (ω))
|3|
decreases as 3 increases is due to our choice of the boundary condition. Since
we are working with Neumann boundary conditions the spectral counting func-
tion is subadditive on disjoint open sets. It is also possible to work with Dirichlet
boundary conditions instead, in which case the spectral counting function is super-
additive. For reasonably well defined Vω the limits are in fact the same. See [4] for
a thorough discussion of this point.
The estimate given in the next theorem is usually referred to as Lifshitz tail
behaviour and is one of the central topics of disordered systems ever since Lifshitz’
seminal contribution [9]:
THEOREM 2.1. The integrated density of states N(t) satisfies
log N(t)
lim sup −d/2 6 −γ (2.1)
t &0 t
for some γ > 0. For the Anderson model, γ = γA depends upon f, M, S and for
the Poisson model, γ = γP depends upon f .
The inequality (2.1) will easily follow from the next result, as we will show at
the end of this section. In the supplement given there one can see the dependence
quite clearly. Note that we write E1 (·) for the first eigenvalue of the operator in
question.
PROPOSITION 2.2. (A) There exist universal constants c, K > 0 such that with
cA = c · S · kf kp /kf k1 , for every
284 P. STOLLMANN
π 2 M2
b 6 min , ,
4 cA2
we have
P{E1 (H3A (ω)) 6 b · l −2 }
√ √
d M − cA b S · (M − cA b)
6 K exp −l log 1 + . (2.2)
K ·S v
(P) There exists a universal constant c0 such that, for M = (e − 1)/e = v,
kf kp
cP = c0 · ,
kf k1
and every
π 2 M2
b 6 min , ,
4 cP2
we have
P{E1 (H3P (ω)) 6 b · l −2 }
ld
√ ld
√
M − cP (l−2) d b M − cP (l−2) d b
6 K exp −(l − 2) d
log 1 + .(2.3)
K v
Let us first single out an important step in the proof of Proposition 2.2. To this
p
end, fix V ∈ Lloc,unif(3), let H (t) = −1 + t · V in L2 (3) with Neumann b.c. and
denote its first eigenvalue by E1 (t). Note that E1 (0) = 0.
LEMMA 2.3. There exists a universal constant C such that for τ = C·kV k−1
p,loc,unif
and 0 6 t 6 τ l −2 we have
π2 2 2
|E1 (t) − tE10 (0)| 6 ·l ·t .
4τ 2
Proof. To estimate the remainder term in the Taylor expansion we want to use
[3], formula II(3.6). The isolation distance ϑ defined as the distance of E1 (0) to
the rest of the spectrum of H (0) is given by ϑ = π 2 / l 2 .
As 0 we choose a circle around E0 with radius ϑ/2. We need an estimate for
the r0 appearing in [3], II(3.3), which means that we have to consider
r(ζ ) = kV (H (0) − ζ )−1 k−1 for ζ ∈ 0.
As (H (0) − ζ )−1 maps L2 to the Sobolev space W 2,2 with norm controlled by
dist(ζ, σ (H (0))) > ϑ/2, we have by Sobolev’s inequality that
2
kV (H (0) − ζ )−1 k 6 c0 · · kV kp,loc,unif,
ϑ
LIFSHITZ ASYMPTOTICS VIA LINEAR COUPLING OF DISORDER 285
so that
ϑ
r0 = min r(ζ ) > C 00 ,
ζ ∈0 kV kp,loc,unif
and an appeal to [3], estimate II(3.6) finishes the proof. 2
Define
π · S · kf kp
cA = ,
C · kf k1
so that
π
6 cA
kf k1 · τ
Now, if 0 6 b 6 M 2 /cA2 it follows that
−2
1 X √
P E1 (ω, 1) 6 b · l 6 P ω(i) 6 cA b
|3| i∈3
1 X √
6 P ω(i) − M > M − cA b .
|3| i∈3
where Xi(m) (ω) is one of the Poisson points in 31 (m), if Ym (ω) = 1, and zero else.
Clearly,
Wω (x) 6 Vω (x) for all ω ∈ , x ∈ Rd ,
and, hence,
P{E1 (H3 (ω)) 6 b · l −2 } 6 P{E1 (−1 + Wω ) 6 b · l −2 }.
Now the latter probability can be estimated by the same calculation as in the case
(A) above, since
X
(l − 2)d 1
(Wω φ0 |φ0 ) > kf k1 Ym ,
ld (l − 2)d m
LIFSHITZ ASYMPTOTICS VIA LINEAR COUPLING OF DISORDER 287
and
kWω kp,loc,unif 6 3d · kf kp ,
where for the last inequality we counted the neighbouring boxes and thus the max-
imal number of nontrivial terms in the sum which defines Wω . Thus, we are again
left with applying a large deviation result for sums of i.i.d. variables. We get, with
π kf kp
cP = · ,
C kf k1
that
P E1 (ω, 1) 6 b · l −2
X
1 ld √
6P Ym 6 cP b
(l − 2)d m∈3 (l − 2)d
l−2
1 X ld √
6P
Ym − M > M − cP b
|3| m∈3 (l − 2)d
l−2
ld
√ ld
√
M − cP (l−2) d b M − cP (l−2) d b
6 K exp − (l − 2) d
log 1 + ,
K v
by [11], Thm. 1.4. 2
3. Concluding Remarks
Of course, one could shorten the above proof if one isn’t interested in the exponent.
There are different quite easy perturbation theoretic proofs for Lifshitz tail
asymptotics which use Temple’s or Thirring’s inequality. See [4] for a detailed
explanation and references. The method presented here has the advantage that the
link between spectral and probability theory provided by the derivative allows for
a conceptually more transparent proof, at least in our opinion. Moreover, the deriv-
ative is in many cases easy to calculate or at least easy to guess, which provides a
road map for the rigorous proof. An example is the application of the above method
in [5], where we didn’t see how to use the methods previously available.
So far we haven’t been able to strengthen our arguments so as to obtain the
d/2
correct value of the exponent which is known to be C · γd , where C is a known
constant and γd is the lowest eigenvalue of the Dirichlet Laplacian on a ball of unit
volume in Rd ; see [10] for an extensive discussion. This correct value is related
with isoperimetric inequalities and is obtained by using the celebrated results of
Donsker and Varadhan; see [2, 8, 4, 10].
To date there are more detailed results available for the bottom (i.e. principal)
eigenvalue of a Schrödinger operator with Poissonian obstacles; we refer the reader
to [12] and the literature cited there.
A recent thorough investigation of the attractive Poissonian case (i.e. the case
where f is nonpositive) is given in [7].
Let us further mention recent deep work of Klopp, [6], which deals with what
is called internal Lifshitz tails. Consider H0 = −1 + V0, where V0 is a Zd -periodic
potential, and H (ω) = H0 +VωA . The spectrum of H0 consists of a number of closed
intervals, called bands, separated by open intervals called gaps. The same is true for
H (ω), where the bands are usually shifted and somewhat enlarged, depending on
sign and size of VωA . Lifshitz predicted that the behaviour of the integrated density
of states N0 for H0 near band edges E0 should have the same power law decay as
in the case V0 = 0 at energy 0, i.e.
N0 (E0 + ε) ∼ N0 (E0 ) + ε d/2 as ε & 0
if E0 is the left endpoint of one of the bands. Moreover, for the randomized op-
erator H (ω) he claimed that the integrated density of states N should exhibit
the exponential decay discussed above for the inf of the spectrum. Interestingly
enough, both claims are still not proved nor disproved in general. Klopps work
LIFSHITZ ASYMPTOTICS VIA LINEAR COUPLING OF DISORDER 289
Acknowledgement
Heartfelt thanks go to F. Kleespies and R. Lang for most useful discussions.
References
1. Carmona, R. and Lacroix, J.: Spectral Theory of Random Schrödinger Operators, Birkhäuser,
Boston, 1990.
2. Donsker, M. D. and Varadhan, S. R. S.: Asymptotics for the Wiener sausage, Comm. Pure Appl.
Math. 28 (1976), 525–565.
3. Kato, T.: Perturbation Theory for Linear Operators, Springer-Verlag, New York, 1980.
4. Kirsch, W.: Random Schrödinger operators: A course, In: H. Holden and A. Jensen (eds),
Schrödinger Operators (Sonderborg DK, 1988), Lecture Notes in Phys. 345, Springer, Berlin,
1989.
5. Kleespies, F. and Stollmann, P.: Lifshitz asymptotics and localization for random quantum
waveguides, Rev. Math. Phys., to appear.
6. Klopp, F.: Internal Lifshitz tails for random perturbations of periodic Schrödinger operators,
Duke Math. J. 98(2) (1999), 335–396.
7. Klopp, F. and Pastur, L.: Lifshitz tails for random Schrödinger operators with negative singular
Poisson potential, Preprint, 1998.
8. Lang, R.: Spectral Theory of Random Schrödinger Operators. A Genetic Introduction, Lecture
Notes in Math. 1498, Springer, New York, 1991.
9. Lifshitz, I. M.: Energy spectrum structure and quantum states of disordered quantum systems
(in Russian), Uspekhi Fiz. Nauk 83 (1964), 617–663.
10. Pastur, L. and Figotin, A.: Spectra of Random and Almost-Periodic Operators, Springer-Verlag,
Berlin, 1992.
11. Talagrand, M.: New concentration inequalities in product spaces, Invent. Math. 126(3) (1996),
505–563.
12. Sznitman, A.-S.: Fluctuations of principal eigenvalues and random scales, Comm. Math. Phys.
189 (1997), 337–363.
Mathematical Physics, Analysis and Geometry 2: 291–321, 1999.
291
© 1999 Kluwer Academic Publishers. Printed in the Netherlands.
LECH ZIELINSKI
Institut de Mathématiques de Paris-Jussieu UMR9994, Université Paris 7 (D. Diderot), 2 Place
Jussieu, 75252 Paris Cedex 05, Case Postale 7012
Key words: spectral asymptotics, Weyl formula, self-adjoint elliptic operators with nonsmooth co-
efficients, sharp remainder estimates.
1. Introduction
This paper is motivated by the well-known result of L. Hörmander [6] concerning
the asymptotic formula for the spectral function (i.e. the kernel of the spectral
projector) of a self-adjoint elliptic differential operator A with smooth coefficients
on a smooth manifold M. If 2m is the order of A and d is the dimension of M, then
d
e(A, y, y, λ) = ω(y)λ 2m (1 + O(λ−µ )) (1.1)
holds with µ = 1/2m. If M is compact, then (1.1) implies the Weyl formula
where N(A, λ) is the counting function of A (i.e. the number of eigenvalues less
than λ, counted with multiplicities).
It is natural to ask if (1.2) still holds for elliptic operators with irregular coeffi-
cients. In [29] we proved that (1.2) holds with µ < r/2m if the coefficients of A are
Hölder continuous of exponent r ∈ ]0; 1]. In particular, for Lipschitz continuous
coefficients, the exponent µ can take an arbitrary value strictly smaller than 1/2m
and we called these results ‘intermediate remainder estimates’.
The aim of this paper is to obtain ‘sharp remainder estimates’, i.e. with the
optimal exponent µ = 1/2m (the value µ = 1/2m cannot be improved in general
292 LECH ZIELINSKI
without additional hypotheses on the Hamiltonian flow). Our basic result in this
direction is
Concerning the sharp spectral asymptotics, we begin by noting that Step 1 can
be made adopting the regularization procedure used in [28 – 31]. However, pro-
found differences between intermediate and sharp remainder estimates appear in
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 293
Step 2. Indeed, the proof of [29] uses a pseudodifferential calculus based on the
fact that P is a pseudodifferential operator of degree 2mµ < 1 and the evolution
propagator e−it P does not change the wave front set. To obtain sharp remainder
estimates, we must use P of degree 1 and we expect that e−it P propagates the
wave front set along the trajectories of the Hamiltonian flow. Thus, a reasonable
regularity hypothesis should allow to solve and to control the Hamiltonian system.
A good hypothesis of this type is the hypothesis that the Hamiltonian field is Lip-
schitz continuous, i.e. the hypothesis that first-order derivatives of coefficients are
Lipschitz continuous.
We note that the analysis of all spectral problems mentioned before begins by
localization. More precisely, it suffices to study functions supported in a suitable
open subset of M and the corresponding quadratic form in local coordinates can be
considered on the whole Rd , i.e. it suffices to consider the quadratic form
X
A0 [ϕ, ψ] = 0
(aα,β D α ϕ, D β ψ) (1.5)
|α|,|β|6m
Then the ellipticity hypothesis of Theorem 1.1 means that there is c > 0 such
that
X
0 0
apr (x, ξ ) = aα,β (x)ξ α+β > c|ξ |2m (1.6)
|α|=|β|=m
and the regularity hypothesis of Theorem 1.1 means that there is C > 0 such that
0 0
|∇aα,β (x) − ∇aα,β (y)| 6 C|x − y| for x, y ∈ Rd , (1.7)
where it suffices to assume (1.7) for |α| = |β| = m.
Further on we fix 0 6 δ < 1 and recall that Step 1 of our approach leads
to pseudodifferential operators with symbols belonging to Hörmander’s classes of
type 1, δ (defined in Section 2 of this paper). More precisely, under the regular-
0
ity condition (1.7), instead of apr introduced in (1.6) we consider a symbol a 0
satisfying
0 0
|∂ξα ∂xα a 0 (x, ξ )| 6 Cα,α 0 hξ i2m−|α|+δ(|α |−2)+ (1.8)
for (x, ξ ) ∈ Rd × Rd , α, α 0 ∈ Nd , where hξ i = (1 + |ξ |2 )1/2 and s+ denotes
the positive part of the real number s, i.e. (|α 0 | − 2)+ = 0 when |α 0 | 6 2 and
(|α 0 | − 2)+ = |α 0 | − 2 when |α 0 | > 2.
Moreover, similarly as in [28] (using (2.6) of [30]) the ellipticity and homo-
0
geneity of apr guarantee the existence of constants C, c > 0 such that
THEOREM 1.2. Let m > 0 and 0 6 δ < 1. Let A0 be given by (1.11) with
a 0 satisfying (1.8)–(1.10) and assume that A0 is self-adjoint in L2 (Rd ) (with the
domain H 2m ). Then A0 is bounded from below, every spectral projector E(A0 , λ) ∈
9 −∞ has a smooth integral kernel e(A0 , ·, ·, λ) and the estimate
e(A0 , y, y, λ) = ω(a 0 , y, λ)(1 + O(λ−1/(2m))) (1.12)
holds with
Z
0 −d
ω(a , y, λ) = (2π ) dξ,
Re a 0 (y,ξ )<λ
THEOREM 1.10 . Let M and A be as in Theorem 1.1. Then (1.2) holds with µ =
1/2m if second order derivatives of coefficients (in local coordinates) are Hölder
continuous with a certain exponent r > 0.
The proof of Theorem 1.2 in its general form will be given in the subsequent
paper [32], where we will develop the analysis described in this paper.
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 295
At the end of this introduction, we note that the regularization procedure of Step
1 was considered in the paper of Kumano-Go, Nagase [12] to study hyperbolic sys-
tems (cf. [8] and [24]). In particular, we follow their notations for classes Sρm ((r))
and we refer to the book of Kumano-Go [11, Chapter 10, Section 1] for the details
concerning the justification of the composition formulas for pseudodifferential and
Fourier integral operators described in Section 4. The properties of the Hamiltonian
systems considered in Lemma 4.1 are well known (cf. [8, 13, 24]) but we join their
simple proof in the Appendix. However, instead of the theory of multiproducts
of phase functions considered in [8, 13, 24], we preferred to present the standard
construction of Fourier integral operators via transport equations with an adequate
version of the Egorov theorem needed in the proof of Theorem 1.2.
LEMMA 2.1. Let A0 > I be as in Theorem 1.2 and let a 0 be its symbol satisfying
Re a 0 > 1. If P = A0 1/(2m), then P = p(x, D) + R with R ∈ 9 −∞ and
∂xα p ∈ S1,δ
1
if |α| 6 2, (2.2)
Similarly as in [6], the proof of Theorem 1.2 follows from the asymptotic
formula for the spectral function of P ,
with
Z
−d
ω(p0 , y, λ) = (2π ) dξ,
p0 (y,ξ )<λ
and introducing the holomorphic function u(·, y) in the lower complex half-plane
C− = {t ∈ C: Im t < 0} by
Z
u(t, y) = e−it λ dλ e(P , y, y, λ) = he−it P δy , δy i for t ∈ C− (2.7)
(where δy denotes the delta of Dirac in y) we can apply Theorem 3.1.11 of [7] to
define the distributional boundary value
Then the approach of [6] uses the expression of e−it P by means of Fourier integral
operators to write
Z
u(t, y) = (2π )−d ei 8̃(t,y,ξ )q(t, y, ξ ) dξ for t ∈ ]−θ; θ[, (2.9)
where θ > 0 is small enough, and to show that u(·, y) ∈ C ∞ (]−θ; θ[\{0}) has a
singularity in 0 with the principal part
Z Z
−it λ −d
e dλ ω(p0 , y, λ) = (2π ) e−itp0 (y,ξ )d ξ.
In the case δ < 1/2, our proof of Theorem 1.2 will use the analogical schema. In
particular, we shall prove Theorem 2.4 allowing us to express e−it P as a Fourier
integral operator with an amplitude q ∈ S1−δ,δ
0
satisfying q(0, y, ξ ) = 1. However,
(2.6) cannot be obtained from the result of Theorem 2.4 in the way described in
[6]. Indeed, even if we could replace 8̃ by −tp0 in (2.9), the only estimate of the
time derivative of q we know is ∂t q ∈ S1−δ,δ
δ
(cf. the remark at the end of the
proof of Theorem 2.4 in Section 4), which is not sufficient to follow the Tauberian
reasoning of [6] based on the estimate q(t, y, ξ ) = 1 + O(t) when t → 0.
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 297
Thus, we propose another approach of studying u(·, y). Similarly as in [28, 29],
we begin by noting that e−it P is a bounded family of pseudodifferential operators
of type 1, δ if t belongs to an angular region {t ∈ C− : −Im t > θ 0 |Re t|} with
θ 0 > 0. In Section 3, we describe a parabolic-type approximation of e−it P by a
sequence of operators
QN (t) = (e−itp0 qN (t))(x, D), (2.10)
defining a sequence of holomorphic functions in C− by the formula
Z
−d
uN (t, y) = hQN (t)δy , δy i = (2π ) e−itp0 (y,ξ )qN (t, y, ξ ) dξ. (2.11)
Then the distributional boundary value for t ∈ R gives uN (·, y) smooth on R \ {0}
and our aim is to prove
THEOREM 2.2. Let θ > 0 be small enough. Then for every l0 ∈ N, we can find
N = N(l0 ) ∈ N and C = C(N, l0 ) > 0 such that
|∂tl (u − uN )(t, y)| 6 C (2.12)
holds for t ∈ ]−θ; θ[, y ∈ Rd and l = 0, . . . , l0 .
which allows us to obtain (2.6) from (2.12) via the Tauberian argument of [6].
An idea similar to integrations by parts is used in Section 5 to prove (2.12) and
the reasoning is based on a result being a version of the Egorov theorem, saying
that eit P a(x, D)e−it P is a pseudodifferential operator of type 1 − δ, δ if the symbol
a is of type 1 − δ, δ. A suitable version of the Egorov theorem is proved in Section
4 by means of Fourier integral operators.
Before giving more details about our framework of Fourier integral operators,
let us consider the change of variables
ã(y, η) = a(x(y, η), ξ(y, η)) (2.14)
defined by functions x: Y × Rd → X, ξ : Y × Rd → Rd , where X, Y are open sets
0
of Rd .
298 LECH ZIELINSKI
∂ξ ∂ξ
∈ Sρ1 (Y × Rd ), ∈ Sρ0 (Y × Rd ), (2.15ii)
∂y ∂η
X 1/2
|b| = |bj,k | 2
.
16j 6d1 , 16k 6d2
LEMMA 2.3. (a) Let a ∈ Sρm (X×Rd ) and let ã be defined by (2.14) with functions
x, ξ satisfying (2.15i,ii). Assume, moreover, that there is a constant C > 0 such that
Then
THEOREM 2.4. Let P be as in Lemma 2.1. Assume that δ < 1/2 and set ρ =
1 − δ. If θ > 0 is small enough then
where
where
3. Parabolic Approximation
Let p be as in Lemma 2.1 and denote p0 = Re p. We note that (2.3) implies
p − p0 = Im p ∈ S1,δ
0
.
Let N ∈ N, N > 3 and define PN : C ∞ (Rdx × Rdξ ) → C ∞ (Rdx × Rdξ ) by
X
PN : a → PN a = (−i)|α| ∂ξα (a ∂xα p̄)/α!. (3.1)
|α|6N
300 LECH ZIELINSKI
Then we have
such that
where
qN,n+1 = −q̃N,n,n
0
/(n + 1), (3.4)
with
X
q̃0 = i(p̄ − p0 )q − (−i)|α|+1 ∂ξα (q ∂xα p̄)/α!,
16|α|6N
X
q̃k = cα0 ,...,αk ∂ξα0 (q ∂xα0 +···+αk p̄)∂ξα1 p0 . . . ∂ξαk p0 (1 6 k 6 N)
|α0 +···+αk |6N
αj 6=0 if j6=0
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 301
for n = 2.
Finally, if (3.10(n)) holds for a given n ∈ {2, . . . , N}, then we have
qN,n+1 = −q̃N,n,n
0
/(n + 1) ∈ S1,δ
nδ̄
and
X (n+k)δ̄
t n+1 P̃N qN,n+1 ∈ t n+1 S1,δ
nδ̄
+ t n+2 S1,δ
1+nδ̄
+ t n+1+k S1,δ ,
26k 6N
Since all constants are translation invariant (can be chosen independent of y0 ), the
estimate (2.6) will hold for all y ∈ Rd .
For c > 0 and j = 1, . . . , d we introduce
0±j (c) = {ξ ∈ Rd : ±∂ξj p0 (y0 , ξ ) > 2c}. (3.11)
Due to (2.5), we can choose c > 0 small enough to have
[
0j (2c) ⊃ {ξ ∈ Rd : |ξ | > 1/c}. (3.12)
0<|j |6d
(b) If c > 0 is small enough to guarantee (3.12), then there exist χj,c,r ∈ S1,0
0
(j = ±1, . . . , ±d) such that supp χj,c,r ⊂ B(y0 , 2r) × 0j (c) and
X
χj,c,r (x, ξ ) = 1, when x ∈ B(y0 , 3r/2), |ξ | > 1/c.
0<|j |6d
COROLLARY 3.3. In order to prove Theorem 1.2 it suffices to prove that (2.12)
holds if u is given by (2.7) and uN is given by (2.11) with qN described in Proposi-
tion 3.1.
Proof. It suffices to show the existence of qN0 ∈ S 0 (Cθ ) satisfying (2.13). If
χj,c,r are as in Lemma 3.2, then for k = 2, . . . , N there exists qN,k,j ∈ S1,δ k−1
such
that χj,c,r qN,k = (i∂ξj p0 ) qN,k,j and it remains to write
k−1
Z Z
e−itp0 (y,ξ )t k χj,c,r qN,k (y, ξ ) dξ = e−itp0 (y,ξ )t∂ξk−1
j
qN,k,j (y, ξ ) dξ. 2
00
Further, q ∈ Sρ,δ m
(X × Rd × Rd ) means that q(x, ξ, x 0 ) = q̃((x, x 0 ), ξ ) with
00
q̃ ∈ Sρ,δ
m
((X × Rd ) × Rd ).
Then for q ∈ Sρ,δ m
(Rd × Rd × Rd ) we write Op(qe−itp0 ) to denote the operator
on S(Rd ) with the distributional kernel
Z
0
Kt (q)(x, x 0 ) = (2π )−d ei(x−x )ξ −itp0 (x,ξ )q(x, ξ, x 0 ) dξ.
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 303
Let A∗ denote the adjoint of A. Then the calculus of [28, Section 2] gives
where
n+1−N(1−δ̄)
The form of qN (t) gives q̃N,n ∈ S1,δ (Rd × Rd × Rd ) such that
X
(q̃N0 + q̃N1 )(t) = t n q̃N,n , (3.15)
06n62N+1
holds when −Im t > θ 0 |Re t|, |t| < θ, τ ∈ [0; Re t].
It is easy to see (cf. [29, Section 5]) that the left-hand side of (3.16) tends to
(u − uN )(y, Re t) (in S 0 (Rd )) when Im t → 0, hence (3.15) implies the estimate
(2.12) for −Im t > θ 0 |Re t|, |t| < θ, τ ∈ [0; Re t], k = 0, N large enough. It
is clear that the analogical reasoning may be applied to estimate ∂tk (u − uN ) for
k = 0, . . . , k 0 when N = N(k 0 ) is large enough.
It remains to prove estimates (2.12) for t ∈ 40 (θ, θ 0 ), where
for every l ∈ N.
where
v = (y, t, τ 0 ) ∈ V qn,k,v (x, ξ, x 0 ) = qn,k,v (x 0 , ξ ).
]
and
In Section 5 we shall prove the following:
To end this section we check that Proposition 3.4 implies Theorem 2.2. Let q̃N,n
be given by (3.15). Then setting
qn,0,v (x, ξ ) = (1 − τ 0 /t) q̃N,n (y, ξ, x) for v = (y, t, τ 0 ) ∈ V
n
for (v, τ ) ∈ Ṽ allows to write the right-hand side of (3.16) in the form
Z X
dτ t n J (qn,0,v , an,0,v )(τ ), (3.23)
[0; Re t ] 06n62N+1
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 305
uniformly with respect to (v, τ ) ∈ Ṽ (cf. the last remark of Appendix) and u −
uN = O(1) follows from (3.23).
Calcultaing the lth derivative of (3.23), we obtain a linear combination of sim-
ilar expressions, where an,0,v (τ, ·, ·) is a symbol of a power of P (i.e. satisfying
(3.21(n,0)i,ii) with m0 (n, 0) 6 l) and qn,0,v satisfy (3.20(n,0)) with m(n, 0) 6 l +
n+1−N(1− δ̄). Therefore, the general statement of Theorem 2.2 holds by a similar
reasoning if N = N(l0 ) is large enough to ensure 2l0 + 1 − N(1 − δ̄) 6 −2(d + 1).
|x̃(t, y, η)| 6 C|t| < 1/2, |ξ̃ (t, y, η)| 6 C|t|hηi 6 hηi/2 for |t| < θ
if θ > 0 is chosen small enough. Therefore
hηi/2 6 hξ(t, y, η)i 6 2hηi for |t| < θ (4.3)
and in the Appendix we check that
hence ξ̄ ∈ Sρ1 ((1))(Cθ ) and Lemma 2.3 implies ∂x 8̃ ∈ Sρ1 ((1))(Cθ ) and ∂t 8̃ ∈
Sρ1 ((1))(Cθ ) by (4.5). Moreover, calculating the ∂η -derivative of (4.6) we find
|∂η ∂x 8̃(t, x, η)| 6 C|t|, hence choosing θ > 0 small enough, we have
with
q̃0 (t, x, η, x 0 , η0 ) = a(t, x, η0 + ζ(t, x, x 0 , η))q(t, x 0 , η).
Then modulo S −∞ (Cθ ) we may replace q̃0 by q̃0 χ with χ(η, η0 ) = χ0 (η0 hηi−1 ),
χ0 (x) = 1 for |x| 6 1/4, χ0 (x) = 0 for |x| > 1/3 (the contribution of the
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 307
0 0
integral (4.11) in the region |η0 | > hηi/4 is easy to estimate using |η0 |2n ei(x−x )η =
0 0
1nx0 ei(x−x )η to integrate by parts, cf. [11]). Therefore, the standard Taylor formula
0
in η = 0 gives the asymptotic expansion
X
q̃(t, x, η) ∼
= (−i)−|α| ∂xα0 ∂ηα0 q̃0 (t, x, η, x 0 , η0 )|x 0 =x, η0 =0 /α!.
α∈Nd
Proof of Theorem 2.4. Let ã1,j ∈ Sρ0 ((1))(Cθ ), ã1 ∈ Sρ0 (Cθ ), be obtained from
(4.13(j)) and (4.14) by using a(t, x, ξ ) = p0 (x, ξ ).
Let us introduce the first-order differential operator
X
L = ã1,0 + ∂t + ã1,j ∂xj , (4.17)
16j 6d
Rt
Z t Rt
−
e−
0 (s,y,η) ds 0 (s,y,η)ds
qn0 (t, y, η) = δ0,n e 0 a1,0
+ s0 a1,j
fn0 (s 0 , y, η) ds 0
0
and qn0 (t, y, η) = qn (t, x 0 (t, y, η), η). To complete the proof we note that choosing
θ > 0 small enough and reasoning as in the proof of Lemma 4.1, we find that the
application y → x 0 (t, y, η) is a bijection of Rd and its inverse
x → y 0 (t, x, η) = x + ỹ 0 (t, x, η) with ỹ 0 ∈ Sρ0 ((1))(Cθ ),
Remark. The above reasoning ensures ∂t q00 ∈ Sρ0 (Cθ ), but we do not know if
∂t q0 ∈ Sρ0 (Cθ ).
Since |∂ ζ̃ /∂η(t, x, x 0 , η)| 6 C|t|, following the proof of Lemma 2.3, we find
that η → ζ(t, x, x 0 , η) is a bijection of Rd with the inverse ζ → η(t, x, x 0 , ζ ) =
ζ + η̃(t, x, x 0 , ζ ) such that η̃ ∈ Sρ1 ((1))((]−θ; θ[×Rd × Rd ) × Rd ) and
hζ i/2 6 hη(t, x, x 0 , ζ )i 6 2hζ i. (4.90 )
m
If qj ∈ Sρ j (Cθ ) for j = 1, 2, then the operator
with
−1
∂ζ
0
q(t, x, ζ, x ) = q1 (t, x, η)q̄2 (t, x, η) det 0
(t, x, x , η) .
∂η
η=η(t,x,x 0,ζ )
is pseudodifferential with q ∈ Sρm1 +m2 (Cθ ) due to Lemma 2.3. Indeed, the distrib-
utional kernel of (4.21) is
Z
0 0
ei(x−x )ζ(t,x,x ,η) q1 (t, x, η)q̄2 (t, x, η) dη (4.22)
with
Z
−d 0 0
q̃(t, x, η) = (2π ) ei(η−η )(y−z(t,x,η,η )) q(t, x, η0 )b(t, y, η) dy dη0
Z
−d 0 0
= (2π ) ei(η−η )x q̃0 (t, x, η, x 0 , η0 ) dx 0 dη0 , (4.25)
where
q̃0 (t, x, η, x 0 , η0 ) = q(t, x, η0 )b(t, x 0 + z(t, x, η, η0 ), η).
and there is a constant C > 0 such that for (v, τ ) ∈ Ṽ one has
X
tJ (q1,0,v , a1,0,v )(τ ) − (J (q1,k,v , a1,k,v ) +
16k 6k0
+ tJ (q1,−k,v , a1,−k,v ))(τ ) 6 C. (5.3)
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 311
To justify that Proposition 5.1 implies Proposition 3.4 with n = 1, we note that
after having written (5.3) we may express J (q1,−k,v , a1,−k,v ) (k = 1, . . . , k0 ) in
the analogical way, i.e. applying Proposition 5.1 to q1,−k,v , a1,−k,v (k = 1, . . . , k0 )
instead of q1,0,v , a1,0,v . Repeating this procedure l times, we obtain the expression
(5.3) with some q1,k,v satisfying (3.20(1,k)), a1,k,v satisfying (3.21(1,k)i,ii) and
m(1, k) + m0 (1, k) 6 m(1, 0) + m0 (1, 0) − l(2ρ − 1) for k < 0.
Thus for l large enough we obtain the assertion of Proposition 3.4 with n = 1.
Further on, instead of writing the following double condition
{av }v∈V is a bounded subset of Sρm (Cθ ) and
m+l(1−ρ)
{∂τl av |τ =0 }v∈V is a bounded subset of S1,1−ρ for every l ∈ N
LEMMA 5.2. Let j ∈ {1, . . . , d} and let {av }v∈V be a bounded subset of S̃ρm (Cθ ).
If xj stands for the operator of multiplication by j th coordinate, then
[xj , av (τ, x, D)] = av+ (τ, x, D) + τ av− (τ, x, D), (5.4)
where
{av+ }v∈V is a bounded subset of S̃ρm−1 (Cθ ),
{av− }v∈V is a bounded subset of S̃ρm+1−2ρ (Cθ ).
Proof. We have
av (τ, x, ξ ) = av (0, x, ξ ) + τ bv (τ, x, ξ ),
with
Z 1
bv (τ, x, ξ ) = ds(∂τ av )(sτ, x, ξ ).
0
Since {bv }v∈V is a bounded subset of S̃ρm+1−ρ (Cθ ), it is easy to check that the
assertion of lemma holds if we take
av+ (τ, x, ξ ) = i∂ξj av (0, x, ξ ), av− (τ, x, ξ ) = i∂ξj bv (τ, x, ξ ). 2
Indeed, the first assertion (5.6) follows from Corollary 4.2 and to obtain the second
one we note that
Proof of Proposition 5.1. To begin, we note that due to Remark (a) at the end of
Appendix, the partition of unity on B(y0 , 3r/2) × Rd introduced in Lemma 3.2(b)
allows to replace q1,0,v by q1,0,v χj,c,r , where j = ±1, . . . , ±d. Further on, we
assume that j is fixed and j > 0 (to treat the case j < 0 it suffices to replace xj
and ∂ξj by x−j and −∂ξ−j in the reasoning described below). Thus we may assume
that
Indeed, using remarks and notations explained at the end of the Appendix we find
) • q1,0,v form a bounded subset of S −∞ due to (5.7).
]
that (1 − χj,c,r
0
(i.e. the symbol py ∈ S10 does not depend on the x-variable) and for v = (y, t, τ 0 ) ∈
V we set
τ0 τ0
P̃v (τ ) = P̃1 (τ ) + 1 − Py = p̃v (τ, x, D) + R0,v,τ . (5.9)
t¯ t¯
and due to (5.6) it is clear that {p̃v }v∈V is a bounded subset of S̃ρ0 (Cθ ).
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 313
Assuming that c, θ, θ 0 > 0 are small enough and τ ∈ ]−θ; θ[, y ∈ B(y0 , 3r),
η ∈ 0j (c/2), |η| > 1/c, we have ∂ξj p0 (ϑ(τ, y, η)) > c/3, hence there is χ0 ∈
C0∞ (Rd ) such that the symbols
Setting av = a1,0,v /p̃˜ v it is easy to check that {av }v∈V is a bounded subset of
0
S̃ρm (1,0)(Cθ ) and we can decompose
6. Appendix
and
0 k+m +δ−1 0 2m−1+m +δ−1 0
Pj ∈ 91,δ
m
⇒ [Pj , P k ] ∈ 91,δ ⇒ Bj ∈ 91,δ .
Therefore
max{1, m0 +δ−1}
2mPj = (A0 j − Bj )P 1−2m ∈ 91,δ
with
X
B= p1 (x, D)k P 2m−1−k .
06k 62m−1
Since the asymptotic formula for the symbol of a 0 (x, D)∗ gives Im a 0 ∈ S1,δ
2m−1
, we
have
p1 (x, D)2m − (Re a 0 )(x, D) ∈ 91,δ
2m−1
⇒ p1 (x, D)2m − P 2m ∈ 91,δ
2m−1
Proof of Lemma 2.3. Let n ∈ N \ {0} and assume that for 0 < |α̃| 6 n,
|∂ α̃ x(y, η)| + hηi−1 |∂ α̃ ξ(y, η)| 6 Cα̃ hηim(α̃) , (A.2)
0
where α̃ = (α̃ 00 , α̃ 0 ) ∈ Nd × Nd and m(α̃) = −|α̃ 0 | + (1 − ρ)(|α̃| − 1).
We are going to check that the above hypothesis implies
0
|∂ α ã(y, η)| 6 Cα hηim−|α |+(1−ρ)|α| (A.3)
d0
for all α = (α 00 , α 0 ) ∈ N × Nd such that |α| 6 n.
To begin the proof we consider
0
β = (β1 , . . . , βd ) ∈ Nd , β̄ = (β̄1 , . . . , β̄d 0 ) ∈ Nd
and
σ = (αk,j )16j 6d, 06k6βj , σ̄ = (ᾱk,j )16j 6d, 06k6β̄j ,
where
00 0 0 00 0 0
αk,j = (αk,j , αk,j ) ∈ Nd × Nd , ᾱk,j = (ᾱk,j , ᾱk,j ) ∈ Nd × Nd
where
Since
β
|(∂ξ ∂xβ̄ a)(x(y, η), ξ(y, η))| 6 Cβ,β̄ hηim−|β|+(1−ρ)(|β|+|β̄|) ,
Now it is possible to adopt aR reasoning of [7, Chapter 18]. Instead, we may use
γs ∈ C0∞ (B(0, s)) such that γs (x) dx = 1, γs > 0, setting
Z
χj,c (ξ ) =
0
γs (hξ i−1 (ξ 0 − ξ ))hξ i−d dξ 0 , (A.5)
0j (3c/4)
Since 1 − χj,c 0
can be expressed by taking Rd \ 0j ( 34 c) instead of 0j ( 34 c) on the
right-hand side of (A.5), χj,c 0
(ξ ) 6= 1 implies the existence of ξ 0 ∈ Rd \ 0j ( 34 c)
such that |ξ 0 − ξ | < shξ i, implying ξ 0 ∈ Rd \ 0j ( 34 c + 2Cs) due to (A.4) and we
may take s small enough to ensure 34 c + 2Cs < c.
Next for s > 0 we introduce χs ∈ C0∞ (B(y0 , s)) such that χs = 1 on
B(y0 , 34 s). Then to complete the proof of (a) it suffices to take χj,c,r 0
(x, ξ ) =
R α
χ3r (x − y0 )χj,c (ξ ) and it is not difficult to show the estimates |∂ξ γs (hξ i−1 (ξ 0 −
0
0
|∂ α K(t, y, η)| 6 Cα hηi(1−ρ)|α|−|α | , (A.6)
where
∂x ∂x ∂x
hηi
∂t ∂y ∂η
K(t, y, η) = (t, y, η).
∂ξ ∂ξ ∂ξ
hηi−1 hηi−1
∂t ∂y ∂η
Since
∂t K(t, y, η) = M(t, y, η)K(t, y, η), (A.7)
holds with a uniformly bounded matrix
∂(∂ p ) ∂(∂ξ p0 )
ξ 0
hηi
∂x ∂ξ
M(t, y, η) = (ϑ(t, y, η)),
∂(∂x p0 ) ∂(∂x p0 )
−hηi−1 − ∂ξ
∂x
it is clear that (A.6) holds when α = 0.
Next we fix n ∈ N \ {0} and assume that (A.6) holds when |α| 6 n − 1. This
assumption implies that
0
|∂ ᾱ x(t, y, η)| + hηi−1 |∂ ᾱ ξ(t, y, η)| 6 Cᾱ hηi−|ᾱ |+(1−ρ)|ᾱ|
for |ᾱ| 6 n and reasoning as in the proof of Lemma 2.3, we obtain
0
|∂ ᾱ M(t, y, η)| 6 C̃ᾱ hηi(1−ρ)|ᾱ|−|ᾱ | for |ᾱ| 6 n. (A.8)
318 LECH ZIELINSKI
Assume now that |α| = n. Then, for 0 6= ᾱ 6 α, we can use (A.8) and (A.6) with
α − ᾱ instead of α, getting
0
|Fα (t, y, η)| 6 C̄α hηi(1−ρ)|α|−|α | , (A.10)
when α = (α 00 , α 0 ) ∈ Nd+1 × Nd is such that |α| = n. Since
∂η p0 (y, η) I 0
K(0, y, η) = ,
−hηi−1 ∂y p0 (y, η) 0 I
it is clear that (A.10) still holds if Fα (t, y, η) is replaced by K(0, y, η), hence
the Gronwall inequality ensures that (A.10) still holds if Fα (t, y, η) is replaced by
∂ α K, i.e. (A.6) still holds when |α| = n. Then Lemma 2.3 implies ∂t x = ∂ξ p0 ◦ϑ ∈
Sρ0 ((1))(Cθ ) and ∂t ξ = −∂x p0 ◦ ϑ ∈ Sρ1 ((1))(Cθ ).
(b) We write α = (α 00 , α 0 ) ∈ Nd+1 ×Nd and denote K(t, y, η) = ∂x/∂y(t, y, η).
Then (A.6) still holds and
∂ x̃
∂t ∂y x̃ ∈ Sρ (Cθ ) ⇒ |K(t, y, η) − I | = (t, y, η) 6 C|t|.
0
∂y
Therefore choosing θ > 0 small enough, we find that y → x − x̃(t, y, η) is a con-
traction with a unique fixed point y(t, x, η) for every x ∈ Rd , i.e. y → x(t, y, η) is
a bijection of Rd and x → y(t, x, η) is its inverse. We note also that |K(t, y, η)−1 −
I | 6 C|t| and ∂ α K −1 is a linear combination of terms K −n0 (∂ α1 K)K −n1 . . .
(∂ αk K)K −nk with n0 , . . . , nk ∈ N, n0 +· · ·+nk = |α|+1, α1 , . . . , αk ∈ N2d+1 \{0},
α1 + · · · + αk = α, which allows to obtain
0
|∂ α (K(t, y, η)−1 )| 6 Cα hηi(1−ρ)|α|−|α | . (A.11)
Since
∂t ỹ(t, x, η) = −(K −1 ∂t x)(t, y(t, x, η), η), (A.12)
we have |∂t ỹ| 6 C and clearly
0
|∂ α y(t, x, η)| 6 Cα hηi−|α |+(1−ρ)(|α|−1)+ (A.13)
holds when |α| = 0. Using, moreover,
∂y
(t, x, η) = K(t, y(t, x, η), η)−1 − I, (A.14)
∂x
SHARP SPECTRAL ASYMPTOTICS AND WEYL FORMULA 319
∂y ∂x
(t, x, η) = − K −1 (t, y(t, x, η), η), (A.15)
∂η ∂η
it is clear that the estimates (A.13) hold if |α| = 1. Next we fix n ∈ Nd \ {0} and
assume that the estimates (A.13) hold when |α| 6 n. Then reasoning as in the proof
of Lemma 2.3, it is easy to estimate the derivative ∂ α of the right-hand sides of
(A.14)–(A.15) and to conclude that the estimates (A.13) hold when |α| 6 n + 1. 2
ADDITIONAL REMARKS
m0
(b) Let {av }v∈V be a bounded subset of Sρ,δ . Then we have the composition
formula
av (x, D) Op(qv e−itp0 )∗ = Op((av • qv )e−itp0 )∗ + R̃v,t , (A.16)
where {R̃v,t }v∈V , Im t 60, |t |<θ is a bounded subset of 9 −∞ , {av • qv }v∈V is a bounded
m+m0
subset of Sρ,δ (Rd × Rd × Rd ) satisfying the asymptotic expansion
X
(a • qv )(x, ξ, x 0 ) ∼ = (−i)|α| ∂ξα ā(x 0 , ξ )∂xα0 qv (x, ξ, x 0 )/α!. (A.160 )
α∈Nd
References
1. Agmon, S.: Asymptotic formulas with remainder estimates for eigenvalues of elliptic operators,
Arch. Rat. Mech. Anal. 28 (1968), 165–183.
320 LECH ZIELINSKI
29. Zielinski, L.: Asymptotic distribution of eigenvalues of some elliptic operators with intermedi-
ate remainder estimates, Asymptotic Anal. 17 (1998), 93–120.
30. Zielinski, L.: Asymptotic distribution of eigenvalues for elliptic boundary value problems,
Asymptotic Anal. 16 (1998), 181–201.
31. Zielinski, L.: Spectral asymptotics for elliptic operators with Hölder continuous coefficients, 2:
The ergodic case, in: Rep. 17th Conf. Operator Theory (Timisoara 1998), to appear.
32. Zielinski, L.: Sharp spectral asymptotics and Weyl formula for elliptic operators with non-
smooth coefficients, part 2, to be published.
Mathematical Physics, Analysis and Geometry 2: 323–415, 1999.
323
© 2000 Kluwer Academic Publishers. Printed in the Netherlands.
MISHA GROMOV
Department of Mathematics, Institut Hautes Études Scientifiques, 35 Route de Chartres,
91440 Bures sur Yvette, France. e-mail: gromov@ihes.fr and Courant Institute, NYU, New York,
U.S.A.
Key words: symbolic dynamics, mean dimension, holomorphic maps, complex subvarieties.
0. Introduction
0.1. FROM X TO X0
Start from some category of spaces X and the maps between them. These can be
bare sets with no additional structure and all maps, topological spaces and con-
tinuous maps, smooth manifolds, algebraic varieties, linear or affine spaces, etc.
Then, given a group 0, we have a functionally defined 0-space X, i.e. a space with
a 0-action, namely the Cartesian power X0 thought of as the space of X-valued
functions on 0. Here the action of 0 on X0 is induced by the left action 0 on 0,
γ 0 x(γ ) = x(γ 0 γ ).
This action is called the shift and X0 is called the (full) shift space over 0 with the
alphabet X, where the basic example is 0 = Z and X consisting of finitely many
elements called letters.
0 as follows: each function x(γ ) goes to y(γ ), γ ∈ 0, by the usual ‘finite differ-
ence operator’ recipe,
y(γ ) = ϕ x(γ δ1 ), x(γ δ2 ), . . . , x(γ δd ) .
In other words, y(γ ) for each γ ∈ 0 is determined by the value of ϕ on the
restriction of x to the γ -translate of D,
y(γ ) = ϕ(x|γ D),
where ϕ is interpreted as a map from Xγ D = Xd to Y . In this case, we say that our
map x 7→ y is based on D and/or defined by ϕ.
Notice that this construction can be used as a definition of 0-morphisms f over
categories of certain spaces X, e.g., for algebraic varieties. But for the topological
category, there are additional continuous 0-maps X0 → Y 0 , not coming by the
way of ϕ: XD → Y . In fact, every continuous map ϕ: X0 → Y (which may
essentially depend on infinitely many x(γ ), γ ∈ 0) defines f = fϕ : X0 → Y 0 by
the same rule f : x → y for y(γ ) = ϕ(γ x).
0.2. SUBSPACES IN X0
The simplest 0-invariant subset in X0 consists of the fixed point set Fix 0 ⊂ X0
which obviously identifies with X itself, realized by the construct maps 0 → X.
More interesting subspaces in X = X0 appear as pull-backs of fixed points in Y
by 0-maps f : X → Y . One can think of such a subspace X0 = f −1 (y0 ), y0 ∈ Y ,
as the set of solutions to the ‘difference’ equation f (x) = y0 and if f = fϕ for
ϕ: XD → Y with D = {δ1 , . . . , δd } as earlier, then this equation turns into the
following system of algebraic equations denoted (ϕγ ), γ ∈ 0,
ϕ x(γ δ1 ), x(γ δ2 ), . . . , x(γ δd ) = y 0 ∈ Y . (ϕγ )
In fact, one can drop Y from this definition and start with an arbitrary subset L ⊂
XD = Xd , d = card D (corresponding to ϕ −1 (y 0 ) in the previous setting). Then
X0 = X0 (L) ⊂ X0 is defined as the space of functions x: 0 → X such that the
restriction of x to each translate γ D is contained in L, where we identify Xγ D with
XD via the correspondence γ δ ↔ δ, δ ∈ D, and where we view Xγ D as the space
of functions γ D → X. These X0 = X0 (L) ⊂ X0 are called subshifts of finite type
in X0 (where ‘finite’ refers to the finiteness of D ⊂ 0) and L is regarded as a ‘law’
distinguishing ‘legal’ function on 0.
example is the quotient space X0 / Fix 0, where the fixed point set Fix 0 ⊂ X0 is
shrunk to a single (fixed) point. Finding more interesting 0-equivariant equivalence
relations
R ⊂ X0 × X0 ⊂ X0 × X0 = (X × X)0
of finite type is a nontrivial matter which we will not discuss at this stage.
Given our category of spaces X, take an invariant (property, theory) in this category
and try to extend it to a class of 0-spaces including X0 and subshifts of finite
type in X0 . Our extension must satisfy Inv0 X0 = Inv X and the essential (formal
functorial) properties of Inv0 must be similar to those of Inv. Besides, we want
our new invariant Inv0 to be ‘dynamical’ which expresses a vague idea of Inv0
depending on the overall behavior of the 0-orbits. For example, we wish
Inv0 (X0 / Fix 0) = Inv0 X0 ,
so that the ‘few’ fixed points of 0 should not matter.
Here is a specific example indicating what we have in mind.
symplectic structure, complex structure, etc.) and one wishes to study (groups of)
0-homeomorphisms preserving such a structure.
0.3.4. The 0-topology can be naturally relaxed to 0-homotopy with many standard
invariants (such as homology) passing from X to X = X0 . For example, the global
homological dimension becomes Fildim(X : 0) within this framework (see 1.1.6).
The simplest nonembedding theorem is the pigeon hole principle: there is no em-
bedding X → Y if card X > card Y for finite sets X and Y . The dynamical
version of the cardinality, or rather of the entropy =def log(cardinality), is the
mean topological entropy defined for arbitrary compact (and sometimes noncom-
pact) topological 0-spaces X (i.e. with continuous actions of groups 0) denoted
ent(X : 0) (see 1.7 for a definition). If 0 is an amenable group (see 1.3), then, not
surprisingly,
for all finite sets X. This is a common knowledge. (Probably, something like this
must be true for sets X of infinite cardinality where the interesting 0’s are those
with card 0 > card X.) Also, one knows that
for most reasonable (see 1.1.5) spaces X and amenable groups 0. Furthermore,
The most interesting spaces from our point of view appear as solutions of elliptic
differential equations over manifolds V with groups 0 acting on V . A basic ex-
ample is the space of harmonic maps V → X between Riemannian manifolds V
and X, where V is noncompact, X is compact and where V comes along with an
isometry group 0, such that V / 0 is compact. For instance, one may take V = Rn ,
where 0 is either taken to be all Rn or some lattice 3 ⊂ Rn .
328 MISHA GROMOV
The full space of the harmonic map V → X is too big and usually has infi-
nite mean dimension but it has interesting 0-invariant subspaces where the mean
dimension is finite and, sometimes, different from zero. A particular space of this
kind, denoted Xc , is distinguished by the pointwise bound on the differential of
such maps x: V → X, namely
for n = dim V and some constant b = b(V , X, 0), where b = a(V , X) vol(V / 0)
for discrete groups 0. Moreover,
Remark. If V = Rn , then (?)∞ holds true for all c > 0 as follows by an obvious
scaling argument. Probably, this remains valid for nonflat metrics on Rn invariant
under Zn but, in general, the asymptotics of dim(Xc : 0) for c → 0 should depend
on the growth rate of the group 0.
for n = dimR V and some positive constant b0 = b0 (V , X, 0), which is of the form
a 0 (V , X) vol(V / 0) for discrete 0.
(b) It seems that the strict inequality dim(Xc : 0) > 0 manifesting the abun-
dance of our maps V → X is intimately linked to the bubbling phenomenon, i.e.
the presence of instantons, highly localized solutions of our elliptic equations. Here
is a specific conjecture:
Remark on continuity of dim(Xc : 0). It is easy to see in many cases that the
mean dimension dim(Xc : 0) is continuous in c ∈ R+ and whenever it is positive, it
is also nonconstant as a function of c. Thus we get 0-spaces with mean dimension
taking continuous spectra of values. To see it clearer, take the case of meromorphic
functions, i.e. holomorphic maps x: C → P 1 where we bound the (spherical)
derivative by one, i.e. take X = X1 = {x | kDxk 6 1}. Then consider a lattice
3 = λZ2 ⊂ C, for λ ∈ C× , and observe (this is nearly obvious) that
as |λ|2 equals the volume (area in this case) of the fundamental domain of 3 in C.
Thus, by varying 3 with λ we get a continuum of mean dimensions of 3-spaces.
Next we observe that the restriction map ρλ : X → (P 1 )3 , where we evaluate
our maps x: C → P 1 at the points z ∈ 3, is injective for all sufficiently small λ.
In fact this follows from the Cauchy inequality and yields the finiteness property
(?) for the present case as
dim(X1 : C) = |λ|−2 dim(X : 3) 6 |λ|−2 dim (P 1 )3 : 3 = 2|λ|−2
(see 3.4). Now, our space X is embedded into the shift space (P 1 )3 = (P 1 )Z ,
2
the interval (0, 2], since for large λ, where the lattice 3 = λZ2 is sufficiently
rare, the restriction map ρλ : X1 → 3 becomes onto as every map 3 → P 1 can
be extended (interpolated) to a holomorphic map X: C → P 1 with kDxk 6 1
(see 3.6, where such an interpolation is used to show that dim(X1 : C) > 0).
Remark on the bound kDxk 6 1. This may look quite restrictive but, in fact,
harmonic (holomorphic) maps x with kDxk 6 1 often give a fair representation
330 MISHA GROMOV
instance, but the equality between the two dimensions, one referring to all maps
C → CP N and the other to iZ2 -invariant maps, does not seem obvious even for
N = 1. (Actually,√the easiest case concerns not maps of C but rather of C/Z versus
maps of C/Z ⊕ i −1Z, i = 1, 2, . . . , to CP 1 .)
Vol2n (M ∩ B) 6 d
for some const = const(W, 0, d). Furthermore, if 0 is discrete and the quotient
space W/ 0 is projective algebraic, then, for 0 6 n = dimC M < dimC W , one
has
ed : 0) > const0 d n+1 ,
dim(M
for all sufficiently large d > d0 (W ) and some positive constant const0 =
const0 (W, 0) > 0.
Remark. This example should be taken with a pinch of salt as our proof of the
lower bound on dim(M ed : 0) is based on a 0-embedding of M ed to ([0, 1]N1 )0
while the lower bound exploits an embedding ([0, 1] ) → M
N2 0 ed .
332 MISHA GROMOV
Consequently,
Remark. The above will be used in 2.4 for evaluating the mean dimension of
(sub)-linear subshifts Y ⊂ B 0 ⊂ (Rs )0 , where we shall need another lemma:
1.1.3. TRIVIAL LEMMA. Let Y be a closed subset in a Banach space X and let
p: X → RN be a bounded linear operator. Then, for arbitrary metrics on Y and on
p(Y ) ⊂ RN compatible with their topologies, one has Widimε Y > Widimε p(Y )
for all ε > 0.
Proof. As the fibers of the map p: Y → p(Y ) are all nonempty convex, there is
a continuous section, i.e. a map q: p(Y ) → Y such that p ◦ q = Id: Y → Y . Thus
one has Widimε Y > Widimε qp(Y ) > Widimε p(Y ). 2
1.1.4. OPEN QUESTIONS. The Widim inequality allows a lower bound on Widimε
of the intersection of a linear subspace Y in a Banach space X with the unit ball,
(compare 2.6). Then we wish to have a similar inequality for nonlinear subvarieties
Y ⊂ X. For example:
Does (∗) hold true for X = CN and Y being a complex analytic subvariety
passing through the origin?
We would not mind (∗) with a slightly smaller ε > 0 but the answer is not
even known for ε = εN > 0. On the other hand, it is not hard to prove (∗) with
ε depending on the degree of Y in the case Y is complex algebraic. In fact, (∗)
holds true with ε = ε(Vold (Y ∩ 2B)), d = dimR Y for all minimal subvarieties
in R2N by the usual compactness argument. It would be interesting to make such
an argument work uniformly for all dimensions and thus applicable for evaluating
of the mean dimension of (local) algebraic subvarieties in (C1 )0 (compare 2.5).
334 MISHA GROMOV
On the other hand, one may ask on the possible range of Widimε for given ε on a
given class of subvarieties and then one is tempted to extend this question to other
‘slicing invariants’ of Y ∩ B defined in App. 1 of [GroFRM ].
It seems Widimε has not been evaluated even for simple convex subsets in Rn ,
e.g. for the simplex 1n−1 = {xi > 0, 6xi = 1}, where one expects (maybe too
navely) that
Widimε 1n ∼ constε n.
P
Another interesting example is the Euclidean ball B`2 = { ni=1 xi2 6 1} whose
Widimε is to be measured with respect to the sup-product metric (with the corre-
sponding norm kxk`∞ = supi=1,...,n |Xi |). More generally, one asks what is
Widimε B`p with respect to the `q -norm in Rn ?
1.1.5. It is clear that Widimε X < ∞ for all compact metric spaces X and all ε > 0
but it may become infinite for noncompact spaces X (where, in fact, the definition
must be modified by replacing Diam f −1 (p) by lim supU →p Diam f −1 (U ) where
U runs over the neighbourhoods of U in P ) and this inequality is strict. It is also
clear that Cartesian product X1 × X2 with the sup-product metric, that is
dist (x1 , x2 ), (x10 , x20 ) = max dist(x1 , x10 ), dist(x2 , x20 ) ,
It follows, that Widimε is also subadditive for taking maxima of metrics on the
same space X,
Warning. One should be careful with the additivity of Widimε for Cartesian
products. In fact, even the Lebesgue dimension is not always additive, but the extent
of the nonadditivity is completely clarified by the work of Dranishnikov (see [Dra])
who kindly explained this to me.
exists and does not depend on a sequence i (see 1.3.5), exactly as it happens to the
entropy (see [Orn-Weis]). Then we use this limit for the definition of Widimε (X :
0) (see 1.4).
1.3. AMENABILITY
Remark. (a) Clearly, the existence of the limit for all amenable sequences
implies its independence of a choice of a sequence.
(b) if h() is monotone increasing for 0 ⊃ , then it suffices to assume (∗)
only for disjoint subsets 1 and 2 .
Sketch of the Proof. Take two subsets 0 and in 0, where will be eventually
taken much larger than 0 , and consider some translates γi 0 ⊂ 0, i = 1, 2, . . . ,
such that:
(a) all γi 0 are contained in ; S
(b) the intersection of γi (0 ) with the union U0i−1 = i−1
j =1 γj 0 satisfies
(γi 0 ) ∩ U i−1 6 ε|0 | (∗)ε
0
and so
Z
− −1
| | |U0k ∩ γ 0 | dγ 6 |U0k | |0 | |+ |−1 (1 − 2α)−1
−
6 |U0k | |0 | ||−1 (1 − 2α0 )−1 . (1)
Next, by the maximality of k, (∗)ε must be violated for all γ ∈ − , i.e.
|U0k ∩ γ 0| > ε|0 |
for all γ ∈ − and thus
Z
−1
|− | |U0k ∩ γ 0 | dγ > ε|0 |.
−
Hence,
ε 6 |U0k | ||−1 (1 − 2α|−1
and (+)ε is proven.
Now we are ready to prove the existence of the limit (?) by adopting the clas-
sical (and trivial) argument establishing convergence of h(t)/t for sublinear func-
tions h(t). Denote by `− the lower limit
lim inf h(i )/|i |,
i→∞
finally cover much of by translates of iµ , namely the union of all these translates
has total measure at least (1−(1−ε)s )(1−2α)s ||, where α is the upper bound on
the relative amenability constants. Since (we may assume) α is much smaller than
ε s , we cover almost all of . On the other hand, our covering is (1 − ε)-efficient,
i.e. the total measure of our translates does not exceed (1 + ε) according to (∗)ε .
Thus, the union of all our translates say U ⊂ has h(U ) bounded by something
of the order `+ + 2ε. On the other hand, the complement \U has small measure
and retains some ‘amenability’ having |(\U ) ∪ ∂1 (\U )| also small, say 6 ε. It
follows, by subadditivity of h, that h(\U ) is bounded by something of the order
of ε|| and h() is bounded by `−|| + O(ε)||. This yields the Ornstein–Weiss
lemma.
The above mean ε-dimensions Widimε (X : {i }) and Widimε (X : 0) are monotone
decreasing in ε. Thus, we can go to the limit and set
and
if we want to be i -free.
Also, we observe that this definition makes sense for every (not necessarily
invariant) subset Y ⊂ X (as we may work with the metrics |x − x 0 | restricted to
Y ) and we shall be using this for compact subsets Y ⊂ X.
If X is itself a compact metric space, then the above definition of Widim does
not depend on the original metric |x − x 0 | in X. In general, one could make things
invariant by first taking supY Widim(Y : {i }) over all compact Y ⊂ X and then
taking infimum over all metrics |x − x 0 | on X compatible with the topology of X
and such that the action of 0 on X is uniformly continuous. (We shall return to this
later on when it becomes relevant.)
dim(Y : 0) 6 dim(X : 0)
There are certain topological spaces X, which admit weak compactification, i.e. a
compact topological space X• along with a bijective continuous map e: X → X• .
340 MISHA GROMOV
For example, the unit ball {kXk 6 1} in each Banach space is like that. Clearly, if
X• exists it is unique up to homeomorphism.
Now, let X come along with an action of 0 and let weak compactification refer
to a compact 0-space X• with a bijective continuous 0-equivariant map X → X• .
This (X• , 0) is also (obviously) unique, if it exists, and its 0-invariants, such as
dim(X• : 0) can be regarded as invariants of (X, 0).
BASIC EXAMPLE. Let 0 isometrically act on a Banach space and thus on the unit
ball X in this space. A 0-invariant weak compactification is obvious for reflexible
spaces and it also exists for some (all?) other examples, such as `∞ (0). Then one
may speak of
def
dim(X : 0) = dim(X• : 0).
It is clear, that
dim(X : 0) = s
for X being the unit ball in the `∞ -space of bounded functions 0 → Rs (0 is
discrete here) and that this dimension 6 n for all other `p -spaces. But I could
not decide if it is actually positive for p < ∞ (where the problem is related to
evaluation of `∞ -width of `p -balls, compare 1.1.4) and nontrivially depends on p.
(If so, this would imply the spaces (X, 0) are mutually 0-nonhomeomorphic for
different p, which, I guess, is unknown for infinite groups 0.) This problem, on the
one hand, and the idea of the Von Neumann dimension, on the other hand, lead to
the following modification of our dim(X : 0) (see 1.12 – 1.12 3).
Our definition of the mean dimension mimics that of the topological entropy where,
instead of our Widimε X, one uses entε X = log covε X, where covε is the minimal
number of the open subsets in X of diameter 6 ε needed to cover X. In fact,
one can avoid any metric in the definition of both invariants ent(X : {i }) and
dim(X S: {i }) by a direct appeal to (sufficiently fine) finite
T open covers of X, say
X = ν Uν and the associated covers by the intersections γ ∈ γ (Uν ). This defin-
ition of the mean dimension has an advantage of being applicable to nonmetrizable
spaces and it is adopted in [Lin-Wei]. We choose here Widim as it is easier on
the level of notations and also more flexible when it comes to generalizations. For
example, our definition does not truly need any action: every family 1 of metrics
|x − x 0 |δ , δ ∈ 1, on X will do. Such a situation naturally comes up in the study of
spaces X of X-valued functions over a given background space 1 replacing 0 in
the example of X = X0 . Here each point δ ∈ 1 gives rise to a metric on functions
x(δ) via some weight function w(δ, δ1 ) on 1 × 1 by the formula
|x − x 0 |δ = sup w(δ, δ1 )|x(δ1 ) − x 0 (δ1 )|X ,
δ1 ∈1
0
where |x − x |X refers to a preassigned metric on X. Typically, 1 itself is a metric
space (e.g. a graph as in [Gro]ESAV ), and
w(δ, δ1 ) = exp −β dist1 (δ, δ1 ).
‘Microscopic’ observations. One can think of a subset ⊂ 0 (or more generally
⊂ 1) as a ‘microscope’ applied to the metric space X = (X, |x − x 0 |) and en-
larging its visual image to the greater size X = (X, |x −x 0 | ), where the resolving
power of depends on the presence of transformations γ : X → X, γ ∈ , which
expand the original metric in X. This expansion brings invisibly small geometric
details of X = (X, |x − x 0 |) to the observable scale ε where we have a vari-
ety of ‘macroscopic’ geometric techniques at our disposal (see [GroFRM , GroAI ]
and [GroPCMD ]). The magnification may be highly nonuniform in different direc-
tions and so when we eventually send ε → 0 we arrive at a new ‘non-isotropic’
image of X quite different from the original (X, |x − x 0 |) (compare § 4.10 in
[GroCC ]). Thus various ‘macroscopic’ invariants discussed in the above-cited pa-
pers (e.g. Widimε X, Fildimε X, etc.) are getting transported from the geometric
realm to the domain of topological dynamics.
1.8.3. Variation. Rather than intersecting the ε-neighbourhoods for the metrics
|x−x 0 |γ , one could take the ε-neighbourhood with respect to the metric |x−x 0 |i =
supγ ∈i |x−x 0 |γ . This may be only smaller than Uε i and so the resulting dimension
is smaller than Min dim. (Probably, there are easy examples where it is strictly
smaller.)
In fact legdim = prodim in many cases (see Section 2) and it is useful to bring
in an intermediate notion of stable legal dimension, denoted stablegdim(Y : {i }),
where i is increasing sequence of subsets in 0. To define this we project Lj to
Xi for all j > i and let Lij ⊂ Xi denote the images of these projections. Then
we set stablegdim(Y : {i }; L) = lim infi→∞ limj →∞ dim Lij /|ij |. Observe
that this stablegdim extends to nontopological categories in most cases where it
is possible for legdim and this sometimes allows such an extension for prodim (see
below).
PROBLEM. Find less restrictive conditions ensuring the above equalities between
different dimensions. (See Section 2 for practical results in this direction.)
Definition of Strong Stability. Call L strongly stable if there exists ρ0 > 0, such
that the following condition (locρ0 ) is sufficient for extendability of a function
X0 : 0 → X to our x: 0 → X belonging to Y (L) ⊂ X0 ,
(locρ0 ) For every ρ0 -ball B ⊂ 0 the restriction x0 | ∩ B is extendable to an
L-legal function x1 on B, i.e. the restriction of xi to each translate of D
inside B must be in L.
Remarks (a). Besides the limit of dim Fix 0i /(0/ 0i ) the totality of the numbers
dim Fix 0i for all lattices 0i ⊂ 0 carries an interesting information about (X, 0).
P if 0 = Z and 0i = iZ, this information is encoded in the generating
For example,
function i t i dim Fix 0i which we shall study in the second part of this paper.
(b) One can make the above definition of resdim more robust by using δ-fixed
points Fixδ 0i , i.e. moved by judiciously chosen generators of 0i by at most δ.
Also, one may use Widimε Fix 0i for the metric supγ ∈0 |x − x 0 |γ on Fix 0i instead
of dim Fix 0i , where eventually δ, ε → 0.
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 347
Given an arbitrary field K, one may take a vector space X over K, e.g. X = K s ,
and speak of linear laws (i.e. subspaces) L ⊂ XD . Then, if 0 is an amenable group,
we have our (mean) projective dimension
prodim(Y : 0) for Y = Y (L) ⊂ X0
defined with an amenable exhaustion of 0.
Next, given a finitely generated left module B over A, we define its dimension
relative to {Ai } as follows. Take some finite K-dimensional linear subspace B0 ⊂
B generating B over A and set
dimA B | {Ai } = lim inf dim Ai B0 / dim Ai .
i→∞
Clearly, this dimension does not depend on the choice of B0 and it gives the ‘right’
number for free moduli: dimA As = s for all amenable exhaustions. Furthermore,
if A equals the group ring K(0) of some 0, this reduces to the notion of le-
gal (or stable) dimension over 0, but I do not know if the existence of the limit
limi→∞ Ai B0 / dim Ai holds in full generality.
Remark (made by Ofer Gabber). Since lim inf is nonadditive, we cannot claim
the additivity
dimA B1 ⊕ B2 = dimA B1 ⊕ dimA B2
prior to proving the existence of the limit. Yet we always can take some general-
ized limit (the best here, I think, is an ultralimit) and thus recapture the additivity.
Eventually we shall be interested in additivity of dimA for exact sequences, 0 →
B1 → B → B2 → 0, where some extra problems arise (as was also pointed out to
me by Ofer).
Let us relax the assumption of B being finitely generated over A by giving
B a topology where the action of A is continuous and such that B admits a dense
finitely generated submodule B 0 . (For example, if A = K(0), one can take B equal
the space of all functions 0 → K s with the product topology in this B = (K s )0 ,
where K s comes with the discrete topology. Clearly, the finitely generated module
B 0 = K s (0) densely embeds to this B.) Then we can define dim B as dim B 0 or
(which is essentially equivalent) by approximating the above B0 by some Bε and
taking
lim inf lim Ai Bε / dim Ai .
ε→0 i→∞
The major drawback of all this is the amenability assumption on A. This can
be overcome in the context of the Von Neumann algebras, e.g. for the rings R(0)
for arbitrary countable groups 0. Here K = R and the relevant modules are those
of `2 -functions 0 → Rs as well as their submodules and factor modules (com-
pare 1.12.1. below). The resulting Von Neumann dimension dim`2 B is well defined
for all 0 and if 0 is amenable it equals the above dimA B as an easy argument shows
(explained to me by Alain Connes about 20 years ago and exposed in the case of
`2 -cohomology in [Dod-Mat]).
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 349
Let Y ⊂ (Rs )0`2 ⊂ (Rs )0 be a 0-invariant Hilbert space inside (Rs )0`2 = `2 (0, Rs ),
the space of the square summable functions 0 → Rs . Then for every subset we
define the restriction operator (map) R : Y → (Rs ) `2 for R (x) = x| and let
∗
R : (R )`2 → Y be the adjoint operator. The 0-invariance of Y (trivially) implies
s
that
trace R R∗ card
∗ =
trace R R0
0 card 0
dim`2 (Y : 0) = |−1 ∗
def
i | trace R R (+)
for some (and so for each) finite subset ⊂ 0, where || =def card (see [Con],
[GroAI ] and references therein).
0
Remark. In what follows we use standard embeddings (Rs ) → (Rs ) for all
0
⊃ where we just extend functions by zero outside . In particular, we embed
(Rs ) ⊂ (Rs )0 and observe that R∗ = R0∗ |(Rs ) , and so we abbreviate R∗ to
simple R ∗ for all ⊂ 0.
To see this more geometrically in the case of an amenable group 0 we indicate
the following (well known, I believe)
and assume that the restriction of R ∗ (x) to the complement of is β-small, i.e.
We claim that for small α and β the number λ must be close to zero or one. Namely
λ(1 − λ) 6 2α + β. (?)
350 MISHA GROMOV
i.e.
(1 − λ)
R R∗ (x)
6 (α + β)kxk.
Now use (α) again and conclude
λ(1 − λ)kxk 6 (α + β + α(1 − λ))kxk
and, finally,
λ(1 − λ) 6 α + β + α(1 − λ).
In particular we get (?) as well as the relations,
λ(1 − λ) = 0(α + β)
and
α+β
1−λ=0 .
λ
Next let −ρ < be the ρ-interior of , i.e. γ ∈ −ρ iff the ρ-ball B(γ , ρ) ⊂
for a given ρ > 0. We claim that the majority of functions x: −ρ → Rs satisfy
(β)0\ with some β = β(ρ) → 0 for ρ → ∞, at least for finite subsets ⊂ 0.
−ρ
To say it precisely, we denote by Sρ : (Rs ) → (Rs )0\ the operator R0\R ∗ on
−ρ
(Rs ) ⊂ (Rs ) and show that
trace Sρ∗ Sρ 6 β(ρ)|−ρ | (β)ρ
where β(ρ) = β(ρ, 0, s) → 0 for ρ → ∞ and where Sρ∗ : (Rs )0\ → (Rs )0 is
the adjoint to Sρ .
In fact every δ-function x = xγ on 0 concentrated at some γ ∈ 0 satisfies
kR ∗ xγ k ∈ 1 since kR ∗ k 6 1. It follows, that the restriction of kR ∗ xγ k to the
complement of the ball B(γ , ρ) has norm 6 β(ρ) for β(ρ) −−→ 0. Therefore,
ρ→∞
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 351
kSρ (xγ )k 6 β(ρ) for all γ ∈ −ρ as well, because 0\ is contained in the
complement of the balls B(γ , ρ) for γ ∈ −ρ . Then the same inequality is clearly
−ρ −ρ
satisfied by Sρ∗ Sρ : (Rs ) → (Rs ) ,
∗
S Sρ (xγ )
6 β(ρ)
ρ
−ρ
that implies (β)ρ , since the δ-function make an orthonormal basis in (Rs ) .
Now we prove our proposition by first evaluating ni [a, b] for small intervals
[a, b], namely for those where |a − b| = α for some α > 0 specified later on.
We denote by Xi = Xi,a,b ⊂ (Rs )i the span of the λ-eigenfunctions of Ri R ∗
for λ ∈ [a, b] and observe that all x ∈ Xi are α-approximate λ-eigenfunctions for
every λ ∈ [a, b].
Next we consider those x ∈ Xi which vanish on the ρ-boundary of i , i.e.
ρ −ρ
x ∈ Xi =def Xi ∩ (Rs )i and observe that
dim Xρ − dim Xi /|i | −−→ 0
i
i→∞
for every fixed ρ, a and b by the amenability of {i }. Thus the estimate for dim Xi
reduces to that for Xiρ . Then we take the intersection of Xi−ρ with the span of the
eigenfunctions of Sρ∗ Sρ , (with i in place of ) corresponding to the eigenvalues
ρ,β −ρ
6 β 2 . We denote this by Xi ⊂ Xiρ and notice that the operator Sρ : (Rs )i →
ρ,β
(Rs )0\i has norm 6 β on Xi . Furthermore, according to (β)ρ , the dimension of
ρ,β
Xi is rather close to that of Xiρ for large i and ρ. Namely, ∀β > 0, ε > 0∃ρ, s.t.
ρ,β
lim sup dim Xiρ − dim Xi /|| 6 ε.
i→∞
ρ,β
Thus all we need is to estimate the dimension of Xi . To do this we invoke (?),
and apply it to λ = a ∈ [a, b] with a − b = α and get
a(1 − a) 6 2(a − b) + β,
ρ,β
provided some space Xi has positive dimension. In other words, the inequality
|a − b| < a(1 − a) − β /2
ρ,β
forces dim Xi = 0; consequently
ρ
lim sup dim Xi /|i | 6 ε
i→∞
since ε → 0 for ρ → ∞. Thus we proved our propositions for all intervals [a, b],
where
|a − b| < a(1 − a)/2, (∗)
352 MISHA GROMOV
since β > 0 can be chosen arbitrarily small. Finally, we cover an arbitrary interval
lying strictly within [0, 1] by those satisfying (∗) and thus conclude the proof. 2
n(ε) 6 Widimε E
trivially follows from 1.1.2. Probably, it is not hard to evaluate the critical ε 0 for
which n(ε 0 ) = Widimε E.
1.12.3. The restriction maps Ri : Y → (Rs )i arise from the evaluation map
Re : Y → Rs for y 7→ y(e) for the identity element e ∈ 0. Now, let R: Y → RN
be an arbitrary bounded operator and let Ri : Y → (RN )i be the orthogonal sum
of the γ -translates of R for γ ranging over i . We define Ei ⊂ (RN )i as above
with Ri in place of Ri and let
Then a straightforward generalization of the above arguments shows that the supre-
mum of dR over all operators R: Y → RN , N = 1, 2, . . ., equals the Von Neumann
dimension dim`2 (Y : 0).
QUESTION. What is dim`p (Rs )0`q for amenable groups 0? One may (?) expect
that dim`p (Rs )0`p = s for all p in the interval 1 < p < ∞, where the major issue is
the inequality dim`p R0`p < ∞. This would imply, in particular, that the `p -spaces
(Rs1 )0`p and (Rs2 )0`p are not 0-isomorphic for s1 6= s2 , at least for amenable groups
0. This seems to be unknown even for 0 = Z.
Remark. While our mean dimension parallels the topological entropy, the above
Von Neumann dimension is reminiscent of the metric entropy. This may suggest the
following questions. Which (infinite-dimensional) 0-manifolds X have dimA χ 6
dim(X : 0) and when does supµ dimA χ = dim(X : 0) for µ running over
all invariant probability measures on X? However, we do not expect the positive
answer, unless the definitions are modified in some (?) way (compare 2.1).
We shall not pursue this algebraic line of thinking anymore but shall return to
proalgebraic varieties Y in 2.7 where their dimension will be studied within a
topological framework. (If Y is irreducible then we can pass to the function field F
and define dim(Y : 0) as trandeg(F : 0). But isolating an irreducible component
in (reducible) Y may cause a problem as this may be not 0-invariant.)
For example, the zero degree term of this limit equals exp(topextent(comp Y :
{i }) where comp Y denotes the space of connected components of Y .
for all amenable sequences i ⊂ 0. (See Section 0 and 1.9 for notations.)
Proof. Let L ⊂ X be the space of legal K s -valued functions on , denote
by Mρ ⊂ L the subspace of functions → K s vanishing on the ρ-boundary of
, i.e. on ∩ ∂ρ , and observe that dim Mρ > dim L − card ∂ρ . On the other
hand, if ρ is sufficiently large, i.e. if the ρ-ball in 0 around the identity contains our
D ⊂ 0, then Mρ naturally embeds into Y = Y (L), where each function x: →
K s extends by the identical zero on 0\ and where the inclusion D ⊂ ρ-ball
ensures the L-legality of such extension. Now, if i is an amenable sequence, then
(card i )−1 dim Mρ i and (card i )−1 dim Li have the same asymptotice behavior
for i → ∞ by the above inequality and our claim follows. 2
Idea of the proof. If a linear space of functions on a finite set , e.g. our Y | has
dimension close to cardP , then it contains many sharply localized (concentrated)
functions y where ( w∈ y 2 (w))1/2 is of the order supw∈ |y(w)|. Furthermore,
one can find many, about dim(our space), such y’s, which vanish on a given subset
in provided this subset has relatively small cardinality (such as ∂ρ i in i for
large i). All this follows by simple-minded linear algebra and, when applied to
Y |i , yields in the limit for i → ∞ ‘many’ nonzero functions y ∈ Y`2 and thus
in all Y`p for p > 2. Actually there is the following standard trick of doing this
very quickly. Let Pi denote the normal projection from `2 (0; Rs ) to the space of
functions Y | i extended by zero outside i ⊂ 0. We think of these operators on
`2 (0; Rs ) as matrices indexed by 0 with entries in GLs R, written Pi (γ , γ 0 ), and
observe that
X
dim Y | i = trace Pi (γ , γ ).
γ ∈i
Now, the argument in 1.12 shows that if the space X1 is weakly closed in `∞ (0; Rs ),
then
2.1.3. Remarks. (a) The present condition of weak closeness is by far less
demanding than the one in 2.1. In particular, the above (∗) applies to the spaces
coming from solutions of linear elliptic PDE.
(b) Every `2 -function on 0 is bounded. Furthermore every nonzero x ∈ X ∩
2 (0; R ) gives rise to many functions in X ⊂ `∞ (0 : R ) by taking sums
s s
`P
γ ∈0 cγ γ x for square summable (i.e. `2 ) functions γ 7→ cγ ∈ R on 0. But it
is unclear if
It is not even clear what kind of `2 -condition ensures the positivity of dim`∞ and/or
of the mean dimension. On the other hand, if X contains a single nonzero `1 -
function, then dim`∞ (X : {i }) is positive for every amenable exhaustion {i } of
P an x ∈ X ∩ `1 (0; R ), we get lots of bounded functions in X by
s
0. In fact, given
taking sums γ ∈0 eγ λ(x) for bounded functions γ 7→ cγ ∈ R on 0. These suffice
to prove that dim`∞ > 0 and, probably, to show that dimł1 6 dim`∞ for a suitably
defined dimension dim`1 = dim`1 (X`1 : {i }).
358 MISHA GROMOV
0
Squaring `2 -functions. Suppose we are given a bilinear map Rs ⊗ Rs → Rs ,
denoted (x, y) 7→ x • y and observe that so defined product of `2 -function 0 →
0 0
Rs lands in `1 (0, Rs ). Denote by X`22 ⊂ `1 (0, Rs ) the set of the products of all
x, y ∈ X`22 , take the linear span of X`2 and let `∞ (X`22 ) be the closure of this span
0
in `∞ (0 : Rs ). If our product is sufficiently nondegenerate, then
dim`2 X`2 : 0 6= 0 ⇒ dim `∞ X`22 : 0 6= 0.
For example, is this true for Cs -valued functions with the component-wise
product Cs ⊗ Cs → Cs ?
for every finite subset D and each amenable sequence i ⊂ 0. (Notice that if 0
has no torsion, then | ÷ D| 6 || − |D| + 1.) It is equally clear that the subspace
L ⊂ X of L-legal functions on (i.e. those x: → X whose restriction to
every translate γ D ⊂ is contained in L) has
codim L 6 | ÷ D| codim L
and so
(a) One can relax the condition kεk < 1 to kεk 6 1 provided the equality
kε(x)k = k(x)k, for x = (x 1 , x 2 , . . . , x |D| ), x i ∈ Rs , is possible only for x 1 =
x 2 = . . . x |D| . This situation arises, for example, for diffusion operators on R(0).
Remarks. (b) The above applies to every normed field K, e.g. to Q with a p-
adic norm. For example, if ε is given by an (s|D| × s) matrix with integer entries
divisible by a prime number, then the corresponding morphism 1 + ε: Qs (0) →
Q(0) is injective.
In fact, 1 + ε remains injective if we replace Q by an arbitrary field of char-
acteristic zero. More generally, let λ be an arbitrary complex number and let all
entries of the above matrix ε be integer polynomials in λ without constant terms
(i.e. divisible by λ). Then the corresponding morphism 1 + ε: Cs (0) → Cs (0) is
injective, provided either λ is transcendental or an algebraic integer which is not
a root of unity.
In fact, the field Q(λ) obviously admits a norm making kεk < 1 under the above
assumptions.
(c) Our (implicite function) argument, shows that an injective morphism α:
K r (0) → K s (0) remains injective under small perturbations if it admits a right
inverse, i.e. a morphism β: K s (0) → K r (0), such that β ◦ α: K r (0) → K r (0)
equals 1. Such β obviously exists (and this was used above) for α induced by
an embedding K r → K s , but it is unclear how typical such invertibility is for
general α. To get a perspective, let 0 = Zn and K = C. Then K(0) = C(Zn )
identifies with the ring of regular functions on the torus (C× )n and morphisms
Cr (Zn ) → Cs (Zn ) become homomorphisms from the trivial vector bundle of
rank r over this torus to such bundle of rank s. Then injectivity of α translates
to injectivity of the vector bundle homomorphism on some fiber, while invertibility
of α amounts to injectivity on all fibers. Thus we see that those α for which α is
injective (i.e. those from In) constitute a nonempty Zariski open subset in Csr|D| for
s > r and all finite subsets D ⊂ Zn , while α corresponding to invertible α have a
similar property only for s > r + n.
(d) If one replaces the space Rs (0) by its dual (Rs )0 , then the corresponding
implicit function argument yields surjectivity of maps 1+ε for suitably contracting
(possibly nonlinear) maps ε: (Rs )0 ←- of finite type (i.e. defined via ε: (Rs )D →
Rs for finite D ⊂ 0). In fact, such an argument yields bounded (and also `p for
p < ∞) solutions to the equation x + ε(x) = y for y ∈ Rs (0) ⊂ `∞ (0; Rs ) ⊂
(Rs )0 and then one uses density of Rs (0) in (Rs )0 . Notice that all this applies
to (K s )0 for all complete normed fields K as well as some subvarieties in these
(K s )0 .
(e) Another class of injective examples is provided by positive selfadjoint oper-
ators Rs (0) ←-. For example, if β: Rs (0) → Rt (0) is injective, then β ∗ β : Rs (0)
is positive selfadjoint and so injective.
γ ∩ D = {γ w} and γ 0 ∩ D = {γ 0 w}.
Remark. If the group 0 is amenable, then an easy argument shows that there is
no injective morphism K r (0) → K s (0) for r > s. On the other hand, such maps
may exist for all nonamenable goups. Also one can ask which nonamenable groups
admit star-like subsets where the picture is unclear, for example, for torsion groups
with sufficiently rare sets of relations. (It is easy to construct a non0-invariant
embedding K 2 (0) → K 1 (0) of finite type for every nonamenable group using a
bounded measure contracting ‘vector field’ or such 0.)
for card K• → ∞.
Next we observe that the topological entropy of the space Y = Y (L(K•0 )) for
a given law L ⊂ (K•)D obviously equals prodim(Y : 0) log card K• . Thus we
obtain many examples of subshifts of finite type where we know what the entropy
is,
for ρ > Diam D, one can extend such x0 to a legal function on all of 0 vanishing
outside 0 + 2ρ.
transversality of L⊥1 with L . Then we take the 2ρ-ball B1 = B(γ1 , 2ρ) around γ1
and project (K s )1 to (K s )B1 ∩1 . We denote by L⊥
1 and L the images of L1 and
⊥
of L under this projection and observe that the transversality of these images is
equivalent to the transversality of the original spaces L and L⊥ 1 . But in the stable
0
case these images are the same for smaller , namely for = ∩ B(γ1 , 5ρ),
where the transversality follows from our assumptions. Thus, the proof follows by
induction on card . 2
COROLLARY. The intersection of the subset Tran0 ⊂ Gr(K s )D with the set of
ρ-stable laws on ρ , say Tran0 ∩Stρ , is Zariski open in Stρ .
(where ‘stability’ means ‘ρ-stability on 0 for some ρ’) and in many cases (e.g. for
groups with bounded asymptotic dimensions, see [GroESAV ]) the disjoint transver-
0
sality implies the existence of a stable sublaw L0 ⊂ LD0 ⊂ (K s )D for some finite
subset D 0 ⊂ 0, such that L00 = Y = L0 . On the other hand, disjoint transversality
of the spaces L for all finite ⊂ 0 (with an obvious modification of the definition
where takes the role of 0) is equivalent to the stability as an easy argument
shows.
364 MISHA GROMOV
2.4.2. COROLLARY. If s > r, then the subset of the laws L’s in Grr (Rs )D giving
Y of mean dimension s −r is residual. Furthermore, it always contains a nonempty
open subset and, if 0 is uncoiled, then ‘open’ can be strengthened to ‘Zariski open’.
stems from the problem of bounding from below Widimε Y ∩ B in the finite-
dimensional case (see 1.1.4). Yet, even without resolving the finite-dimensional
problem from 1.1.4, one expects (∗) in many cases, e.g., for Y = Y (L) where
L ⊂ XD is real analytic.
Another (rather technical) issue, which comes about in the smooth category, is
a possible bad behaviour of the singularities of Y = Y (L) and of the intermediate
finite-dimensional spaces L ⊂ X , ⊂ 0. It is unclear if L can be as bad as
any other closed subset but, in any case, one can rule out major (?) pathologies by
imposing genericity assumptions on L, making all L stratified subsets (and often
just smooth submanifolds) in X (compare 2.7).
Although we have no general result at the moment for ‘local’ mean dimension it is
possible to obtain some lower bounds for ‘global’ infinite dimensional varieties.
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 367
2.6.2. TOPOLOGICAL LEMMA. There exists a positive ε = ε(X) > 0, such that
the ε 0 -dimension of Y for ε 0 = ε/2 satisfies
Widimε0 (Y ) > N,
where this ε does not depend on M, Y , α and N.
Proof. If Y admits an ε-embedding to a polyhedron P then the cylinder of this
map, say Z ⊃ Y admits a metric extending this of Y , such that dist(z, Y ) 6 ε/2 for
all z ∈ Z (compare 2.5). Next we consider the M projections of XM to X, compose
them with α and observe that the resulting maps Y → X extend to continuous maps
P → X for ε 0 6 ε0 = ε0 (X) > 0. In fact X embeds into some Euclidean space,
say X ⊂ Rn , where we may assume our original metric in X is induced by this
embedding. Since our map from Y to X ⊂ Rn is λ-Lipschitz, it extends to a λδn-
Lipschitz map from Z ⊃ Y to Rn ⊃ X. Now, as all of Z is ε 0 /2-close to Y , for
368 MISHA GROMOV
Next define a similar stable legal dimension for Y = Y (L) ⊂ X0 coming from
L ⊂ XD by
stlehodim = prohomdim
in the category of compact metric spaces. This combines with the above topological
lemma and leads to the following
Proof. All one has to add to the above discussion is the following obvious com-
parison between the sup-product metric in X and |x − x 0 | in X0 : the projection
(X0 , |x − x 0 | ) to (X , sup-metric) is 1-Lipschitz. 2
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 369
for a given B ⊂ A. Clearly, (?) implies that the latter corank bounds from below
the maximal dimension k where the homomorphism H k (C) → H k (B) (induced by
B ⊂ A → C ) does not vanish.
| ↑
Finally we observe that
where i may run over an arbitrary index set I . We say (and this is
Qall we care about)
that some such h× ∈ H × does not vanish, written h× 6= 0, if i∈J hi 6= 0 for all
finite subsets J ⊂ I .
We shall apply the above convention to H = H ∗ (X0 ) and denote the corre-
sponding H × by H × (X0 ). Here the most interesting infinite products are of the
form
[
h× = γh
γ ∈0
for some h ∈ H ∗ (X0 ) and we want to decide when such an h× does not vanish.
More specifically, we define 3× (Y ) ⊂ H × (X0 ) as the set of the products
^γ ∈0 γ λγ for all assignments γ 7→ λγ ∈ 3∗ and we introduce the following
2.6.6. PROPOSITION. The above corank bounds from below the stable legal ho-
mological dimension of Y = Y (L) for compact laws L ⊂ XD ,
stlehodim(Y : {i } > lim inf corank(Y : i ).
i→∞
The proof is clear with the preceding discussion. Also, the following corollary
is now obvious.
2.6.10. Remarks. (a) This corollary is most powerful if applied to the coho-
mology with finite (e.g. Z/2Z) coefficients where the monomial condition is not
so restrictive. Thus starting with a monomial µ0 in H ∗ (XD ; Z) nondivisible by an
integer p, one gets nonmonomial classes of the form µ0 + pµ0 where the corollary
may apply.
(b) If we work with real coefficients, then the nonvanishing of an (infinite)
integer monomial µ0 obviously yields this for µ0 + 2µ0 for an integer µ0 and all
transcendental 2 ∈ R. Unfortunately it is not useful as the cohomology H ∗ (L) ⊂
H ∗ (XD ) lives over Z but it suggests that nonvanishing of products of the form
^γ ∈0 γ λ and issuing lower bound on the mean dimension are generic phenomena.
This is also confirmed by the examples we study below.
Also, observe that no monomial in π− includes a factor coming from Hγ0 , since
− D does not contain γ0 . Thus π = π− h• = π− ⊗ hγ0 + ε, where no ε-term
includes hγ0 as a factor. Hence, no cancellation is possible and π 6= 0. 2
2.6.13. Remark on the Kaplansky problem. This refers to the following ques-
tion. Let 0 have no torsion. Can then the group ring K(0) have zero divisors? The
above generalizes the standard argument showing there is no zero divisor in K(0)
if 0 is uncoiled (see Section 2.2 and [Pass]).
Standard order on H ∗ (X; R). The space of real exterior forms on Cn has a nat-
ural (minimal in some sense) order where positive 2k-forms are defined as positive
combinations of pull-backs of the standard (positive!) volume from an Ck under
nonsingular C-linear maps Cn → Ck . (This is the only GLn C-invariant order on
32R (Cn ) but it seems unclear what are other orders on 32kR (C ) for k > 2.) Observe
n
2k
that our positive form lies√ in the subspace of 3R consisting of the form invariant
under the action z 7→ −1z on C , where they constitute a convex cone with
n
nonempty interior. Forms in the interior are then called strictly positive.
Next, given a complex manifold X, a class h ∈ H ∗ (X; R) is called (strictly)
positive if it can be represented by a form which is (strictly) positive on the tangent
spaces Tx (X) for x ranging over an open dense subset in X. Clearly, this is a bona
fide order on H ∗ (X) in our sense.
t: (Z+ ) → ZQ
D 0
+ . This map sends each 0-family of monomials {mγ ∈ Z+ , γ ∈ 0}
0 D
→ Z+ and declare this to be our infinite product over 0. Notice, that for pγ = γp0
0
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 375
the resulting measure is 0-invariant. Also notice that for X = CP m with m < ∞,
we deal with smaller spaces, namely, the finite set {0, . . . , m − 1}D (instead of ZD
+)
and the Cantor set {0, . . . , m − 1}0 (instead of Z0+ ).
Summing up, we see that the ‘fundamental cohomology class’ of the infinite
intersection
\
γ YL, L ⊂ (CP m )D , D ⊂ 0,
γ ∈0
We want to extend the results of 2.2 to nonlinear laws L ⊂ XD and show that
generically the mean dimension of Y = Y (L) is bounded by what one may expect,
dim(Y : 0) 6 dim X − codim L.
Intuitively, we think that the γ -translates of the equations defining L remain essen-
tially independent for generic laws L ⊂ XD .
CONCLUSION.
legdim(Y (Lε ) : 0) 6 dim X − codim Lε (?)
for generic ε ∈ E, where ‘generic’ means away from a countable union of proper
subvarieties in E.
Indeed, the dimension of our intersection is semicontinuous in ε ∈ E and if it is
small for some (possibly asymptotic) value of ε, then it is generically small.
In fact the upper bound on legdim follows by the above reduction argument
while the lower bound depends on the homological positivity argument in 2.6.
QUESTION. Does this conclusion (or at least the upper bound on legdim(Y : 0))
remain valid for all projective embeddings Xd ⊂ P N ?
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 379
where the constant C depends only on the curvature of V , i.e. on supv∈V kKv (V )k,
and on the curvature of the implied vector bundle. In particular, we always have
(∗∗)V with some C < ∞ if (V , E) is acted upon by a cocompact (isometry) group.
Then we have the vanishing corollary and embedding property provided (V , E)
has bounded curvature. Furthermore, if (V , E) is invariant under a cocompact
amenable group 0, then, clearly,
dim(XE : 0) 6 constV ,E vol(V / 0).
3.2.2. Remarks and generalizations. (a) The vanishing corollary trivially extends
to manifolds with unbounded curvature if the density ε of a net 1 is allowed to
depend on v. Essentially, we need ε(v) 6 constn kK(v)k−1/2 , where K incorpo-
rates the curvatures of V and E and their first derivatives if so needed. Similarly,
one may admit unbounded section x with ε(v) ≈ (supv∈Bv kx(v)k)−1 for Bv ⊂ V
being the unit ball around v ∈ V .
In fact, one expects here a more generous density bound on 6 in the spirit of
the first main theorem of the Nevanlinna theory.
(b) The above have an obvious version in a general setting where V is an arbi-
trary metric space and XE is replaced by a subspace Y in the space of bounded maps
x: V → Rs . All one needs is uniform compactness of the restriction operators from
Y to functions on the balls B(v, 1) ⊂ V , for all v ∈ V . Actually, one needs even
less: if X is a linear space of bounded functions x: V → Rs where all x ∈ X with
supv∈V kx(v)k 6 1 are uniformly continuous with a given modulus of continuity
then dim(X : 0) < ∞.
QUESTION. What are the L∞ -counterparts of the above properties? For example,
does the mean dimension of the space of bounded harmonic functions vanish for
all amenable groups 0? (It is clear that dim− introduced in 2.1.2 does vanish.)
(e) Let E → V be a 0-equivariant Euclidean vector bundle of rank s and let
1E be the (Bochner) Laplace operator on sections V . Then the function ϕE (λ) =
dimL2 (X1E (λ) : 0) is related to the above ϕ(λ) = dimL2 (XD (λ) : 0) by the
following classical
KATO INEQUALITY.
Z ∞ Z ∞
e−λδ ϕE0 (λ) dλ 6 s e−λδ ϕ 0 (λ) dλ
0 0
for all δ > 0, where, observe, the derivatives ϕE0 and ϕ 0 are positive (measures)
since our functions are monotone increasing.
COROLLARY.
QUESTIONS. (a) Can one improve over the eλ -factor in (+)? (Here one may be
willing or unwilling to bring the curvature of E into play.) Can one bound the mean
dimension of the space of bounded harmonic k-form in the spirit of (◦)?
Notice that a bound similar (◦) (but with a poorer dependence on n) holds
true under less restrictive assumption of the sectional curvatures of V (rather than
R(V )) being bounded from below by −1. This is shown in [GroCDB ] for 0 =
{e} but the argument equally applies to all 0. Furthermore, that argument ap-
plies to the homology Hk (V ; K) for an arbitrary field K and yields a bound on
prodim(Hk (V ; K) : 0) for amenable groups 0.
(b) What is the relation between prodim(Hk (V ; R) : 0) and the mean dimen-
sion of the space of bounded harmonic k-forms on V ? (If one had a full-fledged
Hodge theory for bounded forms one could immediately claim the equality of the
two dimensions.)
x∞ (v0 ) for all v ∈ V and since x∞ |6∞ = 0 this x∞ vanishes everywhere. This
yields the vanishing of x as sup kxk = kx(v0 )k.
Next, forget about 0 and just suppose (V , E) has locally bounded geometry.
Then, instead of translating V , we move ourselves to the points vi and pass to the
(pointed Hausdorff) limit manifold V∞ = limi→∞ (V , vi ) with the limit operator
E∞ on V∞ . Then the maximum principle applies to x∞ on V∞ and the proof
follows. 2
The L2 -index theorem provides many instances where dimL2 (XE : 0) does not
vanish but it is unclear if this implies nonvanishing of the mean dimension. On
the other hand, the presence of a nonzero L1 -solution of the equation Ex = 0
(trivially) yields sufficiently many bounded solutions to ensure nonvanishing of
the mean dimension dim(XE : 0). An obvious way to go from L2 to L1 is by
taking ‘squares’ of x’s (compare), but this is usually incompatible with the equation
Ex = 0. A happy exception is the Cauchy Riemann ∂ operator as the square of a
holomorphic function is holomorphic. More generally, if V is a complex manifold
and E → V is a holomorphic vector bundle, then one can take, for instance, the
symmetric square of E, denoted E 2 , and observe that the symmetric square of a
holomorphic section is holomorphic. Thus
for the ∂-operator (which is essentially based on the Fredholm alternative, a baby
version of the index theorem) yields lots of L2 -sections of E i without any 0-action
at all.
v0 and still having x1 (v) < 1 for all v 6= v1 and finally take xi = x1i . This xi
√ 1
is > 1/2 in the ball of radius ≈ 1/ i since kx0 (v)k > const(dist(v, v0 )) 2 for v
close to v0 and so its L2 -norm is at least const i −n . On the other hand, ∂(x1 (v)) is
different from zero away from v0 where x1 (v) 6 1 − ε and so k∂xi (v)k is bounded
by const i(1 − ε)i−1 as required by (5). 2
Remarks. (a) The L2 -claim remains valid for every line bundle Ei having the
same positivity as E i . Moreover, the holomorphic sections obtained by the above
argument have a controlled decay at infinity. Indeed, let x0 be a continuous section
with compact support and h be the L2 -nearest holomorphic section, i.e. the normal
projection of x0 to the space of holomorphic L2 -sections. Then y0 = x0 − h is
holomorphic outside some ball, say B(v0 , r) ⊂ V , and it is orthogonal to all
holomorphic L2 -sections. Now, take the function ϕ: V → R+ which equals 1
outside a large concentric R-ball B(v0 , R) ⊃ B(v0 , r), which vanishes on B(v0 , r)
and which equals 1 − (R − dist(v, v0 ))/(R − r) for all v in the annulas between
the two balls. Consider the section y1 = ϕy0 and observe that
Z Z
def
(i) hy1 , y0 i = y1 (v)y0 (v) dv > |y0 (v)|2 dv,
V C(R)
It follows that
Z
0 = hy, y0 i > |y0 (v)|2 dv − const(R − r)−1 ky0 kL2 ,
C(R)
and so
Z
|y0 (v)|2 dv 6 const R −1 |y0 |L2
C(R)
for every ε 6 i −1 .
for ε 6 i −1 .
3.3.9. Remarks. (a) Notice that (+) and (++) are local properties where the
holomorphicity of x is only required on the ball B. Thus we can apply (++) to
the solutions y1 of the ∂-problem ∂yi = ∂xi satisfying the basic L2 -estimate (∗)
from 3.3.2. These yi are holomorphic (as well as xi ) in a small (but fixed!) ball
B(v0 , δ) ⊂ V and then (++) applies to smaller ε-balls B(v, ε) ⊂ B(v0 , δ). It
follows, that the holomorphic sections xi0 = xi − yi converge to xi uniformly (and
exponentially fast for i → ∞) on every concentric ball B(v0 , δ 0 < δ). In fact,
such convergence takes place also on larger balls, where ∂xi 6= 0 anymore, since
the Gårding inequality remains valid for nonhomogeneous situation, but we do not
need this for our purposes.
(b) The constants Cv and Cv0 depend on local geometry of V and E near v.
Actually Cv can be bounded in terms of the curvatures of V and E while Cv0 also
depends on the injectivity radius of V . (In general, ε −n in (++) must be replaced
by (Vol B(v, e0 ))−1/2 for some ε 0 somewhat smaller than ε.) In particular, Cv and Cv0
are bounded if V and E have bounded local geometry, e.g. if there is a cocompact
isometry group 0 acting on V and on E.
This follows from Remark (a) in 3.3.5 and the Gårding inequality.
3.3.14. Take a subset S in the jet bundle J r (E i ) over V and let us try to move a
given holomorphic section V → E i away from S. This presupposes some metric
on J r (E i ) and ‘away from S’ means that the r-jet V → J r (E i ) does not intersect
an ε-neighbourhood of S for some ε > 0. In what follows, we assume that V and E
have bounded local geometry and observe that then J r (E i ) also admits a Hermitian
structure of bounded local geometry compatible with this in V . We choose and fix
such structure in each J r (E i ).
We say that S is uniformly k-dimensional, if for each unit ball B ⊂ J r (E i ) and
every δ > 0, the intersection S ∩ B can be covered by at most Cδ −k balls of radius
δ for some constant C = C(S).
(d) The classical correspondence between divisors and line bundles extends
to the framework of bounded geometry. This allows, in particular, construction
of many bounded sections V → E i vanishing on a hypersurface W ⊂ V with
(sufficiently) bounded geometry.
(e) Here are several problems which seem to be solvable in the present frame-
work:
(1) Extension of bounded (and Lp ) holomorphic sections of E i from a submani-
fold 6 ⊂ V of (sufficiently, depending on i) bounded local geometry (where
the case dim 6 = 0 is covered by the interpolation theorem).
(2) Decomposition of bounded holomorphic sections of E i into convergent sums
of L1 -sections.
(3) Construction of bounded sections of affine subbundles P of sufficiently positive
vector bundles. For example, solution of the equation sr=1 cr ⊗ xr = a for
given bounded sections cr of E j and of E i with the unknown xr bounded
sections of E i . Similarly, one is interested in the equation
X
xr ⊗ yr = a
r
present context)? Here one exercises a good control over L2 -sections especially
for the tensorial powers E ⊗i in terms of the index of the twisted operator EE⊗i
but it is unclear when there are nontrivial Lp -sections x of E ⊗i satisfying the
equation EE⊗ i (x) = 0. Similar question arises for the Hodge–Laplace operator
acting on 3∗ (V ) where nonzero harmonic L1 -form may (?) appear in the presence
of a nontrivial cup-product.
kD i xk 6 Ci (kDxk) (+)
and
using the parallel transport in X along the geodesic [x1 (v), x2 (v)] ⊂ X and take
the difference Dx1 (v) − Dx2 (v). Thus we can speak of the C 1 -distance kDx1 (v) −
Dx2 (v)k and set
def
kDx1 − Dx2 k = sup kDx1 (v) − Dx2 (v)k.
v∈V
394 MISHA GROMOV
Notice that this C 1 -distance is well defined if x1 and x2 are C 0 -close, i.e.
def
kx1 − x2 k = sup distX (x1 (v), x2 (v)) 6 ε0 < InjRad X,
v∈V
About the proof of (a) and (b) for our examples. The property (a) is well-known
for the classes of maps indicated in the above (i)–(iii) where it is derived from
the corresponding elliptic regularity for nonhomogeneous linear equations via the
standard implicit function argument. The sufficient condition on V and X is a
uniform C 1 -bound on their curvatures. Then (b) follows by the trivial interpolation
property of smooth maps,
kDx1 − Dx2 k 6 Ckx1 − x2 k
for C = C(V , X, kD 2 xk, kD 2 xk).
in (∗) can be assumed independent of λ and µ in-so-far as these λ and µ are > 1,
since such scaling diminishes the curvature. On the other hand, when we scale the
metrics, the norms of the differentials of the maps x: V → X scale by the rule,
kDxkλ,µ = λ−1 µkDxk,
where kDxkλ,µ is the norm measured with respect to metrics in λV and µX. It fol-
lows, that the constant C in (∗) is bounded by c const(V , X, E, ε0 ) if c = sup kDxk
is > 1. This is seen by taking λ = c and µ = 1. Consequently, the above ε is
bounded from below by δc−1 , δ > 0, and so we obtain the bound of the mean
dimension of Xc for large c by bcn as was stated in (?)∞ of 0.6.1.
Next let us see what happens if c = sup kD(x)k is small. Such a map x sends
large R-balls in V to small ones, of radii cR in X, and if we scale these small balls
to the unit size by passing to µX with µ = (cR)−1 we get maps from B(R) ⊂ V
to almost Euclidean unit balls, where we assume that c is much smaller than R −1 .
Thus, we can think of the harmonic equation for map B(R) → µX on each B(R)
as a small perturbation of the ordinary Laplace equation for maps B(R) → RN ,
N = dim X. Namely, if x1 and x2 are two harmonic maps from B(R) to a unit
ball in µX, then the difference x1 − x2 is approximately harmonic in the Euclidean
sense, where the difference is taken in the Euclidean geometry approximating the
Riemannian one in µX. Now (b) and (b0 ) from 3.2.3 imply the following
Remarks. (a) We treated above only harmonic maps, but the same argument ap-
plies to the pseudo-holomorphic maps between almost complex manifolds (where
it somewhat simplifies in the honestly holomorphic case).
(b) It is not hard to quantify the above and give a specific bound on dim(Xc : 0)
for harmonic maps and small c in terms of c, the upper bound on the sectional
curvature of X and the rate of decay of the heat kernel in V .
dim(Xc : C) 6 2mC 2 /π ;
(d) More general (but less precise) results are available for harmonic maps,
where the elliptic estimates are controlled by the lower bound on Ricci curvature
of V and the upper bound on the sectional curvature of X as (apparently) follows
from the Yau gradient estimate (compare (b) in 3.4.2).
(e) Our embedding result states, in effect, that two distinct harmonic (line) maps
x and x 0 with bounded differentials cannot coincide on a sufficiently dense subset
6 in the manifold V where the maps are defined. Much more is known for holo-
morphic maps, where the first main theorem of the Nevanlinna theory provides a
bound on the density of 6 in terms of the growth of kDxk and kDx 0 k on V . This
leads to the following
and
for all v ∈ V .
Here again, one expects the bound on a suitable density of the zero set of δ(v) in
terms of the asymptotic growth of σ (v) for v → δ. More generally, one wishes to
show, that if δ(v) is small on a rather dense set, then it is also small on a much larger
set, provided we have some bound on σ (v). For example, a holomorphic function
x with many zeros in a ρ-disk and with a bound on kx(v)k in the consecutive
2ρ-disk is much smaller on the ρ-disk than was suggested by the original bound
on kx(v)k. Another general phenomenon of this kind is the unique continuation
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 397
property for elliptic PDE but all this seems far away from a desirable solution of
the above problem.
Remark. Probably the conclusion remains true for all complex (not necessarily
algebraic) X and, possibly, for more general harmonic (like) maps.
Remark. By varying σi and/or rotating the sphere S 2 , one can easily make an
x: C → S 2 with kdxk 6 c and prescribed values on a given, sufficiently sparce
(depending on c > 0) net 6 0 ⊂ C. (See [Ere] for a finer construction of such
interpolating maps C → CP n .)
3.6.2. Now, given a suitable holomorphic Lipschitz (i.e. with bounded differential)
map x0 : V → CP N we want to generate a larger space of such maps. To do this we
take the pull-back E → V of the O(1)-bundle over CP N and use bounded sections
of E i for this purpose. So we need E to be rather positive which is ensured by the
following condition generalizing the ‘locally bi-Lipschitz’ property.
near by points in U and e ∈ Eui and e ∈ Eu0 0 are close vectors, then the leaves V eu and
eu0 are close on large balls B
V eu ⊂ Veu and B eu0 ⊂ V
eu0 . Then the holomorphic section
e
h̃e , on Bu0 can be moved to an almost holomorphic section h̃0 on Bu which lies close
to h̃e0 on B eu0 and has h̃0 (u) = e. This h̃0 can be made zero outside B eu by applying
an obvious cut-off argument and then we observe that the section h̃00 = 12 (h̃e + h̃0 )
is also almost holomorphic and thus can be turned holomorphic as we did before
using the L2 -estimate. The resulting holomorphic h̃• is L2 -close to h̃00 and may be
assumed to have h̃• (u) = e. On the other hand, if h̃e0 were far from h̃e , then h̃0 is far
from h̃e as well, and then their mean would have significantly smaller L2 -norm than
h̃e (as we could assume kh̃e0 kL2 6 kh̃e kL2 by interchanging u and u0 otherwise)
which contradicts to minimality of the norm kh̃e kL2 . Next, we square each h̃u thus
making it L1 and then push down the resulting section e e to a section ϕ 2 (v)
ϕ 2 from V
of E → U by integrating e
2i
ϕ (u, ṽ) with respect to u over the leaf Vv = Vu and
2
e
summing up over all ṽ in Vu over v, where we need ϕ to be bounded (as well as
continuous) on U. In particular, if U is compact, we obtain, by varying ϕ, lots of
continuous sections of E 2i → U holomorphic along the leaves and conclude that
U admits a continuous leafwise holomorphic and leafwise locally bi-Lipschitz map
to some CP N .
Notice, that such foliations exist, for example, on locally homogeneous spaces
U of the form K\G/ 0 where G is a semi-simple group without compact quotients,
K ⊂ G a (nonmaximal!) compact subgroup and 0 ⊂ G is a cocompact lattice.
(d) Singular spaces V . Probably, our results extend to singular spaces V with
an obvious extension of the idea of bounded local geometry. For example, one can
easily handle submanifolds V of a manifold W with bounded geometry, such that
Vol2n (V ∩ B) 6 const for all unit balls B in W . In general, one needs a suitable
version of ∂-technique where a natural idea is to embed V into a nonsingular mani-
fold. Alternatively, one may resolve the singularity of V and adjust the ∂-lemma to
sections constant (vanishing) on the pull-back of the singular locus. Alternatively,
one may try L2 -techniques on the Ĉech resolutions of the relevant sheaves.
(e) As we mentioned earlier, the space of harmonic maps between generic Rie-
mannian manifolds seems rather small but there are some exceptional cases besides
the Kähler manifolds. For example, one may look from this angle at harmonic maps
Rn → S N for all n > 2.ROne can also consider n-harmonic maps which locally
minimize the energy = kDxkp dv with p = n which bubble very much like
ordinary harmonic
R maps for n = 2. Here one should probably replace the uniform
metric for RkDxkp by ‘the energy metric’ and study maps x: V → X locally
minimizing kDxkp dv and having this integral uniformly bounded over the unit
balls in V . This is similar to bounding holomorphic maps x: V → X by their
‘local degrees’, i.e. by the volumes of their graphs within unit balls in V × X,
where one can use estimates from 4.1 or, alternatively, the (first main theorem of
the) Nevanlinna theory which, when it applies, gives better bounds on the mean
dimensions of these spaces than 4.1 (as was pointed out to me by Alex Eremenko).
Notice that in all these cases the spaces of maps with bounded local energy (or
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 403
degree) are not compact and one should compactify them by allowing suitable
‘singular maps’ best represented by certain subsets in V × X appearing as limits
of graphs of the maps in question.
(f) It is worthwhile to recall at this stage that the mean dimension of a space
of maps V → X appears as a limit of the ε-dimensions of spaces of maps of
relatively compact domains ⊂ V . A more general class of geometric problems
can be formulated for an arbitrary V , allowed to be noncomplete and/or to have a
boundary and for a relatively compact in the interior of V . Here we take some
space Xc of our (harmonic like) maps x: V → X with a bound like kDxk < c,
or a similar bound on the (local or global) energy of x. Then we restrict the maps
x ∈ Xc to and evaluate the ε-dimension Widimε of the resulting space Xc | of
maps → X with respect to some metric in this space, e.g. the uniform metric
or some energy metric. What we want to know is the asymptotic behavior of the
resulting Widimε (Xc |) for growing V and , where remains much smaller
than V . Here it is equally interesting to evaluate the minimal number Nε of the
ε-balls needed to cover Xc |, where the expected growth is roughly ε − Vol .
4. Spaces of Subvarieties
Let W be a Hermitian manifold and consider the space of complex analytic sub-
varieties M ⊂ W of given dimension n. All? possible topology in M comes from
the Hausdorff convergence on compact subsets in W . We shall use below a slightly
different topology induced on M from the space of currents on W . Namely, for
every collection of continuous forms w on W of degree 2n with compact support,
we set
404 MISHA GROMOV
Z Z
|M1 − M2 | = sup w− w .
w∈ M1 M2
intersection of M with this tube. Clearly, the projection of M0 to B0⊥ (ε) is a proper
map of multiplicity d0 = d0 (ε) 6 constn (Vol M ∩ B(ρ))/ε 2n. Thus M0 is rep-
resented by the graph of d0 -valued map over B0⊥ (ε), say ϕ0 : B0⊥ (ε) → Sd0 D m .
Then we consider such tubes centered at all points in M and cover M by a min-
imal number of these. Here we set ρ = 1 and denote by d the supremum of the
volumes of intersections of the unit balls in CN with M. Then we see with (∗)
that there is a covering of M by ε-tubes, where the number of such tubes meeting
each unit ball in CN is bounded by constN dε −2n 6 const0N d 2n+1 , where we use
(+) in the form ε > constN d −1 (and where we exercise the usual freedom with
the notation ‘const’N ). Finally we observe with (+) that d0 6 constN d 2n and so
dim Sd0 D m 6 constN md 2n 6 const0N d 2n . Thus the total number of ‘parameters
per unit volume’ defining M is bounded by constN d 4n+1 . This makes plausible
that the mean dimension of the space M ed of n-dimensional complex subvarieties
M ⊂ C with the local degrees bounded by d satisfies the inequality
N
ed : CN ) 6 constN d 4n+1 .
dim(M (?)
Actually, the natural conjecture (justified later on) reads
ed : CN ) 6 constN d n+1 ,
dim(M (??)
but we are not able to prove even the weaker inequality (?).
Here are two difficulties.
1. The above heuristic argument only applies to subvarieties close to a given
one and we lack a good localization theorem saying that ‘the local mean dimen-
sion equals the global one’. Thus we have to vary the tubes covering M which
unpleasantly enlarges the exponent 4n + 1 to something of order N 2 .
2. As we change a covering of M by ε-tubes, we change our representation of
M by a collection of maps (this already happens near a fixed M as we appeal to
Cauchy inequality). This introduces an ambiguity in our choice of a metric in M of
order d d (which probably could be greatly reduced) and this makes our exponent
(even in the local case) comparably large.
Remark. One can improve the covering argument of M by ε-tubes (using tubes
of variable size at different points in M) but I doubt you can bring the exponent
down to n + 1 this way. (The ‘difficult’ M’s are those having large intersections
with small balls, e.g., having conical singularities of degrees ≈ d.) On the other
hand, the above 1 and 2 are purely technical problems and should be eventually
resolved.
4.1.3. Parametrizations of Sd Cm
The symmetric powers of C are nonsingular. In fact, Sd C can be identified with C
in several ways. For example, given a symmetric configurationQof complex numbers
v1 , . . . , vd one can associate to them the polynomial p(z) = di=1 (z − vi ) and then
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 407
one uses the metric on Sd C corresponding to the sup-norm of functions on the disk
D(2) ⊂ C of radius 2. Another useful representation of Sd C is by means of the
symmetric functions,
X
d X
d X
d
s1 = mi , s2 = m2i , ..., sd = mdi ,
i=1 i=1 i=1
where the corresponding metric is the sup-norm in the (s1 , . . . , sd )-space. The two
norms are bi-Lipschitz equivalent, at least in the region corresponding to mi ’s with
|mi | 6 1 where the Lipschitz constant can be trivially bounded by something like
d d . Indeed, going from si to polynomials amounts to expressing the elementary
symmetric functions as polynomials in si . Conversely, one reconstructs si out of
p(z) by taking Cauchy integral of zi p 0 (z)/p(z) over the circle of radius 2 since
zi p 0 (z)/p(z) has simple poles at mi with residues mdi .
Next we observe that the natural map Sd Cm → (Sd C)m is finite-to-one, we
take, additionally, the composition of this projection with a generic linear map of
Cm . Then the resulting map Sd Cm → (Sd C)m × (Sd C)m = (Sd C)2m becomes
one-to-one.
Clearly, there exists a system of disks with above properties, such that the num-
ber of these per unit volume in CN , i.e. meeting each unit ball (or cube) in CN does
not exceed constN ε0−K for
K = dim Isom CN + n = N(2N + 3) + n 6 3N 2 .
4.1.5. MAIN LEMMA. Let D be a collection of disks satisfying A, B, C and let
M1 and M2 be n-dimensional subvarieties in CN from the class M ed . Suppose, that
every disk D ∈ Dε-transversal to both M1 and M2 with the above ε, satisfies
D ∩ M1 = D ∩ M2 . Then M1 = M2 .
Proof. Since M1 ∪ M2 ⊂ M2d and our ε 0 is so small, we can cover CN by ε-
tubes g(D m × B(ε)) for D = D(1) ⊂ C, B(ε) ⊂ Cn and ε much larger than ε,
say ε = constN d −1 (see (+)) such that all disks gD m are in our collection and,
moreover, 2ε-transversal to both M1 and M2 . Thus, M1 and M2 are represented by a
collection of Sdg Cm -valued maps ϕg on the corresponding ε 0 -balls g(B(ε)) ⊂ CN ,
where dg are bounded by const0N d/(ε)2n 6 N 2N d 2n+1 . The intersection condition,
with B, says that these functions are equal on ε 0 -dense subsets in these balls. Now,
let δ denote the supremum of the distances between these functions over all our
balls. The argument as in Section 3 appealing to the Cauchy inequality makes
this distance λδ-small on concentric balls of radii, say 0.9ε with very small λ,
something of order 12 d −d . Thus δ-distance for one covering implies 12 d −d δ-distance
for another covering which then yield 12 δ-distance for the original covering by the
discussion in 3.1. It follows δ = 0 and the proof follows. 2
Remark. We did not try to be sharp in the above estimates but used notations
clarifying relative roles of n, N and d. Besides, there are little details to fill in, like
requiring covering by 0.8ε 0 -tubes (rather than the ε 0 -tubes), etc.
4.1.6. We want to interprete 4.1.5 as an embedding result and thus bound the mean
dimension of M ed . Denote by 1 = 1(D n ) the union of the cones of Si D m , i =
1, . . ., d0 joint at the vertex
_
d0
1= cone(Si D m ),
i=0
where d0 is the smallest integer > N 2N d 2n+1 , and let 1g = 1(gD m ). Now, for
every collections of disks gD m , g running over some subset g ∈ Isom CN , we
map Md to the Cartesian product ×g∈g 1g as follows. If gD m is 2ε-transversal to
M, then the g’s component of our map sends M to M ∩ gD m. If gD m is not ε-
transversal, we go to the joint vertex of the cones and we interplate between the
two maps in some standard way. Now Main Lemma shows that this map is an
embedding.
ed is bounded by
COROLLARY. The mean dimension of M
ed : CN ) 6 const(N, d),
dim(M (++)
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 409
Remark. Our bound on K, ε and d0 = dim 1 are pretty awful. Better leave it
just as const(N, d).
Remarks and open questions. (a) As we mentioned earlier, the constant in (++)
should be bounded by constN d 2n , where it will be interesting to explicitely com-
pute constN .
(b) The above argument can be, probably, extended to two-dimensional mini-
mal subvarieties in Riemannian manifolds and also to pseudo-holomorphic (one-
dimensional) subvarieties in almost complex manifolds W (where the easiest case
if of dimR W = 4 as we have at our disposal pseudo-holomorphic curves ε-
transversal to our M ⊂ W ). On the other hand, the situation seems more difficult
for higher dimensional minimal subvarieties. In fact, it seems unknown if the space
of n-dimensional minimal subvarieties of volume 6 d < ∞ in a compact Rie-
mannian manifold W has finite topological dimension. (On the other hand, generic
W ’s contain few minimal subvarieties and so, typically, their mean dimension
should be zero for infinite groups 0.)
(c) Clearly Med is empty for d 6 d0 = d0 (W ), where the critical d0 equals
1 for W = C . It is not hard to see that the mean dimension is continuous at
N
If N = n + 1, this becomes
and gives us the following bound on the residual dimension of the space M ed =
e
Md (C ) (of n-dimensional subvarieties M with Vol M ∩ B(1) 6 d for all unit
N
balls B(1)), resdim M ed 6 const d n+1 . This improves our earlier bound (with a poor
dependence on d) and suggests that the mean dimension of M ed must be asymptotic
to d n+1 . Here is a more general
Let W admit a positive line bundle E of locally bounded geometry. Then W admits
a holomorphic uniformly nondegenerate Lipschitz map x to CP N , N = dim W .
The pull-backs of subvarieties in CP N are, clearly, in our class M ed→∞ and by
e
varying x one sees that Md has positive mean dimension. Actually, by a direct ap-
plication of the uniform transversality theorem one obtains bounded holomorphic
sections x: W → E i which are uniformly transversal to the zero section in the
obvious sense. The zero set x −1 (0) ⊂ W of such an X is a manifold with bounded
local geometry of dimension equal dim W − 1 and so one obtains by induction
such submanifolds of all codimensions. This combines with an obvious scaling ar-
gument and shows, in particular, that in the presence of cocompact amenable action
the mean dimension of the space M ed of n-dimensional submanifolds M ⊂ W with
the bound Vol B ∩ M 6 d for all unit balls B ⊂ W satisfies
ed : 0) > const d n+1
dim(M
for all n 6 N, some const = const(W, 0) > 0 and all sufficiently large d.
Remarks and final questions. Since every complex subvariety is minimal, one
sees with the above theorem, for example, that the space Md of 2m-dimensional
minimal subvarieties M ⊂ RN with the volume bound by Vol M ∩ B 6 d has
dim(Md : RN ) > 0 for all N > 2m + 2 and d > Vol B 2m . But it is unclear if this
dimension is positive for minimal surfaces in R3 (where one can use the Weirstrass
representation to generate minimal surfaces).
Another situation where one may expect positive mean dimension is that of
pseudo-holomorphic subvarieties M ⊂ W with dimR M = 2, but here one needs
a different technique for producing sufficiently many of them in suitable almost
complex manifolds W .
Finally, we mention special Lagrangian submanifolds and related classes of
complex submanifolds, e.g., M ⊂ CN isotopic relative to a given (symmetric or
anti-symmetric) bi-linear form on CN . Unfortunately, the lack of examples pre-
cludes us from asking meaningful questions.
414 MISHA GROMOV
References
[Con] Connes, A.: Noncommutative Geometry, Academic Press, San Diego, 1994.
[Delz] Delzant, T.: Sur l’anneau d’un groupe hyperbolique, C.R. Acad. Sci. Paris Sér. I
Math. 324 (1997), 381–384.
[Dod-Mat] Dodziuk, J. and Mathai, V.: Approximating L2 invariants of amenable covering
spaces: A combinatorial approach, J. Funct. Anal. 154(2) (1998), 359–378.
[Dra] Dranishnikov, A.: Homological dimension theory, Russian Math. Surveys 43 (1988),
11–63.
[Ere] Eremenko, A.: Normal holomorphic curves from parabolic regions to projective
spaces, Preprint, 1998.
[Gal-Mey] Gallot, S. and Meyer, D.: Opérateur de courbure et laplacien des formes dif-
férentielles d’une variété riemannienne, J. Math. Pures Appl. (9) 54(3) (1975),
259–284.
[Gri-Ha] Griffiths, P. and Harris, J.: Principles of Algebraic Geometry, Wiley, New York, 1978.
[GroAI ] Gromov, M.: Asymptotic invariants of infinite groups, In: Niblo and Roller (eds),
Geometric Group Theory, Vol. 2, Proc. Symp. Sussex Univ., Brighton, July 14–
19, 1991, London Math. Soc. Lecture Notes Ser. 182, Cambridge Univ. Press,
Cambridge, 1993, pp. 1–295.
[GroCC ] Gromov, M.: Carnot–Carathéodory spaces seen from within sub-Riemannian
geometry, In: A. Bellaiche (ed.), Proc. journées nonholonomes; géométrie sous-
riemannienne, théorie du contrôle, robotique, Paris, June 30–July 1, 1992, Progr.
Math. 144, Birkhäuser, Basel.
[GroCDB ] Gromov, M.: Curvature, diameter and Betti numbers, Comm. Math. Helv. 56 (1981),
179–195.
[GroESAV ] Gromov, M.: Endomorphisms of symbolic algebraic varieties, J. Europ. Math. Soc.
(1999), 109–197.
[GroFRM ] Gromov, M.: Filling Riemannian manifolds, J. Differential Geom. 18 (1983), 1–147.
[GroMIKM ] Gromov, M.: Metric Invariants of Kähler manifolds, In: Caddeo and Tricerri (eds),
Proc. Workshops on J. Differential Geom. and Topology, Alghero, Italy, 20–26 June
1992, World Scientific, Singapore, 1993, pp. 90–117.
[GroNLS ] Gromov, M. and Piatetski-Shapiro, I.: Non-arithmetic groups in Lobatchevski
spaces, Publ. Math. IHÉS 66 (1988), 27–45.
[GroPCMD ] Gromov, M.: Positive curvature, macroscopic dimension, spectral gaps and higher
signatures, In: Gindikin and Simon (eds), Functional Analysis on the Eve of the
21st Century, Volume II. In honor of the eightieth birthday of I. M. Gelfand. Proc.
Conf. Rutgers Univ., New Brunswick, NJ, U.S.A., Oct. 24–27, 1993. Progr. Math. 132,
Birkhäuser, Basel, 1996, pp. 1–213.
[GroPDR ] Gromov, M.: Partial Differential Relations, Ergeb. der Math. 3. Folge, Bd. 9,
Springer-Verlag, New York, 1986.
[G-L-P] Gromov, M., LaFontaine, J. and Pansu, P.: Metric Structures for Riemannian and
Non-Riemannian Spaces, Based on Structures métriques des variétés riemanniennes,
edited by J. LaFontaine and P. Pansu, English translation by Sean Michael Bates,
Progr. Math. 152, Birkhäuser, Basel, 1999.
[Hed] Hedlund, G.: Endomorphisms and automorphisms of the shift dynamical system,
Math. Systems Theory 3 (1969), 320–375.
[Lin] Lindenstrauss, E.: Mean dimension, small entropy factors and an embedding theo-
rem, Preprint, 1998.
[Lin-Wei] Lindenstrauss, E. and Weiss, B.: Mean topological dimension, to appear in Israel J.
Math.
[Lück] Lück, W.: L2 -invariants of regular coverings of compact manifolds and CW -
complexes, Preprint.
TOPOLOGICAL INVARIANTS OF DYNAMICAL SYSTEMS: I 415
[Orn-Weis] Ornstein, D. and Weiss, B.: Entropy and isomorphism theorems for actions of
amenable groups, J. Anal. Math. 48 (1987), 1–141.
[Nap] Napier, T.: Convexity properties of coverings of smooth projective varieties, Math.
Ann. 286(1–3) (1990), 433–479.
[Pass] Passman, D. S.: The Algebraic Structure of Group Rings (Reprint with corrections of
the orig. edn publ. 1977 by Wiley, New York), Krieger Publ., Florida.
[Schm] Schmidt, K.: Dynamical Systems of Algebraic Origin, Progr. Math., Birkhäuser,
Basel, 1995.
[Tian] Tian, C.: On a set of polarized Kähler metrics on algebraic manifolds, J. Differential
Geom. 32 (1990), 99–130.
[Ve-Go] Vershik, A. and Gordon, E.: Groups which locally embed into the class of finite
groups, Algebra Anal. 9(1) (1997), 71–97 (in Russian).
Mathematical Physics, Analysis and Geometry 2: 417–418, 1999.
Contents of Volume 2