Riesz–Thorin theorem

Theorem on operator interpolation From Wikipedia, the free encyclopedia

In mathematics, the Riesz–Thorin theorem, often referred to as the Riesz–Thorin interpolation theorem or the Riesz–Thorin convexity theorem, is a result about interpolation of operators. It is named after Marcel Riesz and his student G. Olof Thorin.

This theorem bounds the norms of linear maps acting between Lp spaces. Its usefulness stems from the fact that some of these spaces have rather simpler structure than others. Usually that refers to L2 which is a Hilbert space, or to L1 and L. Therefore one may prove theorems about the more complicated cases by proving them in two simple cases and then using the Riesz–Thorin theorem to pass from the simple cases to the complicated cases. The Marcinkiewicz theorem is similar but applies also to a class of non-linear maps.

Motivation

Summarize
Perspective

First we need the following definition:

Definition. Let p0, p1 be two numbers such that 1 ≤ p0 < p1 ≤ ∞. Then for 0 < θ < 1 define pθ by: 1/pθ = 1 − θ/p0 + θ/p1.

By splitting up the function f in Lpθ as the product |f| = |f|1−θ |f|θ and applying Hölder's inequality to its pθ power, we obtain the following result, foundational in the study of Lp-spaces:

Proposition (log-convexity of Lp-norms)  Each fLp0Lp1 satisfies:

This result, whose name derives from the convexity of the map 1p ↦ log ||f||p on [0, ∞], implies that Lp0Lp1Lpθ.

On the other hand, if we take the layer-cake decomposition f = f1{|f|>1} + f1{|f|≤1}, then we see that f1{|f|>1}Lp0 and f1{|f|≤1}Lp1, whence we obtain the following result:

Proposition  Each f in Lpθ can be written as a sum: f = g + h, where gLp0 and hLp1.

In particular, the above result implies that Lpθ is included in Lp0 + Lp1, the sumset of Lp0 and Lp1 in the space of all measurable functions. Therefore, we have the following chain of inclusions:

Corollary  Lp0Lp1LpθLp0 + Lp1.

In practice, we often encounter operators defined on the sumset Lp0 + Lp1. For example, the Riemann–Lebesgue lemma shows that the Fourier transform maps L1(Rd) boundedly into L(Rd), and Plancherel's theorem shows that the Fourier transform maps L2(Rd) boundedly into itself, hence the Fourier transform extends to (L1 + L2) (Rd) by setting for all f1L1(Rd) and f2L2(Rd). It is therefore natural to investigate the behavior of such operators on the intermediate subspaces Lpθ.

To this end, we go back to our example and note that the Fourier transform on the sumset L1 + L2 was obtained by taking the sum of two instantiations of the same operator, namely

These really are the same operator, in the sense that they agree on the subspace (L1L2) (Rd). Since the intersection contains simple functions, it is dense in both L1(Rd) and L2(Rd). Densely defined continuous operators admit unique extensions, and so we are justified in considering and to be the same.

Therefore, the problem of studying operators on the sumset Lp0 + Lp1 essentially reduces to the study of operators that map two natural domain spaces, Lp0 and Lp1, boundedly to two target spaces: Lq0 and Lq1, respectively. Since such operators map the sumset space Lp0 + Lp1 to Lq0 + Lq1, it is natural to expect that these operators map the intermediate space Lpθ to the corresponding intermediate space Lqθ.

Statement of the theorem

Summarize
Perspective

There are several ways to state the Riesz–Thorin interpolation theorem;[1] to be consistent with the notations in the previous section, we shall use the sumset formulation.

Riesz–Thorin interpolation theorem  Let 1, Σ1, μ1) and 2, Σ2, μ2) be σ-finite measure spaces. Suppose 1 ≤ p0 , q0 , p1 , q1 ≤ ∞, and let T : Lp0(μ1) + Lp1(μ1) → Lq0(μ2) + Lq1(μ2) be a linear operator that boundedly maps Lp0(μ1) into Lq0(μ2) and Lp1(μ1) into Lq1(μ2). For 0 < θ < 1, let pθ, qθ be defined as above. Then T boundedly maps Lpθ(μ1) into Lqθ(μ2) and satisfies the operator norm estimate

In other words, if T is simultaneously of type (p0, q0) and of type (p1, q1), then T is of type (pθ, qθ) for all 0 < θ < 1. In this manner, the interpolation theorem lends itself to a pictorial description. Indeed, the Riesz diagram of T is the collection of all points (1/p, 1/q) in the unit square [0, 1] × [0, 1] such that T is of type (p, q). The interpolation theorem states that the Riesz diagram of T is a convex set: given two points in the Riesz diagram, the line segment that connects them will also be in the diagram.

The interpolation theorem was originally stated and proved by Marcel Riesz in 1927.[2] The 1927 paper establishes the theorem only for the lower triangle of the Riesz diagram, viz., with the restriction that p0q0 and p1q1. Olof Thorin extended the interpolation theorem to the entire square, removing the lower-triangle restriction. The proof of Thorin was originally published in 1938 and was subsequently expanded upon in his 1948 thesis.[3]

Proof

Summarize
Perspective

We will first prove the result for simple functions and eventually show how the argument can be extended by density to all measurable functions.

Simple functions

By symmetry, let us assume (the case trivially follows from (1)). Let be a simple function, that is for some finite , and , . Similarly, let denote a simple function , namely for some finite , and , .

Note that, since we are assuming and to be -finite metric spaces, and for all . Then, by proper normalization, we can assume and , with and with , as defined by the theorem statement.

Next, we define the two complex functions Note that, for , and . We then extend and to depend on a complex parameter as follows: so that and . Here, we are implicitly excluding the case , which yields : In that case, one can simply take , independently of , and the following argument will only require minor adaptations.

Let us now introduce the function where are constants independent of . We readily see that is an entire function, bounded on the strip . Then, in order to prove (2), we only need to show that

for all and as constructed above. Indeed, if (3) holds true, by Hadamard three-lines theorem, for all and . This means, by fixing , that where the supremum is taken with respect to all simple functions with . The left-hand side can be rewritten by means of the following lemma.[4]

Lemma  Let be conjugate exponents and let be a function in . Then where the supremum is taken over all simple functions in such that .

In our case, the lemma above implies for all simple function with . Equivalently, for a generic simple function,

Proof of (3)

Let us now prove that our claim (3) is indeed certain. The sequence consists of disjoint subsets in and, thus, each belongs to (at most) one of them, say . Then, for , which implies that . With a parallel argument, each belongs to (at most) one of the sets supporting , say , and

We can now bound : By applying Hölder’s inequality with conjugate exponents and , we have

We can repeat the same process for to obtain , and, finally,

Extension to all measurable functions in Lpθ

So far, we have proven that

when is a simple function. As already mentioned, the inequality holds true for all by the density of simple functions in .

Formally, let and let be a sequence of simple functions such that , for all , and pointwise. Let and define , , and . Note that, since we are assuming , and, equivalently, and .

Let us see what happens in the limit for . Since , and , by the dominated convergence theorem one readily has Similarly, , and imply and, by the linearity of as an operator of types and (we have not proven yet that it is of type for a generic )

It is now easy to prove that and in measure: For any , Chebyshev’s inequality yields and similarly for . Then, and a.e. for some subsequence and, in turn, a.e. Then, by Fatou’s lemma and recalling that (4) holds true for simple functions,

Interpolation of analytic families of operators

Summarize
Perspective

The proof outline presented in the above section readily generalizes to the case in which the operator T is allowed to vary analytically. In fact, an analogous proof can be carried out to establish a bound on the entire function from which we obtain the following theorem of Elias Stein, published in his 1956 thesis:[5]

Stein interpolation theorem  Let 1, Σ1, μ1) and 2, Σ2, μ2) be σ-finite measure spaces. Suppose 1 ≤ p0 , p1 ≤ ∞, 1 ≤ q0 , q1 ≤ ∞, and define:

S = {zC : 0 < Re(z) < 1},
S = {zC : 0 ≤ Re(z) ≤ 1}.

We take a collection of linear operators {Tz : zS} on the space of simple functions in L1(μ1) into the space of all μ2-measurable functions on Ω2. We assume the following further properties on this collection of linear operators:

  • The mapping is continuous on S and holomorphic on S for all simple functions f and g.
  • For some constant k < π, the operators satisfy the uniform bound:
  • Tz maps Lp0(μ1) boundedly to Lq0(μ2) whenever Re(z) = 0.
  • Tz maps Lp1(μ1) boundedly to Lq1(μ2) whenever Re(z) = 1.
  • The operator norms satisfy the uniform bound for some constant k < π.

Then, for each 0 < θ < 1, the operator Tθ maps Lpθ(μ1) boundedly into Lqθ(μ2).

The theory of real Hardy spaces and the space of bounded mean oscillations permits us to wield the Stein interpolation theorem argument in dealing with operators on the Hardy space H1(Rd) and the space BMO of bounded mean oscillations; this is a result of Charles Fefferman and Elias Stein.[6]

Applications

Summarize
Perspective

Hausdorff–Young inequality

It has been shown in the first section that the Fourier transform maps L1(Rd) boundedly into L(Rd) and L2(Rd) into itself. A similar argument shows that the Fourier series operator, which transforms periodic functions f : TC into functions whose values are the Fourier coefficients maps L1(T) boundedly into (Z) and L2(T) into 2(Z). The Riesz–Thorin interpolation theorem now implies the following: where 1 ≤ p ≤ 2 and 1/p + 1/q = 1. This is the Hausdorff–Young inequality.

The Hausdorff–Young inequality can also be established for the Fourier transform on locally compact Abelian groups. The norm estimate of 1 is not optimal. See the main article for references.

Convolution operators

Let f be a fixed integrable function and let T be the operator of convolution with f, i.e., for each function g we have Tg = fg.

It follows from Fubini's theorem that T is bounded from L1 to L1 and it is trivial that it is bounded from L to L (both bounds are by ||f||1). Therefore the Riesz–Thorin theorem gives

We take this inequality and switch the role of the operator and the operand, or in other words, we think of S as the operator of convolution with g, and get that S is bounded from L1 to Lp. Further, since g is in Lp we get, in view of Hölder's inequality, that S is bounded from Lq to L, where again 1/p + 1/q = 1. So interpolating we get where the connection between p, r and s is

The Hilbert transform

The Hilbert transform of f : RC is given by where p.v. indicates the Cauchy principal value of the integral. The Hilbert transform is a Fourier multiplier operator with a particularly simple multiplier:

It follows from the Plancherel theorem that the Hilbert transform maps L2(R) boundedly into itself.

Nevertheless, the Hilbert transform is not bounded on L1(R) or L(R), and so we cannot use the Riesz–Thorin interpolation theorem directly. To see why we do not have these endpoint bounds, it suffices to compute the Hilbert transform of the simple functions 1(−1,1)(x) and 1(0,1)(x) − 1(0,1)(−x). We can show, however, that for all Schwartz functions f : RC, and this identity can be used in conjunction with the Cauchy–Schwarz inequality to show that the Hilbert transform maps L2n(Rd) boundedly into itself for all n ≥ 2. Interpolation now establishes the bound for all 2 ≤ p < ∞, and the self-adjointness of the Hilbert transform can be used to carry over these bounds to the 1 < p ≤ 2 case.

Comparison with the real interpolation method

While the Riesz–Thorin interpolation theorem and its variants are powerful tools that yield a clean estimate on the interpolated operator norms, they suffer from numerous defects: some minor, some more severe. Note first that the complex-analytic nature of the proof of the Riesz–Thorin interpolation theorem forces the scalar field to be C. For extended-real-valued functions, this restriction can be bypassed by redefining the function to be finite everywhere—possible, as every integrable function must be finite almost everywhere. A more serious disadvantage is that, in practice, many operators, such as the Hardy–Littlewood maximal operator and the Calderón–Zygmund operators, do not have good endpoint estimates.[7] In the case of the Hilbert transform in the previous section, we were able to bypass this problem by explicitly computing the norm estimates at several midway points. This is cumbersome and is often not possible in more general scenarios. Since many such operators satisfy the weak-type estimates real interpolation theorems such as the Marcinkiewicz interpolation theorem are better-suited for them. Furthermore, a good number of important operators, such as the Hardy-Littlewood maximal operator, are only sublinear. This is not a hindrance to applying real interpolation methods, but complex interpolation methods are ill-equipped to handle non-linear operators. On the other hand, real interpolation methods, compared to complex interpolation methods, tend to produce worse estimates on the intermediate operator norms and do not behave as well off the diagonal in the Riesz diagram. The off-diagonal versions of the Marcinkiewicz interpolation theorem require the formalism of Lorentz spaces and do not necessarily produce norm estimates on the Lp-spaces.

Mityagin's theorem

Summarize
Perspective

B. Mityagin extended the Riesz–Thorin theorem; this extension is formulated here in the special case of spaces of sequences with unconditional bases (cf. below).

Assume:

Then

for any unconditional Banach space of sequences X, that is, for any and any , .

The proof is based on the Krein–Milman theorem.

See also

Notes

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.