Again our starting point is a measure space \( (S, \ms S , \mu) \), so that \( S \) is a set, \( \ms S \) is a \( \sigma \)-algebra of subsets of \( S \), and \( \mu \) is a positive measure on \( \ms S \).
In the last section we defined the integral of certain measurable functions \( f: S \to \R \) with respect to the measure \( \mu \). Recall that the integral, denoted \( \int_S f \, d\mu \), may exist as a number in \( \R \cup \{-\infty, \infty\} \) (in which case \( f \) is integrable), and in particular may exist as a number in \(\R\) (in which case \( f \) is absolutely integrable). The integral may also fail to exist. Here is a review of how the definition is built up in stages:
Definition of the integral
So \(f: S \to \R\) is integrable if it is measurable and either \(\int_S f^+ \, d\mu \lt \infty\) or \(\int_S f^- \, d \mu \lt \infty\). If both are finite, equivalently \(\int_S |f| \, d\mu \lt \infty\), then \(f\) is absolutely integrable.
Consider a statement on the elements of \(S \), for example an equation or an inequality with \( x \in S \) as a free variable. (Technically such a statement is a predicate on \( S \).) For \( A \in \ms S \), we say that the statement holds on \( A \) if it is true for every \( x \in A \). We say that the statement holds almost everywhere on \( A \) (with respect to \( \mu \)) if there exists \( B \in \ms S \) with \( B \subseteq A \) such that the statement holds on \( B \) and \( \mu(A \setminus B) = 0 \).
A few properties of the integral that were essential to the motivation of the definition were given in the last section. In this section, we extend some of those properties and we study a number of new ones. As a review, here is what we know so far.
Properties of the integral
Parts (a) and (b) are the linearity properties; part (a) is the additivity property and part (b) is the scaling property. Parts (c) and (d) are the order properties; part (c) is the positive property and part (d) is the increasing property. Part (e) is a continuity property known as the monotone convergence theorem. Part (f) is the additive property for disjoint domains. Properties (a)–(e) hold with \( S \) replaced by \( A \in \ms S \).
Our first new results are extensions dealing with equality and order. The integral of a function over a null set is 0:
Suppose that \( f: S \to \R \) is measurable and \( A \in \ms S \) with \( \mu(A) = 0 \). Then \( \int_A f \, d\mu = 0 \).
The proof proceeds in stages via the definition of the integral.
Two functions that are indistinguishable from the point of view of \( \mu \) must have the same integral.
Suppose that \( f: S \to \R \) is integrable. If \( g: S \to \R \) is measurable and \( g = f \) almost everywhere on \( S \), then \( \int_S g \, d\mu = \int_S f \, d\mu \).
Note that \( g = f \) if and only if \( g^+ = f^+ \) and \( g^- = f^- \). Let \( A = \{x \in S: g^+(x) = f^+(x)\} \). Then \( A \in \ms S \) and \( \mu(A^c) = 0 \). Hence by the additivity property and , \[ \int_S g^+ \, d\mu = \int_A g^+ \, d\mu + \int_{A^c} g^+ \, d\mu = \int_A f^+ \, d\mu + 0 = \int_A f^+ \, d\mu + \int_{A^c} f^+ \, d\mu = \int_S f^+ \, d\mu \] Similarly \( \int_S g^- \, d\mu = \int_S f^- \, d\mu\). Hence the integral of \( g \) exists and \( \int_S g \, d\mu = \int_S f \, d\mu \)
Next we have a simple extension of the positive property.
Suppose that \( f: S \to \R \) is measurable and \( f \ge 0 \) almost everywhere on \( S \). Then
So, if \( f \ge 0 \) almost everywhere on \( S \) then \( \int_S f \, d\mu \gt 0 \) if and only if \( \mu\{x \in S: f(x) \gt 0\} \gt 0 \). The simple extension of the positive property in turn leads to a simple extension of the increasing property.
Suppose that \( f, \, g: S \to \R \) are integrable, and that \( f \le g \) almost everywhere on \( S \). Then
So if \( f \le g \) almost everywhere on \( S \) then, except in the two cases mentioned, \( \int_S f \, d\mu \lt \int_S g \, d\mu \) if and only if \( \mu\{x \in S: f(x) \lt g(x)\} \gt 0 \). The exclusion when both integrals are \( \infty \) or \( -\infty \) is important. A counterexample when this condition does not hold is given in . The next result is the absolute value inequality.
Suppose that \( f: S \to \R \) is integrable. Then \[ \left| \int_S f \, d\mu \right| \le \int_S \left|f \right| \, d\mu \] If \( f \) is absolutely integrable, then equality holds if and only if \( f \ge 0 \) almost everywhere on \( S \) or \( f \le 0 \) almost everywhere on \( S \).
First note that \( -\left|f\right| \le f \le \left|f\right| \) on \( S \). The integrals of all three functions exist, so the increasing property and scaling properties give \[ -\int_S \left|f\right| \, d\mu \le \int_S f \, d\mu \le \int_S \left|f \right| \, d\mu \] which is equivalent to the inequality above. If \( f \) is integrable, then by the increasing property , equality holds if and only if \( f = -\left|f\right| \) almost everywhere on \( S \) or \( f = \left|f\right| \) almost everywhere on \( S \). In the first case, \( f \le 0 \) almost everywhere on \( S \) and in the second case, \( f \ge 0 \) almost everywhere on \( S \).
Suppose that \( (T, \ms T) \) is another measurable space and that \( u: S \to T \) is measurable. By the change of variables theorem for positive measures, \( \nu \) defined by \[ \nu(B) = \mu\left[u^{-1}(B)\right], \quad B \in \ms T \] is a positive measure on \( (T, \ms T) \). The following result is known as the change of variables theorem for integrals:
\( f: T \to \R \) is integrable with respect to \(\nu\) if and only if \(f \circ u\) is integrable with respect to \(\mu\) and \[ \int_T f \, d\nu = \int_S (f \circ u) \, d\mu \]
We will show that if either of the integrals exist then they both do, and are equal. The proof is a classical bootstrapping argument that parallels the definition of the integral.
The change of variables theorem will look more familiar if we give the variables explicitly. Thus, suppose that we want to evaluate \[ \int_S f\left[u(x)\right] \, d\mu(x) \] where again, \( u: S \to T \) and \( f: T \to \R \). One way is to use the substitution \( u = u(x) \), find the new measure \( \nu \), and then evaluate \[ \int_T g(u) \, d\nu(u) \]
We start with a simple but important corollary of the monotone convergence theorem that extends the additivity property to a countably infinite sum of nonnegative functions.
Suppose that \( f_n: S \to [0, \infty) \) is measurable for \( n \in \N_+ \). Then \[ \int_S \sum_{n=1}^\infty f_n \, d\mu = \sum_{n=1}^\infty \int_S f_n \, d\mu \]
Let \( g_n = \sum_{i=1}^n f_i \) for \( n \in \N_+ \). Then \( g_n: S \to [0, \infty) \) is measurable and \( g_n \) is increasing in \( n \). Moreover, by definition, \( g_n \to \sum_{i=1}^\infty f_i \) as \( n \to \infty \). Hence by the monotone convergence theorem, \( \int_S g_n \, d\mu \to \int_S \sum_{i=1}^\infty f_i \, d\mu \) as \( n \to \infty \). But we know the additivity property holds for finite sums, so \(\int_S g_n \, d\mu = \sum_{i=1}^n \int_S f_i \, d\mu\) and again, by definition, this sum converges to \(\sum_{i=1}^\infty \int_S f_i \, d\mu\) as \( n \to \infty \).
Theorem below gives a related result that relaxes the assumption that \( f \) be nonnegative, but imposes a stricter integrability requirement. Our next result is the additivity of the integral over a countably infinite collection of disjoint domains.
Suppose that \( f: S \to \R \) is integrable, and that \( \{A_n: n \in \N_+\} \) is a disjoint collection of sets in \( \ms S \). Let \( A = \bigcup_{n=1}^\infty A_n \). Then \[ \int_A f \, d\mu = \sum_{n=1}^\infty \int_{A_n} f \, d\mu \]
Suppose first that \( f \) is nonnegative. Note that \( \bs{1}_A = \sum_{n=1}^\infty \bs{1}_{A_n} \) and hence \( \bs{1}_A f = \sum_{n=1}^\infty \bs{1}_{A_n} f \). Thus from , \[ \int_A f \, d\mu = \int_S \bs{1}_A f \, d\mu = \int_S \sum_{n=1}^\infty \bs{1}_{A_n} f \, d\mu = \sum_{n=1}^\infty \int_S \bs{1}_{A_n} f \, d\mu = \sum_{n=1}^\infty \int_{A_n} f \, d\mu \] Suppose now that \( f: S \to \R \) is measurable and \( \int_S f \, d\mu \) exists. Note that for \( B \in \ms S \), \( \left(\bs{1}_B f\right)^+ = \bs{1}_B f^+ \) and \( \left(\bs{1}_B f\right)^- = \bs{1}_B f^- \). Hence from the previous argument, \[ \int_A f^+ \, d\mu = \sum_{n=1}^\infty \int_{A_n} f^+ \, d\mu, \quad \int_A f^- \, d\mu = \sum_{n=1}^\infty \int_{A_n} f^- \, d\mu \] Both of these are sums of nonnegative terms, and one of the sums, at least, is finite. Hence we can group the terms to get \[ \int_A f \, d\mu = \int_A f^+ \, d\mu - \int_A f^- \, d\mu = \sum_{n=1}^\infty \int_{A_n} (f^+ - f^-) \, d\mu = \sum_{n=1}^\infty \int_{A_n} f \, d\mu \]
Of course, applies if \( f \) is nonnegative or if \( f \) is absolutely integrable. Next we give a minor extension of the monotone convergence theorem that relaxes the assumption that the functions be nonnegative.
Monotone Convergence Theorem. Suppose that \( f_n: S \to \R \) is integrable for each \( n \in \N_+ \) and that \( f_n \) is increasing in \( n \) on \( S \). If \( \int_S f_1 \, d\mu \gt -\infty \) then
\[ \int_S \lim_{n \to \infty} f_n \, d\mu = \lim_{n \to \infty} \int_S f_n \, d\mu \]
Let \( f(x) = \lim_{n \to \infty} f_n(x) \) for \( x \in S \) which exists in \( \R \cup \{\infty\} \) since \( f_n(x) \) is increasing in \( n \in \N_+ \). If \( \int_S f_1 \, d\mu = \infty \), then by the increasing property, \( \int_S f_n \, d\mu = \infty \) for all \( n \in \N_+ \) and \( \int_S f \, d\mu = \infty \), so the conclusion of the monotone convergence theorem trivially holds. Thus suppose that \( f_1 \) is integrable. Let \( g_n = f_n - f_1 \) for \( n \in \N \) and let \( g = f - f_1 \). Then \( g_n \) is nonnegative and increasing in \( n \) on \( S \), and \( g_n \to g \) as \( n \to \infty \) on \( S \). By the ordinary monotone convergence theorem, \( \int_S g_n \, d\mu \to \int_S g \, d\mu \) as \( n \to \infty \). But since \( \int_S f_1 \, d\mu \) is finite, \( \int_S g_n \, d\mu = \int_S f_n \, d\mu - \int_S f_1 \, d\mu \) and \( \int_S g \, d\mu = \int f \, d\mu - \int_S f_1 \, d\mu \). Again since \( \int_S f_1 \, d\mu \) is finite, it follows that \( \int_S f_n \, d\mu \to \int_S f \, d\mu \) as \( n \to \infty \).Details:
Here is the complementary result for decreasing functions.
Suppose that \( f_n: S \to \R \) is integrable for each \( n \in \N_+ \) and that \( f_n \) is decreasing in \( n \) on \( S \). If \( \int_S f_1 \, d\mu \lt \infty \) then \[ \int_S \lim_{n \to \infty} f_n \, d\mu = \lim_{n \to \infty} \int_S f_n \, d\mu \]
The functions \( -f_n \) for \( n \in \N_+ \) satisfy the hypotheses of the monotone convergence theorem for increasing functions and hence \(\int_S \lim_{n \to \infty} -f_n \, d\mu = \lim_{n \to \infty} -\int_S f_n \, d\mu \). By the scaling property, \( \int_S \lim_{n \to \infty} f_n \, d\mu = \lim_{n \to \infty} \int_S f_n \, d\mu \).
The additional assumptions on the integral of \( f_1 \) in the last two extensions of the monotone convergence theorem are necessary. A counterexample is given in .
Our next result is also a consequence of the montone convergence theorem, and is called Fatou's lemma in honor of Pierre Fatou. Its usefulness stems from the fact that no assumptions are placed on the integrand functions, except that they be nonnegative and measurable.
Fatou's Lemma. Suppose that \( f_n: S \to [0, \infty) \) is measurable for \( n \in \N_+ \). Then \[ \int_S \liminf_{n \to \infty} f_n \, d\mu \le \liminf_{n \to \infty} \int_S f_n \, d\mu \]
Let \( g_n = \inf\left\{f_k: k \in \{n, n + 1, \ldots \}\right\} \) for \( n \in \N_+ \). Then \( g_n: S \to [0, \infty) \) is measurable for \( n \in \N_+ \), \( g_n \) is increasing in \( n \), and by definition, \( \lim_{n \to \infty} g_n = \liminf_{n \to \infty} f_n \). By the monotone convergence theorem, \[ \int_S \liminf_{n \to \infty} f_n \, d\mu = \lim_{n \to \infty} \int_S g_n \, d\mu \] But \( g_n \le f_k \) on \( S \) for \( n \in \N_+ \) and \( k \in \{n, n + 1, \ldots\} \) so by the increasing property, \( \int_S g_n \, d\mu \le \int_S f_k \, d\mu\) for \( n \in \N_+ \) and \( k \in \{n, n + 1, \ldots\} \). Hence \( \int_S g_n \, d\mu \le \inf\left\{\int_S f_k \, d\mu: k \in \{n, n+1, \ldots\}\right\} \) for \( n \in \N_+ \) and therefore \[ \lim_{n \to \infty} \int_S g_n \, d\mu \le \liminf_{n \to \infty} \int_S f_n \, d\mu \]
Given the weakness of the hypotheses, it's hardly surprising that strict inequality can easily occur in Fatou's lemma. An example is given in .
Our next convergence result is one of the most important and is known as the dominated convergence theorem. It's sometimes also known as Lebesgue's dominated convergence theorem in honor of Henri Lebesgue, who first developed all of this stuff in the context of the Euclidean measure space\( (\R^n, \ms R^n, \lambda^n) \). The dominated convergence theorem gives a basic condition under which we may interchange the limit and integration operators.
Dominated Convergence Theorem. Suppose that \( f_n: S \to \R \) is measurable for \( n \in \N_+ \) and that \( \lim_{n \to \infty} f_n \) exists on \( S \). Suppose also that \( \left|f_n\right| \le g \) for \( n \in \N \) where \( g: S \to [0, \infty) \) is absolutely integrable. Then \[ \int_S \lim_{n \to \infty} f_n \, d\mu = \lim_{n \to \infty} \int_S f_n \, d\mu \]
First note that by the increasing property, \( \int_S \left|f_n\right| \, d\mu \le \int_S g \, d\mu \lt \infty \) and hence \( f_n \) is integrable for \( n \in \N_+ \). Let \( f = \lim_{n \to \infty} f_n \). Then \( f \) is measurable, and by the increasing property again, \( \int_S \left| f \right| \, d\mu \lt \int_S g \, d\mu \lt \infty \), so \( f \) is integrable.
Now for \( n \in \N_+ \), let \( u_n = \inf\left\{f_k: k \in \{n, n + 1, \ldots\}\right\} \) and let \( v_n = \sup\left\{f_k: k \in \{n, n + 1, \ldots\}\right\} \). Then \( u_n \le f_n \le v_n \) for \( n \in \N_+ \), \( u_n \) is increasing in \( n \), \( v_n \) is decreasing in \( n \), and \( u_n \to f \) and \( v_n \to f \) as \( n \to \infty \). Moreover, \( \int_S u_1 \, d\mu \ge - \int_S g \, d\mu \gt -\infty \) so by the version of the monotone convergence theorem above, \( \int_S u_n \, d\mu \to \int_S f \, d\mu \) as \( n \to \infty \). Similarly, \( \int_S v_1 \, d\mu \lt \int_S g \, d\mu \lt \infty \), so by the monotone convergence theorem , \( \int_S v_n \, d\mu \to \int_S f \, d\mu \) as \( n \to \infty \). But by the increasing property, \( \int_S u_n \, d\mu \le \int_S f_n \, d\mu \le \int_S v_n \, d\mu \) for \( n \in \N_+ \) so by the squeeze theorem for limits, \( \int_S f_n \, d\mu \to \int_S f \, d\mu \) as \( n \to \infty \).
As you might guess, the assumption that \( \left| f_n \right| \) is uniformly bounded in \( n \) by an absolutely integrable function is critical. A counterexample when this assumption is missing is given in . The dominated convergence theorem remains true if \( \lim_{n \to \infty} f_n \) exists almost everywhere on \( S \). The follow corollary of the dominated convergence theorem gives a condition for the interchange of infinite sum and integral.
Suppose that \( f_i: S \to \R \) is measurable for \( i \in \N_+ \) and that \( \sum_{i=1}^\infty \left| f_i \right| \) is absolutely integrable. then \[ \int_S \sum_{i=1}^\infty f_i \, d\mu = \sum_{i=1}^\infty \int_S f_i \, d\mu \]
The assumption that \( g = \sum_{i = 1}^\infty \left| f_i \right| \) is integrable implies that \( g \lt \infty \) almost everywhere on \( S \). In turn, this means that \( \sum_{i=1}^\infty f_i \) is absolutely convergent almost everywhere on \( S \). Let \( f(x) = \sum_{i=1}^\infty f_i(x) \) if \( g(x) \lt \infty \), and for completeness, let \( f(x) = 0 \) if \( g(x) = \infty \). Since only the integral of \( f \) appears in the theorem, it doesn't matter how we define \( f \) on the null set where \( g = \infty \). Now let \( g_n = \sum_{i=1}^n f_i \). Then \( g_n \to f \) as \( n \to \infty \) almost everywhere on \( S \) and \( \left| g_n \right| \le g \) on \( S \). Hence by the dominated convergence theorem , \( \int_S g_n \, d\mu \to \int_S f \, d\mu \) as \( n \to \infty \). But we know the additivity property holds for finite sums, so \( \int_S g_n \, d\mu = \sum_{i=1}^n \int_S f_i \, d\mu \), and in turn this converges to \( \sum_{i=1}^\infty \int_S f_i \, d\mu \) as \( n \to \infty \). Thus we have \( \sum_{i=1}^\infty \int_S f_i \, d\mu = \int_S f \, d\mu \).
The following corollary of the dominated convergence theorem is known as the bounded convergence theorem.
Bounded Convergence Theorem. Suppose that \( f_n: S \to \R \) is measurable for \( n \in \N_+ \) and there exists \( A \in \ms S \) such that \( \mu(A) \lt \infty \), \( \lim_{n \to \infty} f_n \) exists on \( A \), and \( \left| f_n \right| \) is bounded in \( n \in \N_+ \) on \( A \). Then \[ \int_A \lim_{n \to \infty} f_n \, d\mu = \lim_{n \to \infty} \int_A f_n \, d\mu \]
Suppose that \( \left|f_n\right| \) is bounded in \( n \) on \( A \) by \( c \in (0, \infty) \). The constant \( c \) is integrable on \( A \) since \( \int_A c \, d\mu = c \mu(A) \lt \infty \), and \( \left|f_n\right| \le c \) on \( A \) for \( n \in \N_+ \). Thus the result follows from the dominated convergence theorem .
Again, the bounded convergence remains true if \( \lim_{n \to \infty} f_n \) exists almost everywhere on \( A \). For a finite measure space (and in particular for a probability space), the condition that \( \mu(A) \lt \infty \) automatically holds.
Suppose now that \( (S, \ms S, \mu) \) and \( (T, \ms T, \nu) \) are \( \sigma \)-finite measure spaces. Recall the basic facts about the product \( \sigma \)-algebra \( \ms S \times \ms T \) of subsets of \( S \times T \), and the product measure \( \mu \times \nu \) on \( \ms S \times \ms T \). The product measure space \( (S \times T, \ms S \times \ms T, \mu \times \nu) \) is the standard one that we use for product spaces. If \( f: S \times T \to \R \) is measurable, there are three integrals we might consider. First, of course, is the integral of \( f \) with respect to the product measure \( \mu \times \nu \) \[ \int_{S \times T} f(x, y) \, d(\mu \times \nu)(x, y) \] sometimes called a double integral in this context. But also we have the nested or iterated integrals where we integrate with respect to one variable at a time: \[ \int_S \left(\int_T f(x, y) \, d\nu(y)\right) \, d\mu(x), \quad \int_T \left(\int_S f(x, y) d\mu(x)\right) \, d\nu(y)\] How are these integrals related? Well, just as in calculus with ordinary Riemann integrals, under mild conditions the three integrals are the same. The resulting important theorem is known as Fubini's Theorem in honor of the Italian mathematician Guido Fubini.
Fubini's Theorem. If \( f: S \times T \to \R \) is integrable with respect to \(\mu \times \nu\) then \[ \int_{S \times T} f(x, y) \, d(\mu \times \nu)(x, y) = \int_S \int_T f(x, y) \, d\nu(y) \, d\mu(x) = \int_T \int_S f(x, y) \, d\mu(x) \, d\nu(y) \]
We will show that \[ \int_{S \times T} f(x, y) \, d(\mu \times \nu)(x, y) = \int_S \int_T f(x, y) \, d\nu(y) \, d\mu(x) \] The proof with the other iterated integral is symmetric. The proof is a bootstrapping argument that proceeds in stages, paralleling the definition of the integral.
Of course, the double integral exists, and so Fubini's theorem applies, if either \( f \) is nonnegative or absolutely integrable with respect to \( \mu \times \nu \). When \( f \) is nonnegative, the result is sometimes called Tonelli's theorem in honor of another Italian mathematician, Leonida Tonelli. On the other hand, the iterated integrals may exist, and may be different, when \(f\) is not integrable with respect to \(\mu \times \nu\). A counterexample is given in and a second counterexample in .
A special case of Fubini's theorem (and indeed part of the proof) is that we can compute the measure of a set in the product space by integrating the cross-sectional measures.
If \( C \in \ms S \times \ms T \) then \[ (\mu \times \nu)(C) = \int_S \nu\left(C_x\right) \, d\mu(x) = \int_T \mu\left(C^y\right) \, d\nu(y) \] where \( C_x = \{y \in T: (x, y) \in C\} \) for \( x \in S \), and \( C^y = \{x \in S: (x, y) \in C\} \) for \( y \in T \).
In particular, if \( C, \; D \in \ms S \times \ms T \) have the property that \( \nu(C_x) = \nu(D_x) \) for all \( x \in S \), or \( \mu\left(C^y\right) = \mu\left(D^y\right) \) for all \( y \in T \) (that is, \( C \) and \( D \) have the same cross-sectional measures with respect to one of the variables), then \( (\mu \times \nu)(C) = (\mu \times \nu)(D) \). In \( \R^2 \) with area, and in \( \R^3 \) with volume (Lebesgue measure in both cases), this is known as Cavalieri's principle, named for Bonaventura Cavalieri, yet a third Italian mathematician. Clearly, Italian mathematicians cornered the market on theorems of this sort.
A simple corollary of Fubini's theorem is that the double integral of a product function over a product set is the product of the integrals. This result has important applications in probability to random variables that are independent, as we will see in the chapter on probability spaces.
Suppose that \( g: S \to \R \) and \( h: T \to \R \) are measurable, and either both are nonnegative or both are absolutely integrable. Then \[ \int_{S \times T} g(x) h(y) \, d(\mu \times \nu)(x, y) = \left(\int_S g(x) \, d\mu(x)\right) \left(\int_T h(y) \, d\nu(y)\right) \]
Define \(f: S \times T \to \R\) by \(f(x, y) = g(x) h(y)\) for \((x, y) \in S \times T\). Then \(f\) is measurable. If \(g\) and \(h\) are nonnegative, then so is \(f\) and hence \(f\) is integrable. If \(g\) and \(h\) are absolutely integrable then by Fubini's theorem applied to \(|f|\) we have \begin{align*} \int_{S \times T} |f(x, y)| \, d(\mu \times \nu)(x, y) & = \int_T\left(\int_S |g(x)| |h(y)| \, d\mu(x)\right) \, d\nu(y) = \int_T |h(y)| \left(\int_S |g(x)| \, d\mu(x)\right) \, d\nu(y) \\ & = \left(\int_S |g(x)| \, d\mu(x)\right) \left(\int_T |h(y)| \, d\nu(y)\right) \lt \infty \end{align*} Hence \(f\) is absolutely integrable. So in either case, applying Fubini's theorem to \(f\), with the same computations as above we have \begin{align*} \int_{S \times T} f(x, y) \, d(\mu \times \nu)(x, y) & = \int_T\left(\int_S g(x) h(y) \, d\mu(x)\right) \, d\nu(y) = \int_T h(y) \left(\int_S g(x) \, d\mu(x)\right) \, d\nu(y) \\ & = \left(\int_S g(x) \, d\mu(x)\right) \left(\int_T h(y) \, d\nu(y)\right) \end{align*}
Recall that a discrete measure space consists of a countable set with the \( \sigma \)-algebra of all subsets and with counting measure. In such a space, integrals are simply sums and so Fubini's theorem allows us to rearrange the order of summation in a double sum.
Suppose that \( I \) and \( J \) are countable and that \( a_{i j} \in \R \) for \( i \in I \) and \( j \in J \). If the sum of the positive terms or the sum of the negative terms is finite, then \[ \sum_{(i, j) \in I \times J} a_{i j} = \sum_{i \in I} \sum_{j \in J} a_{i j} = \sum_{j \in J} \sum_{i \in I} a_{i j} \]
Often \( I = J = \N_+ \), and in this case, \( a_{i j} \) can be viewed as an infinite array, with \( i \in \N_+ \) the row number and \( j \in \N_+ \) the column number:
\( a_{11} \) | \( a_{12} \) | \( a_{13} \) | \( \ldots \) |
\( a_{21} \) | \( a_{22} \) | \( a_{23} \) | \( \ldots \) |
\( a_{31} \) | \( a_{32} \) | \( a_{33} \) | \( \ldots \) |
\( \vdots \) | \( \vdots \) | \( \vdots \) | \( \vdots \) |
The significant point is that \( \N_+ \) is totally ordered. While there is no implied order of summation in the double sum \( \sum_{(i, j) \in \N_+^2} a_{i j} \), the iterated sum \( \sum_{i=1}^\infty \sum_{j=1}^\infty a_{i j} \) is obtained by summing over the rows in order and then summing the results by column in order, while the iterated sum \( \sum_{j=1}^\infty \sum_{i=1}^\infty a_{i j} \) is obtained by summing over the columns in order and then summing the results by row in order.
Of course, only one of the product spaces might be discrete. Propositions and on the interchange of sum and integral can be viewed as applications of Fubini's theorem, where one of the measure spaces is \( (S, \ms S, \mu) \) and the other is \( \N_+ \) with counting measure.
In the first three exercises below, \( (\R, \ms R, \lambda) \) is the standard one-dimensional Euclidean space, so \( \ms R \) is \( \sigma \)-algebra of Borel measurable sets and \( \lambda \) is Lebesgue measure.
Let \( f = \bs{1}_{[1, \infty)} \) and \( g = \bs{1}_{[0, \infty)} \). Show that
This example shows that the strict increasing property in can fail when the integrals are infinite.
Let \( f_n = \bs{1}_{[n, \infty)} \) for \( n \in \N_+ \). Show that
This example shows that the monotone convergence theorem can fail if the first integral is infinite. It also illustrates strict inequality in Fatou's lemma .
Let \( f_n = \bs{1}_{[n, n + 1]} \) for \( n \in \N_+ \). Show that
This example shows that the dominated convergence theorem can fail if \( \left|f_n\right| \) is not bounded by an integrable function. It also shows that strict inequality can hold in Fatou's lemma .
Consider the product space \( [0, 1]^2 \) with the usual Borel measurable subsets and Lebesgue measure. Let \( f: [0, 1]^2 \to \R \) be defined by \[ f(x, y) = \frac{x^2 - y^2}{(x^2 + y^2)^2} \] Show that
This example shows that Fubini's theorem can fail if the integrand is not integrable with respect to the product measure.
For \( i, \, j \in \N_+ \) define the sequence \( a_{i j} \) as follows: \( a_{i i} = 1 \) and \( a_{i + 1, i} = -1 \) for \( i \in \N_+ \), \( a_{i j} = 0 \) otherwise.
This example shows that the iterated sums can exist and be different when the double sum does not exist, a counterexample to corollary of Fubini's theorem, for sums when the hypotheses are not satisfied.
Compute \( \int_D f(x, y) \, d(x,y) \) in each case below for the given \( D \subseteq \R^2 \) and \( f: D \to \R \).
Integrals of the type in are useful in the study of exponential distributions.