In real analysis, the Heine–Borel theorem, named after Eduard Heine and Émile Borel, states:
For a subset <math>S</math> of Euclidean space <math>\mathbb{R}^n</math>, the following two statements are equivalent:
- <math>S</math> is compact, that is, every open cover of <math>S</math> has a finite subcover
- <math>S</math> is closed and bounded.
History and motivation
The history of what today is called the Heine–Borel theorem starts in the 19th century, with the search for solid foundations of real analysis. Central to the theory was the concept of uniform continuity and the theorem stating that every continuous function on a closed and bounded interval is uniformly continuous. Peter Gustav Lejeune Dirichlet was the first to prove this and implicitly he used the existence of a finite subcover of a given open cover of a closed interval in his proof. Later Eduard Heine, Karl Weierstrass and Salvatore Pincherle used similar techniques. Émile Borel in 1895 was the first to state and prove a form of what is now called the Heine–Borel theorem. His formulation was restricted to countable covers. Pierre Cousin (1895), Lebesgue (1898) and Schoenflies (1900) generalized it to arbitrary covers.
Proofs
Proof 1
If a set is compact, then it must be closed.
Let <math>S</math> be a subset of <math>\mathbb{R}^n</math>. Observe first the following: if <math>a</math> is a limit point of <math>S</math>, then any finite collection <math>C</math> of open sets, such that each open set <math>U\in C</math> is disjoint from some neighborhood <math>V_U</math> of <math>a</math>, fails to be a cover of <math>S</math>. Indeed, the intersection of the finite family of sets <math>V_U</math> is a neighborhood <math>W</math> of <math>a</math> in <math>\mathbb{R}^n</math>. Since <math>a</math> is a limit point of <math>S</math>, <math>W</math> must contain a point <math>x</math> in <math>S</math>. This <math>x\in S</math> is not covered by the family <math>C</math>, because every <math>U</math> in <math>C</math> is disjoint from <math>V_U</math> and hence disjoint from <math>W</math>, which contains <math>x</math>.
If <math>S</math> is compact but not closed, then it has a limit point <math>a\not\in S</math>. Consider a collection <math>C'</math> consisting of an open neighborhood <math>N(x)</math> for each <math>x\in S</math>, chosen small enough to not intersect some neighborhood <math>V_x</math> of <math>a</math>. Then <math>C'</math> is an open cover of <math>S</math>, but any finite subcollection of <math>C'</math> has the form of <math>C</math> discussed previously, and thus cannot be an open subcover of <math>S</math>. This contradicts the compactness of <math>S</math>. Hence, every limit point of <math>S</math> is in <math>S</math>, so <math>S</math> is closed.
The proof above applies with almost no change to showing that any compact subset <math>S</math> of a Hausdorff topological space <math>X</math> is closed in <math>X</math>.
If a set is compact, then it is bounded.
Let <math>S</math> be a compact set in <math>\mathbb{R}^n</math>, and <math>U_x</math> a ball of radius 1 centered at <math>x\in\mathbb{R}^n</math>. Then the set of all such balls centered at <math>x\in S</math> is clearly an open cover of <math>S</math>, since <math>\cup_{x\in S} U_x</math> contains all of <math>S</math>. Since <math>S</math> is compact, take a finite subcover of this cover. This subcover is the finite union of balls of radius 1. Consider all pairs of centers of these (finitely many) balls (of radius 1) and let <math>M</math> be the maximum of the distances between them. Then if <math>C_p</math> and <math>C_q</math> are the centers (respectively) of unit balls containing arbitrary <math>p,q\in S</math>, the triangle inequality says:
<math display="block">
d(p, q)\le d(p, C_p) + d(C_p, C_q) + d(C_q, q)\le 1 + M + 1 = M + 2.
</math>
So the diameter of <math>S</math> is bounded by <math>M+2</math>.
Lemma: A closed subset of a compact set is compact.
Let <math>K</math> be a closed subset of a compact set <math>T</math> in <math>\mathbb{R}^n</math> and let <math>C_K</math> be an open cover of <math>K</math>. Then <math>U=\mathbb{R}^n\setminus K</math> is an open set and
<math display="block"> C_T = C_K \cup \{U\} </math>
is an open cover of <math>T</math>. Since <math>T</math> is compact, then <math>C_T</math> has a finite subcover <math> C_T'</math>, that also covers the smaller set <math>K</math>. Since <math>U</math> does not contain any point of <math>K</math>, the set <math>K</math> is already covered by <math> C_K' = C_T' \setminus \{U\} </math>, that is a finite subcollection of the original collection <math>C_K</math>. It is thus possible to extract from any open cover <math>C_K</math> of <math>K</math> a finite subcover.
If a set is closed and bounded, then it is compact.
If a set <math>S</math> in <math>\mathbb{R}^n</math> is bounded, then it can be enclosed within an <math>n</math>-box
<math display="block"> T_0 = [-a, a]^n</math>
where <math>a>0</math>. By the lemma above, it is enough to show that <math>T_0</math> is compact.
Assume, by way of contradiction, that <math>T_0</math> is not compact. Then there exists an infinite open cover <math>C</math> of <math>T_0</math> that does not admit any finite subcover. Through bisection of each of the sides of <math>T_0</math>, the box <math>T_0</math> can be broken up into <math>2^n</math> sub <math>n</math>-boxes, each of which has diameter equal to half the diameter of <math>T_0</math>. Then at least one of the <math>2^n</math> sections of <math>T_0</math> must require an infinite subcover of <math>C</math>, otherwise <math>C</math> itself would have a finite subcover, by uniting together the finite covers of the sections. Call this section <math>T_1</math>.
Likewise, the sides of <math>T_1</math> can be bisected, yielding <math>2^n</math> sections of <math>T_1</math>, at least one of which must require an infinite subcover of <math>C</math>. Continuing in like manner yields a decreasing sequence of nested <math>n</math>-boxes:
<math display="block"> T_0 \supset T_1 \supset T_2 \supset \ldots \supset T_k \supset \ldots </math>
where the side length of <math>T_k</math> is <math>(2a)/2^k</math>, which tends to 0 as <math>k</math> tends to infinity. Let us define a sequence <math>(x_k)</math> such that each <math>x_k</math> is in <math>T_k</math>. This sequence is Cauchy, so it must converge to some limit <math>L</math>. Since each <math>T_k</math> is closed, and for each <math>k</math> the sequence <math>(x_k)</math> is eventually always inside <math>T_k</math>, we see that <math>L\in T_k</math> for each <math>k</math>.
Since <math>C</math> covers <math>T_0</math>, then it has some member <math>U\in C</math> such that <math>L\in U</math>. Since <math>U</math> is open, there is an <math>n</math>-ball <math>B(L)\subseteq U</math>. For large enough <math>k</math>, one has <math>T_k\subseteq B(L)\subseteq U</math>, but then the infinite number of members of <math>C</math> needed to cover <math>T_k</math> can be replaced by just one: <math>U</math>, a contradiction.
Thus, <math>T_0</math> is compact. Since <math>S</math> is closed and a subset of the compact set <math>T_0</math>, then <math>S</math> is also compact (see the lemma above).
Proof without Choice
The theorem can be proved without the use of the axiom of choice. First we prove
Proof: Let <math>\mathcal{U}</math> be an open cover of <math>I</math>. Consider the set
:<math>E = \{ x \in [0, 1] \mid [0, x] \text{ is covered by a finite subcover of } \, \mathcal{U} \}.</math>
It is nonempty and bounded. Thus, <math>b := \sup(E)</math> exists as a finite number. We claim <math>b = 1</math>. Suppose otherwise; then <math>b < 1</math>. Since <math>\mathcal{U}</math> is a cover, <math>b</math> is in some <math>U \in \mathcal{U}</math>. By the definition of sup, there is some <math>x</math> in <math>E</math> with <math>[x, b] \subset U</math>. Then for <math>b' > b</math> close to <math>b</math>,
:<math>[0, b'] = [0, x] \cup [x, b']</math>
is covered by a finite subcover of <math>\mathcal{U}</math>; i.e., <math>b' \in E</math>, contradicting that <math>b</math> is an upper bound of <math>E</math>. Hence, <math>b = 1</math>. Then, by a similar argument, <math>b = 1</math> is in <math>E</math>. <math>\square</math>
The theorem now follows easily. Indeed, by Tychonoff's theorem for finite products, the standard cube <math>I^n</math> is compact (the full version of the theorem is equivalent to the axiom of choice, but the theorem for finite products is much easier and does not use Choice.) Since cubes are homeomorphic to each other, arbitrary cubes are compact and each closed bounded set is contained in some cube; thus, compact. Finally, it is not hard to see the converse (a compact set in <math>\mathbb{R}^n</math> is closed and bounded) without the axiom of choice. <math>\square</math>
We also have, still not assuming the axiom choice:
Indeed, "compact" implies "sequentially compact", without Choice, as follows. Given a sequence <math>x_i</math> in a compact set in <math>\mathbb{R}^n</math>, take <math>E_i = \overline{\{ x_j \mid j \ge i \</math>. The family <math>E_i, i \ge 1</math> has the finite intersection property and so has nonempty intersection by compactness. Let <math>x \in \cap_i E_i</math>. Then let <math>U_m</math> be the open ball with center at <math>x</math> and of radius <math>1/m</math>. Since <math>U_1</math> intersects <math>\{ x_j \mid j \ge 1 \}</math>, let <math>i_1</math> be the least integer such that <math>x_{i_1}</math> is in <math>U_1</math>, possible since <math>\mathbb{N}</math> is well-ordered. Then let <math>i_2</math> be the least integer such that <math>i_2 > i_1</math> and <math>x_{i_2}</math> is in <math>U_2</math>, and so on. The sequence <math>x_{i_k}</math> is a convergent subsequence then. <math>\square</math>
Note the above version implies the Bolzano–Weierstrass theorem, that a bounded sequence in <math>\mathbb{R}^n</math> has a convergent sequence.
With the axiom of choice, the converse of the above theorem holds. Indeed, clearly, "sequentially compact" implies "closed". Also, if the set is unbounded, then clearly we can construct a sequence <math>x_i</math> in the set such that <math>|x_i - x_j| \ge 1</math> for <math>j < i</math>; in particular, the sequence has no convergent sequence. This step uses the axiom of choice (or more precisely countable choice).
Without the axiom of choice, the converse of the above theorem can fail; in facf, there is a model of ZF in which <math>\mathbb{R}</math> has a sequentially compact subset that is neither closed nor bounded.
Generalization
The Heine–Borel theorem is somewhat subsumed in the following more general result.
The usual Heine–Borel theorem follows from the above since a set in <math>\mathbb{R}^n</math> is bounded if and only if it is totally bounded.
In general, given a sequence <math>x_n</math>, each point in the intersection
:<math>\cap_n \overline{ \{ x_m \mid m \ge n \} }</math>
is called a cluster point. In other words, a point is a cluster point of <math>x_n</math> if each neighborhood of that point contains infinitely many (possibly repeated) terms in the sequence. For a metric space, a sequence has a convergent subsequence if and only if it has a cluster point. Thus, the statement (2) above is equivalent to saying each sequence has a cluster point.
Proof: (1) <math>\Rightarrow</math> (2) clear (cf. ). (2) <math>\Rightarrow</math> (3): if <math>S \subset X</math> is an infinite subset, by the axiom of choice, it contains a countable subset whose limit points are among the limit points of <math>S</math>. Thus, we can assume <math>S = \{ x_n \mid n \in \mathbb{N} \}</math> is countable. Then, by (2), <math>x_n</math> has a cluster point, which is a limit point of <math>S</math>.
(3) <math>\Rightarrow</math> (4): First, <math>X</math> is complete since a Cauchy sequence has a limit by (3). Next, assume <math>X</math> is not totally bounded; i.e., there is some <math>\epsilon > 0</math> with the property that no finite number of open balls of radius <math>\epsilon</math> covers <math>X</math>. Then, recursively, choose a sequence <math>x_n</math> such that
:<math>x_n \not\in B(x_1, \epsilon) \cup \cdots \cup B(x_{n-1}, \epsilon)</math>.
Then the set <math>S = \{ x_n \mid n \}</math> is infinite and has no limit point.
(4) <math>\Rightarrow</math> (1) is by adapting an argument of Bourbaki. We shall show a family <math>F</math> of closed subsets of <math>X</math> with the finite intersection property (i.e., each finite subset of <math>F</math> has nonempty intersection) has nonempty intersection. Consider the set of all families of subsets of <math>X</math> with the finite intersection property, ordered by set inclusion. Clearly, the hypothesis of Zorn's lemma is satisfied and so it has a maximal element <math>M</math> that contains <math>F</math>. We note <math>M</math> has the property
- If <math>X = S_1 \cup \cdots \cup S_n</math> for some sets <math>S_j</math>'s, then <math>S_j \in M</math> for some <math>j</math>.
Indeed, suppose no <math>S_j</math> is in <math>M</math>. Since each <math>M \cup \{ S_j \}</math> does not have the finite intersection property, we have <math>T_j \cap S_j = \emptyset</math> for some intersections <math>T_j</math> of finite subsets of <math>M</math>. Then
:<math>\emptyset \ne \cap_j T_j \subset \cap_j (X - S_j)</math>,
meaning <math>S_j</math>'s do not cover <math>X</math>.
Now, since <math>X</math> is totally bounded, for each integer <math>n > 0</math>, <math>M</math> contains an open ball <math>B_n</math> of radius <math>1/n</math> by the above property. For each <math>n</math>, let <math>x_n</math> be in <math>B_1 \cap \cdots \cap B_n</math>, which is possible by the finite intersection property. For <math>m \ge n</math>, <math>x_m, x_n</math> are both in <math>B_n</math>. Hence, the sequence <math>x_n</math> is Cauchy and, by completeness, it converges to a limit <math>x</math>. For each <math>\epsilon > 0</math>, we have:
:<math>B_n \subset B(x, \epsilon)</math>
if <math>n</math> is large enough. Indeed, for <math>y</math> in <math>B_n</math>,
:<math>d(y, x) \le d(y, x_n) + d(x_n, x) < 2/n + d(x_n, x) < \epsilon</math>
if <math>n</math> is large. Hence, for such <math>n</math>, <math>\emptyset \neq B_n \cap S \subset B(x, \epsilon) \cap S</math> for <math>S</math> in <math>M</math>. That is, <math>x</math> is in <math>\overline{S}</math> for each <math>S</math> in <math>M</math>. A fortiori, <math>x \in \cap F</math> then. <math>\square</math>
The proof of (4) <math>\Rightarrow</math> (1) above in fact shows the following more general result for uniform spaces; indeed, Bourbaki's original argument was for uniform spaces.
Heine–Borel property
The Heine–Borel theorem does not hold as stated for general metric and topological vector spaces, and this gives rise to the necessity to consider special classes of spaces where this proposition is true. These spaces are said to have the Heine–Borel property.
In the theory of metric spaces
A metric space <math>(X,d)</math> is said to have the Heine–Borel property if each closed, bounded set in <math>X</math> is compact.
Many metric spaces fail to have the Heine–Borel property, such as the metric space of rational numbers (or indeed any incomplete metric space). Complete metric spaces may also fail to have the property; for instance, no infinite-dimensional Banach spaces have the Heine–Borel property (as metric spaces). Even more trivially, if the real line is not endowed with the usual metric, it may fail to have the Heine–Borel property.
A metric space <math>(X,d)</math> has a Heine–Borel metric which is Cauchy locally identical to <math>d</math> if and only if it is complete, <math>\sigma</math>-compact, and locally compact.
In the theory of topological vector spaces
A topological vector space <math>X</math> is said to have the Heine–Borel property (R.E. Edwards uses the term boundedly compact space) if each closed bounded set in <math>X</math> is compact. No infinite-dimensional Banach spaces have the Heine–Borel property (as topological vector spaces). But some infinite-dimensional Fréchet spaces do have, for instance, the space <math>C^\infty(\Omega)</math> of smooth functions on an open set <math>\Omega\subset\mathbb{R}^n</math> and the space <math>H(\Omega)</math> of holomorphic functions on an open set <math>\Omega\subset\mathbb{C}^n</math>. More generally, any quasi-complete nuclear space has the Heine–Borel property. All Montel spaces have the Heine–Borel property as well.
See also
- Bolzano–Weierstrass theorem
Notes
References
- BookOfProofs: Heine-Borel Property
External links
- Mathworld "Heine-Borel Theorem"
- "An Analysis of the First Proofs of the Heine-Borel Theorem - Lebesgue's Proof"
