Geometry of the Hausdorff and Gromov-Hausdorff distances


154 46 992KB

English Pages [97] Year 2019

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
1 Elements of general topology
2 Introduction to Metric Spaces
3 Curves in Metric Spaces
4 Extreme graphs and networks
5 Hausdorff distance
6 Gromov–Hausdorff Distance
7 Gromov–Hausdorff space
8 Calculating GH-distances to simplexes and some applications
Recommend Papers

Geometry of the Hausdorff and Gromov-Hausdorff distances

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Citation preview

Chapter 1

Elements of general topology. Schedule. Definition of topology and topological space, induced topology, subspace of topological space, discrete and anti-discrete topologies, metric spaces and metric topology, standard topology on Euclidean space, base of topology, cover of set and subset, Zariski topology, Sorgenfrey topology, subbase of topology, disjoint union of topological spaces, Cartesian product of topological spaces, Tychonoff or product topology, quotient topology and quotient space, Vietoris topology, continuous mapping, homeomorphism, embedding, convergence of sequences, closure, interior, boundary, dense subsets, separability, separated or Hausdorff topological space, connected and disconnected topological spaces, connected components, path-connected topological spaces, open cover, subcover, compact and sequentially compact topological spaces, bounded metric spaces, hyperspaces.

In this chapter we present an introduction to general topology.

1.1

Basic facts of general topology

For a set X, let 2X denote the collection of all subsets of X. If 𝒜 ⊂ 2X is a family of subsets of X, then ∪𝒜 and ∩𝒜 denote the union and the intersection of the elements from 𝒜, respectively. If 𝒜 is an indexed family, i.e., 𝒜 = {Ai }i∈I , then we use ∪i∈I Ai and ∩i∈I Ai for the union and the intersection. If different elements of 𝒜 do not intersect each other (such family 𝒜 is called disjoint), then we write ⊔𝒜 instead of ∪𝒜, to emphasize that 𝒜 is disjoint; similarly, we write ⊔i∈I Ai instead of ∪i∈I Ai for indexed families. We can define ⊔i∈I Ai also in the case when some differen Ai intersect each other, in particular, when they coincide. In this situation we simply consider Ai for different i as nonintersecting { sets.} This can be done in a formal way if we change the elements ai ∈ Ai to (ai , i) and identify Ai with the set (ai , i) a ∈A . i

i

Definition 1.1. A set τ = {Uα }α∈A ⊂ 2X is called a topology on X if τ satisfies the following properties: (1) ∅, X ∈ τ ; (2) for any 𝒜 ⊂ τ we have ∪𝒜 ∈ τ (the union of arbitrary collection of elements from τ belongs to τ ); (3) for any finite 𝒜 ⊂ τ it holds ∩𝒜 ∈ τ (the intersection of arbitrary finite collection of elements from τ belongs to τ ). Definition 1.2. A set X with a given topology τ ⊂ 2X is called a topological space. It is also convenient to denote the topological space X as the pair (X, τ ). Also, speaking about the topological space X, we will often denote the topology defined on it by τX , without specifically mentioning it every time. The elements of X are usually called points, and the elements of τ are called open sets. A set F ⊂ X is called closed if its complement is open. Problem 1.1. Show that the family of all closed subsets of a topological space X contains ∅ and X, and that the intersection of any collection of closed subsets, as well as the union of any finite collection of closed subsets are some closed sets. Take an arbitrary Y ⊂ X and consider the family τY := {U ∩ Y : U ∈ τX }. Problem 1.2. Prove that τY is a topology on Y . Definition 1.3. The τY is called the topology on Y induced from X. The Y with the topology τY is usually called a subspace of the topological space X. 1

1.2. Base and subbase

2

Generally speaking, there are many different topologies on each set X, and the inclusion relation generates a partial order on the set of all such topologies. The smallest topology in this order consists of two elements: τa = {∅, X}. It is called anti-discrete. The largest topology consists of all subsets: τd = 2X . It is called discrete. If 𝒯 is a collection of topologies defined on the same set X, then ∩𝒯 is a topology as well; it is smaller than each topology τ ∈ 𝒯 ; for any topology τ ′ on X that is smaller than all topologies from 𝒯 it holds τ ′ ⊂ ∩𝒯 . The most important for us example of topology will be generated by metrics. Namely, suppose that for a set X a function ρ : X × X → ℝ is given, which has the following properties: (1) for any x, y ∈ X it holds ρ(x, y) ≥ 0, and ρ(x, y) = 0 iff x = y (positive definiteness); (2) for any x, y ∈ X we have ρ(x, y) = ρ(y, x) (symmetry); (3) for any x, y, z ∈ X it holds ρ(x, y) + ρ(y, z) ≥ ρ(x, z) (triangle inequality). Then ρ is called a metric, and the set X with the metric ρ is called a metric space. It is also convenient to denote the metric space X as the pair (X, ρ). Each Y ⊂ X endowed with the restriction of ρ to Y × Y is called a subspace of X. Example 1.4. In calculus, the standard example of a metric space is the arithmetic ℝn with the Euclidean √∑space n 2 metric defined on it: for x = (x1 , . . . , xn ) and y = (y1 , . . . , yn ) it holds ρ(x, y) = i=1 (xi − yi ) . This metric is called Euclidean. We will also call Euclidean the arithmetic space itself, endowed with the Euclidean metric. Let X be a metric space with a metric ρ. For every x ∈ X and r > 0 we put { } Ur (x) := y ∈ X : ρ(x, y) < r and call an open ball of radius r > 0 and center x. Using the metric ρ, we construct the natural topology τρ , called the metric topology: we assign the subset U ⊂ X to open sets of the metric topology τρ if and only if U is either empty, or U is a union of open balls. Equivalent definition: U ∈ τρ if and only if for any point x ∈ U there exists r > 0 such that Ur (x) ⊂ U (together with each point the set U contains some open ball with the center at this point). Problem 1.3. Prove that the family τρ is a topology. Remark 1.5. Unless otherwise stated, on the real line ℝ and, more generally, on the arithmetic space ℝn , we consider the topologies generated by the Euclidean metric (see. Example 1.4). This topology is called standard.

1.2

Base and subbase

The construction of metric topology described above leads to the following important notion. Similar to linear algebra, where to describe a linear space it is enough to choose a family of vectors that, using linear combinations, generates the whole space, to define a topology, one can also select a subfamily of open sets and generate the topology by means of set-theoretic operations. Definition 1.6. A family β ⊂ τ is called a base of the topology τ if every nonempty open set U ∈ τ is representable as a union of some elements from β. Thus, by the definition of metric topology, its base is the family of all open balls. Remark 1.7. Note that a given topology can have many different bases. For example, not all balls can be selected as the base of the metric space, but only, say, of radii not exceeding 1, or of only rational radii, or of only radii of the form 1/n, etc. On the Euclidean line, for example, only rational numbers can be selected as centers. We note two important properties of the topology base on the set X: (1) each point x ∈ X is contained in some element from the base (otherwise the set X cannot be obtained as the union of some elements from the base); (2) a nonempty intersection of any two elements of the base is representable as the union of some elements from the base (otherwise this intersection will not belong to the topology). It turns out that these two properties completely characterize the families that are the bases of some topologies. Before formulating the corresponding criterion, we introduce a definition of cover, which will be useful to us both here and hereinafter.

1.3. Standard constructions of topologies

3

Definition 1.8. A family 𝒜 ⊂ 2X is called a cover of the set X if X = ∪𝒜. A family 𝒜 ⊂ 2X is called a cover of Y ⊂ X if Y ⊂ ∪𝒜. It is clear that each base of a topological space X is a cover of X. Problem 1.4. Prove that a family β ⊂ 2X is a base of some topology τ on X if and only if β is a cover of X, and for any intersecting B1 , B2 ∈ β their intersection B1 ∩ B2 is the union of some elements from β. Moreover, each family satisfying these properties, generates a unique topology. Notice that a collection β of open sets in a topological space X which satisfies the conditions of Problem 1.4 may generate a topology τ different from τX . What do we need to add for to be sure that τ = τX ? The answer can be obtained from the following more general result that is often used in proving the coincidence of topologies. Problem 1.5. Let some topologies τ1 and τ2 with bases β1 and β2 be given on a set X. Then τ1 = τ2 if and only if for any x ∈ X the following condition is fulfilled: for any B2 ∈ β2 , x ∈ B2 , there is B1 ∈ β1 for which x ∈ B1 ⊂ B2 , and vice versa, for any B1 ∈ β1 , x ∈ B1 , there exists B2 ∈ β2 for which x ∈ B2 ⊂ B1 . In particular, for a topological space X, a collection β of open sets satisfying the conditions of Problem 1.4 is a base of the topology τX if and only if for each open set U ∈ τX and any point x ∈ U there exists some B ∈ β such that x ∈ B ⊂ U . Recall that two sets are called equivalent if there exists a bijection between them. The equivalence classes of sets are called cardinalities or cardinal numbers. The cardinality of a set X will be denoted by #X. Example 1.9. Let X be an infinite set of cardinality n, and m be an infinite cardinal number, with m ≤ n. Consider a family ℱ of all F ⊂ X such that #F < m, and let βm = {X \ F : F ∈ ℱ}. Then βm is a base of some topology τ , which we call the Zariski topology of the weight m. Problem 1.6. Prove that the family βm from Example 1.9 is a base of some topology. Example 1.10. As we noted above, on the Euclidean line we can take the family of all intervals as the base of the standard topology. Another interesting example of topology is obtained if, instead of intervals, we take all possible half-intervals of the form [a, b). The corresponding topology is called the arrow topology or the Sorgenfrey topology. Remark 1.11. The Sorgenfrey topology contains the standard topology of the line, since each interval (a, b) can be represented as a union of half-intervals [a + 1/n, b), n ∈ ℕ. Problem 1.7. Show that the collection of all possible half-intervals of the form [a, b) ⊂ ℝ form a base of some topology that contains the standard topology. If, to generate a topology, we allowed also to use finite intersections, then the generating family can, generally speaking, be reduced. Definition 1.12. A family σ ⊂ τ is called a subbase of the topology τ if the set of all finite intersections of elements from σ forms a base of the topology τ . It is clear that, like the base, each subbase of a topological space X is a cover of X. Problem 1.8. Prove that a family σ ⊂ 2X is a subbase of some topology on X if and only if σ is a cover of X. Moreover, each cover of X generates a unique topology. Example 1.13. The family of all subsets of the real line ℝ, each of which is an open ray, forms a subbase of the standard topology and is not a base of this topology.

1.3

Standard constructions of topologies

This section provides examples of standard constructions that allow to build new examples of topological spaces from existing ones. Construction 1.1. Let σ be an arbitrary family of subsets of a set X, and 𝒯σ be the family of all topologies on X containing σ. Then τ := ∩𝒯σ is the smallest topology containing σ. If σ is a cover of X, then σ is a subbase of τ . Construction 1.2 (Disjoint union). Let 𝒜 = {Xi }i∈I be a family of topological spaces. We define a topology on ⊔𝒜 = ⊔i∈I Xi , setting its base to be equal to ⊔i∈I τXi . The set ⊔i∈I Xi with the corresponding topology is called the disjoint union of the topological spaces Xi .

1.4. Continuous mappings

4

Construction 1.3 (Cartesian product). Let 𝒜 = {Xi }i∈I be an arbitrary family of topological spaces. The set of all mappings w : I →∏⊔i∈I X∏ i such that w(i) ∈ Xi for every i ∈ I is called the Cartesian product of the spaces Xi and is denoted by W := 𝒜 = i∈I Xi . In particular, if all Xi are equal to the same space X, then W = X I , where the latter, recall, denotes the set of all mappings from I to X. For ∏ convenience, we will often write wi instead of w(i), and we will call this value the i-th coordinate of the point w ∈ i∈I Xi . ∏ We define a topology on W , choosing as its subbase the family ∏ of all products i∈I Ui , Ui ∈ τXi , in which only one Ui can differ from Xi . The corresponding base consists of i∈I Ui , Ui ∈ τXi , in which only a finite number of Ui can differ from Xi . This topology is called the product topology or the Tychonoff topology. In the case when I is a finite set, say, I = {1, . . . , n}, then the Cartesian product of the spaces Xi is denoted by X1 × · · · × Xn . In particular, in this way one can define the standard topology on the n-dimensional arithmetic space ℝn . Problem 1.9. Show that the standard topology of the Euclidean space ℝn coincides with the topology of the Cartesian product ℝ × · · · × ℝ of real lines endowed with the standard topology. Example 1.14. Sorgenfrey space is the Cartesian product of the Sorgenfrey lines from Example 1.10 and is used in general topology to illustrate numerous exotic possibilities. Construction 1.4 (Quotient topology). Let X be a topological space, and ν be some equivalence relation on X. Denote by X/ν the set of classes of this equivalence. For each x ∈ X denote by [x] ∈ X/ν the ν-equivalence class containing x, and let π : X → X/ν, π : x ⊦→ [x], be the canonical projection. Then the family of all U ⊂ X/ν such that π −1 (U ) ∈ τX forms a topology called the quotient topology. The set X/ν endowed with the quotient topology is called the quotient space. Construction 1.5 (Vietoris topology). Let X be an arbitrary topological space. For each finite collection of open sets U1 , . . . , Un ∈ τX we put ⟨U1 , . . . , Un ⟩ = {Y ⊂ X : Y ⊂ ∪ni=1 Ui and Y ∩ Ui ̸= ∅ for all i = 1, . . . , n}. Note that if at least one of Ui is empty, then ⟨U1 , . . . , Un ⟩ = ∅. Problem 1.10. Show that the families { } { } { } σ = ⟨U ⟩ : U ∈ τX ∪ ⟨X, U ⟩ : U ∈ τX and β = ⟨U1 , . . . , Un ⟩ : U1 , . . . , Un ∈ τX form respectively a subbase and the corresponding base of some topology on 2X . The topology on 2X defined in Problem 1.10 is called the Vietoris topology. Remark 1.15. Usually, Vietoris topology is defined on the family of all nonempty closed subsets of a topological space. Problem 1.11. Prove that each construction of these section provides a topology.

1.4

Continuous mappings

As a rule, all considered mappings between topological spaces are consistent with topologies. Such mappings are called continuous. We give three equivalent definitions of continuity. Definition 1.16. A neighborhood of a point x ∈ X of a topological space X is an arbitrary open set U ⊂ X containing x. A neighborhood of a subset Z of a topological space X is an arbitrary open set U ⊂ X containing Z. Remark 1.17. For convenience, we denote an arbitrary neighborhood of a point x ∈ X by U x . Definition 1.18. A mapping f : X → Y between topological spaces is continuous at x ∈ X if for any neighborhood U f (x) there exists a neighborhood U x such that f (U x ) ⊂ U f (x) . The mapping f , continuous at all points x ∈ X, is called continuous. Definition 1.19. A mapping f : X → Y between topological spaces is continuous if for any open set U ⊂ Y its preimage f −1 (U ) ⊂ X is open (the preimage of each open set is open).

1.5. Homeomorphisms, embeddings

5

Definition 1.20. A mapping f : X → Y between topological spaces is continuous if for any closed set F ⊂ Y its preimage f −1 (F ) ⊂ X is closed (the preimage of each closed set is closed). Problem 1.12. Prove that the definitions 1.18, 1.19, and 1.20 are equivalent. Problem 1.13. Let f : X → Y be a mapping of topological spaces and σ a subbase of the topology on the space Y . Prove that f is continuous if and only if f -preimage of each element from the subbase σ is open in X. Remark 1.21. When we speak of the continuity of a function f : X → ℝ or, more generally, of a vector-valued mapping f : X → ℝn from a topological space X, then, unless otherwise stated, on ℝ and ℝn we consider the standard topologies (see. Remark 1.5). Problem 1.14. Show that the identity mapping and the composition of continuous mappings are continuous. Problem 1.15. Let X be a topological space, and Z ⊂ X be its subspace. Show that the inclusion mapping i : Z → X, i(z) = z for each point z ∈ Z, is continuous. Problem 1.16. Let X, Y be topological spaces, W ⊂ Y be a subspace of Y , and f : X → W be a continuous mapping. Let g : X → Y be a mapping coinciding with f : for each x ∈ X it holds f (x) = g(x). Prove that the mapping g is continuous. Let f : X → Y be an arbitrary mapping of sets. Choose arbitrary subsets Z ⊂ X and W ⊂ Y such that f (Z) ⊂ W . Then the restriction f ∣Z,W of the mapping f to Z and W is the mapping g : Z → W that coincides on the domain with the mapping f , i.e., for any x ∈ Z it holds f (x) = g(x). Problem 1.17. Let f : X → Y be a continuous mapping of topological spaces, Z ⊂ X, W ⊂ Y , f (Z) ⊂ W . Then the restriction f ∣Z,W : Z → W is also continuous as the mapping of the topological spaces Z and W with the topologies induced on them from X and Y , respectively. Problem 1.18. Let {Xi }i∈I be a cover of a topological space X by open subsets Xi , and f : X → Y a mapping to a topological space Y . Show that f is continuous if and only if all the restrictions f ∣Xi are continuous. In particular, this holds when X = ⊔i∈I Xi is the disjoint union of some topological spaces. Will this result remain true if we replace {Xi } with a cover of X by arbitrary sets? Problem 1.19. Let X = ⊔i∈I Xi be the disjoint union of some topological spaces and f : X → Y be a map into a topological space Y . Show that f is continuous if and only if all its restrictions f ∣Xi are continuous. ∏ Problem 1.20. Let {Xi }i∈I be a family of topological spaces and X = i∈I Xi . We define the canonical projection πi : X → Xi by setting πi (x) = xi . Prove that the product topology on X is the smallest of those topologies in which all the projections πi are continuous. ∏ Problem 1.21. Let {Yi }i∈I be a family of topological spaces, ∏ Y = i∈I Yi , and fi : X → Yi be mappings from some topological space X. We construct the mapping F := i∈I fi : X → Y by associating with each point x ∈ X the element y ∈ Y defined as follows: yi = fi (x). Prove that the mapping F is continuous if and only if all fi are continuous. Problem 1.22. Let A ⊂ ℝn be an arbitrary subset, (x1 , . . . , xn ) the Cartesian coordinates on ℝn , f : A → ℝm a continuous mapping, (y 1 , . . . , y m ) the Cartesian coordinates on ℝm , and y i = y i (x1 , . . . , xn ) the coordinate functions of the mapping f . Prove that the mapping f is continuous if and only if all the coordinate functions y i = y i (x1 , . . . , xn ) are continuous. Problem 1.23. Describe all continuous functions on a topological space with Zariski topology.

1.5

Homeomorphisms, embeddings

An important particular case of continuous mapping is a homeomorphism. Definition 1.22. A mapping f : X → Y of topological spaces is called a homeomorphism if it is bijective, and both the maps f and f −1 are continuous. Topological spaces between which there is a homeomorphism are called homeomorphic.

1.6. Convergence of sequences

6

Remark 1.23. A homeomorphism, being a bijection, identifies not only points of spaces, but also identifies topologies, establishing a one-to-one correspondence between them. For clarity, we can imagine that the homeomorphism f : X → Y is a replacement for the names of points in the space X: the point x ∈ X is “renamed” to f (x), without changing the topology. From these considerations it follows that all topological properties of homeomorphic spaces are the same. The next exercise follows directly from Problem 1.17. Problem 1.24. Let f : X → Y be a homeomorphism, and Z ⊂ X, W = f (Z). Prove that the restriction f ∣Z,W : Z → W is also a homeomorphism. An injective mapping f : X → Y of topological spaces is called an embedding of X into Y if the restriction f ∣X,f (X) is a homeomorphism. Problem 1.25. Show that every embedding is continuous. Give an example of a continuous injective mapping of topological spaces that is not an embedding.

1.6

Convergence of sequences

A sequence in a set X is an arbitrary mapping x : ℕ → X from the set of natural numbers ℕ = {1, 2, . . .}. For convenience, the points x(n) are usually denoted by xn . Also, for brevity, it is customary to say that a sequence of points xn is given. Definition 1.24. A sequence of points xn in a topological space X is called convergent if, for some x ∈ X, called a limit of this sequence, the following holds: for any neighborhood U x there exists N ∈ ℕ such that for all n ≥ N we have xn ∈ U x . If the sequence is not convergent, then it is called divergent. ¯ = ℕ ∪ {ω}, taking as Problem 1.26. Let ω be a character not contained in ℕ. We define a topology on the set N a base all points from ℕ, as well as all sets of the form {n ≥ N } ∪ {ω}, N ∈ ℕ. Show that a sequence x : ℕ → X ¯ converges if and only if the mapping x can be extended to a continuous mapping on ℕ. Problem 1.27. Show that a continuous mapping f : X → Y of topological spaces takes convergent sequences to convergent ones. Show that if X is a metric space, then every mapping g : X → Y that takes convergent sequences into convergent ones is continuous. Give an example of a topological space X and a mapping h : X → Y into a topological space Y , which takes convergent sequences into convergent ones, but is not continuous nonetheless. Problem 1.28. Let x1 , x2 , . . . be a sequence of points in a metric space X. Suppose that for some point x ∈ X each neighborhood of x intersects the set {xi }∞ i=1 \ {x}. Prove that the sequence x1 , x2 , . . . contains a convergent subsequence. Prove that if a sequence of points in a metric space does not contain any convergent subsequence, then for each x ∈ X there exists r > 0 such that the open ball Ur (x) does not contain points of this sequence other than x.

1.7

Closure, interior, boundary, dense subsets, separability

Let Y be a subset of a topological space X. A point x ∈ X is called an adherent point, or a closure point, or a contact point for Y if every neighborhood of x intersects Y . The set of all adherent points of the set Y is called its closure and is denoted by Y . Problem 1.29. Prove that the closure of a set Y ⊂ X is the smallest closed subset of X containing Y , i.e., Y is the intersection of all closed sets containing Y . A subset of Y of a topological space X is called everywhere dense in X if Y = X. Example 1.25. The set of all rational numbers, like the set of all irrational numbers, are everywhere dense in the real line. Problem 1.30. Let the topology of Zariski of weight m be given on an infinite set X. Then a subset Y ⊂ X is everywhere dense in X if and only if #Y ≥ m. A topological space is called separable if it contains an everywhere dense sequence.

1.8. Separated spaces

7

Example 1.26. Each finite space is separable. The Euclidean space ℝn is also separable: as an everywhere dense sequence we can take arbitrary numbered set of all points with rational coordinates. Each space with a countable Zariski topology is separable. Sorgenfrey space (Example 1.14) is separable. Problem 1.31. Show that in metric space, separability is equivalent to having a countable base. Extract from this that every subset of a separable metric space is separable. Show that an open subset of an arbitrary separable topological space is separable. Give an example of a separable topological space containing an non-separable subset (use the Sorgenfrey plane). A point x from a subset Y of a topological space X is called interior for Y if some neighborhood of x is contained in Y . The family of all interior points of the set Y is called its interior and is denoted by Int Y . Problem 1.32. Show that the interior Int Y is the largest open subset of X contained in Y . Problem 1.33. Prove that a subset Y of the topological space X is closed if and only if Y = Y , and is open if and only if Y = Int Y . A point x ∈ X of a topological space X is called a boundary point for a subset Y ⊂ X if each neighborhood of x intersects both Y and its complement X \ Y . The set of all boundary points of the set Y is called its boundary and is denoted by ∂Y . Problem 1.34. Prove that the boundary ∂Y is a closed subset of X, and ∂Y = Y \ Int Y = Y ∩ X \ Y .

1.8

Separated spaces

There are a number of separation axioms that generate various classes of topological spaces. We will not dwell on this in detail here, but formulate only one axiom of separation, which will be useful to us in the future. A topology on a set X, as well as the topological space X itself, is called Hausdorff or separated if any two points of X have disjoint neighborhoods. Example 1.27. Each discrete topology is Hausdorff. Each metric space is Hausdorff. If the set X consists of more than one point, then the anti-discrete topology is not Hausdorff. Also, the Zariski topology is not a Hausdorff topology (see Example 1.9). Problem 1.35. Show that in a Hausdorff topological space every point is closed. Give an example of a non-Hausdorff topological space in which all points are closed. Problem 1.36. Show that the disjoint union and the Cartesian product of Hausdorff topological spaces are also Hausdorff. Problem 1.37. Show that in a Hausdorff topological space the limit of a convergent sequence is uniquely defined. Give an example of a topological space in which each sequence converges to each point. Problem 1.38. Describe what sequences in a space with Zariski topology are convergent, and what limits each convergent sequence has.

1.9

Connected spaces

We say that a set X is partitioned into subsets {Xi }i∈I if X = ⊔i∈I Xi . A topological space (its topology) is called disconnected if it can be partitioned into two nonempty open (equivalently, closed) sets. If such a partition does not exist, then the topological space is called connected. A subset of a topological space is connected (disconnected ), if such is the topology induced on it. In other words, a subset Y of a topological space X is disconnected if there exist U, V ∈ τX such that Y ⊂ U ∪ V , and both intersections Y ∩ U and Y ∩ V are nonempty and do not intersect each other. Problem 1.39. Prove that each segment [a, b] ⊂ ℝ is connected. Problem 1.40. Prove that the closure of a connected subset of a topological space is connected.

1.10. Path-connected spaces

8

Problem 1.41. Let {Ai }i∈I be a family of connected pairwise intersecting subsets of a topological space X, then the set ∪i∈I Ai is connected. Problem 1.42. Show that the image of a connected topological space under a continuous mapping is also connected. Problem 1.43. Prove that every continuous function on a connected topological space takes all intermediate values. The maximum (by inclusion) connected subset of a topological space is called a connected component of this space. Problem 1.44. Show that each connected component is closed, and that each topological space is uniquely partitioned into its connected components. If such a partition is finite, then connected components are also open. Give an example of a topological space in which some connected components are not open.

1.10

Path-connected spaces

A curve in a topological space X is an arbitrary continuous map γ : [a, b] → X. It is said that the curve γ joins the points γ(a) and γ(b). A topological space X is called path-connected if any two of its points can be connected by a curve. Problem 1.45. Prove that a path-connected topological space is connected. Give an example of a connected space that is not path-connected.

1.11

Compact and sequentially compact spaces

A subcover of a cover is a subfamily of a cover, which itself is a cover. A cover of a topological space composed of open sets is called open. Definition 1.28. A topological space X is called compact if a finite subcover can be found in any of its open covers. Remark 1.29. To define a cover for a subset Y of a topological space X, it is more convenient to modify Definition 1.28, rather than reduce it to the corresponding concept for the induced topology. Namely, an open cover of Y is a family {Ua }a∈A of open subsets of X such that Y ⊂ ∪a∈A Ua . Other definitions do not change. Problem 1.46. Show that a finite union of compact subsets of a topological space is compact. Problem 1.47. Prove the following statements: (1) the image under a continuous mapping from a compact topological space is compact; (2) a closed subset of a compact topological space is compact; (3) a compact subset of a Hausdorff topological space is closed; (4) a continuous bijective mapping from a compact topological space to a Hausdorff space is a homeomorphism; (5) give an example of an infinite topological space in which all subsets are compact. Note that in such a space there are compact subsets that are not closed; (6) give an example of a continuous bijective mapping of topological spaces that is not a homeomorphism. Problem 1.48 (Alexander subbase theorem). Let X be a topological space and σ its subbase. Prove that X is compact if and only if each cover of X by elements of the subbase σ has a finite subcover. ∏ Problem 1.49 (Tychonoff’s theorem). Prove that the Cartesian product i∈I Xi of topological spaces Xi , endowed with Tychonoff topology, is compact if and only if all Xi are compact. Problem 1.50. Prove that each segment [a, b] ⊂ ℝ is compact. A subset of a metric space is called bounded if it is contained in some ball. Problem 1.51. Prove that a subset of a Euclidean space is compact if and only if it is closed and bounded.

1.12. Hyperspaces

9

Problem 1.52. Prove that every compact metric space is bounded. Prove that a continuous function on a compact topological space is bounded and takes its largest and smallest values. Definition 1.30. A topological space is called sequentially compact if every sequence of its points has a convergent subsequence. Problem 1.53. Prove that every sequentially compact metric space is bounded. Prove that a continuous function on a sequentially compact topological space is bounded and takes its largest and smallest values. Remark 1.31. Note that compactness and sequential compactness in the case of general topological spaces are not related to each other: there are spaces that have one of these properties and do not have the other. Since the examples are quite complicated, we will omit them here. However, everything is much simpler for metric spaces, see Theorem 2.20 in Chapter 2.

1.12

Hyperspaces

A family of various subsets of a topological or metric space endowed with a certain topology or metric is called a hyperspace. In Construction 1.5 above, we defined the Vietoris topology on the set 2X of all subsets of a topological space X. Thus, 2X is a special case of hyperspace. Here are a few more examples of hyperspaces (in all these spaces, the topology is induced from 2X in the standard way): • by 𝒫0 (X) ⊂ 2X we denote the set of all nonempty subsets of X; • by 𝒞ℒ(X) ⊂ 𝒫0 (X) we denote the set of all nonempty closed subsets of X; • by 𝒞(X) ⊂ 𝒞ℒ(X) the set of all nonempty closed connected subsets of X; • by 𝒞n (X) ⊂ 𝒞ℒ(X) the set of all nonempty closed subsets of X having at most n components; • by 𝒞∞ (X) ⊂ 𝒞ℒ(X) the set of all nonempty closed subsets of X, each of which has finitely many components; • by 𝒦(X) ⊂ 𝒫0 (X) the set of all nonempty compact subsets of X; • by ℱn (X) ⊂ 𝒦(X) the set of all nonempty at most n-point subsets of X; • by ℱ∞ (X) ⊂ 𝒦(X) the set of all nonempty finite subsets of X. There are numerous, usually obvious, connections between these spaces. For example, if the space X is Hausdorff, then 𝒦(X) ⊂ 𝒞ℒ(X); if X is Hausdorff and compact, then 𝒦(X) = 𝒞ℒ(X). A connected nonempty compact Hausdorff topological space is called a continuum. If the space X is Hausdorff, then 𝒦(X) ∩ 𝒞(X) is the set of all continua. In some literature, for example, in [2], this space is denoted by 𝒞(X). { } Problem 1.54. Let X = {a, b}. We define the following topology on X as follows: τ = ∅, X, {a} . Find out what the space CL(X) is. Definition 1.32. A topological space is called a space of class T0 if, for any two different points of this space, at least one of them has a neighborhood that does not contain the second point. Problem 1.55. Prove that the space CL(X) is always a space of class T0 . Problem 1.56. A topological space is called a space of class T1 if, for any two different points of this space, each of them has a neighborhood that does not contain the remaining point. Problem 1.57. Prove that if X is a space of class T1 , then CL(X) is also a space of class T1 . Give an example that demonstrates that the converse is not true. Problem 1.58. Prove that the space 𝒫0 (X) belongs to the class T1 if and only if the space X is discrete.

References to Chapter 1 [1] R. Engelking, General Topology, Heldermann, Berlin, 1989. [2] S. Nadler, Hyperspaces of Sets, 1978.

10

Exercises to Chapter 1.

11

Exercises to Chapter 1 Exercise 1.1. Show that the family of all closed subsets of a topological space X contains ∅ and X, and that the intersection of any collection of closed subsets, as well as the union of any finite collection of closed subsets are some closed sets. Exercise 1.2. Let X be a topological space and Y ⊂ X. Consider the family τY := {U ∩ Y : U ∈ τX }. Prove that τY is a topology on Y . Exercise 1.3. For a metric space (X, ρ) define τρ ⊂ 2X as the collection consisting of the empty set and all possible unions of open balls. Prove that the family τρ is a topology. Exercise 1.4. Prove that a family β ⊂ 2X is a base of some topology τ on X if and only if β is a cover of X, and for any intersecting B1 , B2 ∈ β their intersection B1 ∩ B2 is the union of some elements from β. Moreover, each family satisfying these properties, generates a unique topology. Exercise 1.5. Let some topologies τ1 and τ2 with bases β1 and β2 be given on a set X. Then τ1 = τ2 if and only iff for any x ∈ X the following condition is fulfilled: for any B2 ∈ β2 , x ∈ B2 , there is B1 ∈ β1 for which x ∈ B1 ⊂ B2 , and vice versa, for any B1 ∈ β1 , x ∈ B1 , there exists B2 ∈ β2 for which x ∈ B2 ⊂ B1 . In particular, for a topological space X, a collection β of open sets satisfying the condition of Exercise 1.4 is a base of the topology τX if and only if for each open set U ∈ τX and any point x ∈ U there exists some B ∈ β such that x ∈ B ⊂ U . Exercise 1.6. Let X be an infinite set of cardinality n, and m be an infinite cardinal number, with m ≤ n. Consider a family ℱ of all F ⊂ X such that #F < m, and let βm = {X \ F : F ∈ ℱ}. Prove that the family βm is a base of some topology. Exercise 1.7. Show that the collection of all possible half-intervals of the form [a, b) ⊂ ℝ form a base of some topology that contains the standard topology. Exercise 1.8. Prove that a family σ ⊂ 2X is a subbase of some topology on X if and only if σ is a cover of X. Moreover, each cover of X generates a unique topology. Exercise 1.9. Show that the standard topology of the Euclidean space ℝn coincides with the topology of the Cartesian product ℝ × · · · × ℝ of real lines endowed with the standard topology. Exercise 1.10. Let X be an arbitrary topological space. For each finite collection U1 , . . . , Un ∈ τX we put ⟨U1 , . . . , Un ⟩ = {Y ⊂ X : Y ⊂ ∪ni=1 Ui , and Y ∩ Ui ̸= ∅ for all i = 1, . . . , n}. Show that the families { } { } { } σ = ⟨U ⟩ : U ∈ τX ∪ ⟨X, U ⟩ : U ∈ τX and β = ⟨U1 , . . . , Un ⟩ : U1 , . . . , Un ∈ τX form respectively a subbase and the corresponding base of some topology on 2X . Exercise 1.11. Prove that each construction from the section “Standard constructions of topologies” provides a topology. Exercise 1.12. Prove that the definitions 1.18, 1.19, and 1.20 are equivalent. Exercise 1.13. Let f : X → Y be a mapping of topological spaces and σ be a subbase of the topology on the space Y . Prove that f is continuous if and only if f -preimage of each element from the subbase σ is open in X. Exercise 1.14. Show that the identity mapping and the composition of continuous mappings are continuous. Exercise 1.15. Let X be a topological space, and Z ⊂ X be its subspace. Show that the inclusion mapping i : Z → X, i(z) = z for each point z ∈ Z, is continuous. Exercise 1.16. Let X, Y be topological spaces, W ⊂ Y be a subspace of Y , and f : X → W be a continuous mapping. Let g : X → Y be a mapping coinciding with f : for each x ∈ X it holds f (x) = g(x). Prove that the mapping g is continuous.

Exercises to Chapter 1.

12

Exercise 1.17. Let f : X → Y be a continuous mapping of topological spaces, Z ⊂ X, W ⊂ Y , f (Z) ⊂ W . Then the restriction f ∣Z,W : Z → W is also continuous as the mapping of the topological spaces Z and W with the topologies induced on them from X and Y , respectively. Exercise 1.18. Let {Xi }i∈I be a cover of a topological space X by open subsets Xi , and f : X → Y a mapping to a topological space Y . Show that f is continuous if and only if all the restrictions f ∣Xi are continuous. In particular, this holds when X = ⊔i∈I Xi is the disjoint union of some topological spaces. Will this result remain true if we replace {Xi } with a cover of X by arbitrary sets? Exercise 1.19. Let X = ⊔i∈I Xi be the disjoint union of some topological spaces and f : X → Y be a map into the topological space Y . Show that f is continuous if and only if all its restrictions f ∣Xi are continuous. ∏ Exercise 1.20. Let {Xi }i∈I be a family of topological spaces and X = i∈I Xi . We define the canonical projection πi : X → Xi by setting πi (x) = xi . Prove that the product topology on X is the smallest of those topologies in which all the projections πi are continuous. ∏ Exercise 1.21. Let {Yi }i∈I be a family of topological spaces, ∏ Y = i∈I Yi , and fi : X → Yi are mappings from some topological space X. We construct the mapping F := i∈I fi : X → Y by associating with each point x ∈ X the element y ∈ Y defined as follows: yi = fi (x). Prove that the mapping F is continuous if and only if all fi are continuous. Exercise 1.22. Let A ⊂ ℝn be an arbitrary subset, (x1 , . . . , xn ) the Cartesian coordinates on ℝn , f : A → ℝm a continuous mapping, (y 1 , . . . , y m ) the Cartesian coordinates on ℝm , and y i = y i (x1 , . . . , xn ) the coordinate functions of the mapping f . Prove that the mapping f is continuous if and only if all the coordinate functions y i = y i (x1 , . . . , xn ) are continuous. Exercise 1.23. Describe all continuous functions on a topological space with Zariski topology. Exercise 1.24. Let f : X → Y be a homeomorphism, and Z ⊂ X, W = f (Z). Prove that the restriction f ∣Z,W : Z → W is also a homeomorphism. Show that the letters b, c, f, g, i, h, o are pairwise non-homeomorphic. Exercise 1.25. Show that every embedding is continuous. Give an example of a continuous injective mapping of topological spaces that is not an embedding. ¯ = ℕ ∪ {ω}, taking as Exercise 1.26. Let ω be a character not contained in ℕ. We define a topology on the set N a base all points from ℕ, as well as all sets of the form {n ≥ N } ∪ {ω}, N ∈ ℕ. Show that a sequence x : ℕ → X ¯ converges if and only if the mapping x can be extended to a continuous mapping on ℕ. Exercise 1.27. Show that a continuous mapping f : X → Y of topological spaces takes convergent sequences to convergent ones. Show that if X is a metric space, then every mapping g : X → Y that takes convergent sequences into convergent ones is continuous. Give an example of a topological space X and a mapping h : X → Y into a topological space Y , which takes convergent sequences into convergent ones, but is not continuous nonetheless. Exercise 1.28. Let x1 , x2 , . . . be a sequence of points in a metric space X. Suppose that for some point x ∈ X each neighborhood of x intersects the set {xi }∞ i=1 \ {x}. Prove that the sequence x1 , x2 , . . . contains a convergent subsequence. Extract from this that if a sequence of points in a metric space does not contain any convergent subsequence, then for each x ∈ X there exists r > 0 such that the open ball Ur (x) does not contain points of this sequence other than x. Exercise 1.29. Prove that the closure of a set Y ⊂ X is the smallest closed subset of X containing Y , i.e., Y is the intersection of all closed sets containing Y . Exercise 1.30. Let the topology of Zariski of weight m be given on an infinite set X. Then a subset Y ⊂ X is everywhere dense in X if and only if #Y ≥ m. Exercise 1.31. Show that in metric space, separability is equivalent to having a countable base. Extract from this that every subset of a separable metric space is separable. Show that an open subset of an arbitrary separable topological space is separable. Give an example of a separable topological space containing an non-separable subset (use the Sorgenfrey plane). Exercise 1.32. Show that the interior Int Y is the largest open subset of X contained in Y .

Exercises to Chapter 1.

13

Exercise 1.33. Prove that a subset Y of the topological space X is closed if and only if Y = Y , and is open if and only if Y = Int Y . Exercise 1.34. Prove that the boundary ∂Y is a closed subset of X, and ∂Y = Y \ Int Y = Y ∩ X \ Y . Exercise 1.35. Show that in a Hausdorff topological space every point is closed. Give an example of a non-Hausdorff topological space in which all points are closed. Exercise 1.36. Show that the disjoint union and the Cartesian product of Hausdorff topological spaces are also Hausdorff. Exercise 1.37. Show that in a Hausdorff topological space the limit of a convergent sequence is uniquely defined. Give an example of topological space in which each sequence converges to each point. Exercise 1.38. Describe what sequences in a space with Zariski topology are convergent, and what limits each convergent sequence has. Exercise 1.39. Prove that each segment [a, b] ⊂ ℝ is connected. Exercise 1.40. Prove that the closure of a connected subset of a topological space is connected. Exercise 1.41. Let {Ai }i∈I be a family of connected pairwise intersecting subsets of a topological space X, then the set ∪i∈I Ai is connected. Exercise 1.42. Show that the image of a connected topological space under a continuous mapping is also connected. Exercise 1.43. Prove that every continuous function on a connected topological space takes all intermediate values. Exercise 1.44. Show that each connected component is closed, and that each topological space is uniquely partitioned into its connected components. If such a partition is finite, then connected components are also open. Give an example of a topological space in which some connected components are not open. Exercise 1.45. Prove that a path-connected topological space is connected. Give an example of a connected space that is not path-connected. Exercise 1.46. Show that a finite union of compact subsets of a topological space is compact. Exercise 1.47. Prove the following statements: (1) the image under a continuous mapping from a compact topological space is compact; (2) a closed subset of a compact topological space is compact; (3) a compact subset of a Hausdorff topological space is closed; (4) a continuous bijective mapping from a compact topological space to a Hausdorff space is a homeomorphism; (5) give an example of an infinite topological space in which all subsets are compact. Note that in such a space there are compact subsets that are not closed; (6) give an example of a continuous bijective mapping of topological spaces that is not a homeomorphism. Exercise 1.48 (Alexander subbase theorem). Let X be a topological space and σ its subbase. Prove that X is compact if and only if each cover of X by elements of the subbase σ has a finite subcover. ∏ Exercise 1.49 (Tychonoff’s theorem). Prove that the Cartesian product i∈I Xi of topological spaces Xi , endowed with Tychonoff topology, is compact if and only if all Xi are compact. Exercise 1.50. Prove that each segment [a, b] ⊂ ℝ is compact. Exercise 1.51. Prove that a subset of a Euclidean space is compact if and only if it is closed and bounded.

Exercises to Chapter 1.

14

Exercise 1.52. Prove that every compact metric space is bounded. Prove that a continuous function on a compact topological space is bounded and takes its largest and smallest values. Exercise 1.53. Prove that every sequentially compact metric space is bounded. Prove that a continuous function on a sequentially compact topological space is bounded and takes its largest and smallest values. 2

Notation. The following matrix groups consist of real matrices of size n × n and are considered as subsets of ℝn with the induced topology (their rows or columns are written out one after another and form vectors): O(n) consists of all orthogonal matrices (orthogonal group); SO(n) consists of all orthogonal matrices with determinant 1 (special orthogonal group); GL(n) consists of all nondegenerate matrices (general linear group); SL(n) consists of all matrices with determinant 1 (special linear group). Exercise 1.54. Find out which of the following matrix groups are connected, which are compact: O(n), SO(n), GL(n), SL(n). { } Exercise 1.55. Let X = {a, b}. We define the following topology on X: τ = ∅, X, {a} . Find out what the space CL(X) is. Definition 1.33. A topological space is called a space of class T0 if, for any two different points of this space, at least one of them has a neighborhood that does not contain the second point. Exercise 1.56. Prove that the space CL(X) is always a space of class T0 . Definition 1.34. A topological space is called a space of class T1 if, for any two different points of this space, each of them has a neighborhood that does not contain the remaining point. Exercise 1.57. Prove that if X is a space of class T1 , then CL(X) is also a space of class T1 . Give an example that demonstrates that the converse is not true. Exercise 1.58. Prove that the space 𝒫0 (X) belongs to the class T1 if and only if the space X is discrete.

Chapter 2

Introduction to Metric Spaces. Schedule. Distance, pseudometric or semimetric, metric, open ball, closed ball, sphere, distance to a nonempty subset, open and closed rneighborhoods of a nonempty subset, diameter of a subset, natural quotient of pseudometric space, Lipschitz mappings, Lipschitz constant, dilatation of a Lipschitz mapping, uniform continuity, bi-Lipschitz mappings, isometric mapping, isometry, isometry group, standard constructions of metrics, multiplying a metric by a number, adding to metric a constant, induced distance, semidirect product of metric spaces, examples, Levenshtein distance, elements of graph theory, metric construction for graphs, distance on a connected graph, distance on a connected weighted graph, Cayley graph of a group, quotient pseudometric and metric, generalized distance, disjoint union of generalized spaces, maximal pseudometric, the relation between quotient and maximal pseudometrics and metrics, isometries groups actions and quotient spaces, metrized graphs, polyhedron spaces, convergence of sequences and completeness, fundamental sequences, completion of a metric space, equivalence of compactness and sequential compactness for metric spaces, completeness and total boundedness equivalent to compactness for metric spaces, canonical isometric embeddings of metric spaces, Frechet-Kuratowski embedding to the space of bounded continuous functions, Frechet embedding of a separable metric space to the space of bounded sequences.

In this section, we discuss some basic facts from metric spaces theory.

2.1

Distance

Let X be an arbitrary set. A function ρ : X × X → ℝ is called a distance, if it is non-negative, it is equal to zero on all pairs of the form (x, x), and it is symmetric: ρ(x, y) = ρ(y, x) for any x, y ∈ X. If the distance function ρ satisfies the triangle inequality, namely, ρ(x, y) + ρ(y, z) ≥ ρ(x, z), then such ρ is called a pseudometric or a semimetric, and the set X with the introduced pseudometric on it is a pseudometric space or a semimetric space. Thus, the metric defined above is a pseudometric not equal to zero on the pairs (x, y) with x ̸= y. Remark 2.1. It will be convenient for us to introduce a universal notation for the distance function defined on an arbitrary set, namely, the distance between points x and y from this set will be denoted by ∣xy∣. Even if several sets are considered at the same time with the distances given on them, we will denote these distances in the same way, while understanding what distance is used according to which set the corresponding points belong to. For each distance given on the set X, a number of subsets arise that play an important role in the study of geometry: { } • an open ball of radius r > 0 and center x ∈ X: Ur (x) := y ∈ X : ∣xy∣ < r (above we defined an open ball for a metric); { } • a closed ball of radius r ≥ 0 and center x ∈ X: Br (x) := y ∈ X : ∣xy∣ ≤ r ; { } • a sphere of radius r ≥ 0 and center x ∈ X: Sr (x) := y ∈ X : ∣xy∣ = r ; Note that in metric space, for r = 0, both a closed ball and a sphere of radius r degenerate to a point and are called degenerate. For any point x ∈ X and a nonempty set A, define the distance from x to A by setting { } ∣xA∣ = ∣Ax∣ = inf ∣xa∣ : a ∈ A . This concept gives rise to a number of objects: { } • for r > 0 an open r-neighborhood of the set A is Ur (A) = y ∈ X : ∣Ay∣ < r ; 15

2.2. Lipschitz mappings and isometries

16

{ } • for r ≥ 0 a closed r-neighborhood of the set A is Br (A) = y ∈ X : ∣Ay∣ ≤ r . In addition, for a nonempty A ⊂ X, a numerical characteristic of the set A is defined — its diameter { } diam A = sup ∣aa′ ∣ : a, a′ ∈ A . Moreover, the diameter can also be naturally determined for the empty set by setting diam ∅ = 0. Remark 2.2. Sometimes an object A under consideration can simultaneously belong to different metric spaces, for example, in one space it is a subset, and in another it is a point. Then, in the notation introduced above, Ur (A), Br (A), etc., as an upper index, we will add either the name of the space to which A belongs or the metric of this space. A typical case: A can be considered both as a nonempty subset of the space X with the metric d, and as a point in the space ℋ(X) of all nonempty closed bounded subsets of X with some metric dH . In this situation, instead ℋ(X) of Ur (A) we will write UrX (A) or Urd (A) in the first case, and Ur (A) or UrdH (A) in the second one. Each pseudometric on the set X defines a natural equivalence relation: x ∼ y if and only if ∣xy∣ = 0. Let X/∼ be the set of classes of this equivalence, and for each x ∈ X denote by [x] the equivalence class containing x. Problem 2.1. Prove that for any x, y ∈ X and x′ ∈ [x] and y|′ ∈ [y]| it is true that ∣x′ y ′ ∣ = ∣xy∣. Thus, on the set X/∼ the corresponding distance function is correctly defined: |[x][y]| = ∣xy∣. Show that this distance function is a metric. Remark 2.3. If ρ is the pseudometric on X, then the quotient space X/∼ from Problem 2.1 is sometimes denoted by X/ρ. Problem 2.2. Let X be an arbitrary metric space, x, y ∈ X, r ≥ 0, s, t > 0, and A ⊂ X be nonempty. Verify that ( ) ( ( ) ( (1) Us {x} = Us x) and Br {x} = Br x); (2) the functions y ⊦→ ∣xy∣, y ⊦→ ∣yA∣ are continuous; (3) an open neighborhood Us (A) is an open subset of X, and a closed neighborhood Br (A) is a closed subset of X; ( ) (4) Ut Us (A) ⊂ Us+t (A) and construct an example demonstrating that the left-hand side can be different from the right-hand side; ( ) (5) Bt Bs (A) ⊂ Bs+t (A) and construct an example demonstrating that the left-hand side can be different from the right-hand side; (6) ∂Us (x), ∂Bs (x) are not related by any inclusion; ∂Us (x) ⊂ Ss (x) and ∂Br (x) ⊂ Sr (x); the both previous inclusions can be strict; (7) diam Us (x) ≤ diam Bs (x) ≤ 2s; (8) diam Us (A) ≤ diam Bs (A) ≤ diam A + 2s.

2.2

Lipschitz mappings and isometries

Mappings of metric spaces that distort distances no more than a certain finite number of times are called Lipschitz. More formally, a mapping| f : X → |Y of metric spaces is called Lipschitz, if there exists C ≥ 0 such that for any x, x′ ∈ X the inequality |f (x)f (x′ )| ≤ C∣xx′ ∣ holds. Each such C is called a Lipschitz constant. Sometimes, for brevity, a Lipschitz mapping with Lipschitz constant C is called C-Lipschitz. For 1-Lipschitz mapping is reserved the term nonexpanding. Problem 2.3. Let ℒ(f ) ⊂ ℝ be the set of all Lipschitz constants for a Lipschitz mapping f . Prove that inf ℒ(f ) is also a Lipschitz constant. The smallest Lipschitz constant for a Lipschitz mapping f is called the dilatation of the mapping f and is denoted by dil f . A mapping f : X → Y of metric spaces is called uniformly continuous if for any ε > 0 there exists δ > 0 such that for any points x, x′ ∈ X, ∣xx′ ∣ < δ, we have ∣f (x)f (x′ )∣ < ε.

2.3. Standard constructions of metrics

17

Problem 2.4. Show that each Lipschitz mapping is uniformly continuous, and each uniformly continuous mapping is continuous. A bijective mapping f between metric spaces such that f and f −1 are Lipschitz is called bi-Lipschitz. It is clear that each bi-Lipschitz mapping is a homeomorphism. | | A mapping of metric spaces f : X → Y is called isometric if it preserves distances: |f (x)f (x′ )| = ∣xx′ ∣ for any x, x′ ∈ X. A bijective isometric mapping is called an isometry. Problem 2.5. Show that each isometry is a homeomorphism, in particular, each isometric mapping from one metric space into another one is an embedding, i.e., we recall, it is a homeomorphism with its image. Problem 2.6. Verify that the identity map, the composition of isometries, and the inverse mapping to an isometry are also isometries, i.e., the set of all isometries of an arbitrary metric space forms a group. Definition 2.4. The set of all isometries f : X → X of the metric space X endowed with the composition operation is called the isometry group of the space X and is denoted by Iso(X). Problem 2.7. Let X be an arbitrary metric space, x, y ∈ X and A ⊂ X be nonempty. Prove that ∣Ax∣ + ∣xy∣ ≥ ∣Ay∣, so that the function ρA (x) = ∣Ax∣ is 1-Lipschitz and, therefore, uniformly continuous.

2.3

Standard constructions of metrics

This section provides examples of standard constructions that build various metric spaces. Construction 2.1 (Multiplying a metric by a number). If d is a distance function on the set X, then for each real λ > 0 the function λ d is also a distance on X; moreover, if d was a metric (pseudometric), then λ d is the distance of the same type. The corresponding space will be denoted by λ X. Construction 2.2 (Adding a constant). Let d be the distance function on X. We define an analogue of the Kronecker symbols for x, y ∈ X by setting δxy = 0 for any x ̸= y, and δxx = 1 for any x. Then for every real c ≥ 0 the function (d + c)(x, y) = d(x, y) + c (1 − δxy ) is a distance. Moreover, if d is a pseudometric (metric) and c > 0, then d + c is a metric. Note that this construction can also be extended to some negative numbers c. Problem 2.8. Let d be a metric. Find the least possible c such that d + c is a pseudometric. Verify that for such c and any c′ > c the function d + c′ is a metric. Construction 2.3 (Induced distance). In fact, we have already implicitly used this obvious construction. Let d be a distance function on the set X, and Y be a nonempty subset of X. We define a distance function on Y by setting ∣yy ′ ∣ = d(y, y ′ ) for any y, y ′ ∈ Y , and call it induced from d or the restriction of d to Y . Note that the restriction of a metric (pseudometric) is always a metric (pseudometric). To emphasize that Y is equipped with the induce metric, we call such Y a subspace of X. However, in what follows, each subset of a metric space will be considered as a subspace, unless otherwise stated. Construction 2.4 (Product). Let {Xi }i∈I∏be some family of sets, and di be a distance function on Xi . We call each ′ ′ Xi equipped with ( di a′ )space. We set X = i∈I Xi , and for any x, x ∈ X we consider the function dI (x, x ) : I → ℝ, ′ dI (x, x )(i) = di xi , xi . Put DI = ∪x,x′ ∈X {dI (x, x′ )}, and let 𝒱I ⊃ DI be some linear subspace of the linear space ℱI consisting of all real-valued functions defined on the set I, and 𝒱I+ ⊂ 𝒱I consists of all functions with non-negative values. We call the subset 𝒱+ ⊂ 𝒱 by the first orthant of 𝒱. Denote by 0 ∈ ℱI the zero function: 0(i) = 0 for any i ∈ I. Let ρ : 𝒱I+ → [0, ∞) (be an arbitrary function such ) that ρ(0) = 0. This function ρ generates a distance function dρ on X: dρ (x, x′ ) = ρ dI (x, x′ ) . The function ρ is sometimes called a binder, and the function dρ is called the distance generated by ρ; the space X with the distance function dρ is called the semidirect product of the spaces Xi w.r.t. ρ. Below we will show a few examples in which all di are metrics, and the resulting dρ is a metric as well. Example 2.5. Consider the standard partial ordering on the first orthant 𝒱+ of 𝒱: v ≤ w if and only if v(i) ≤ w(i) for all i ∈ I. We call the function ρ from Construction 2.4

2.3. Standard constructions of metrics

18

• positively definite if it vanishes only at the origin; • subadditive if for any v, w ∈ 𝒱+ it holds ρ(v + w) ≤ ρ(v) + ρ(w); • monotone if it is monotone w.r.t. the standard partial order described above. Proposition 2.6. Under notations of Construction 2.4, let∏all Xi be metric spaces, and ρ be positively definite, subadditive and monotone. Then the distance function dρ on i∈I Xi is a metric. Proof. Positive definiteness of dρ follows from the one of metrics and of the function ρ. It remains to verify the triangle inequality. Take arbitrary x, x′ , x′′ ∈ X, then ( ) ( ) ( ) ( ) dρ (x, x′′ ) = ρ dI (x, x′′ ) ≤ ρ dI (x, x′ ) + dI (x′ , x′′ ) ≤ ρ dI (x, x′ ) + ρ dI (x′ , x′′ ) = dρ (x, x′ ) + dρ (x′ , x′′ ). Here the first inequality follows from monotonicity of ρ and triangle inequalities for metrics, and the second one from subadditivity of ρ. Example 2.7. Let I be a finite set, say, I = {1, . . . , n}, and 𝒱I = ℱI = ℝn . For ρ we choose the corresponding restriction of one of the following standard norms on 𝒱I : ┌ ⎹ n ⎹∑ ‖ ‖ ‖ ‖ { } p ‖(v1 , . . . , vn )‖ = ⎷ ∣vi ∣p for 1 ≤ p < ∞ and ‖(v1 , . . . , vn )‖∞ = max ∣vi ∣ : i = 1, . . . , n . p i=1

Then the distance function dρ is a metric because ρ is positively definite, subadditive and monotone (verify), thus we can apply Proposition 2.6. For p = 2 (when the norm is Euclidean) we get the direct product of the spaces Xi . Example 2.8. On the first orthant ℝ2+ of the plane ℝ2 with coordinates (x, y) consider the following function √ ρ(x, y) = x + y. This function is obviously positively definite and monotonic. To prove it is subadditive, make the following calculation: √ √ √ ρ(x + x′ , y + y ′ ) = x + x′ + y + y ′ ≤ x + y + x′ + y ′ = ρ(x, y) + ρ(x′ , y ′ ). Let X1 = X2 = ℝ, then, by Proposition 2.6, the distance function dρ is a metric. In explicit form √ ( ) dρ (x, y), (x′ , y ′ ) = ∣x − x′ ∣ + ∣y − y ′ ∣. Example 2.9. Let I be an arbitrary set, and the family {Xi }i∈I consists of spaces with diameters bounded by the same constant, i.e., the set of numbers {diam Xi }i∈I is bounded. Then, for 𝒱, { we can}choose the linear space of all | | bounded functions v : I → ℝ, and for ρ we can choose the norm ‖v‖∞ = supi∈I |v(i)| . If all Xi are metric spaces, then dρ is a metric. ∏ The previous construction can be generalized if we consider not all the Cartesian product X = i∈I Xi , but only a part of it extracted by some condition. ∏ Example 2.10. We choose the set of natural numbers ℕ as I, and the real line ℝ as Xi , then each element of i∈ℕ Xi can be represented as a sequence (x1 , x2 , . . .), where xi ∈ Xi = ℝ. ∏ Consider a subset ℓp ⊂ i∈ℕ Xi consisting of all sequences x = (x1 , x2 , . . .) for which {√ ∑ ∞ p p for 1 ≤ p < ∞, and i=1 ∣xi ∣ ‖x‖p := { } sup ∣xi ∣ : i ∈ ℕ for p = ∞ is finite. Then ‖ · ‖p is a norm on ℓp . The resulting metric space is denoted by ℓp . In particular, the space ℓ∞ consists of all bounded sequences. Example 2.11. We choose an arbitrary set of∏indices I, and consider I as topological space with discrete topology. We put all Xi equal to ℝ, and as a subset of i∈I Xi we take the family FinI of all maps x : I → ℝ with compact support, i.e., such that x(i) ̸= 0 only for a finite number of indices i. It is clear that FinI forms a linear space, and each function √∑ { } ∣xi ∣p for 1 ≤ p < ∞ and ‖x‖∞ = max ∣xi ∣ ‖x‖p = p i∈I

i∈I

is a norm on the space FinI . If all Xi are metric space, and ρ is a norm described above, then dρ is a metric.

2.3. Standard constructions of metrics

19

Construction 2.5 (Levenshtein distance). Let A be some set, and A∗ be the family of all finite sequences of elements from A, as well as an empty sequence λ. We will interpret A as an alphabet of some language. Then elements from A are naturally called letters, and the elements from A∗ are words. An editorial operation on A∗ is a word transformation consisting either in the exclusion of one of the letters from the word (deletion), or in the insertion of a letter into the word (insertion), or in replacing one letter with another one (substitution). The smallest number of editorial operations needed to move from one word to another is called the Levenshtein distance and generates a metric on A∗ . This distance plays an important role in linguistics and bioinformatics. Now we need to recall the notion of graph. We consider more general graphs permitting infinite numbers of vertices and edges.

2.3.1

Elements of graph theory, and metric constructions for graphs

Consider a triple G = (V, E, i), where V and E are arbitrary sets, and i : E → ℱ2 (V ) a mapping to the set of all at most 2-point nonempty subsets of V , see Section 1.12. The set V is also denoted by V (G) and called the set of vertices; the set E is also denoted by E(G) and called the set of edges; the mapping i : E → ℱ2 (V ) is also denoted by iG and called the incidence mapping, and it defines how edges are glued ( to)vertices; the triple G is called a non-oriented graph. An edge e is called a loop, if #i(e) = 1. The number #i−1 i(e) is called multiplicity of the edge e. An edge e is called multiple, if its multiplicity is more that 1. Graph without loops and multiple edges is called simple. In a simple graph each edge e can be considered as the pair {v, w} = i(e) of different vertices v and w, that is why we can reduce the definition by setting G = (V, E), where E is a subset of V (2) , where V (2) is the set of all 2-point subsets of V . A subgraph H of the graph G is a triple (V ′ , E ′ , i′ ) such that V ′ ⊂ V , E ′ ⊂ E, and i′ = i∣E ′ . If instead ℱ2 (V ) we consider V × V , then we obtain definition of an oriented graph. Let G = (V, E, i) be an arbitrary non-oriented graph. For each edge e ∈ E and each vertex v ∈ i(e) we say that e and v are incident; if i(e) = {v, w}, the vertices v and w are called adjacent, and we say that e joins v and w; if i(e) = {v}, we say that v is adjacent to itself, that e joins v with v. The number of non-loop edges plus the double number of loops, incident to a vertex v ∈ V , is called the degree of v and is denoted by deg v or degG v. A walk of length n joining some vertices v and w is a sequence v = v0 , e1 , v1 , e2 , . . . , en , vn = w of alternating vertices and edges such that each edge ek joins vk−1 and vk . If G is a simple graph, we do not need to indicate the edges since each pair of adjacent vertices defines uniquely the edge incident to these vertices. The walt is called closed if v0 = vn , and it is called open otherwise. A trail is a walk with no repeated edges. A path is an open trail with no repeated vertices. A circuit is a closed trail. A cycle is a circuit with no repeated vertices. In the case of an oriented graph G, if i(e) = (v, w), then we say that e starts at v and ends at w. We redefine the walk just demanding that each ek starts at vk−1 and ends at vk . All other definitions remain the same. A graph G is called connected, if each pair of its vertices are joined by a walk. A graph without cycles is called a forest, and a connected forest is called a tree. Clearly that each forest is a simple graph. A weighted graph is a graph G = (V, E, i) equipped with a weight function ω : E → [0, ∞) (sometimes it is useful to consider more general weight functions, for instance, with possibility of negative values). Sometimes we denote the weighted graph as (V, E, i, ω) or (G, ω), and, in the case of simple graph G, by (V, E, ω). The weight of a subgraph of a weighted graph is the sum of the weights of edges from this subgraph. We can extend this definition to paths and cycles considering them as subgraphs of G. In the case of the walk, its weight is defined as the sum of weights of its consecutive edges. For graphs without weight functions these notions are defined as well by assigning the weight 1 to each edge by default. Construction 2.6 (Distance on a connected graph). Let Γ = (V, E, i) be a connected graph. We define a distance function on V , setting it equal to the infimum of length of walks joining a given pair of vertices (we can always change the walks to paths with the same result). This function is a metric on V (verify). Construction 2.7 (Distance on a connected weighted graph). If Γ = (V, E, i, ω) is a weighted connected graph with the weight function ω : E → [0, ∞), then we define the distance function by setting it equal to the infimum of the weights of all walks (or paths) joining a given pair of vertices. The resulting function is a pseudometric on V . Note that even if ω is everywhere positive, the constructed distance function does not have to be a metric (consider a graph with a positive weight function in which some pair of vertices are connected by an infinite number of paths with weights tending to zero). Construction 2.8 (Cayley graph of a group). Let G be an arbitrary group with a set S of generators. The Cayley graph of the pair (G, S) is the directed graph Γ(G, S) = (G, E, i), in which (g, h) = i(e) for some e ∈ E if and only if h = gs for some s ∈ S.

2.3. Standard constructions of metrics

20

We note that in geometric group theory the set S usually satisfies the following properties: the neutral element is not contained in S, and S = S −1 , i.e., if s ∈ S, then s−1 ∈ S. In this case, the graph Γ(G, S) has no loops, and if (g, h) = i(e) for some e ∈ E, then (h, g) = i(e′ ) for some e′ ∈ E, since h = gs implies g = hs−1 . Under these assumptions, by Cayley graph we mean a simple non-oriented graph in which each pair of mutually opposite oriented edges were replaced by one (non-oriented) edge. For such Cayley graph, Construction 2.6 defines a metric on G, which is used to determine the growth rate of the group G. Remark 2.12. In what follows, we will always consider only those S that satisfies the both conditions: S = S −1 and S does not contain the neutral element of G. However, we will not list in S all inverses of its elements supposing that this holds by default. Problem 2.9. Describe the Cayley graphs for the following groups G and generating sets S: (1) G = ℤ and S = {1}; (2) G = ℤm and S = {1}; { } (3) G = ℤ2 and S = (1, 0) (0, 1) ; { } (4) G = ℤ2 and S = (1, 0) (0, 1), (1, 1) ; (5) G is a free group with generators a and b. Construction 2.9 (Quotient spaces). Let (X, ρ) be a metric (pseudometric) space, and ∼ an equivalence relation on X. Define the following quotient distance function on X: ρ∼ (x, y) = inf

n {∑

} ρ(pi , qi ) : p0 = x, qn = y, n ∈ ℕ, qi ∼ pi+1 for all i .

i=0

The terms in the right-hand side can be naturally visualized as p0 − q0 ∼ p1 − q1 ∼ · · · ∼ pn − qn , where pi − qi indicates that we calculate the distance between pi and qi , and qi ∼ pi+1 says that these points are taken from the same equivalency class, and thus the distance between them is zero. We ∑ncall such sequences p0 −q0 ∼ p1 −q1 ∼ · · · ∼ pn −qn an admissible one joining the classes [x] and [y], and the value i=0 ρ(pi , qi ) the weight of this admissible sequence. Problem 2.10. Prove that ρ∼ is a pseudometric on X. Now, let x ∼ y, then we can put p0 = x = q0 and p1 = q1 = y, so ρ∼ (x, y) ≤ ρ(x, x) + ρ(y, y) = 0, thus ρ∼ (x, y) = 0. Therefore, each equivalence class of ∼ consists of points on ρ∼ -zero distance from each other, thus ρ∼ generates correctly a pseudometric on X/∼ which we also denote by ρ∼ and call in the quotient pseudometric w.r.t. ∼; the space X/∼ we will call the the quotient pseudometric space w.r.t. ∼. The next step is to consider the space X/ρ∼ instead of the X/∼: these two spaces are different when the distance between some distinct classes of ∼ vanishes. As above, we denote by the same ρ∼ the corresponding metric on X/ρ∼ , and the ρ∼ and X/ρ∼ we call the quotient metric space and the quotient metric space w.r.t. ∼, respectively. To work effectively with the ρ∼ , let us define the following notions. The admissible sequence p0 − q0 ∼ p1 − q1 ∼ · · · ∼ pn − qn from definition of ρ∼ is called reducible if it is possible to delete a part of it in such a way that its ends remains to belong to the classes [x], [y] ∈ X/∼, and its weight did not increase. Otherwise, its is called irreducible. It is evident that to calculate the distance ρ∼ , it is sufficient to consider irreducible admissible sequences only. If a and b not equivalent, then we write it as a ̸∼ b. Problem 2.11. Let ξ = (p0 − q0 ∼ p1 − q1 ∼ · · · ∼ pn − qn ) be an irreducible admissible sequence. Prove that (1) for any i < j we have pi ̸∼ pj , qi ̸∼ qj ; (2) qi ∼ pj if and only if j = i + 1; (3) for any i we have qi ̸= pi+1 . Example 2.13. Let us show how Problem 2.11 can be applied. Let X = [0, a] ⊂ ℝ be a segment of real line. Identify its ends. This means, that x, y ∈ X are equivalent if and only if either x = y, or {x, y} = {0, a}. Describe all irreducible admissible sequences ξ. The are two possibilities (verify): (1) ξ consists of two different points, say x and y, and its weight equals ∣xy∣;

2.3. Standard constructions of metrics

21

(2) ξ consists of 4 points, its ends are distinct from 0 and a, but the middle points are these 0 and a; w.l.o.g., ξ = (x − 0 ∼ a − y), thus its weight equals ∣0x∣ + ∣ya∣. Thus, on the topological circle [0, a]/ ∼ we introduce the distance in the same way as we do for the standard circle when we choose the length of the shortest arc between the points. Problem 2.12. Let ∼ be the trivial equivalence on a metric space (X, ρ), i.e., x ∼ y if and only if x = y. Prove that ρ∼ = ρ. Problem 2.13. Let ∼ be an equivalence on a pseudometric space (X, ρ). Prove that for any x, y ∈ X it holds ρ∼ (x, y) ≤ ρ(x, y). Thus, if we define the function b : X → X → ℝ such that b(x, y) = 0 for x ∼ y, and b(x, y) = ρ(x, y) otherwise, then ρ∼ ≤ b. Construction 2.10 (Generalization of distance). It is useful to allow infinite distances. A function ρ : X ×X → [0, ∞] that satisfies the axioms of distance (pseudometric, metric) will be called generalized one. The corresponding spaces X with such distances we call generalized as well. Remark 2.14. In some monographs the authors work with generalized distances from the very beginning and because of that they call such distances without the word “generalized”. However, the distances with values in [0, ∞) they call finite ones. The generalization of distance gives rise more metric constructions. { } Construction 2.11 (Disjoint union of spaces). Let (Xi , ρi ) i∈I be an arbitrary family of generalized spaces. Consider on ⊔i∈I Xi the distance function that is equal to ρi on Xi , and to ∞ for any pair of points from different i. The resulting space is called the disjoint union of spaces Xi . Evidently, if Xi are generalized pseudometric (metric) spaces, then ⊔i∈I Xi is the space of the same type. Let X be a generalized pseudometric space. So, we can define two equivalence relations: x ∼1 y if and only if ∣xy∣ < ∞; and x ∼2 y if and only if ∣xy∣ = 0. Problem 2.14. Prove that each class of equivalence ∼1 is a pseudometric space (with finite distance), and that the distance between points from different classes equals ∞. Thus, if we denote by Xi the classes of equivalence ∼1 , then X = ⊔Xi . Prove that the space X/∼2 equals the disjoint union ⊔(Xi /∼2 ) of metric spaces Xi /∼2 . Now we combine disjoint union and quotient operation. Example 2.15. Let X and Y be metric spaces, Z ⊂ X, and f : Z → Y a mapping. Consider on X ⊔ Y with generalized metric ρ the equivalence relation ∼ which identifies each a ∈ f (Z) with all b ∈ f −1 (a). The quotient space X ⊔f Y := (X ⊔ Y )/ρ∼ is called the result of gluing the spaces X and Y over the mapping f . Problem 2.15. Suppose that f is isometric. Prove that the restrictions of ρ∼ to X and Y coincides with the initial metrics of X and Y , respectively. Problem 2.16. Let y0 ∈ Y and f (X) = y0 . Prove that X ⊔f Y is isometric to Y . Construction 2.12 (Maximal pseudometric). Consider an arbitrary function b : X × X → [0, ∞], and denote by 𝒟b the set of all generalized pseudometrics d : X × X → [0, ∞] such that d(x, y) ≤ b(x, y) for any x, y ∈ X. Lemma 2.16. There exists and unique db ∈ 𝒟b such that db ≥ d for all d ∈ 𝒟b . Proof. We put db (x, y) = supd∈𝒟b d(x, y). It is evident that db is nonnegative, symmetric, db (x, x) = 0 for all x ∈ X, and db ≤ b. It remains to verify the triangle inequality: ( ) db (x, z) = sup d(x, z) ≤ sup d(x, y) + d(y, z) ≤ sup d(x, y) + sup d(y, z) = db (x, y) + db (y, z). d∈𝒟b

d∈𝒟b

d∈𝒟b

d∈𝒟b

The uniqueness of db is evident. We denote db by sup 𝒟b and call it maximal pseudometric w.r.t. the function b.

2.3. Standard constructions of metrics

22

Theorem 2.17. Let (X, ρ) be a pseudometric space, and ∼ an equivalence relation. We put { 0, if x ∼ y, b(x, y) := b∼ (x, y) = ρ(x, y), otherwise. Then db = sup 𝒟b = ρ∼ , thus X/ρ∼ = X/db . Proof. By Problem 2.13, we have ρ∼ ≤ b, thus ρ∼ ∈ 𝒟b . It remains to prove that ρ∼ ≥ d for each d ∈ 𝒟b . To do that, we write down n {∑ } ρ∼ (x, y) = inf ρ(pi , qi ) : p0 = x, qn = y, n ∈ ℕ, qi ∼ pi+1 for all i ≥ i=0

{ } ≥ inf d(p0 , q0 ) + d(q0 , p1 ) + · · · + d(qn−1 , pn ) + d(pn , qn ) : n ∈ ℕ, qi ∼ pi+1 for all i ≥ d(x, y).

Now we apply the quotation technique to obtain a few more important classes of metric spaces. Construction 2.13 (Groups actions). Let X be a metric space and G ⊂ Iso(X) a subgroup of its isometry group. Consider the action of G on X, i.e., the mapping φ : G × X → X such that φ(g, x) =: g(x) satisfies the following conditions: (1) e(x) = x for the neutral element e ∈ G and any x ∈ X; ( ) (2) (hg)(x) = h g(x) for any g, h ∈ G and x ∈ X. Consider the following equivalence relation on X: x ∼ y if and only if g(x) = y for some g ∈ G. We say that the equivalence ∼ is generated by the action of G on X. The corresponding quotient space X/∼ is usually denoted by X/G. The sets G(x) are called orbits, they are elements of X/G; the set X/G is called the orbit-space. Problem 2.17. Let X be a (metric space) and G{⊂ Iso(X) a subgroup of its isometry group. For each two elements } G(x), G(y) ∈ X/G we set d G(x), G(y) = inf ∣x′ y ′ ∣ : x′ ∈ G(x), y ′ ∈ G(y) . Prove that d = ρ∼ , where the equivalence ∼ is generated by the action of G on X. Problem 2.18. Let S 1 be the standard unit circle in the Euclidean plane. As a distance between x, y ∈ S 1 we take the length of the shortest arc of S1 between x and y. By the standard torus we mean the direct product T 2 = S 1 × S 1 (with the Euclidean binder). We describe the points on the both S 1 by their polar angles φ1 and φ2 , defined up to 2π. So, the shifts sa,b : (φ1 , φ2 ) ⊦→ (φ1 + a, φ2 + b) are isometries of T 2 . Fix some (a, b) ∈ ℝ2 and consider a subgroup Ga,b ⊂ Iso(T 2 ) consisting of all shifts sta,tb , t ∈ ℝ. For different a and b, find the corresponding pseudometric and metric quotient spaces. { } Construction 2.14 (Metrized graphs). Take an arbitrary collection of segments [ak , bk ] k∈K , and on the set ( ) ⊔{ak , bk } consider an equivalence ∼. The generalized pseudometric metric space ⊔[ak , bk ] / ∼ is called a metrized graph. To obtained its combinatorial it as we did before, namely, as a) triple G = (V, E, i). ( ) structure,{we represent } ( To do that, we put V := ⊔{ak , bk } / ∼, E := [ak , bk ] k∈K , and π : ⊔ {ak , bk } → ⊔{ak , bk } / ∼ be the canonical ( ) ( ) projection, then i [ai , bi ] := π {ai , bi } . Construction 2.15 (Polyhedral spaces). By a polyhedron of dimension n we mean an intersection of a finite number of half-spaces in ℝn that has nonempty interior. For each point x of a polyhedron W we define its dimension dimW x as follows: consider all affine subspaces L containing x such that x is an interior point of W ∩ L, and put dimW x to be the maximal dimension of such L. Denote by W k the subset in W consisting of all points of dimension k. One can show that W k consists of connected component each of which belongs of an affine subspace L of dimension k. The closures of these connected components are called the faces of W of dimension k. Now, consider an arbitrary collection {Wk }k∈K of polyhedra (probably, of different dimensions), and for some pairs (Wk , Wl ), k ̸= l, choose some faces Fkp ⊂ Wk and Flq ⊂ Wl such that there exists an isometry φ : Fkp → Flq . Consider the generalized metric space ⊔k∈K Wk and the equivalence relation generated by the isometries φ: we put in one class each point x ∈ Fkp and φ(x) ∈ Flq , for all isometries φ (we glue the faces by the isometry φ). The resulting quotient generalized space is called a polyhedron space. Problem 2.19. Represent the standard torus from Problem 2.18 as a polyhedron space. Problem 2.20. Verify that the above constructions actually, as declared, define (pseudo-)metrics. A huge collection of metric spaces can be found in [1].

2.4. Convergence and completeness

2.4

23

Convergence and completeness

Since each metric space is Hausdorff, the following result holds. Proposition 2.18. If a sequence converges in a metric space, then its limit is uniquely determined. A sequence x1 , x2 , . . . of points of a metric space X is called fundamental if for any ε > 0 there exists N such that for all m, n ≥ N the inequality ∣xm xn ∣ < ε holds. If every fundamental sequence in a metric space X is convergent, then such X is called complete. Problem 2.21. Show that a subspace of a complete metric space is complete if and only if it is closed. Theorem 2.19. Each metric space X is an everywhere dense subspace of some complete space called a completion of X. The completion of the space X is uniquely defined: for any two completions X ′ ⊃ X and X ′′ ⊃ X there is an isometry f : X ′ → X ′′ that is identical on X. Construction 2.16 (Completion). The standard construction of completion of a metric space X is as follows: • the set W of all fundamental sequences in the space X is considered; • on W , a pseudometric is set equal to the limit of the distances between the points of two fundamental sequences (the existence of the limit follows from the fundamentality); • X can be realized as a subset of W by associating each point x ∈ X with the constant sequence x, x, . . .. • the desired completion is obtained by factorization as in Problem 2.1. The uniqueness of completion implies the following result. Problem 2.22. Let X be an arbitrary subspace of a complete metric space. Then the closure X of the set X is a completion of the space X. Problem 2.23. Let f : X → Y be a bi-Lipschitz mapping of metric spaces. Prove that X is complete if and only of Y is complete. Construct a homeomorphism of metric spaces that does not preserves completeness. Problem 2.24. Show that a metric space is complete if and only if the following condition holds: for any sequence of closed subsets X1 ⊃ X2 ⊃ X3 ⊃ · · · such that diam Xn → 0 as n → 0, the intersection ∩∞ i=1 Xi is not empty (in fact, it consists of unique element). Show that the condition diam Xn → 0 is essential. Problem 2.25 (Fixed-point theorem). Let f : X → X be a C-Lipschitz mapping of a complete metric spaces X. Prove that for C < 1 there exists and unique a point x0 such that f (x0 ) = x0 (it is called the fixed point of the mapping f ).

2.5

Compactness and sequential compactness

Recall that a topological space is called compact if a finite subcover can be extracted from any of its open cover, and sequentially compact if any sequence of its points has a convergent subsequence. As noted in Section 1, in the case of general topological spaces, compactness and sequential compactness are different concepts. However, for metric spaces this is not so. Theorem 2.20. A metric space is compact if and only if it is sequentially compact. Proof. Let X be a compact metric space and x1 , x2 , . . . be an arbitrary sequence of points from X. We must show that there is a convergent subsequence in this sequence. Suppose this is not so, then • the sequence xi contains an infinite number of different points; • for each point x ∈ U := X \ {x1 , x2 , . . .} there is an (open) neighborhood U x that does not contain points of the sequence xi , therefore U = ∪x∈U U x and, thus, U is an open set; • for each xi there is a neighborhood U xi for which U xi ∩ {x1 , x2 , . . .} = {xi }.

2.5. Compactness and sequential compactness

24

But then the family {U, U x1 , U x2 , . . .} is an open cover of X from which it is impossible to choose a finite subcover (each finite subcover contains only finitely many points of the sequence xi ). The obtained contradiction shows that X is sequentially compact. Now let the space X be sequentially compact. Suppose that there exists an open covering 𝒰 = {Uα } of the space X that does not contain finite subcovers. On X we define a function ρ : X → ℝ as follows: ρ(x) = sup{r ∈ ℝ ∣ ∃Uα ∈ 𝒰 : Ur (x) ⊂ Ua }. This function is everywhere finite, since the sequentially compact space is bounded, and everywhere positive by virtue of the definition of metric topology. Lemma 2.21. The function ρ defined above is 1-Lipschitz and, therefore, continuous. Proof. Assume the contrary, i.e., that for some x, y ∈ X the inequality ∣ρ(y) − ρ(x)∣ > ∣xy∣ holds. Without loss of generality, we assume that ρ(y) > ρ(x), then ρ(y) > ρ(x) + ∣xy∣. Increase the number ρ(x) a little to ρ′ and slightly decrease the number ρ(y) to ρ′′ so that ρ′′ > ρ′ + ∣xy∣ is still true, then • Uρ′ (x) ⊂ Uρ′′ (y), because for an arbitrary point z ∈ Uρ′ (x) we have ∣zy∣ ≤ ∣zx∣ + ∣xy∣ < ρ′ + ∣xy∣ < ρ′′ ; • there is Uα ∈ 𝒰 for which Uρ′′ (y) ⊂ Uα . But then Uρ′ (x) ⊂ Uα , therefore ρ(x) ≥ ρ′ by the definition of the function ρ, a contradiction. By virtue of Problem 1.53, the function ρ achieves the smallest value ρ0 , which is, therefore, strictly positive. Put r = ρ0 /2. Then for each point x ∈ X there exists Uα ∈ 𝒰 such that Ur (x) ⊂ Uα . Choose an arbitrary point x1 ∈ X, and let U1 ∈ 𝒰 be such that Ur (x1 ) ⊂ U1 . There is x2 ∈ X \ U1 . Choose U2 ∈ 𝒰 such that Ur (x2 ) ⊂ U2 . In general, if x1 , . . . , xn and U1 , . . . , Un are selected, then there is xn+1 ∈ X \ ∪ni=1 Ui and Un+1 ∈ 𝒰 such that Ur (xn+1 ) ⊂ Un+1 . Since 𝒰 does not have a finite subcover, we construct an infinite sequence x1 , x2 , . . ., and it is clear that every point xn+1 lies outside ∪ni=1 Ur (xi ), so for any xi and xj , i ̸= j, we have ∣xi xj ∣ ≥ r. But such a sequence does not contain a convergent subsequence, which contradicts the sequential compactness of X. Problem 2.26. Give an example of a topological space that is (1) compact, but not sequentially compact; (2) sequentially compact, but not compact. Problem 2.27 (Lebesgue’s lemma). Let X be a compact metric space. Prove the following statement: for any open cover {Ui }i∈I of X there exists ρ > 0 such that for any x ∈ X one can find Ui with Bρ (x) ⊂ Ui . Problem 2.28. Show that each continuous mapping f : X → Y from a compact metric space to an arbitrary metric space is uniformly continuous. Problem 2.29. Prove that the diameter diam X of a compact metric space X if finite, and that there exist x, y ∈ X such that diam X = ∣xy∣.

2.5.1

Completeness and compactness

Theorem 2.20 easily yields the following result. Corollary 2.22. A compact metric space is complete. Proof. Let X be a compact metric space. In the space X we choose an arbitrary fundamental sequence x1 , x2 , . . .. By Theorem 2.20, there exists a subsequence xi1 , xi2 , . . ., converging to some point x ∈ X. Due to fundamentality of x1 , x2 , . . ., the whole this sequence converges to x, therefore the space X is complete. Definition 2.23. For ε > 0, a subset S of a metric space X is called an ε-net if for any point x ∈ X there exists s ∈ S such that ∣xs∣ < ε. A metric space is called totally bounded if for any ε > 0 there exists a finite ε-net in it.

2.6. Canonical isometric embeddings of metric spaces

25

Theorem 2.24. A metric space is compact if and only if it is complete and totally bounded. Proof. Let X be a compact metric space. Then, by virtue of Corollary 2.22, it is complete. We choose an arbitrary ε > 0, then {Uε (x)}x∈X is an open cover of X. Since X is compact, there exists a finite subcover {Uε (xi )}ni=1 . But then {x1 , . . . , xn } is a finite ε-net. Thus, the space X is totally bounded. Now let X be a complete and totally bounded metric space. We prove that X is sequentially compact and apply Theorem 2.20. Consider an arbitrary sequence xi in X. For each n ∈ ℕ, consider a finite 1/n-net Sn . The balls {U1 (s)}s∈S1 cover X, so there is a ball U1 among them that contains infinitely many elements of the sequence xi . The balls {U1/2 (s)}s∈S2 cover U1 , so there is a ball U2 among them for which C2 = U1 ∩ U2 contains infinitely many xi . If U1 , U2 , . . . , Uk are already selected so that Ck = ∩kj=1 Uj contains infinitely many xi , then the family {U1/(k+1) (s)}s∈Sk+1 , since it covers Ck , contains a ball Uk+1 for which there are infinitely many xi in Ck+1 = ∩k+1 j=1 Uj . Now, we choose an arbitrary xi1 ∈ C1 . Since there are infinitely many points of our sequence in C2 , there exists i2 > i1 such that xi2 ∈ C2 . Continuing this process, we construct a subsequence xi1 , xi2 , . . . such that xik , xik+1 , . . . ∈ Ck for each k, and since diam Ck → ∅ as k → ∞, this subsequence is fundamental and, therefore, converges because X is complete. Thus, X is sequentially compact. Problem 2.30. Prove that every compact metric space is separable. A subset of a topological space is called nowhere dense if its closure has empty interior. Problem 2.31 (Baire’s theorem). Prove that a complete metric space cannot be covered by at most countably many nowhere dense subsets. Moreover, the complement of the union of at most countably many nowhere dense subsets is everywhere dense. Problem 2.32. Prove that a compact metric space X cannot be isometrically mapped to a subspace Y ⊂ X such that Y ̸= X. In other words, each isometric mapping f : X → X for a compact metric space X is surjective. Problem 2.33. Let X be a compact metric space and f : X → X be a mapping. Prove that (1) if f is surjective and nonexpanding, then f is an isometry; | | (2) if |f (x)f (y)| ≥ ∣xy∣ for all x, y ∈ X, then f is an isometry.

2.6

Canonical isometric embeddings of metric spaces

Let X be an arbitrary metric space. We denote by C(X) the vector space of functions continuous on X, and by C | b (X)| the subspace of C(X) composed of all bounded functions, and consider on Cb (X) the norm ‖f ‖∞ = supx∈X |f (x)| and the corresponding metric ∣f g∣∞ = ‖f − g‖∞ . We define a mapping ν : X → Cb (X) as follows. For each point x ∈ X, by dx we denote the function dx : X → [0, ∞) defined by the rule dx (y) = ∣xy∣. This function is 1-Lipschitz and, therefore, dx ∈ |C(X). Now we fix some point | p ∈ X and consider the function dx − dp . Then for each y ∈ X we have |dx (y) − dp (y)| ≤ ∣xp∣, so the function dx − dp is bounded and, thus, belongs to Cb (X). Theorem 2.25 (Frechet, Kuratowski). The mapping ν : X → Cb (X) defined by the formula ν : x ⊦→ dx − dp is an isometric embedding. Proof. By virtue of Problem 2.5, it suffices to verify that the mapping ν is isometric. Choose arbitrary x, y ∈ X, then | | | | |(dx − dp )(dy − dp )| = sup |dx (z) − dy (z)| ≤ ∣xy∣ ∞ z∈X

| | by triangle inequality. On the other hand, if z = y, then |dx (z) − dy (z)| = ∣xy∣, which implies the required result. Let us prove a finer result. Recall that by ℓ∞ we denote the space of all bounded sequences with the metric defined by the norm ‖ · ‖∞ . Theorem 2.26 (Frechet). Let X be a separable metric space, then X can be isometrically embedded into ℓ∞ .

2.6. Canonical isometric embeddings of metric spaces

26

Proof. Consider an everywhere dense sequence x1 , x2 , . . . in X, which exists due to separability. Choose an arbitrary point p ∈ X and associate with each point x ∈ X a sequence ν(x) obtained by restricting the function dx − dp to the sequence x|1 , x2 , . . ., |namely, ν(x)(i) = dx (xi ) − dp (xi ). Since the functions dx − dp are bounded, then ν(x) ∈ ℓ∞ . The inequality |ν(x)ν(y)|∞ ≤ ∣xy∣ is verified in the same way as in the proof of Theorem 2.25. The converse inequality follows from the fact that the subset {xi } is everywhere dense in X: consider xik → x, then | | | | | | |ν(x)ν(y)| = sup |dx (z) − dy (z)| ≥ |dx (xi ) − dy (xi )| → dy (x) = ∣xy∣ as k → ∞. k k ∞ z∈X

The theorem is proved.

References to Chapter 2 [1] M.M.Deza, E.Deza, Encyclopedia of Distances. Springer, 2009.

27

Exercises to Chapter 2.

28

Exercises to Chapter 2 Exercise 2.1. Let X be a pseudometric space and ∼ is the natural equivalence relation: x ∼ y if and only if ∣xy∣ = 0. ′ For each x ∈ X denote by [x] the equivalence class containing x. Prove that for any x, y ∈ X, x′ ∈ [x], and | y ∈| [y] it is ′ ′ | true that ∣x y ∣ = ∣xy∣. Thus, on the set X/∼ the corresponding distance function is correctly defined: [x][y]| = ∣xy∣. Show that this distance function is a metric. Exercise 2.2. Let X be an arbitrary metric space, x, y ∈ X, r ≥ 0, s, t > 0, and A ⊂ X be nonempty. Verify that ( ) ( ( ) ( (1) Us {x} = Us x) and Br {x} = Br x); (2) the functions y ⊦→ ∣xy∣, y ⊦→ ∣yA∣ are continuous; (3) an open neighborhood Us (A) is an open subset of X, and a closed neighborhood Br (A) is a closed subset of X; ( ) (4) Ut Us (A) ⊂ Us+t (A) and construct an example demonstrating that the left-hand side can be different from the right-hand side; ( ) (5) Bt Bs (A) ⊂ Bs+t (A) and construct an example demonstrating that the left-hand side can be different from the right-hand side; (6) ∂Us (x), ∂Bs (x) are not related by any inclusion; ∂Us (x) ⊂ Ss (x) and ∂Br (x) ⊂ Sr (x); the both previous inclusions can be strict; (7) diam Us (x) ≤ diam Bs (x) ≤ 2s; (8) diam Us (A) ≤ diam Bs (A) ≤ diam A + 2s. Exercise 2.3. Let ℒ(f ) ⊂ ℝ be the set of all Lipschitz constants for a Lipschitz mapping f . Prove that inf ℒ(f ) is also a Lipschitz constant. Exercise 2.4. Show that each Lipschitz map is uniformly continuous, and each uniformly continuous map is continuous. Exercise 2.5. Show that each isometry is a homeomorphism, in particular, each isometric mapping of one metric space into another one is an embedding, i.e., we recall, it is a homeomorphism with an image. Exercise 2.6. Verify that the identity map, the composition of isometries, and the inverse mapping to an isometry are also isometries, i.e., the set of all isometries of an arbitrary metric space forms a group. Exercise 2.7. Let X be an arbitrary metric space, x, y ∈ X and A ⊂ X be nonempty. Prove that ∣Ax∣ + ∣xy∣ ≥ ∣Ay∣, so that the function ρA (x) = ∣Ax∣ is 1-Lipschitz and, therefore, uniformly continuous. Exercise 2.8. Describe the Cayley graphs for the following groups G and generating sets S: (1) G = ℤ and S = {1}; (2) G = ℤm and S = {1}; { } (3) G = ℤ2 and S = (1, 0) (0, 1) ; { } (4) G = ℤ2 and S = (1, 0) (0, 1), (1, 1) ; (5) G is a free group with generators a and b. Exercise 2.9. Let (X, ρ) be a metric (pseudometric) space, and ∼ an equivalence relation on X. Define the following quotient distance function on X: ρ∼ (x, y) = inf

n {∑ i=0

Prove that ρ∼ is a pseudometric on X.

} ρ(pi , qi ) : p0 = x, qn = y, n ∈ ℕ, qi ∼ pi+1 for all i .

Exercises to Chapter 2.

29

Exercise 2.10. Let ξ = (p0 − q0 ∼ p1 − q1 ∼ · · · ∼ pn − qn ) be an irreducible admissible sequence. Prove that (1) for any i < j we have pi ̸∼ pj , qi ̸∼ qj ; (2) qi ∼ pj if and only if j = i + 1; (3) for any i we have qi ̸= pi+1 . Exercise 2.11. Let ∼ be the trivial equivalence on a metric space (X, ρ), i.e., x ∼ y if and only if x = y. Prove that ρ∼ = ρ. Exercise 2.12. Let ∼ be an equivalence on a pseudometric space (X, ρ). Prove that for any x, y ∈ X it holds ρ∼ (x, y) ≤ ρ(x, y). Thus, if we define the function b : X × X → ℝ such that b(x, y) = 0 for x ∼ y, and b(x, y) = ρ(x, y) otherwise, then ρ∼ ≤ b. Exercise 2.13. Let X be a generalized pseudometric space. We can define two equivalence relations: x ∼1 y if and only if ∣xy∣ = ∞, and x ∼2 y if and only if ∣xy∣ = 0. Prove that each class of equivalence ∼1 is a pseudometric space (with finite distance), and that the distance between points from different classes equals ∞. Thus, if we denote by Xi the classes of equivalence ∼1 , then X = ⊔Xi . Prove that the space X/∼2 equals the disjoint union ⊔(Xi /∼2 ) of metric spaces Xi /∼2 . Exercise 2.14. Let X and Y be metric spaces, Z ⊂ X, and f : Z → Y is an isometric mapping. Let ρ be the metric on the X ⊔f Y . Prove that the restrictions of ρ to X and Y coincides with the initial metrics of X and Y , respectively. Exercise 2.15. Let y0 ∈ Y and f (X) = y0 . Prove that X ⊔f Y is isometric to Y . Exercise 2.16. Let X be a (metric space) and G{⊂ Iso(X) a subgroup of its isometry group. For each two elements } G(x), G(y) ∈ X/G we set d G(x), G(y) = inf ∣x′ y ′ ∣ : x′ ∈ G(x), y ′ ∈ G(y) . Prove that d = ρ∼ , where the equivalence ∼ is generated by the action of G on X. Exercise 2.17. Let S 1 be the standard unit circle in the Euclidean plane. As a distance between x, y ∈ S 1 we take the length of the shortest arc of S1 between x and y. By the standard torus we mean the direct product T 2 = S 1 × S 1 (with the Euclidean binder). We describe the points on the both S 1 by their polar angles φ1 and φ2 , defined up to 2π. So, the shifts sa,b : (φ1 , φ2 ) ⊦→ (φ1 + a, φ2 + b) are isometries of T 2 . Fix some (a, b) ∈ ℝ2 and consider a subgroup Ga,b ⊂ Iso(T 2 ) consisting of all shifts sta,tb , t ∈ ℝ. For different a and b, find the corresponding pseudometric and metric quotient spaces. Exercise 2.18. Represent the standard torus from Exercise 2.17 as a polyhedron space. Exercise 2.19. Verify that the constructions given in Section 2.3 do define (pseudo-)metrics, as declared. Exercise 2.20. Let d be a metric. Find the least possible c such that d + c is a pseudometric. Verify that for such c and any c′ > c the function d + c′ is a metric. Exercise 2.21. Show that a subspace of a complete metric space is complete if and only if it is closed. Exercise 2.22. Let X be an arbitrary subspace of a complete metric space. Then the closure X of the set X is a completion of the space X. Exercise 2.23. Let f : X → Y be a bi-Lipschitz mapping of metric spaces. Prove that X is complete if and only of Y is complete. Construct a homeomorphism of metric spaces that does not preserves completeness. Exercise 2.24. Show that a metric space is complete if and only if the following condition holds: for any sequence of closed subsets X1 ⊃ X2 ⊃ X3 ⊃ · · · such that diam Xn → 0 as n → 0, the intersection ∩∞ i=1 Xi is not empty (in fact, it consists of unique element). Show that the condition diam Xn → 0 is essential. Exercise 2.25 (Fixed-point theorem). Let f : X → X be a C-Lipschitz mapping of a complete metric spaces X. Prove that for C < 1 there exists and unique a point x0 such that f (x0 ) = x0 (it is called the fixed point of the mapping f ). Exercise 2.26. Give an example of a topological space that is (1) compact, but not sequentially compact;

Exercises to Chapter 2.

30

(2) sequentially compact, but not compact. Exercise 2.27 (Lebesgue’s lemma). Let X be a compact metric space. Prove the following statement: for any open cover {Ui }i∈I of X there exists ρ > 0 such that for any x ∈ X one can find Ui with Bρ (x) ⊂ Ui . Exercise 2.28. Show that each continuous mapping f : X → Y from a compact metric space to an arbitrary metric space is uniformly continuous. Exercise 2.29. Prove that the diameter diam X of a compact metric space X if finite, and that there exist x, y ∈ X such that diam X = ∣xy∣. Exercise 2.30. Prove that every compact metric space is separable. Exercise 2.31 (Baire’s theorem). A subset of a topological space is called nowhere dense if its closure has empty interior. Prove that a complete metric space cannot be covered by at most countably many nowhere dense subsets. Moreover, the complement of the union of at most countably many nowhere dense subsets is everywhere dense. Exercise 2.32. Prove that a compact metric space X cannot be isometrically mapped to a subspace Y ⊂ X such that Y ̸= X. In other words, each isometric mapping f : X → X for a compact metric space X is surjective. Exercise 2.33. Let X be a compact metric space and f : X → X be a mapping. Prove that (1) if f is surjective and nonexpanding, then f is an isometry; | | (2) if |f (x)f (y)| ≥ ∣xy∣ for all x, y ∈ X, then f is an isometry.

Chapter 3

Curves in Metric Spaces. Schedule. Curves in a topological space, parameter of a curve, reparametrization, polygonal line in a metric space, its edges, the length of the edge, the length of the polygonal line, the length of a curve in a metric space, rectifiable curves, properties of the length functional, intrinsic metric, generalized intrinsic pseudometric, maximal and minimal generalized pseudometrics, minimum of generalized intrinsic pseudometrics, quotients of generalized intrinsic pseudometric spaces, Hopf-Rinow condition, Hopf-Rinow theorem Part 1, convergence and uniform convergence in terms of the corresponding product spaces, limits of sequences of Lipschitz mappings, arc-length and uniform curves, reparametrizations, uniform reparametrizations, Arzela-Ascoli theorem, shortest curves and geodesics, existences theory for shortest curves, geodesic metric space, midpoints and ε-midpoints, existence of shortest curves in term of midpoints, intrinsic metrics and ε-midpoints.

In this section we discuss some results related to the geometry of curves in metric spaces. Recall that a curve in a topological space X is any continuous mapping γ : [a, b] → X from a segment [a, b] ⊂ ℝ with the standard topology; the variable t ∈ [a, b] is called the parameter of the curve γ, and the curve γ is sometimes written in the form γ(t). Each homeomorphism φ : [c, d] → [a, b] generates a new curve γ ◦ φ : [c, d] → X, about which we say that it is obtained from γ by replacement φ of the parameter t with the parameter s ∈ [c, d]. Moreover, if there is no misunderstanding, instead of the curve (γ ◦ φ)(s) we simply write γ(s). Such replacement φ is also called a reparametrization. Note that each reparametrization is a strictly monotonic continuous function. If the function φ grows, then we say that φ reverses the direction, otherwise that it changes the direction.

3.1

Rectifiable curves

Let X be a metric space. A finite sequence L = (A0 , . . . , An ) of points in the space X we called a polygonal line in X; moreover, the pairs (Ai−1 , Ai ) will be called edges of the polygonal line L, and the numbers ∣Ai−1 Ai ∣ the lengths of these edges. The sum of the lengths of all these edges we call the length of the polygonal line L and denote by ∣L∣. Let γ : [a, b] → X be an arbitrary( curve. For each) partition ξ = (a = t0 < t1 < · · · < tm = b), consider the corresponding polygonal line Lγ (ξ) = γ(t0 ), . . . , γ(tm ) (such polygonal lines will be called inscribed in the curve γ), then the value } {| | ∣γ∣ = sup |Lγ (ξ)| : ξ is a partition of the segment [a, b] is called the length of the curve γ. A curve γ is called rectifiable if ∣γ∣ < ∞. Let us give some examples of rectifiable curves. Example 3.1. Each C-Lipschitz curve γ : [a, b] → X is rectifiable, because for any partition ξ of the segment [a, b] we have ∣Lγ (ξ)∣ ≤ C(b − a) and, therefore, ∣γ∣ ≤ C(b − a) < ∞. Denote by Ω(X) the family of all curves in a metric space X, and by Ω0 (X) ⊂ Ω(X) the subfamily of all rectifiable curves. Note that on Ω(X) and on Ω0 (X) there are defined (1) the restriction of each curve γ : [a, b] → X to each subsegment [c, d] ⊂ [a, b]; (2) the gluing γ1 · γ2 of those pairs of curves γ1 : [a, b] → X, γ2 : [b, c] → X for which γ1 (b) = γ2 (b), namely, (γ1 · γ2 ) : [a, c] → X is the curve whose restrictions to [a, b] and [b, c] coincide with γ1 and γ2 , respectively; (3) the reparametrization and equivalence identifying curves that differ by parameterization. The following proposition describes some properties of the length of a curve. 31

3.1. Rectifiable curves

32

Proposition 3.2. Let X be an arbitrary metric space. Then (1) generalized triangle inequality: if γ ∈ Ω(X) joins the points x, y ∈ X, then ∣γ∣ ≥ ∣xy∣; (2) additivity: if γ = γ1 · γ2 is the gluing of curves γ1 , γ2 ∈ Ω(X) then ∣γ∣ = ∣γ1 ∣ + ∣γ2 ∣; | | (3) continuity: for any γ ∈ Ω0 (X), γ : [a, b] → X, the function f (t) = |γ∣[a,t] | is continuous; (4) independence from parameter: for each γ ∈ Ω(X), γ : [a, b] → X, and reparametrization φ : [c, d] → [a, b], it holds ∣γ∣ = ∣γ ◦ φ∣; (5) consistency with topology: for each x ∈ X, ε > 0, y ∈ X \ Uε (x), and a curve γ ∈ Ω(X) joining x and y, it holds ∣γ∣ ≥ ε; (6) lower semicontinuity: for any sequence γn ∈ Ω0 (X) that converges pointwise to some γ ∈ Ω0 (X), we have ∣γ∣ ≤ lim inf ∣γn ∣. n→∞

Proof. Only the items (3) and (6) are nontrivial, we prove them. (3) Choose |an arbitrary| t ∈ [a, b] and show that for any ε > 0 there is δ > 0 such that for all s ∈ [a, b] ∩ (t − δ, t + δ) the inequality ||f (t) −|f (s)| < ε holds. Put ℓ = ∣γ∣. By definition, there exists a partition ξ of the segment [a, b] such that ℓ − ε/2 < |Lγ (ξ)| ≤ ℓ. If t ̸∈ ξ, add | it to | ξ (we denote the resulting partition by the same letter). It is clear that for the resulting partition, ℓ − ε/2 < |Lγ (ξ)| ≤ ℓ is still satisfied. For δ1 we take the distance from t to the nearest element of the partition ξ, other than t. Since subdivisions of the partition ξ can change the length of the polygonal line Lγ (ξ) only within (ℓ−ε/2, ℓ], then for each s ∈ [a,| b]∩(t−δ|1 , t+δ1 ) | | the length ℓts = |f (t) − f (s)| of the fragment of the curve γ between points γ(t) and γ(s) differs from |γ(t)γ(s)| by less than ε/2.| On the other hand, since the map γ is continuous, there exists δ2 > 0 such that for all s ∈ [a, b]∩(t−δ2 , t+δ2 ) | we have |γ(t)γ(s)| < ε/2. It remains to put δ = min{δ1 , δ2 }. (6) Choose an arbitrary ε > 0 and show that for sufficiently large n the inequality ∣γ∣ ≤ ∣γn ∣ + ε holds, thus ∣γ∣ ≤ lim inf n→∞ ∣γn ∣ + ε and, due to the arbitrariness of ε, we get what is required. So,| let ε| > 0 be fixed. Choose a partition ξ = (a = t0 < t1 < · · · < tm = b) |of the segment [a, b] such that | ε ∣γ∣ − |Lγ (ξ)| < ε/2. There is N such that for any n > N and all i the inequality |γ(ti )γn (ti )| < 4m holds. This immediately implies that | | | | |γ(ti−1 )γ(ti )| < |γn (ti−1 )γn (ti )| + ε , 2m | | | | | | | | therefore Lγ (ξ) < Lγn (ξ) + ε/2. Thus, | | ∣γ∣ < |Lγ (ξ)| + ε/2 < ∣Lγn (ξ)∣ + ε/2 + ε/2 ≤ ∣γn ∣ + ε, as required. Problem 3.1. Prove the remaining items of Proposition 3.2. Problem 3.2. Will the items (3) and (6) of Proposition 3.2 remain true if we change Ω0 (X) to Ω(X)? Problem 3.3. Show that a piecewise smooth curve in ℝn is Lipschitzian with a Lipschitz constant equal to the maximum modulus of the velocity vector of the curve, therefore every such curve is rectifiable. Let X be a metric space in which any two points are connected by a rectifiable curve. Then for any x, y ∈ X the quantity { } din (x, y) = inf ∣γ∣ : γ is a curve joining x and y is finite. Problem 3.4. Let X be a metric space in which any two points are connected by a rectifiable curve. (1) Prove that din is a metric. (2) Denote by τ the metric topology of X w.r.t. the initial metric on X, by τin the metric topology w.r.t. din , by Xin the set X with metric din and topology τin . Show that τ ⊂ τin . In particular, if a mapping γ : [a, b] → Xin is continuous, then the mapping γ : [a, b] → X is continuous as well.

3.2. Maximal and minimal pseudometrics, quotients

33

(3) Construct an example when τ ̸= τin . (4) Prove that for each rectifiable curve γ : [a, b] → X the mapping γ : [a, b] → Xin is continuous. (5) Denote by ∣γ∣in the length of a curve γ : [a, b] → Xin . Show that for each curve γ : [a, b] → X which is also a curve in Xin , it holds ∣γ∣ = ∣γ∣in . Thus, the sets of rectifiable curves for X and Xin coincide, and each non-rectifiable curve in X is either a non-rectifiable one in Xin , or the mapping γ : [a, b] → Xin is discontinuous. (6) Construct an example of continuous mapping γ : [a, b] → X such that the mapping γ : [a, b] → Xin is not continuous. Notice that the curve γ : [a, b] → X can not be rectifiable. Definition 3.3. If din coincides with the original metric, then the original metric is called intrinsic. A metric space with an intrinsic metric is also called intrinsic. Problem 3.5. Let X be a metric space in which any two points are connected by a rectifiable curve. Prove that the metric din is intrinsic. Example 3.4. Let S 1 be the standard circle on the Euclidean plane ℝ2 . (1) If for x, y ∈ S 1 we put d(x, y) equal to the distance in ℝ2 between these points, then the metric d on S 1 is not intrinsic. (2) If, for d(x, y), we choose the length of the smaller of the two arcs of the circle S 1 into which x and y divide it, then the resulting metric will be intrinsic. Remark 3.5. If we allow generalized pseudometrics, then we can define din and intrinsic metric not only for those metric spaces where each pair of points is connected by a rectifiable curve, but for generalized pseudometric spaces too. To do that, we need only put inf ∅ = ∞, where inf is applied to subsets of [0, ∞]. Thus, if x, y ∈ X cannot be joined by a curve, then we get din (x, y) = ∞. The same holds for x and y joined by non-rectifiable curves only. Recall that in the previous chapter we introduced the equivalence relation ∼1 such that x ∼1 y if and only if the generalized distance between these points equals ∞. Applying this equivalence to the generalized din , we get a partition of the space X into metrics subspaces with finite din .

3.2

Maximal and minimal pseudometrics, quotients

In Chapter 2 we defined maximal pseudometric, see Construction 2.12, and we demonstrated its relation with quotient distance. Now, let us define maximal and minimal pseudometrics for arbitrary families of generalized pseudometrics. Also, we apply these generalizations to investigation of intrinsic distances. Construction 3.1 (Maximal and minimal pseudometric for arbitrary family of pseudometrics). Let X be a set, and 𝒟 an arbitrary nonempty family of generalized pseudometrics on X. Then we consider the standard partial order on the set of all generalized pseudometrics on X, namely, d1 ≤ d2 if d1 (x, y) ≤ d2 (x, y) for all x, y ∈ X. By means of this partial order, we define inf 𝒟 and sup 𝒟 in the standard way. Put d𝒟 (x, y) = supd∈𝒟 d(x, y), then the same proof as for Lemma 2.16 can be carried out to obtain that d𝒟 is a generalized pseudometric and d𝒟 = sup 𝒟, thus, sup 𝒟 exists for any nonempty 𝒟. We call d𝒟 the maximal generalized pseudometric for the family 𝒟. Notice that Construction 2.12 is a particular case of the present one: db = sup 𝒟b . Now, let us put b(x, y) = inf d∈𝒟 d(x, y). This b is not necessarily a generalized pseudometric (construct an example), however, the db = sup 𝒟b is, and we denote it by d𝒟 . Proposition 3.6. Under above notations, d𝒟 = inf 𝒟, and, thus, inf 𝒟 exists for any nonempty 𝒟. Proof. Since for any d′ ∈ 𝒟b and d ∈ 𝒟 we have d′ ≤ b ≤ d, then d𝒟 ≤ 𝒟, i.e., d𝒟 is a lower bound for 𝒟. Now, let d′ be an arbitrary lower bound for 𝒟, then d′ ≤ b and, therefore, d′ ∈ 𝒟b . Thus d′ ≤ d𝒟 . We call d𝒟 the minimal generalized pseudometric for the family 𝒟. Also, we can define sup 𝒟 and inf 𝒟 for empty family 𝒟: the first is the zero pseudometric, and the last is the generalized pseudometric equal ∞ for any pair of distinct points. Below we will use the following result.

3.3. Hopf–Rinow condition

34

Problem 3.6. Let ρ1 ≤ ρ2 be generalized pseudometrics on a set X, and Y be a topological space. Prove that each mapping f : Y → X, continuous w.r.t. ρ2 , is also continuous w.r.t. ρ1 , in particular, if γ is a curve in (X, ρ2 ), then γ is also a curve in (X, ρ1 ); moreover, if ρ′1 and ρ′2 denote the corresponding generalized intrinsic pseudometrics, then ρ′1 ≤ ρ′2 . Proposition 3.7. Let X be a set, and 𝒟 an arbitrary family of intrinsic generalized pseudometrics on X. Then d𝒟 = inf 𝒟 is intrinsic. Proof. We put b(x, y) = inf d∈𝒟 d(x, y), ρ = d𝒟 = db , and denote by ρ′ ≥ ρ the generalized intrinsic pseudometric corresponding to ρ′ . Since ρ ≤ d for all d ∈ 𝒟, and all d are intrinsic, then ρ′ ≤ d for all d ∈ 𝒟 due to Problem 3.6. Thus, ρ′ ≤ b and, therefore, ρ′ ∈ 𝒟b . However, ρ is maximal for the class 𝒟b , so ρ ≥ ρ′ and we get ρ = ρ′ . In the previous chapter we considered the construction of quotient spaces. What can we say about the quotient space if the distance function of the initial one is intrinsic? Proposition 3.8. Let X be a generalized pseudometric space whose distance function is intrinsic, and ∼ an arbitrary equivalence on X. Then the generalized pseudometric of X/∼ is intrinsic as well. Proof. Define d : X × X → [0, ∞] by setting d(x, y) = 0 if x ∼ y, and d(x, y) = ∞ otherwise. It is easy to see that d is a generalized pseudometric. Thus, the space (X, d) is partitioned into subspaces Xi such that ∣Xi Xj ∣ = ∞ for i ̸= j, and the distance between points of each Xi vanishes. Thus, the topology on each Xi is anti-discrete, so each mapping [a, b] → Xi is continuous, i.e., it is a curve, and such curve has zero length. This implies that the distance function of X is intrinsic. Let ρ be the original distance function of X. Let us put 𝒟 = {ρ, d}, then ρ∼ = inf 𝒟, because min{ρ, d} equals the function b∼ from the definition of the quotient space. It remains to apply Proposition 3.7. Problem 3.7. Let X be an arbitrary set covered by a family {Xi }i∈I of generalized pseudometric spaces. Denote the distance function on Xi by ρi , and consider the set 𝒟 of all generalized pseudometrics d on X such that for any i and x, y ∈ Xi it holds d(x, y) ≤ ρi (x, y). Extend each ρi to the whole X by setting ρ′i (x, y) = ∞ if at least one of x, y does not belong to Xi , and ρ′i (x, y) = ρ(x, y) otherwise (it is easy to see that each ρ′i is a generalized pseudometric). Denote by 𝒟′ the set of all such ρ′i . Prove that sup 𝒟 = inf 𝒟′ , and if all ρi are intrinsic, then sup 𝒟 is intrinsic as well. Problem 3.8. Let 𝒟 be a collection of generalized pseudometrics defined on the same set X, and Xd for d ∈ 𝒟 denote the generalized pseudometric space (X, d). Put W = ⊔d∈𝒟 Xd and denote by ρ the generalized pseudometric of W . Define on W an equivalence relation ∼ by identifying those points xd ∈ Xd and xd′ ∈ Xd′ which correspond to the same point x of the set X. The equivalence class of these points xd and xd′ we denote by [x]. Denote by ρ∼ the quotient generalized pseudometric on W/∼. Define the mapping φ : W/∼→ X as φ : [x] → x, then φ is bijective, and ρ∼ can be considered as a generalized pseudometric on X. Prove that ρ∼ = inf 𝒟. Problem 3.9. Let ρ1 and ρ2 be intrinsic metrics on a set X. Suppose that these metrics generate the same topology, and that each x ∈ X has a neighborhood U x such that the restrictions of ρ1 and ρ2 to U x coincide. Prove that ρ1 = ρ2 . Show that the condition “ρ1 and ρ2 are intrinsic” is essential.

3.3

Hopf–Rinow condition

General metric spaces can be geometrically very different from ℝn . For example, in discrete spaces, balls of nonzero radius can coincide with their centers. In particular, the distance from an arbitrary point to such a ball will be equal to the distance from this point to the center. In spaces with an internal metric, this does not occur. Theorem 3.9 (Hopf–Rinow condition). Let X be a space with an intrinsic metric, x, y ∈ X, x ̸= y, and 0 < r ≤ ∣xy∣. Then | | |yUr (x)| = ∣xy∣ − r. Remark 3.10. For general| metric|spaces X, Theorem 3.9 does not hold. For example, if X = {x, y}, ∣xy∣ = 1, and r = 0.5, then Ur (x) = {x}, |yUr (x)| = 1 ̸= ∣xy∣ − r = 0.5.

3.4. Local compactness

35

| | Proof of Theorem 3.9. For any point z ∈ Ur (x) we have ∣yz∣ ≥ ∣yx∣ − ∣zx∣ > ∣xy∣ − r, therefore |yUr (x)| ≥ ∣xy∣ − r. Let us prove that the converse inequality also holds. For each 0 < ε < r we consider a |rectifiable curve γ : [0, 1] → X, x = γ(0) and y = γ(1), for which ∣γ∣ ≤ ∣xy∣+ε. We | define a continuous function f (t) = |xγ(t)|, f (0) = 0, f (1) = ∣xy∣, and choose an arbitrary t0 such that f (t0 ) = r − ε. We denote by γ1 the part of the curve γ between 0 and t0 , and by γ2 the remaining part of γ. Then ∣γ1 ∣ ≥ r−ε | the curve | |γ(t0 )y | ≤ ∣γ2 ∣ ≤ ∣xy∣ − r + 2ε. by Item (1) of Proposition 3.2, so |∣γ2 ∣ ≤ ∣xy∣ − r + 2ε and, by the same proposition, | However, γ(t0 ) ∈ Ur (x), therefore |yUr (x)| ≤ ∣xy∣ − r + 2ε. Since ε is arbitrary, we obtain what is required. Remark 3.11. The Hopf–Rinow condition can also be satisfied in spaces whose metric is not intrinsic, for example, in the metric space ℚ of all rational numbers (with the standard distance function). We give some corollaries from Theorem 3.9. First we give a necessary definition. Let X be a metric space, x ∈ X, r ≥ 0. Note that a closed ball Br (x) is a closed set, but, generally speaking, different from the closure of the open ball Ur (x): if, as in the above example, X consists of two points x and y at the distance 1, then U1 (x) = {x}, B1 (x) = {x, y}, U1 (x) = {x} ̸= B1 (x). However, if the metric of the space X is intrinsic, then Theorem 3.9 immediately implies the following result. Corollary 3.12. Let X be a space with an intrinsic metric. Then Br (x) = Ur (x). Proof. A point y is adherent for a ball Ur (x) if and only if ∣yUr (x)∣ = 0, thus ∣xy∣ ≤ r, i.e., y ∈ Bx (r) and, therefore, Ur (x) ⊂ Br (x). Let us prove the reverse inclusion. Let y ∈ Br (x). If ∣xy∣ < r, then y ∈ Ur (x) ⊂ Ur (x). If ∣xy∣ = r, then, by Theorem 3.9, we have ∣yUr (x)∣ = ∣xy∣ − r = 0, thus y ∈ Ur (x). The following result will be used in the proof of the first part of Hopf–Rinow Theorem. Corollary 3.13. Let X be a space with intrinsic metric and ε > 0. Then for each ε-net S in the ball Br (x) ⊂ X and any δ ′ > δ > 0 we have Br+δ (x) ⊂ ∪s∈S Uε+δ′ (s), i.e., S is (ε + δ ′ )-net for Br+δ (x). Proof. For any| point |y ∈ Br+δ (x) we have ∣xy∣ ≤ r + δ, therefore either y ∈ Ur (x) and, thus, ∣yUr (x)∣ = 0, or, by Theorem 3.9, |yUr (x)| ≤ δ holds. Thus, for any δ ′ > δ there exists z ∈ Ur (x) ⊂ Br (x) such that ∣yz∣ < δ ′ . On the other hand, there exists s ∈ S for which Uε (s) ∋ z, whence ∣sy∣ ≤ ∣sz∣ + ∣zy∣ < ε + δ ′ , therefore y ∈ Uε+δ′ (s), as required.

3.4

Local compactness

Definition 3.14. A metric space X is called locally compact if for every point x ∈ X there exists ε > 0 such that the closed ball Bε (x) is compact. Problem 3.10. Prove that a metric space X is locally compact if and only if for each point x ∈ X there exists a neighborhood with compact closure. Remark 3.15. Unlike compactness, local compactness, even in combination with the intrinsic metric, does not guarantee the completeness of the metric space. An obvious example is an open ball in Euclidean space. Another example is the Euclidean space with a point removed. Theorem 3.16 (Hopf-Rinow, Part 1). Let X be a locally compact space with intrinsic metric. Then the space X is complete if and only if every closed ball in X is compact. Proof. Suppose first that each closed ball is compact. We prove the completeness. Consider an arbitrary fundamental sequence x1 , x2 , . . .. Then there exists r such that all xn are contained in Br (x1 ). By Theorem 2.24, the ball Br (x1 ) is complete, therefore the sequence x1 , x2 , . . . converges to some point x ∈ Br (x) ⊂ X, as required. Now let the space X be complete. On X we define a function ρ : X → [0, ∞] as follows: ρ(x) = sup{r > 0 : the ball Br (x) is compact}. Lemma 3.17. Suppose that there exists a point x0 ∈ X such that ρ(x0 ) = ∞. Then each ball Br (x) is compact and, therefore, ρ is identically equal to ∞.

3.5. Lipschitz, convergence and uniform convergence

36

Proof. For every x and r > 0, the ball Br (x) is contained in some compact ball Br′ (x0 ), therefore, since the set Br (x) is closed, the ball Br (x) is also compact. Thus, it suffice to prove that there exists a point x0 ∈ X such that ρ(x0 ) = ∞. Assume the contrary, i.e., that the function ρ is everywhere finite. Lemma 3.18. The function ρ is 1-Lipschitz and, therefore, continuous. | | Proof. Otherwise, there exists x, y ∈ X such that |ρ(x) − ρ(y)| > ∣xy∣. To be definite, assume that ρ(x) ≥ ρ(y), thus ρ(x) > ρ(y) + ∣xy∣, and if ε > 0 is chosen in such a way that ρ(x) > ρ(y) + 2ε + ∣xy∣, then Bρ(y)+ε (y) ⊂ Bρ(x)−ε (x), however, Bρ(x)−ε (x) is compact, thus Bρ(y)+ε (y) is compact as well, a contradiction with definition of ρ(y). Lemma 3.19. Under the assumptions made, the ball Bρ(x) (x) is compact for every x. Proof. Since the ball Bρ(x) (x) is a closed subset of the complete space X, this ball is also complete. Therefore, by Theorem 2.24, it suffices to prove that for each ε > 0 this ball contains a finite ε-net. To do that, we choose 0 < r < ρ(x) such that δ := ρ(x) − r < ε/2, then the ball Br (x) is compact, thus it contains a finite (ε/2)-net S. By Corollary 3.13, the set S is (ε/2 + ε/2)-net for the ball Br+δ (x) = Bρ(x) (x). Since ρ is a continuous function, its restriction to the compact set Bρ(x) (x) attains its minimum and, thus, this minimum is positive. We denote this minimum by ε, then, by Lemma 3.19, all the balls Bε (y), y ∈ Bρ(x) (x) are compact. Let S be a finite (ε/2)-net in Bρ(x) (x), and 0 < δ < ε/2, then, by Corollary 3.13, the set S is (ε/2 + ε/2)-net for the ball Bρ(x)+δ (x). In particular, Bρ(x)+δ (x) is contained in the set ∪s∈S Bε (s) which is compact as a finite union of compact sets. Therefore, Bρ(x)+δ (x) is compact that contradicts to definition of the function ρ. Definition 3.20. A metric space in which every closed ball is compact is called proper or boundedly compact. Corollary 3.21. A metric space with an intrinsic metric is boundedly compact if and only if it is locally compact and complete. Problem 3.11. Show that a metric spaces is boundedly compact if and only if its compact subsets are exactly those subsets that are closed and bounded.

3.5

Lipschitz, convergence and uniform convergence

In this section, we state and prove a few useful technical results regarding the convergence of Lipschitz mappings. Let fn : X → Y be a family of arbitrary mappings from a set X to a metric space Y . We say that the sequence fn converges pointwise to a mapping f : X → Y if for each x ∈ X the sequence fn (x) converges to f (x). The sequence fn converges uniformly to a mapping f : X → Y if for any ε > 0 there exists N such that for every n ≥ N the inequality | | |f (x)fn (x)| < ε holds for all x ∈ X. Remark 3.22. Let us introduce the convergences described above by representing the mappings fn as points of the ∏ set x∈X Y , which, recall, we defined as the family Y X of all mappings from X to Y . Now we model the pointwise and uniform convergences by means of some topologies. ∏ We start with the case of pointwise convergence. We define on Y X = x∈X Y the Tychonoff’s topology, see Construction 1.3. Problem 3.12. Show that convergence in the Tychonoff topology of points fn to a point f is equivalent to pointwise convergence of the mappings fn to the mapping f . To model uniform convergence, we first give some definitions. A mapping f : X → Y is called bounded if its image f (X) is a bounded subset of Y . The family of all bounded |mappings| from X to Y we denote by ℬ(X, Y ). We define the following distance function on ℬ(X, Y ): ∣f g∣ = supx∈X |f (x)g(x)|. Problem 3.13. Prove that the distance function defined above on ℬ(X, Y ) is a metric, and that the convergence in this metric of a sequence fn ∈ ℬ(X, Y ) to a point f ∈ ℬ(X, Y ) is equivalent to the uniform convergence of the mappings fn to the mapping f . | | For arbitrary fn and f we can also define such convergence by considering ∣f g∣ = supx∈X |f (x)g(x)| as generalized metric.

3.6. Arc-length and uniform curves

37

| | Problem 3.14. Prove that the generalized distance function ∣f g∣ = supx∈X |f (x)g(x)| defined on Y x is a generalized metric, and that the convergence in this generalized metric of a sequence fn ∈ Y X to a point f ∈ Y X is equivalent to the uniform convergence of the mappings fn to the mapping f . Proposition 3.23. Let X be compact, and Y be arbitrary metric spaces, and let fn : X → Y be a sequence of CLipschitz mappings converging pointwise to some mapping f : X → Y . Then f is a C-Lipschitz mapping, and the sequence fn converges to f uniformly. | | Proof. To verify that the mapping f is C-Lipschitz, it is sufficient to pass to the limit in the inequality |fn (x)fn (x′ )| ≤ C · ∣xx′ ∣ for arbitrary fixed x, x′ ∈ X. We now prove uniform convergence. | |Choose an arbitrary ε > 0 and show that there exists N such that for all n > N and all x ∈ X we have |f (x)fn (x)| < ε. and let {xi } ⊂ X be a finite δ-net. We choose N such that for all n > N and all i the inequality | Put δ = ε/(3C), | |f (xi )fn (xi )| < ε/3 holds. x ∈ X. There is i such| that ∣xxi ∣| < δ. Since fn and f are C-Lipschitz, we conclude that | Fix an arbitrary | |fn (x)fn (xi )| ≤ C · ∣xxi ∣ < ε/3 and, similarly, |f (x)f (xi )| < ε/3, therefore | | | | | | | | |f (x)fn (x)| ≤ |f (x)f (xi )| + |f (xi )fn (xi )| + |fn (xi )fn (x)| < ε/3 + ε/3 + ε/3 = ε, as required. The following version of the previous statement is useful in studying curves. Corollary 3.24. Let X be a metric space, and γn : [a, b] → X be a sequence of C-Lipschitz curves converging pointwise to a mapping γ : [a, b] → X. Then γ is a C-Lipschitz curve, and the sequence γn converges to γ uniformly. The proposition below can be proved similarly to Proposition 3.23. Proposition 3.25. Let X be compact, Y be an arbitrary metric spaces, and fn : X → Y be a sequence of C-Lipschitz mappings. Suppose that for some everywhere dense subset Z ⊂ X the sequence fn ∣Z converges pointwise. Then the sequence fn converges pointwise to some mapping f : X → Y and, therefore, by Proposition 3.23, this convergence is uniform, and the mapping f is C-Lipschitz. Corollary 3.26. Let γn : [a, b] → X be a sequence of C-Lipschitz curves in a metric space X. Suppose that for some everywhere dense subset Z ⊂ [a, b] the sequence of mappings γn ∣Z converges pointwise. Then the sequence of curves γn converges pointwise to some curve γ : [a, b] → X and, therefore, by virtue of Corollary 3.24, this convergence is uniform, and the curve γ is C-Lipschitz.

3.6

Arc-length and uniform curves

Definition | 3.27. | A curve γ(s) and its parameter s ∈ [a, b] are called natural or arc-length, if for any a ≤ s1 ≤ s2 ≤ b it holds |γ∣[s1 ,s2 ] | = s2 − s1 . A curve| γ(t) and | its parameter t ∈ [a, b] are called uniform, if there exists λ ≥ 0 such | that for any a ≤ t1 ≤ t2 ≤ b it holds γ∣[t1 ,t2 ] | = λ(t2 − t1 ); the value λ is called the velocity or the speed of uniform γ. Remark 3.28. Let γ(t), t ∈ [a, b], be uniform curve. (1) If a ̸= b, then its velocity λ is uniquely determined by the equation ∣γ∣ = λ(b − a). In particular, ∣γ∣ = 0, i.e., γ is a constant mapping if and only if λ = 0. (2) If a = b, then λ can be arbitrary. However, it is natural to consider this case as the limiting one for constant mappings γ, where λ = 0. So, to be definite, we make the following agreement: if a = b then λ = 0. Thus, under the above agreement, the velocity of a uniform curve γ vanishes if and only if γ is a constant mapping. We call such curves degenerate, and all the remaining curves nondegenerate. So, each degenerate curve is uniform. Remark 3.29. The following simple observations concern the relations between arc-length and uniform curves. (1) Each arc-length curve is uniform. (2) A degenerate curve γ(s), s ∈ [a, b], is arc-length if and only if a = b; in this case the arc-length curve has zero velocity.

3.6. Arc-length and uniform curves

38

(3) A degenerate curve γ(t), t ∈ [a, b], with a < b is uniform but not arc-length. (4) A nondegenerate uniform curve is arc-length if and only if its velocity equals 1. Proposition 3.30. If a curve γ(t), t ∈ [a, b], is uniform with the velocity λ, then the mapping γ is λ-Lipschitz. Proof. For any a ≤ t1 ≤ t2 ≤ b we have | | | |γ(t1 )γ(t2 )| ≤ |γ∣[t

1 ,t2 ]

| | = λ(t2 − t1 ).

Remark 3.31. The following types of curves γ : [a, b] → X cannot be reparameterized to arc-length curves: (1) not rectifiable γ (otherwise, b = ∞); (2) γ = const when a ̸= b; (3) more general, γ containing stops, i.e., when there exists [α, β] ⊂ [a, b], α ̸= β, such that γ∣[α,β] = const. Curves that do not contain stops are called non-stop ones. Problem 3.15. Let γ be a curve in a metric space. Prove that (1) nondegenerate γ can be reparameterized to an arc-length or, more generally, to a uniform one if and only if γ is rectifiable and non-stop; (2) degenerate γ can be reparameterized to an arc-length one if and only if its domain is singleton (indeed, such γ is arc-length itself and, thus, it need not a reparametrization); (3) degenerate γ is always uniform. In [1] it is proposed to extend the class of reparametrizations, namely, to consider monotone (not necessarily strictly monotone) surjective mappings between the domains of the curves. It turns out that with this definition of reparametrization, it is possible to introduce an arc-length parameter on any rectifiable curve. Definition 3.32. We say that curves γ : [a, b] → X and γ¯ : [c, d] → X are obtained from each other by a monotone reparametrization if either there exists a monotone surjective mapping φ : [c, d] → [a, b] such that γ¯ = γ ◦ φ, or there exists a monotone surjective mapping ψ : [a, b] → [c, d] such that γ = γ¯ ◦ ψ. Remark 3.33. It is easy to see that monotone reparametrization does not change the length. Problem 3.16. Prove that a curve γ in a metric space can be monotonically reparameterized to an arc-length or, more generally, a uniform one if and only if γ is rectifiable. The reparameterized curve is unique upto the choice of its domain and direction. In arc-length case one can choose any segment of the length ∣γ∣. In the uniform case the domain can be arbitrary nondegenerate segment for nondegenerate γ, and arbitrary segment for degenerate γ. Remark 3.34. An instructive example is a curve that is a parametrization of the segment [0, 1] ⊂ ℝ by the Cantor staircase. The Cantor staircase is a graph of a function f : [0, 1] → [0, 1], the construction of which we will now describe. At the points 0 and 1, we set the value of the function f equal to 0 and 1, respectively. Next, we divide the segment [0, 1] into three equal parts and on the middle interval we set f equal to 1/2. The remaining two segments are again divided into three equal parts each, and on the middle intervals we assume that the function f is equal to the arithmetic mean of its values at the nearest intervals where it is defined. Thus, on the left-most interval, the function f is equal to 1/4, and on the right-most interval it is equal to 3/4. Continuing this process to infinity, we define a function f on an everywhere dense subset of the segment [0, 1], which is the complement to the Cantor set. Extend f to the remaining points of the segment [0, 1] by continuity (make sure that this can be done). We now consider the Cantor staircase as a curve f : [0, 1] → ℝ on the Euclidean line. Note that the subset of the segment [0, 1], on which the point of this curve changes its position, is the Cantor set that has measure zero. So, this curve stops almost everywhere, however, its length equals 1 and it can be reparameterized to an arc-length curve.

3.7. Arzela-Ascoli Theorem

3.7

39

Arzela-Ascoli Theorem

Developing the ideas from Section 3.5, we formulate and prove a variant of the famous Arzela–Ascoli theorem. First we give necessary definitions. Definition 3.35. Let γn : [an , bn ] → X be a sequence of curves in a metric space X. We say that this sequence converges (uniformly converges) to a curve γ : [a, b] → X if there exist curves γ¯n : [c, d] → X and γ¯ : [c, d] → X obtained from γn and γ, respectively, by monotone reparametrization, such that the mappings γ¯n converge (uniformly converge) to the mapping γ¯ . Theorem 3.36 (Arzela–Ascoli). Let X be a compact metric space, and γn be a sequence of curves in X. Suppose that the lengths of the curves γn are uniformly bounded, i.e., there exists a real number C such that ∣γn ∣ ≤ C for all n. Then in this sequence there is a subsequence that converges uniformly to a curve whose length is at most C. Proof. In virtue of Problem 3.16, the curves γn can be monotonically reparameterized to uniform curves γ¯n : [0, 1] → X with speeds at most C. It follows from Remark 3.33 and Proposition 3.30 that all the curves γ¯)n are C-Lipschitz. ( ∞ Choose a countable everywhere dense subset Z ⊂ [0, 1], Z = {zi }∞ . The sequence γ ¯ (z ) has a convergent n 1 i=1 n=1 ( 1 ( 1 ( 2 )∞ )∞ )∞ subsequence γn (z1 ) n=1 ; the sequence γn (z2 ) n=1 has a convergent subsequence γn (z2 ) n=1 , etc. Then the sequence ( )∞ ( n )∞ γn (zk ) n=1 = γ¯ni (zk ) i=1 convergence for any k (Cantor diagonal process). Let us put f (zk ) = limi→∞ γ¯ni (zk ), then γ¯ni ∣Z → f . By Corollary 3.26, the mappings γ¯ni : [0, 1] → X converge uniformly to some C-Lipschitz curve γ¯ : [0, 1] → X. By γ ∣ ≤ lim inf ni →∞ ∣¯ γni ∣ ≤ C, as required. Item (6) of Proposition 3.2, we have ∣¯

3.8

Existence of shortest curves

We apply the previous results to investigation of curves of smallest length. Definition 3.37. A rectifiable curve in a metric space is called shortest if its length is equal to the infimum of the lengths of all the curves joining its ends. Remark 3.38. If X is a space with an intrinsic metric, then a curve γ in X joining x and y is shortest if and only if ∣xy∣ = ∣γ∣. The following proposition is obvious. Proposition 3.39. A curve in a metric space is shortest if and only if each of its parts is a shortest curve. Problem 3.17. Prove that an arc-length curve γ : [a, b] → X in a space X with an intrinsic metric is shortest if and only if γ is an isometric embedding. Definition 3.40. A curve γ : [a, b] → X in a metric space X is called locally shortest if for each t ∈ [a, b] there exists an interval (α, β) ⊂ ℝ containing t such that γ∣[a,b]∩[α,β] is a shortest curve. Definition 3.41. A uniform locally shortest curve is called a geodesic. Arzela-Ascoli theorem, together with a few other previous propositions, implies the following result. Corollary 3.42. Any two points x and y of a compact metric space X that are joined by a rectifiable curve are also joined by a shortest curve. Proof. Let ℓ be the infimum of the lengths of the curves joining x and y. There is a sequence γn for which ∣γn ∣ → ℓ and, thus, the lengths of γn are uniformly bounded. Theorem 3.36 implies that the sequence γn contains a subsequence γni which uniformly converging to some curve γ. By Item (6) of Proposition 3.2, we have ∣γ∣ ≤ lim inf i→∞ ∣γni ∣ = ℓ, however, by the minimality of ℓ, it holds ∣γ∣ ≥ ℓ, therefore ∣γ∣ = ℓ and, thus, γ is a shortest curve. Remark 3.43. Corollary 3.42 remains true if we change compact X to a boundedly compact one (verify this). Definition 3.44. A metric on X is called strictly intrinsic if any two points in X are joined by a curve whose length is equal to the distance between these points. A metric space with strictly intrinsic metrics is called strictly intrinsic or geodesic. Taking into account the Hopf–Rinow theorem, we obtain the following Corollary 3.45. Each complete locally compact space with an intrinsic metric is a geodesic space.

3.9. Shortest curves and midpoints

3.9

40

Shortest curves and midpoints

Definition 3.46. A point z of a metric space is called a midpoint between or for points x and y of this space if ∣xz∣ = ∣yz∣ = 21 ∣xy∣. Theorem 3.47. Let X be a complete metric space. Suppose that for each pair of points x, y ∈ X there is a midpoint. Then X is a geodesic space. Proof. Choose two arbitrary points x and y from X. We show that these points can be joined by a curve γ : [0, 1] → X, for which ∣γ∣ = ∣xy∣. We will sequentially determine the map γ for various points of the segment [0, 1]. Put γ(0) = x and γ(1) = y. Next, let γ(1/2) be a midpoint between x and y; γ(1/4) be a midpoint between γ(0) and γ(1/2), and γ(3/4) be a midpoint between γ(1/2) and γ(1). Continuing this process, we define γ at all binary rational points of the segment [0, 1], i.e., at all points of the form m/2n , where 0 ≤ m ≤ 2n is an integer, and n = 0, 1, . . .. Note that the set of all binary rational points of the segment [0, 1] is everywhere dense in [0, 1]. In addition, it is easy to show that the constructed mapping γ is ∣xy∣-Lipschitz. The proof of the following technical lemma is left as an exercise. Lemma 3.48. Let Z be an everywhere dense subset of a metric space X, and f : Z → Y be some C-Lipschitz mapping into a complete metric space Y . Then there exists a unique continuous mapping F : X → Y extending f . Moreover, the mapping F is also C-Lipschitz. Problem 3.18. Prove Lemma 3.48. So, using Lemma 3.48, we extend by continuity the mapping γ onto the entire segment [0, 1], and we again denote the resulting ∣xy∣-Lipschitz curve by γ. As noted in Example 3.1, it holds ∣γ∣ ≤ ∣xy∣(1 − 0) = ∣xy∣, from where, by virtue of Item (1) of Proposition 3.2, we have ∣γ∣ = ∣xy∣ and, therefore, γ is a shortest curve. Remark 3.49. In a complete metric space, the property of a metric to be intrinsic is not sufficient for midpoints and shortest curves between any points to exist. Consider a countable family of segments [0, 1 + 1/n], n ∈ ℕ, each with the standard metric, and glue all their zeros at one point A, and at another point B we glue all the other ends 1 + 1/n. If x and y belong to different segments, say to [0, 1 + 1/n] and [0, 1 + 1/m], then we set the distance between x and y equal to min(x + y, 1 − x + 1/n + 1 − y + 1/n) (i.e., the intrinsic circle metric is considered on each pair of glued segments). Then the distance between A and B is 1 and is not reached on any curve. In addition, there is no midpoints between A and B. Definition 3.50. A point | | z of a metric | space X is called an ε-midpoint between or for points x and y of this space | if |∣xz∣ − 21 ∣xy∣| ≤ ε and |∣yz∣ − 12 ∣xy∣| ≤ ε. Theorem 3.51. Let X be a complete metric space. Suppose that for each pair of points x, y ∈ X and each ε > 0, there is an ε-midpoint. Then the metric of X is intrinsic. Proof. The proof is similar with the one of Theorem 3.47, however, now we find not strict midpoints, but approximate ∑∞ ones, making sure that the total “spread” is not large (we use the fact that i=1 ε/2i = ε). There are also converse obvious statements, even without assuming the completeness of the ambient space. Proposition 3.52. In a space with an intrinsic (strictly intrinsic) metric, for any two points and any ε > 0 there is an ε-midpoint (a midpoint), respectively. Problem 3.19. Prove Proposition 3.52.

References to Chapter 3 [1] D.Burago, Yu.Burago, S.Ivanov, A Course in Metric Geometry. Graduate Studies in Mathematics, vol.33, A.M.S., Providence, RI, 2001.

41

Exercises to Chapter 3.

42

Exercises to Chapter 3 Exercise 3.1. Let X be an arbitrary metric space and Ω(X) the family of all curves in X. Verify that (1) if γ ∈ Ω(X) joins the points x, y ∈ X, then ∣γ∣ ≥ ∣xy∣; (2) if γ = γ1 · γ2 is the gluing of curves γ1 , γ2 ∈ Ω(X) then ∣γ∣ = ∣γ1 ∣ + ∣γ2 ∣; (3) for each γ ∈ Ω(X), γ : [a, b] → X, and reparametrization φ : [c, d] → [a, b], it holds ∣γ∣ = ∣γ ◦ ψ∣; (4) for each x ∈ X, ε > 0, y ∈ X \ Uε (x) and the curve γ ∈ Ω(X) joining x and y, ∣γ∣ ≥ ε holds; | | (5) is it true that for any γ ∈ Ω(X), γ : [a, b] → X, the function f (t) = |γ∣[a,t] | is continuous? (6) is it true that for any sequence γn ∈ Ω(X) converging pointwise to some γ ∈ Ω(X), we have ∣γ∣ ≤ lim inf ∣γn ∣? n→∞

Exercise 3.2. Show that the piecewise smooth curve in ℝn is Lipschitzian with a Lipschitz constant equal to the maximum modulus of the velocity vector of the curve, therefore each such curve is rectifiable. Exercise 3.3. Let X be a metric space in which any two points are connected by a rectifiable curve. (1) Prove that din is a metric. (2) Denote by τ the metric topology of X w.r.t. the initial metric on X, by τin the metric topology w.r.t. din , by X∈ the set X with metric din and topology τin . Show that τ ⊂ τin . In particular, if a mapping γ : [a, b] → Xin is continuous, then the mapping γ : [a, b] → X is continuous as well. (3) Construct an example when τ ̸= τin . (4) Prove that for each rectifiable curve γ : [a, b] → X the mapping γ : [a, b] → Xin is continuous. (5) Denote by ∣γ∣in the length of a curve γ : [a, b] → Xin . Show that for each curve γ : [a, b] → X which is also a curve in Xin , it holds ∣γ∣ = ∣γ∣in . Thus, the sets of rectifiable curves for X and Xin coincide, and each non-rectifiable curve in X is either a non-rectifiable one in Xin , or the mapping γ : [a, b] → Xin is discontinuous. (6) Construct an example of continuous mapping γ : [a, b] → X such that the mapping γ : [a, b] → Xin is not continuous. Notice that the curve γ : [a, b] → X can not be rectifiable. Exercise 3.4. Let X be a metric space in which any two points are connected by a rectifiable curve. Prove that the metric din is intrinsic. Exercise 3.5. Let ρ1 ≤ ρ2 be generalized pseudometrics on a set X, and Y be a topological space. Prove that each mapping f : Y → X, continuous w.r.t. ρ2 , is also continuous w.r.t. ρ1 , in particular, if γ is a curve in (X, ρ2 ), then γ is also a curve in (X, ρ1 ); moreover, if ρ′1 and ρ′2 denote the corresponding generalized intrinsic pseudometrics, then ρ′1 ≤ ρ′2 . Exercise 3.6. Let X be an arbitrary set covered by a family {Xi }i∈I of generalized pseudometric spaces. Denote the distance function on Xi by ρi , and consider the set 𝒟 of all generalized pseudometrics d on X such that for any i and x, y ∈ Xi it holds d(x, y) ≤ ρi (x, y). Extend each ρi to the whole X by setting ρ′i (x, y) = ∞ if at least one of x, y does not belong to Xi , and ρ′i (x, y) = ρ(x, y) otherwise (it is easy to see that each ρ′i is a generalized pseudometric). Denote by 𝒟′ the set of all such ρ′i . Prove that sup 𝒟 = inf 𝒟′ , and if all ρi are intrinsic, then sup 𝒟 is intrinsic as well. Exercise 3.7. Let 𝒟 be a collection of generalized pseudometrics defined on the same set X, and Xd for d ∈ 𝒟 denote the generalized pseudometric space (X, d). Put W = ⊔d∈𝒟 Xd and denote by ρ the generalized pseudometric of W . Define on W an equivalence relation ∼ by identifying those points xd ∈ Xd and xd′ ∈ Xd′ which correspond to the same point x of the set X. The equivalence class of these points xd and xd′ we denote by [x]. Denote by ρ∼ the quotient generalized pseudometric on W/∼. Define the mapping φ : W/∼→ X as φ : [x] → x, then φ is bijective, and ρ∼ can be considered as a generalized pseudometric on X. Prove that ρ∼ = inf 𝒟.

Exercises to Chapter 3.

43

Exercise 3.8. Let ρ1 and ρ2 be intrinsic metrics on a set X. Suppose that these metrics generate the same topology, and that each x ∈ X has a neighborhood U x such that the restrictions of ρ1 and ρ2 to U x coincide. Prove that ρ1 = ρ2 . Show that the condition “ρ1 and ρ2 are intrinsic” is essential. Exercise 3.9. Prove that a metric space X is locally compact if and only if for each point x ∈ X there exists a neighborhood with compact closure. Exercise 3.10. Show that a metric spaces is boundedly compact if and only if its compact subsets are exactly those subsets that are closed and bounded. Exercise { set} and Y an arbitrary metric space. Consider the collection of sets of the ∏ 3.11. Let∏X be an arbitrary form x∈X V (x) ⊂ x∈X Y , where V (x) x∈X is the family of nonempty open subsets of Y such that for all x ∈ X, except for their finite number, V (x) = Y . Show that the family defined in this way forms a basis of a topology, and the convergence in this topology of points fn to a point f is equivalent to pointwise convergence of the mappings fn to the mapping f . Exercise 3.12. Let X be an arbitrary set and Y an arbitrary metric space. A mapping f : X → Y is called bounded if its image f (X) is a bounded subset of Y . The family of all bounded mappings | |from X to Y we denote by ℬ(X, Y ). We define the following distance function on ℬ(X, Y ): ∣f g∣ = supx∈X |f (x)g(x)|. Prove that the distance function defined above is a metric, and that the convergence in this metric of a sequence fn ∈ ℬ(X, Y ) to some f ∈ ℬ(X, Y ) is equivalent to uniform convergence of the mappings fn to the mapping f . Exercise 3.13. Let X be an arbitrary set | | and Y an arbitrary metric space. Define the following generalized distance function on Y X : ∣f g∣ = supx∈X |f (x)g(x)|. Prove that the generalized distance function defined above is a generalized metric, and that the convergence in this generalized metric of a sequence fn ∈ Y X to some f ∈ Y X is equivalent to uniform convergence of the mappings fn to the mapping f . Exercise 3.14. Let γ be a curve in a metric space. Prove that (1) nondegenerate γ can be reparameterized to an arc-length or, more generally, to a uniform one if and only if γ is rectifiable and non-stop; (2) degenerate γ can be reparameterized to an arc-length one if and only if its domain is singleton (indeed, such γ is arc-length itself and, thus, it need not a reparametrization); (3) degenerate γ is always uniform. Exercise 3.15. Prove that a curve γ in a metric space can be monotonically reparameterized to an arc-length or, more generally, a uniform one if and only if γ is rectifiable. The reparameterized curve is unique upto the choice of its domain and direction. In arc-length case one can choose any segment of the length ∣γ∣. In the uniform case the domain can be arbitrary nondegenerate segment for nondegenerate γ, and arbitrary segment for degenerate γ. Exercise 3.16. Prove that an arc-length curve γ : [a, b] → X in a space X with an intrinsic metric is shortest if and only if γ is an isometric embedding. Exercise 3.17. Let Z be an everywhere dense subset of a metric space X, and f : Z → Y be some C-Lipschitz map into a complete metric space Y . Then there exists a unique continuous mapping F : X → Y extending f . Moreover, the mapping F is also C-Lipschitz. Exercise 3.18. Show that in a space with an intrinsic (strictly intrinsic) metric, for any two points and any ε > 0 there is an ε-midpoint (a midpoint), respectively.

Chapter 4

Extreme graphs and networks. Schedule. Simple graphs, finite graphs, vertices, edges, isomorphism of graphs, adjacency, incidence, neighborhood of a vertex, subgraph, spanning subgraph, complete graph, subgraph generated by vertices, subgraph generated by edges, walk, degenerate and non-degenerated walks, open and closed walks, trail, path, circuit, cycle, connected graph, components of a graph, forest, tree, weighted graph, the weight of a subgraph, the weight of trail, the weight of walk, operations on graphs, union, disjoint union, intersection, difference, deleting edges, deleting vertices, quotient graphs, quotient by an edge, splitting a vertex, splitting of a vertex, graphs in metric spaces, the length of an edge, the length of a graph, the length of minimum spanning tree, minimum spanning tree, the length of Steiner minimal tree, Steiner minimal tree, the length of minimal filling, minimal filling, mst-spectrum of finite metric space, calculation of mst-spectrum in terms of partitions, graphs with boundaries, boundary (fixed) vertices, interior (movable) vertices, networks, parameterizing graphs of networks, boundary of a network, the length of a network, splitting and splitting off for networks, full Steiner tree, Steiner minimal trees existence in boundedly compact metric spaces.

In this section, we collect information about various kinds of extreme graphs and networks. We will consider two types of such graphs: minimal spanning trees and shortest trees, also called Steiner minimal trees.

4.1

Necessary information from graph theory

We will consider only simple graphs, so in what follows by a graph we mean a pair G = (V, E) consisting of two sets V and E, respectively called the set of vertices and the set of edges of the graph G; the elements from V are called vertices, and from E are called edges of the graph G. The set E is a subset of the family of two-element subsets of V . If V and E are finite sets then the graph G is called finite. It is convenient to use the following notation: • if {v, w} ∈ E is an edge of the graph G, then we will write it in the form vw or wv; we will also say that the edge vw joins the vertices v and w, and that v and w are the vertices of the edge vw; • if the sets V and E are not explicitly indicated, and only the notation for the graph G is introduced, then the set of vertices of this graph is usually written as V (G), and the set of edges is denoted by E(G). Recall some concepts from the graph theory. Graphs G = (V, E) and H = (W, F ) are called isomorphic if there exists a bijective map f : V → W such that uv ∈ E if and only if f (u)f (v) ∈ F . Such a mapping f is called an isomorphism of the graphs G and H. Isomorphic graphs are often identified and, therefore, are not distinguished. Two vertices v, w ∈ V (G) are called adjacent if vw ∈ E(G). Two different edges e1 , e2 ∈ E(G) are called adjacent if they have a common vertex, i.e., if e1 ∩ e2 ̸= ∅. Each edge vw ∈ E(V ) and its vertex, i.e., v or w, are incident to each other. The set of vertices of a graph G adjacent to a vertex v ∈ V is called the neighborhood of the vertex v and denoted by Nv . The cardinal number of edges incident to a vertex v is called the degree of the vertex v and is denoted by deg v, so deg v = #Nv . A subgraph of a graph G = (V, E) is each graph H = (W, F ) provided that W ⊂ V and F ⊂ E. The fact that a graph H is a subgraph of a graph G will be written as H ⊂ G. If W = V then the subgraph H ⊂ G is called spanning. On the set of all graphs whose vertex sets lie in a given set V , the inclusion relation ⊂ defines a partial order. The smallest element in this order is the empty graph (∅, ∅); the greatest one is called the complete graph on V , which we denote by K(V ): in K(V ) each pair of vertices are joined by an edge. This partial order induces the one on the set of all subgraphs of a graph G = (V, E): now the smallest element is again the empty graph (∅, ∅), but the greatest one is G. For each W ⊂ V we define the subgraph G(W ) of the graph G generated by W : its set of vertices coincides with W , and its set of edges consists of all e ∈ E that connect the vertices from W . In other words, G(W ) is maximal among subgraphs of G whose vertex sets coincides with W . 44

4.1. Necessary information from graph theory

45

Now, we define a similar construction by interchanging vertices and edges. Namely, for F ⊂ E we define the subgraph G(F ) of the graph G generated by F : its set of edges coincides with F , and its set of vertices is the collection of all vertices of G incident to edges from F . In what follows we also apply this construction. A finite sequence γ = (v0 = v, v1 , . . . , vk = w) of vertices of a graph G is called a walk of length k joining v and w if for every i = 1, . . . , k the vertices vi−1 and vi are adjacent, and the edges ei = vi−1 vi are called the edges of the walk γ. A walk containing at least one edge is called nondegenerate, and not containing is called degenerate. The walt is called closed if v0 = vn , and it is called open otherwise. A trail is a walk with no repeated edges. A path is an open trail with no repeated vertices. A circuit is a closed trail. A cycle is a circuit with no repeated vertices. A graph G is called connected, if each pair of its vertices are joined by a walk. Maximal (by inclusion) connected subgraphs of a graph G are called components of G. A graph without cycles is called a forest, and a connected forest is called a tree. A weighted graph is a graph G = (V, E) equipped with a weight function ω : E → [0, ∞) (sometimes it is useful to consider more general weight functions, for instance, with possibility of negative values or ∞). Sometimes we denote such weighted graph as (V, E,∑ ω) or (G, ω). The weight ω(H) of a subgraph H ⊂ G is the sum of the weights of edges from this subgraph: ω(H) = e∈E(H) ω(e). We can extend this definition to trails, in particular, to paths, circuits and cycles, considering them as subgraphs of G. In the case of walk γ = (v0 = v, v1 , . . . , vk = w), its weight is ∑the n defined as the sum of weights of its consecutive edges: ω(γ) = i=1 ∈ (vi−1 vi ). For graphs without weight functions these notions are defined as well by assigning the weight 1 to each edge by default. Remark 4.1. As in the case of metric spaces, we sometimes won’t explicitly denote the weight function. Instead of that, when we speak about weighted graph G, the weights of all the objects x related to such G we denote by ∣x∣, for example, for e ∈ E by ∣e∣ we mean the weight of this edge, and H ⊂ G by ∣H∣ we mean the weight of H, etc.

4.1.1

Some operations on graphs

Let H1 = (W1 , F1 ) and H2 = (W2 , F2 ) be subgraphs of a graph G. Then the following subgraphs are defined: • the union H1 ∪ H2 = (W1 ∪ W2 , F1 ∪ F2 ); • the disjoint union: if W1 ∩ W2 = ∅, then to emphasize this fact, instead of H1 ∪ H2 we write H1 ⊔ H2 ; • the intersection H1 ∩ H2 = (W1 ∩ W2 , F1 ∩ F2 ); • the difference H1 \ H2 = H1 (W1 \ W2 ). Remark 4.2. We can define the operations described above on any graphs Hi , not only on subgraphs of a graph. To reduce these definitions to the previous ones, we consider Hi as subgraphs of the graph K(W1 ∪ W2 ). Problem 4.1. Show that each forest is the disjoint union of trees that are components of this forest. If G = (V, E) is a graph, and F is a set possibly intersecting E, then the operation of deleting the set of edges F from the graph G produces the graph G \e F := (V, E \ F ). If F = {e} then instead of G \e {e} we will write G \e e. The operation of deleting a set of vertices W from the graph G produces the graph G \v W := G(V \ W ). If W = {w} then instead of G \v {w} we will write G \v w. If it is clear that F refers to edges, or W refers to vertices, we write simplified G \ F or G \ W , respectively. Using the operation of deleting edges, we define the complement of a graph G = (V, E) or, in other words, the ¯ = K(V ) \ E. Thus, the dual graph G ¯ has the same set of vertices V , and its edges graph dual to G to be the graph G are exactly those edges of the complete graph on V that were absent in the original graph G. Another useful operation for us produces a quotient graph G = (V, E): let ∼ be an equivalence relation on V , and V = ⊔i∈I Vi the partition into classes of this equivalence. We put V /∼ = {Vi }, and as E/∼ we take the set of pairs Vi Vj , Vi ̸= Vj for which there exist vi ∈ Vi , vj ∈ Vj , such that vi vj ∈ E. By the quotient graph G/∼ we call the graph (V /∼ , E/∼ ). An important particular case of this operation creates the quotient of G by an edge e = vw ∈ E: the result is the graph G/∼ for the equivalence relation identifying the vertices v and w. We denote this quotient graph by G/e. The following notation and concepts are also useful: the equivalence class containing a given vertex v will be denoted by [v]; the mapping π : V → V /∼ , π : v ⊦→ [v], is called the canonical projection. Problem 4.2. Let G = (V, E) be a connected graph, and ∼ an arbitrary equivalence relation on the set V . Show that the graph G/∼ is connected.

4.2. Graphs and optimization problems

46

Problem 4.3. Let G = (V, E) be an arbitrary tree, and ∼ be an equivalence relation on the set V such that for each class Vi of this equivalence the subgraph G(Vi ) is a tree. Show that then G/∼ is a tree. In some cases, the following operations are inverse to the quotient by an edge. We define two such operations: splitting a vertex of degree greater than or equal to 4, and splitting off some vertex of degree 1 from a vertex of degree greater than or equal to 2. So, let G = (V, E) be a graph, v ∈ V , deg v ≥ 4. We partition the neighborhood Nv of the vertex v into two sets V1 and V2 , each of which contains at least two vertices. Consider the graph G \ v, add to its vertex set V \ {v} two elements w1 and w2 not contained in V \ {v}, and, to the set of edges, all pairs of the form w1 v1 , v1 ∈ V1 , w2 v2 , v2 ∈ V2 , as well as the pair w1 w2 . We call the obtained graph the result of splitting the vertex v, and the edge w1 w2 the splitting edge. It is clear that the graph obtained from G by the composition of splitting a vertex and the quotient by the corresponding splitting edge, is isomorphic to G (just in this sense, the splitting is inverse to the quotient operation). To determine the splitting off a vertex of degree 1 from v, add to V an element w not contained in V , and add the edge vw to E. The obtained graph is called the result of splitting off the vertex w from the vertex v, and the edge wv is called the splitting edge. It is clear that the quotient by the splitting edge is isomorphic to the original graph (in this sense, splitting off is also inverse to the quotient operation).

4.2

Graphs and optimization problems

Let G = (V, E) be an arbitrary graph. We say that the graph G is defined in a metric space X if V ⊂ X. For every such graph, the length ∣e∣ of its edge e = vw is defined as the distance ∣vw∣ between the ending vertices v and w of these edge, as well as the length ∣G∣ of the graph G itself as the sum of all its edges. More generally, one ( of the lengths ) can replace the metric space X with a weighted complete graph K(X), ω ; another possibility — to consider some weighted graph (H, ω) with V (H) = X (not necessarily the complete one), such that G is a subgraph of) H. Let us ( note that each metric space (X, ρ) can be considered as a weighted complete graph, namely, as K(X), ρ .

4.2.1

Minimum spanning tree problem

Let M be a metric space. We consider M as a weighted complete graph K(M ), and denote by 𝒯 (M ) the set of all spanning trees in K(M ). Then we put mst(M ) = inf ∣T ∣ T ∈𝒯 (M )

and call it the length of minimum spanning tree on M . Each T ∈ 𝒯 (M ) with ∣T ∣ = mst(M ) is call a minimum spanning tree on M . The set of all minimum spanning trees on M is denoted by MST(M ). Remark 4.3. If M is finite, them MST(M ) ̸= ∅. For infinite M the situation is rather more difficult, see [1] and [2]. Example 4.4. If all nonzero distances in M are the same, then every spanning tree in K(M ) is minimal, so MST(M ) = 𝒯 (M ). If #M = 3, then each minimum spanning tree is obtained from the complete graph K(M ) by deleting the longest edge (if there are several, then any of them). Problem 4.4. Let M be a finite metric space. Partition M into nonempty subsets M1 and M2 , and let vi ∈ Mi were chosen in such a way that ∣v1 v2 ∣ = ∣M1 M2 ∣. Prove that there exists a minimum spanning tree T ∈ MST(M ) such that v1 v2 ∈ E(T ). Remark 4.5. The problem of finding a minimum spanning tree can be naturally generalized. Let M be a set. Consider a connected weighted graph H with V (H) = M , and denote by 𝒯 (H) the set of all spanning subtrees of H. Then we put mst(H) = inf ∣T ∣ T ∈𝒯 (H)

and call it the weight of minimum spanning tree in H. If there exists T ∈ 𝒯 (H) such that ∣T ∣ = mst(H) then we call such T a minimum spanning tree in H. The set of all minimum spanning trees in H is denoted by MST(H). If M is a metric space, and H = K(M ) the corresponding weighted complete graph, then mst(H) = mst(M ) and MST(H) = MST(M ). Note that there are a number of fast algorithms that solve the problem of finding a minimum spanning tree in a finite weighted connected graph. The most popular of them are Kruskal [5] and Prim [6] algorithms.

4.2. Graphs and optimization problems

4.2.2

47

Steiner minimal tree problem

Now we generalize the notion of minimum spanning tree. To do that, we consider M as a subset of another metric spaces X, then we will minimize mst(V ) over all M ⊂ V ⊂ X. Namely, we put { } smtX (M ) = inf mst(V ) : M ⊂ V ⊂ X and call it the length of Steiner minimal tree on M . Each T ∈ 𝒯 (V ) for M ⊂ V ⊂ X is called a shortest tree on M or a Steiner minimal tree on M if ∣T ∣ = smtX (M ). The set of all Steiner minimal trees on M is denoted by SMTX (M ). If it is clear or not important what X the set M belongs to, we simply write smt(M ) and SMT(M ) omitting X. Remark 4.6. The following terminology is convenient when we study Steiner minimal trees or minimum spanning trees: if G = (V, E) is a graph such that V ⊂ X, then we say that G is a graph in the space X. If M ⊂ V then we say that G joins M ; if M = V then we say that G spans M . Thus, looking for minimum spanning trees we minimize the length of the trees spanning M , and for Steiner minimal trees we deal with the trees in the space X joining M . Remark 4.7. The classical problem of finding a shortest tree is formulated for the Euclidean plane X = ℝ2 . The case #M = 3 arose as early as 1643 in works of Fermat [7]. For an arbitrary finite number of points on the Euclidean plane, the problem was posed by Jarn´ık and K¨ossler in 1934 [8]. Courant and Robbins [9] mistakenly called the problem of finding a shortest tree on the Euclidean plane the Steiner problem. Due to popularity of the book [9], this title has been fixed. The Steiner problem can be solved by Melzak’s algorithm [10] and its many improvements, see for example [11] and [12]. As shown in [13], the Steiner problem is algorithmically complex (N P -complete). Remark 4.8. (1) Generally speaking, the set SMT(M ) can be empty, also for finite M , however, the value smt(M ) is always defined. (2) The set SMTX (M ) and the value smtX (M ) depend not only on the distances between points from M , but also on the geometry of the ambient space X: isometric M lying in different metric spaces X can be joined by Steiner minimal trees of different lengths. Some details on the theory of Steiner minimal trees can be found, for example, in [3] or [4]. Problem 4.5. Find all Steiner minimal trees for 3-point boundaries in the Euclidean plane. How many such trees exist for different boundaries? Problem 4.6. Find all Steiner minimal trees for the vertices of a square in the Euclidean plane. How many such trees exist? Problem ‖ 4.7. Find all Steiner minimal trees for 3-point boundaries in the plane with ℓ1 -metric defined by the norm ‖ ‖(x, y)‖ = ∣x∣ + ∣y∣. How many such trees exist for different boundaries? Problem 4.8. Construct an example of a complete metric space and of some its finite subset M , such that there is no a Steiner minimal tree joining M .

4.2.3

One-dimensional minimal filling problem

We now fix a finite metric space M . We will embed it isometrically into various metric spaces X, and minimize smtX (M ) over all such embeddings. To overcome the Cantor paradox, we put { } ( ) mf(M ) = inf r ∣ there exists an isometric embedding ν : M → X with smtX ν(M ) ≤ r ( ) and call it the length of minimal filling of M . Each tree G ∈ SMTX ν(M ) such that ∣G∣ = mf(M ) is called a minimal filling of M . The set of all minimal fillings of M is denoted by MF(M ). Remark 4.9. Each graph G in a metric space X can be naturally considered as a weighted graph with the weight function assigning to the edges their lengths. Thus, each minimal filling is a weighted graph. The triangle inequality in X leads to the fact that the distances between points in M are majorized by the length of pathes in G connecting these points.

4.3. mst-spectrum of a finite metric space

48

All this motivates an alternative equivalent definition of minimal fillings. Namely, let G = (V, E, ω) be a weighted connected graph. Recall that in Construction 2.7 we introduced the corresponding pseudometric dω on V as follows: for arbitrary v, w ∈ W we put { } dω (v, w) = inf ω(γ) : γ is a walk joining v and w . Let M be a metric space. A connected weighted graph G = (V, E, ω) joining M is called a filling of M if for any v, w ∈ M we have ∣vw∣ ≤ dω (v, w). Problem 4.9. Prove that for any metric space M it holds { } mf(M ) = inf ω(G) : G is a filling of M . The following results were obtained in [15]. Problem 4.10. Prove that for any finite metric space M there exists a minimal filling. Problem 4.11. Let M be a finite metric space with equal non-zero distances. Describe all minimal filling of M . Remark 4.10. The multidimensional problem on minimal fillings was formulated by M.Gromov [14]. One-dimensional minimal filling as a stratified version of the Gromov’s problem was studied by Ivanov and Tuzhilin [15].

4.3

mst-spectrum of a finite metric space

In this section we consider only finite metric spaces M , i.e., #M < ∞. To start with, we note that the minimum spanning tree, generally speaking, is not uniquely defined. For G ∈ MST(M ), by σ(G) we denote the vector whose elements are the lengths of the edges of the tree G sorted in descending order. The following result is well known, however, we present its proof for completeness. Proposition 4.11. For any G1 , G2 ∈ MST(M ) it holds σ(G1 ) = σ(G2 ). Proof. Recall the standard algorithm for converting one minimum spanning tree to another [5]. Let G1 ̸= G2 , Gi = (M, Ei ), then E1 ̸= E2 and #E1 = #E2 , therefore, there exists e ∈ E2 \ E1 . The graph G1 ∪ e has a cycle C containing the edge e. There is no longer edge in the C cycle than e, because otherwise G1 ̸∈ MST(M ). The forest G2 \ e consists of two trees whose vertex sets we denote by V ′ and V ′′ . Clearly, M = V ′ ⊔ V ′′ . The cycle C contains an edge e′ ̸= e joining a vertex from V ′ with a vertex from V ′′ . This edge does not lie in E2 , otherwise G2 would contain a loop. Therefore, e′ ∈ E1 \ E2 . The graph G2 ∪ e′ also contains some cycle C ′ . By the choice of e′ , the cycle C ′ also has the edge e. Similarly to the above, the length of the edge e is less than or equal to the length of the edge e′ , otherwise G2 ̸∈ MST(M ). Therefore, ∣e∣ = ∣e′ ∣. Replacing the edge e′ in G1 with e, we get a tree G′1 of the same length, i.e., it is a minimum spanning tree as well, and G′1 and G2 have one common edge more than the trees G1 and G2 . Thus, in a finite number of steps, we rebuild the tree G1 into the tree G2 , passing through minimum spanning trees. It remains to notice that σ(G′1 ) = σ(G1 ), therefore, σ(G1 ) = σ(G2 ). Proposition 4.11 motivates the following definition. Definition 4.12. For any finite metric space M , by σ(M ) we denote σ(G) for an arbitrary G ∈ MST(M ) and call it the mst-spectrum of the space M . Construction 4.1. For a set M and a cardinal number k ≤ #M , by 𝒟k (M ) we denote the family of all possible partitions of the{ set M into k } of its nonempty subsets. Now let M be a metric space and D = {Mi }i∈I ∈ 𝒟k (M ). Put α(D) = inf ∣Mi Mj ∣ : i ̸= j . Theorem 4.13. Let M be a finite metric space and σ(M ) = (σ1 , . . . , σn−1 ). Then { } σk = max α(D) : D ∈ 𝒟k+1 (M ) . Proof. Let G = (M, E) ∈ MST(M ) and the set E be ordered so that ∣ei ∣ = σi . Denote by D = {M1 , . . . , Mk+1 } the partition of the set M into the sets of vertices of the trees G \ {ei }ki=1 .

4.4. Networks

49

Lemma 4.14. We have α(D) = ∣ek ∣. Proof. Indeed, we choose arbitrary Mi and Mj , i ̸= j, in them we take points Pi and Pj , respectively, and let γ be the unique path in G, joining Pi and Pj . Then γ contains some edge ep , 1 ≤ p ≤ k. However, due to the minimality of the tree G, we have ∣Pi Pj ∣ ≥ ∣ep ∣ ≥ mini ∣ei ∣ = ∣ek ∣, thus ∣Mi Mj ∣ ≥ ∣ek ∣, so α(D) ≥ ∣ek ∣. On the other hand, if i and j are chosen so that ek joins Mi and Mj , then we get α(D) ≤ ∣Mi Mj ∣ = ∣ek ∣. ′ Now consider an arbitrary partition D′ = {M1′ , . . . , Mk+1 }.

Lemma 4.15. We have α(D′ ) ≤ α(D). Proof. By virtue of Lemma 4.14, it suffices to show that α(D′ ) ≤ ∣ek ∣. Denote by E ′ the set consisting of all edges ep ∈ E, for each of which there are Mi′ and Mj′ , i ̸= j, such that ep joins Mi′ and Mj′ . Since G is connected, the set E ′ consists of at least k edges; otherwise, the set of indices {1, . . . , k + 1} is split into two nonempty subsets I and J such that the sets ∪i∈I Mi′ and ∪j∈J Mj′ that generate the partition M are not joined by any edge from E. On the other hand, if some Mi′ and Mj′ , i ̸= j, are joined by an edge e′ ∈ E ′ , then ∣Mi′ Mj′ ∣ ≤ ∣e′ ∣, hence α(D′ ) = min ∣Mi′ Mj′ ∣ ≤ mine′ ∈E ′ ∣e′ ∣ ≤ ∣ek ∣. Lemma 4.15 completes the proof of the theorem.

4.4

Networks

To study Steiner minimal trees and minimal fillings, it is sometimes more convenient to work with so-called networks instead of the graphs in metric spaces. For example, if we investigate deformations of such graphs perturbing the positions of some their vertices, it may happen that after such a perturbation some vertices coincide, however we would like to preserve the structure of the graph by considering the coinciding vertices as different ones. To achieve this, we suppose that the graph is given apart of the metric space, and “the positions of its vertices in the space” are provided by a mapping from the vertex set of the graph to this space. Such mappings are called networks. Remark 4.16. Let us discuss three additional observations. (1) In the optimization problems we usually deal with connected graphs, thus the domain of each network will be the vertex set of a connected graph, more often, of a tree. (2) We usually investigate boundary-value problems, that is why we need to partition the vertices into boundary and all remaining (nonboundary) ones. (3) We usually minimize the length of a graph. If a graph G in a metric space contains a nonboundary vertex of degree 1 or 2, we can simplify G preserving its connectedness, boundary, and not increasing its length. In the case of degree 1 nonboundary vertex, we can simply remove the edge incident to this vertex. In the case of degree 2 nonboundary vertex, we can change the both edges incident to this vertex by the unique edge joining the remaining vertices of these edges. That is why we usually assume that each boundary contains all the vertices of degree 1 and 2. Remark 4.17. Indeed, we could also destroy possible cycles in the graph we optimize, and thus we might restrict ourselves with trees. However, in what follows we will see that such restriction leads us to some inconvenience, that is why we do not limit ourselves with trees only, but develop the corresponding theory for general connected graphs. Now we are ready to give formal definitions. We will assume that in each graph G there is a certain set of vertices ∂G ⊂ V (G) containing all vertices of degree 1 and 2 (the set ∂G can be empty), which is called the boundary of the graph G, and the vertices from ∂G are called boundary ones. The remaining vertices of the graph G are called interior ones. Sometimes the boundary vertices are also called fixed, while the interior vertices are called movable. Let G = (V, E) be a connected graph with some boundary ∂G. A network of the type G in a metric space X is an arbitrary mapping Γ : V → X. The G is also called the parameterizing graph of Γ. In what follows, we will transfer to the networks all the terminology from the graph theory related to their parameterizing graphs. Let Γ : V → X be a network parameterized by a connected graph G = (V, E) with a boundary ∂G ⊂ V . Then (1) the restrictions of Γ to the vertices and the edges of G are called the vertices and the edges of Γ, respectively;

4.4. Networks

50

(2) the restriction of Γ to ∂G is called the boundary of Γ and is denoted by ∂Γ; | | (3) for each vw ∈ E the value |Γ(v)Γ(w)| is called the length of the edge Γ : {v, w} → X; (4) the sum of the lengths of all edges of Γ is called the length of Γ and is denoted by ∣Γ∣. Example 4.18. Let G = (V, E) be a connected graph in a metric space X joining M ⊂ X, i.e., M ⊂ V ⊂ X, such that M contains all the vertices of G of degree 1 and 2. Put ∂G = M , and define a network Γ : V → X as the inclusion mapping Γ : v ⊦→ v. Thus, Γ is a network in X of the type G, and ∣Γ∣ = ∣G∣. Example 4.19. Let X be a metric space, G = (V, E) a connected graph with ∂G = M ⊂ X. Let Γ : V → X be a network whose restriction to M is the inclusion: Γ(v) = v for all v ∈ M . Suppose that the mapping Γ is injective. By identifying each vertex v ∈ V with its image Γ(v) ∈ X, we can consider G as a graph in X, so the length ∣G∣ of G is defined. Then, with this identification, we have ∣Γ∣ = ∣G∣. Definition 4.20. As in Example 4.19, given a metric space X, let G = (V, E) be a connected graph with ∂G = M ⊂ X, and Γ : V → X be a network whose boundary ∂Γ : M → X is the inclusion: ∂Γ(v) = v for all v ∈ M . For such Γ we say that Γ joins the subset M of the space X (we do not assume here that Γ is injective outside M ).

4.4.1

Networks and quotients

Now let V ′ be a set, ∼ an equivalence relation on V ′ , and suppose that V = V ′ /∼ . Denote by π : V ′ → V the canonical projection. Let G′ = (V ′ , E ′ ) be a connected graph such that G = G′ /∼ and ∂G = π(∂G′ ). Let Γ : V → X be a network of the type G with the boundary ∂Γ : ∂G → X, then the composition Γ′ = Γ ◦ π : V ′ → X is correctly defined and is a network in X of the type G′ with the boundary ∂Γ′ = ∂Γ ◦ π∣∂G′ . Problem 4.12. Show that (1) ∣Γ′ ∣ ≥ ∣Γ∣; (2) if G′ is a tree, V = {Vi′ }, and G′ (Vi′ ) ⊂ G′ is a tree for each i, then G is also a tree and ∣Γ′ ∣ = ∣Γ∣; (3) give an example in which G′ is a tree, G is not a tree, and ∣Γ′ ∣ = ∣Γ∣; (4) give an example in which G′ and G are trees, and ∣Γ′ ∣ > ∣Γ∣.

4.4.2

Splitting and splitting off for networks

In this section we extend the operations of “splitting a vertex” and “splitting off from a vertex” which we defined above, to the case of graphs with boundaries and the corresponding networks. These operations enable us to simplify the structures of networks in consideration. In Section 4.1 we defined splitting a vertex of degree greater than or equal to 4, and splitting off a vertex of degree 1 from a vertex of degree greater than or equal to 2. For graphs with boundary, we refine these definitions. We will split only interior vertices, while the resulting vertices will again be classified as interior; we will only split off from boundary vertices, and if we split off a vertex w from the boundary vertex v, then we assign the vertex v to interior one, and w to boundary one. The both these operations can be naturally defined for networks. A graph G with a boundary is called non-splittable if no vertex can be split off from any boundary vertex, and no interior vertex can be split. For a finite graph G, we define the degree p(G) of non-splitting, setting it equal to the sum of deg v − 3 over all interior vertices v of G. It is easy to see that the G, for which the degrees of all boundary vertices are 1, is not splittable if and only if p(G) = 0. The next lemma will be the key point in the proof of Theorem 4.23. Lemma 4.21. For each finite graph G = (V, E) with a boundary ∂G there exists a finite graph G′ = (V ′ , E ′ ) with a boundary ∂G′ , and an equivalence relation ∼ on V ′ , such that G = G′ /∼ and the following properties hold : (1) all boundary vertices of G′ have degree 1; (2) all interior vertices of G′ have degree 3; (3) for connected G, the graph G′ is connected ;

4.5. Steiner minimal trees existence

51

(4) for a tree G, the graph G′ is a tree; (5) the ∼-class of each boundary vertex of G′ is a singleton; (6) the canonical projection mapping π : V ′ → V corresponding to ∼ is a bijection between ∂G′ and ∂G; (7) if G (and G′ ) is a tree, then for each v ∈ V , W = π −1 (v), the graph G′ (W ) ⊂ G′ is a tree. Proof. To start with, we split off a vertex of degree 1 from each boundary vertex of G. As a result, we obtain a graph with all boundary vertices of degree 1. In this graph, we will successively split all its interior vertices of degree greater than or equal to 4. It is easy to see that the degree of non-splitting of this graph decreases by 1 for each splitting, therefore, in a finite number of steps, we arrive to a non-splittable graph G′ = (V ′ , E ′ ). We denote by ∼ the equivalence relation on V ′ , which is obtained from the trivial equivalence relation on V (whose all equivalence classes are singletons) according to the following rule: for each splitting off and splitting, the resulting pair of vertices is equivalent to all those ones to which the original vertex was equivalent, and to each other. It is clear that G′ /∼ = G. It remains to notice that those splittings preserve the connectivity and do not create cycles.

4.5

Steiner minimal trees existence

As we already mentioned in Problem 4.8, some metric spaces, also under assumption of completeness, may contain finite subsets which can not be joined by a Steiner minimal tree. In this section we prove that Steiner minimal trees always exist in boundedly compact metric spaces. To do that, we first need to reduce this problem to minimization of a finite number of continuous functions. To construct these functions, we show that it suffices to minimize the lengths of networks whose types can be chosen from some finite collection. A tree G, and a network Γ of the type G in a metric space X, both joining a set M ⊂ X, are called full Steiner trees if all their boundary vertices have degree 1 and all their interior vertices have degree 3. Let us stress that the boundary ∂Γ of this network is the inclusion M ⊂ X. Remark 4.22. If the graph G from Lemma 4.21 is a tree then the graph G′ from this lemma is a full Steiner tree. Theorem 4.23. Let X be an arbitrary metric space and M be a finite subset of X. Then smt(M ) is equal to the infimum of the lengths of all full Steiner trees Γ joining M . Proof. Recall that

{ } smt(M ) = inf ∣G∣ : G is a tree in X with ∂G = M .

As we mentioned in Remark 4.16, it suffices to consider only trees G whose vertices of degree 1 and 2 belong to M . In what follows, we will minimize over such trees only. Problem 4.13. Prove that any finite tree G with n ≥ 2 boundary vertices contains at most n − 2 interior vertices. The equality holds exactly in the case when G is a full Steiner tree. Problem 4.13 states that in calculation of smt(M ) we can consider only finite trees G with the boundary M . This enables us to use Lemma 4.21, according to which for every G there exists a full Steiner tree G′ and an equivalence relation ∼ on V (G′ ) such that G = G′ /∼ . Let π : V (G′ ) → V (G) be the corresponding canonical projection, then this π, being considered as a mapping from V (G′ ) to X, is a full Steiner tree joining M . Since G and G′ are trees, then by Lemma 4.21 each equivalence class generates a subtree of G′ , thus, by Problem 4.12, we have ∣π∣ = ∣G∣, that completes the proof. To list, up to a natural isomorphism, all the full Steiner trees from Theorem 4.23, we construct a model set of such trees. By a model full Steiner tree we mean a full Steiner tree G = (V, E) with V = {1, 2, . . . , 2n − 2} and ∂G = {1, . . . , n}. Two model full Steiner trees are called equivalent if there is an isomorphism between them that is identical on the boundary {1, . . . , n}. Thus, equivalent trees differ from each other by the numbering of their interior vertices. We denote by ℬn the set of all model full Steiner trees with n boundary vertices considered up to the introduced equivalence. In other words, we construct ℬn by choosing in each equivalence class an arbitrary representative. Now let X be a metric space and M be a finite subset of X consisting of n points. We enumerate the points from M in an arbitrary way, i.e., consider some bijection φ : {1, . . . , n} → M . Choose an arbitrary G ∈ ℬn , and consider a network Γ of the type G for which ∂Γ = φ. Then all such networks differ only in the “positions” of their interior vertices. The set of such networks is denoted by [G, φ].

4.5. Steiner minimal trees existence

52

It is clear that all networks from Theorem 4.23 are obtained from the networks just described by identification, concordant with φ, of the set {1, . . . , 2n−2} with the sets of vertices of the graphs parameterizing the former networks. Thus, we have proved the following result. Corollary 4.24. Let X be an arbitrary metric space and M ⊂ X be a finite subset of X. Then { } smt(M ) = inf ∣Γ∣ : Γ ∈ [G, φ], G ∈ ℬn . Remark 4.25. Note that the set ℬn by which we minimize in Corollary 4.24 is finite, and the set [G, φ], by which minimization is also carried out, can be infinite. In some cases, it is easy to prove that the infimum in [G, φ] is attained for every G, which immediately implies that SMT(M ) ̸= ∅ because ℬn is finite. Recall that a metric space is called boundedly compact if each of its closed balls is compact. Equivalent condition: a subset is compact if and only if it is closed and bounded. We will present a technical result that is rather simple, but necessary in the future. Let f : X → ℝ be some function defined on a metric space. We fix an arbitrary point p ∈ X and for r ≥ 0 we set Fp (r) = inf x∈X\Br (p) f (x). Problem 4.14. Suppose that Fp (r) → ∞ as r → ∞. Prove that at any point q ∈ X it holds Fq (r) → ∞ as r → ∞. From Problem 4.14, the correctness of the following definition immediately follows. We say that a function f : X → ℝ, defined on a metric space, blows up at infinity if the corresponding function Fp (r) tends to infinity as r → ∞ for some and, therefore, for every choice of the point p ∈ X. Proposition 4.26. Let f : X → ℝ be a continuous function defined on a boundedly compact metric space X. Suppose that f blows up at infinity, then f is bounded below and attains its infimum. Proof. Indeed, since the corresponding function Fp (r) blows up at infinity, for some r0 it holds f (x) ≥ 0 for all x ∈ X \ Br0 (p). On the other hand, the function f is bounded below on the closed ball Br0 (p) due to its compactness. Thus, the lower boundedness of the function f is proved. Let f0 = inf x∈X f (x), and r be such that for all x ∈ X \ Br (p) we have f (x) ≥ f0 + 1. This means that the infimum of the function f is attained on the ball Br (p), and, due to the compactness of this ball, there exists a point x0 ∈ Br (p) for which f (x0 ) = f0 . Theorem 4.27. Let X be a boundedly compact metric space. Then for every nonempty finite M ⊂ X we have SMT(M ) ̸= ∅. Proof. We use Corollary 4.24. If #M = n, then we choose an arbitrary enumeration φ : {1, . . . , n} → M , as well as an arbitrary model full Steiner tree G ∈ ℬn . Then each network Γ ∈ [G, ( ) φ] is uniquely determined by the positions of its interior vertices, i.e., by the “vector” z = Γ(n + 1), . . . , Γ(2n − 2) ∈ X n−2 . The function ℓ(z) = ∣Γ∣ is continuous as the sum of continuous functions. In addition, this function blows up at infinity, therefore, by virtue of Proposition 4.26, it attains its infimum. Also, there are a finite number of such functions in the formula from Corollary 4.24, so the infimum from this formula is attained at a minimum point of one of these functions.

References to Chapter 4 [1] Ivanov A.O., Nikonov I.M., Tuzhilin A.A. Sets admitting connection by graphs of finite length. Sbornik: Mathematics, 2005, 196:6, 845–884. [2] Ivanov A.O., Nikonov I.M., Tuzhilin A.A. Minimal Spanning Trees on Infinite Sets. 2014, ArXiv:1403.383. [3] Ivanov A.O., Tuzhilin A.A. Minimal Networks. Steiner Problem and Its Generalizations. CRC Press, 1994. [4] Hwang F.K., Richards D.S., Winter P. The Steiner Tree Problem. Annals of Discrete Mathematics 53, NorthHolland: Elsevier, 1992. ISBN 0-444-89098-X. [5] Kruskal J.B. On the shortest spanning subtree of a graph and the traveling salesman problem, Proceedings of the American Mathematical Society, 1956, v. 7, 48–50. [6] Prim R.C. Shortest connection networks And some generalizations, Bell System Technical Journal, 1957, v. 36, N 6, pp. 1389–1401. [7] Fermat P. de, Ed. H.Tannery, ed., Oeuvres, v. 1, Paris 1891, Supplement: Paris 1922, p. 153. ˆ [8] Jarn´ık V., K¨ossler O. O minim´ aln´ıch grafech obsahuj´ıc´ıch n dan´ych bodu, Cas, Pˆestovˆan´ı Mat. (Essen) v. 63, pp. 223–235. [9] Courant R., Robbins H. What Is Mathematics?, Oxford University Press, 1941. [10] Melzak Z.A. On the problem of Steiner, Canad. Math. Bull., 1960, N 4, pp. 143–148. [11] Hwang F.K. A linear time algorithm for full Steiner trees, Oper. Res. Letter, 1986, N 5, pp. 235–237. [12] Warme D.M., Winter P., Zachariasen M. Exact Algorithms for Plane Steiner Tree Problems. A Computational Study, Technical Report DIKU-TR-98/11, Department of Computer Science, University of Copenhagen, DENMARK, 1998. [13] Garey M.R., Graham R.L., Johnson D.S. The complexity of computing Steiner minimal trees, SIAM J. Appl. Math., 1977, v. 32, N 4, pp. 835–859. [14] Gromov M. Filling Riemannian manifolds, J. Diff. Geom., 1983, v. 18, N 1, pp. 1–147. [15] Ivanov A.O., Tuzhilin A.A. One-dimensional Gromov minimal filling. arXiv:1101.0106v2 [math.MG] (http://arxiv.org).

53

Exercises to Chapter 4.

54

Exercises to Chapter 4 Exercise 4.1. Show that each forest is a disjoint union of trees that are components of this forest. Exercise 4.2. Let G = (V, E) be a connected graph, and ∼ an arbitrary equivalence relation on the set V . Show that the graph G/∼ is connected. Exercise 4.3. Let G = (V, E) be an arbitrary tree, and ∼ be an equivalence relation on the set V such that for each class Vi of this equivalence the subgraph G(Vi ) is a tree. Show that then G/∼ is a tree. Exercise 4.4. Let M be a finite metric space. Partition M into nonempty subsets M1 and M2 , and let vi ∈ Mi were chosen in such a way that ∣v1 v2 ∣ = ∣M1 M2 ∣. Prove that there exists a minimum spanning tree T ∈ MST(M ) such that v1 v2 ∈ E(T ). Exercise 4.5. Find all Steiner minimal trees for 3-point boundaries in the Euclidean plane. How many such trees exist for different boundaries? Exercise 4.6. Find all Steiner minimal trees for the vertices of a square in the Euclidean plane. How many such trees exist? Exercise ‖ ‖ 4.7. Find all Steiner minimal trees for 3-point boundaries in the plane with ℓ1 -metric defined by the norm ‖(x, y)‖ = ∣x∣ + ∣y∣. How many such trees exist for different boundaries? Exercise 4.8. Construct an example of a complete metric space and of some its finite subset M , such that there is no a Steiner minimal tree joining M . 1 Hint. Consider on the set X = {0, 1, 2, . . .} the distance function ∣mn∣ = 1 + m+n , m ̸= n. Prove that it is 3 a complete metric. Consider the space X with the complete metric generated by the norm ‖ · ‖∞ . Put M = {(1, 0, 0), (0, 1, 0), (0, 0, 1)}. Prove that SMT(M ) = ∅.

Exercise 4.9. Prove that for any metric space M it holds { } mf(M ) = inf ω(G) : G is a filling of M . Exercise 4.10. Prove that for any finite metric space M there exists a minimal filling. Exercise 4.11. Let M be a finite metric space with equal non-zero distances. Describe all minimal fillings of M . Exercise 4.12. Let G = (V, E) be a connected graph in a metric space X joining M ⊂ X, i.e., M ⊂ V ⊂ X. Put ∂G = M , and define a network Γ : V → X as the embedding mapping Γ : v ⊦→ v. Let V ′ be a set, ∼ an equivalence relation on V ′ , and suppose that V = V ′∼ . Denote by π : V ′ → V the canonical projection. Let G′ = (V ′ , E ′ ) be a connected graph such that G = G′ /∼ . Then the composition Γ′ = Γ ◦ π : V ′ → X is a network in X of the type G′ . Show that (1) ∣Γ′ ∣ ≥ ∣Γ∣; (2) if G′ = (V ′ , E ′ ) is a tree, and for each class Vi′ of the equivalence ∼ the subgraph G′ (Vi′ ) ⊂ G′ is a tree, then G is also a tree and ∣Γ′ ∣ = ∣Γ∣; (3) give an example in which G′ is a tree, G is not a tree, and ∣Γ′ ∣ = ∣Γ∣; (4) give an example in which G′ and G are trees, and ∣Γ′ ∣ > ∣Γ∣. Exercise 4.13. Prove that any finite tree G with n ≥ 2 boundary vertices contains at most n − 2 interior vertices. The equality holds exactly in the case when G is a full Steiner tree. Exercise 4.14. Let f : X → ℝ be some function defined on a metric space. We fix an arbitrary point p ∈ X and for r ≥ 0 we set Fp (r) = inf x∈X\Br (p) f (x). Suppose that Fp (r) → ∞ for r → ∞. Prove that at any point q ∈ X it holds Fq (r) → ∞ as r → ∞.

Chapter 5

Hausdorff distance. Schedule. Hausdorff distance, equivalence of three definitions, triangle inequality for Hausdorff distance, Hausdorff distance is a metric on the set of all closed bounded nonempty subsets, coincidence of Vietoris topology and metric topology generated by Hausdorff distance on the set of all compact subsets, limits theory for nonempty subsets, definition of lim sup and some its properties (equivalent definitions), definition of lim inf and some its properties (equivalent definitions), convergence w.r.t. Hausdorff distance (Hausdorff convergence) and calculating lim inf, Hausdorff convergence of singletons, definition of lim, Hausdorff convergence implies existence of lim, the cases of decreasing and increasing sequences that are Hausdorff converging, equivalence of Hausdorff convergence and existence of lim in compact spaces, the cases of decreasing and increasing sequences in compact spaces, convergence in complete metric spaces, simultaneous completeness (total boundness, compactness) of the original space and the hyperspace of all closed bounded nonempty subsets, inheritance of the property to be geodesic for compact space and the hyperspace of all its closed nonempty subsets.

Let X be an arbitrary metric space. For arbitrary nonempty sets A, B ⊂ X we put ( { } { }) d1H (A, B) = max sup ∣aB∣ : a ∈ A , sup ∣Ab∣ : b ∈ B , (5.1) { } (5.2) d2H (A, B) = inf r ∈ [0, ∞] : A ⊂ Br (B) & Br (A) ⊃ B , { } d3H (A, B) = inf r ∈ [0, ∞] : A ⊂ Ur (B) & Ur (A) ⊃ B . (5.3) Proposition 5.1. For nonempty subsets A, B ⊂ X of a metric space X we have d1H (A, B) = d2H (AB) = d3H (A, B). { } Proof. Put ri = diH (A, B). First, let r1 = ∞. Without loss of generality, we assume that sup ∣aB∣ : a ∈ A = ∞, but then neither A ⊂ Ur (B), nor A ⊂ Br (B) holds for any finite r > 0, therefore r2 = r3 = ∞. Now suppose that r1 < ∞, then for any r > r1 all inclusions in the definitions of r2 and r3 take place, therefore r2 and r3 are also finite. So, we have shown that either all three ri are infinite at the same time, or all of them are finite. It remains to analyze the case of finite ri . First, we show that r1 = r2 and r1 = r3 . Let us note that r2 ≤ r3 because A ⊂ Ur (B) implies A ⊂ Br (B) (and the same for A and B swaped). By definition of r1 , we have ∣aB∣ ≤ r1 for all a ∈ A, and ∣Ab∣ ≤ r1 for all b ∈ B, therefore for all r > r1 it holds A ⊂ Ur (B) and B ⊂ Ur (A), hence r3 ≤ r. Since r > r1 is arbitrary, we have r2 ≤ r3 ≤ r1 . On the other hand, for any r > r2 we have A ⊂ Br (B) and B ⊂ Br (A), so for any a ∈ A and b ∈ B it holds ∣aB∣ ≤ r and ∣Ab∣ ≤ r, therefore, r1 ≤ r and since r > r2 is arbitrary, we get r1 ≤ r2 ≤ r3 . The value diH (A, B) from Proposition 5.1 is denoted by dH (A, B). It is easy to see that dH is non-negative, symmetric, and dH (A, A) = 0 for any nonempty A ⊂ X, thus, dH is a generalized distance on the family 𝒫0 (X) of all nonempty subsets of a metric space X. The function dH is called the Hausdorff distance. Proposition 5.2. For an arbitrary metric space X, the function dH is a generalized pseudometric on 𝒫0 (X). Proof. It remains to prove the triangle inequality. Choose arbitrary A, B, C ∈ 𝒫0 (X) and set c = dH (A, B), a = dH (B, C), b = dH (A, C). We have to show that b ≤ c + a. If either c = ∞, or a = ∞, then the equality holds. Suppose now that the both c and a are finite. Choose arbitrary ( finite)r > c and s > a, then A ⊂ Ur (B) and B ⊂ Us (C) implies, by virtue of Item (4) of Problem 2.2, that A ⊂ Ur Us (C) ⊂ Ur+s (C). Similarly, Ur+s (A) ⊃ C. Thus, b ≤ r + s. Since r > c and s > a are arbitrary, we obtain what is required. Denote by ℋ(X) ⊂ 𝒫0 (X) the set of all nonempty closed bounded subsets of a metric space X. Theorem 5.3. For an arbitrary metric space X, the generalized pseudometric dH is a metric on ℋ(X).

55

5.1. Vietoris topology and Hausdorff metric

56

Proof. Choose arbitrary A, B ∈ ℋ(A). Since they are bounded, for some r > 0 we have A ⊂ Ur (B) and Ur (A) ⊃ B, hence dH (A, B) < ∞. Thus, dH is finite. If A ̸= B, then without loss of generality we can assume that there exists a ∈ A \ B, but since the set X \ B is open, there exists r > 0 such that Ur (a) ∩ B = ∅, in particular, ∣aB∣ ≥ r and, therefore, dH (A, B) ≥ r. Thus, dH is non-degenerate and, therefore, positively defined. Recall that by 𝒦(X) we denoted the collection of all nonempty compact subsets of a topological space X. Since each compact subset of a metric space is closed and bounded, we have 𝒦(X) ⊂ ℋ(X) and, thus, we get Corollary 5.4. For an arbitrary metric space X, the function dH is a metric on 𝒦(X). In what follows, when speaking about the distance in ℋ(X), we will always have in mind the Hausdorff metric, and for topology consider the corresponding metric one. Note that different authors use different notations for this hyperspace. We introduced the notation ℋ(X) by virtue of the fact that this is the largest natural set of subsets of a metric space on which the Hausdorff distance is defined. We present a few Hausdorff distance properties in the next exercise. Problem 5.1. Prove the following statements for an arbitrary metric space X. (1) Let f : X → 𝒫0 (X) be given by the formula f : x ⊦→ {x}, then f is an isometric embedding. ¯ = dH (A, ¯ B) = dH (A, ¯ B). ¯ (2) For any A, B ∈ 𝒫0 (X) we have dH (A, B) = dH (A, B) ¯ (3) For any A, B ∈ 𝒫0 (X) we have dH (A, B) = 0 if and only if A¯ = B. (4) If Y ⊂ X is an ε-net in A ⊂ X, then dH (A, Y ) ≤ ε. Proposition 5.5. Let X be an arbitrary metric space, and A, B ∈ 𝒫0 (X), r = dH (A, B). Then A ⊂ Br (B), B ⊂ Br (A), and for less r one of these inclusions fails. Thus, for A, B ∈ 𝒫0 (X) we can change inf to max in equality (5.2). Proof. The fact that smaller r do not fit follows directly from the definition of the Hausdorff distance. Let us prove the first part. Suppose the contrary, and let, say, B ̸⊂ Br (A). This means that there exists b ∈ B for which R := ∣bA∣ > r, thus for r < s < R we have B ̸⊂ Bs (A), which contradicts the definition of dH (A, B). Problem 5.2. Prove that for A, B ∈ 𝒦(X) there exist a ∈ A and b ∈ B such that dH (A, B) = ∣ab∣. Is it possible to change 𝒦(X) with ℋ(X)? Problem 5.3. Let X be an arbitrary metric space and A, B, A′ , B ′ ∈ ℋ(X) such that A′ ⊂ A and B ′ ⊂ B. Prove that dH (A ∪ B ′ , B ∪ A′ ) ≤ dH (A, B). Problem 5.4. Let X be an arbitrary metric space and A, B, C ∈ ℋ(X) such that C ⊂ B. Prove that dH (A, A ∪ C) ≤ dH (A, B).

5.1

Vietoris topology and Hausdorff metric

In Construction 1.5 we defined the Vietoris topology on the set of all nonempty subsets of a topological space X. Recall that a base of this topology is the family of sets ⟨U1 , . . . , Un ⟩ = {Y ⊂ X : Y ⊂ ∪ni=1 Ui , and Y ∩ Ui ̸= ∅ for all i = 1, . . . , n} over all possible finite families U1 , . . . , Un of open subsets of X. Theorem 5.6. Let (X, d) be an arbitrary metric space, then the metric topology on 𝒦(X) defined by the Hausdorff metric dH coincides with the Vietoris topology. In particular, for a boundedly compact X we have ℋ(X) = 𝒦(X), thus the above result holds if we change 𝒦(X) with ℋ(X).

5.1. Vietoris topology and Hausdorff metric

57

Proof. We use Problem 1.5 to prove that each open set in metric topology generated by dH is also open in Vietoris topology. To do that, it suffices to take arbitrary A ∈ 𝒦(X), r > 0, and an open neighborhood of A in { to construct } Vietoris topology that belongs to UrdH (A). Consider the family 𝒞 = Ur/3 (a) a∈A , then 𝒞 is an open cover of the compact set A, thus we can extract from 𝒞 a finite subcover {U1 , . . . , Un }. Notice that A ∈ ⟨U1 , . . . , Un ⟩ because A ⊂ ∪ni=1 Ui and A ∩ Ui ̸= ∅ for each i = 1, . . . , n. Further, we claim that ⟨U1 , . . . , Un ⟩ ⊂ UrdH (A). Indeed, take an arbitrary A′ ∈ ⟨U1 , . . . , Un ⟩, then A′ ⊂ Ur/3 (A). Further, for every a ∈ A there exist Ui ∋ a and a′ ∈ Ui ∩ A′ , hence ∣aa′ ∣ < 2r/3, so A ⊂ U2r/3 (A′ ), hence dH (A′ , A) ≤ 2r/3 < r. Now, consider an arbitrary ⟨U1 , . . . , Un ⟩ and any A ∈ ⟨U1 , . . . , Un ⟩. Let C = X \ ∪ni=1 Ui , then C is a closed set that does not intersect A. By Item (2) of Problem 2.2, the function x ⊦→ ∣xC∣ is continuous, thus its restriction onto the compact A is bounded and attains its minimal value at some point a ∈ A. Since C is closed and a ̸∈ C, we have ∣aC∣ > 0 and, therefore, r := ∣AC∣ > 0. Further, since A ∩ Ui ̸= ∅ for each i = 1, . . . , n, we can choose some point ai in it. By the definition of the topology on X, for every i there is ri such that Uri (ai ) ⊂ Ui . Put ρ = min{r, r1 , . . . , rn }. Then for each B ∈ UρdH (A), that is, for each B ∈ 𝒦(X) satisfying dH (A, B) < ρ, we have (1) B ⊂ ∪a∈A Uρ (a) ⊂ ∪ni=1 Ui , because ρ ≤ r, and (2) for each i = 1, . . . , n there is bi ∈ B such that bi ∈ Uρ (ai ) ⊂ Ui (because ρ ≤ ri ), therefore B intersects all Ui . It follows that UρdH (A) ⊂ ⟨U1 , . . . , Un ⟩, thus, < U1 , . . . , Un ⟩ is open in the metric topology generated by dH . The proof is over. Corollary 5.7. If two metrics on a set X induce the same topology on X, then also the same metric topology is induced on the corresponding spaces 𝒦(X). In other words, the metric topology of the space 𝒦(X) does not depend on the specific form of the metric on X, but only on the topology defined by this metric on X. For boundedly compact X the same is true for ℋ(X). The next example shows that the Vietoris and the metric topologies on ℋ(X) can be different also if X is bounded and locally compact metric space. Example 5.8. Let X = (−1, 0) ∪ (0, 1) ⊂ ℝ endowed with the metric induced from the standard metric on ℝ. Then A = (−1, 0) is a closed bounded subset of X, thus A ∈ ℋ(X). Since A is also an open set, then 𝒰 = ⟨A⟩ is a neighborhood of A in Vietoris topology consisting of all closed nonempty subsets of X which belong to A. However, for any ε > 0 the ball UεdH (A) ⊂ ℋ(X) contains the closed subset B = A ∪ {ε/2} such that B ̸∈ 𝒰, therefore UεdH (A) ̸⊂ 𝒰 and, thus, 𝒰 is not open in the metric topology generated by the Hausdorff distance. Example 5.9. In Example 5.8 the space X was not complete. Now we present an example of complete metric space where the Vietoris topology and the metric topology on ℋ(X) are different. ∑∞ Let X = ℓ2 be the space of all sequences (x1 , ∑ x2 , . . .) of real numbers such that i=1 x2 < ∞. For ξ = (x1 , x2 , . . .) ∈ ∞ ℓ2 and ξ ′ = (x′1 , x′2 , . . .) ∈ ℓ2 we put d(ξ, ξ ′ )2 = i=1 (xi − x′i )2 . Problem 5.5. Prove that d is a metric on ℓ2 , and that (ℓ2 , d) is a complete metric space. Denote by ei ∈ ℓ2 the sequence (x1 , x2 , . . .) such that xi = 1 and xj = 0 for all j ̸= i. Problem 5.6. Prove that A = {ei }∞ i=1 is a closed subset of ℓ2 , thus A ∈ ℋ(ℓ2 ). Now, we put Ui = U 2i1 (ei ) and let U = ∪∞ i=1 Ui , then U is an open neighborhood of A. Let 𝒰 = ⟨U ⟩ ⊂ ℋ(ℓ2 ) be the corresponding neighborhood of A ∈ ℋ(ℓ2 ) in the Vietoris topology. Choose an arbitrary ε > 0. We show that 1 UεdH (A) ̸⊂ 𝒰, thus, 𝒰 is not open in the metric topology generated by dH . Consider an arbitrary i such that 2i < ε, 1 and choose any 2i < δ < min{1, ε}. Put ξ = (1 − δ)ei and B = A ∪ {ξ} ∈ ℋ(ℓ2 ), then d(ξ, ei ) = δ < ε, therefore, √ 1 dH (A, B) < ε. On the other hand, since 2i < δ, then ξ ̸∈ Ui ; also, for any j ̸= i we have d(ξ, ej ) = 1 + (1 − δ)2 > 1 , thus ξ ̸∈ Uj , and, in account, ξ ̸∈ 𝒰 . 1 > 2j | | Problem 5.7. Let X = ℕ, and define two metrics on X: d1 (x, y) = 1 for any x ̸= y, and d2 (x, y) = | x1 − y1 | for any x, y. Then the corresponding metric topologies are both discrete. Prove that the corresponding Hausdorff metric generates non-homeomorphic topologies on CL(X).

5.2. Limits Theory

5.2

58

Limits Theory

For the future, we need one technical result. Let A1 , A2 , . . . be a sequence of nonempty subsets of a metric space X. Definition 5.10. Put

lim sup Ak = ∩∞ n=1 An ∪ An+1 ∪ · · ·

and call it the upper limit of the sequence Ak . Remark 5.11. Since lim sup Ak is equal to the intersection of closed sets, it is always closed (possibly empty). Proposition 5.12. We have { } lim sup Ak = x ∈ X : ∀ε > 0 it holds #{k : Uε (x) ∩ Ak ̸= ∅} = ∞ . ∞ ¯ Proof. Put Bn = ∪∞ k=n Ak and A = lim sup Ak = ∩n=1 Bn . Choose an arbitrary ε > 0. Let x ∈ A, then x is adherent point of each set Bn , so Bn ∩ Uε (x) ̸= ∅. If the condition Ak ∩ Uε (x) ̸= ∅ were satisfied only for a finite number k, then for some n they would have Bn ∩ Uε (x) = ∅, so x would not be the adherent point of the set Bn , a contradiction. Conversely, suppose that for every ε > 0 it holds #{k : Uε (x) ∩ Ak ̸= ∅} = ∞. Then for every n ∈ ℕ we have ¯n for any n, therefore, x ∈ A. Uε (x) ∩ Bn ̸= ∅, so x ∈ B

Corollary 5.13. We have lim sup Ak = {x ∈ X : there exists a sequence aik ∈ Aik that converges to x}. Proof. Let x ∈ lim sup Ak . By Proposition 5.12, for each k ∈ ℕ one can find aik ∈ Aik such that ∣aik x∣ < 1/k, and there are infinitely many such ik (for fixed k). Thus, we can compose an increasing sequence i1 < i2 < · · · for which aik ∈ Aik , ∣aik x∣ < 1/k, and therefore, aik → x. Conversely, let some sequence aik ∈ Aik converge to some x ∈ X. This means that for every ε > 0 there is n such that for any k ≥ n we have Uε (x) ∩ Aik ̸= ∅, therefore x ∈ lim sup Ak by virtue of Proposition 5.12. Remark 5.14. The upper limit may be empty. Example 5.15. Take as X the interval (0, 1) with the standard metric, and put Ak = {1/k}, k ∈ N , then one cannot select a convergent subsequence aik ; therefore, lim sup = ∅. A similar situation concerns the space X = ℝ and the sequence Ak = {k}. Thus, the space X can be totally bounded or complete, but it can contain sequences of subsets Ak ⊂ X for which lim sup Ak is empty. The construction of upper limit can be extended as follows. Definition 5.16. Put { } lim inf Ak = x ∈ X : ∀ε > 0 it holds #{k : Uε (x) ∩ Ak = ∅} < ∞ and call it the lower limit of the sequence Ak . Remark 5.17. Note that lim inf Ak ⊂ lim sup Ak . Proposition 5.18. We have lim inf Ak = {x ∈ X : there exists a sequence ak ∈ Ak that converges to x}. Proof. Let x ∈ lim inf Ak . By Definition 5.16, for each n ∈ ℕ one can find kn ∈ ℕ such that for all k ≥ kn there exists ank ∈ Ak for which ∣ank x∣ < 1/n. Without loss of generality, we assume that the sequence kn is strictly monotone, and construct a sequence a′k ∈ Ak , starting with arbitrary a1 , . . . , ak1 −1 , and then adding a1k1 , . . . , a1k2 −1 , a2k2 , . . . , a2k3 −1 , etc. It is clear that the sequence a′k ∈ Ak constructed in this way converges to x. Conversely, let some sequence ak ∈ Ak converge to some x ∈ X. This means that for every ε > 0 there is n such that for any k ≥ n we have Uε (x) ∩ Ak ̸= ∅, therefore x ∈ lim inf Ak by Definition 5.16. Problem 5.8. Prove that lim inf Ai is a closed subset of X.

5.2. Limits Theory

59

Definition 5.19. If X is a metric space, Ak ∈ 𝒫0 (X), k ∈ ℕ, A ∈ 𝒫0 (X), and Ak converges to A w.r.t. the Hausdorff d

distance, then we write this as Ak −−H → A. d Proposition 5.20. If Ak , A ∈ 𝒫0 (X) and Ak −−H → A, then A¯ = lim inf Ak , where A¯ is the closure of A.

¯ thus, without loss of generality, we can assume that A is a Proof. By Item (2) of Problem 5.1, we have Ak → A, closed subset of X. Now, we show that A always contains the limit point of each convergent sequence ak ∈ Ak , i.e., that lim inf Ak ⊂ A. Let ak → a. By definition, for any ε > 0 there exists n such that for all k ≥ n we have dH (A, Ak ) < ε/2 and ∣aak ∣ < ε/2. Hence, for such k we have Ak ⊂ Uε/2 (A), in particular, ak ∈ Uε/2 (A). The latter means that for such k there exist a′k ∈ A for which ∣a′k ak ∣ < ε/2 and, therefore, ∣a′k a∣ < ε. Thus, a′k → a and therefore, since A is closed, we have a ∈ A. To complete the proof, it remains to verify that each point a ∈ A is the limit point for some sequence ak ∈ Ak , i.e., that lim inf Ak ⊃ A. For each n there exists kn such that for all k ≥ kn it holds dH (Ak , A) < 1/n. For such k we have A ⊂ U1/n (Ak ), in particular, a ∈ U1/n (Ak ). It is clear that the sequence kn can be chosen strictly monotonic. For such kn , we take ak as follows: for 1 ≤ k ≤ k1 − 1 we choose ak ∈ Ak arbitrarily; for k1 ≤ k < k2 we select ak ∈ Ak so that ∣ak a∣ < 1; for k2 ≤ k < k3 , select ak ∈ Ak so that ∣ak a∣ < 1/2; etc. Such sequence converges to a, as required. Corollary 5.21. Let Ak = {ak } for all k. Then the sequence Ak converges w.r.t. dH if and only if the sequence ak d

converges. Moreover, if ak → a, then Ak −−H → {a}. Proof. By virtue of Item (1) of Problem 5.1, the mapping f : X → ℋ(X), f : x ⊦→ {x}, is isometric, therefore the d

convergence of ak → a implies the convergence of Ak −−H → {a}. dH

Conversely, if Ak −−→ A, then A ̸= ∅ by definition. Also, A cannot contain two different points. Indeed, if a, a′ ∈ A, a ̸= a′ , then for 0 < ε < ∣aa′ ∣/2 and sufficiently large k we have {a, a′ } ⊂ A ⊂ Uε (Ak ) = Uε (ak ), that is impossible. Thus, A = {a} and for any ε > 0 there exists n such that for any k ≥ n we have Ak ∈ Uε (A), i.e., ak ∈ Uε (a), thus ak → a. Definition 5.22. If the upper and lower limits of a sequence A1 , A2 , . . . are nonempty and equal to each other, then the sequence A1 , A2 , . . . is said to have a limit, which is denoted by lim Ak . We write this as Ak → A. Let us discuss what the existence of the limit lim Ak means. d Theorem 5.23. If Ak , A ∈ 𝒫0 (X) and Ak −−H → A, then there exists lim Ak and A¯ = lim Ak .

Proof. By Proposition 5.20 and Remark 5.17, we have A¯ = lim inf Ak ⊂ lim sup Ak . Therefore, it suffices to show that ¯ lim sup Ak ⊂ A. ¯ To do this, in turn, it is Let x ∈ lim sup Ak . We show that x is an adherent point for A and, therefore, x ∈ A. enough to show that for an arbitrary ε > 0 the ball Uε (x) intersects A. d

Since Ak −−H → A, there exists n such that for every k ≥ n we have Ak ⊂ U (A, ε/2). By Proposition 5.18, there exists k ≥ n for which Uε/2 (x) ∩ Ak ̸= ∅, i.e., there exists ak ∈ Ak such that ∣xak ∣ < ε/2. Since Ak ⊂ U (A, ε/2), there exists a ∈ A for which ∣ak a∣ < ε/2, thus ∣xa∣ < ε and, hence Uε (x) ∩ A ̸= ∅. Is the converse statement to that of Theorem 5.23 true as well? The following example demonstrates that this is not the case in general, also when Ak , A ∈ ℋ(X). Example 5.24. Let X be the interval (0, 3) ⊂ ℝ with the standard distance function. Put Ak = {2, 1/k}. Then d

→ A, lim sup Ak = lim inf Ak = {2}, so that lim Ak exists and is equal to {2}. If for some A ∈ ℋ(X) we have Ak −−H then, by virtue of Theorem 5.23, we would have A = {2}. However, for ε = 1 there is no Ak that belongs to Uε (A), so the sequence Ak diverges. The case of X = ℝ and Ak = {0, k}, for which lim inf Ak = lim sup Ak = {0}, but the sequence Ak diverges, is similarly analyzed. Thus, the existence of a limit does not imply the convergence of the sequence Ak ∈ ℋ(X) for either totally bounded or complete X. We give some corollaries. d → A, and for all k we have Ak ⊃ Ak+1 , then A¯ = ∩k A¯k . Corollary 5.25. If Ak , A ∈ 𝒫0 (X), Ak −−H

5.2. Limits Theory

60

Proof. By Theorem 5.23, we have ∞ ¯ A¯ = lim Ak = lim sup Ak = ∩∞ k=1 Ak ∪ Ak+1 ∪ · · · = ∩k=1 Ak .

d Corollary 5.26. If Ak , A ∈ 𝒫0 (X), Ak −−H → A, and for all k we have Ak ⊂ Ak+1 , then A¯ = ∪k Ak .

Proof. By Theorem 5.23, we have A¯ = lim sup Ak = A1 ∪ A2 ∪ · · ·, since all the sets Bk = Ak ∪ Ak+1 ∪ · · · coincide.

5.2.1

Limits for compact X

Example 5.15 shows that a upper limit can be empty. However, for compact X this is no longer the case. Proposition 5.27. For any compact metric space X and any sequence of nonempty Ak ⊂ X we have lim sup Ak ̸= ∅. Proof. Choose an arbitrary sequence ak ∈ Ak , then it contains a subsequence that converges to some x ∈ X. By Corollary 5.13, we have x ∈ lim sup Ak , so the upper limit of this sequence is not empty. We will need the following technical result in the future. Proposition 5.28. For any compact metric space X and any sequence of nonempty Ak ⊂ X the following statement holds: for any ε > 0 there exists n such that for any k ≥ n Ak ⊂ Uε (lim sup Ak ) and lim inf Ak ⊂ Uε (Ak ). Proof. To start with, we prove the first inclusion. Put A = lim sup Ak , then, by Proposition 5.27, we have A ̸= ∅. Now, we suppose to the contrary that for some sequence Ak there exists an ε > 0 and a subsequence Aik such that Aik ̸⊂ Uε (A). The latter means that in every Aik there is a point aik for which ∣aik A∣ ≥ ε. Since X is compact, there exists a subsequence in the sequence aik that converges to some x ∈ X. Then, by Corollary 5.13, we have x ∈ A, which implies that for some sufficiently large k it holds ∣aik A∣ < ε, a contradiction. We now prove the second inclusion. Put A = lim inf Ak and assume the contrary, i.e., that there exists ε > 0 and a sequence i1 < i2 < · · · for which A ̸⊂ Uε (Aik ), in particular, A ̸= ∅. The latter is equivalent to the existence of a′ik ∈ A such that a′ik ̸∈ Uε (Aik ). By Problem 5.8, the set A is closed, thus A is compact as a closed subset of a compact metric space, therefore, in the sequence ai′ k one can choose a subsequence converging to some a ∈ A. Without loss of generality, we assume that this subsequence coincides with the entire sequence ai′ k . Since A = lim inf Ak , by Proposition 5.18 there exists a sequence ak ∈ Ak converging to a, thus ∣a′ik aik ∣ → 0, so for sufficiently large k we have a′ik ∈ Uε (Aik ), a contradiction. Corollary 5.29. For any compact metric space X and any sequence of nonempty Ak ⊂ X, the Ak converges w.r.t. d dH if and only if there exists lim Ak . Moreover, if Ak −−H → A, then A¯ = lim Ak . Proof. By Theorem 5.23, the convergence of the sequence Ak to some A implies the existence of lim Ak and that A¯ = lim Ak . Thus, it remains to prove the converse. d → A. By Proposition 5.28, for any ε > 0 there exists n such that So, let lim Ak =: A exist. We will show that Ak −−H d → A, as required. for any k ≥ n we have Ak ⊂ Uε (lim sup Ak ) = Uε (A) and A = lim inf Ak ⊂ Uε (Ak ), hence Ak −−H Let us demonstrate how the theory of limits works in the compact case. Corollary 5.30. For an arbitrary, not necessarily compact metric space X, and each decreasing sequence of nonempty d → A. sets A1 ⊃ A2 ⊃ · · · starting from compact A1 , we have A := ∩k A¯k ̸= ∅ and Ak −−H Proof. Without loss of generality, we assume that X = A1 , then X is a compact space. By Proposition 5.27, we have lim sup Ak ̸= ∅, however, by definition of lim sup, we have lim sup Ak = A, thus A is not empty. On the other hand, each point a ∈ A is the limit of the constant sequence ak = a ∈ A¯k , and for each k there exists a′k ∈ Ak such that ∣ak a′k ∣ < 1/k, thus a′k → a as well and, hence, a ∈ lim inf Ak . Since a ∈ A is arbitrary then lim sup Ak ⊂ lim inf Ak . However, lim inf Ak ⊂ lim sup Ak , thus lim inf Ak = A = lim sup Ak , and it remains to use Corollary 5.29.

5.2. Limits Theory

61

Remark 5.31. In Corollary 5.30 it may happen that ∩∞ k=1 Ak = ∅. Let X = ℝ, A1 = [0, 1], Ak = (0, 1/k) for k > 1, then ∩∞ A = ∅, however, k=1 k ∞ ¯ lim sup Ak = ∩∞ k=1 Ak = ∩k=1 [0, 1/k] = {0} ̸= ∅. Corollary 5.32. For an arbitrary, not necessarily compact, space X, if for an increasing sequence of nonempty sets dH A1 ⊂ A2 ⊂ · · · the set A = ∪∞ −→ A. k=1 Ak is compact, then Ak − Proof. Again, without loss of generality, we assume that X = A. By definition of lim sup, we have lim sup Ak = A, therefore, by Corollary 5.13, for every a ∈ A there exists a sequence aik ∈ Aik , i1 < i2 < · · · , converging to a. Notice that each aik also belongs to all Ap for p > ik , therefore this sequence can be extended to a sequence ai ∈ Ai converging to a, so, by Proposition 5.18, we have a ∈ lim inf Ak , hence lim inf Ak = lim sup Ak = A, and it remains to use Corollary 5.29.

5.2.2

Limits for complete X

We give a series of results for fundamental sequences of subsets of complete metric spaces. It will be convenient for us to use the following construction. Construction 5.1. Let Y be an arbitrary generalized pseudometric space and ω = (y1 , y2 , . . .) be some sequence in it. Put dn (ω) = sup ∣yp yq ∣. p,q≥n

Note that dn (ω) is a non-negative non-increasing sequence, therefore, it has a limit d(ω). Proposition 5.33. The sequence ω = (y1 , y2 , . . .) is fundamental if and only if d(ω) = 0. Proof. If y is fundamental, then for every ε > 0 there exists n such that for any p, q ≥ n we have ∣yp yq ∣ < ε, so dn (ω) ≤ ε and, therefore, d(ω) = 0. Conversely, if d(ω) = 0, then for any ε > 0 starting from some n it holds dn (ω) < ε, so for such n and any p, q ≥ n we have ∣yp yq ∣ < ε, which means that the sequence ω is fundamental. Now we present an important technical result on the fundamental sequences of an arbitrary metric space X. Proposition 5.34. Let Ω = (A1 , A2 , . . .), Ak ∈ 𝒫0 (X), be a fundamental sequence. Choose arbitrary a1 ∈ A1 and ε > 0. Then there exists a sequence i1 = 1 < i2 < i3 < · · · of natural numbers and a fundamental sequence ω = (ai1 , ai2 , ai3 , . . .), aik ∈ Aik , such that d1 (ω) < d1 (Ω) + ε. Proof. Proposition 5.33 implies dk (Ω) → 0, so there exists i2 ∈ ℕ, i2 > i1 , such that di2 (Ω) < ε/8. Since dH (Ai1 , Ai2 ) ≤ d1 (Ω), there exists ai2 ∈ Ai2 for which ∣ai1 ai2 ∣ < d1 (Ω) + ε/4. Next, we find a natural i3 > i2 such that di3 (Ω) < ε/16, as well as ai3 ∈ Ai3 for which ∣ai2 ai3 ∣ < ε/8. Continuing this process, we construct a sequence ω = (ai1 = a1 , ai2 , ai3 , . . .), aik ∈ Aik , for which ∣aik−1 aik ∣ < ε/2k for k ≥ 3. By the triangle inequality, for any p, q ∈ ℕ, 2 ≤ p ≤ q, we have ∣aip aiq ∣ < ε/2p , therefore, the sequence ω is fundamental. On the other hand, for any p, q ∈ ℕ we have ∣aip aiq ∣ < d1 (Ω) + ε/2, hence d1 (ω) ≤ d1 (Ω) + ε/2 < d1 (Ω) + ε. Example 5.15 demonstrates that for a complete X a upper limit can be empty. However, for a fundamental sequence Ak this is no longer the case. Proposition 5.35. Let X be a complete space and Ak ∈ 𝒫0 (X) be a fundamental sequence, then lim sup Ak ̸= ∅. Proof. We choose an arbitrary a1 ∈ A1 , then, by Proposition 5.34, there exists a fundamental sequence (ai1 = a1 , ai2 , ai3 , . . .) such that aik ∈ Aik . Since X is a complete space, this sequence converges to some x ∈ X. Proposition 5.12 implies that x ∈ lim sup Ak , therefore lim sup Ak ̸= ∅. Now we will give an analogue of Proposition 5.28. Theorem 5.36. Let X be an arbitrary metric space, Ak ∈ 𝒫0 (X) a fundamental sequence, and A = lim sup Ak (possibly empty). Then for any ε > 0 there exists n such that for any k ≥ n (1) A ⊂ Uε (Ak ); (2) if the space X is complete, then A ̸= ∅ and Ak ⊂ Uε (A).

5.2. Limits Theory

62 d

Therefore, in complete X we have Ak −−H → A and, hence, there exists lim Ak . Proof. Since the sequence Ω′ = (A1 , A2 , . . .) is fundamental, Proposition 5.33 implies that dk (Ω′ ) → 0. Choose n such that dn (Ω′ ) < ε/2. Consider an arbitrary k ≥ n and put Ω = (Ak , Ak+1 , . . .). Since the sequence di (Ω′ ) is monotonic, we have d1 (Ω) < ε/2. (1) If A = ∅, then the inclusion is proved. Now let A ̸= ∅. We choose an arbitrary a ∈ A, then, by Corollary 5.13, there exists a sequence ω = (ai1 , ai2 , . . .) converging to a so that for some p ∈ ℕ we have ∣aip a∣ < ε/2. On the other hand, since d1 (Ω) < ε/2, then dH (Ak , Aip ) < ε/2, therefore there exists ak ∈ Ak for which ∣ak aip ∣ < ε/2. By the triangle inequality, ∣ak a∣ < ε, therefore, because a is arbitrary, we have A ⊂ Uε (Ak ). (2) Let us apply Proposition 5.34 to the sequence Ω. By this proposition, each ak ∈ Ak is included in some fundamental sequence ω for which d1 (ω) < d1 (Ω) + ε/2 < ε. Due to the completeness of the space X and Proposition 5.12, the sequence ω converges to some element a ∈ A, hence, taking into account the previous estimate on d1 (ω), we get ∣ak a∣ < ε, therefore Ak ⊂ Uε (A). The existence of the limit follows from Theorem 5.23. Corollary 5.37. A metric space X is complete if and only if ℋ(X) is complete. Proof. The completeness of ℋ(X) for complete X is proved in Theorem 5.36. Conversely, let ℋ(X) be a complete space. Consider an arbitrary fundamental sequence ak in X; then, due to the isometry of the mapping x ⊦→ {x}, the sequence Ak = {ak } ∈ ℋ(X) is also fundamental. Since ℋ(X) is complete, the sequence Ak converges to some A ∈ ℋ(X). However, by Corollary 5.21, the sequence ak also converges.

5.2.3

Inheritance of total boundedness and compactness

The purpose of this section is to prove the following theorem. Theorem 5.38. Let X be an arbitrary metric space. Then the following properties are simultaneously present or not in both X and ℋ(X): (1) completeness (Khan), (2) total boundedness, (3) compactness (Hausdorff, Blaschke). Proof. (1) This is Corollary 5.37. (2) Let X be totally bounded. The total boundedness of ℋ(X) immediately follows from the following lemma and problem. Lemma 5.39. Let W be an arbitrary metric space, ε > 0, and Y ⊂ W be some ε-net, then for any δ > ε the set 𝒫0 (Y ) is a δ-net in 𝒫0 (W ). Proof. Indeed, we choose an arbitrary F ∈ 𝒫0 (W ) and put M = {y ∈ Y : ∣yF ∣ < ε}, thus for every y ∈ M there exists w ∈ F such that ∣yw∣ < ε, therefore, M ⊂ Uε (F ). Further, since Y is an ε-net for W , then for every w ∈ F there exists y ∈ Y for which ∣yw∣ < ε, hence ∣yF ∣ ≤ ∣yw∣ < ε and, therefore, y ∈ M . In particular, M ̸= ∅, i.e., M ∈ 𝒫0 (W ). Since w is arbitrary, we get F ⊂ Uε (M ), therefore, dH (F, M ) ≤ ε < δ. Problem 5.9. If W is a metric space, Y ⊂ W is an ε-net, and Z ⊂ W is not empty. Then Z contains an (2ε)-net S such that #S ≤ #Y . Conversely, let ℋ(X) be totally bounded. The total boundedness of X follows from the following lemma. Lemma 5.40. Let W be an arbitrary metric space, ε > 0, and 𝒴 ⊂ ℋ(W ) be some ε-net. Denote by M ⊂ W the set obtained by choosing at each element Y ∈ 𝒴 any one point p(Y ). Then M is an ε-net in W (of the same cardinality as 𝒴). ( ) Proof. We| choose| an arbitrary w ∈ W , then there exists Y ∈ 𝒴 such that dH {w}, Y < ε. Hence Y ⊂ Uε (w) and, therefore, |p(Y )w| < ε. (3) This follows from the previous section and Theorem 2.24.

5.3. Inheritance of geodesic

5.3

63

Inheritance of geodesic

In this section we present the results from [4]. Theorem 5.41. Let X be a compact metric space, then ℋ(X) is geodesic if and only if X is geodesic. Proof. First, let X be a geodesic space. To prove that ℋ(X) is geodesic, we use Theorems 5.38 and 3.47. The first of them claims that ℋ(X) is compact and, therefore, complete; the second is that ℋ(X) is geodesic if for any its elements there exists a midpoints. That is what we will prove. Consider arbitrary A, B ∈ ℋ(X), r := dH (A, B), put C = Br/2 (A) ∩ Br/2 (B), and show that C is a midpoint between A and B. Indeed, C is closed as the intersection of closed sets. We show that C ̸= ∅. Choose an arbitrary a ∈ A. By Proposition 5.5, A ⊂ Br (B), therefore, since B is compact, there exists b ∈ B such that ∣ab∣ ≤ r. Since the space X is geodesic, for the points a and b there is a midpoint c, then c ∈ C, which proves the nonemptiness of C and, at the same time, that A ⊂ Br/2 (C). In addition, C ⊂ Br/2 (A) by construction, hence dH (A, C) ≤ r/2. Similarly, dH (B, C) ≤ r/2. Since r = dH (A, B) ≤ dH (A, C) + dH (C, B), then dH (A, C) = dH (C, B) = r/2, so C is a midpoint between A and B. We again use Theorem 3.47 and conclude that the space ℋ(X) is geodesic. Conversely, let the space ℋ(X) be geodesic. We choose arbitrary points a, b ∈ X, r := ∣ab∣, then, due to the geodesicity of ( ) ℋ(X), it contains ( )C, which ( is a )midpoint between {a} and {b}. By Items (1) of Problem 5.1, we have dH {a}, {b} = r, so dH {a}, C = dH C, {b} = r/2, hence C ⊂ Br/2 (a) ∩ Br/2 (b). We choose an arbitrary point c ∈ C, then ∣ac∣ ≤ r/2 and ∣cb∣ ≤ r/2; since r = ∣ab∣ ≤ ∣ac∣ + ∣cb∣, then ∣ac∣ = ∣cb∣ = r/2 and, therefore, c ∈ X is a midpoint between a and b. It remains to use Theorem 3.47. We give a few examples to demonstrate that for the space ℋ(X) was geodesic, neither the total boundedness of X, nor its completeness is sufficient individually. Problem 5.10. Denote by X a subset of the space ℝ3 , which in the standard Cartesian coordinates has the form X = {x2 + y 2 < 1, z ≤ 1} ∪ {x2 + y 2 = 1, z = −1, x ∈ ℚ} ∪ {x2 + y 2 = 1, z = 1, x ∈ ℝ \ ℚ}. Next, put A = {x2 + y 2 = 1, z = −1, x ∈ ℚ} and B = {x2 + y 2 = 1, z = 1, x ∈ ℝ \ ℚ}. Show that X is a geodesic space, but A and B are not connected by a shortest curve (there is no midpoint between them), so ℋ(X) is not geodesic. Problem 5.11. Consider the square X = [−1, 1] × [−1, 1] and introduce the following distance function on X: ( ) d (x, y), (x′ , y ′ ) = ∣x − x′ ∣ + min{(1 − y) + (1 − y ′ ), (y + 1) + (y ′ + 1)}. Show that (X, d) is a complete geodesic space, A = [−1, 1] × −1 and B = [−1, 1] × {1} are compacts such that there is midpoint between them in 𝒦(X), i.e., 𝒦(X) is not geodesic. Problem 5.12. Consider two segments I0 = [0, 1] × 0 and I1 = [0, 1] × {1} in the Euclidean plane ℝ2 , and let A ⊂ I0 be the subset of all points with rational abscissae, and B ⊂ I1 the subset of all points with irrational abscissae. For each a ∈ A and b ∈ B we denote by r(a, b) the distance between a and b in ℝ2 . Now, for each a ∈ A, b ∈ B consider a circle S(a, b) or the length 3, and denote by the same a and b some points of S(a, b) bounding an arc α(a.b) of the length r(a, b). Let C(a, b) ∈ S(a, b) be the middle point of the act complement to α(a, b). Denote by X the quotient space obtained from all S(a, b) by identifying their points C(a, b). Also denote by A and B the subsets of X consisting of all points a and all points b, respectively. Prove that X is complete and geodesic, A, B ∈ ℋ(X), however, there is no a midpoint between A and B, thus, ℋ(X) is complete but not geodesic.

References to Chapter 5 [1] D.Burago, Yu.Burago, S.Ivanov, A Course in Metric Geometry. Graduate Studies in Mathematics, vol.33, A.M.S., Providence, RI, 2001. [2] S.Nadler, Hyperspaces of Sets, 1978. [3] A.Illanes, S.Nadler, Hyperspaces, 1999. [4] Bryant V.W. The convexity of the subset space of a metric space, Compositio Mathematica, 1970, vol. 22, N 4, p. 383–385.

64

Exercises to Chapter 5.

65

Exercises to Chapter 5 Exercise 5.1. Prove the following statements for an arbitrary metric space X. (1) Let f : X → ℋ(X) be given by the formula f : x ⊦→ {x}, then f is an isometric embedding. ¯ = dH (A, ¯ B) = dH (A, ¯ B). ¯ (2) For any A, B ∈ 𝒫0 (X) we have dH (A, B) = dH (A, B) ¯ (3) For any A, B ∈ 𝒫0 (X) we have dH (A, B) = 0 if and only if A¯ = B. (4) If Y ⊂ X is an ε-net in A ⊂ X, then dH (A, Y ) ≤ ε. Exercise 5.2. Prove that for A, B ∈ 𝒦(X) there exist a ∈ A and b ∈ B such that dH (A, B) = ∣ab∣. Is it possible to change 𝒦(X) with ℋ(X)? Exercise 5.3. Let X be an arbitrary metric space and A, B, A′ , B ′ ∈ ℋ(X) such that A′ ⊂ A and B ′ ⊂ B. Prove that dH (A ∪ B ′ , B ∪ A′ ) ≤ dH (A, B). Exercise 5.4. Let X be an arbitrary metric space and A, B, C ∈ ℋ(X) such that C ⊂ B. Prove that dH (A, A ∪ C) ≤ dH (A, B). Exercise 5.5. Prove that lim inf Ai is a closed subset of X. ∑∞ Exercise 5.6. Let ℓ2 denote the space of all sequences (x1 , x2 , ∑ . . .) of real numbers such that i=1 x2 < ∞. For ∞ ξ = (x1 , x2 , . . .) ∈ ℓ2 and ξ ′ = (x′1 , x′2 , . . .) ∈ ℓ2 we put d(ξ, ξ ′ )2 = i=1 (xi − x′i )2 . Prove that d is a metric on ℓ2 , and that (ℓ2 , d) is a complete metric space. Exercise 5.7. Denote by ei ∈ ℓ2 the sequence (x1 , x2 , . . .) such that xi = 1 and xj = 0 for all j ̸= i. Prove that A = {ei }∞ i=1 is a closed subset of ℓ2 , thus A ∈ ℋ(ℓ2 ). | | Exercise 5.8. Let X = ℕ, and define two metrics on X: d1 (x, y) = 1 for any x ̸= y, and d2 (x, y) = | x1 − y1 | for any x, y. Then the corresponding metric topologies are both discrete. Prove that the corresponding Hausdorff metric generates non-homeomorphic topologies on CL(X). Exercise 5.9. Suppose that in a sequence Ai ∈ ℋ(X) all Ai are connected, Ai → A ∈ ℋ(X), and lim inf Ai ̸= ∅. Prove that lim sup Ai is connected. Exercise 5.10. Denote by X a subset of the space ℝ3 , which in the standard Cartesian coordinates has the form X = {x2 + y 2 < 1, z ≤ 1} ∪ {x2 + y 2 = 1, z = −1, x ∈ ℚ} ∪ {x2 + y 2 = 1, z = 1, x ∈ ℝ \ ℚ}. Next, put A = {x2 + y 2 = 1, z = −1, x ∈ ℚ} and B = {x2 + y 2 = 1, z = 1, x ∈ ℝ \ ℚ}. Show that X is a geodesic space, but A and B are not connected by a shortest curve (there is no midpoint between them), so ℋ(X) is not geodesic. Exercise 5.11. Consider the square X = [−1, 1] × [−1, 1] and introduce the following distance function on X: ( ) d (x, y), (x′ , y ′ ) = ∣x − x′ ∣ + min{(1 − y) + (1 − y ′ ), (y + 1) + (y ′ + 1)}. Show that (X, d) is a complete geodesic space, A = [−1, 1] × −1 and B = [−1, 1] × {1} are compacts such that there is no midpoint between them in 𝒦(X), i.e., 𝒦(X) is not geodesic. Exercise 5.12. Consider two segments I0 = [0, 1] × 0 and I1 = [0, 1] × {1} in the Euclidean plane ℝ2 , and let A ⊂ I0 be the subset of all points with rational abscissae, and B ⊂ I1 the subset of all points with irrational abscissae. For each a ∈ A and b ∈ B we denote by r(a, b) the distance between a and b in ℝ2 . Now, for each a ∈ A, b ∈ B consider a circle S(a, b) or the length 3, and denote by the same a and b some points of S(a, b) bounding an arc α(a.b) of the length r(a, b). Let C(a, b) ∈ S(a, b) be the middle point of the act complement to α(a, b). Denote by X the quotient space obtained from all S(a, b) by identifying their points C(a, b). Also denote by A and B the subsets of X consisting of all points a and all points b, respectively. Prove that X is complete and geodesic, A, B ∈ ℋ(X), however, there is no a midpoint between A and B, thus, ℋ(X) is complete but not geodesic.

Chapter 6

Gromov–Hausdorff Distance. Schedule. Realization of a pair of metric space, Gromov–Hausdorff distance, admissible metric on disjoin union of metric spaces, calculation of Gromov–Hausdorff distance in terms of admissible metrics, triangle inequality for Gromov–Hausdorff distance, positive definiteness of Gromov– Hausdorff distance for isometry classes of compact spaces, counterexample for boundedly compact spaces, Gromov–Hausdorff distance for separable spaces in terms of their isometric images in ℓ∞ , relations, distortion of a relation between metric spaces, correspondences, Gromov–Hausdorff distance in terms of correspondences, ε-isometries and Gromov–Hausdorff distance, irreducible correspondences between sets, existence of irreducible correspondences, irreducible correspondences as bijections of partitions of sets, Gromov–Hausdorff distance in terms of irreducible correspondences, examples: Gromov–Hausdorff distance between 2- or 3-point metric spaces, simple general properties of Gromov–Hausdorff distance, Gromov– Hausdorff convergence, inheritance of metric and topological properties while Gromov–Hausdorff convergence.

In this section, we will study the Gromov–Hausdorff distance, see [1] and [2] for more details. All metric spaces are supposed to be nonempty. Construction 6.1. Let X and Y be metric spaces. A triple (X ′ , Y ′ , Z) consisting of a metric space Z and its two subsets X ′ and Y ′ , which are isometric respectively to X and Y , will be called a realization of the pair (X, Y ). We put { } dGH (X, Y ) = inf r ∈ ℝ : there exists a realization (X ′ , Y ′ , Z) of (X, Y ) such that dH (X ′ , Y ′ ) ≤ r . Remark 6.1. The value dGH (X, Y ) is evidently non-negative, symmetric, and dGH (X, X) = 0 for any metric space X. Thus, dGH is a distance function on each set of metric spaces. Definition 6.2. The value dGH (X, Y ) from Construction 6.1 is called the Gromov–Hausdorff distance between the metric spaces X and Y . Problem 6.1. Prove that for any metric spaces X and Y there exists a realization of (X, Y ). Remark 6.3. In some monographs the Gromov–Hausdorff distance dGH (X, Y ) is defined as the infimum of the values dH (X ′ , Y ′ ) over all realizations (X ′ , Y ′ , Z) of the pair (X, Y ). However, we give such, at first glance, a more technically complicated definition of dGH , to avoid the Cantor paradox, because the family of all realizations is no longer a set. Introducing r and talking about the existence of a realization, we thereby get rid of the need to consider all the realizations. Notation. In what follows, it will sometimes be necessary for us to explicitly indicate the space X in which a particular metric is considered, as well as the corresponding Hausdorff metric. Thus, the distance between points x, x′ ∈ X will sometimes be denoted by ∣xx′ ∣X , and the corresponding Hausdorff distance between nonempty subsets A and B of the space X by dX H (A, B). In addition, if ρ is some metric on X, then the Hausdorff distance generated by this metric will sometimes be denoted by ρH . It turns out that, to define the Gromov–Hausdorff distance, it suffices to consider only metric spaces of the form (X ⊔ Y, ρ), where ρ extends the original metrics of X and Y . Such ρ will be called an admissible metric for X and Y , and the set of all admissible metrics for given X and Y will be denoted by 𝒟(X, Y ). Problem 6.2. Prove that for any metric spaces X and Y there exists at least one admissible metric, i.e., the set 𝒟(X, Y ) is not empty. Theorem 6.4. For any metric spaces X and Y , we have { } (6.1) dGH (X, Y ) = inf ρH (X, Y ) : ρ ∈ 𝒟(X, Y ) . 66

Chapter 6. Gromov–Hausdorff Distance.

67

Proof. Denote by d′GH (X, Y ) the right hand side of the equation (6.1). Then dGH (X, Y ) ≤ d′GH (X, Y ), because for every ρ ∈ 𝒟(X, Y ) and Z = (X ⊔ Y, ρ), the triple (X, Y, Z) is a realization of the pair (X, Y ) with the same Hausdorff distance ρH (X, Y ). We now prove the inverse inequality. By Construction 6.1, for any ε > 0 there exists a realization (X ′ , Y ′ , Z) of the pair (X, Y ) such that dH (X ′ , Y ′ ) ≤ dGH (X, Y ) + ε. If X ′ and Y ′ do not intersect each other, then we restrict the metric from Z to X ′ ⊔ Y ′ , and, after identifying X ′ and Y ′ with X and Y , respectively, we obtain an admissible metric ρ on| X ⊔ Y for which ρH (X, Y ) ≤ dGH (X, Y ) + ε. | If X ′ ∩ Y ′ ̸= ∅, we replace Z by Z × ℝ with the metric |(z, t)(z ′ , s)| = ∣zz ′ ∣ + ∣ts∣, also replace X ′ and Y ′ by the sets X ′′ = X ′ × {0} and Y ′′ = Y ′ × {ε}, respectively, then (X ′′ , Y ′′ , Z × ℝ) is a realization of the (X, Y ) such that X ′′ ∩ Y ′′ = ∅, dH (X ′′ , Y ′′ ) ≤ dGH (X, Y ) + 2ε, and aging we can restrict the metric of Z × ℝ to X ′′ ⊔ Y ′′ , identify X with X ′′ , Y ′′ with Y , and obtain an admissible metric ρ with ρH (X, Y ) ≤ dGH (X, Y ) + 2ε. Since ε is arbitrary, we get d′GH (X, Y ) ≤ dGH (X, Y ). Remark 6.5. If X and Y are subsets of some metric space, then dGH (X, Y ) ≤ dH (X, Y ). In particular, if dH (X, Y ) = 0, then dGH (X, Y ) = 0, so the Gromov–Hausdorff distance, like the Hausdorff distance, is not positively defined: for example, the Gromov–Hausdorff distance between the segment [0, 1] and the interval (0, 1) vanishes. However, if we restrict ourselves to compact metric spaces, then dGH (X, Y ) = 0 if and only if X and Y are isometric (see below for the proof). Proposition 6.6. The function dGH satisfies the triangle inequality. Proof. Choose arbitrary metric spaces X, Y , and Z. We have to show that dGH (X, Z) ≤ dGH (X, Y ) + dGH (Y, Z). Choose any admissible metrics μ ∈ 𝒟(X, Y ) and ν ∈ 𝒟(Y, Z). Recall that in Example 2.15 we have defined the gluing U ⊔f V of metric spaces U and V over a mapping f : W → V , where W ⊂ U . Also, Problem 2.15 states that if f is isometric, then the restrictions of the metric of U ⊔f V to U and V coincides with the original ones. We apply this result to U = X ⊔ Y and V = Y ⊔ Z with the metrics μ and ν, respectively, and to W = Y and f = id : Y → Y ⊂ Y ⊔ Z. Consider the metric space A = (X⊔Y )⊔f (Y ⊔Z) with the corresponding quotient metric ρ := ρ∼ . By Problem 2.15, the restrictions of the metric ρ from A to X and Z coincide with the original ones, thus (X, Z, A) is a realization of (X, Z). Also, by definition of ρ, for any x ∈ X and z ∈ Z we have ρ(x, z) ≤ μ(x, y)+ν(y, z). Let us put a = μH (X, Y ), b = νH (Y, Z), and choose some δ > 0. Then for any x ∈ X there exists y ∈ Y such that μ(x, y) < a + δ/2; also, for any y ∈ Y there exists z ∈ Z such that ν(y, z) < b + δ/2, thus for any x ∈ X there exists z ∈ Z and, similarly, for any z ∈ Z there exists x ∈ X such that ρ(x, z) ≤ μ(x, y) + ν(y, z) < a + b + δ, thus ρH (X, Z) ≤ a + b + δ. Since δ is arbitrary, we get ρH (X, Z) ≤ μH (X, Y ) + νH (Y, Z). Thus, ( ) dGH (X, Z) = inf dH (X, Z) ≤ inf ρH (X, Z) ≤ inf μH (X, Y ) + νH (Y, Z) = d∈𝒟(X,Z)

ρ

μ∈𝒟(X,Y ), ν∈𝒟(Y,Z)

=

inf μ∈𝒟(X,Y )

μH (X, Y ) +

inf ν∈𝒟(Y,Z)

νH (Y, Z) = dGH (X, Y ) + dGH (Y, Z),

where the second equality holds because the values we minimize over μ and ν are independent. Thus, we have shown that on every set of metric spaces, the function dGH is a generalized pseudometric. If the diameters of all spaces in the family are bounded by the same number, then dGH is a (finite) pseudometric. As we already noted, dGH is not a metric in general. However, if we restrict ourselves to compact metric spaces considered upto isometry, then dGH will already be a metric. Proposition 6.7. For compact metric spaces X and Y , it holds dGH (X, Y ) = 0 if and only if X and Y are isometric. Proof. If X is isometric to Y then we can take Z = X ′ = Y ′ = X, thus (Z, X ′ , Y ′ ) is a realization of (X, Y ), and dH (X ′ , Y ′ ) = 0. Now we prove the converse statement. Consider a sequence of admissible metrics dk ∈ 𝒟(X, Y ) such that dkH (X, Y ) < 1/k, k = 1, 2, . . ., then for each x ∈ X ⊂ X ⊔ Y there exists y ∈ Y ⊂ X ⊔ Y such that dk (x, y) < 1/k. We choose any such y and put Ik (x) = y. Thus, we have constructed a mapping Ik : X → Y (possibly discontinuous). Similarly, we define a mapping Jk : Y → X. From the triangle inequality it follows that for any x, x′ ∈ X and y, y ′ ∈ Y we have ( ( ) 2 ) 2 dk Ik (x), Ik (x′ ) < + dk (x, x′ ), dk Jk (y), Jk (y ′ ) < + dk (y, y ′ ), k k (6.2) ( ) 2 ( ) 2 k k d x, Jk ◦ Ik (x) < , d y, Ik ◦ Jk (y) < . k k

Chapter 6. Gromov–Hausdorff Distance.

68

Similarly to what was done in the proof of the Arzela–Ascoli theorem, we construct the “limit” mappings I : X → Y and J : Y → X. Namely, we choose in X a countable everywhere dense subset S = {x1 , x2 , . . .}; using the Cantor diagonal process, we construct a subsequence {Ik1 , Ik2 , . . .} such that for every i the sequence Ikp (xi ) converges to some I(xi ) ∈ Y (here we use the compactness of Y ). The mapping I is 1-Lipschitz because for every xi , xj ∈ S we have | | ( ) |I(xi )I(xj )| = dk I(xi ), I(xj ) ≤ 2 + dk (xi , xj ) = 2 + ∣xi xj ∣, k k | | | | and since k is arbitrary, we have I(xi )I(xj ) ≤ ∣xi xj ∣. Since Y is complete, by Lemma 3.48 we can extend I onto the whole X by continuity to a 1-Lipschitz mapping (we denote this mapping by the same symbol I). We proceed similarly with the sequence Jk . Passing to the limit in the last two inequalities (6.2) as k → ∞, we conclude that for any x ∈ X and y ∈ Y we have | | | | |x (J ◦ I)(x)| = 0, |y (I ◦ J)(y)| = 0. (6.3) The relations (6.3) indicates that I and J are mutually inverse bijections, and the 1-Lipschitz condition for the both I and J guarantees that them preserve the distance, i.e., I and J are isometric. The following exercise gives one possible generalization of Proposition 6.7. Problem 6.3. Prove that if a metric space X is compact, a metric space Y is complete, and dGH (X, Y ) = 0, then X is isometric to Y . Solution. By Theorem 2.24, it suffices to prove that Y is a totally bounded space. Choose an arbitrary ε > 0. Since dGH (X, Y ) = 0 then there exists an admissible metric d ∈ 𝒟(X, Y ) such that dH (X, Y ) < ε/3. Since X is compact, there exists a finite (ε/3)-net S ⊂ X. Since dH (X, Y ) < ε/3, for each s ∈ S there exists ys ∈ Y for which d(s, ys ) < ε/3, and for each y ∈ Y there exists x ∈ X such that ∣xy∣ < ε/3. Since S is an (ε/3)-net in X, there exists s ∈ S for which ∣xs∣ < ε/3. Thus ∣yys ∣ ≤ ∣yx∣ + ∣xs∣ + d(s, ys ) < ε, so the set {ys }s∈S is a finite ε-net in Y . Remark 6.8. Even if both metric spaces X and Y are boundedly compact, Proposition 6.7 may not hold. To describe the corresponding example, we denote by X and Y the subsets of the Euclidean plane ℝ2 constructed as follows. Each of these spaces is obtained from the abscissa by adding vertical (m, 0) and (m, ℓm ), m ∈ ℤ. In the | | segments between case of X we put ℓm = ∣ sin m∣, and in the case of Y let ℓm = |sin(m + 1/2)|. For the distances on X and Y we take the corresponding internal metrics. Notice that the spaces X and Y can be considered as graphs whose points (m, 0) correspond to the vertices of degree 3, and the points (m, ℓm ) to the vertices of degree 1. It is easy to see that for any ε > 0 there exists n ∈ ℕ such that for all m ∈ ℤ it holds || | | ||| || | sin(m + n)| − |sin(m + 1/2)|| < ε. Therefore, the subset X can be shifted along the abscissa by the vector (n, 0) in such a way that the resulting subset X ′ , which is isometric to X, satisfies dH (X ′ , Y ) < ε. Thus, dGH (X, Y { ) = 0. | |} { } On the other hand, it is easy to see that the sets ∣ sin m∣ m∈ℤ and |sin(m + 1/2)| do not intersect. Suppose m∈ℤ

that X and Y are isometric, and f : X → Y is an isometry. Since each isometry is a homeomorphism, then(f takes the) vertices of degree 1 and 3 to the vertices of the same degree, thus ( each segment |in ) X between (m, 0) and m, ∣ sin m∣ is mapped by f onto a segment of Y between some (n, 0) and n, ∣ sin(n + 1/2)| . However, since f is an isometry, it has to preserve the lengths of these segments, a contradiction. As follows from Theorem 6.4, the Gromov–Hausdorff distance between X and Y measures the least “discrepancy” for all possible “alignments” of these spaces inside metric spaces constructed on X ⊔ Y . A natural question arises: can all these alignments, perhaps for some special classes of metric spaces, be realized inside the same ambient metric space? The following result answers the question for the class of separable spaces. Consider the metric space ℓ∞ of all bounded sequences introduced in Example 2.10. Recall that, by Theorem 2.26, each separable metric space is isometrically embedded into ℓ∞ . Proposition 6.9. Let X and Y be separable metric spaces. Then ( ) dGH (X, Y ) = inf dℓH∞ φ(X), ψ(Y ) , where the infimum is taken over all isometric embeddings φ : X → ℓ∞ and ψ : Y → ℓ∞ .

Chapter 6. Gromov–Hausdorff Distance.

69

Proof. The space X ⊔ Y with an admissible metric d ∈ 𝒟(X, Y ) is also separable, therefore, by Theorem 2.26, it can be isometrically embedded into ℓ∞ . For specific calculations of the Gromov–Hausdorff distance, other equivalent definitions of this distance are useful. Recall that a relation between sets X and Y is a subset of the Cartesian product X × Y . Recall that 𝒫0 (X × Y ) denotes the set of all nonempty subsets of X × Y , i.e., the set of all nonempty relations between X and Y . Similarly to the case { of mappings, for} each σ ∈ 𝒫0 (X × Y ) and for { every x ∈ X and} y ∈ Y , there are defined the image σ(x) := y ∈ Y : (x, y) ∈ σ and the preimage σ −1 (y) = x ∈ X : (x, y) ∈ σ . Also, for A ⊂ X and B ⊂ Y their image and preimage are defined as the union of the images and preimages of their elements, respectively. Let πX : X × Y → X and πY : X × Y → Y be the canonical projections πX (x, y) = x and πY (x, y) = y. We denote in the same way the restrictions of these mappings to each relation σ ⊂ X × Y . A relation R between X and Y is called a correspondence if the restrictions of the canonical projections πX and πY to R are surjective. In other words, for every x ∈ X there exists y ∈ Y , and for every y ∈ Y there exists x ∈ X, such that (x, y) ∈ R. Thus, the correspondence can be considered as a surjective multivalued mapping. The set of all correspondences between X and Y is denoted by ℛ(X, Y ). If X and Y are metric spaces, then for each relation σ ∈ 𝒫0 (X × Y ) we define its distortion dis σ as follows {| } | dis σ = sup |∣xx′ ∣ − ∣yy ′ ∣| : (x, y), (x′ , y ′ ) ∈ σ . Problem 6.4. Prove that for any σ1 , σ2 ∈ 𝒫0 (X × Y ) such that σ1 ⊂ σ2 , we have dis σ1 ≤ dis σ2 . Problem 6.5. Prove that for R ∈ ℛ(X, Y ) it holds dis R = 0 if and only if R is an isometry. The next two constructions establish a link between correspondences from ℛ(X, Y ) and admissible metrics on X ⊔Y. To start with, we consider an arbitrary admissible metric and use it to construct a specific correspondence. Let ρ ∈ 𝒟(X, Y ) be an arbitrary admissible metric for metric spaces X and Y , and suppose that ρH (X, Y ) < ∞. Choose arbitrary r ≥ ρH (X, Y ) for which the set Rrρ = {(x, y) : ρ(x, y) ≤ r} is a correspondence between X and Y . Notice that we always can take arbitrary r > ρH (X, Y ). Sometimes, for example, for compact X and Y , we can take r = ρH (X, Y ). Proposition 6.10. Under above notations, it holds dis Rrρ ≤ 2r. Proof. For any (x, y), (x′ , y ′ ) ∈ Rrρ we have | ′ | | | |∣xx ∣ − ∣yy ′ ∣| = |ρ(x, x′ ) − ρ(y, y ′ )| ≤ ρ(x, y) + ρ(x′ , y ′ ) ≤ 2r. It remains to pass to supremum in definition of dis Rrρ . Now we start from a correspondence and construct a specific admissible metric. Consider arbitrary correspondence R ∈ ℛ(X, Y ). Suppose that dis R < ∞. Extend the metrics of X and Y upto a symmetric function ρR defined on pairs of points from X ⊔ Y : for x ∈ X and y ∈ Y put { } 1 ρR (x, y) = ρR (y, x) = inf ∣xx′ ∣ + ∣yy ′ ∣ + dis R : (x′ , y ′ ) ∈ R . 2 Proposition 6.11. Under above notations, let dis R > 0, then ρR is an admissible metric, and ρR H (X, Y ) =

1 2

dis R.

Proof. To simplify notation, we put ρ := ρR . Since dis R > 0 then ρ is positively defined. Now we verify the triangle inequality. It suffices to consider the case x1 , x2 ∈ X, y ∈ Y and to prove the inequalities for the triangle x1 x2 y. Due to symmetry reasons, we prove only two inequalities ρ(x1 , y) + ρ(x2 , y) ≥ ∣x1 x2 ∣ and ρ(x2 , y) + ∣x1 x2 ∣ ≥ ρ(x1 , y). Let us start from the first one. Choose arbitrary (x′1 , y1′ ), (x′2 , y2′ ) ∈ R, then ∣x1 x′1 ∣ + ∣y1′ y∣ + ∣x2 x′2 ∣ + ∣y2′ y∣ + dis R ≥ ∣x1 x2 ∣ − ∣x′1 x′2 ∣ + ∣y1′ y2′ ∣ + dis R ≥ ∣x1 x2 ∣, where the last inequality holds because ∣y1′ y2′ ∣ − ∣x′1 x′2 ∣ ≥ − dis R by definition. Thus, } { ρ(x1 , y) + ρ(x2 , y) = inf ∣x1 x′1 ∣ + ∣y1′ y∣ + ∣x2 x′2 ∣ + ∣y2′ y∣ + dis R : (x′1 , y1′ ), (x′2 , y2′ ) ∈ R ≥ ∣x1 x2 ∣.

Chapter 6. Gromov–Hausdorff Distance.

70

Now we prove the second inequality. We have { } 1 ρ(x2 , y) + ∣x1 x2 ∣ = inf ∣x2 x′2 ∣ + ∣y2′ y∣ + dis R + ∣x1 x2 ∣ : (x′2 , y2′ ) ∈ R ≥ 2 { } 1 ≥ inf ∣x1 x′2 ∣ + ∣y2′ y∣ + dis R : (x′2 , y2′ ) ∈ R = ρ(x1 , y). 2 It remains to prove that ρH (X, Y ) = 12 dis R. Since R ∈ ℛ(X, Y ) then for each x ∈ X there exists y ∈ Y , and for each y ∈ Y there exists x ∈ X, such that (x, y) ∈ R and, thus, ρ(x, y) = 12 dis R, therefore ρH (X, Y ) ≤ 12 dis R. Besides that, for any x ∈ X and y ∈ Y it holds ρ(x, y) ≥ 12 dis R, thus ρH (X, Y ) ≥ 21 dis R, and the proof is complete. Theorem 6.12. For any metric spaces X and Y we have dGH (X, Y ) =

{ } 1 inf dis R : R ∈ ℛ(X, Y ) . 2

Proof. Denote by I(X, Y ) the right-hand side of the equality from the statement of the theorem. We prove first that dGH (X, Y ) ≥ I(X, Y ). If dGH (X, Y ) = ∞, then the inequality holds. Now, suppose that dGH (X, Y ) < ∞. Choose an arbitrary r > dGH (X, Y ), then, by Theorem 6.4, there exists ρ ∈ 𝒟(X, Y ) for which ρH (X, Y ) < r. Consider the correspondence Rrρ constructed above. Then, by Proposition 6.10, we have dis Rrρ ≤ 2r, thus I(X, Y ) ≤ r. Since r is arbitrary, we get I(X, Y ) ≤ dGH (X, Y ). We now prove that dGH (X, Y ) ≤ I(X, Y ). If I(X, Y ) = ∞, then the inequality holds. Now, suppose that I(X, Y ) < ∞. If there exists R ∈ ℛ(X, Y ) such that dis R = 0, then, by Problem 6.5, we have dGH (X, Y ) = 0, and the equality holds. Now suppose that for all R ∈ ℛ(X, Y ) we have dis R > 0. Choose an arbitrary R ∈ ℛ(X, Y ) such that dis R < ∞ and consider ρR ∈ 𝒟(X, Y ) constructed above. By Proposition 6.11, we 1 have ρR H (X, Y ) = 2 dis R, thus { } dGH (X, Y ) = inf ρH (X, Y ) ≤ inf ρR H (X, Y ) : R ∈ ℛ(X, Y ), dis R < ∞ = I(X, Y ). ρ∈𝒟(X,Y )

Recall that for a relation σ between X and Y , and a relation θ between Y and Z, the composition θ ◦ σ is defined by the following condition: (x, z) ∈ θ ◦ σ if and only if there exists y ∈ Y such that (x, y) ∈ σ and (y, z) ∈ θ. Problem 6.6. Let X, Y , and Z be metric spaces, R1 ∈ ℛ(X, Y ), R2 ∈ ℛ(Y, Z). Prove that (1) R2 ◦ R1 ∈ ℛ(X, Z); (2) dis(R2 ◦ R1 ) ≤ dis R1 + dis R2 ; (3) derive from the previous items the triangle inequality for the Gromov–Hausdorff distance. Solution. (1) Let x be an arbitrary point in X. Since R1 is a correspondence, there exists y ∈ Y such that (x, y) ∈ R1 . Since R2 is a correspondence, there exists z ∈ Y such that (y, z) ∈ R2 . Therefore, (x, z) ∈ R2 ◦ R1 . Similarly, for any z ∈ Z there exists x ∈ X such that (x, z) ∈ R2 ◦ R1 . Thus, R2 ◦ R1 is a correspondence. (2) For any (x, z), (x′ , z ′ ) ∈ R2 ◦ R1 there exist y, y ′ ∈ Y such that (x, y), (x′ , y ′ ) ∈ R1 and (y, z), (y ′ , z ′ ) ∈ R2 , hence | ′ | | | | | | | |∣zz ∣ − ∣xx′ ∣| = |∣zz ′ ∣ − ∣yy ′ ∣ + ∣yy ′ ∣ − ∣xx′ ∣| ≤ |∣zz ′ ∣ − ∣yy ′ ∣| + |∣yy ′ ∣ − ∣xx′ ∣|, and, passing to the suprema, we obtain what is required. (3) Denote by R(X, Y, Z) the subset of R(X, Z) consisting of all correspondences represented in the form R2 ◦ R1 , where R1 ∈ R(X, Y ) and R2 ∈ R(Y, Z). Then, by Theorem 6.12 and the previous item, we have dGH (X, Z) =

1 1 1 inf dis R ≤ inf dis R = inf dis(R2 ◦ R1 ) ≤ 2 R∈ℛ(X,Z) 2 R∈ℛ(X,Y,Z) 2 R1 ∈ℛ(X,Y ) R2 ∈ℛ(Y,Z)

1 1 1 ≤ (dis R1 + dis R2 ) ≤ inf inf dis R1 + inf dis R2 = dGH (X, Y ) + dGH (Y, Z), 2 R1 ∈ℛ(X,Y ) 2 R1 ∈ℛ(X,Y ) 2 R2 ∈ℛ(Y,Z) R2 ∈ℛ(Y,Z)

as required.

6.1. Irreducible correspondences

71

We show one more approach to the study of the Gromov–Hausdorff distance. Definition 6.13. A mapping f : X → Y of metric spaces is called an ε-isometry if dis f < ε and f (X) is an ε-net in Y. Theorem 6.14. Let X and Y be arbitrary metric spaces and ε > 0. Then (1) if dGH (X, Y ) < ε then there exists a (2ε)-isometry f : X → Y ; (2) if there exists an ε-isometry f : X → Y then it holds dGH (X, Y ) < 2ε. Proof. (1) By Theorem 6.12, there exists a relation R ∈ ℛ(X, Y ) such that dis R < 2ε. For each x ∈ X we choose an arbitrary y ∈ R(x) and put f (x) = y. Thus, we determined a mapping f : X → Y , and since f ⊂ R (here we identify f with its graph), by Problem 6.4 it holds dis f ≤ dis R < 2ε. Now we choose an arbitrary y ′ ∈ Y , an arbitrary ′ x ∈ R−1 (y ′ ), and let y = f (x). Since { diam R(x) ≤ dis R } < 2ε, it follows that ∣yy ∣ < 2ε, therefore f (X) is a (2ε)-net. (2) Consider the relation R = (x, y) : ∣f (x)y∣ < ε . Since f (X) is an ε-net, R is a correspondence. To estimate the distortion of R, we choose arbitrary (x, y), (x′ , y ′ ) ∈ R, then | | ′ | | | || || | | | | | |∣xx ∣ − ∣yy ′ ∣| ≤ ||∣xx′ ∣ − |f (x)f (x′ )||| + |||f (x)f (x′ )| − ∣yy ′ ∣|| ≤ ε + |f (x)y | + |y ′ f (x′ )| < 3ε, therefore, dis R ≤ 3ε < 4ε and dGH (X, Y ) ≤

6.1

1 2

dis R < 2ε.

Irreducible correspondences

For arbitrary nonempty sets X and Y , a correspondence R ∈ ℛ(X, Y ) is called irreducible if it is a minimal element of the set ℛ(X, Y ) w.r.t. the order given by the inclusion relation. The set of all irreducible correspondences between X and Y is denoted by ℛ0 (X, Y ). The following result is evident. Proposition 6.15. A correspondence R ∈ ℛ(X, Y ) is irreducible if and only if for any (x, y) ∈ R it holds { } min #R(x), #R−1 (y) = 1. Theorem 6.16. Let X, Y be arbitrary nonempty sets. Then for every R ∈ ℛ(X, Y ) there exists R0 ∈ ℛ0 (X, Y ) such that R0 ⊂ R. In particular, ℛ0 (X, Y ) ̸= ∅. Remark 6.17. One could use the standard technique based on the Zorn lemma, but then we need to guarantee that each chain R1 ⊃ R2 ⊃ · · · has a lower bound, i.e., there exists a correspondence that belongs to all Ri . However, { } this, generally speaking, is not true. As an example, consider X = Y = ℕ and set Rk = (i, j) : max(i, j) ≥ k . It is clear that every Rk belongs to ℛ(X, Y ), and that these Rk form a decreasing chain. However, ∩Rk = ∅, since for any i and j there is k for which i < k and j < k, therefore (i, j) ̸∈ Rk . Proof of Theorem 6.16. For each x ∈ X, choose an arbitrary y ∈ R(x) and define a mapping f : X → Y by setting y = f (x). Let us note that f ⊂ R. Put Y1 = f (X) and Y2 = Y \ Y1 . Now for each y ∈ Y2 we choose an arbitrary x ∈ R−1 (y) and define a mapping g : Y1 → X by setting x = g(y). In this case we have g −1 ⊂ R. Put X2 = g(Y2 ) and X1 = X \ X2 . Let Y3 = f (X2 ). It is clear that Y3 ⊂ Y1 . Using f and g, we define another relation: h = f ∪ g −1 . Lemma 6.18. We have h ∈ ℛ(X, Y ).

( ) Proof. By the definition of f , for each x ∈ X it holds x, f (x) ∈ f ⊂ h. Now consider an arbitrary y ∈ Y . If y ∈ Y1 , then, since Y1 =( im f , there exists x ∈ X such that y = f (x), therefore ) (x, y) ∈ f ⊂ h. If y ∈ Y2 , then, by the definition of g, we have g(y), y ∈ g −1 ⊂ h. Now we define the relation R0 by removing from h some (x, y) for each y ∈ Y3 according to the following rule: ( ) (1) if h−1 (y) ∩ X1 ̸= ∅, then we remove h−1 (y) ∩ X2 × {y}; (2) if h−1 (y) ∩ X1 = ∅, i.e., h−1 (y) ⊂ X2 , then we remove all the elements from h−1 (y) × {y}, except any one.

6.1. Irreducible correspondences

72

Lemma 6.19. We have R0 ∈ ℛ(X, Y ). Proof. For every y ∈ Y \ Y3 we remove nothing, therefore for such y there always exists x ∈ X such that (x, y) ∈ R0 . Now let y ∈ Y3 . If h−1 (y) ∩ X1 ̸= ∅, then we do not remove (x, y) with x ∈ h−1 (y) ∩ X1 , thus (x, y) ∈ R0 for such x. If h−1 (y) ∩ X1 = ∅, then we do not remove some x ∈ h−1 (y), thus (x, y) ∈ R0 for such x. Now let us deal with x ∈ X. If x ∈ X1 , then we remove nothing, therefore (x, y) ∈ R0 for some y ∈ Y . If x ∈ X2 , then, since X2 = im g, there exists y ∈ Y2 for which x = g(y), but we remove nothing for such y, thus (x, y) ∈ R0 . Lemma 6.20. We have R0 ∈ ℛ0 (X, Y ). Proof. It is sufficient to show that for each pair((x, y) ∈) R0 either x, or y do not belong to other pairs. If y ∈ Y2 , then y is included in the g(y), y ∈ R0 . If y ∈ Y1 \ Y3 , then y = f (x) for some x ∈ X1 , however ( only pair ) such x is included in the only pair x, f (x) ∈ R0 . Finally, now let y ∈ Y3 . If h−1 (y) ∩ X1 ̸= ∅, then we removed (all the pairs of the form (x′ , y), x′ ∈ X2 , so x ∈ X1 , ) however, each such x is included in exactly one pair, namely, in x, f (x) . If h−1 (y) ∩ X1 = ∅, then we removed all the pairs (x′ , y), x′ ∈ h−1 (y) ⊂ X2 , except some one, so y is included in exactly one such pair. This lemma completes the proof of the theorem. Theorems 6.16 and 6.12, together with Problem 6.4, implies Corollary 6.21. For any metric spaces X and Y we have dGH (X, Y ) =

{ } 1 inf dis R ∣ R ∈ ℛ0 (X, Y ) . 2

Now we give another useful description of irreducible. Proposition 6.22. For any nonempty sets X, Y , and each R ∈ ℛ0 (X, Y ), there exist and unique partitions RX = {Xi }{i∈I and}RY = {Yi }i∈I{ of the } sets X and Y , respectively, such that R = ∪i∈I Xi × Yi . Moreover, RX = ∪y∈Y R−1 (y) , RY := ∪x∈X R(x) , {Xi × Yi }i∈I = ∪(x,y)∈R {R−1 (y) × R(x)}, and for each i it holds min{#Xi , #Yi } = 1. Conversely, each set R = ∪i∈I Xi × Yi , where {Xi }i∈I and {Yi }i∈I are partitions of nonempty sets X and Y , respectively, such that for each i it holds min{#Xi , #Yi } = 1, is an irreducible correspondence between X and Y . { } { } Proof. First, let R ∈ ℛ0 (X, Y ). Put RX = ∪y∈Y R−1 (y) , RY = ∪x∈X R(x) , and show that RX and RY are partitions. We show it for RY (for RX the proof is the same). Notice first that RX and RY are covers because R is a correspondence. Now, suppose to the contrary that RY is not a partition, i.e., there exist two different elements of RY , say R(x) and R(x′ ), such that R(x) ∩ R(x′ ) ̸= ∅. Since R(x) ̸= R(x′ ), one of them, say R(x), contains two different elements y and y ′ , such that one of them, say y, belongs to R(x) ∩ R(x′ ). This means that for the pair (x, y) it holds #R(x) ≥ 2 and R−1 (y) ≥ 2, a contradiction to Proposition 6.15. Now, we put RX = {Xi }i∈I . Note that for any x, x′ ∈ Xi we have R(x) = R(x′ ). Indeed, by definition, Xi = R−1 (y) for some y ∈ Y , thus R(x) ∩ R(x′ ) ̸= ∅ and, therefore, R(x) = R(x′ ) because RY is a partition. Choose arbitrary i ∈ I, x ∈ Xi , and put Yi = R(x). Then, this definition is correct (does not depend on the choice of x). We show that the mapping φ : Xi ⊦→ Yi is a bijection between RX and RY . If φ is not injective, then there exist x, x′ ∈ X lying in different elements of the partition RX for which R(x) = R(x′ ). Thus x, x′ ∈ R−1 (y) ∈ RX for y ∈ R(x), a contradiction. −1 Finally, φ is surjective, since for any Yi , y ∈ Yi , the set R (y) is) an element of the partition RX . Choose an ( −1 −1 arbitrary x ∈ R (y), then R(x) ∈ RY contains y, therefore φ R (y) = Yi . Since for any x, x′ ∈ Xi we have R(x) = R(x′ ) = Yi , then Xi × Yi ⊂ R. On the other hand, since RX is a partition of X, for any x ∈ X there exists Xi ∈ RX such that x ∈ Xi , therefore each (x, y) ∈ R is contained in some Xi × Yi . To prove the equality min{#Xi , #Yi } = 1, let us note that for any (x, y) ∈ R, x ∈ Xi , and y ∈ Yi , we have Xi = R−1 (y) and Yi = R(x), thus, {Xi × Yi }i∈I = ∪(x,y)∈R {R−1 (y) × R(x)}. It suffices to apply Proposition 6.15. The uniqueness of the partitions is the standard fact from the set theory. The converse is trivial.

6.1. Irreducible correspondences

73

Let X be an arbitrary set different from singleton, and m a cardinal number, 2 ≤ m ≤ #X. By 𝒟m (X) we denote the family of all possible partitions of the set X into m nonempty subsets. Now let X be a metric space. Then for each D = {Xi }i∈I ∈ 𝒟m (X) we put diam D = sup diam Xi . i∈I

{ } Further, for {any nonempty A, B }⊂ X, we have already defined ∣AB∣ as inf ∣ab∣ : (a, b) ∈ A × B . We also need ∣AB∣′ := sup ∣ab∣ : (a, b) ∈ A × B . Further, for each D = {Xi }i∈I ∈ 𝒟m (X) we put { } { } α(D) = inf ∣Xi Xj ∣ : i ̸= j and β(D) = sup ∣Xi Xj ∣′ : i ̸= j . Also notice that ∣Xi Xi ∣ = 0, ∣Xi Xi ∣′ = diam Xi and hence diam D = supi∈I ∣Xi Xi ∣′ . The next result follows easily from the definition of distortion, as well as from Proposition 6.22. Proposition 6.23. Let X and Y be arbitrary metric spaces, DX = {Xi }i∈I , DY = {Yi }i∈I , #I ≥ 2, be some partitions of the spaces X and Y , respectively, and R = ∪i∈I Xi × Yi ∈ ℛ(X, Y ). Then { } dis R = sup ∣Xi Xj ∣′ − ∣Yi Yj ∣, ∣Yi Yj ∣′ − ∣Xi Xj ∣ : i, j ∈ I = { } = sup diam DX , diam DY , ∣Xi Xj ∣′ − ∣Yi Yj ∣, ∣Yi Yj ∣′ − ∣Xi Xj ∣ : i, j ∈ I, i ̸= j ≤ { } ≤ max diam DX , diam DY , β(DX ) − α(DY ), β(DY ) − α(DX ) . In particular, if R ∈ ℛ0 (X, Y ), then in the previous formula we can take RX and RY defined in Proposition 6.22 for DX and DY , respectively. It will also be convenient for us to represent a relation σ ∈ 𝒫0 (X × Y ) as a bipartite graph. Then the degree deg of each vertex is defined: degσ (x) = #σ(x) and degσ (y) = #σ −1 (y) . Problem 6.7. Let R ∈ ℛ0 (X, Y ), x ∈ X, degR (x) > 1. Prove that for each x′ ∈ X, x′ ̸= x, it holds R(x) ∩ R(x′ ) = ∅. Problem 6.7 immediately implies Corollary 6.24. Let #X ≥ 2 and #Y ≥ 2, then for any R ∈ ℛ0 (X, Y ) there is no x ∈ X such that {x} × Y ⊂ R. Example 6.25. Let X = {x1 , x2 } and Y = {y1 , y2 }, then, by Corollary 6.24, the set ℛ0 (X, Y ) consists only of bijections, therefore | | 2dGH (X, Y ) = |∣x1 x2 ∣ − ∣y1 y2 ∣|. Thus, the set of isometric classes of two-point metric spaces endowed with the Gromov–Hausdorff distance is a metric space isometric to the open ray x > 0 of the real line ℝ with coordinate x. Example 6.26. Consider two three-point metric spaces X = {x1 , x2 , x3 } and Y = {y1 , y2 , y3 }. Put ρij = ∣xi xj ∣ =: ρk and νij = ∣yi yj ∣ =: νk , where {i, j, k} = {1, 2, 3}. Without loss of generality, we assume that ρ1 ≤ ρ2 ≤ ρ3 and ν1 ≤ ν2 ≤ ν3 . We will show that dGH (X, Y ) =

{ } 1 max ∣ρ1 − ν1 ∣, ∣ρ2 − ν2 ∣, ∣ρ3 − ν3 ∣ . 2

By Corollary 6.21, it suffices to describe all irreducible correspondences Rm ∈ ℛ0 (X, Y ). We will use Proposition 6.22. There are three types of partitions of a three-point space: (1) into one-point subsets, (2) into one two-point subset and one-point subset, and (3) into one three-point subset. By Proposition 6.22, an irreducible correspondence defines a bijection between partitions, therefore both spaces have to be partitioned in the same way. Case (3) is not realized by Corollary 6.24, in case (1) we have a bijection, and in case (2) one-point and two-point subsets have to correspond to each other. Thus, we have two types of correspondences: if {i, j, k} = {p, q, r} = {1, 2, 3}, then { } (1) R1 = (xi , yp ), (xj , yq ), (xk , yr ) , { } (2) R2 = (xi , yq ), (xi , yr ), (xj , yp ), (xk , yp ) .

6.2. A few more examples

74

We have { } dis R1 = max ∣ρij − νpq ∣, ∣ρik − νpr ∣, ∣ρjk − νqr ∣ , { } dis R2 = max ∣ρij − νpq ∣, ∣ρik − νpr ∣, ρjk , νqr , ∣ρij − νpr ∣, ∣ρik − νpq ∣ . Note that max{ρjk , νqr } ≥ ∣ρjk − νqr ∣, therefore dis R2 ≥ dis R1 , so it is enough to consider only the bijections R1 . We { also note } that each bijection R1 defines a bijection between the sets {ρa } and {νa }. We show that the bijection ∪3i=1 (ρi , νi ) is optimal, i.e., if we replace this |bijection with any other bijection ψ, we cannot get a lower value than | M = maxi ∣ρi − νi ∣. Put M ′ = maxi |ρi − ψ(ρi )|. We need to show that M ′ ≥ M . Let M = ∣ρ1 − ν1 ∣, and suppose, without loss of generality, that ρ1 ≤ ν1 , thus M = ν1 − ρ1 . Since ψ(ρ1 ) ≥ ν1 , then M ′ ≥ ψ(ρ1 ) − ρ1 ≥ M . Similarly we deal with the case M = ∣ρ3 − ν3 ∣. Now, let M = ∣ρ2 − ν2 ∣, and ρ2 ≤ ν2 , i.e., M = ν2 − ρ2 . If ψ(ρ2 ) ̸= ν1 , them M ′ ≥ M . Suppose now that ψ(ρ2 ) = ν1 , then ψ(ρ1 ) equal either ν2 , or ν3 . Thus M ′ ≥ ψ(ρ1 ) − ρ1 ≥ ν2 − ρ1 ≥ ν2 − ρ2 = M . Thus, the set of isometry classes of three-point metric spaces endowed with the Gromov–Hausdorff distance is a metric space isometric to the polyhedral cone { } (x, y, z) : 0 < x ≤ y ≤ z ≤ x + y ‖ ‖ in the space ℝ3∞ , where the latter denotes the space ℝ3 with the metric generated by the norm ‖(x, y, z)‖∞ = { } max ∣x∣, ∣y∣, ∣z∣ . The isometry is given by the formula {x1 , x2 , x3 } ⊦→

6.2

1 (ρ1 , ρ2 , ρ3 ). 2

A few more examples

The following statement immediately follows from the definition of the Gromov–Hausdorff distance and Item (4) of Problem 5.1. Example 6.27. Let Y be an arbitrary ε-net of a metric space X. Then dGH (X, Y ) ≤ dH (X, Y ) ≤ ε. Thus, every compact metric space is approximated (according to the Gromov-Hausdorff metric) with any accuracy by finite metric spaces. Example 6.28. Denote by Δ1 a one-point metric space. Then for any metric space X we have dGH (Δ1 , X) =

1 diam X. 2

Indeed, ℛ(Δ1 , X) consists of exactly one correspondence R, namely, of R = Δ1 × X, thus, dis R = diam X, and it remains to use Theorem 6.12. Example 6.29. Let X and Y be some metric spaces, and the diameter of one of them is finite. Then dGH (X, Y ) ≥

1 ∣ diam X − diam Y ∣. 2

Indeed, if diam X < ∞ and diam Y = ∞, then for any ρ ∈ 𝒟(X, Y ) we have ρH (X, Y ) = ∞, otherwise Y ⊂ Ur (X) for some finite r and, therefore, diam Y < ∞. Now let the both diam X and diam Y are finite. Then it suffices to use the triangle inequality (Proposition 6.6) for the triple X, Y , Δ1 , and Example 6.28. Example 6.30. Let X and Y be some metric spaces, then dGH (X, Y ) ≤

1 max{diam X, diam Y }, 2

in particular, if X and Y are bounded metric spaces, then dGH (X, Y ) < ∞. Indeed, if the diameter of one of the spaces X, Y is infinite, then the inequality holds. If both spaces are singletons, then everything is also obvious. Now let 0 < d := max{diam X, diam Y } < ∞. Then for R = X × Y ∈ ℛ(X, Y ) we 1 have dis R = d, thus, by Proposition 6.11, it holds ρR H (X, Y ) = 2 dis R = d/2, therefore, dGH (X, Y ) ≤ d/2.

6.3. GH-convergence and GH-limits

75

Recall that for an arbitrary metric space X and a real number λ > 0, by λX we denote the metric space obtained from X by multiplying all distances by λ. For λ = 0 we set λX = Δ1 . Example 6.31. For any bounded metric space X and any λ ≥ 0, μ ≥ 0, we have dGH (λX, μX) = 12 ∣λ − μ∣ diam X, in particularly, for any 0 ≤ a < b the curve γ(t) := t X, t ∈ [a, b], is shortest. Indeed, for the identity correspondence R ∈ ℛ(λX, μX) we have dis R = ∣λ − μ∣ diam X, hence dGH (λX, μX) ≤ 1 ∣λ − μ∣ diam X by Theorem 6.12. On the other hand, by Example 6.29 we have 2 dGH (λX, μX) ≥

| 1 1 || diam(λX) − diam(μX)| = ∣λ − μ∣ diam X. 2 2

It remains to note that ∣γ∣ =

sup

n ∑

dGH (ti−1 X, ti X) =

a=t0 0. Then for each correspondence R ∈ ℛ(X, Y ) and the correspondence Rλ ∈ ℛ(λX, λY ), which coincides with R as a set, we have dis Rλ = λ dis R. It remains to use Theorem 6.12. We now verify that under the condition dGH (X, Y ) < ∞ the equality holds for λ = 0 as well. With this λ, we have λX = λY = Δ1 , therefore dGH (λX, λY ) = 0. Since dGH (X, Y ) < ∞, the value λ dGH (X, Y ) vanishes too.

6.3

GH-convergence and GH-limits

For a sequence Xk of metric spaces that converges w.r.t. the Gromov–Hausdorff distance to a metric space X, we say GH for short that the sequence GH-converges and write this as Xk −−→ X. We call X the GH-limit and denote it by GH-limk→∞ Xk . Let us present a few simple observations. (1) Let X = GH-limk→∞ Xk , Y = GH-limk→∞ Xk , one of X and Y be compact, and the other one be complete, then X and Y are isometric. Indeed, due to the triangle inequality, we have dGH (X, Y ) = 0. It remains to use Problem 6.3. (2) Hausdorff convergence implies GH-convergence. (3) Each compact metric space is a GH-limit of finite metric spaces (of its finite 1/k-nets). GH

Theorem 6.33. Let Xk −−→ Y . If all Xk , starting from some k, have one of the properties listed below, then this property is inherited by the space Y : (1) the diameter equals infinity; (2) the diameter is bounded by a certain number D (in fact, diam Xk → diam Y ); (3) separability; (4) total boundedness; (5) if Y is complete, then bounded compactness; (6) if Y is complete, then the intrinsicness. Proof. Without loss of generality, we assume that each of there properties holds for all k, and that dGH (Xk , Y ) < 1/k. By Theorem 6.12, for each k ∈ ℕ there exists Rk ∈ ℛ(Xk , Y ) such that dis Rk < 2/k. In addition, by Theorem 6.14, for each k ∈ ℕ there exist (2/k)-isometries fk : Xk → Y and gk : Y → Xk . (1) If diam Y < ∞, then, by Example 6.29, we have dGH (Xk , Y ) = ∞, a contradiction.

6.3. GH-convergence and GH-limits

76

| | ||| | (2) Since gk is (2/k)-isometry, then dis gk < 2/k and, hence, for any y, y ′ ∈ Y we have |∣yy ′ ∣ − |gk (y)gk (y ′ )|| < 2/k, thus | | ∣yy ′ ∣ < |gk (y)gk (y ′ )| + 2/k ≤ diam Xk + 2/k. Since y, y ′ ∈ Y are arbitrary, we get diam Y ≤ diam Xk + 2/k. Swapping gk and fk , we get diam Xk ≤ diam Y + 2/k, i.e., ∣ diam Xk − diam Y ∣ < 2/k, therefore, diam Xk → diam Y and, thus, diam Y ≤ D. (3) In each Xk we choose a| countable | everywhere dense subset Sk . Since fk is (2/k)-isometry, for any y ∈ Y there exists xk ∈ Xk such that |yfk (xk )| < 2/k. Since Sk is everywhere dense in Xk , there | exists |sk ∈ Sk such that ∣xk sk ∣ < 1/k. Now the condition dis fk < 2/k implies that ∣fk (sk )fk (xk )∣ < 3/k, therefore, |fk (sk )y | < 5/k, and hence fk (Sk ) is a countable (5/k)-net in Y . It remains to note that ∪∞ k=1 fk (Sk ) is a countable everywhere dense subset of Y. (4) In each Xk we choose a finite (1/k)-net Sk . Repeating word-by-word the reasoning from Item (3), we obtain that fk (Sk ) is a finite (5/k)-net. Since k is arbitrary, we get what is required. (5) Choose an arbitrary closed bounded Z ⊂ Y . We have to show that it is compact. Since dis fk < 2/k, then fk−1 (Z) is bounded. Let W be the closure of fk−1 (Z), then W is compact, in particular, it contains a finite (1/k)-net Sk . As was discussed above, fk (Sk ) is a finite (3/k)-net in f (W ) ⊃ Z and, thus, there exists a finite (6/k)-net Zk in Z. Since k is arbitrary, we get that Z is totally bounded. Since Z is closed, it is compact. (6) By Theorem 3.51, it suffices to show that for any y, y ′ ∈ Y and any ε > 0 there exists an ε-midpoint between y and y ′ . Choose k such that 4/k < ε, and arbitrary xk ∈ Rk−1 (y), x′k ∈ Rk−1 (y ′ ). Since Xk is intrinsic, there exists a (1/k)-midpoint s between xk |and x′k . Choose|arbitrary| w ∈ Rk (s).| Since s| is (1/k)-midpoint, and | | | | dis Rk < |2/k, we ′ | | |∣x′ s∣−∣xk x′ ∣/2| < 1/k, |∣xk s∣−∣yw∣| < 2/k, |∣x′ s∣−∣y ′ w∣| < 2/k, |∣xk x′ ∣−∣yy∣| < 2/k, have |∣xk s∣−∣xk xk ∣/2 < 1/k, k k k k | | | thus |∣yw∣ − ∣yy ′ ∣/2| < 4/k < ε and |∣y ′ w∣ − ∣yy ′ ∣/2| < 4/k < ε, what is required. GH

Corollary 6.34. Let Xk −−→ Y , all Xk are boundedly compact and intrinsic (such Xk are strictly intrinsic by Corollary 3.45), Y is complete. Then Y is strictly intrinsic. Moreover, if we abandon the bounded compactness property, then Y may be not strictly intrinsic, namely, the GH-limit of strictly intrinsic metric spaces may be not strictly intrinsic. Proof. By Item (5) and Item (6) of Theorem6.33, the space Y is boundedly compact and intrinsic. It remains to apply Corollary 3.45. To verify the second statement of the corollary, let us consider a metric graph Y obtained by gluing the ends of the segments [0, 1 + 1/n], n ∈ ℕ (all 0 are glued together, and all ends 1 + 1/n are also glued together). Then Y is complete, intrinsic, but not strictly intrinsic. For Xk we take the space obtained from Y by replacing the segment [0, 1+1/k] with the segment [0, 1]. Then all Xk are complete and strictly intrinsic, however, all they are not boundedly compact. Problem 6.8. Let X = {x1 , . . . , xn } be a finite metric space. Prove that a sequence Xk of metric spaces GH-converges to X if and only if for sufficiently large k there exist partitions {Xki }ni=1 ∈ 𝒟n (Xk ) such that ∣Xki Xkj ∣ → ∣xi xj ∣ for any 1 ≤ i, j ≤ n. Let dk be a sequence of metrics on a nonempty set X. We say that the metric spaces Xk = (X, dk ) uniformly converges to a metric space Y = (X, d) if the functions dk uniformly converges to d, i.e., if | | sup |dk (x, x′ ) − d(x, x′ )| → 0. x,x′ ∈X

Problem 6.9 (Uniform convergence). Prove that if metric spaces Xk uniformly converges to a metric space Y then GH Xk −−→ Y . Recall that for any Lipschitz mapping f : X → Y of metric spaces we defined the dilatation dil f as the minimal Lipschitz constant for f , see Section 2.2. Also, we defined bi-Lipschitz mapping between metric spaces as a bijective mapping such that it and its inverse as Lipschitz ones. Now we define the Lipschitz distance dL between metric spaces X and Y as the following value: ( ) dL (X, Y ) = inf log max{dil f, dil f −1 } , f : X→Y

where infimum is taken over all bi-Lipschitz mappings (if there is no such mappings then dL (X, Y ) = ∞).

6.3. GH-convergence and GH-limits

77

Problem 6.10 (Lipschitz convergence). Prove that if metric spaces Xk converges to a bounded metric space Y w.r.t. GH the Lipschitz distance, then Xk −−→ Y . Does it remain true without the boundedness assumption? Problem 6.11. Let the metric space Xk be obtained from the standard sphere S 2 ⊂ ℝ3 by removing a ball of radius GH 1/k. Prove that Xk −−→ S 2 . Prove that if we change S 2 to the circle S 1 , then the similar statement does not hold. In the both cases we consider the interior metrics.

References to Chapter 6 [1] Burago D., Burago Yu., Ivanov S. A Course in Metric Geometry. Graduate Studies in Mathematics, vol.33, A.M.S., Providence, RI, 2001. [2] Ghanaat P. “Gromov-Hausdorff distance and applications”. In: Summer school “Metric Geometry”, Les Diablerets, August 25–30, 2013, https://math.cuso.ch/fileadmin/math/document/gromov-hausdorff.pdf

78

Exercises to Chapter 6.

79

Exercises to Chapter 6 Exercise 6.1. Prove that for any metric spaces X and Y there exists a realization of (X, Y ). Exercise 6.2. Prove that for any metric spaces X and Y there exists at least one admissible metric, i.e., the set 𝒟(X, Y ) is not empty. Exercise 6.3. Prove that for arbitrary metric spaces X and Y , any σ1 , σ2 ∈ 𝒫0 (X × Y ) such that σ1 ⊂ σ2 , we have dis σ1 ≤ dis σ2 . Exercise 6.4. Prove that for arbitrary metric spaces X and Y , any R ∈ ℛ(X, Y ) it holds dis R = 0 if and only if R is an isometry. Exercise 6.5. Let X and Y be arbitrary sets, R ∈ ℛ0 (X, Y ), x ∈ X, degR (x) > 1. Prove that for each x′ ∈ X, x′ ̸= x, it holds R(x) ∩ R(x′ ) = ∅. Exercise 6.6. Let X = {x1 , . . . , xn } be a finite metric space. Prove that a sequence Xk of metric spaces GH-converges to X if and only if for sufficiently large k there exist partitions {Xki }ni=1 ∈ 𝒟n (Xk ) such that ∣Xki Xkj ∣ → ∣xi xj ∣ for any 1 ≤ i, j ≤ n. Exercise 6.7 (Uniform convergence). Prove that if metric spaces Xk uniformly converges to a metric space Y then GH Xk −−→ Y . Exercise 6.8 (Lipschitz convergence). Prove that if metric spaces Xk converges to a bounded metric space Y w.r.t. GH the Lipschitz distance, then Xk −−→ Y . Does it remain true without the boundedness assumption? Exercise 6.9. Let the metric space Xk be obtained from the standard sphere S 2 ⊂ ℝ3 by removing a ball of radius GH 1/k. Prove that Xk −−→ S 2 . Prove that if we change S 2 to the circle S 1 , then the similar statement does not hold. In the both cases we consider the interior metrics.

Chapter 7

Gromov–Hausdorff space. Schedule. Gromov–Hausdorff space (GH-space), distortions of a correspondence and its closure, calculating GH-distance in terms of closed correspondences, compactness of the set of all closed correspondences for compact metric spaces, continuity of distortion for compact metric spaces, optimal correspondences, existence of closed optimal correspondences for compact metric spaces, GH-space is geodesic, cover number and packing number, there relations, total boundness of families of compact metric spaces in terms of cover and packing numbers, isometric embedding of all compact metric spaces from a totally bounded family to the same compact subset of ℓ∞ , completeness of GH-space, separability of GH-space, mst-spectrum in terms of GH-distances to simplexes, Steiner problem in GH-space.

This section describes some geometrical and topological properties of the space consisting the isometry classes of compact metric spaces, endowed with the Gromov–Hausdorff metric. We denote by ℳ the set of all compact metric spaces considered up to isometry (in other words, the set of isometry classes of compact metric spaces). From Propositions 6.6 and 6.7 it follows that the distance dGH is a metric on ℳ. The metric space (ℳ, dGH ) is called the Gromov–Hausdorff space. Recall that by Δ1 we denoted a one-point metric space. Note that ℳ contains all finite metric spaces, in particular, Δ1 ∈ ℳ. The results presented in Examples 6.28–6.32 lead to the following geometric model of ℳ, see Figure 7.1.

Figure 7.1: Gromov–Hausdorff space: general properties.

Indeed, according to Example 6.32, the operation of multiplying the metric by a number λ > 0 is a homothety of the space ℳ, centered in the one-point metric space Δ1 , so ℳ is a cone with the vertex Δ1 . By Example 6.31, the curves γ(t) := t X, X ∈ ℳ, X ̸= Δ1 , are shortest between any of their points, so they are the generators of the cone. Figure 7.1 also illustrates Examples 6.28 and 6.30.

80

7.1. Existence of optimal correspondences between compact metric spaces

7.1

81

Existence of optimal correspondences between compact metric spaces

Let X and Y be arbitrary metric spaces. Agreement 7.1. In what follows, when we work with the metric space X × Y , we always suppose that its distance function is | | { } |(x, y)(x′ , y ′ )| = max ∣xx′ ∣, ∣yy ′ ∣ , and just this metric generates the Hausdorff distance on 𝒫0 (X × Y ) and on all its subspaces, e.g., on ℛ(X, Y ). Problem 7.1. Prove that the product topology on X × Y coincides with the one generated by the metric from Agreement 7.1. Proposition 7.2. For arbitrary metric spaces X and Y , if σ ¯ is the closure of σ ∈ 𝒫0 (X × Y ), then dis σ ¯ = dis σ. Proof. Since σ ⊂ σ ¯ then we have dis σ ≤ dis σ ¯ , hence in the case dis σ = ∞ the result holds. Now let dis σ < ∞. It remains to prove that dis σ ≥ dis σ ¯. { } ′ ′ By definition, for each ε > 0 and any (¯ x , y ¯ ), (¯ x , y ¯ ) ∈ σ ¯ there exist (x, y), |(x′ , y ′ ) ∈ σ such that max ∣¯ xx∣, ∣¯ y y∣ < | | | { ′ ′ } ε/6 and max ∣¯ x x ∣, ∣¯ y ′ y ′ ∣ < ε/6, thus |∣¯ xx ¯′ ∣ − ∣xx′ ∣| < ε/3 and |∣¯ y y¯′ ∣ − ∣yy ′ ∣| < ε/3. Therefore, | ′ | | | |∣¯ xx ¯ ∣ − ∣¯ y y¯′ ∣| < |∣xx′ ∣ − ∣yy ′ ∣| + 2ε/3 ≤ dis σ + 2ε/3. Passing to the supremum in definition of dis σ ¯ , we conclude that dis σ ¯ ≤ dis σ + 2ε/3. Since ε is arbitrary, we have dis σ ¯ ≤ dis σ. Denote by ℛc (X, Y ) the subset of ℛ(X, Y ) consisting of all closed correspondences R. Clearly that for any ¯ is also a correspondence, i.e., R ¯ ∈ ℛc (X, Y ). This fact, together with Proposition 7.2 and R ∈ ℛ(X, Y ) its closure R Theorem 6.12, immediately implies Corollary 7.3. For any metric spaces X and Y we have dGH (X, Y ) =

{ } 1 inf dis R : R ∈ ℛc (X, Y ) . 2

Now, let X and Y be compact metric spaces. Then X × Y is compact as well, and ℛc (X, Y ) ⊂ ℋ(X × Y ). By Corollary 5.37, ℋ(X × Y ) is compact. Proposition 7.4. For X, Y ∈ ℳ the set ℛc (X, Y ) is closed in ℋ(X × Y ), thus, ℛc (X, Y ) is compact. Proof. It suffices to show that for each σ ∈ ℋ(X × Y ) \ ℛc (X, Y ) some its neighborhood in ℋ(X × Y ) does not intersect ℛc (X, Y ). Notice that σ ̸∈ ℛ(X, Y ) because ℋ(X × Y ) consists of all closed nonempty subsets of X × Y , and ℛc (X, Y ) equals to the set of all closed R ∈ ℛ(X, Y ). Then either πX (σ) ̸= X, or πY (σ) ̸= Y , where πX and πY are the canonical projections. To be definite, suppose that the first condition holds, i.e., there exists x ∈ X \ πX (σ). Since σ is a closed subset of the compact X × Y , then it is compact itself, and therefore πX (σ) is compact in X, thus, πX (σ) is closed. The latter implies that there exists an open ball Uε (x) such that Uε (x) ∩ πX (σ) = ∅. Let ℋ(X×Y ) U = Uε/2 (σ), then for each σ ′ ∈ U we have dH (σ, σ ′ ) < ε/2, thus for any (a′ , b′ ) ∈ σ ′ there exists (a, b) ∈ σ such | | | | ′ ′ | | that (a, b), (a , b ) < ε/2. Since a ∈ πX (σ) then ∣xa∣ ≥ ε. On the other hand, ∣aa′ ∣ ≤ |(a, b), (a′ , b′ )| < ε/2, therefore, ∣xa′ ∣ > ε/2, i.e., a′ ̸∈ Uε/2 (x) and hence πX (σ ′ ) ∩ Uε/2 (x) = ∅. This implies that σ ′ is not a correspondence, thus σ ′ ̸∈ ℛc (X, Y ) and U ∩ ℛc (X, Y ) = ∅. | | Define a function f : (X × Y ) × (X × Y ) → ℝ as f (x, y, x′ , y ′ ) = |∣xx′ ∣ − ∣yy ′ ∣|. Clearly that f is continuous. Notice that for each σ ∈ 𝒫0 (X × Y ) we have { } dis σ = sup f (x, y, x′ , y ′ ) : (x, y), (x′ , y ′ ) ∈ σ = sup f ∣σ×σ . Proposition 7.5. If X, Y ∈ ℳ, then the function dis : ℋ(X × Y ) → ℝ is continuous. Proof. Since (X × Y ) × (X × Y ) is compact, then the function f is uniformly continuous, i.e., for any { } ε > 0 there exists δ > 0 such that for every (x1 , y1 , x′1 , y1′ ) and (x2 , y2 , x′2 , y2′ ) with max ∣x1 x2 ∣, ∣y1 , y2 ∣, ∣x′1 x′2 ∣, ∣y1′ y2′ ∣ < δ it holds | | |f (x1 , y1 , x′1 , y1′ ) − f (x2 , y2 , x′2 , y2′ )| < ε.

7.2. The Gromov–Hausdorff space is geodesic

82

Thus, for any σ ∈ ℋ(X × Y ) and any ε > 0 there exists δ > 0 such that for the open δ-neighborhood U = UδX×Y (σ) ⊂ X × Y of σ it holds sup f ∣U ×U ≤ sup f ∣σ×σ + ε. ℋ(X×Y )

By V we denote the open ball Uδ σ ′ ⊂ U , then it follows that

(σ) ⊂ ℋ(X × Y ) of radius δ centered at σ. Since for any σ ′ ∈ V we have

dis σ ′ = sup f ∣σ′ ×σ′ ≤ sup f ∣U ×U ≤ sup f ∣σ×σ + ε = dis σ + ε. Swapping σ and σ ′ , we get ∣ dis σ − dis σ ′ ∣ ≤ ε, and hence, the function dis is continuous. Definition 7.6. A correspondence R ∈ ℛ(X, Y ) is called optimal if dGH (X, Y ) = the set of all optimal correspondences between X and Y .

1 2

dis R. By ℛopt (X, Y ) we denote

Theorem 7.7. For any X, Y ∈ ℳ we have ℛopt (X, Y ) ∩ ℛc (X, Y ) ̸= ∅. Proof. By Proposition 7.5, the function dis : ℛc (X, Y ) → ℝ is continuous, and by Proposition 7.4 the space ℛc (X, Y ) is compact, thus the function dis attains its least value. The half of this value equals dGH (X, Y ), and each correspondence R which this least value is attained at is optimal. From Theorem 7.7 and Propositions 6.11 and 6.11, we immediately get Corollary 7.8. For each nonisometric X, Y ∈ ℳ there exist (1) a correspondence R ∈ ℛc (X, Y ) such that dGH (X, Y ) =

1 2

dis R;

(2) an admissible metric ρ ∈ 𝒟(X, Y ) such that dGH (X, Y ) = ρH (X, Y ); (3) a metric space Z and isometric embeddings of X and Y into Z such that the Hausdorff distance between their images equals dGH (X, Y ).

7.2

The Gromov–Hausdorff space is geodesic

Let X and Y be arbitrary metric spaces. We assume that the space X × Y is by default endowed with the metric | | { } |(x, y)(x′ , y ′ )| = max ∣xx′ ∣, ∣yy ′ ∣ . Also, we introduce on X × Y the following 1-parametric family of metrics dt , t ∈ (0, 1): ( ) dt (x, y), (x′ , y ′ ) = (1 − t)∣xx′ ∣ + t∣yy ′ ∣. Problem 7.2. Prove that the topologies generated by all dt , and by ∣ · ∣ as well, coincide with the product topology of X × Y . Now we suppose that dGH (X, Y ) < ∞, and R ∈ ℛ(X, Y ) is any correspondence with dis R < ∞. By Rt , t ∈ [0, 1], we denote the metric space (R, dt ) if t ∈ (0, 1), and we put R0 = X, R1 = Y . Proposition 7.9. The mapping t ⊦→ Rt is Lipschitzian with the Lipschitz constant Hausdorff distance).

1 2

dis R (w.r.t. the Gromov–

Proof. We have to show that 2dGH (Rs , Rt ) ≤ ∣s − t∣ dis R for arbitrary s, t ∈ [0, 1]. Let us start with the case 0 < s, t < 1. As always, we denote by id the identical mapping. Then {| } ) ( )| | ( ′ ′ ′ ′ ′ ′ | 2dGH (Rs , Rt ) ≤ dis id = sup |ds (x, y), (x , y ) − dt (x, y), (x , y ) | : (x, y), (x , y ) ∈ R = } {| | = ∣s − t∣ sup |∣xx′ ∣ − ∣yy ′ ∣| : (x, y), (x′ , y ′ ) ∈ R = ∣s − t∣ dis R. Now we consider the case s = 0, 0 < t < 1. As a correspondence we take St ∈ ℛ(R0 , Rt ) of the form ⋃ {( )} St = x, (x, y) . (x,y)∈R

7.3. Cover Number and Packing Number

83

Notice that such St does not depend on t as a set. We have {| } ( )| ( ) ( ′ ′ ′ ) | ′ ′ ′ | 2dGH (R0 , Rt ) ≤ dis St = sup |∣xx ∣ − dt (x, y), (x , y ) | : x, (x, y) , x , (x , y ) ∈ St = {| } | ( ) ( ) = t sup |∣xx′ ∣ − ∣yy ′ ∣| : x, (x, y) , x′ , (x′ , y ′ ) ∈ St = t dis R = ∣s − t∣ dis R. All remaining cases can be proved similarly. Now, let X and Y be compact metric spaces, and R ∈ ℛc (X, Y ). Then R is compact as well, and by Problem 7.2, all metric spaces Rt are compact too. Thus, by Proposition 7.9, the mapping t ⊦→ Rt is a Lipschitz curve in ℳ, and it joins X and Y . Theorem 7.10. Given X, Y ∈ ℳ and R ∈ ℛopt (X, Y ) ∩ ℛc (X, Y ), the curve γ : [0, 1] → ℳ, γ : t ⊦→ Rt , is a shortest geodesic with the speed dGH (X, Y ). In particular, the length of γ equals to dGH (X, Y ), thus, the space ℳ is geodesic. Proof. Choose arbitrary 0 ≤ s ≤ t ≤ 1, then, by Proposition 7.9, we have dGH (Rs , Rt ) ≤

(7.1)

t−s dis R = (t − s) dGH (X, Y ), 2

thus, taking into account the triangle inequality, we get dGH (X, Y ) ≤ dGH (X, Rs ) + dGH (Rs , Rt ) + dGH (Rt , Y ) ≤ dGH (X, Y ), therefore, dGH (X, Rs ) + dGH (Rs , Rt ) + dGH (Rt , Y ) = dGH (X, Y ). If for some s and t the inequality in Formula (7.1) is strict, then the previous equality is not satisfied, thus dGH (Rs , Rt ) = (t − s) dGH (X, Y ). It remains to note that ∣γ∣ =

sup 0=t0 0 it holds cov(X, ε) < ∞ (similarly, pack(X, ε) < ∞); (2) a metric space X is finite if and only if there exists n such that cov(X, ε) ≤ n for all ε > 0 (similarly, for pack(X, ε)); (3) the functions f (ε) = cov(X, ε) and g(ε) = pack(X, ε) are monotonically decreasing.

7.4. Totally bounded families of compact metric spaces

84

Proposition 7.12. For any metric space X and any number ε > 0 we have cov(X, ε) ≤ pack(X, ε) ≤ cov(X, ε/4). Proof. First we prove the first inequality. If pack(X, ε) = ∞, then the inequality is automatically satisfied. Now let pack(X, ε) < ∞ and x1 , . . . , xn , n = pack(X, ε), be a largest set of points in X for which the balls Uε/2 (xi ) are disjoint. Since this family is maximal, for any x ∈ X there exists xk such that Uε/2 (x) ∩ Uε/2 (xk ) ̸= ∅, i.e., ∣xxk ∣ < ε. Then the family {Uε (xi )}ni=1 covers X, so cov(X, ε) ≤ n = pack(X, ε). Let us prove the second inequality. Again, if cov(X, ε/4) = ∞, then the inequality holds. Now let cov(X, ε/4) < ∞ and x1 , . . . , xm , m = cov(X, ε/4), be the smallest set of points in X for which the balls Uε/4 (xi ) cover X. Suppose that pack(X, ε) > cov(X, ε/4), then there exist x′1 , . . . , x′n , n > cov(X, ε/4), such that the balls Uε/2 (x′i ) are pairwise disjoint. On the other hand, for some i ̸= j there exists k such that x′i , x′j ∈ Uε/4 (xk ), therefore, xk ∈ Uε/2 (x′i ) ∩ Uε/2 (x′j ), so this intersection is not empty, a contradiction. Corollary 7.13. Let X be an arbitrary metric space, then (1) if pack(X, ε) < ∞, then cov(X, ε) < ∞; (2) if cov(X, ε) < ∞, then pack(X, 4ε) < ∞. Thus, cov(X, ε) < ∞ for all ε > 0, if and only if pack(X, ε) < ∞ for all ε > 0. Proposition 7.14. Let X be an arbitrary metric space. Then the following statements are equivalent: (1) cov(X, ε) < ∞ for any ε > 0; (2) pack(X, ε) < ∞ for any ε > 0; (3) the space X is totally bounded. Proof. (1) ⇔ (2) This follows from Corollary 7.13. (1) ⇔ (3) The condition cov(X, ε) < ∞ is equivalent to the existence of a finite cover {Uε (xi )}ni=1 , which is equivalent to the existence of a finite ε-net {xi }ni=1 . Thus, the condition of Item (1) is equivalent to the total boundedness of the space X. Proposition 7.15. Let X, Y be metric spaces, δ > 0, and dGH (X, Y ) < δ, then (1) cov(X, ε) ≥ cov(Y, ε + 2δ), (2) pack(X, ε) ≥ pack(Y, 2ε + 4δ).

{ }m Proof. (1) The case cov(X, ε) = ∞ is obvious. Now let m := cov(X, ε) < ∞ and Uε (xi ) i=1 be a cover of X. By Theorem 6.12, there exists R ∈ ℛ(X, Y ) such that dis R < 2δ. For each i, we choose an arbitrary yi ∈ R(xi ) and show that the set {yi }m i=1 is an (ε + 2δ)-net, thus cov(Y, ε + 2δ) ≤ m = cov(X, ε). So, we take arbitrary y ∈ Y and choose any x ∈ R−1 (y). Then for some j we have ∣xxj ∣ < ε. Since dis R < 2δ, then ∣yyj ∣ < ε + 2δ, as { required. } n (2) Since the case pack(Y, 2ε+4δ) = ∞ is trivial, we assume that n := pack(Y, 2ε+4δ) < ∞ and let Uε+2δ (yi ) i=1 be a disjoint family of open balls in Y . Then for any i ̸= j we have ∣yi yj ∣ ≥ ε + 2δ. { For each }ni, we choose an arbitrary xi ∈ R−1 (yi ). Since dis R < 2δ, we have ∣xi xj ∣ > ε, therefore the family Uε/2 (xi ) i=1 is disjoint and, thus, pack(X, ε) ≥ n = pack(Y, 2ε + 4δ).

7.4

Totally bounded families of compact metric spaces

We will be interested in when a particular family of compact metric spaces is totally bounded. We begin with the following auxiliary statement, which will be needed below. For n ∈ ℕ we denote by ℳn ⊂ ℳ (ℳ[n] ⊂ ℳ) the set of all metric spaces with at most (respectively, exact) n points. For D ≥ 0, by ℳ(D) ⊂ ℳ we denote the set of all compact metric spaces whose diameters do not exceed D. We also put ℳn (D) = ℳn ∩ ℳ(D) and ℳ[n] (D) = ℳ[n] ∩ ℳ(D). It is clear that ℳn = ∪k≤n ℳ[k] and ℳn (D) = ∪k≤n ℳ[k] (D). Proposition 7.16. The space ℳ[n] (D) ⊂ ℳ is totally bounded.

7.4. Totally bounded families of compact metric spaces

85

Proof. Given X ∈ ℳ[n] (D), we consider all possible bijections ν : X| → {1, . . . , n}, and for every such ν we construct | the distance matrix f (X, ν) = ρ = (ρij ), where ρij = |ν −1 (i)ν −1 (j)|. Let T be the set of all such matrices. It is clear that the mapping g : T → ℳ[n] (D) such that g : f (X, ν) ⊦→ X is surjective. 2 We define on T the distance function generated by the ℓ∞ -norm, so T will be considered as a subset of ℝn∞ . Since 2 for every i, j we have ∣ρij ∣ ≤ D, the set T is bounded and, therefore, totally bounded as a subset of ℝn∞ . ′ ′ ′ ′ ′ −1 If X, X ∈ ℳ[n] (D), ρ = f (X, ν) and ρ = f (X , ν ), then R = (ν ) ◦ ν is a bijective correspondence between X and X ′ , and ∣ρρ′ ∣∞ = dis R ≥ 2dGH (X, X ′ ). Thus, the surjection g is Lipschitzian, therefore, ℳ[n] (D) = g(T ) is also totally bounded. Corollary 7.17. The space ℳn (D) ⊂ ℳ is totally bounded. Problem 7.4. Prove that the set ℳn (D) is compact, while ℳ[n] (D) for n > 1 is not. Theorem 7.18. Let 𝒞 be a nonempty subset of ℳ. Then the following statements are equivalent. (1) There is a number D ≥ 0 and a function N : (0, ∞) → ℕ such that for all X ∈ 𝒞 we have diam X ≤ D and pack(X, ε) ≤ N (ε). (2) There is a number D ≥ 0 and a function N : (0, ∞) → ℕ such that for all X ∈ 𝒞 we have diam X ≤ D and cov(X, ε) ≤ N (ε). (3) The space 𝒞 with the metric dGH is totally bounded. Proof. (3) ⇒ (1). Fix an arbitrary ε > 0. We have to find the corresponding D and N (ε). Since 𝒞 is totally bounded, for any δ > 0 there exists a finite δ-net 𝒞 ′ ⊂ 𝒞. Choose δ such that 4δ < ε. Since all the spaces lying in 𝒞 ′ are totally bounded, by Proposition 7.14, their packing numbers are finite. In addition, their diameters are finite. Put D′ = maxX ′ ∈𝒞 ′ diam X ′ and N ′ (ε) = maxX ′ ∈𝒞 ′ pack(X ′ , ε). For an arbitrary X ∈ 𝒞 there exists X ′ ∈ 𝒞 ′ such that dGH (X, X ′ ) < δ. It is easy to see that diam X ≤ diam X ′ + 2δ ≤ D′ + 2δ, so that we can put D = D′ + 2δ. In addition, by Proposition 7.15, it holds pack(X, ε) ≤ pack(X ′ , ε/2 − 2δ) ≤ N ′ (ε/2 − 2δ), so we can put N (ε) = N ′ (ε/2 − 2δ). (1) ⇔ (2). This immediately follows from Proposition 7.12. (2) ⇒ (3). Fix some ε > 0, and for each X ∈ 𝒞 consider a finite cover of the space X by at most n = N (ε) open ε ε ε ∈ ℳn (D), ) ≤ ε. In addition, FX we denote the set of centers of these balls, then dGH (X, FX balls of radius ε. By FX ε ε therefore, by Corollary 7.17, the family ℱ = {FX }X∈𝒞 ⊂ ℳn (D) is totally bounded. Since for any X ∈ 𝒞 and any ℳ ε ′ ε ε′ > ε we have X ∈ Uεℳ ′ (FX ), then 𝒞 ⊂ Uε′ (ℱ ). Since ε and ε are arbitrary, we conclude that 𝒞 is also totally bounded (verify that). The following theorem allows us to realize all metric spaces from a totally bounded subset of ℳ as subsets of some compact subset of ℓ∞ . Theorem 7.19 (Gromov). For each totally bounded family 𝒞 ⊂ ℳ there exists a compact K ⊂ ℓ∞ such that every X ∈ 𝒞 is isometrically embedded into K. Proof. The compact K is constructed as follows. By Theorem 7.18, there exist D ≥ 0 and N : (0, ∞) → ℕ such that for all X ∈ 𝒞 we have diam X ∑ ≤ D and cov(X, ε) ≤ N (ε). Choose an arbitrary decreasing sequence of positive ∞ numbers E = {ε1 , ε2 , . . .} such that i=1 εi < ∞. This sequence and the function N (ε) generate a sequence of natural numbers Ni = N (εi ). These two sequences, together with the number D, define the set FD,E ⊂ ℓ∞ as follows. ( ) Construction 7.1. Put A = ∪∞ a countable set. Let j=1 {1, . . . , N1 } × · · · × {1, . . . , Nj } . It is clear that A is ( ) ℓ∞ (A) = {f : A → ℝ : sup ∣f ∣ < ∞}, then ℓ∞ (A) is isometric to ℓ∞ . For brevity, instead of f (n1 , . . . , nj ) we will write f (n1 , . . . , nj ). We now define the set FD,E , composing it from all f : A → ℝ that satisfy the following conditions: (1) 0 ≤ f (n1 ) ≤ D for all 1 ≤ n1 ≤ N1 ; | | (2) |f (n1 , . . . , nj , nj+1 ) − f (n1 , . . . , nj )| ≤ εj for all elements (n1 , . . . , nj , nj+1 ) ∈ A. Lemma 7.20. The set FD,E defined above is a compact subset of ℓ∞ (A).

7.5. Some other properties of Gromov–Hausdorff space

86

| | ∑∞ Proof. First, note that for each function f ∈ FD,E it holds supa∈A |f (a)| ≤ D + i=1 εi < ∞, so f ∈ ℓ∞ (A). Further, since all the inequalities defining FD,E are non-strict, the set FD,E is closed in ℓ∞ (A). Since ℓ∞∑ (A) is complete, FD,E ∞ is also complete. In addition, the diameter of F is finite (it bounded by the number D + 2 D,E i=1 εi ). { } k Put A[k] = (n1 , . . . , nk ) ∈ A and Ak = ∪j=1 Aj . We denote by πk : ℓ∞ (A) → ℓ∞ (Ak ) the canonical projection that maps each function f : A → ℝ to its restriction on Ak ⊂ A, and let Fk = πk (FD,E ). Note that Fk is a closed and bounded subset of the finite-dimensional vector space ℓ∞ (Ak ), therefore Fk is compact. Define the mapping ν : Fk → ℓ∞ (A) by extending each function fk ∈ Fk to the entire set A as follows: fk (n1 , . . . , nk , nk+1 , . . .) = fk (n1 , . . . , nk ). It is clear that ν is isometric, therefore Fk′ = ν(Fk ) is also a compact set. ℓ (A) Put ek = εk + εk+1 + . . ., then ek → ∅ as k → ∞. By Condition (2), we have FD,E ⊂ Uek∞ (Fk′ ) for all k ≥ 2, which implies the total boundedness of FD,E (verify that). We now take the set FD,2E as K and show that each space X ∈ 𝒞 can be isometrically embedded into this K. We consider points of the form xa , a ∈ A, and again, for brevity, instead of x(n1 ,...,nj ) we write xn1 ···nj . { } 1 Take an arbitrary X ∈ 𝒞. Since cov(X, ε1 ) ≤ N (ε1 ) = N1 , then X contains an ε1 -net ∪N n1 =1 xn1 , i.e., the family { }N1 Uε1 (xn1 ) n1 =1 forms a cover of X. Note that some points xn1 may coincide. { ′ } { }N2 ′ 2 Further, since cov(X, ε2 ) ≤ N (ε2 ) = N2 , then X contains an ε2 -net ∪N n2 =1 xn2 , i.e., the family Uε2 (xn2 ) n2 =1 forms a cover of X. Fix n1 and choose only those balls Uε2 (x′n2 ) that satisfy ∣xn1 x′n2 ∣ < ε1 + ε2 . In this way we have got at most N2 balls. Enumerate them and add some copies of them to obtain exactly N2 balls which we denote by { }N2 Uε2 (xn1 n2 ), n2 = 1, . . . , N2 . So, we have got a cover Uε2 (xn1 n2 ) n2 =1 of the ball Uε1 (xn1 ). By construction, it holds ∣xn1 xn1 n2 ∣ < ε1 + ε2 < 2ε1 . { } Continuing this process, at the j-th step we get the family of balls Uεj (xa ) a∈A with ∣xn1 ···nj xn1 ···nj nj+1 ∣ < 2εj . [j]

It is easy to see that the set {xa }a∈A of centers of these balls is a countable everywhere dense subset of X (some xa may coincide with each other). By Theorem 2.26, the space X can be isometrically embedded into ℓ∞ (A) by associating with each point x the function fx : A → ℝ defined as follows: fx (a) = ∣xxa ∣. Lemma 7.21. For every x ∈ X we have fx ∈ FD,2E . Proof. It is clear that 0 ≤ fx ≤ D, so that Item (1) from the definition of the set FD,2E is satisfied. Further, for each (n1 , . . . , nj , nj+1 ), the point xn1 ···nj nj+1 lies in U2εj (xn1 ···nj ), so for every x ∈ X we have | | | | |fx (n1 , . . . , nj , nj+1 ) − fx (n1 , . . . , nj )| = |∣xn1 ···nj nj+1 x∣ − ∣xn1 ···nj x∣| ≤ ∣xn1 ···nj nj+1 xn1 ···nj ∣ < 2εj , therefore, Item (2) from the definition of the set FD,2E is also fulfilled. Thus, the mapping x ⊦→ fx isometrically embeds X into K.

7.5

Some other properties of Gromov–Hausdorff space

In this section we apply the previous results to prove a few more properties of the Gromov–Hausdorff space ℳ.

7.5.1

Completeness of Gromov–Hausdorff space

Theorem 7.19 implies the following result. Theorem 7.22. The space ℳ is complete. ∞ Proof. Consider an arbitrary fundamental sequence {Xi }∞ i=1 ⊂ ℳ. Then {Xi }i=1 is a totally bounded subset of ℳ. By Theorem 7.19, there exists a compact set K ⊂ ℓ∞ into which all Xi can be isometrically embedded. Denote by Yi the image of Xi . By Theorem 5.38, the space ℋ(K) of all closed bounded subsets of K is also compact, therefore the sequence Yi ∈ ℋ(K) contains a convergent subsequence Yni . Let Y be the limit of this subsequence. Then Y is a nonempty compact metric space and

dGH (Xni , Y ) = dGH (Yni , Y ) ≤ dH (Yni , Y ) → 0 as i → ∞, GH

GH

therefore, Xni −−→ Y and, since the sequence Xi is fundamental, we have Xi −−→ Y .

7.6. Calculating mst-spectrum by means of Gromov–Hausdorff distances

7.5.2

87

Separability of Gromov–Hausdorff space

Theorem 7.23. The space ℳ is separable. Proof. By Corollary 7.17, each space ℳn (D) is totally bounded and, therefore, separable. Since ℳn = ∪∞ k=1 ℳn (k), then all ℳn , as well as their union ∪∞ n=1 ℳn , are separable. This last union is the set of all finite metric spaces, which, as noted in Example 6.27, is an everywhere dense subset of ℳ, so that ℳ is separable. Recall that a complete separable metric space is called Polish. Thus, the following result holds. Corollary 7.24. The space ℳ is Polish. By Problem 1.31, for a metric space, the separability is equivalent to having a countable base. Corollary 7.25. The space ℳ has a countable base.

7.6

Calculating mst-spectrum by means of Gromov–Hausdorff distances

Recall that by Δn we denoted n-point metric space such that all its nonzero distances equal 1. Also, given λ > 0 and any metric space X, if we multiply by λ all the distances in X, then the resulting metric space we denote by λX. In the present section we show that the mst-spectrum of an arbitrary n-point metric space X can be represented as a linear function on the Gromov–Hausdorff distances from this space to the λΔ2 , . . . , λΔn for λ ≥ 2 diam X. Theorem 7.26. Let X be a finite metric space, σ(X) = (σ1 , . . . , σn−1 ), λ ≥ 2 diam X. Then σk = λ − 2dGH (λΔk+1 , X). Proof. Choose any 1 ≤ k ≤ n − 1 and arbitrary irreducible correspondence R ∈ ℛ0 (λΔk+1 , X). By Proposition 6.22, there exists partitions RλΔk+1 = {Zi }pi=1 and RX = {Xi }pi=1 of λ Δk+1 and X, respectively, such that R = ∪pi=1 Zi ×Xi , and min{#Zi , #Xi } = 1 for all i. By Proposition 6.23, it holds dis R ≥ max{diam RλΔk+1 , diam RX }. Thus, if for some i we have #Zi > 1, then dis R ≥ λ ≥ 2 diam X. Since k + 1 ≤ n, there exists R such that #Zi = 1 for all i. For such R, again by Proposition 6.23, we have dis R ≤ diam X. Therefore, inf R∈ℛ0 (λΔk+1 ,X) dis R is achieved on a correspondences of the latter type. The set of these correspondences we denote by ℛ. Now, if R ∈ ℛ, then p = k + 1 and RX ∈ 𝒟k+1 (X). By Proposition 6.23, we have { } dis R = sup diam RX , ∣Xi Xj ∣′ − λ, λ − ∣Xi Xj ∣ : 1 ≤ i < j ≤ k + 1 = { } = sup λ − ∣Xi Xj ∣ : 1 ≤ i < j ≤ k + 1 = λ − α(RX ), where the second equality holds because { } max ∣Xi Xj ∣′ − λ, diam RX ≤ diam X ≤ λ − diam X ≤ λ − ∣Xi Xj ∣ for any 1 ≤ i < j ≤ k + 1. Corollary 6.21, together with above considerations, gives us ( ) 2dGH (λΔk+1 , X) = min dis R = min λ − α(RX ) = λ − max R∈ℛ

R∈ℛ

D∈𝒟k+1 (X)

where the last equality holds because each D generates some R ∈ ℛ. It remains to use Theorem 4.13 which states that { } σk = max α(D) : D ∈ 𝒟k+1 (X) , thus, 2dGH (λΔk+1 , X) = λ − σk . Corollary 7.27. Let X be a finite metric space and λ ≥ 2 diam X, then mst X = λ(#X − 1) − 2

#X−1 ∑ k=1

dGH (λΔk+1 , X).

α(D),

7.7. Steiner problem in Gromov–Hausdorff space

7.7

88

Steiner problem in Gromov–Hausdorff space

In this section we prove the following Theorem 7.28. Let M ⊂ ℳ be a finite set such that each X ∈ M is a finite metric space. Then SMT(M, ℳ) ̸= ∅, i.e., for such M the Steiner problem has a solution. Remark 7.29. For arbitrary finite M ⊂ ℳ the problem is still open. Proof. Let n = #M . In Section 4.5 we introduced model full Steiner trees, and we have shown how to use them for calculating the length of a Steiner minimal tree. Recall the corresponding definitions in our case. A full Steiner tree has the vertices of two types only: the ones of degree 1 which we call boundary, and the ones of degree 3 which we call interior. In model full Steiner trees which we use to treat the problem for such M , the vertex set is {1, . . . , 2n − 2}, where {1, . . . , n} are reserved for the boundary vertices. We called two model full Steiner tree equivalent if there is an isomorphism between them fixed on the boundary. By ℬn we denoted the set of all model full Steiner trees with n boundary vertices considered up to this equivalence. Enumerate the points from M in an arbitrary way, i.e., we consider a bijection φ : {1, . . . , n} → M . Choose an arbitrary G ∈ ℬn , and consider a network Γ of the type G for which ∂Γ = φ. Then all such networks for given G differ from each other only in the “positions” of their interior vertices, thus the set [G, φ] of such networks can be identified with ℳn−2 . Then we proved (Corollary 4.24) that { } smt(M, ℳ) = inf ∣Γ∣ : Γ ∈ [G, φ], G ∈ ℬn . Choose an arbitrary G ∈ ℬn and any Γ ∈ [G, φ]. We put Xi = Γ(i), then M = {X1 , . . . , Xn }. For each −1 k ij ∈ E(G) we choose an arbitrary Rij ∈ ℛopt (Xi , Xj ) in such a way that Rji = Rij . Let Xk = {xik }ni=1 , then for any 2n−2 i k ∈ {1, . . . , n} and any 1 ≤ i ≤ nk we construct a network Γk : {1, . . . , 2n − 2} → ⊔j=1 Xj as follows: in each Xj we r choose one point xj j =: Γik (j) such that (1) xrkk = xik ; r

r

(2) for any pq ∈ E(G) we have (xpp , xqq ) ∈ Rpq (verify that it is always possible to do). Consider the set 𝒯 = {Γik } consisting of all Γik over all possible k and i. For any j ∈ {1, . . . , 2n − 2} we put Vj = ∪T ∈𝒯 {T (j)}, i.e., we gather in each Xj all ∑npoints that are the images of the vertices of the constructed networks. Let us note that all Vj have at most N := k=1 nk points, and for each j ∈ {1, . . . , n} it holds Vj = Xj . t the set of all pairs (xp , xp ) such that for some Further, for each p and q such that pq ∈ E(G) we denote by Rpq T ∈ 𝒯 we have xp = T (p) and xq = T (q), i.e., we gather all pairs forming the images of the edges of the constructed t t networks. Thus, we obtained correspondences Rpq ∈ ℛ(Vp , Vq ) such that Rpq ⊂ Rpq , hence t dGH (Vp , Vq ) ≤ dis Rpq ≤ dis Rpq = dGH (Xp , Xq ).

Denote by Γt the network Γt : p ⊦→ Vp . Since Γt ∣{1,...,n} = φ and Γt has the type G, then Γt ∈ [G, φ]. Denote by [G, φ]t the set of all such Γt . Since ∣Γt ∣ ≤ ∣Γ∣, then { } smt(M, ℳ) = inf ∣Γt ∣ : Γt ∈ [G, φ]t , G ∈ ℬn . t However, all Xi belong to ℳN , therefore, smt(M, ℳ) = smt(M, ℳ ∑N ). Moreover, if we choose′ Γ such that all Vi , i ≥ n + 1, coincide with Vk for some k ≤ n, then smt(M, ℳN ) ≤ 1≤p,q≤n dGH (Xp , Xq ) =: D . Thus, it suffices to consider only Γt with ∣Γt ∣ ≤ D′ , in particular, for such Γt and any pq ∈ E(G) we have dGH (Vp , Vq ) ≤ D′ . Let us put d = max{diam Xk : k = 1, . . . , n}. Since for any X, Y ∈ ℳ we have dGH (X, Y ) ≥ 21 ∣ diam X − diam Y ∣, then for each Xp we have diam Xp ≤ d + 2(n − 2)D′ =: D (all these estimates are rather rough, however, we do not need exact ones here). In account, we proved that all Xp belongs to ℳN (D). By Problem 7.4, the space ℳN (D) is compact, that is why the continuous function ℓ : ℳN (D)n−2 → ℝ, ℓ : (Xn+1 , . . . , X2n−2 ) ⊦→ ∣Γt ∣, attains its minimum at some Γt0 . It remains to notice that ℬn is finite.

References to Chapter 7 [1] Burago D., Burago Yu., Ivanov S. A Course in Metric Geometry. Graduate Studies in Mathematics, vol.33, A.M.S., Providence, RI, 2001.

89

Exercises to Chapter 7.

90

Exercises to Chapter 7 Exercise 7.1. Prove that the product topology on X × Y coincides with the one generated by the metric from Agreement 7.1. Exercise 7.2. Prove that the topologies generated by all dt , and by ∣ · ∣ as well, coincide with the product topology of X × Y . Exercise 7.3. Prove that (1) a metric space X is bounded if only for some ε > 0 it holds cov(X, ε) < ∞ (similarly, pack(X, ε) < ∞); (2) a metric space X is finite if and only if there exists n such that cov(X, ε) ≤ n for all ε > 0 (similarly, for pack(X, ε)); (3) the functions f (ε) = cov(X, ε) and g(ε) = pack(X, ε) are monotonically decreasing. Exercise 7.4. Prove that the set ℳn (D) is compact, while ℳ[n] (D) for n > 1 is not.

Chapter 8

Calculating GH-distances to simplexes and some applications. Schedule. GH-distance to simplexes with more points, GH-distance to simplexes with at most the same number of points, generalized Borsuk problem, solution of generalized Borsuk problem in terms of GH-distances, clique covering number and chromatic number of simple graphs, their dualities, calculating these numbers in terms of GH-distances.

By simplex we mean a metric space in which all non-zero distances equal to each other. If m is an arbitrary cardinal number, a simplex contain m points, and all its non-zero distances equal 1, then we denote this simplex by Δm . Thus, λΔm , λ > 0, is a simplex whose non-zero distances equal λ. Also, for arbitrary metric space X and λ = 0, the space λX coincides with Δ1 .

8.1

Gromov–Hausdorff distance to simplexes with more points

The next result generalizes Theorem 4.1 from [1]. Theorem 8.1. Let X be an arbitrary metric space, m > #X a cardinal number, and λ ≥ 0, then 2dGH (λΔm , X) = max{λ, diam X − λ}. Proof. If X is unbounded, then 2dGH (λΔm , X) = ∞ by Example 6.29, and we get what is required. Now, let diam X < ∞. If #X = 1, then diam X = 0, and, by Example 6.28, we have 2dGH (λΔ, X) = diam λΔ = λ = max{λ, diam X − λ}. If λ = 0, then, by Example 6.28, we have 2dGH (Δ1 , X) = diam X = max{λ, diam X − λ}. Let #X > 1 and λ > 0. Choose an arbitrary R ∈ ℛ(λΔm , X). Since #X < m and λ > 0, then there exists x ∈ X such that #R−1 (x) ≥ 2, thus, dis R ≥ λ and 2dGH (λΔm , X) ≥ λ. Consider an arbitrary sequence (xi , yi ) ∈ X × X such that ∣xi yi ∣ → diam X. If it contains a subsequence (xik , yik ) such that for each ik there exists zk ∈ λΔ, (zk , xik ) ∈ R, (zk , yik ) ∈ R, then dis R ≥ diam X and 2dGH (λΔm , X) ≥ max{λ, diam X} ≥ max{λ, diam X − λ}. If such subsequence does not exist, then there exists a subsequence (xik , yik ) such that for any ik there exist distinct zk , wk ∈ λΔm , (zk , xik ) ∈ R, (wk , yik ) ∈ R, and, therefore, { } 2dGH (λΔm , X) ≥ max λ, ∣ diam X − λ∣ ≥ max{λ, diam X − λ}. { } Thus, in the both cases we have 2dGH (λΔ, X) ≥ max λ, diam X − λ .

91

8.2. Gromov–Hausdorff distance to simplexes with at most the same number of points

92

Choose an arbitrary x0 ∈ X, then, by assumption, #X > 1, and, thus, the set X \ {x0 } is not empty. Since #X < m, then λΔm contains a subset λΔ′ of the same cardinality with X \ {x0 }. Let g : λΔ′ → X \ {x0 } be an arbitrary bijection, and λΔ′′ = λΔm \ λΔ′ , then #λΔ′′ > 1. Consider the following correspondence: {( } ( ) ) R0 = z ′ , g(z ′ ) : z ′ ∈ λΔ′ ∪ λΔ′′ × {x0 } . Then we can apply Proposition 6.23, thus we have dis R0 = sup{λ, ∣x1 x′1 ∣ − λ, λ − ∣x2 x′2 ∣ : x1 , x′1 , x2 , x′2 ∈ X, x1 ̸= x′1 , x2 ̸= x′2 } = max{λ, diam X − λ}, therefore, 2dGH (λΔ, X) = max{λ, diam X − λ}, what is required.

8.2

Gromov–Hausdorff distance to simplexes with at most the same number of points

Let X be an arbitrary set different from singleton, 2 ≤ m ≤ #X a cardinal number, and λ > 0. Under notations from Section 6.1, consider an arbitrary D ∈ 𝒟m (X), any bijection g : λΔm → D, and construct the correspondence RD ∈ ℛ(λΔm , X) in the following way: ⋃ RD = {z} × g(z). z∈λΔm

Clearly that each correspondence RD is irreducible. From Proposition 6.23 we get Proposition 8.2. Let X ̸= Δ1 be an arbitrary metric space, 2 ≤ m ≤ #X a cardinal number, and λ > 0. Then for any D ∈ 𝒟m (X) it holds dis RD = max{diam D, λ − α(D), β(D) − λ}. Proof. If X is unbounded, then dis R = ∞ for any R ∈ ℛ(λΔm , X). Since m ≥ 2, for any D = {Xi }i∈I ∈ 𝒟m (X) we have either diam D = ∞, or β(D) = ∞. Indeed, if diam D < ∞ and β(D) < ∞ then for any x, y ∈ X either x, y ∈ Xi , thus ∣xy∣ ≤ diam D, or x ∈ Xi , y ∈ Xj , i ̸= j, and ∣xy∣ ≤ ∣Xi Xj ∣ ≤ β(D), therefore X is bounded. Thus, for unbounded X the right-hand side of the considered equation is infinite as well, thus we get what is required. Now, let diam X < ∞. By Proposition 6.23, we have { } dis RD = sup diam D, λ − ∣Xi Xj ∣, ∣Xi Xj ∣′ − λ : i, j ∈ I, i ̸= j = max{diam D, λ − α(D), β(D) − λ}, that completes the proof. Corollary 8.3. Let X ̸= Δ1 be an arbitrary metric space, 2 ≤ m ≤ #X a cardinal number, and λ > 0. Then for any D ∈ 𝒟m (X) it holds dis RD = max{diam D, λ − α(D), diam X − λ}. Proof. Again, for unbounded X the equation evidently holds. Consider now the case of bounded X. Notice that diam D ≤ diam X and β(D) ≤ diam X. In addition, if diam D < diam X, and (xi , yi ) ∈ X × X is a sequence such that ∣xi yi ∣ → diam X, then, starting from some i, the points xi and yi belong to different elements of D, therefore, in this case we have β(D) = diam X, and the formula is proved. Now, let diam D = diam X, then β(D) − λ ≤ diam X and diam X − λ ≤ diam X, thus max{diam D, λ − α(D), β(D) − λ} = max{diam X, λ − α(D)} = max{diam D, λ − α(D), diam X − λ}, that completes the proof. Proposition 8.4. Let X ̸= Δ1 be an arbitrary metric space, and 2 ≤ m ≤ #X a cardinal number, and λ > 0. Then 2dGH (λΔm , X) =

inf D∈𝒟m (X)

dis RD .

8.3. Generalized Borsuk problem

93

Proof. The case of unbounded X is trivial, so, let X be bounded. By Corollary 6.21, 2dGH (λΔm , X) =

inf

R∈ℛ0 (λΔm ,X)

dis R,

thus it suffices to prove that for any irreducible correspondence R ∈ ℛ0 (λΔm , X) there exists D ∈ 𝒟m (X) such that dis RD ≤ dis R. Let us choose an arbitrary R ∈ ℛ0 (λΔm , X) such that it cannot be represented in the form RD , then the partition R DλΔm is not pointwise, i.e., there exists x ∈ X such that #R−1 (x) ≥ 2, therefore, dis R ≥ λ. R Define a metric on the set DλΔ to be equal λ between any its distinct elements, then this metric space is isometric m ′ to a simplex λΔn , n ≤ m. The correspondence R generates naturally another correspondence R′ ∈ ℛ(λΔ′n , X), R R R namely, if DλΔ = {Δj }j∈J , and fR : DλΔ → DX is the bijection generated by R, then m m R′ =



{Δj } × fR (Δj ).

j∈J R It is easy to see that dis R = max{λ, dis R′ }. Moreover, R′ is generated by the partition D′ = DX , i.e., R′ = RD′ , thus, by Corollary 8.3, we have

dis R′ = max{diam D′ , λ − α(D′ ), diam X − λ}, and hence,

dis R = max{λ, diam D′ , λ − α(D′ ), diam X − λ} = max{λ, diam D′ , diam X − λ}.

Since n ≤ m, the partition D′ has a subpartition D ∈ 𝒟m (X). Clearly, diam D ≤ diam D′ , therefore, dis RD = max{diam D, λ − α(D), diam X − λ} ≤ max{diam D′ , λ, diam X − λ} = dis R, q.e.d. Considering separately the case λ = 0, we get the following Corollary 8.5. Let X ̸= Δ1 be an arbitrary metric space, 2 ≤ m ≤ #X a cardinal number, and λ ≥ 0. Then 2dGH (λΔm , X) = For any metric space X put

inf D∈𝒟m (X)

max{diam D, λ − α(D), diam X − λ}.

{ } ε(X) = inf ∣xy∣ : x, y ∈ X, x ̸= y .

Notice that ε(X) ≤ diam X, and for a bounded X the equality holds, if and only if X is a simplex. Corollary 8.5 immediately implies the following result that is proved in [1]. Theorem 8.6 ([1]). Let X ̸= Δ1 be a finite metric space, m = #X, and λ ≥ 0, then { } 2dGH (λΔm , X) = max λ − ε(X), diam X − λ .

8.3

Generalized Borsuk problem

Classical Borsuk Problem deals with partitions of subsets of Euclidean space into parts having smaller diameters. We generalize the Borsuk problem to arbitrary bounded metric spaces and partitions of arbitrary cardinality. Let X be a bounded metric space, m a cardinal number such that 2 ≤ m ≤ #X, and D = {Xi }i∈I ∈ 𝒟m (X). We say that D is a partition into subsets having strictly smaller diameters, if there exists ε > 0 such that diam Xi ≤ diam X − ε for all i ∈ I. By Generalized Borsuk problem we call the following one: Is it possible to partition a bounded metric space X into a given, probably infinite, number of subsets, each of which has a strictly smaller diameter than X? We give a solution to the Generalized Borsuk problem in terms of the Gromov–Hausdorff distance. Theorem 8.7. Let X be an arbitrary bounded metric space and m a cardinal number such that 2 ≤ m ≤ #X. Choose an arbitrary number 0 < λ < diam X, then X can be partitioned into m subsets having strictly smaller diameters if and only if 2dGH (λΔm , X) < diam X.

8.4. Calculating clique covering and chromatic numbers of a graph

94

Proof. For the λ chosen, due Corollary 8.5, we have 2dGH (λΔm , X) ≤ diam X, and the equality holds if and only if for each D ∈ 𝒟m (X) we have diam D = diam X. The latter means that there is no partition of the space X into m parts having strictly smaller diameters. Corollary 8.8. Let d > 0 be a real number, and m ≤ n cardinal numbers. By ℳn we denote the set of isometry classes of bounded metric spaces of cardinality at most n, endowed with the Gromov–Hausdorff distance. Choose an arbitrary 0 < λ < d. Then the intersection Sd/2 (Δ1 ) ∩ Sd/2 (λΔm ) of the spheres, considered as the spheres in ℳn , does not contain spaces, whose cardinality is less than m, and consists exactly of all metric spaces from ℳn , whose diameters are equal to d and that cannot be partitioned into m subsets of strictly smaller diameters. Proof. Let X belong to the intersection of the spheres, then diam X = d in accordance with Example 6.28. If m > #X, then, due to Theorem 8.1, we have 2dGH (λΔ, X) = max{λ, diam X − λ} < d, therefore X ̸∈ Sd/2 (λΔm ), that proves the first statement of Corollary. Now let m ≤ #X. Since diam X = d and 2dGH (λΔm , X) = d, then, due to Theorem 8.7, the space X cannot be partitioned into m subsets of strictly smaller diameters. Conversely, each X of the diameter d, such that m ≤ #X and which cannot be partitioned into m subsets of strictly smaller diameter, lies in the intersection of the spheres by Theorem 8.7.

8.4

Calculating clique covering and chromatic numbers of a graph

Recall that a subgraph of an arbitrary simple graph G is called a clique, if any its two vertices are connected by an edge, i.e., the clique is a subgraph which is a complete graph itself. Notice that each single-vertex subgraph is also a clique. For convenience, the vertex set of a clique is also referred as a clique. On the set of all cliques, an ordering with respect to inclusion is naturally defined, and hence, due to the above remarks, a family of maximal cliques is uniquely defined; this family forms a cover of the graph G in the following sense: the union of all vertex sets of all maximal cliques coincides with the vertex set V (G) of the graph G. If one does not restrict himself by maximal cliques, then, generally speaking, one can find other families of cliques covering the graph G. One of the classical problems of the Graph Theory is to calculate the minimal possible number of cliques covering a finite simple graph G. This number is referred as the clique covering number and is often denoted by θ(G). It is easy to see that the value θ(G) is also equal to the least number of cliques whose vertex sets form a partition of V (G). Another popular problem is to find the least possible number of colors that is necessary to color the vertices of a simple finite graph G in such a way that adjacent vertices have different colors. This number is denoted by γ(G) and is referred as the chromatic number of the graph G. For a simple graph G, by G′ we denote its dual graph, i.e., the graph with the same vertex set and the complementary set of edges (two vertices of G′ are adjacent if and only if they are not adjacent in G). Problem 8.1. For any simple finite graph G it holds θ(G) = γ(G′ ). Let G = (V, E) be an arbitrary finite graph. Fix two real numbers a < b ≤ 2a and define a metric on V as follows: the distance between adjacent vertices equals a, and nonadjacent vertices equals b. Then a subset V ′ ⊂ V has diameter a if and only if G(V ′ ) ⊂ G is a clique. This implies that each clique covering number equals to the least cardinality of partitions of the metric space V onto subsets of (strictly) smaller diameter. However, this number was calculated in Theorem 8.7. Thus, we get the following Corollary 8.9. Let G = (V, E) be an arbitrary finite graph. Fix two real numbers a < b ≤ 2a and define a metric on V as follows: the distance between adjacent vertices equals a, and nonadjacent vertices equals b. Let m be the greatest positive integer k such that 2dGH (aΔk , V ) = b (in the case when there is no such k, we put m = 0). Then θ(G) = m + 1.

8.4. Calculating clique covering and chromatic numbers of a graph

95

Problem 8.2. Consider simple finite graphs G = (V, E) for which the clique covering numbers θ(G) are known, and get the Gromov–Hausdorff distances between the corresponding metric spaces V and simplexes λΔm with m ≤ θ(G). Verify explicitly that for k > m these distances are less than diam V . Because of the duality between clique and chromatic numbers, we get Corollary 8.10. Let G = (V, E) be an arbitrary finite graph. Fix two real numbers a < b ≤ 2a and define a metric on V as follows: the distance between adjacent vertices equals b, and nonadjacent vertices equals a. Let m be the greatest positive integer k such that 2dGH (aΔk , V ) = b (in the case when there is no such k, we put m = 0). Then γ(G) = m + 1. Problem 8.3. Consider simple finite graphs G = (V, E) for which the chromatic numbers γ(G) are known, and get the Gromov–Hausdorff distances between the corresponding metric spaces V and simplexes λΔm with m ≤ γ(G). Verify explicitly that for k > m this distances are less than diam V .

References to Chapter 8 [1] A.O.Ivanov, S.Iliadis, and A.A.Tuzhilin, Geometry of Compact Metric Space in Terms of Gromov-Hausdorff Distances to Regular Simplexes. ArXiv e-prints, arXiv:1607.06655, 2016.

96

Exercises to Chapter 8.

97

Exercises to Chapter 8 Exercise 8.1. For any simple finite graph G it holds θ(G) = γ(G′ ). Exercise 8.2. Consider simple finite graphs G = (V, E) for which the clique covering numbers θ(G) are known, and get the Gromov–Hausdorff distances between the corresponding metric spaces V and simplexes λΔm with m ≤ θ(G). Verify explicitly that for k > m these distances are less than diam V . Exercise 8.3. Consider simple finite graphs G = (V, E) for which the chromatic numbers γ(G) are known, and get the Gromov–Hausdorff distances between the corresponding metric spaces V and simplexes λΔm with m ≤ γ(G). Verify explicitly that for k > m this distances are less than diam V .