Metric Embeddings: Bilipschitz and Coarse Embeddings into Banach Spaces 9783110264012, 9783110263404

Embeddings of discrete metric spaces into Banach spaces recently became an important tool in computer science and topolo

233 18 3MB

English Pages 384 Year 2013

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
Preface
1 Introduction: examples of metrics, embeddings, and applications
1.1 Metric spaces: definitions and main examples
1.2 Types of embeddings: isometric, bilipschitz, coarse, and uniform
1.2.1 Isometric embeddings
1.2.2 Bilipschitz embeddings
1.2.3 Coarse and uniform embeddings
1.3 Probability theory terminology and notation
1.4 Applications to the sparsest cut problem
1.5 Exercises
1.6 Notes and remarks
1.6.1 To Section 1.1
1.6.2 To Section 1.2
1.6.3 To Section 1.3
1.6.4 To Section 1.4
1.6.5 To exercises
1.7 On applications in topology
1.8 Hints to exercises
2 Embeddability of locally finite metric spaces into Banach spaces is finitely determined. Related Banach space theory
2.1 Introduction
2.2 Banach space theory: ultrafilters, ultraproducts, finite representability
2.2.1 Ultrafilters
2.2.2 Ultraproducts
2.2.3 Finite representability
2.3 Proofs of the main results on relations between embeddability of a locally finite metric space and its finite subsets
2.3.1 Proof in the bilipschitz case
2.3.2 Proof in the coarse case
2.3.3 Remarks on extensions of finite determination results
2.4 Banach space theory: type and cotype of Banach spaces, Khinchin and Kahane inequalities
2.4.1 Rademacher type and cotype
2.4.2 Kahane-Khinchin inequality
2.4.3 Characterization of spaces with trivial type or cotype
2.5 Some corollaries of the theorems on finite determination of embeddability of locally finite metric spaces
2.6 Exercises
2.7 Notes and remarks
2.8 Hints to exercises
3 Constructions of embeddings
3.1 Padded decompositions and their applications to constructions of embeddings
3.2 Padded decompositions of minor-excluded graphs
3.3 Padded decompositions in terms of ball growth
3.4 Gluing single-scale embeddings
3.5 Exercises
3.6 Notes and remarks
3.7 Hints to exercises
4 Obstacles for embeddability: Poincaré inequalities
4.1 Definition of Poincaré inequalities for metric spaces
4.2 Poincaré inequalities for expanders
4.3 Lp-distortion in terms of constants in Poincaré inequalities
4.4 Euclidean distortion and positive semidefinite matrices
4.5 Fourier analytic method of getting Poincaré inequalities
4.6 Exercises
4.7 Notes and remarks
4.8 A bit of history of coarse embeddability
4.9 Hints to exercises
5 Families of expanders and of graphs with large girth
5.1 Introduction
5.2 Spectral characterization of expanders
5.3 Kazhdan’s property (T) and expanders
5.4 Groups with property (T)
5.4.1 Finite generation of SLn(ℤ)
5.4.2 Finite quotients of SLn(ℤ)
5.4.3 Property (T) for groups SLn(ℤ)
5.4.4 Criterion for property (T)
5.5 Zigzag products
5.6 Graphs with large girth: basic definitions
5.7 Graph lift constructions and ℓ1-embeddable graphs with large girth
5.8 Probabilistic proof of existence of expanders
5.9 Size and diameter of graphs with large girth: basic facts
5.10 Random constructions of graphs with large girth
5.11 Graphs with large girth using variational techniques
5.12 Inequalities for the spectral gap of graphs with large girth
5.13 Biggs’s construction of graphs with large girth
5.14 Margulis’s 1982 construction of graphs with large girth
5.15 Families of expanders which are not coarsely embeddable one into another
5.16 Exercises
5.17 Notes and remarks
5.17.1 Bounds for spectral gaps
5.17.2 Graphs with very large spectral gaps
5.17.3 Some more results and constructions
5.18 Hints to exercises
6 Banach spaces which do not admit uniformly coarse embeddings of expanders
6.1 Banach spaces whose balls admit uniform embeddings into L1
6.2 Banach spaces not admitting coarse embeddings of expander families, using interpolation
6.3 Banach space theory: a characterization of reflexivity
6.4 Some classes of spaces whose balls are not uniformly embeddable into L1
6.4.1 Stable metric spaces and iterated limits
6.4.2 Non-embeddability result
6.5 Examples of non-reflexive spaces with nontrivial type
6.6 Exercises
6.7 Notes and remarks
6.8 Hints to exercises
7 Structure properties of spaces which are not coarsely embeddable into a Hilbert space
7.1 Expander-like structures implying coarse non-embeddability into L1
7.2 On the structure of locally finite spaces which do not admit coarse embeddings into a Hilbert space
7.3 Expansion properties of metric spaces not admitting a coarse embedding into a Hilbert space
7.4 Exercises
7.5 Notes and remarks
7.6 Hints to exercises
8 Applications of Markov chains to embeddability problems
8.1 Basic definitions and results on finite Markov chains
8.2 Markov type
8.3 First application of Markov type to embeddability problems: Euclidean distortion of graphs with large girth
8.4 Banach space theory: renormings of superreflexive spaces, q-convexity and p-smoothness
8.4.1 Definitions and duality
8.4.2 Pisier theorem on renormings of uniformly convex spaces
8.5 Markov type of uniformly smooth Banach spaces
8.6 Applications of Markov type to lower estimates of distortions of embeddings into uniformly smooth Banach spaces
8.7 Exercises
8.8 Notes and remarks
8.9 Hints to exercises
9 Metric characterizations of classes of Banach spaces
9.1 Introduction
9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem
9.2.1 Proving Bourgain’s discretization theorem. Preliminary step: it suffices to consider spaces with differentiable norm
9.2.2 First step: picking the system of coordinates
9.2.3 Second step: construction of a Lipschitz almost-extension
9.2.4 Third step: further smoothing of the map using Poisson kernels
9.2.5 Poisson kernel estimates and proofs of Lemmas 9.14 and 9.15
9.3 Test-space characterizations
9.3.1 More Banach space theory: superreflexivity
9.3.2 Characterization of superreflexivity in terms of diamond graphs
9.4 Exercises
9.5 Notes and remarks
9.5.1 Another test-space characterization of superreflexivity: binary trees
9.5.2 Further results on test-spaces
9.5.3 Further results on the Ribe program
9.5.4 Non-local properties
9.6 Hints to exercises
10 Lipschitz free spaces
10.1 Introductory remarks
10.2 Lipschitz free spaces: definition and properties
10.3 The case where dX is a graph distance
10.4 Lipschitz free spaces of some finite metric spaces
10.5 Exercises
10.6 Notes and remarks
10.7 Hints to exercises
11 Open problems
11.1 Embeddability of expanders into Banach spaces
11.2 Obstacles for coarse embeddability of spaces with bounded geometry into a Hilbert space
11.2.1 The main problem
11.2.2 Comments
11.3 Embeddability of graphs with large girth
11.4 Coarse embeddability of a Hilbert space into Banach spaces
Bibliography
Author index
Subject index
Recommend Papers

Metric Embeddings: Bilipschitz and Coarse Embeddings into Banach Spaces
 9783110264012, 9783110263404

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Citation preview

De Gruyter Studies in Mathematics 49 Editors Carsten Carstensen, Berlin, Germany Nicola Fusco, Napoli, Italy Fritz Gesztesy, Columbia, USA Niels Jacob, Swansea, United Kingdom Karl-Hermann Neeb, Erlangen, Germany

Mikhail I. Ostrovskii

Metric Embeddings Bilipschitz and Coarse Embeddings into Banach Spaces

De Gruyter

Mathematical Subject Classification 2010: Primary: 46B85; Secondary: 05C12, 30L05, 46B20, 54E35. Author Prof. Dr. Mikhail I. Ostrovskii St. John’s University St. John’s College of Liberal Arts and Sciences Department of Mathematics and Computer Science 8000 Utopia Parkway Queens NY 11439 United States [email protected]

ISBN 978-3-11-026340-4 e-ISBN 978-3-11-026401-2 Set-ISBN 978-3-11-916622-5 ISSN 0179-0986 Library of Congress Cataloging-in-Publication Data A CIP catalog record for this book has been applied for at the Library of Congress. Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available in the internet at http://dnb.dnb.de. © 2013 Walter de Gruyter GmbH, Berlin/Boston Typesetting: P TP-Berlin Protago-TEX-Production GmbH, www.ptp-berlin.de Printing and binding: Hubert & Co. GmbH & Co. KG, Göttingen Printed on acid-free paper Printed in Germany www.degruyter.com

Preface

Embeddings of discrete metric spaces (such as graphs with graph distances, finitely generated groups with word distances) into Banach spaces (the most important are the Hilbert space and the space L1 ) recently became an important tool in computer science and topology. The purpose of the book is to present some of the most important techniques and results, mostly on bilipschitz and coarse embeddings. The topics include: (1) Embeddability of locally finite metric spaces into Banach spaces is finitely determined; (2) Constructions of embeddings; (3) Distortion in terms of Poincaré inequalities; (4) Constructions of families of expanders and of families of graphs with unbounded girth and lower bounds on average degrees; (5) Banach spaces which do not admit coarse embeddings of expanders; (6) Structure of metric spaces which are not coarsely embeddable into a Hilbert space; (7) Applications of Markov chains to embeddability problems; (8) Metric characterizations of properties of Banach spaces; (9) Lipschitz free spaces. A substantial part of the book is devoted to a detailed presentation of relevant results of Banach space theory and graph theory. The final chapter contains a list of open problems. An extensive bibliography is also included. Each chapter, except the open problems chapter, contains exercises and a Notes and Remarks section containing references, discussion of related results, and suggestions for further reading. The book will help readers to enter into and to work in a very rapidly developing area having many important connections with different parts of mathematics and computer science. I also hope that the readers will become interested in the open problems listed in this book. I plan to maintain a web site containing information on progress on problems listed in Chapter 11 (at http://facpub.stjohns.edu/ostrovsm/). I would be very thankful for information on errors in this book and on progress on the open problems presented here, my e-mail: [email protected]. It should be mentioned that the area currently develops in numerous different directions and employs very diverse techniques. For this reason it is not possible to present even the most important results and techniques in a book of moderate size. I suggest readers consult [333], [334], and [345] for some of the directions which are underrepresented in this book, more references are given in the Notes and Remarks sections. The selection of topics for this book is strongly influenced by my interests and expertise. I am very grateful to all of my colleagues who helped me with useful comments, helpful discussions, and constructive criticism. These include Florent Baudier, Florin Catrina, Genady Grabarnik, Hamed Hatami, William B. Johnson, the late Nigel J. Kalton, Jerome Kaminker, Gilles Lancien, Nathan Linial, Calvin Mittman, Assaf Naor,

vi

Preface

Piotr W. Nowak, Mikhail M. Popov, Doron Puder, Matias Raja, Beata Randrianantoanina, David Rosenthal, Mark Sapir, Gideon Schechtman, and Romain Tessera. Also I would like to thank my home institution, St. John’s University, for supporting me during my work on this book by providing a research leave in Fall 2012. I was also partially supported by NSF Grant DMS-1201269. Queens, February 2013

Mikhail I. Ostrovskii

Contents

Preface

v

1

Introduction: examples of metrics, embeddings, and applications

1

1.1 Metric spaces: definitions and main examples . . . . . . . . . . . . . . . . . . .

1

1.2 Types of embeddings: isometric, bilipschitz, coarse, and uniform . . . . 6 1.2.1 Isometric embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 1.2.2 Bilipschitz embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 1.2.3 Coarse and uniform embeddings . . . . . . . . . . . . . . . . . . . . . . . . 17 1.3 Probability theory terminology and notation . . . . . . . . . . . . . . . . . . . . . 22 1.4 Applications to the sparsest cut problem . . . . . . . . . . . . . . . . . . . . . . . . 25 1.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 1.6 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.6.1 To Section 1.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.6.2 To Section 1.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.6.3 To Section 1.3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.6.4 To Section 1.4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.6.5 To exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

27 27 28 29 30 30

1.7 On applications in topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 1.8 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 2

Embeddability of locally finite metric spaces into Banach spaces is finitely determined. Related Banach space theory

34

2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 2.2 Banach space theory: ultrafilters, ultraproducts, finite representability . 2.2.1 Ultrafilters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2.2 Ultraproducts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2.3 Finite representability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

35 35 37 40

2.3 Proofs of the main results on relations between embeddability of a locally finite metric space and its finite subsets . . . . . . . . . . . . . . . . . . . 2.3.1 Proof in the bilipschitz case . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.2 Proof in the coarse case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3.3 Remarks on extensions of finite determination results . . . . . . .

44 44 52 53

viii

Contents

2.4 Banach space theory: type and cotype of Banach spaces, Khinchin and Kahane inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.1 Rademacher type and cotype . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.2 Kahane–Khinchin inequality . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.3 Characterization of spaces with trivial type or cotype . . . . . . .

53 53 57 66

2.5 Some corollaries of the theorems on finite determination of embeddability of locally finite metric spaces . . . . . . . . . . . . . . . . . . . . 75 2.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 2.7 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 2.8 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 3

Constructions of embeddings

80

3.1 Padded decompositions and their applications to constructions of embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 3.2 Padded decompositions of minor-excluded graphs . . . . . . . . . . . . . . . . 84 3.3 Padded decompositions in terms of ball growth . . . . . . . . . . . . . . . . . . 90 3.4 Gluing single-scale embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 3.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 3.6 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 3.7 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 4

Obstacles for embeddability: Poincaré inequalities

105

4.1 Definition of Poincaré inequalities for metric spaces . . . . . . . . . . . . . . 105 4.2 Poincaré inequalities for expanders . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 4.3 Lp -distortion in terms of constants in Poincaré inequalities . . . . . . . . . 112 4.4 Euclidean distortion and positive semidefinite matrices . . . . . . . . . . . . 114 4.5 Fourier analytic method of getting Poincaré inequalities . . . . . . . . . . . 116 4.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127 4.7 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128 4.8 A bit of history of coarse embeddability . . . . . . . . . . . . . . . . . . . . . . . . 129 4.9 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 5

Families of expanders and of graphs with large girth

131

5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131 5.2 Spectral characterization of expanders . . . . . . . . . . . . . . . . . . . . . . . . . 132 5.3 Kazhdan’s property (T) and expanders . . . . . . . . . . . . . . . . . . . . . . . . . 137

Contents

5.4 Groups with property (T) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.4.1 Finite generation of SLn.Z/ . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.4.2 Finite quotients of SLn.Z/ . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.4.3 Property (T) for groups SLn.Z/ . . . . . . . . . . . . . . . . . . . . . . . . 5.4.4 Criterion for property (T) . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

ix 142 143 144 145 145

5.5 Zigzag products . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 5.6 Graphs with large girth: basic definitions . . . . . . . . . . . . . . . . . . . . . . . 155 5.7 Graph lift constructions and `1 -embeddable graphs with large girth

156

5.8 Probabilistic proof of existence of expanders . . . . . . . . . . . . . . . . . . . . 164 5.9 Size and diameter of graphs with large girth: basic facts . . . . . . . . . . . 167 5.10 Random constructions of graphs with large girth . . . . . . . . . . . . . . . . . 169 5.11 Graphs with large girth using variational techniques . . . . . . . . . . . . . . 170 5.12 Inequalities for the spectral gap of graphs with large girth . . . . . . . . . . 174 5.13 Biggs’s construction of graphs with large girth . . . . . . . . . . . . . . . . . . . 175 5.14 Margulis’s 1982 construction of graphs with large girth . . . . . . . . . . . . 177 5.15 Families of expanders which are not coarsely embeddable one into another . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178 5.16 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181 5.17 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.17.1 Bounds for spectral gaps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.17.2 Graphs with very large spectral gaps . . . . . . . . . . . . . . . . . . . . 5.17.3 Some more results and constructions . . . . . . . . . . . . . . . . . . . .

183 187 187 188

5.18 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189 6

Banach spaces which do not admit uniformly coarse embeddings of expanders

191

6.1 Banach spaces whose balls admit uniform embeddings into L1 . . . . . . 192 6.2 Banach spaces not admitting coarse embeddings of expander families, using interpolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195 6.3 Banach space theory: a characterization of reflexivity . . . . . . . . . . . . . 200 6.4 Some classes of spaces whose balls are not uniformly embeddable into L1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204 6.4.1 Stable metric spaces and iterated limits . . . . . . . . . . . . . . . . . . 204 6.4.2 Non-embeddability result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206 6.5 Examples of non-reflexive spaces with nontrivial type . . . . . . . . . . . . . 208 6.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215

x

Contents

6.7 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 6.8 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217 7

Structure properties of spaces which are not coarsely embeddable into a Hilbert space 218 7.1 Expander-like structures implying coarse non-embeddability into L1

218

7.2 On the structure of locally finite spaces which do not admit coarse embeddings into a Hilbert space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220 7.3 Expansion properties of metric spaces not admitting a coarse embedding into a Hilbert space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223 7.4 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226 7.5 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226 7.6 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227 8

Applications of Markov chains to embeddability problems

228

8.1 Basic definitions and results on finite Markov chains . . . . . . . . . . . . . . 228 8.2 Markov type . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230 8.3 First application of Markov type to embeddability problems: Euclidean distortion of graphs with large girth . . . . . . . . . . . . . . . . . . . 232 8.4 Banach space theory: renormings of superreflexive spaces, q-convexity and p-smoothness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 8.4.1 Definitions and duality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 8.4.2 Pisier theorem on renormings of uniformly convex spaces . . . 239 8.5 Markov type of uniformly smooth Banach spaces . . . . . . . . . . . . . . . . 253 8.6 Applications of Markov type to lower estimates of distortions of embeddings into uniformly smooth Banach spaces . . . . . . . . . . . . . . . . 259 8.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261 8.8 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 8.9 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264 9

Metric characterizations of classes of Banach spaces

265

9.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem 9.2.1 Proving Bourgain’s discretization theorem. Preliminary step: it suffices to consider spaces with differentiable norm . . . . . . . 9.2.2 First step: picking the system of coordinates . . . . . . . . . . . . . . 9.2.3 Second step: construction of a Lipschitz almost-extension . . . .

266 268 269 271

xi

Contents

9.2.4 9.2.5

Third step: further smoothing of the map using Poisson kernels 276 Poisson kernel estimates and proofs of Lemmas 9.14 and 9.15 283

9.3 Test-space characterizations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287 9.3.1 More Banach space theory: superreflexivity . . . . . . . . . . . . . . . 289 9.3.2 Characterization of superreflexivity in terms of diamond graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297 9.4 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302 9.5 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.5.1 Another test-space characterization of superreflexivity: binary trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.5.2 Further results on test-spaces . . . . . . . . . . . . . . . . . . . . . . . . . . 9.5.3 Further results on the Ribe program . . . . . . . . . . . . . . . . . . . . . 9.5.4 Non-local properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

303 305 306 307 307

9.6 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307 10 Lipschitz free spaces

308

10.1 Introductory remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308 10.2 Lipschitz free spaces: definition and properties . . . . . . . . . . . . . . . . . . . 308 10.3 The case where dX is a graph distance . . . . . . . . . . . . . . . . . . . . . . . . . 312 10.4 Lipschitz free spaces of some finite metric spaces . . . . . . . . . . . . . . . . 317 10.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 326 10.6 Notes and remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 326 10.7 Hints to exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327 11 Open problems

328

11.1 Embeddability of expanders into Banach spaces . . . . . . . . . . . . . . . . . . 328 11.2 Obstacles for coarse embeddability of spaces with bounded geometry into a Hilbert space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330 11.2.1 The main problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330 11.2.2 Comments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 331 11.3 Embeddability of graphs with large girth . . . . . . . . . . . . . . . . . . . . . . . 332 11.4 Coarse embeddability of a Hilbert space into Banach spaces . . . . . . . . 333 Bibliography

335

Author index

361

Subject index

367

Chapter 1

Introduction: examples of metrics, embeddings, and applications 1.1 Metric spaces: definitions and main examples The notions of being “close” or “far” are used in many different contexts and are understood in many different ways. In this book we consider one of the most standard ways for mathematics to look at these notions, the notion of a metric space. Definition 1.1. A metric space is a set X endowed with a function d : X  X ! RC (where RC is the set of all nonnegative real numbers) satisfying the conditions: (a)

Triangle inequality: 8x, y, z 2 X

(b)

Symmetry: 8x, y 2 X

(c)

Separation axiom: 8x, y 2 X

(d)

8x 2 X

d.x, z/  d.x, y/ C d.y, z/.

d.x, y/ D d.y, x/. x ¤ y ) d.x, y/ ¤ 0.

d.x, x/ D 0.

The function d is called a metric on X. Example 1.2 (Normed spaces). Any normed and any Banach space .X, k  k/ is a metric space with respect to the metric d.x, y/ D kx  yk. Usually we shall consider real Banach spaces (that is, the case where the field of scalars is R). In some results and proofs it will be natural to consider complex Banach spaces (the case where the field of scalars is C). Example 1.3 (Banach spaces `p , `pn , 1  p  1, n 2 N). Their definitions (in all of these spaces addition and multiplication by scalars are defined componentwise): 1=p ³ ² X n `pn D ¹xi ºniD1 : xi 2 R, k¹xi ºniD1k D , 1  p < 1. jxi jp i D1

`n1

² ³ n n D ¹xi ºi D1 : xi 2 R, k¹xi ºi D1k D max jxi j . 1i n

1=p ² X ³ 1 1 1 p jxi j 0 for all x 2 X except x D 0. (b) h0, 0i D 0. (c) 8˛1, ˛2 2 K 8x1, x2 , x 2 X h˛1 x1 C ˛2 x2, xi D ˛1 hx1 , xi C ˛2hx2 , xi. (d) 8x, y 2 X hx, yi D hy, xi, where the bar denotes the complex conjugation (the bar can be omitted if K D R).

Section 1.1 Metric spaces: definitions and main examples

3

We say that a norm k  k on a linear space X is induced by an inner product if 8x 2 X kxk2 D hx, xi for some inner product h, i on X. Among the Banach spaces mentioned above the following spaces are Hilbert spaces: `2 , `n2 and L2 ., †, /. The corresponding inner products are: 1 X i D1

xi yi ,

n X i D1

Z xi yi ,

x.t /y.t /d.t /. 

Example 1.8 (Graphs with graph distances). Let G D .V .G/, E.G// be a graph, so V .G/ is a set of objects called vertices and E.G/ is some set of unordered pairs of vertices called edges. We denote an unordered pair consisting of vertices u and v by uv or by ¹u, vº and say that u and v are ends or end vertices of uv. A walk in G is a finite sequence of the form W D v0 , e1 , v1 , e2, : : : , ek , vk whose terms are alternately vertices and edges such that, for 1  i  k, the edge ei has ends vi 1 and vi . We say that W starts at v0 and ends at vk , and that W is a v0vk -walk. The number k is called the length of the walk. A graph G is called connected if for each u, v 2 V .G/ there is a uv-walk in G. If G is connected, we endow V .G/ with the metric dG .u, v/ D the length of the shortest uv-walk in G. The metric dG is called the graph distance or the shortest path metric. When we say “graph G with its graph distance (shortest path metric)” we mean the metric space .V .G/, dG /. Example 1.9 (Groups with word metrics). Let G be a group and S be a finite subset of G. We say that G is generated by the set S, or that S is a generating set of G, if each g 2 G can be written as a finite product of elements from S (elements from S can be repeated in this product arbitrarily many times). A group is called finitely generated if it has a finite generating set. Now let G be a finitely generated group and S be a generating set in G. We assume that S does not contain the identity and is symmetric, that is, contains g if and only if it contains g 1 . The Cayley graph of G corresponding to the generating set S is the graph whose vertices are elements of G, elements g1 2 G and g2 2 G are connected by an edge if and only if g11g2 2 S. The graph distance of this graph is called the word metric because the distance between group elements g and h is the shortest representation of g 1 h in terms of elements of S, such representations are called words in the alphabet S. Definition 1.10. A semimetric is like a metric except that the separation axiom is not required. Example 1.11 (Cut semimetrics (sometimes called elementary cut metrics)). Let S N is called a cut in A be a subset of a set A, SN be the complement of S. The pair .S, S/ and S, SN are called parts of this cut. The cut semimetric on A corresponding to the cut

4

Chapter 1 Introduction: examples of metrics, embeddings, and applications

N is defined by .S, S/ ´ 0 dS .u, v/ D 1

if u and v are in the same part if u and v are in different parts

Example 1.12 (Weighted graphs). A weighted graph is a graph G D .V , E/ in which each edge e has been assigned a nonnegative real number w.e/. This number is called the weightPof e. The weight of a walk W D v0, e1 , v1 , e2, : : : , ek , vk is defined as w.W / D jkD1 w.ej /. We endow the vertex set V with the (semi)metric dG,w .u, v/ D inf¹w.W / : W is a uv-walk in Gº. This metric is called a weighted graph distance or weighted shortest path semimetric (semi is omitted if we know that we get a metric). Sometimes we use the term unweighted graph to emphasize that we consider a graph with no weights, or, what is the same, with all weights equal to 1. Observe that for infinite graphs the weighed graph distance can fail the separation axiom even in the case where all weights are strictly positive. Remark 1.13. Each finite semimetric space .X, dX / can be obtained as a weighted graph with its weighted graph distance. In fact, consider the complete graph with the vertex set X. (Recall that a graph is called complete if each pair of vertices in it is joined by one edge and there are no loops, that is, edges joining a vertex to itself.) We introduce weights of edges by w.xy/ D dX .x, y/. It is clear that the weighted graph distance of the obtained weighted graph coincides with dX . Example 1.14 (Semimetrics in combinatorial optimization). Semimetrics arise naturally in construction of approximation algorithms in computer science. We are going to present one of the most important and well-known examples of this type. The sparsest cut problem is the following problem of combinatorial optimization. We are given a connected graph G D .V , E/, with a positive weight (called a capacity) c.e/ associated to each edge e 2 E, and a nonnegative number (called a demand) D.u, v/ associated to each (unordered) pair of vertices u, v 2 V . By a cut of G we N mean a partition of the vertex set V into two disjoint sets: S and its complement S. N The sparsity of the cut .S, S/ is defined as P c.uv/ N u2S,v2S,uv2E P , (1.1) u2S,v2SN D.u, v/ that is, the sparsity is the ratio between the capacities and the demands which “cross” the cut. The P sparsest cut problem is: find a cut of minimum sparsity among cuts with nonzero u2S,v2SN D.u, v/. This problem is known to be computationally hard (see

Section 1.1 Metric spaces: definitions and main examples

5

the “Notes and Remarks” section), for this reason the following version of the sparsest cut problem is also of interest: to approximate the minimum sparsity. One of the approaches to this approximate version of the sparsest cut problem starts with writing the quantity (1.1) in terms of a cut semimetric dS : P c.uv/dS .u, v/ P uv2E . D.u, v/dS .u, v/ u,v2V

(1.2)

(The quantities in (1.1) and (1.2) are not equal, their quotient is equal to 2, but this does not change the problem.) Then, as is easy to see, the minimum decreases if instead of the minimum over cut semimetrics d on V (by a nonmetrics dS we consider the minimum over all nontrivial P trivial semimetric here we mean a semimetric for which u,v2V D.u, v/d.u, v/ ¤ 0). The point is that the problem of minimization of P c.uv/d.u, v/ P uv2E . u,v2V D.u, v/d.u, v/

(1.3)

over the set of all nontrivial semimetrics d on V belongs to the class of so-called linear programming (LP) problems for which reasonably fast algorithms are known (see Notes and Remarks section for references). InPmore detail, by homogeneity, we may restrict our attention to the metrics satisfyingP u,v2V D.u, v/d.u, v/ D 1. Then we can write the problem as: Minimize the sum uv2E c.uv/d.u, v/ over the set of all collections ¹d.u, v/ºu,v2V satisfying the conditions X

D.u, v/d.u, v/ D 1

u,v2V

8u, v, w 2 V 8u, v 2 V 8u, v 2 V 8u 2 V

d.u, w/  d.u, v/ C d.v, w/ d.u, v/ D d.v, u/ d.u, v/  0 d.u, u/ D 0.

(1.4)

We get a problem of minimization of a linear form subject to finitely many linear inequalities, such a problem is called a linear programming problem. Of course, it is not immediately clear how to estimate the sparsest cut from above in terms of the minimum computed for this LP problem (and the corresponding semimetric dmin). Later we shall see (Section 1.4 in this chapter) that this can be done in terms of the possible quality of embeddings of the semimetric space .V , dmin / into the Banach space `1 .

6

Chapter 1 Introduction: examples of metrics, embeddings, and applications

1.2 Types of embeddings: isometric, bilipschitz, coarse, and uniform By an embedding of a set X into Y we mean any (not necessarily injective or surjective) map of X into Y . We are interested in the case where X and Y are metric (or at least semimetric) spaces and we are interested in embeddings which do not “distort too much” the metric structure. We start with embeddings which preserve distances.

1.2.1 Isometric embeddings Definition 1.15. A map f : X ! Y between two metric spaces is called an isometric embedding if it preserves distances, that is dY .f .u/, f .v// D dX .u, v/ for all u, v 2 X. If there exists an isometric embedding of X into Y we say that X is isometric to a subset (subspace) of Y . If an isometric embedding of X into Y is a bijection of X and Y , we say that X and Y are isometric. Example 1.16. The space `pn is isometric to a linear subspace of `pm if n < m, and to a linear subspace of `p . In fact, the map which extends a sequence ¹xi ºniD1 2 `pn with the corresponding amount of zeros is an isometric embedding of `pn onto a linear subspace. (In the described situations there are many other isometric embeddings.) The theory of isometric embeddings is a very rich theory which was developed from several different perspectives (we provide some references in the Notes and Remarks section). In many cases, in particular, in many important applications of embeddings there is no hope for existence of isometric embeddings (see e.g. Example 1.32 below), for this reason many weaker types of embeddings have been introduced and studied. We are going to consider three of them: bilipschitz, coarse, and uniform. Our main interest in this book is the theory of these wider classes of embeddings. We do not intend to present much of the theory of isometric embeddings. The purpose of this section is to present some simple but useful facts about isometric embeddings (much more general results are known and can be found in references provided in the Notes and Remarks section). Proposition 1.17 (Fréchet embedding). Each countable metric space embeds isometrically into `1 . Each metric space with n elements embeds isometrically into `n1 . Proof. Let X D ¹ui º1 i D0 be a countable metric space. We introduce a map f : X ! `1 by f .v/ D ¹d.v, ui /  d.ui , u0 /º1 i D1 . Observe that kf .v/  f .w/k D sup jd.v, ui /  d.w, ui /j. i 2N

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

7

The triangle inequality implies sup jd.v, ui /  d.w, ui /j  d.v, w/. i 2N

On the other hand, if v ¤ w, then at least one of v, w is among ¹ui º1 i D1 . Suppose that . We get v 2 ¹ui º1 i D1 sup jd.v, ui /  d.w, ui /j  jd.v, v/  d.w, v/j D d.v, w/. i 2N

This proves the first statement. The second statement can be proved similarly. We can make its proof simpler if we observe that for a bounded metric space X D ¹ui º (that is, for a space X for which supu,v2X d.u, v/ is finite) the definition of f .v/ can be simplified to f .v/ D ¹d.v, ui /ºi . In particular, if X D ¹ui ºniD1, then f .v/ D ¹d.v, ui /ºniD1 defines an isometric embedding into `n1 . Remark 1.18. One can isometrically embed any n-element metric space into `n1 1 (see Exercise 1.56). For large n further lowering of the dimension is possible, see the Notes and Remarks section and Exercise 1.60. Proposition 1.19. `p .1  p < 1/ is isometric to a subspace of Lp .0, 1/. Proof. Consider a sequence ¹i º1 i D1 of disjoint measurable subsets of Œ0, 1 with nonzero Lebesgue measures. Let m.i / denote the Lebesgue measure of i . Straightforward computation of the norm shows that the embedding T : `p ! Lp .0, 1/ defined by 8 xi < if t 2 i 1 .T .¹xi ºi D1//.t / D .m.i //1=p :0 if t … [1 i D1 i is an isometric embedding of `p into Lp .0, 1/. We are going to use also the following result (which we do not prove, see the Notes and Remarks section for references). Fact 1.20. Any separable subspace of Lp ., †, / .1  p < 1/ is isometric to a subspace of Lp .0, 1/. This fact shows that the space Lp .0, 1/ is the “largest” Lp ., †, / which we need to consider if we restrict our attention to separable spaces. We denote Lp .0, 1/ simply by Lp . Proposition 1.21. If a finite metric space is isometric to a subset of `2 , then it is isometric to a subset of `1 .

8

Chapter 1 Introduction: examples of metrics, embeddings, and applications

Proof. The result follows from Lemmas 1.22 and 1.24 stated below. If M is an nelement metric space isometric to a subset of `2 , then its image is contained in an ndimensional subspace of `2 . As is well known, an n-dimensional subspace of a Hilbert space is isometric to `n2 . By Lemma 1.22, M is isometric to a subset of L1.S n1 , /, where S n1 is the unit sphere in `n2 and  is a Lebesgue measure on S n1 (these notions are well-defined because `n2 can be identified with the n-dimensional Euclidean space Rn ). By Lemma 1.24, a finite subset of L1 .S n1 , / is isometric to a subset of `1 . Lemma 1.22. The space `n2 is isometric to a subspace of L1 .S n1 , /. Proof. Since  is rotationally invariant, the value of the integral Z jhe, ij d./ S n1

n1

is the same for all e 2 S . Let ˛ be the common value of all such integrals. Define T : `n2 ! L1 .S n1 , / by hx, i , x 2 `n2 ,  2 S n1 . ˛ Linearity of T is obvious. We check that T is distance-preserving by writing Eˇ ˇD ˇ x , ˇ ˇ ˇ Z Z ˇ kxk`n ˇ ˇ hx, i ˇ ˇ ˇ 2 ˇd./ D ˇ kT xkL1 D ˇ ˇ kxk`n2 d./ D kxk`n2 . ˇ ˇ ˇ ˇ n1 n1 ˛ ˛ S S ˇ ˇ .T x/./ D

Observation 1.23. Similar argument shows that `n2 embeds isometrically into the space Lp .S n1 , / .1  p < 1/. Lemma 1.24. Each finite subset of L1 ., †, / is isometric to a subset of `1 . Proof. Let M D ¹m1, : : : , mn º be a finite subset of L1 ., †, /. Consider all possible permutations ˛ .˛ D 1, : : : , nŠ/ of the set ¹1, : : : , nº. We are going to construct an isometric embedding of M into `nŠ 1 . Define the measurable sets (not necessarily disjoint, and possibly empty) ˛ D ¹x 2  : m˛ .1/ .x/  m˛ .2/ .x/      m˛ .n/ .x/º. Note that [nŠ ˛D1 ˛ D . Let

  ƒ1 D 1 , ƒ2 D 2 n 1 , ƒ˛ D ˛ n [i˛1 D1 i , 3  ˛  nŠ.

nŠ We have [nŠ ˛D1 ƒ˛ D , and ƒ˛ \ ƒˇ D ; for ˛ ¤ ˇ. Let T : M ! `1 be given by ²Z ³nŠ mi d . T .mi / D ƒ˛

˛D1

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

9

The following computation shows that T is an isometry: Z kmi  mj kL1.,†,/ D



nŠ Z X ˇ ˇ ˇmi  mj ˇ d D

nŠ ˇZ X ˇ ˇ D ˇ ˛D1

˛D1 ƒ˛

ƒ˛

ˇ ˇ ˇmi  mj ˇ d

ˇ  ˇ  .mi  mj / dˇˇ D T mi  T mj `nŠ . 1

The first equality in the second line is justified by the fact that mi mj does not change sign on ƒ˛ . Proposition 1.25. There exist 4-element metric spaces which are isometric to subsets in `1 , but cannot be embedded isometrically into `2 . Proof. The complete bipartite graph Km,n is a graph with m C n vertices in which the vertex set consists of two classes: one class containing m vertices and the other class containing n vertices. Two vertices in Km,n are joined by an edge if and only if they belong to different classes. We consider the graph K1,3 with its graph distance. Let w be the vertex in the oneelement class, and v1 , v2 , v3 be vertices in the 3-element class. Embedding into `1 . We denote by ¹ei º1 i D1 the unit vectors in `1 , that is, ei is the vector whose i-th coordinate is 1 and all other coordinates are 0. (We are going to use the same definition of unit vectors in the spaces `p and `pn for all values of p 2 Œ1, 1, as well as for c0 .) With this notation the map vi 7! ei ; w 7! 0 is an isometric embedding of K1,3 into `1 . On the other hand, the space `2 has the following property: if a, b, c 2 `2 are such that ka  bk C kb  ck D ka  ck, then a, b, c lie on the same line and b is between a and c. Now, assume that there is an isometric embedding of K1,3 into `2 . By the previous observation the image of w should be the midpoint of three line segments: the line segment joining the image of vi and vj for i ¤ j , i, j 2 ¹1, 2, 3º. This is clearly impossible.

1.2.2 Bilipschitz embeddings Definition 1.26. Let C < 1. A map f : .X, dX / ! .Y , dY / between two metric spaces is called C -Lipschitz if 8u, v 2 X

dY .f .u/, f .v//  C dX .u, v/.

A map f is called Lipschitz if it is C -Lipschitz for some C < 1. For a Lipschitz map f we define its Lipschitz constant by Lip.f / :D

dY .f .u/, f .v// . dX .u, v/ dX .u,v/¤0 sup

10

Chapter 1 Introduction: examples of metrics, embeddings, and applications

A map f : X ! Y is called a C -bilipschitz embedding if there exists r > 0 such that (1.5) 8u, v 2 X rdX .u, v/  dY .f .u/, f .v//  r C dX .u, v/. A bilipschitz embedding is an embedding which is C -bilipschitz for some C < 1. The smallest constant C for which there exist r > 0 such that (1.5) is satisfied is called the distortion of f . (It is easy to see that such smallest constant exists.) Remark 1.27. For metric spaces bilipschitz embeddings can be defined as injective maps f for which both f and f 1 , defined as a function on f .X/, are Lipschitz maps. In such a case the distortion of f is Lipf  Lip.f 1 /. It is clear that each finite metric space X admits a bilipschitz embedding into each at least 1-dimensional normed space and into each metric space Y which has more elements than X. In the study of families of finite metric spaces the following definition is important. Definition 1.28. A family ¹f˛ º of embeddings is called uniformly bilipschitz if they have uniformly bounded distortions. Remark 1.29. If we consider linear mappings of normed spaces, one into another, a different terminology is used: Definition 1.30. A linear bilipschitz embedding T : X ! Y of one normed space into another is called an isomorphic embedding. This definition can be restated as: T is an isomorphic embedding if T is injective, kT k < 1 and kT 1 k < 1, where T 1 is considered as a linear map T 1 : T .X/ ! X. The notion of a bilipschitz embedding is important because the notion of an isometric embedding is too restrictive for many applications. We illustrate this statement with the following three examples. Definition 1.31 (Some graph theory definitions). A complete graph with n vertices in which any two distinct vertices are joined by exactly one edge is denoted Kn . A path with n vertices is a graph whose vertices form a sequence ¹vi ºniD1 and edges are determined by the following: vk , k D 2, : : : , n  1 is joined by exactly one edge with vk1 and vkC1 . The vertex v1 is joined with v2 only and the vertex vn is joined with vn1 only. The path with n vertices is denoted Pn . It is somewhat confusing, but it is customary to use the term path also for a walk in which no vertex is repeated. (See page 3 for definition of a walk.) If we add to Pn an edge joining v1 and vn , we get a graph called a cycle of length n and denoted by Cn. A graph is called simple if any two vertices in it are joined by at most one edge and there are no loops. The degree of a vertex is the number of edges incident to it. Vertices joined by an edge are called neighbors.

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

11

Example 1.32. A finite simple connected graph G admits an isometric embedding into `2 if and only if it is either a complete graph Kn or a path Pn for some n. be Proof. It is easy to find isometric embeddings of Kn and Pn into `2 . Let ¹ek º1 kD1 ek the unit vector basis in `2 . For Kn we map vk 7! p . For Pn we map vk 7! ke1. It is 2 easy to see that both maps are isometric embeddings. To prove the “only if” part of the statement we assume that G is a finite simple connected graph, which is not a path, but is such that .V .G/, dG / is isometric to a subset of `2 , and denote the isometric embedding by f . Our goal is to show that these conditions imply that G is a complete graph. The fact that G is not a path immediately implies that G is either a cycle or has a vertex of degree at least 3. In the case where G is a cycle we observe that the cycle C3 is simultaneously a complete graph K3, and we are done in this case. As for longer cycles, we prove that they do not admit isometric embeddings into `2 in the following way. Since vertices vk1 , vk , vkC1 in a cycle with at least 4 vertices satisfy dG .vi 1 , vi C1/ D dG .vi 1 , vi / C dG .vi , vi C1 /, we get that f .vk1 /, f .vk /, and f .vkC1/ should be on the same line, with f .vk / being a midpoint of the line segment joining f .vk1 / and f .vkC1 /. Since this observation is applicable also to vn , v1 , and v2 , we get a contradiction. Now let v 2 V .G/ be a vertex of degree  3, and let u1 , u2 , u3 be its neighbors. We show that ui are pairwise adjacent. If two pairs of them (say u1 , u2 and u2 , u3 ) are not adjacent, we get a contradiction because f .v/ should be simultaneously a midpoint of the line segment joining f .u1 / and f .u2 / and a midpoint of the line segment joining f .u2 / and f .u3 /. If only one edge, say u1 u3 , is missing then both f .u2 / and f .v/ should be midpoints of the line segment joining f .u1 / and f .u3 /. Therefore v and all of its neighbors should form a complete subgraph in G. Since the same should hold for each of the neighbors of v, we get that G should be a complete graph. Example 1.33. There are no isometric embeddings of the unit circle S in the Euclidean plane, with the metric inherited from the plane, into the Banach space c0 . On the other hand, for each 1 < C < 1 there is n 2 N such that S admits a C -bilipschitz embedding into `n1 . Proof. Assume that there is an isometric embedding f : S ! c0 . Since the norm in c0 is max jxi j, the definition S1 of an isometric embedding implies that the set S  S is covered by the union i D1 Ti , where Ti is the set of pairs which are ‘taken care of’ by the i-th coordinate in the following sense. Denote by ei : c0 ! R the functional whose value is the i-th coordinate (or i-th term of the sequence). We say that a pair .u, v/ 2 S  S is taken care of by ei if jei .f .u//  ei.f .v//j D dS .u, v/. By continuity, each set Ti is closed in the standard product topology on S  S.

12

Chapter 1 Introduction: examples of metrics, embeddings, and applications

Out next purpose is to show that Ti does not contain interior points. Assume the contrary. Then there is a pair .u, v/ 2 S such that for some neighborhoods Nu and Nv and any x 2 Nu , y 2 Nv we have dS .x, y/ D jei .f .x//  ei.f .y//j. This assumption can be translated into geometric language in the following way: we can find a map P of Nu and Nv onto the line L passing through u and v, such that P .u/ D u, P .v/ D v and dL .P .x/, P .y// D dS .x, y/ for all x 2 Nu and y 2 Nv . Here dL is the standard Euclidean distance on the line L. Consider x0 2 Nu and y0 2 Nv on the shorter circular arc of S joining u and v (any arc if the arcs are of the same length) such that dS .u, x0 / D dS .v, y0 /. Since dL.P .x0 /, v/ D dS .x0 , v/, the point P .x0 / is closer to u than the image PL .x0 / of the orthogonal projection onto L. For the same reason P .y0 / is closer to v than PL .y0 /. But then dL.P .x0 /, P .y0 // > dL .PL .x0 /, PL .y0 // D dS .x0, y0 / (the last equality follows from the assumption dS .u, x0 / D dS .v, y0 /), and we get a contradiction. Hence Ti does not contain interior points. Therefore our assumption that f is an isometric embedding contradicts the Baire category theorem (its statement: A complete metric space cannot be covered by a sequence of closed subsets which do not have interior points.) Hence S does not admit isometric embeddings into c0 . As for C -bilipschitz embeddings into `n1 , we can find them not only for S, but for the whole space `22 , (the construction below is a standard Banach-space-theoretical construction which can be easily extended to the case where `22 is replaced with an arbitrary finite-dimensional normed space). We find a finite "-net ¹vi ºniD1 on S (if needed, see the definition of an "-net on page 34). Let ¹viºniD1 be linear functionals on `22 satisfying kvik D vi .vi / D 1, i D 1, : : : , n. We introduce an embedding f : `22 ! `n1 by f .x/ D ¹vi.x/ºniD1. The condition kvik D 1 implies that f is 1-Lipschitz. To estimate the Lipschitz constant of the inverse map consider any x, y 2 S. There  xy   exists i such that kxyk  vi  ". Therefore jvi .x  y/j  .1  "/kx  yk and kf .x/  f .y/k  .1  "/kx  yk. Example 1.34. There is no isometric embedding of a metric space consisting of n equidistant points into `m 2 with m < n  1. Proof. We may restrict our attention to the case where distances are equal to 1 and to maps which map one of the equidistant points to 0 2 `m 2 . With these assumptions it suffices to prove that any vectors ¹xi ºkiD1 in `2 satisfying kxi k D 1 and kxi  xj k D 1 for i ¤ j are linearly independent. This can be done by observing that under these assumptions we have hxi , xj i D 12 if i ¤ j , and therefore the matrix .hxi , xj i/ki,j D1 (this matrix is called the Gram matrix

13

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

of vectors x1, : : : , xk ) is of the form 0 1 B1 B2 B B1 B2 B. @ .. 1 2

1 2

1 1 2

.. .

1 2

1 2 1 2

 

11 2 1C 2C C 1C . 2C

1  .. . . .. C . .A . 1  1 2

It is easy to verify that this matrix is invertible. Therefore any equality of the form k X

ai xi D 0

i D1

implies a1 D a2 D    D ak D 0. (Instead of proving the last statement one can also apply the well-known fact that a Gram matrix of a set of vectors is invertible if and only if the vectors are linearly independent.) On the other hand, our next result (Theorem 1.35) shows that we can find bilipschitz embeddings of the metric space of Example 1.34 (observe that this space can be identified with the complete graph Kn with its graph distance) into `k2 with k much smaller than n and distortion very close to 1. The value of Theorem 1.35 goes far beyond proving this. The Johnson–Lindenstrauss lemma is one of the most frequently used in applications results of the theory of embeddings. Theorem 1.35 (Johnson–Lindenstrauss lemma). For each " 2 .0, 1/ there is a constant K D K."/, 0 < K < 1, such that for each n 2 N .n  2/, each n-element subset V D ¹v1, : : : , vn º  `n2 , and each k  K ln n there is a mapping f : `n2 ! `k2 satisfying .1  "/kvi  vj k  kf .vi /  f .vj /k  .1 C "/kvi  vj k

(1.6)

for all i, j D 1, 2, : : : , n. Proof. We are going to proceed according to the following scheme: For given n, ", and an appropriate k we find a probability space ., †, P/ (see Section 1.3 for probability theory terminology and notation which we are going to use) and a map ! 7! T! from  to the set of all linear operators `n2 ! `k2 satisfying for each x 2 `n2 the following condition P ¹! : .1  "/kxk  kT! .x/k  .1 C "/kxkº  1 

1 . n2

(1.7)

  If we find such a map, we can finish the proof in the following way. There are n2 different pairs of distinct points in V . By (1.7), the probability that a random linear map

14

Chapter 1 Introduction: examples of metrics, embeddings, and applications

does not satisfy the condition (1.6) for a given pair .vi , vj / is at most n12 . Therefore the probability that a random linear map does not satisfy condition (1.6) for at least one n 1 of the pairs .vi , vj / does not exceed 2  n2 < 12 . Therefore there exist linear maps satisfying the condition (1.6) for all pairs .vi , vj /. We construct T! using real-valued random variables of the following type. Let X be a symmetric random variable satisfying Var .X/ D 1 and Ee tX  e ct

2

(1.8)

for some 0 < c < 1 and all t  0. Recall that Var .X/ D E.X  EX/2 . In our case EX D 0 since X is assumed to be symmetric, so the condition Var .X/ D 1 can be written as E.X 2 / D 1. As an example of such a variable one can consider a Bernoulli random variable which has values ˙1 with the probability 12 each. In fact, in this case Var .X/ D 1 and Ee tX D 12 .e t Ce t /, and an easy analysis with power series implies that 12 .e t Ce t /  2 et . Let ¹Xij ºjnD1,ik D1 be independent random variables having the same distribution as X. We introduce T! as the operator `n2 7! `k2 whose matrix A.!/ in the unit vector bases of `n2 and `k2 has entries aij .!/ D p1 Xij .!/. k To get (1.7) we need to show that for fixed x 2 `n2 , kxk D 1, and a suitably chosen k 2 N we have 1 , 2n2 (1.9) 1 P ¹! : kT! .x/k  1  "º  2 . 2n Pn The vector T! .x/ has components p1 j D1 Xij xj . Since Xij are independent rank dom variables with the same distribution, all of the components have the same distribution. To simplify the notation we introduce ¹Xj ºjnD1 as independent random variables with the samePdistribution as X. All of the components of T! .x/ have the same n distribution as p1 j D1 xj Xj . P ¹! :

kT! .x/k  1 C "º 

k

Let x 2 `n2 be such that kxk D 1. Our next purpose is to compute E! kT! .x/k2. We have 2  X k  n 1 X 2 EkT! .x/k D E . p Xij xj k j D1 i D1 We have

X 2  n n X n 1 1 X Xij xj D E Xij Xi m xj xm . E p k k j D1 j D1 mD1 

15

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

Symmetry and independence of ¹Xij º imply that E.Xij Xi m / D 0 if j ¤ m. The condition E.X 2/ D 1 implies E.Xij Xij / D 1. Therefore 2  n n 1 X 2 1 1 X Xij xj D xj D E p k k k j D1 j D1 and EkT! .x/k2 D 1. The purpose of the rest of the proof is to show that the random variable kT! P .x/k2 is concentrated near its expectation. Let Ui D jnD1 Xij xj . We need to estimate from above P1 D P¹! : kT! .x/k  1 C "º and P2 D P¹! : kT! .x/k  1  "º. We have ³ ² k 1X 2 2 Ui .!/  .1 C "/ P1 D P ! : k i D1

³ ² k X 2 Ui .!/  k.1 C "/ . P !: i D1





1 , where c is the constant in (1.8). We have We consider s 2 0, 8c

Ee s

Pk

2 iD1 Ui

 P1  e sk.1C"/.

Therefore P1  e sk.1C"/Ee s

Pk

2 iD1 Ui

.by independence/

D

e sk.1C"/

k Y

2

Ee sUi .

(1.10)

i D1

Pn

We introduce a random variable U D j D1 xj Xj . It is clear that each of the random variables Ui has the same distribution as U . Our next step is to show  that there 1 1 is a constant 0 < D < 1 depending on c only such that for s 2  8c , 8c we have 2

2

Ee sU  e sCDs .

(1.11)

As a preliminary step we show that for a (standard) normal random variable g.ˇ/ (also known as a Gaussian random variable) and a real number u we have Eg e ug D 2 e u =2 . (Here we did not specify the probability space, writing Eg we mean the expectation with respect to the probability space on which g is a normal random variable.) In fact,

Z 1 Z 1 2 2 2 2 1 1  t2 ut C u2 C u2 ug ut  t2 e e dt D p e dt Eg e D p 2 1 2 1 Z 1 .tu/2 u2 u2 1 Dp e2 e  2 dt D e 2 . 2 1

16

Chapter 1 Introduction: examples of metrics, embeddings, and applications

Therefore the random variable U (which we assume to be defined on a probability space different from the one on which g is defined) satisfies for each real number t  0 (below E denotes the expectation with respect to the probability measure on which U is defined and Eg denotes the expectation with respect to the probability measure on which g is defined) Ee

tU 2

D EEg e

p

2t gU

D Eg Ee

p Pn 2t g j D1 xj Xj .by independence/

D

Eg

n Y

Ee

p 2t gxj Xj

j D1 (1.8)

 Eg

n Y

e c2tg

2x 2 j

.since kxkD1/

D

2

Eg e 2t cg .

j D1

p 2 1 Now we observe that Eg e 2t cg  2 if t  8c . In fact, in such a case Z 1 1 2 2 2 2 e t =4 e t =2 dt Eg e 2t cg  Ee g =4 D p 2 1 Z 1 Z 1 p p p 1 2 .t D 2 s/ 1 t 2 =4 e dt D p e 2s =4 2 ds D 2. Dp 2 1 2 1 Now we continue our proof of (1.11). Since E.U 2 / D 1, using the power series representation of e x we get  1 m 2m X X s m U 2m U sU 2 m 8c Ee D1CsCE D1CsCE .8cs/ mŠ mŠ m2 m2  1 m 2m X U 2 m2 8c D 1 C s C .8cs/ E  1 C s C Ds 2 .8cs/ mŠ m2 p where D D .8c/2 2. To get the last inequality we use j8csj  1 and   X 1 m U 2m p 1 2 8c E  Ee 8c U  2, mŠ m2

the latter inequality has been proved above. We use the well-known inequality 1 C x  e x (which follows immediately from the convexity of e x ) and get 1 1 sU 2 sCDs 2 Ee e for s 2  , . 8c 8c Now we can continue the estimate started in (1.10). We have 2

2

P1  e sk.1C"/e k.sCDs / D e k.s"Ds / . " , and is equal We maximize s"  Ds 2 over s. The maximum is attained when s D 2D "2 1 " is suitable to 4D . We recall that we consider 0  s  8c only, so the choice s D 2D

17

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

" 1 only if 2D  8c , but one can see that this is not a real obstacle because we may assume that " < 1 and 2D  8c. Thus we have

P1  e k"

2 =4D

.

We need to find the condition under which the right-hand side is  if k  "2 =4D  2 ln n  ln 2 or

1 . 2n2

This happens

4D.2 ln n C ln 2/ . "2 Obviously the right-hand side of the last inequality is  K."/ ln n for a suitably chosen K."/ and n  2. To estimate ² ³ k X 2 P2 D P ¹! : kT! .x/k  1  "º  P ! : Ui .!/  k.1  "/ k

i D1

we consider s 2

1 , 0 Œ 8c

and use a similar argument. We have Ee s

Pk

2 iD1 Ui

 P2 e sk.1"/.

Using the same computation as above we get 2

2

P2  e sk.1"/ e k.sCDs / D e k.s"Ds / . " and is We maximize s"  Ds 2 over s. The maximum is attained when s D  2D 2 " equal to 4D . The rest of the argument is the same as for P1.

1.2.3 Coarse and uniform embeddings Definition 1.36. Let f : .X, dX / ! .Y , dY / be a map between two (semi)metric spaces. We introduce two real-valued functions of a real variable related to such a map: !f .t / D sup¹dY .f .u/, f .v// : u, v 2 X, dX .u, v/  t º,

t  0,

'f .t / D inf¹dY .f .u/, f .v// : u, v 2 X, dX .u, v/  t º,

t  0,

and so that 8u, v 2 X

'f .dX .u, v//  dY .f .u/, f .v//  !f .dX .u, v//.

(1.12)

We say that f is a coarse embedding and X coarsely embeds into Y if !f .t / < 1 for all t and lim t !1 'f .t / D 1. We say that f is a uniform embedding and X uniformly embeds into Y if 'f .t / > 0 for all t > 0 and lim t !0 !f .t / D 0. Remark 1.37. To make 'f .t / well-defined in the case where the metric space X is bounded, we agree that the infimum over an empty set is 1. It should be mentioned

18

Chapter 1 Introduction: examples of metrics, embeddings, and applications

that the definitions of 'f .t / and of a coarse embedding are useful only for unbounded metric spaces. The definitions above are closely related to the following classical definitions: Definition 1.38. The function !f .t / is called the modulus of continuity of f . The map f is called uniformly continuous if lim t !0 !f .t / D 0. Observe that for metric spaces the definition of a uniform embedding can be restated as: An injective map f : X ! Y between two metric spaces is called a uniform embedding if it is uniformly continuous and its inverse f 1 , defined as a map from f .X/ to X, is also uniformly continuous. This definition makes sense and is used also for bounded metric spaces. Observation 1.39. If X is a Banach space and Y is any metric space, then for any map f : X ! Y , the modulus !f is a subadditive function in the sense that !f .s C t /  !f .s/ C !f .t /. If s and t are integers, the same inequality holds if X is an unweighted graph with its graph distance. Proof. In fact, if X is a Banach space, then for any pair u, v 2 X satisfying ku  vk  s C t there is a vector w satisfying ku  wk  s and kw  vk  t . By the triangle inequality we have dY .f .u/, f .v//  dY .f .u/, f .w//CdY .f .w/, f .v//  !f .s/C !f .t /. Therefore !f .s C t /  !f .s/ C !f .t /. The same proof works in the second case. Example 1.40. For 1  p  q < 1 there exists an embedding T : Lp .R/ ! Lq .R  R/ which is simultaneously coarse and uniform. Proof. Our proof is simpler for p D 1, so we start with this case. We define a map T : L1.R/ ! L1 .R  R/ by: 8 ˆ < 1 if 0 < s  f .t /, def T .f /.t , s/ D 1 if f .t / < s < 0, ˆ : 0 otherwise. For all f , g 2 L1.R/ we have:

²

jT .f /.t , s/  T .g/.t , s/j D Thus, for all q  1 we have q

kT .f /  T .g/kL

q .RR/

1 if g.t / < s  f .t / or f .t / < s  g.t /, 0 otherwise.

Z Z D Z D

R

R

 ds dt .g.t /,f .t /[.f .t /,g.t /

jf .t /  g.t /jdt D kf  gkL1 .R/.

(1.13)

19

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

It is clear that this equality implies that the embedding T : L1 .R/ ! Lq .R  R/ is both coarse and uniform. Our proof in the case p > 1 is more complicated. First we consider the complexvalued space Lq .R  R/. We comment on the real-space-case later. Let 0 < ˛ < 2ˇ. It is easy to see that the integral Z

1 1

.1  cos.tx//ˇ dt jt j˛C1

converges for each x 2 R. Observe that if we replace x with sx, s > 0, in the integral, we get Z

1 1

.1  cos.t sx//ˇ dt jt j˛C1

.uDst /

Z

D

1 1

Z D s˛

1 1

.1  cos.ux//ˇ du juj˛C1 =s ˛C1 s .1  cos.ux//ˇ du. juj˛C1

Also the value of the integral does not change if we replace x with x. Therefore there exists a constant c˛,ˇ > 0 such that Z

1 1

.1  cos tx/ˇ dt D c˛,ˇ jxj˛ . jt j˛C1

Define a map T : Lp .R/ ! Lq .R  R/ by T .f /.s, t / D

1  e i tf .s/ . jt j.pC1/=q

We need to show that this function is in Lq .R  R/. We use the following elementary trigonometric formula ..1  cos w/2 C .sin w/2/1=2 D

p 2 .1  cos w/1=2 .

(1.14)

Using this formula we get “

“ q

jT .f /.s, t /j dt ds D “

j1  e i tf .s/jq dt ds jt jpC1

2q=2 .1  cos.tf .s///q=2 dt ds jt jpC1 Z p D 2q=2 cp,q=2 jf .s/jp ds D 2q=2 cp,q=2 kf kL

D

p .R/

.

20

Chapter 1 Introduction: examples of metrics, embeddings, and applications

If f , g 2 Lp .R/, using a similar computation we get:    1  e i tf .s/  .1  e i tg.s// q q   kT .f /  T .g/kLq .RR/ D   jt j.pC1/=q Lq .RR/ “ j1  e i t .f .s/g.s//jq D dt ds jt jpC1 p D 2q=2cp,q=2 kf  gkL

p .R/

,

so T is the required embedding. Returning to the real case: considering real and imaginary parts of a complex-valued function we may identify a complex-valued space Lq ./ with the real-valued space Lq . [ /. The norms of a complex-valued function on  and the corresponding real-valued function on  [  are not the same, but the map is a bilipschitz map. Using Fact 1.20 we complete the proof in the real case. Remark 1.41. If q D 1, even an isometric embedding f : Lp .R/ ! Lq .0, 1/ exists. In fact, let ¹.ai , bi /º1 i D1 be a sequence of disjoint subintervals of .0, 1/. It is easy to see that the subspace of L1 .0, 1/ consisting of functions f which are constant on each of .ai , bi / (that is, x, y 2 .ai , bi / ) f .x/ D f .y/) is isometric to `1 . Since Lp .R/, 1  p < 1 is separable, the conclusion follows from Exercise 1.54. Remark 1.42. The proof in the case p D 1 presented above can be used to find a simultaneously coarse and uniform embedding of L1./ into Lq .  R/ for an arbitrary measure space . In the case q D 2 we get 1=2

Corollary 1.43. The metric space .L1., /, kf  gkL .,/ / admits an isometric 1 embedding into a Hilbert space. Definition 1.44. A metric space X is called uniformly discrete if there exists a constant ı > 0 such that 8u, v 2 X .u ¤ v/ ) .dX .u, v/  ı/. For uniformly discrete metric spaces the notion of a uniform embedding is not useful because any bijection between a uniformly discrete metric space and a uniformly discrete subset of another metric space is a uniform embedding. The definition of a coarse embedding is usually given in the form: Definition 1.45. A map f : .X, dX / ! .Y , dY / between two metric spaces is called a coarse embedding if there exist non-decreasing functions 1, 2 : Œ0, 1/ ! Œ0, 1/ (observe that this condition implies that 2 has finite values) such that lim t !1 1.t / D 1 and 8u, v 2 X 1 .dX .u, v//  dY .f .u/, f .v//  2.dX .u, v//.

Section 1.2 Isometric, bilipschitz, coarse, and uniform embeddings

21

As in the case of bilipschitz embeddings we need the following definition for families. Definition 1.46. Let ¹.X˛ , d˛ /º˛2A and ¹.Y˛ , D˛ /º˛2A be two families of metric spaces. A family f˛ : X˛ ! Y˛ of embeddings is called uniformly coarse if there exist two non-decreasing functions 1, 2 : Œ0, 1/ ! Œ0, 1/ such that 

8˛ 2 A 8u, v 2 X˛

1 .d˛ .u, v//  D˛ .f˛ .u/, f˛ .v//  2.d˛ .u, v//.



2 takes finite values only.



lim t !1 1.t / D 1.

Remark 1.47 (Warning). M. Gromov [175, p. 211] used the term uniform embedding for what we call a coarse embedding and defined it a bit differently, but for all problems which we are going to consider his definition is equivalent to Definition 1.36. We prefer to use the term coarse embedding because in nonlinear functional analysis the term uniform embedding is used in the sense of Definition 1.36. The main purpose of this remark is to alert the reader that in some publications the term uniform embedding is used for the class of embeddings which we call coarse. Example 1.48. Embeddings of an infinite binary tree. We define the infinite binary tree T1 as the graph whose vertices are in bijective correspondence with all finite sequences of 0 and 1, including an empty sequence, two vertices are adjacent if the lengths of the corresponding sequences differ by 1, and one of them is an extension of the other. (In more detail: a sequence ¹ai ºm i D1 is an extension n of ¹bi ºi D1 if m > n and a1 D b1, a2 D b2, : : : , an D bn .) The infinite binary tree with its graph distance admits an isometric embedding into `1 . To get such an embedding we consider a bijection between edges of T1 and the unit vectors of `1 . We map the vertex v0 corresponding to the empty sequence to 0. We map any other vertex v of T1 to the sum of unit vectors corresponding to edges of the unique path from v0 to v in T1 . The fact that this defines an isometric embedding can be verified in a straightforward way. Using Example 1.40 we get the existence of a coarse embedding of T1 into `2 . It is worth mentioning that one can get a coarse embedding of T1 into `2 much easier, just by using the same construction as above with the unit vectors of `2 instead of the unit vectors of `1 . Remark 1.49. It is known (see Section 9.5.1) that there are no bilipschitz embeddings of T1 into `2 . Remark 1.50. If the metric space M is an unweighted graph with its graph distance, then a coarse embedding f : M ! X is a Lipschitz map with Lipschitz constant  2.1/.

22

Chapter 1 Introduction: examples of metrics, embeddings, and applications

Proof. Let x, y 2 M . There exists a sequence x0, : : : , xn such that n D dM .x, y/, x0 D x, xn D y and dM .xi , xi C1/ D 1. Then dX .f .xi /, f .xi C1//  2.1/; therefore dX .f .x/, f .y//  n2.1/.

1.3 Probability theory terminology and notation By a probability space we mean a triple ., †, P/, where  is a set and † is a collection of subsets of  satisfying the conditions (i)

If A 2 †, then so is its complement: A0 2 †

(ii)

If ¹An º is any finite or countable collection of sets in †, then .[n An / 2 † and .\n An / 2 †.

(iii) ; 2 † and  2 †. A collection † of subsets satisfying the conditions (i)–(iii) is called a -algebra. Elements of † are called events. The last element of the triple P is a real-valued function on † satisfying the conditions: 

P.A/  0 for each A 2 †.



P./ D 1.



If ¹An º is anyPfinite or countable collection of non-overlapping sets in †, then P .[n An / D n P.An /. (This condition implies P.;/ D 0.)

A function P satisfying these conditions is called a probability measure on †, the number P.A/ is called the probability of the event A. The set of all subsets of any set is always a -algebra. If X is a metric space, the smallest -algebra of subsets of X containing all closed sets of X is called the Borel algebra of X and the sets in it are called Borel sets. Let ., †, P/ be a probability space. A function f on  with values in a metric space X is called measurable if the set f 1 .I / is in † for each Borel set I in X (it is easy to check that in the case of real-valued functions it suffices to require that f 1 .I / is in † for each interval I of the real line). A random variable on a probability space ., †, P/ is a measurable real-valued, complex-valued, or Banach space valued function on . An X-valued random variable is called symmetric if for each Borel set I  X the equality P¹! : X.!/ 2 I º D P¹! : X.!/ 2 I º holds. For some of the random variables we define the expectation. We introduce the indicator function of a set S by ´ 1 if x 2 S 1S .x/ D (1.15) 0 if x … S.

Section 1.3 Probability theory terminology and notation

23

First we define expectations for functions with values in a Banach space X (which can be R or C) presented as finite sums of the form X ai 1Ai , (1.16) i

where ai 2 X and Ai 2 †. Such random variables are called simple. P We define the expectation E.F / of a simple random variable F D i ai 1Ai by X E.F / D ai P.Ai /. i

One can check that this sum does not depend on the choice of representation. To introduce expectations of other random variables we observe that E.kF k/ is a norm on the linear space of all simple X-valued (X is a fixed Banach space) random variables. If a random variable F can be represented as a limit of a sequence of simple random variables ¹Fi º with respect to this norm, we define E.F / D limi !1 E.Fi /. It can be checked that this definition does not depend on the choice of the sequence ¹Fi º1 i D1. It is easy to verify that for a random variable F the expectation E.F / is defined if and only if the expectation E.kF k/ is defined, and kE.F /k  E.kF k/. The space of all X-valued random variables F for which E.F / and E.kF k/ are defined is denoted L1., †, P, X/. We use the notation L1.X/ in the case where  is the interval Œ0, 1, † is the -algebra of Borel subsets of Œ0, 1, and P is the usual (Lebesgue) measure restricted to Œ0, 1. We also introduce the Banach space Lp ., †, P, X/ .1 < p < 1/ as linear subspaces of L1 ., †, P, X/ for which E.kXkp / < 1 is defined. We use the notation Lp .X/ in the case of Œ0, 1. The variance Var .F / of a real-valued random variable F is defined by   Var .F / D E .F  EF /2 . Let A, B 2 † be two events. The conditional probability of A with respect to B is defined as P.AjB/ D P.A\B/ (defined only if P.B/ ¤ 0). Events A, B 2 † are called P.B/ independent if P.A \ B/ D P.A/  P.B/. If A and B are independent and P.B/ ¤ 0, then P.AjB/ D P.A/. Random variables F1 :  ! X1 and F2 :  ! X2 are called independent if for any two Borel sets I1  X1 and I2  X2 the events F11 .I1 / and F21 .I2 / are independent. Random variables ¹Fi ºniD1 with values in ¹Xi ºniD1, respectively, are called independent if n n

Y

\ Fi1 .Ii / D P Fi1 .Ii / P i D1

i D1

24

Chapter 1 Introduction: examples of metrics, embeddings, and applications

for any collection ¹Ii ºniD1 of Borel sets satisfying Ii  Xi . An infinite sequence ¹Fi º1 i D1 of random variables is called independent if each finite subset of it is independent in the sense of the previous definition. It is not difficult to prove that if F1 is a real-valued random variable on , F2 is a Banach space valued random variable on , and F1 and F2 are independent, then E.F1 F2 / D E.F1 /E.F2 /. Now we introduce some basic definitions and facts about martingales. Let ., †, P/ be a probability space, and F be a -subalgebra of † (that is, a -algebra of sets of  satisfying F  †). Let X be a Banach space and f :  ! X be a map. The map is called measurable with respect to F (or F-measurable) if for each Borel set A  X the pre-image f 1 .A/ is an element of F. For F 2 L1., †, P, X/ we define the conditional expectation E.F jF/ with respect to the -subalgebra F as an F-measurable X-valued random variable Fz for which Z Z Fz d P (1.17) F dP D R

A

A

for each A 2 F, where A F d P is defined as E.F  1A /. We emphasize that (1.17) is required only for A 2 F. The proof of the existence of the conditional expectation for general real-valued random variables requires the Radon–Nikodým theorem on integral representation of absolutely continuous measures. As for Banach space valued random variables, the existence of conditional expectation in this case is proved in the following way: it is easy to see that for a variable of the form f  x, where f 2 L1., †, P/ and x 2 X, we have E.f  xjF/ D E.f jF/  x. On the other hand, it can be shown that linear combinations of functions of the form f  x, where f 2 L1 ., †, P/ and x 2 X, are dense in L1., †, P, X/. For applications in this book we are going to use the conditional expectations with respect to finite -subalgebras F only. In this case the existence is obvious: It is easy to check that if F is finite, it is generated by its minimal subsets, which form a partition of . Let F be generated by disjoint sets ¹A1 , : : : , An º partitioning , and F be an X-valued random variable. Then the random variable FQ which takes the value R Ai F d P P.Ai /

at each point of Ai is the conditional expectation of F with respect to F. Let ., †, P/ be a probability space. A (finite or infinite) sequence of random variables ¹M0, M1 , : : : , Mn , : : : º in L1., †, P, X/ is called a martingale if there exists an increasing sequence F0  F1      Fn  : : : of -subalgebras of † (such sequence of -subalgebras is called a filtration) such that for each n  0 the random variable Mn is Fn -measurable and satisfies Mn D E.MnC1 jFn /.

Section 1.4 Applications to the sparsest cut problem

25

1.4 Applications to the sparsest cut problem We return to the discussion of the sparsest cut problem (see page 4). Let dmin be the semimetric which solves the minimization problem (1.4). Observation 1.51. PIf the semimetric space .V , dmin/ is isometric to a subset of L1, then the quantity uv2E c.uv/dmin .u, v/ coincides with the sparsity of the sparsest cut. To prove this observation we need the following result which is useful in many other contexts. Proposition 1.52. Let .X, dX / be a finite semimetric space. The space X admits an isometric embedding into L1 if and only if its metric is a positive linear combination of cut semimetrics, in the sense that there are nonnegative numbers ˛1 , : : : , ˛m and cuts .S1, SN1 /, : : : , .Sm , SNm / such that 8u, v 2 X

dX .u, v/ D ˛1dS1 .u, v/ C    C ˛m dSm .u, v/.

Proof. If a metric dX is given by a linear combination of the described type we define an embedding f : X ! `m 1 whose coordinates are given by ´ 0 if x 2 Si .f .x//i D ˛i if x … Si . It is easy to check that 8u, v 2 X

kf .u/  f .v/k D ˛1dS1 .u, v/ C    C ˛m dSm .u, v/,

so that f is an isometric embedding. On the other hand, if X admits an isometric embedding into L1 we may assume (see Lemma 1.24) that X is a finite subset of some `k1 . / Let ¹ˇi ,j ºjp.i D1 be the set of all possible values of the i-th coordinate of elements of X, i D 1, : : : , k, arranged in an increasing order, so ˇi ,j C1 > ˇi ,j . Let Si ,j be the set of elements whose i-th coordinate is  ˇi ,j , and di ,j be the corresponding cut semimetric. Since the coordinates of each element u 2 X have the form ˇ1,j.1/ , ˇ2,j.2/ , : : : , ˇk,j.k/ , one can easily check that dX .u, v/ D

/1 k p.i X X

.ˇi ,j C1  ˇi ,j /di ,j .u, v/.

i D1 j D1

Now we return to the proof of Observation 1.51. We start by proving, for any nonnegative numbers 1, : : : , m and z1 , : : : , zm , the inequality ³ ² 1 C    C m 1 m . (1.18)  min ,:::, z1 C    C zm z1 zm

26

Chapter 1 Introduction: examples of metrics, embeddings, and applications

This can be done by induction starting with the inequality ² ³ 1 2 1 C 2  min , . z1 C z2 z1 z2 To get this inequality we may assume that

1 z1



2 z2

or

2 z1  1 z2 In such a case we need to show

1 C2 z1 Cz2



1 z1

(1.19)

or

1z1 C 2z1  1z1 C 1z2 .

(1.20)

We see that (1.20) follows immediately from (1.19). By Proposition 1.52, if .V , dmin/ is isometric to a subset of L1 , then dmin D ˛1 dS1 C    C ˛m dSm . By (1.18) we get P P c.uv/dSi .u, v/ uv2E c.uv/dmin .u, v/ P  min P uv2E . 1i m u,v2V D.u, v/dmin .u, v/ u,v2V D.u, v/dSi .u, v/ The conclusion of the observation follows. An immediate consequence of the proof of Observation 1.51 is that if the metric space .V , dmin/ is not isometric to a subset of L1 , but there is a bilipschitz embedding of this space into L1 with distortion  C , then the sparsity of the sparsest cut does not exceed C  .the sparsity of dmin/.

1.5 Exercises Recall that a metric space is called separable if it has a countable dense set. Exercise 1.53. Prove that any separable metric space admits an isometric embedding into `1 . By a linear isometric embedding of one normed linear space into another we mean an isometric embedding which is simultaneously a linear map. Exercise 1.54. Prove that any separable normed linear space admits a linear isometric embedding into `1 . If needed, see a hint in the Hints to exercises section below. m m Exercise 1.55. We say that vectors a D ¹ai ºm i D1 and b D ¹bi ºi D1 in `p are disjoint if the sets ¹i : ai ¤ 0º and ¹i : bi ¤ 0º are disjoint. Let 1 < p < 1, p ¤ 2, and n  m. Show that each linear isometric embedding T : `pn ! `pm is given by T .ei / D di , where ¹ei ºniD1  `pn are the unit vectors and ¹di ºniD1 are disjoint vectors of norm 1 in `pm .

If needed, see a hint in the Hints to exercises section below.

Section 1.6 Notes and remarks

27

Exercise 1.56. Decrease the dimension in Proposition 1.17 to n  1. The definition of a uniformly convex normed linear space is given on page 31. Exercise 1.57. Show that `2 is uniformly convex. The definition of bounded geometry is given on page 31. Exercise 1.58. Show that each finitely generated group with the word distance has bounded geometry. Exercise 1.59. Find a sharp estimate from below of the distortion of embeddings of K1,3 into `2 (see Proposition 1.25). If needed, see a hint in the Hints to exercises section below. Exercise 1.60. Show that for n  4 any metric space with n elements admits an isometric embedding into `n2 1 . In the Hints to exercises section we suggest an approach to Exercise 1.60. Exercise 1.61. Let 1 < p < q < 1. Show that there exists an embedding T : `p ! `q which is simultaneously coarse and uniform. In the Hints to exercises section we suggest an approach to Exercise 1.61. Exercise 1.62. Consider two different finite sets of generators, S1 and S2, in a finitely generated group G. Let d1 and d2 be the corresponding word metrics on G. Show that the map .G, d1 / ! .G, d2/ which maps each element to itself is a (bijective) bilipschitz embedding.

1.6 Notes and remarks 1.6.1 To Section 1.1 Metric geometry first appeared in the first half of the 20th century in the works of Fréchet, Menger, Blumenthal, and Schoenberg. Expositions of results obtained in this period can be found in [65], [66]. In recent years metric geometry has experienced explosive growth, as demonstrated by expositions in the following books, surveys, and lecture notes: [55], [82], [87], [177], [196], [197], [198], [237], [289], [333], [334], [345], [394], [443]. Each of the examples of the metric space examples presented in Section 1.1 corresponds to a well-developed area of mathematics. We shall use some results of the

28

Chapter 1 Introduction: examples of metrics, embeddings, and applications

corresponding areas without proofs, here we mention the corresponding sources. Banach spaces are among the most important objects in functional analysis. As a general introduction to functional analysis we recommend [130]. Recommended sources for basic Banach space theory are [9], [50], and [141]. Extremely useful advanced monographs on Banach space theory are [55], [120], [287], and [288]. Another highly recommended source is the two-volume handbook [221], [222]. We are going to follow the graph theory terminology and notation of a very comprehensive textbook [71], another relevant text is [122]. An extremely useful source in graph theory is the collection of problems [297] containing hints and complete solutions for all of its problems. A famous essay on geometric group theory is [175]. The books [195] and [323] can be used to enter the subject. Cut semimetrics and their applications in combinatorial optimization are the subject of the book [118]. The sparsest cut problem is one of the most important problems in the field of approximation algorithms. The problem of computing the sparsest cut is known to be NP hard [409]. The sparsest cut problem is used as a subroutine in many approximation algorithms for NP-hard problems; see the survey article [411], as well as the references in [280],[25],[26],[98],[431],[438]. See [408] for information on algorithms available for linear programming problems.

1.6.2 To Section 1.2 Different aspects of the theory of isometric embeddings are presented in [65], [118], [151], [152], [181], [220], [252], [262], [436]. Note on Fréchet. In 1910 Fréchet [155, p. 161–162] defined `1 as a metric space (he denoted it D) and proved that each separable metric space can be isometrically embedded into `1 . Some authors use the term Kuratowski embedding or Hausdorff– Kuratowski embedding for what we call the Fréchet embedding. This is because in 1935 Kuratowski [258, p. 543] proved that a bounded metric space X can be isometrically embedded into the space C.X/ of bounded continuous functions on X with the supremum norm using the map: p 7! fp .x/ :D dX .x, p/. In [259, p. 224] the unbounded case is also treated using the embedding similar to the one in our proof of Proposition 1.17; at this point Kuratowski refers to the paper by Kunugui [257], where the proof of the mentioned above Fréchet result is reproduced. Hausdorff is mentioned in this context in [257] where it is observed that by embedding an arbitrary metric space into some Banach space we get a new proof of the Hausdorff theorem on existence of a completion of a metric space. After this investigation the present author decided that the term Fréchet embedding is the most suitable for embeddings of the type considered in Proposition 1.17. The mentioned Fact 1.20 can be derived from the following results:

Section 1.6 Notes and remarks

29

Lemma 1.63 ([130, Lemma III.8.5]). For each separable subset G of Lp ., †, /, 1  p < 1, there is a set S1 2 † and a -subalgebra of †-restricted-to-S1 , such that the restriction 1 of  to †1 has the following properties 

Lp .S1 , †1, 1 / is separable.



G  Lp .S1 , †1 , /.

The pair .†1 , 1 / is called a separable measure algebra if Lp .S1 , †1 , 1/, 1  p < 1, is separable. (The standard definition, see [396, p. 321] is different, but is equivalent.) The Carathéodory theorem (see [396, p. 321]) states that each separable measure algebra is isomorphic to some algebra of measurable subsets of Œ0, 1 endowed with the Lebesgue measure. This isomorphism induces an isometric embedding of Lp .S1 , †1 , 1 / into Lp .0, 1/. Proposition 1.21 and Lemmas 1.22 and 1.24 apparently should be considered as folklore. The exact estimates for the dimension of `k1 containing an n-element subset of L1 were found in [440] and [35]. Examples 1.32 and 1.33 were suggested by the present author, although with high probability they were known before. Example 1.32 can be easily generalized to an arbitrary strictly convex Banach space. The Baire category theorem can be found in many sources, see e.g. [130, p. 20] or [396, p. 139]. Example 1.34 is well known. Theorem 1.35 is from [219]. As a well-known and very valuable tool in computer science, the Johnson–Lindenstrauss lemma has been proven in various different ways, each method characterized by different algorithmic features, see [4],[6],[109],[154],[209],[250],[316] and references therein. The sharpness of the Johnson–Lindenstrauss lemma was studied in [12],[13]. Our proof of the Johnson–Lindenstrauss lemma is based on the lecture of Assaf Naor given at Workshop in Analysis and Probability at Texas A & M University, July 2006, see also [316]. A significant amount of effort was put into finding analogues of the Johnson– Lindenstrauss lemma for other spaces, see [54],[83],[93],[172],[223],[226],[276], see also k-dimensional versions of the lemma in [124]. Example 1.40 is from [325, Remark 5.10] and [333, Section 3]. The fact that p=q .Lp , kx  ykLp / admits an isometric embedding into Lq when 1  p  q  2 is due to Bretagnolle, Dacunha-Castelle, and Krivine [80, p. 251]

1.6.3 To Section 1.3 In this section we list some standard definitions of probability theory. Our summary is similar to the one given in [233]. Our recommended source on probability theory is [146],[147]. For more information on Banach space valued martingales we recommend [383]. This book also contains basic material on vector-valued random variables and conditional expectations.

30

Chapter 1 Introduction: examples of metrics, embeddings, and applications

1.6.4 To Section 1.4 The application of low-distortion embeddings described in this section appears in [290] and [33]. Distortion estimates for finite metric spaces have been studied extensively, a survey of this study appears in [208] and [314, Chapter 15]. It is worth mentioning that an updated version of [314, Chapter 15] is available on the internet, see [315]. In my opinion [314, Chapter 15] (both in 2002 and in 2005 versions) is very nice reading with a nice selection of exercises. Applications of low-distortion bilipschitz embeddings in computer science are so numerous that I do not even try to survey them. Information on some of the developments in the area can be found in [2],[3],[142],[207],[333].

1.6.5 To exercises Exercise 1.53 is taken from [155, pp. 161–162]. Exercise 1.55 is from [193] and [367, Lemma 3]. Exercise 1.60 is from [441]. The problem of estimating the least dimension d.n/ such d.n/ that all n-element metric spaces are isometric to subsets of `1 was further studd.n/ ied in [35] and [371]. The problem obtained if we replace `1 with “some d.n/dimensional normed space” was studied in [34]. Exercise 1.61 is a discrete version of Example 1.40. See [8] and [47] for closely related results. See [344] and [354, Theorem 5.1] for a different approach to construction of coarse embeddings of `2 into other Banach spaces.

1.7 On applications in topology Usage of coarse embeddings of uniformly discrete metric spaces into sufficiently good Banach spaces in topology was initiated by Gromov. In this section we cite the corresponding paper of Gromov, and state the results which could be considered as very important steps in the realization of Gromov’s program. This section requires much more background than the rest of the book. For this reason readers who do not have such background are advised to skip this section and to read only the following summary of it: Some important recent results in topology are proved in two steps. (1) Certain uniformly discrete metric spaces related to the problem in question (for example, homotopy groups of topological spaces with their word metrics) are coarsely embedded into a Hilbert space or a uniformly convex Banach space (see Definition 1.65 below). (2) This embeddability is used to answer the problem.

Section 1.7 On applications in topology

31

This approach has been already mentioned in the essay [175], and on September 9, 1993 Gromov suggested the following problems (the enclosed is a citation from [176]): (4)

Does every finitely generated or finitely presented group admit a uniformly metrically proper Lipschitz embedding into a Hilbert space? Even such an embedding into a reflexive uniformly convex Banach space would be interesting. This seems hard.

(5)

Can one give a new proof using the above philosophy (of mapping to Euclidean or Hilbert space) of the Strong Novikov conjecture (injectivity of the assembly map for the K-theory of the group C  -algebra) or the Baum–Connes Conjecture for uniformly discrete subgroups of SO.n, 1/, SU.n, 1/?

Remark 1.64. The notion of a ‘uniformly metrically proper Lipschitz embedding’ coincides with the notion of ‘coarse embedding’ introduced in Definition 1.36 Somewhat later G. Yu [442] and G. Kasparov and G. Yu [243] obtained strong positive results in the direction of Problem (5). Because of this the problems of embeddability of uniformly discrete metric spaces into well-structured Banach spaces became important for topology. So far only embeddings into uniformly convex spaces, or better into `2 , are useful. Definition 1.65. A Banach space is called uniformly convex if for every " > 0 there is some ı > 0 so that for any two vectors with kxk  1 and kyk  1, the inequality kx C yk > 2  ı implies kx  yk < ". Intuitively, the center of a line segment inside the unit ball must lie deep inside the unit ball unless the segment is short. Uniformly convex Banach spaces form a rather wide and very complicated (from the Banach-space-theoretical point of view) class of Banach spaces. Metric spaces which are the most interesting for topology are finitely generated infinite groups with their word metrics (see the definitions on page 3). In problems related to the Baum–Connes conjecture, the following class of metric spaces is also of importance: Definition 1.66. A uniformly discrete metric space A is said to have a bounded geometry if for each r > 0 there exist a positive integer M.r / such that each ball in A of radius r contains at most M.r / elements.

32

Chapter 1 Introduction: examples of metrics, embeddings, and applications

In the problems cited above M. Gromov suggested to use uniform (coarse) embeddings into a Hilbert space or into a uniformly convex space as a tool for approaching some of the well-known problems. G. Yu [442] and G. Kasparov and G. Yu [243] have shown that this is indeed a very powerful tool. They proved (we warn the reader that the term “uniform embedding” in the theorems below is used for what we call a “coarse embedding”): Theorem 1.67 (G. Yu, [442]). Let be a discrete metric space with bounded geometry. If admits a uniform embedding into Hilbert space, then the coarse Baum–Connes conjecture holds for . Theorem 1.68 (G. Kasparov and G. Yu, [243]). Let be a discrete metric space with bounded geometry. If is uniformly embeddable into a uniformly convex Banach space, then the coarse geometric Novikov conjecture holds for , i.e., the index map from limd !1 K .Pd . // to K .C  . // is injective, where Pd . / is the Rips complex of and C  . / is the Roe algebra associated to . Theorems 1.67 and 1.68 made Gromov’s problem (4) (cited above) and its generalization for spaces with bounded geometry very important. In most of the known applications of embeddings in topology coarse embeddings are used. However, recently Chen, Wang, and Yu [99] showed that a strictly weaker notion of embedding (fibered coarse embedding) can be used as well. In the negative direction, Gromov [179] constructed finitely generated groups which do not admit coarse embeddings into a Hilbert space. Sapir [403] proved that there is a 4-dimensional closed aspherical manifold M such that the fundamental group 1.M / contains weakly a family of expanders (see Definition 7.2), and so 1.M / is not coarsely embeddable into a Hilbert space.

1.8 Hints to exercises To Exercise 1.54. Use the argument similar to the one used at the end of Example 1.33 to construct a bilipschitz embedding of `22 into `m 1. To Exercise 1.55. For 1  p < 2 and x, y 2 R prove that jx C yjp C jx  yjp  2jxjp C 2jyjp and the equality holds if and only if at least one of the numbers x and y is equal to 0. For 2 < p < 1 prove that jx C yjp C jx  yjp  2jxjp C 2jyjp and the equality holds if and only if at least one of the numbers x and y equals 0. Then apply these inequalities to each of the coordinates of the images of two unit vectors, in order to show that they are disjoint. To Exercises 1.57 and 1.59. Use the parallelogram identity: 8x, y 2 `2

kx C yk2 C kx  yk2 D 2kxk2 C 2kyk2.

Section 1.8 Hints to exercises

33

To Exercise 1.60. Do it in three steps: (a) Start with n D 4, let .X, d / be a 4-element metric space. We say that a realvalued function f on X takes care of the distance between u and v .u, v 2 X/ if Lip.f / D 1 and jf .u/f .v/j D d.u, v/. Consider the bijective correspondence between distances in X and edges of K4. Observe that we need to split the set of edges of K4 into two subsets, S1 , and S2 , such that there exist real-valued functions f1 and f2 with f1 taking care of distances from S1 and f2 taking care of distances from S2 . We can find sets S1 and S2 for which this is possible in the following way: We partition edges of K4 into three pairs of edges without common end vertices, and compute the sums of distances corresponding to the pairs. Then we split the pair with the largest sum, and distribute its edges among the remaining pairs. Finally we prove that the obtained triples of edges can serve as S1 and S2 . (b) Prove: If Y  X, then any function f on Y with Lip.f / D 1 may be extended to a function fz on X with Lip.fz/ D 1. (c) We need to embed isometrically a .k C 4/-element metric space X into `kC2 1 . Let Y be a 4-element subset of X and f1 and f2 be functions producing an isometric embedding of Y into `21 (according to (a)). Show that their extensions to X, existing according to (b), together with the functions d., v/, v 2 XnY produce an isometric embedding into `kC2 1 . To Exercise 1.61. One of the approaches is to consider, for 0 < ˛ < 2ˇ, the series P1 .1cos.2n x//ˇ and to mimic steps of the proof in Example 1.40 to the extent nD1 2n˛ to which it is possible.

Chapter 2

Embeddability of locally finite metric spaces into Banach spaces is finitely determined. Related Banach space theory 2.1 Introduction Definition 2.1. A uniformly discrete metric space X is called locally finite if for every x 2 X and every r > 0 the set ¹u 2 X : dX .x, u/  r º is finite. Remark 2.2. Some authors do not require uniform discreteness of locally finite metric spaces (see in this connection Remark 2.35 and the corresponding comment in Notes and Remarks section). Most of our results hold in this more general setting. We invite readers to check in each case whether the uniform discreteness is needed. Examples 2.3. 1. Unweighted graphs with finite degrees, that is, with finitely many edges incident with each vertex, are locally finite metric spaces with their graph distances. 2. Important examples of uniformly discrete but not locally finite metric spaces are nets in infinite-dimensional Banach spaces. Let us recall the definition: Definition 2.4. Let ı > 0. By a ı-net in a metric space X we mean a collection U of elements of X satisfying the conditions: (a) 8x 2 X 9u 2 U dX .u, x/  ı. (b) 8u, v 2 U dX .u, v/  ı. The existence of nets can be derived from the following result: Lemma 2.5 (Zorn’s lemma). A partially ordered set has a maximal element if every totally ordered subset of it has an upper bound. In this lemma we use the following definitions: A partially ordered set is called totally ordered if any two elements in it are comparable (that is, satisfy a  b or b  a). An upper bound of a subset A in a partially ordered set B is an element u 2 B satisfying a  u for each a 2 A. We are going to consider Zorn’s lemma as a basic set-theoretical principle. In the Notes and Remarks section we provide references to sources where Zorn’s lemma is derived from other set-theoretical axioms and results. To get the existence of a ı-net in an arbitrary metric space X we consider the partially ordered by inclusion set P of all subsets U  X satisfying the second condition

Section 2.2 Banach space theory: ultrafilters, ultraproducts, finite representability

35

of Definition 2.4. This set P satisfies the assumptions of Zorn’s lemma because if ¹U º2 is totally ordered by inclusion, then [2 U is also in P and is an upper bound for ¹U º2 . By Zorn’s lemma there is a maximal element M in P . It should satisfy the first condition of Definition 2.4 because otherwise we would be able to find x 2 X such that M [ ¹xº still satisfies the second condition of Definition 2.4 contrary to the maximality of M . This chapter has two main purposes. The first main purpose is to prove the following theorems (we use Definitions 1.28 and 1.46): Theorem 2.6. Let A be a locally finite metric space whose finite subsets admit uniformly bilipschitz embeddings into a Banach space X. Then A admits a bilipschitz embedding into X. Theorem 2.7. Let A be a locally finite metric space whose finite subsets admit uniformly coarse embeddings into a Banach space X. Then A admits a coarse embedding into X. Proofs are given in Section 2.3 of this chapter. The second main purpose of this chapter is to develop two important tools of Banach space theory. The first is the theory of ultraproducts and finite representability. The second tool is the theory of type and cotype.

2.2 Banach space theory: ultrafilters, ultraproducts, finite representability In order to make some of our arguments more readable (both in this chapter and in further chapters) it is convenient to use the limiting procedure based on the notions of an ultrafilter and an ultraproduct.

2.2.1 Ultrafilters Definition 2.8. Let I be an infinite set. A filter on I is a subset F of P.I / (where P.I / is the set of all subsets of I ) satisfying the following conditions (a) ; … F. (b) If A  B and A 2 F, then B 2 F. (c) If A, B 2 F, then A \ B 2 F. Let R be a collection of subsets of I satisfying the conditions: 

; … R.



If A, B 2 R, then 9C 2 R such that C  A \ B.

36

Chapter 2 Locally finite metric spaces

Then the set S :D ¹A  I : 9B 2 R, A Bº is a filter on I called the filter generated by R. Example 2.9. Let I be an infinite set. The set of all subsets of I with finite complements is a filter on I . Example 2.10. Let I be any set, x 2 I . Then the set of all subsets of I containing x is a filter on I . Let Z be a topological space, f : I ! Z be a function. We say that f converges to z 2 Z through F and write limF f .x/ D z, if f 1 .U / 2 F for every open set U containing z. This notion of convergence applied to Example 2.9 in the case where I D N leads to the standard notion of convergence for sequences. Definition 2.11. An ultrafilter U (on I ) is a maximal filter (on I ) with respect to inclusion, that is, a filter which is not properly contained in any larger filter. Lemma 2.12. Every filter is contained in an ultrafilter. Proof. By Zorn’s lemma (see Lemma 2.5 in this section) it suffices to show that for each totally ordered set of filters there is an upper bound. But this immediately follows from the observation that the union of an increasing family of filters is a filter. It is easy to see that each filter of the type described in Example 2.10 is an ultrafilter. We are interested mostly in ultrafilters with empty intersection. Definition 2.13. An ultrafilter is called free if the intersection of all sets of the ultrafilter is empty. (Some authors use nonprincipal or nontrivial instead of ‘free’.) Observation 2.14. If U is an ultrafilter on I and E  I , then either E 2 U or .I nE/ 2 U. The converse is also true: If a filter U on I is such that for each E  I either E 2 U or .I nE/ 2 U, then U is an ultrafilter. Proof. In fact, if E … U, then all intersections A \ .I nE/, A 2 F, are nonempty. It is easy to verify that these sets generate a filter containing U, hence this filter coincides with U. Hence .I nE/ 2 U. To prove the converse observe that if a filter U satisfies the condition of Observation 2.14 and is such that E … U, then the set U [ ¹Eº cannot be a subset of any filter because I nE 2 U and .I nE/ \ E D ;. Thus U is a maximal filter, that is, an ultrafilter. Lemma 2.15. Let U be an ultrafilter on I , K be a compact set, and f : I ! K be a function, then f converges to some point k 2 K through U.

Section 2.2 Ultrafilters, ultraproducts, finite representability

37

Proof. Assume the contrary. Then each point k 2 K has a neighborhood Nk such that f 1 .Nk / is not in U. By Observation 2.14 .I nf 1 .Nk // 2 U. Since K is compact there are finitely points k1 , : : : , kn such that [niD1 Nki K, but then \niD1 .I nf 1 .Nki // 2 U. On the other hand, this set is empty. We get a contradiction. Corollary 2.16. Let I be a set, U be an ultrafilter on I , and f be a bounded real- or complex-valued function on I . Then f converges to some number through U. Note. The limit whose existence is claimed in Lemma 2.15 and Corollary 2.16 is denoted by limU f .i/.

2.2.2 Ultraproducts Definition 2.17. Given a family .Xi /i 2I of Banach spaces, the `1 direct sum of .Xi /i 2I is defined as the space of all bounded collections .xi /i 2I , xi 2 Xi with the vector operations .xi /i 2I C .yi /i 2I D .xi C yi /i 2I , ˛.xi /i 2I D .˛xi /i 2I , and the norm given by k.xi /i 2I k1 D sup kxi kXi . i 2I

The `1 direct sum is denoted by .˚i 2I Xi /1 . It is easy to check that .˚i 2I Xi /1 is a Banach space. Remark 2.18. We introduce `p direct sums for 1  p < 1 in Definition 3.6. We introduce `1 direct sums separately because it is convenient and customary to use a slightly different notation here. Let U be a free ultrafilter on I . By Corollary 2.16 the limit limU kxi kXi exists for each .xi /i 2I 2 .˚i 2I Xi /1 . It is easy to see that limU kxi kXi is a seminorm on .˚i 2I Xi /1 . (Recall that a seminorm is like norm except that kxk D 0 ) x D 0 is not required.) Let NU be the subspace of .˚i 2I Xi /1 on which this seminorm is equal to 0. Lemma 2.19. NU is a closed subspace of .˚i 2I Xi /1 . Proof. We need to show that each vector .zi /i 2I for which limU kzi kXi D ˛ > 0 has a neighborhood satisfying limU kxi kXi ¤ 0. It is clear that the neighborhood ¹.xi /i 2I : kxi  zi k < ˛=2 for all i 2 I º satisfies this condition. Since NU is a closed subspace, the seminorm limU kxi kXi induces a norm on the quotient space .˚i 2I Xi /1 =NU .

38

Chapter 2 Locally finite metric spaces

Definition 2.20. The obtained Banach space Q is called the ultraproduct of .Xi /i 2I with Q respect to the ultrafilter U. We denote it by . i 2I Xi /U or . Xi /U . If all Xi are the same, the corresponding ultraproduct is also called an ultrapower and is denoted X U . The element corresponding to .xi /i 2I in the ultraproduct will be written .xi /U . As we know its norm is given by k.xi /U k D limU kxi k. We also use the i /kU Q notation k.x U for this norm. From our definitions we see that each element of . Xi /U or X is of the form .xi /U , but this representation is not unique. The notion of an ultraproduct is of importance for us because it provides a way of “pasting” bilipschitz and coarse embeddings out of uniformly bilipschitz and uniformly coarse pieces. We mean the following result which will be used in this chapter. Proposition 2.21. Let A be a metric space which is represented as a union of metric 1 subspaces ¹Ai º1 i D1 satisfying A1  A2  A3  : : : . Suppose that ¹Ai ºi D1 admit uniinto Banach spaces formly bilipschitz (uniformly coarse) embeddings fi : Ai ! XiQ ¹Xi º1 . Then A admits a bilipschitz (coarse) embedding into . Xi /U for any free i D1 ultrafilter U. Proof. We pick a point O 2 A1 . We may assume without loss of generality that fi .O/ D 0 (in fact, the conditions in the definitions of bilipschitz and coarse embeddings are shift-invariant). Next, in the bilipschitz case we may assume that all embeddings ¹fi º1 i D1 satisfy the inequalities: 8u, v 2 Ai

dA .u, v/  kfi .u/  fi .v/k  C dA .u, v/,

(2.1)

where C is any number exceeding distortions of all fi . This can be achieved by multiplying all of the maps by r1 , where r is the number in the definition of a bilipschitz embedding (see (1.5) in Definition 1.26). We claim that the embedding which maps a 2 A onto .fi .a//U has the desired property. First we need to fix the following problem with this definition: if a … Ai , then fi .a/ is not defined. The point is that for each a 2 A this can happen only for finitely many indices, and the definition of the norm in the ultraproduct is such that no change in finitely many terms affects the norm (for free ultrafilters). So we may assume, for example, that fi .a/ D 0 2 Xi if a … Ai . It remains to estimate k.fi .u//i 2I  .fi .v//i 2I kU . In the uniformly bilipschitz case we get immediately from (2.1) and the definition of a convergence through an ultrafilter that 8u, v 2 A

dA .u, v/  k.fi .u//i 2I  .fi .v//i 2I kU  C dA .u, v/.

Section 2.2 Ultrafilters, ultraproducts, finite representability

39

In the uniformly coarse case the assumption (2.1) is replaced with 8u, v 2 Ai

1 .dA .u, v//  kfi .u/  fi .v/k  2.dA .u, v//,

and we get 8u, v 2 A

1.dA .u, v//  k.fi .u//i 2I  .fi .v//i 2I kU  2.dA .u, v//.

Q Proposition 2.22. If the sequence ¹dim Xi º1 Xi /U i D1 is bounded, then the space . is also finite-dimensional. We start by proving a result which is very interesting and important by itself. Let ¹xi ºniD1 be a basis in an n-dimensional space X, its biorthogonal functionals are defined by xi .xj / D ıij (Kronecker delta). The basis ¹xi ºniD1 is called an Auerbach basis if kxi k D kxi k D 1 for all i 2 ¹1, : : : , nº. Lemma 2.23 (Auerbach’s lemma). Every finite-dimensional Banach space has an Auerbach basis. Proof. For a vector x 2 X by Œx, x we denote the line segment joining x and x. For x1, : : : , xk 2 X by M.¹xi ºkiD1/ we denote the Minkowski sum of the corresponding line segments, that is, M.¹xi ºkiD1/ D ¹x : x D y1 C    C yk for some yi 2 Œxi , xi , i D 1, : : : , kº. Let n D dim X. Consider the set N.D N.X// consisting of all subsets ¹xi ºniD1  X satisfying kxi k D 1, i 2 ¹1, : : : , nº. It is a compact set in its natural topology; and the n-dimensional volume of M.¹xi ºniD1/ is a continuous function on N . Hence it attains its maximum on N . Let U  N be the set of n-tuples on which the maximum is attained. It is easy to see that each ¹xi ºniD1 2 U is a basis (for linearly dependent sets the volume is zero). Another important observation is that M.¹xi ºniD1/ BX if ¹xi ºniD1 2 U (where BX D ¹x 2 X : kxk  1º). In fact, if there is y 2 BX nM.¹xi ºniD1/ then (since the volume of a parallelepiped is the product of the length of its height and the .n  1/-dimensional volume of its base), there is i 2 ¹1, : : : , nº such that replacing xi by y we get a parallelepiped whose volume is strictly greater the volume of M.¹xi ºniD1/. Since we may assume that kyk D 1, this leads to a contradiction with the definition of U . The following claim shows that each basis in U is an Auerbach basis. Claim 2.24. A system ¹xi ºniD1 2 N is an Auerbach basis if and only if M.¹xi ºniD1/ BX . Proof. It is easy to see that M.¹xi ºniD1/ D ¹x : jxi .x/j  1 for i D 1, : : : , nº

40

Chapter 2 Locally finite metric spaces

for each basis ¹xi ºniD1. Hence M.¹xi ºniD1/ BX if and only if kxi k  1 for each i. It remains to observe that the equality kxi k D 1 implies kxi k  1, i D 1, : : : , n. This completes the proof of Lemma 2.23. dim X

Proof of Proposition 2.22. Let ¹ej ,i ºj D1 i , i 2 I , be Auerbach bases in Xi . We may consider each Xi as a linear subspace of `m 2 , where m D maxi dim Xi . Also we may assume that ej ,i is identified with ej , where ¹ej ºjmD1 is an orthonormal basis in `m 2 . (Of as a normed space.) course, in general Xi is not a subspace of `m 2 Under these conditions a bounded sequence .xi /i 2I corresponds to a bounded sequence in `m 2 . In fact, since ¹ej ,i º are Auerbach bases, the decomposition xi D P a e implies jaj ,i j  kxi k, and therefore the norm of the natural image of xi j ,i j ,i j p m in `m is  mkx k. Thus, by Lemma 2.15, limU xi exists as an element i 2 Q of `2 . Itmis easy to see that the map .xi /U 7! limU xi is a linear embedding of . Xi /U into `2 . This embedding is injective. In fact, limU xi D 0 implies limU kxi k`m D 0. Therefore 2 limU aj ,i D 0 for each jQD 1, : : : , m and limU kxi kXi D 0. The injectivity of the embedding implies that . Xi /U is a finite-dimensional space. Remark 2.25. In the case where all Xi have the same dimension m, the ultraproduct Q . Xi /U is also m-dimensional. In fact, in this case each Xi is identified (as a linear space) with `m bases as above), and it is clear that each stationary 2 (using Auerbach Q sequence .x, x, : : : , x, : : : / is in . Xi /U and is identified with the corresponding x 2 Q . Thus . X / is m-dimensional. `m i U 2 We also use the following important theorem whose proof is beyond the scope of the present book (see Notes and Remarks for references and a proof sketch). Theorem 2.26. If X D Lp .0, 1/ for some p 2 Œ1, 1/ and U is any ultrafilter, then X U is isometric to Lp ., †, / for some measure space ., †, /, and each separable subspace of X U is isometric to a subspace of X.

2.2.3 Finite representability Definition 2.27. The standard notion of distance between two Banach spaces is the Banach–Mazur distance which is defined by d.X, Y / D inf¹kT kkT 1 k : T : X ! Y is an isomorphismº. If X and Y are not isomorphic, we put d.X, Y / D 1. This notion is especially useful for pairs of finite-dimensional spaces of the same dimension. The Banach–Mazur distance is not a metric in the sense of the definition given at the beginning of Chapter 1, but it is easy to verify that it satisfies multiplicative analogues of the corresponding conditions:

41

Section 2.2 Ultrafilters, ultraproducts, finite representability

(a)

Multiplicative triangle inequality: 8X, Y , Z

(b)

Symmetry: 8X, Y

(c)

d.X, X/ D 1.

(d)

d.X, Y /  1.

d.X, Z/  d.X, Y /  d.Y , Z/.

d.X, Y / D d.Y , X/.

Therefore, the restriction of log d.X, Y / to any set of isomorphic Banach spaces is a semimetric. It is not difficult to prove using compactness that in the finite-dimensional case it is a metric, if we identify linearly isometric spaces. Definition 2.28. Let X and Y be two Banach spaces. The space X is said to be finitely representable in Y if for any " > 0 and any finite-dimensional subspace F  X there exists a finite-dimensional subspace G  Y such that d.F , G/ < 1 C ". The space X is said to be crudely finitely representable in Y if there exists 1  C < 1 such that for any finite-dimensional subspace F  X there exists a finitedimensional subspace G  Y such that d.F , G/ < C . Important example: Proposition 2.29. The space Lp is finitely representable in `p . Proof. Let F be a finite-dimensional subspace of Lp , dim F D n. Let ¹ei ºniD1 be a basis in F . Using compactness we get that there exists a constant c > 0 such that   n n X  X  t e jti j. 8¹ti ºniD1  R   c i i  i D1

i D1

(The constant might depend on the subspace, the existence of such a constant is immediate for Auerbach bases.) Recall that a measurable function is called simple if it takes only finitely many different values. It is well known (and is easy to see from the definition of Lp D Lp .0, 1/) that simple functions are dense in Lp . (Even the set of all those simple functions for which pre-images of all values are half-open intervals is dense.) Let " > 0 be arbitrary, pick ! > 0 satisfying .1 C !/=.1  !/ < 1 C ". Let fi 2 Lp be simple functions satisfying kfi  ei k  c!. There is a partition ¹P1, : : : , Pm º of .0, 1/ such that each of the functions fi is constant on each of the elements of the partition. Consider the subspace of Lp spanned by the indicator functions ¹1Pi ºm i D1 . It is easy to see that this space is isometric to `pm (see the proof of Proposition 1.19; as we have mentioned, we may assume that each Pi is an interval). It remains to estimate the norm of the operator T : F ! Lp given by T ei D fi and the norm of its inverse. We have     X X  X  X X  ti ei  ti fi  jti jkei  fi k  !c jti j  !  ti ei   .   i

i

i

i

i

42

Chapter 2 Locally finite metric spaces

Thus .1  !/kxk  kT xk  .1 C !/kxk and kT k  kT 1 jT .F / k < 1 C ". The following theorem is one of the most famous results on finite representability. We do not prove it because we shall use it in some examples only, and also because proofs of this theorem are presented in numerous books and surveys. Theorem 2.30 (Dvoretzky theorem). The space `2 is finitely representable in each infinite-dimensional Banach space. The notion of an ultraproduct is closely related to finite representability. One of the results establishing such relations is the following proposition. Proposition 2.31. Let X be any infinite-dimensional Banach space and U be a free ultrafilter. Then X U is finitely representable in X. Proof. Let F be a finite-dimensional subspace of X U . We pick a basis ¹xj ºjnD1 in F . Let .xj ,i /i 2I be a representative of xj in .˚i 2I Xi /1 . It is clear that we can extend the map xj 7! .xj ,i /i 2I to a linear map on R : F ! .˚i 2I Xi /1 which maps each vector of F to an element in .˚i 2I Xi /1 representing it. Now we pick ı > 0 and ! > 0 satisfying the condition 1C! < 1 C ". 1  !  2ı

(2.2)

Let ¹dk ºm be a ı-net in the unit sphere SF of F (the unit sphere of a normed space kD1 F is defined as SF D ¹x 2 F : kxk D 1º). Let .dk,i /i 2I D Rdk . Since kdk k D limU kdk,i k, for each k there is a subset Uk 2 U such that 8i 2 Uk

1  !  kdk,i k  1 C !

U is nonempty. Let g 2 \m U . Then Since U is an ultrafilter, the intersection \m kD1 k kD1 k the map given by dk 7! dk,g is a linear map and it satisfies 1  !  kdk,g k  1 C !

(2.3)

for each k D 1, : : : , m. We denote this linear map by T . Let us estimate the norm of T and T 1 jT .F / . We use the following general lemma. Lemma 2.32. Let ¹dk ºm be a ı-net in the unit sphere of a finite-dimensional normed kD1 space F and T : F ! X be a linear map satisfying 1  !  kT dk k  1 C !. Then kT k 

1C! 1ı

and kT 1 jT .F / k 

1ı . 1!2ı

43

Section 2.2 Ultrafilters, ultraproducts, finite representability

Proof. Let f 2 SF be such that kTf k D kT k. Let d 2 ¹dk ºm be such that kf  kD1 d k  ı. We have kT k D kTf k  kT d k C kT .f  d /k  1 C ! C kT kı. Therefore kT k 

1C! . 1ı

Now let f be any element in SF and d 2 ¹dk ºm be such that kf  d k  ı. Then kD1 kTf k  kT d k  kT .f  d /k  1  !  ıkT k  1  !  ı and therefore kT 1 jT .F / k 

1C! 1ı

1ı . 1  !  2ı

Combining estimates of Lemma 2.32 with the assumption (2.2) we get that the embedding T:F !X U satisfies the conditions from the definition of finite representability. Now we prove a result which is a kind of a converse to Proposition 2.31. Proposition 2.33. Let Z be a Banach space which is finitely representable in a Banach space X. Then there exists a free ultrafilter U such that the ultrapower X U contains a subspace isometric to Z. Proof. Denote by J the set of all finite-dimensional subspaces of Z. Consider the set I D J  .0, 1/ as an ordered set: .j1 , "1/ .j2 , "2 / if and only if j1 j2 and "1  "2 . Consider an ultrafilter U on I containing the filter generated by sets of the form ¹.j , "/ : .j , "/ .j0, "0 /º, where j0 2 J , "0 2 .0, 1/. Since Z is finitely representable in X, for each pair .j , "/ 2 J  .0, 1/ we can pick a linear operator T.j ,"/ : j ! X satisfying kzk  kT.j ,"/.z/k  .1 C "/kzk. It remains to observe that the maps ´ T.j ,"/.z/ z 7! 0

if z 2 j if z … j

(parameterized by pairs .j , "/ 2 I ) induce an isometric embedding of Z into X U .

44

Chapter 2 Locally finite metric spaces

2.3 Proofs of the main results on relations between embeddability of a locally finite metric space and its finite subsets 2.3.1 Proof in the bilipschitz case Proof of Theorem 2.6. We pick a point O in A and let Ai D ¹a 2 A : dA .O, a/  2i º. By the assumption there are uniformly bilipschitz maps fi : Ai ! X. We may and shall assume that fi .O/ D 0 and that there is a constant 1  C < 1 such that 8u, v 2 Ai

dA .u, v/  kfi .u/  fi .v/k  C dA .u, v/.

(See the proof of Proposition 2.21.) U Let U be a free ultrafilter on N. The maps ¹fi º1 i D1 induce a map f : A ! X 1 z defined by f .u/ D ¹fi .u/ºi D1 , where ´ fi .u/ if u 2 Ai fzi .u/ D 0 if u … Ai . The definition of an ultraproduct immediately implies that f : A ! X U is a bilipschitz embedding (the argument is the same as in Proposition 2.21). Let N D f .A/. Since the composition of two bilipschitz embeddings is a bilipschitz embedding, it suffices to find a bilipschitz embedding of N (with the metric induced from X U ) into X. Note. This passage from A to its image in X U is not essential for the proof of Theorem 2.6, it just simplifies some formulas in our proof. A similar step is more essential for other classes of embeddings. Remark 2.34. We would like to emphasize that the rest of the proof of Theorem 2.6 consists of establishing the fact that a locally finite metric subspace of X U admits a bilipschitz embedding into X. Remark 2.25 implies that the case where X is finite-dimensional is trivial. In this connection in the rest of the proof we assume that X is infinite-dimensional. Let Ni D ¹u 2 N : kukX U  2i º. It is clear that Ni are finite sets. By Proposition 2.31 there exist maps si : Ni ! X such that si .0/ D 0 and   1 ku  vk. (2.4) 8u, v 2 Ni ku  vk  ksi .u/  si .v/k  1 C i (The direct proof of this statement is simpler than the proof of Proposition 2.31, since we do not need the argument with nets.)

45

Section 2.3 Finitely determined

Since the sets Ni form an increasing sequence, any subsequence ¹sni º1 i D1 of the 1 sequence ¹si º1 maps ¹N º into X and satisfies (2.4). We are going to construct i i D1 i D1 a bilipschitz embedding of N into X using such subsequences. Note. We are going to pass to a subsequence in ¹si º1 i D1 several times. Each time we 1 keep the notation ¹si ºi D1 for the subsequence. Recall that a subspace M  X  is called 1-norming if 8x 2 X sup¹jf .x/j : f 2 M , kf k  1º D kxk. It is clear that may assume that X is separable (replacing it by the closure of the Swe 1 linear span of i D1 si .Ni /, if necessary). For a separable Banach space X there exists a separable 1-norming subspace M  X  . It can be constructed as follows: Let ¹xi º1 i D1 be a dense sequence in the unit sphere SX D ¹x 2 X : kxk D 1º. Let fi 2 SX  be such that fi .xi / D 1. It is easy to check that the closed linear span M of the sequence ¹fi º1 i D1 is 1-norming.  Let M  X be a separable 1-norming subspace. Then the natural embedding of X into M  is an isometry. We identify X with its image under this embedding. Since is M is separable, there is a subsequence in ¹si º such that the sequence ¹si .a/º1 i Dk convergent in the weak topology of M  for each a 2 Nk . We denote the weak limit of this sequence by m.a/. We need to select further subsequences of ¹si º. We do this in the following two steps. Step 1. If m.a/ ¤ m.b/ for a, b 2 Nj , we find and fix f D fa,b 2 SM satisfying f .m.a/  m.b// 

99 km.a/  m.b/k. 100

(2.5)

We select a subsequence of ¹si º satisfying 1 km.a/  m.b/k (2.6) 100 for k  j . This can be achieved because Nj is finite and the sequence ¹sk .a/º converges to m.a/ in the weak topology of M  . jf ..sk .a/  sk .b//  .m.a/  m.b///j 

Step 2. If for some a, b 2 N` and some j  ` the vector .sj .a/  sj .b//  .m.a/  m.b// is nonzero, we find and fix f D fj ,a,b 2 SM such that f ..sj .a/  sj .b//  .m.a/  m.b/// 99 k.sj .a/  sj .b//  .m.a/  m.b//k.  100 In such a case we assume that for all k > j the condition jf ..sk .a/  sk .b//  .m.a/  m.b///j 

1 ka  bk 1000

(2.7)

(2.8)

46

Chapter 2 Locally finite metric spaces

holds. This goal can be achieved because there are finitely many a, b 2 N` and because sk .a/ converges to m.a/ in the weak topology of M  . We introduce a map ' : N ! X by '.a/ D

kak  2i 1 2i  kak s .a/ C si C1 .a/ i 2i 1 2i 1

(2.9)

if 2i 1  kak  2i . This formula gives two definitions of '.a/ if kak D 2i , but one can easily check that they coincide. We start by considering the case where X is isomorphic to X ˚ R. To make this condition more clear we need to specify the norm on X ˚ R; we do this by k.x, t /k D kxkX C jt j. In this case we show that the embedding 'z : N ! X ˚ R given by '.a/ z D .'.a/, kak/ is a bilipschitz embedding. Recall that a hyperplane in a Banach space X is defined as the subspace ¹x 2 X : f .x/ D 0º, where f is a linear functional. A closed hyperplane is defined in the same way with the additional condition that f is continuous. It is easy to see that a Banach space X is isomorphic to X ˚ R if and only if X is isomorphic to its closed hyperplane. Gowers and Maurey (1993) proved that Banach spaces which are not isomorphic to their closed hyperplanes exist. Because of this we have to consider also the case where X is not isomorphic to X ˚ R. This is done in Section 2.3.1. Now we estimate the Lipschitz constants of 'z and .'/ z 1 . We consider three cases. Case 1. 2i 1  kbk  kak  2i In this case we have '.a/  '.b/ 2i  kak kak  2i 1 si .a/ C si C1 .a/ i 1 2 2i 1 kbk  2i 1 2i  kbk s .b/  si C1 .b/  i 2i 1 2i 1 kak  2i 1 2i  kak .s .a/  s .b// C .si C1 .a/  si C1 .b// D i i 2i 1 2i 1 kbk  kak kak  kbk C si .b/ C si C1 .b/. i 1 2 2i 1 D

Using (2.4) we get k'.a/  '.b/k 

2i  kak kak  2i 1 2ka  bk C 2ka  bk 2i 1 2i 1 C 4j kbk  kak j C 4j kak  kbk j,

therefore k'.a/ z  '.b/k z  C ka  bk for some absolute constant C .

(2.10)

47

Section 2.3 Finitely determined

Now we estimate k'.a/ z 'z.b/k from below. So we need to estimate k'.a/'.b/k from below. Observe that '.a/  '.b/ D m.a/  m.b/ 2i  kak .si .a/  si .b/  .m.a/  m.b/// 2i 1 (2.11) kak  2i 1 .s .a/  s .b/  .m.a/  m.b/// C i C1 i C1 2i 1 si .b/ si C1.b/ C .kbk  kak/  i 1 C .kak  kbk/ i 1 . 2 2 1 First we consider the case where km.a/  m.b/k  100 ka  bk. Let fa,b be the corresponding functional (see Step 1 above). We have C

k'.a/  '.b/k  fa,b .'.a/  '.b// D fa,b .m.a/  m.b//  i  2  kak C fa,b .si .a/  si .b/  .m.a/  m.b/// 2i 1   kak  2i 1 .s .a/  s .b/  .m.a/  m.b/// C fa,b i C1 i C1 2i 1 (2.12) fa,b .si C1 .b// fa,b .si .b// C .kak  kbk/ C .kbk  kak/  2i 1 2i 1 (2.5) & (2.6) 99 1 km.a/  m.b/k  km.a/  m.b/k  100 100  4j kbk  kak j  4j kak  kbk j 98 ka  bk  8.kak  kbk/.  10000 1 ka  bk, we get an estimate for k'.a/  '.b/k In the case where kak  kbk < 1000 (and thus for k'.a/ z  '.b/k) z from below of the form cka  bk, where c > 0 is an absolute constant. 1 The estimate k'.a/ z 'z.b/k from below in the case where kak kbk  1000 ka bk is immediate, we just recall that '.a/ z  '.b/ z D .'.a/  '.b// ˚ .kak  kbk/.

(2.13)

Hence, to finish the lower estimate for k'.a/ z '.b/k z in Case 1 it remains to consider 1 ka  bk. In this case we consider two subcases: the case where km.a/  m.b/k < 100 2i  kak ksi .a/  si .b/  .m.a/  m.b//k  2i 1 2i  kak ksi .a/  si .b/  .m.a/  m.b//k < 2i 1

1 ka  bk 10 1 ka  bk. 10

(2.14) (2.15)

48

Chapter 2 Locally finite metric spaces

We start with subcase (2.14). Let fi ,a,b be the functional found in Step 2. We get k'.a/  '.b/k  fi ,a,b .'.a/  '.b//  fi ,a,b .m.a/  m.b//  i  2  kak .si .a/  si .b/  .m.a/  m.b/// C fi ,a,b 2i 1   kak  2i 1 .s .a/  s .b/  .m.a/  m.b/// C fi ,a,b i C1 i C1 2i 1 fi ,a,b .si .b// fi ,a,b .si C1 .b// C .kbk  kak/  C .kak  kbk/ 2i 1 2i 1 (2.7),(2.8), & (2.14) 99 1 1 ka  bk  ka  bk  ka  bk > 1000 1000 100  4j kbk  kak j  4j kak  kbk j 88 ka  bk  8.kak  kbk/. D 1000

(2.16)

1 Now, as above, we consider the case where kak  kbk < 1000 ka  bk separately, and complete the argument in the same way as above. We turn to subcase (2.15). Recall (see (2.4)) that ksi .a/  si .b/k  ka  bk. Com1 bining this with (2.15) and with the inequality km.a/  m.b/k < 100 ka  bk, we get i

99 ksi .a/  si .b/  .m.a/  m.b//k  100 ka  bk and 2 2kak < 10 . (In the same way i1 99 99 we get the inequality ksi C1.a/  si C1.b/  .m.a/  m.b//k  100 ka  bk which we

use below.) Therefore

kak2i1 2i1

>

89 . 99

Applying the triangle inequality we get

   kak  2i 1   k'.a/  '.b/k   .si C1 .a/  si C1.b/  .m.a/  m.b///  i 1 2   i   2  kak    2i 1 .si .a/  si .b/  .m.a/  m.b/// (2.17)  km.a/  m.b/k  8.kak  kbk/ 1 1 89 99  ka  bk  ka  bk  ka  bk  8.kak  kbk/ > 99 100 10 100 78 ka  bk  8.kak  kbk/. D 100 1 kabk Now, as was done already twice, we consider the case where kakkbk < 100 separately, and complete the argument in the same way as above. This completes the argument in Case 1.

49

Section 2.3 Finitely determined

Case 2. 2i 1  kbk  2i  kak  2i C1 We have 2i  kbk si .b/ 2i 1 kbk  2i 1 2i C1  kak s .a/  si C1 .b/ C i C1 2i 2i 1 kak  2i si C2.a/. C 2i

'.a/  '.b/ D 

Estimate from above: The first and the last terms have norms  4.kak  kbk/. The norm of the two remaining terms can be estimated as follows:  i C1  2  kbk  2i 1  kak   s .a/  s .b/ i C1 i C1   2i 2i 1   i   2  .kak  2i / .2i  kbk/  2i 1   s .a/ C s .b/ D i C1 i C1  2i 2i 1 (2.18)   i i   .kak  2 / .2  kbk/ si C1 .a/ C si C1.b/ D  .si C1 .a/  si C1 .b//  2i 2i 1  2ka  bk C 4.kak  2i / C 4.2i  kbk/  6ka  bk. Now we turn to estimates from below. Rewriting and estimating some of the terms as in (2.18) we get k'.a/  '.b/k     .kak  2i / .2i  kbk/  si C1 .a/ C si C1 .b/  .si C1 .a/  si C1 .b//   i i 1 2 2 kak  2i 2i  kbk ks .b/k  ksi C2 .a/k i 2i 1 2i  ksi C1.a/  si C1.b/k  12.kak  kbk/ 

(2.19)

 ka  bk  12.kak  kbk/, where in the last line we used (2.4). We complete the proof in this case as three times 1 before. If kak  kbk < 20 ka  bk, we get an estimate using (2.19). Otherwise we use (2.13). Case 3. 2i 1  kbk  2i < 2k1  kak  2k In this case we have z  '.b/k z  kak  kbk  2k1  2i . 3.2k C 2i /  3.kak C kbk/  k'.a/

50 Since

Chapter 2 Locally finite metric spaces

3  2kC1 3.2k C 2i /   24 2k1  2i 2k2

and kak C kbk  ka  bk  kak  kbk, it follows that 'z is bilipschitz. Completion of the proof for spaces non-isomorphic to their closed hyperplanes We have proved Theorem 2.6 in the case where X is isomorphic to its closed hyperplanes. To prove Theorem 2.6 in the general case we find a Lipschitz map : RC ! X such that the map 'y : N ! X given by '.a/ y D .kak/ C '.a/ works just in the same way as '. z It is easy to see that for this to be true we need the inequality k'.a/ y  '.b/k y  ˛.kak  kbk/ to hold for some ˛ > 0. We rewrite this inequality as k .kak/  .kbk/ C .'.a/  '.b//k  ˛.kak  kbk/.

(2.20)

Let Ti D ¹'.u/ : u 2 N , kuk  3i C1 º. It is clear that all these sets are finite. We construct inductively a sequence ¹Fi º1 i D1 of finite-dimensional subspaces of X and a sequence ¹pi º1 of vectors. We let F 1 D lin.T1 /. (Here and below lin.T / denotes i D1 the linear span of a set T in a vector space.) Recall that we have assumed that X is infinite-dimensional. Therefore there is p1 2 SX such that dist.p1 , F1 / D 1. Let F2 D lin.T2 [¹p1 º/ and p2 2 SX be such that dist.p2 , F2 / D 1. Let F3 D lin.T3 [¹pi º2i D1/, we continue in an obvious way. We introduce the map : RC ! X in the following way: 8 ˆ if 0  t  31 tp ˆ ˆ 1 ˆ ˆ ˆ if 31  t  32 ˆ31 p1 C .t  31/p2 ˆ ˆ ˆ ˆ ˆ31 p1 C .32  31 /p2 C .t  32 /p3 if 32  t  33 ˆ < ::: .t/ D : : : ˆ ˆ 1 2 1 ˆ ˆ 3 p1 C .3  3 /p2 C    ˆ ˆ ˆ ˆ ˆ C.3k  3k1/pk C .t  3k /pkC1 if 3k  t  3kC1 ˆ ˆ ˆ ˆ :: : : ::: Since kpi k D 1, the map is 1-Lipschitz. It remains to show that the inequality (2.20) holds. We consider three cases: (a) 3i  kbk  kak  3i C1. The argument used in this case can be used also in the case 0  kbk  kak  31 . Minor adjustments of the other cases are needed if 0  kbk  31  kak.

51

Section 2.3 Finitely determined

(b) 3i 1  kbk  3i  kak  3i C1 (c) 3k1  kbk  3k  3i  kak  3i C1, where k < i. In case (a) we have k .kak/  .kbk/ C .'.a/  '.b//k D k.kak  kbk/pi C1 C .'.a/  '.b//k  kak  kbk. The last inequality follows from dist.pi C1 , Fi C1 / D kpi C1k and '.a/, '.b/ 2 Ti , therefore '.a/  '.b/ 2 Fi  Fi C1 . In case (b) we consider two subcases: 1 kak  3i  .kak  kbk/. 3 1 kak  3i < .kak  kbk/. 3

(2.21) (2.22)

In subcase (2.21) we get k .kak/  .kbk/ C .'.a/  '.b//k   D .kak  3i /pi C1 C .3i  kbk/pi C .'.a/  '.b// 1  kak  3i  .kak  kbk/, 3 where we use (2.21), dist.pi C1 , Fi C1 / D kpi C1k, and pi , '.a/, '.b/ 2 Fi C1 . In subcase (2.22) we have k .kak/  .kbk/ C .'.a/  '.b//k   D .kak  3i /pi C1 C .3i  kbk/pi C .'.a/  '.b//    .3i  kbk/pi C .'.a/  '.b//  .kak  3i / 1  .3i  kbk/  .kak  3i /  .kak  kbk/. 3 In this chain of inequalities we use the fact that '.a/, '.b/ 2 Ti , dist.pi , Fi / D kpi k, in the last line we use the inequality kbk  3i  kak and (2.22). Now we consider case (c). In this case we consider two subcases: 1 kak  3i  .3i  3i 1 /. 3 1 kak  3i < .3i  3i 1 /. 3

(2.23) (2.24)

52

Chapter 2 Locally finite metric spaces

In subcase (2.23) our argument is very close to the argument above: k .kak/  .kbk/ C .'.a/  '.b//k  D .kak  3i /pi C1 C .3i  3i 1/pi

 C    C .3k  kbk/pk C .'.a/  '.b//

 kak  3i 1  .3i  3i 1 / 3 2 D 3i C1 27 2 .kak  kbk/.  27 Now we consider subcase (2.24). In this subcase we get (from the previous formula) .kak/  .kbk/ D .kak  3i /pi C1 C .3i  3i 1/pi C r , where r is a vector contained in Fi . Thus k .kak/  .kbk/ C .'.a/  '.b//k    .3i  3i 1/pi C r C .'.a/  '.b//  k.kak  3i /pi C1k 1 4 4 .kak  kbk/,  .3i  3i 1 /  .3i  3i 1/ D  3i 1  3 3 27 where we use the fact that r and '.a/  '.b/ are in Fi and dist.pi , Fi / D kpi k. This completes the proof of (2.20) and thus Theorem 2.6.

2.3.2 Proof in the coarse case Proof of Theorem 2.6 contains almost everything we need for the proof of Theorem 2.7, we need just to modify the beginning of the proof. Proof of Theorem 2.7. We pick a point O in A and let Ai D ¹a 2 A : dA .O, a/  2i º. By the assumption there are uniformly coarse maps fi : Ai ! X. We may and shall assume that fi .O/ D 0. Let U be a free ultrafilter on N. The maps ¹fi º1 i D1 induce a map f : A ! X U defined by f .u/ D ¹fzi .u/º1 , where i D1 ´ fi .u/ if u 2 Ai fzi .u/ D 0 if u … Ai . The definition of an ultraproduct immediately implies that f : A ! X U is a coarse embedding (see the proof of Proposition 2.21). Let N D f .A/, it is easy to check that in N with the metric induced from X U all balls of finite radius have finite cardinality.

Section 2.4 Type and cotype

53

The argument of the proof of Theorem 2.6 shows that there is a bilipschitz embedding of N into X (see Remarks 2.34 and 2.35). Since the composition of a coarse and a bilipschitz embedding is a coarse embedding, the proof is completed.

2.3.3 Remarks on extensions of finite determination results Theorems 2.6 and 2.7 can be extended in two directions: Remark 2.35. In the proofs of Theorems 2.6 and 2.7 we never used the condition that the metric space A is uniformly discrete. For this reason these results remain true if the only assumption on A is that for every a 2 A and every r > 0 the set ¹u 2 A : dX .a, u/  r º is finite. Remark 2.36. It is worth mentioning that our argument implies that a result similar to Theorems 2.6 and 2.7 holds for any class E of embeddings provided that (a)

There is a notion of being uniformly in E for a collection of maps of finite metric spaces into a Banach space.

(b)

The notion in (a) is such that if all finite subspaces of a metric space A admit uniformly-in-E embeddings into a Banach space X, then there is an embedding of the class E of A into an ultraproduct X U where U is a free ultrafilter (see the construction below).

(c)

The image of a locally finite metric space under an embedding of the class E is locally finite.

(d)

A composition of an embedding of the class E and a bilipschitz embedding is in E.

2.4 Banach space theory: type and cotype of Banach spaces, Khinchin and Kahane inequalities 2.4.1 Definitions: Rademacher type and cotype, parallelogram identity One of the most important directions of research in Banach space theory is to find embeddings of Banach spaces, one into another; and to find obstructions for such embeddings. Of course the results depend heavily on the class of “admissible” embeddings. In classical linear theory the most important class of embeddings is the class of linear isomorphic embeddings, that is, linear maps T : X ! Y satisfying ckxkX  kT xkY  C kxkX for some 0 < c  C < 1. Such a T is called Kisomorphic, where K D Cc . Sometimes linear isomorphic embeddings are called just isomorphic embeddings. One of the important tools and outstanding achievements in the linear isomorphic theory is the type-cotype theory for Banach spaces.

54

Chapter 2 Locally finite metric spaces

Definition 2.37. Rademacher functions ¹rk .t /º1 on Œ0, 1 are defined by kD0 ´ 1 for all t 2 Œ0, 1, if k D 0 rk .t / D k sign.sin 2  t / if k  1. A Banach space X is said to have Rademacher type p, 1  p  2 if there exists a constant Tp .X/ < 1 such that for every n and for every x1 , : : : , xn 2 X, p 1=p Z 1X 1=p X n  n  p   ri .t /xi  dt  Tp .X/ kxi kX , (2.25)  0

X

i D1

i D1

where ¹ri º are Rademacher functions. The Banach space X is said to have Rademacher cotype q, 2  q  1 if there exists a constant 0 < Cq .X/ < 1 such that for every n and for every x1, : : : , xn 2 X, q 1=q Z 1X 1=q X n  n  1 q   ri .t /xi  dt  kxi kX . (2.26)  Cq .X/ 0 X i D1

i D1

In the case where q D 1 we understand inequality (2.26) as  n  X  1  max kxi kX . r .t /x max  i i   0t 1 C 1 .X/ i X i D1

Let us explain the restrictions on p and q in this definition and give some simple examples. Observation 2.38. The triangle inequality implies that each Banach space has type 1 with constant 1, and cotype 1 with constant 1. In fact, to show the first statement we observe that the triangle inequality implies  n  n X X    r .t /x  kxi kX i i  i D1

X

i D1

for each t 2 Œ0, 1. The second statement can be proved using induction on n and the following two observations. We may assume that maxi kxi k D kx1k. For every z 2 X and i D 1, : : : , n we have z D 12 ..z C xi / C .z  xi //. Therefore max¹kz C xi k, kz  xi kº  kzk. Definition 2.39. If a Banach space X has some type p > 1, we say that X has nontrivial type. If a Banach space X has some cotype q < 1, we say that X has nontrivial cotype. Proposition 2.40. Any Hilbert space (finite- or infinite-dimensional) has type 2 with constant 1 and cotype 2 with constant 1.

55

Section 2.4 Type and cotype

Proof. Let x1, : : : , xn be elements of a Hilbert space H . Then  n 2

X  X 1 n n 1 X X X    r .t /x D r .t /x , r .t /x ri .t /rj .t /hxi , xj i. D i i i i j j   H

i D1

i D1

j D1

i D1 j D1

Now, when we integrate with respect to t , we use the fact that Z 1 ri .t /rj .t /dt D ıi ,j (Kronecker delta), 0

and get

 n 1 X

Z

 

0

i D1

2 1=2  X 1=2 n  2  ri .t /xi  dt D kxi kH . H

(2.27)

i D1

This proves the proposition. To develop some theory of type-cotype we need an important inequality showing that we can replace inequalities (2.25) and (2.26) in the definitions of Rademacher type and cotype with  n 1 X

Z 0

and

 

i D1

 n 1X

Z 0

 

i D1

2 1=2 1=p X n  p zp .X/ ri .t /xi  dt  T kx k i X  X

2 1=2  ri .t /xi    dt X

(2.28)

i D1

1 Czq .X/

X n

q

kxi kX

1=q ,

(2.29)

i D1

respectively, and this will affect the constants only. Even more, we can replace 2 in the left-hand sides of these inequalities with any number in Œ1, 1/, and again, this change will affect the constants only: the obtained inequalities will characterize the same class of Banach spaces. This goal will be achieved in the next section. Remark 2.41. Now we can explain why in the definition of type we restrict to p  2 and in the definition of cotype to q  2. The reason is that any space which has dimension at least 1 does not have any type p > 2 and does not have any cotype q < 2. Since type and cotype are obviously inherited by subspaces and each Banach space, except for the space ¹0º, contains a one-dimensional subspace, it suffices to show that a one-dimensional space (which can be assumed to be a Hilbert space) does not have any type p > 2 and does not have any cotype q < 2. To see this we combine equality (2.27) with the equivalent definitions of type and cotype in (2.28) and (2.29). In fact for a sequence ¹xi ºniD1 consisting of vectors of norm 1 (it can be the same vector repeated p p n times) in the space X D R1 , we get n  Tzp .X/  n1=p and n  z 1 n1=q . For p > 2 or q < 2 we get a contradiction for sufficiently large n.

Cq .X/

56

Chapter 2 Locally finite metric spaces

The converse to Proposition 2.40 is also true, spaces having type 2 with constant 1 and cotype 2 with constant 1 are isometric to a Hilbert space. It is sufficient even to require the corresponding inequalities for n D 2 only. Definition 2.42. We say that vectors x and y in a Banach space satisfy the parallelogram identity if kx C yk2 C kx  yk2 D 2kxk2 C 2kyk2

(2.30)

Observe that the inequality kx1 C x2k2 C kx1  x2 k2  2kx1k2 C 2kx2k2 can be written as 12 .kx1 C x2 k2 C kx1  x2k2 /  kx1k2 C kx2k2 and so it is just the type-2 inequality (2.25) with constant 1 and n D 2. Theorem 2.43. A Banach space is a Hilbert space if and only if any two vectors in it satisfy the parallelogram identity. The corresponding inner product is uniquely determined by the norm. Proof. Properties of inner product give immediately kx C yk2 C kx  yk2 D hx C y, x C yi C hx  y, x  yi D hx, xi C hy, xi C hx, yi C hy, yi C hx, xi  hy, xi  hx, yi C hy, yi D 2hx, xi C 2hy, yi D 2kxk2 C 2kyk2. Therefore any pair of vectors in a Hilbert space satisfies the parallelogram identity. Now we assume that any two vectors in a Banach space X satisfy the parallelogram identity. Observe that any two vectors in a Hilbert space satisfy also the following identity: kx C yk2  kx  yk2 D hx C y, x C yi  hx  y, x  yi D hx, xi C hy, xi C hx, yi C hy, yi  hx, xi C hy, xi C hx, yi  hy, yi D 2hx, yi C 2hy, xi. We continue our proof in the real case only (see Notes and Remarks for the complex case). In the real case hx, yi D hy, xi, and we get that the only possibility to define the inner product corresponding to the norm is 1 .kx C yk2  kx  yk2/. (2.31) 4 It remains to check that under the assumption that the parallelogram identity holds, this formula defines an inner product on X, and that the norm of X corresponds to this inner product. Observe that hx, yi D hy, xi, kxk2 D hx, xi, and hx, 0i D 0 are obvious. So we need to verify that hx, yi given by (2.31) satisfies hx, yi D

(a) hx1 C x2, yi D hx1 , yi C hx2 , yi (b) 8˛ 2 R h˛x, yi D ˛hx, yi

57

Section 2.4 Type and cotype

Observe that the parallelogram identity implies that for all u, v, y 2 X the following equality holds (the first and the last equalities come from our definition of the inner product, the second from the parallelogram identity): hu C v, yi C hu  v, yi 1 D .ku C v C yk2  ku C v  yk2 C ku  v C yk2  ku  v  yk2/ 4 1 D .ku C yk2 C kvk2  ku  yk2  kvk2/ 2 D 2hu, yi. In particular, h2u, yi D 2hu, yi. Replacing u C v by x1 and u  v by x2 , we get Dx C x E 1 2 , y D hx1 C x2, yi. hx1 , yi C hx2 , yi D 2 2 Using this equality and induction we get that hmx, yi D mhx, yi for all m 2 N. 1 1 Letting u D mx, we get m hu, yi D h m u, yi for all m 2 N. Since it is also clear that (2.31) implies hx, yi D hx, yi, we get 8˛ 2 Q h˛x, yi D ˛hx, yi. Now item (b) above follows because it can be written as ˛ 1 .k˛x C yk2  k˛x  yk2/ D .kx C yk2  kx  yk2/, 4 4 the identity is known to hold for rational ˛, and both sides are continuous with respect to ˛. We add some more observations related to Theorem 2.43 in Exercise 2.68. In this connection we also would like to mention the following very interesting result (we do not prove it because we are not going to use it in this book). Theorem 2.44 (Kwapie´n theorem). A Banach space is isomorphic to a Hilbert space if and only if it has both type 2 and cotype 2.

2.4.2 Kahane–Khinchin inequality Theorem 2.45 (Kahane–Khinchin inequality). For each 1  p < 1 there exists a constant Cp such that, for any Banach space X and any finite sequence ¹xi ºniD1 in X the inequality  p 1=p Z 1 X Z 1 X  n   n     ri .t /xi  dt  ri .t /xi     dt 0

i D1

0

X

i D1

Z  Cp 0

holds.

X

 n  1X   ri .t /xi    dt i D1

X

(2.32)

58

Chapter 2 Locally finite metric spaces

Note. This inequality in the case where X D R is called the Khinchin inequality and in the vector-valued case the inequality is called the Kahane inequality. It will be convenient to rewrite and to prove inequality (2.32) using probabilistic language, that is, considering instead of the sequence of Rademacher functions a sequence ¹"i º1 i D1 of independent random variables on a probability space ., †, P/, such that each "i takes values 1 and 1, each with probability 12 . The name Bernoulli random variables is historically more appropriate for such random variables. However, we shall follow the Banach-space-theoretical tradition, and call the sequence ¹"i º1 i D1 a Rademacher sequence. With this notation inequality (2.32) becomes:  p 1=p X  X    n  n    E "i .!/xi   E "i .!/xi   X

i D1

X

i D1

  n  X   Cp E "i .!/xi  

.

(2.33)

X

i D1

We make some remarks before proving (2.33). To be more precise, only the righthand side inequalities in (2.32) and (2.33) are attributed to Khinchin and Kahane. The left-hand side inequalities are easy consequences of the Hölder inequality which we recall now. Theorem 2.46 (Hölder inequality). Suppose that 1 < p < 1 and p1 C f 2 Lp ., †, P/ and g 2 Lq ., †, P/. Then fg 2 L1., †, P/, and 1=p  1=q  Ejgjq jE.fg/j  Ejfgj  Ejf jp .

1 q

D 1. Let

(2.34)

Remark 2.47. A similar inequality holds for f 2 Lp .0, 1/ and g 2 Lq .0, 1/. In this case the inequality is written as ˇZ 1 ˇ Z ˇ ˇ ˇ ˇ .f .t /g.t //dt ˇ ˇ 0

1 0

1=p  Z

1

jf .t /jp dt

1=q jg.t /jq dt

.

(2.35)

0

A similar inequality holds for f 2 Lp ., †, / and g 2 Lq ., †, /, where ., †, / is any measure space: ˇZ ˇ Z 1=p  Z 1=q ˇ ˇ p q ˇ .f .t /g.t // dˇ  jf .t /j d jg.t /j d . ˇ ˇ 



(2.36)



To getthe  of (2.33) from (2.34) we apply the latter to the functions P left-hand side f .!/ D  niD1 "i .!/xi X and g.!/ D 1 for each ! (observe that the function f is

59

Section 2.4 Type and cotype

bounded and hence satisfies the condition f 2 Lp ., †, P/). We get  p 1=p X  X    n  n  1=q  Ej1jq "i .!/xi  "i .!/xi  E   E   X

i D1

i D1

X

i D1

X

p 1=p  X   n  D E "i .!/xi  . 

So we turn to the right-hand side inequality of (2.33) (the main point in the proof). Recall that a random variable f :  ! X is called symmetric if P.f 2 B/ D P.f 2 B/ for each Borel set B  X. Lemma 2.48. Let f :  ! X be a symmetric random variable, and x 2 X. Then 1 P¹! : kf .!/ C xk  kxkº  . 2 Proof. Observe that for each ! kx C f .!/ C x  f .!/k  max¹kx C f .!/k, kx  f .!/kº. kxk D 2 Therefore   .¹! : kx C f .!/k  kxkº [ ¹! : kx  f .!/k  kxkº/. If we introduce the notation B D ¹y 2 X : kx C yk  kxkº, we can write   .¹! : f .!/ 2 Bº [ ¹! : f .!/ 2 Bº/. Therefore P.¹! : f .!/ 2 Bº/  12 or P.¹! : f .!/ 2 Bº/  12 . It remains to observe that since the random variable f is symmetric, these probabilities are the same. For given n 2 N and x1 , : : : , xn 2 X we introduce random variables Ym :  ! X .1  m  n/ given by m X Ym .!/ D "i .!/xi . i D1

Lemma 2.49. P¹! : maxmn kYm .!/k > r º  2P¹! : kYn.!/k > r º for all r > 0. Proof. Let B D ¹! : kYn .!/k > r º A D ¹! : max kYm .!/k > r º mn

A1 D ¹! : A2 D ¹! : A3 D ¹! : ::: An D ¹! :

kY1.!/k > r º kY1.!/k  r , kY2.!/k > r º kY1.!/k  r , kY2.!/k  r , kY3.!/k > r º kY1.!/k  r , : : : , kYn1 .!/k  r , kYn.!/k > r º.

60

Chapter 2 Locally finite metric spaces

It is clear that A D [nmD1 Am , that P the sets Am , m D 1 : : : , n, are disjoint, and that B  [nmD1 Am . Therefore P.A/ D nmD1 P.Am / and n X

P.B/ D

P.Am \ B/.

(2.37)

mD1

Notice that [

Am D

¹! 2 Am : "j .!/ D j , j 2 ¹1, : : : , mºº.

1 ,:::,m 2¹1,1º

(We partition Am according to the values of 1, : : : , m .) On the other hand, by Lemma 2.48  X   X n X  m   m  1    P  j xj C "j .!/xj    j xj   2 j D1

j DmC1

j D1

for each collection 1 , : : : , m of ˙1. By independence of ¹"i ºniD1 we get   P ¹! 2 Am : "j .!/ D j , j 2 ¹1, : : : , mºº \ ¹! : kYn.!/k  kYm .!/kº  D P ¹! 2 Am : "j .!/ D j , j 2 ¹1, : : : , mºº  X ³ X ² n X   m   m    \ !:  j xj C "j .!/xj    j xj   j D1

j DmC1

j D1

 1   P ¹! 2 Am : "j .!/ D j , j 2 ¹1, : : : , mºº . 2 Summing these inequalities over all collections 1 , : : : , m of ˙1, we get P .Am \ ¹! : kYn.!/k  kYm .!/kº/  Therefore P .Am \ B/ 

1 P.Am /. 2

1 P.Am /. 2

Summing these inequalities over m and using (2.37) we get P .B/  12 P .A/, which is the desired inequality. We use the notation introduced above. Lemma 2.50. P¹! : kYn .!/k > 2r º  4.P¹! : kYn.!/k > r º/2 for all r > 0. P Proof. For each m 2 ¹1, : : : , nº, the random variables k niDm "i xi k, "1, : : : , "m are independent. We would like to emphasize that this is true despite the fact that "m is

61

Section 2.4 Type and cotype

included into the first random variable, too. One of the ways of showing the indepenP dence is to observe that the first random variable is equal to kxm C niDmC1 "i "m xi k, and that the variables P "1 , : : : , "m , "mC1 "m , "mC2"m , : : : , "n"m are independent. Hence the events ¹! : k niDm "i xi k > r º and Am are independent. Observe that if ! 2 Am satisfies kYn .!/k > 2r , then kYn.!/  Ym1 .!/k > r , where we let Y0.!/ D 0 for all ! 2 . Observe that Lemma 2.49 implies  X ² ³   n  " x P !:  > r  2P¹! : kYn.!/k > r º. i i  i Dm

In fact, one can see that the result of Lemma 2.49 does not depend on the ordering of indices i. Hence we have   n  X P.Am \ ¹! : kYn.!/k > 2r º/  P.Am \ ¹! :  " i xi   > r º/  i Dm

  n  X " i xi  D P.Am /P¹! :   > rº  i Dm

 2P.Am /P¹! : kYn .!/k > r º. Summing in m and using the inclusion ¹! : kYn.!/k > 2r º  A and Lemma 2.49 again, we get P¹! : kYn.!/k > 2r º  2P.A/P¹! : kYn .!/k > r º  4.P¹! : kYn .!/k > r º/2. Proof of (2.33) and of Theorem 2.45. Let 1  p < 1 and let ¹xi ºniD1 be a finite set of vectors in X. We may assume without loss of generality that  X   n  " i xi  E  D EkYn k D 1. mD1

Therefore P¹! : kYn .!/k > 8º 

1 8

Using Lemma 2.50 repeatedly we get  2 1 , P¹! : kYn .!/k > 2  8º  4 8  4 1 P¹! : kYn .!/k > 22  8º  43 , 8  8 3 7 1 P¹! : kYn .!/k > 2  8º  4 , 8

(2.38)

62

Chapter 2 Locally finite metric spaces

and so on. Now we use the inequality which is easy to get from the definition of expectation:   E kYn kp  8p  P¹! : kYn.!/k  8º C .2  8/p  P¹! : kYn .!/k > 8º C .22  8/p  P¹! : kYn .!/k > 2  8º C    2j 1 X p j C1 p 2j 1 1 .2  8/  4 . 8 C 8 j D0

It remains to observe that this series is convergent. Denoting its sum by .Cp /p , we get (2.33). This completes the proof of Theorem 2.45. Usually the Khinchin inequality is written in an equivalent but slightly different form than (2.32): Theorem 2.51 (Khinchin inequality). For each 1  p < 1 there exist constants Ap and Bp such that for any finite sequence ¹xi ºniD1 of scalars the inequality Ap

X n

1=2 2

jxi j

Z 

ˇ n 1ˇX ˇ ˇ

0

i D1

i D1

ˇp 1=p X 1=2 n ˇ 2 ˇ ri .t /xi ˇ dt  Bp jxi j

(2.39)

i D1

holds. Theorem 2.51 is an immediate consequence of Theorem 2.45 because the Rademacher sequence is orthonormal and therefore ˇ n 1 ˇX

Z 0

ˇ ˇ

i D1

ˇ2 n X ˇ ˇ ri .t /xi ˇ dt D jxi j2 . i D1

We shall also use the Khinchin inequality in probabilistic notation: Ap

X n i D1

1=2 2

jxi j

ˇp 1=p X 1=2  ˇX n ˇ ˇ n 2 ˇ ˇ  Eˇ "i .!/xi ˇ  Bp jxi j . i D1

(2.40)

i D1

Remark 2.52. Theorem 2.51 (that is, the Khinchin inequality) implies that the space `2 admits an isomorphic embedding into any of the spaces Lp .1  p < 1/ (see Observation 4.12 for more details). For L1 this result also holds, but the reason is different: first we partition Œ0, 1 into infinitely many measurable subsets of nonzero measure and show that functions which are constant on each of the subsets form a subspace isometric to `1 . After that we prove a linear analogue of Proposition 1.17,

63

Section 2.4 Type and cotype

namely we prove that each separable Banach space X admits a linear isometric embedding into `1 and that this embedding can be constructed as follows. Let ¹xi º be a dense sequence in X. Let ¹fi º be a sequence of functionals satisfying the conditions kfi k D 1 and fi .xi / D kxi k. Then the mapping X 7! `1 given by x 7! ¹fi .x/º1 i D1 is a linear isometric embedding. Theorem 2.53. The space Lp D Lp .0, 1/, 1  p < 1 has type min¹p, 2º and cotype max¹p, 2º. Remark 2.54. The same proof works for Lp ., †, /, where ., †, / is an arbitrary measure space. Proof. First we consider the case 1  p  2. In this case we need to show that Lp has cotype 2. Let x1, : : : , xn 2 Lp . By the Kahane inequality (2.33), it suffices to prove the inequality p 1=p  n 1=2  X   n 1 X 2   "i .!/xi   kxi k , E Czp i D1 i D1 where Czp is a constant which does not depend on the choice of n and ¹xi ºniD1, but, possibly, depends on p. We have p 1=p   Z 1 ˇ n ˇp 1=p  X  n  ˇX ˇ ˇ E "i .!/xi  D E "i .!/xi .t /ˇˇ dt   ˇ 0

i D1

Z

1

D 0

i D1

ˇ n ˇp 1=p ˇX ˇ Eˇˇ "i .!/xi .t /ˇˇ dt i D1 n 1X

Z  Ap 0

p=2 jxi .t /j2

(2.41)

1=p dt

,

i D1

where we used the Khinchin inequality (2.40). Comparing the right-hand side of the inequality (2.41) with the desired result, we see that it suffices to prove the inequality n 1 X

Z 0

p=2 2

jxi .t /j

1=p dt



X n Z

i D1

i D1

2=p 1=2

1

p

jxi .t /j dt

.

0

To prove this inequality we use the fact that r :D 2=p  1. Therefore there is a supporting functional g D ¹gi ºniD1 2 `nr0 , where r 0 is determined by r1 C r10 D 1, such that kgk D 1 (the norm in `nr0 ) and X n Z i D1

1 0

2=p p=2 jxi .t /jp dt

D

n Z X i D1

1 0

 jxi .t /jp dt gi .

64

Chapter 2 Locally finite metric spaces

We have X n Z

2=p 1=2

1

p

jxi .t /j dt

D

X n Z

0

i D1

n 1 X

D 0

n X

p

jxi .t /j gi 

i D1

p i D1 jxi .t /j gi

 k¹jxi .t /jp ºniD1kr  kgkr 0 , and get

p=2 2

jxi .t /j

 1=p jxi .t /j gi dt . p

i D1

Pn

X n

p

0

i D1

Z

Now we apply the inequality

 1=p jxi .t /j dt gi

1

kgk D

X n

i D1

p=2 2

jxi .t /j

,

i D1

the desired inequality follows. Our next purpose is to show that the space Lp , 1  p  2, has type p. Let x1, : : : , xn 2 Lp , we have p 1=p   Z  X   n  E "i .!/xi  D E  i D1

Z

1

D 0

ˇ n 1 ˇX 0

ˇ ˇ

ˇp 1=p ˇ "i .!/xi .t /ˇˇ dt

i D1

ˇX ˇp 1=p ˇ n ˇ ˇ Eˇ "i .!/xi .t /ˇˇ dt i D1 n 1 X

Z  Bp 0

p=2 2

jxi .t /j

(2.42)

1=p dt

.

i D1

At this point we use the fact that for p  2 the following inequality holds: X n

1=2 2

jxi .t /j



X n

i D1

1=p p

jxi .t /j

.

(2.43)

i D1

This inequality P can be proved in the following way. It is clear that it suffices to consider the case where niD1 jxi .t /j2 D 1. In this case jxi .t /j  1 for each i and t . Therefore jxi .t /j2  jxi .t /jp , and the proof of (2.43) is completed. Combining (2.42) and (2.43) we get p 1=p Z  X   n  " .!/x  B E i i p  i D1

D Bp

n 1X

1=p jxi .t /jp dt

0 i D1

X n i D1

kxi kp

1=p .

65

Section 2.4 Type and cotype

Now we turn to Lp with p  2. First we prove the cotype inequality. We have p 1=p   Z 1 ˇ n ˇp 1=p  X  ˇX ˇ  n   ˇ "i .!/xi  D E "i .!/xi .t /ˇˇ dt E ˇ 0

i D1

Z

1

D 0

i D1

ˇX ˇp 1=p ˇ n ˇ ˇ Eˇ "i .!/xi .t /ˇˇ dt i D1 n 1 X

Z  Ap 0

p=2 2

jxi .t /j

1=p dt

.

i D1

At this point we use the fact that for p  2 the inequality 1=2  X 1=p X n n 2 p jxi .t /j  jxi .t /j i D1

i D1

holds (it can be proved in the same way as (2.43)). We get p 1=p Z 1 X  X 1=p n  n  p   E "i .!/xi   Ap jxi .t /j dt 0 i D1

i D1

D Ap

X n

kxi k

1=p p

.

i D1

Now we turn to proving that Lp .2  p < 1/ has type 2. Let x1 , : : : , xn 2 Lp . By the Kahane inequality (2.33), it suffices to prove the inequality p 1=p X 1=2  X n   n 2   z "i .!/xi   Tp kxi k , E i D1

i D1

where Tzp is a constant which does not depend on the choice of n and ¹xi ºniD1, but, possibly, depends on p. Using the Khinchin inequality as in (2.42) we get p 1=p  X Z 1 X p=2 1=p n  n  2   E "i .!/xi   Bp jxi .t /j dt i D1

0

i D1

We use the same trick as in the first part of the proof, now with r D p=2  1, and with respect to the interval Œ0, 1 and not the set ¹1, : : : , nº. There is a supporting functional g D g.t / 2 Lr 0 , where r 0 is determined by r1 C r10 D 1, such that kgk D 1 (the norm is in the space Lr 0 ) and Z 1 X p=2 2=p Z 1  X  n n jxi .t /j2 dt D jxi .t /j2 g.t / dt . 0

i D1

0

i D1

66

Chapter 2 Locally finite metric spaces

We have Z

n 1 X 0

1=p

p=2 jxi .t /j2

Z

n 1 X

D

dt

0

i D1

D

X n Z

 1=2 jxi .t /j2 g.t / dt

i D1 1

1=2

jxi .t /j2 g.t / dt

.

i D1 0

Now we apply the inequality Z

1

R1 0

jxi .t /j2 g.t / dt  k.jxi .t /j2 /kr  kgkr 0 , and get

jxi .t /j2 g.t / dt  kxi k2kgk D kxi k2,

0

the desired inequality follows.

2.4.3 Characterization of spaces with trivial type or cotype The following result shows an important relation between type, cotype, and finite representability. Theorem 2.55 (Maurey–Pisier theorem). Let X be an infinite-dimensional Banach space, p.X/ D sup¹p : X has type pº and q.X/ D inf¹q : X has cotype qº. Then the spaces `p.X/ and `q.X/ are finitely representable in X. Theorem 2.55 is beyond the scope of this book. We are going to prove only the following special cases of it (only these cases will be used in this book). Theorem 2.56. A Banach space X has no nontrivial type if and only if `1 is finitely representable in X. A Banach space X has no nontrivial cotype if and only if `1 is finitely representable in X. Proof. Let X be an infinite-dimensional Banach space, and ¹"i ºniD1 be a Rademacher sequence. For each n 2 N, define ˛n .X/ to be the least constant ˛ so that 21=2  X 1=2 X n   n 2   E "i .!/xi  ˛ kxi k i D1

i D1

for all sequences ¹xi ºniD1 in X. We define ˇn .X/ to be the least constant ˇ so that X n i D1

1=2 kxi k

2

2 1=2  X   n   ˇ E "i .!/xi   i D1

p p for all sequences ¹xi ºniD1 in X. We have 1  ˛n .X/  n, 1  ˇn .X/  n. The lower bounds should hold because the inequalities should be satisfied for sequences

67

Section 2.4 Type and cotype

with exactly one nonzero vector. The upper bounds follow from the following inequality: 2 1=2  X n X   n   max kxi k  E "i .!/xi   kxi k. i i D1

i D1

The right-hand-side inequality is an immediate consequence of the triangle inequality. The left-hand-side inequality follows from convexity of the function k  k2 , which implies kx  yk2 C kx C yk2  2kxk2. It is also easy to see that ˛n .X/ and ˇn .X/ are non-decreasing. Lemma 2.57. The parameters ˛n .X/ and ˇn .X/ are submultiplicative in the sense that they satisfy (2.44) 8m, n 2 N ˛mn .X/  ˛m .X/˛n .X/ and 8m, n 2 N

ˇmn .X/  ˇm .X/ˇn .X/.

(2.45)

Proof. Let ¹"i ,j .!/ : i D 1, : : : , m; j D 1, : : : , nº be a Rademacher sequence on a probability space ., † , P/, numbered as a double-parametric sequence (but still being an independent set of random variables with values 1 and 1, taking each of the values with probability 12 ). Let ¹"i ./ : i D 1, : : : , mº be a Rademacher sequence on a probability space .‚, †‚ , P‚/. The products ¹"i ./"i ,j .!/ : i D 1, : : : , m; j D 1, : : : , nº

(2.46)

form a Rademacher sequence on the probability space .  ‚, †prod, Pprod/, where   ‚ is the Cartesian product of the sets  and ‚, †prod is the smallest with respect to the inclusion -algebra of subsets of ‚ containing all subsets of the form AB with A 2 † and B 2 †‚ ; the probability Pprod is determined by Pprod.A  B/ D P .A/  P‚ .B/. Let us denote E! , E , and E! the expectations corresponding to the spaces ., † , P/, .‚, †‚ , P‚/, and .  ‚, †prod, Pprod/, respectively. Since the sequence (2.46) is a Rademacher sequence, we have X 2 X 2 n X    m     " .!/x D E " ./ " .!/x E!  i ,j i ,j  i i ,j i ,j  !   i ,j

i D1

j D1

for any collection ¹xi ,j ºi ,j in X. We use the well-known fact: E! .f / D E! .E .f // for every function on   ‚. It is worth mentioning that in our case if suffices to consider finite probability spaces. In this case we just use the fact that the finite sum does not depend on the order of summation.

68

Chapter 2 Locally finite metric spaces

By the definition of ˛m .X/ we have  m 2  2 n X XX X   2    "i ./ "i ,j .!/xi ,j   .˛m .X// "i ,j .!/xi ,j  E    . i D1

j D1

i

j

Thus X 2 2 n X X   m  X  2    " ./ " .!/x  .˛ .X// E " .!/x E! E  i i ,j i ,j  m ! i ,j i ,j  .  i D1

j D1

i

j

Now we observe that for each fixed i the sequence ¹"i ,j .!/ºjnD1 is a Rademacher sequence. Therefore X 2 n X   2   "i ,j .!/xi ,j   .˛n .X// kxi ,j k2, E!  j

j D1

and we get  2 X X  2  " .!/x kxi ,j k2. E!  i ,j i ,j   .˛m .X/˛n .X//  i ,j

i ,j

Therefore ˛mn .X/  ˛m .X/˛n .X/. Similarly we get the results for ˇmn .X/ (we do not repeat all steps of the argument):  n 2 m X X X  2 2  kxi ,j k  .ˇn .X// E!  "i ,j .!/xi ,j   i ,j

i D1

j D1

 2 X    .ˇm .X/ˇn .X// E!  "i ,j .!/xi ,j   . 2

i ,j

Therefore ˇmn .X/  ˇm .X/ˇn .X/. To prove Theorem 2.56 for spaces with no nontrivial type we need the following lemma. Lemma 2.58. A Banach space X has nontrivial type if and only if ˛n .X/ < some n 2 N.

p

n for

Proof. Suppose that X has type p > 1 (as we know p  2). Combining the definition of type with the Kahane inequality we get 2 1=2 1=p X  X n   n p  " .!/x  C kx k . (2.47) E i i i  i D1

i D1

69

Section 2.4 Type and cotype

Now we use the inequality X n

1=p kxi k

p

n

1 1 p 2

X n

i D1

1=2 kxi k

2

.

i D1

This inequality can be obtained by using n X

ai bi  k¹ai ºkr  k¹bi ºkr 0

i D1

with ai D kxi kp , r D

2 , p

bi D 1, and r 0 given by

1 r

C

1 r0

D 1.

p . Since p > 1, this inequality implies that ˛n .X/ < n We get ˛n .X/  C n for some n. p Now we prove the other direction. Suppose that ˛n0 .X/ < n0 for some n0 2 N. 1 1 p2

1

1

Then ˛n0 .X/ D n0s 2 for some s > 1. Using the submultiplicativity of ˛n.X/ we get 1 1 ˛n .X/  n s  2 , for each n which is a positive integer power of n0 . Now we recall the fact that ˛n .X/ is a non-decreasing sequence and use the observation that the interval Œn, n0 n always contains a positive integer power of n0 . Therefore 1

1

1

1

˛n .X/  .nn0 / s  2 D Dn s  2

(2.48)

for all n 2 N, where D is some constant. Now we show that the estimate (2.48) implies that X has type p for each p < s. By the Kahane inequality, it suffices to prove (2.47) for some constant C . To do so, we Passume that kx1 k  kx2k      kxnk. With this assumption we have (writing 1 i D1 xi we mean xi D 0 for i > n) k 2 1=2 2 1=2  X 1   2X X  1  n      E E "i .!/xi   "i .!/xi  

i D1

kD1

  

1 X kD1 1 X kD1 1 X

i D2k1

D.2

k1

/

1 1 s2

1=2 kxi k

2

i D2k1 1

(2.49)

D.2k1 / s kx2k1 k 1

1

D.2k1 / s .2k1/ p

kD1

C

k 1  2X

X n i D1

i D1

1=p kxi kp

X n

.

1=p kxi kp

70

Chapter 2 Locally finite metric spaces

(We used the inequality

1 s



1 p

< 0, which implies that the series

1 X

1

1

D.2k1 / s .2k1 / p

kD1

is convergent.) We have proved (2.47). The last step in the proof of the statement of Theorem 2.56 regarding type is the following. Lemma 2.59. Let X be a Banach space. The space `1 is finitely representable in X p if and only if ˛n .X/ D n for all n 2 N. p Proof. First we observe that ˛n .`1 / D n for all n. In fact, if we let x1, : : : , xn be vectors of the unit vector basis of `1 , then 2 1=2  X  n   E "i .!/xi  Dn  i D1

 Pn

 2 1=2

p k D n. Therefore if `1 is finitely representable in X, we easily and i D1 kx pi p get ˛n.X/  n=.1 C "/ for every " > 0, therefore ˛n .X/ D n. p The assumption ˛n .X/ D n implies that for each k 2 N we can find a sequence ¹xk,j ºjnD1 in X such that X n

1=2 kxk,j k

2

D

p

n

j D1

and

2 1=2  X n X   n 1   "j .!/xj ,k   kxk,j k  n. n   E k j D1

j D1

Let U be a free ultrafilter on N. Consider elements ¹xj ºjnD1 represented in the ultra. We have product X U by the sequences ¹xk,j º1 kD1 X n

1=2 kxj k2

D

p

n,

j D1

2 1=2  X   n  "j .!/xj  D n, E  j D1

and

n X j D1

kxj k D n.

71

Section 2.4 Type and cotype

It is easy to see that these equalities imply that kxj k D 1 for each j and that  n  X    " .!/x j j D n  j D1

for almost each ! 2  (that is, for each collection of "j .!/ D ˙1). Now let ¹aj ºjnD1 be a sequence of real numbers. Assume that 0 < jaj j  1 for each j . We have X  X  X  n X  n   n   n       jaj j   aj xj    sign.aj /xj    sign.aj /.1  jaj j/xj   j D1

j D1

j D1

 n  .n 

n X

j D1 n X

jaj j/ D

jaj j.

j D1

j D1

It follows immediately that the subspace spanned by ¹xj ºjnD1 in X U is isometric to `n1 . We combine this statement with the following results. 

X U is finitely representable in X (Proposition 2.31).



For each finite-dimensional subspace F of `1 and each " > 0 there exists n 2 N and an operator T : F ! `n1 such that kT k  1 and kT 1 k  1 C ", where T 1 is the operator defined on T .F /. (The necessary argument is contained in the proof of Proposition 2.29.)



Composition S of two operators satisfying the inequalities of the previous item satisfies kSk  1 and kS 1k  .1 C "/2 . We conclude that `1 is finitely representable in X.

To prove Theorem 2.56 for spaces with no nontrivial cotype we need the following lemma. p Lemma 2.60. A Banach space X has nontrivial cotype if and only if ˇn .X/ < n for some n 2 N. Proof. Suppose that X has cotype q < 1 (as we know q  2). Combining the definition of cotype with the Kahane inequality we get 2 1=2  X 1=q X n  n  q   E "i .!/xi  C kxi k . (2.50) i D1

Now we use the inequality X n i D1

i D1

1=2 kxi k2

1

1

 n2q

X n i D1

1=q kxi kq

.

72

Chapter 2 Locally finite metric spaces

This inequality can be obtained by using the inequality n X

ai bi  k¹ai ºkr  k¹bi ºkr 0

i D1

with ai D kxi k2 , r D q2 , bi D 1, and r 0 given by

1 r

1 r0

C

D 1.

p . Since q < 1, this inequality implies that ˇn .X/ < n We get ˇn .X/  n for some n. p Now we prove the other direction. Suppose that ˇn0 .X/ < n0 for some n0 2 N. 1 1 2q

1 C

1

1

Then ˇn0 .X/ D n02 t for some 2  t < 1. Using the submultiplicativity of ˇn .X/ 1 1 we get ˇn .X/  n 2  t for each n which is a positive integer power of n0 . Now we recall the fact that ˇn .X/ is a non-decreasing sequence. Since each interval of the form Œn, nn0  contains a positive integer power of n0 , we get 1

1

1

1

ˇn .X/  .nn0 / 2  t D Dn 2  t

(2.51)

for all n 2 N, where D is some constant. Now we show that the estimate (2.51) implies that X has cotype q for each q > t . By the Kahane inequality, it suffices to prove (2.50) for some constant C . To do so, we assume that kx1k  kx2k      kxn k. With this assumption we have k 1  2X

1=2 kxi k

2

 D.2

i D2k1

k1

/

1 1 2t

k 2 1=2   2X   1  " .!/x E i i 

i D2k1

2 1=2  X  n  k1 12  1t  E  D.2 / "i .!/xi  .  i D1

Since k 2X 1

q

kxi k  kx2k1 k

q2

i D2k1

k 2X 1

kxi k2,

i D2k1

we have n X

q

kxi k  D

2

X 1

i D1

DD

2

.2

kD1 X 1

k1 1 2t

/

kx2k1 k

q2

2  X  n   E "i .!/xi   i D1

.2

kD1

k1 1 2t .1 q2 /

/

.2

k1 1 q2

/

kx2k1 k

q2

2  X   n  E "i .!/xi   . i D1

73

Section 2.4 Type and cotype

bi D

P1

kD1 ak bk  k¹ak ºkr  k¹bk ºkr 0 , 2 k1 1 q .2 / kx2k1 kq2 , and r 0 given by r1 C r10

Now we apply

n X

q

kxi k  D

2

X 1

i D1

.2

X 1

D 1, so r 0 D

2k1kx2k1 kq

 D1

i D1

q q2 .

2  q2  X q   n  E " .!/x i i 

q , 2

We get

(2.52)

i D1

kD1

X n

2

  q2  2  2t q2 q /

k1

kD1



2

with ak D .2k1 / q  t , r D

2  q2  X q   n q  kxi k E "i .!/xi   . i D1

We use the inequalities 1 X

2

k1

q

kx2k1 k  2

2 q



2 t

kxi kq

i D1

kD1

and

n X

< 0. The latter inequality implies that the series 1 X

.2

k1

  2  2t q2 q /

kD1

converges. Inequality (2.52) can be easily transformed into the desired inequality. The last step in the proof of the statement of Theorem 2.56 regarding cotype is the following. Lemma 2.61. Let X be a Banach space. The space `1 is finitely representable in X p if and only if ˇn .X/ D n for all n 2 N. p Proof. First we observe that ˇn .`1 / D n for all n. In fact, let x1, : : : , xn be vectors of the unit vector basis of `1 , then 2 1=2  X   n  E "i .!/xi  D1  i D1

 p 2 1=2 D and n. As for X, since `1 is finitely representable in X, we i D1 kxi k p p "/ for every " > 0, therefore ˇn .X/ D n. easily get ˇn .X/  n=.1 Cp The assumption ˇn .X/ D n implies that for each k 2 N we can find a sequence ¹xk,j ºjnD1 in X such that  Pn

X n j D1

1=2 kxk,j k

2

D

p

n

74 and

Chapter 2 Locally finite metric spaces

2 1=2  X   n 1  1  E "j .!/xj ,k   1C .  k j D1

Let U be a free ultrafilter of N. Consider elements ¹xj ºjnD1 represented in the ultra. We have product X U by the sequences ¹xk,j º1 kD1 X n

1=2 kxj k

2

D

p

n

(2.53)

j D1

and

2 1=2  X   n  " .!/x D 1. E j j  j D1

Now we prove that these inequalities imply that kxj k D 1 for each j and that P  n "j .!/xj  D 1 for almost each ! 2  (that is, for each collection of "j .!/ D j D1 ˙1). In fact, it is clear that (2.53) implies that kxj0 k  1 for some j0 . If  n  X    (2.54) " .!/x j j  D 1  ı.!/  j D1

for some ! and ı.!/  0, applying the triangle inequality we get  n   X  "j .!/xj  2"j0 .!/xj0    1 C ı.!/,  j D1

so each ! 2  satisfying (2.54) can be coupled with ! z 2  such that  X   n  "j .!/x z j   1 C ı.!/.  j D1

This shows that ı.!/ is always 0, because otherwise 2 X   n  " .!/x E j j  > 1.  j D1

The equalities kxi k D 1 also follow. Now let ¹aj ºjnD1 be a sequence of real numbers satisfying jaj j  1 for all j D 1, : : : ,P n. We have P (1) jnD1 aj xj is a convex combination of jnD1 j xj , where j D ˙1. Therefore  n  X   aj xj     1. j D1

Section 2.5 Some corollaries

75

(2) Assume  other coefficients are 1.  is to show  Paj0 D 1 and  that in POur purpose such a case  jnD1 aj xj  D 1. As we have shown,  jnD1 aj xj   1 and 2aj0 xj0   Pn  j D1 aj xj  1 (because the absolute values of the coefficients of the latter vector are also  1). Hence     Pn Pn     j D1 aj xj C 2aj0 xj0  j D1 aj xj . 1 D kxj0 k  2  P Therefore  jnD1 aj xj  D 1. It follows that the subspace spanned by ¹xj ºjnD1 in X U is isometric to `n1 . We combine this statement with the following results. 

X U is finitely representable in X (Proposition 2.31).



For each finite-dimensional normed space F and each " > 0 there exist n 2 N and an operator T : F ! `n1 such that kT k  1 and kT 1 k  1 C ", where T 1 is the operator defined on T .F /. We sketched proofs of similar results in Example 1.33 and Remark 2.52.



Composition S of two operators satisfying the inequalities of the previous item satisfies kSk  1 and kS 1k  .1 C "/2 . We conclude that `1 is finitely representable in X. This completes the proof of Theorem 2.56.

2.5 Some corollaries of the theorems on finite determination of embeddability of locally finite metric spaces Theorem 2.62. Let M be a locally finite subset of a Hilbert space. Then M admits a bilipschitz embedding into an arbitrary infinite-dimensional Banach space. Follows from the Dvoretzky theorem (Theorem 2.30) and Theorem 2.6. Theorem 2.63. Each locally finite metric space admits a bilipschitz embedding into any Banach space with no nontrivial cotype. Follows from the result of Fréchet (Proposition 1.17), characterization of spaces with no nontrivial cotype (Theorem 2.56), and Theorem 2.6.

76

Chapter 2 Locally finite metric spaces

2.6 Exercises Exercise 2.64. (Applying Zorn’s lemma) Let X be a linear space. A set A  X is called a Hamel basis for X if each vector x 2 X has a unique (up to an order of terms) representation x D ˛1 a1 C    C ˛n an where n 2 N, ˛1 , : : : , ˛n are nonzero scalars, and a1 , : : : , an 2 A. Use Zorn’s lemma to show that each linear space has a Hamel basis. Recall that a metric space is called nonseparable if it does not have a countable dense subset. Exercise 2.65. Show that for any infinite-dimensional Banach space X and any free ultrafilter U on N the ultrapower X U is a nonseparable space. If needed, see a hint in the Hints to exercises section below. Exercise 2.66. Give an example of two isomorphic but not isometric Banach spaces X and Y with d.X, Y / D 1, where d is the Banach–Mazur distance. There is a hint to this exercise. A Banach space X is called strictly convex if its unit sphere does not contain line segments. (We also say that the norm of the space X is strictly convex.) Exercise 2.67. Prove that a Banach space Y is uniformly convex if and only if each Banach space X which is finitely representable in Y is strictly convex. Exercise 2.68. (a) Show that each Banach space having type 2 with constant 1 is a Hilbert space. (b) Show that each Banach space having cotype 2 with constant 1 is a Hilbert space. There is a hint to this exercise. Let A  S`n1 (S`n1 is the unit sphere of `n1 ). For a Banach space X we define the index h.X, A/ of A in X by h.X, A/ D

sup ¹xi ºn iD1 BX

X   n   inf  ai xi  . ¹a º2A i

i D1

Exercise 2.69. Let ˛ 2 .0, 1/. Prove h.X, S`n2 /  ˛ ) h.X, S`n1 /  1

p

˛.

Section 2.7 Notes and remarks

77

2.7 Notes and remarks Zorn’s lemma is from [445]. See [130, Chapter 1, Sections 2 and 15] for relations of Zorn’s lemma with the axiom of choice and other statements of set theory, and for related references. Theorems 2.6 and 2.7 were proven in [360] and our presentation follows this paper. The results of [360] provide a common generalization of different theorems showing that one can paste an embedding of a locally finite metric space into a Banach space from embeddings of finite pieces of the metric space. Such results were obtained in [45], [46], [48], [352], [353], [354]. Some of these results, in turn, were proven as generalizations and analogues of the result of [85] stating that each space of bounded geometry admits a coarse embedding into a reflexive Banach space. A generalization of the mentioned result of [85] in a different direction was obtained in [236, Theorem 2.1]. Formula (2.9) for pasting pieces was suggested in [48] in the proof of Theorem 2.63. The idea of using norming subspaces for selection of good-behaving sequences in Banach spaces goes back to [230]. The result of Gowers and Maurey on the existence of Banach spaces which are not isomorphic to their closed hyperplanes can be found in [173]. The extended notion of a locally finite space mentioned in Remark 2.35 is used in some papers, e.g. [247]. Our definition of a locally finite metric space follows [345]. Filters and ultrafilters were introduced by H. Cartan [89],[90]. A detailed presentation of Cartan’s results on filters and ultrafilters was given in [73]. Our summary of basic results on ultrafilters is similar to the summary in [130, pp. 30–31]. Ultraproducts were introduced to Banach space theory by Dacuhna-Castelle and Krivine [107]. An important contribution to the development and dissemination of these ideas is the paper by Heinrich [199]. Our presentation of the basic results on ultraproducts is similar to those of [120] and [383]. Theorem 2.26 has its roots in [107]. It was explicitly formulated in [199], where the following proof was suggested: to show that an ultraproduct of Banach lattices (see the definition of a Banach lattice in [288]) is a Banach lattice. As Banach lattices, Lp -spaces admit a characterization [288, Theorem 1.b.2] which goes back to Kakutani [234]. It is not difficult to check that this characterization passes from .Xi /i 2I to their ultraproducts, see [199] or [120, p. 172]. The first part of Theorem 2.26 follows. The second part follows if we combine the first part with Fact 1.20 mentioned in Chapter 1. According to [40, Remarks to Chapter VII] Auerbach proved the existence of Auerbach bases for each finite-dimensional Banach space X. The book [40] does not contain any proofs of the existence of Auerbach bases. The two-dimensional case of Auerbach’s result was proved in [32]. During the Second World War Herman Auerbach (1901–1942) fell victim to the Nazi regime (see [373, p. 590]), and his original proof of the general case seems to have been lost. Apparently the first published proofs of Auerbach’s lemma appeared in [116] and [424]. The paper [385] contains interesting

78

Chapter 2 Locally finite metric spaces

results on the relations between Auerbach bases obtained using different constructions, and some related references, see also [358]. The notion of finite representability was introduced in [213]. The Dvoretzky theorem is from [132]. It preceded the introduction of the notion of finite representability and was proved in terms of finite-dimensional normed spaces: for each n 2 N and " > 0 a normed space of sufficiently high dimension contains an n-dimensional subspace whose Banach–Mazur distance to `n2 is  1 C ". The Dvoretzky theorem and its ramifications form a very important part of Banach space theory, see [332] and [381]. The precursors to the type and cotype theory first appear in papers [348],[349]. Various versions of the notions of type and cotype appear in the papers [129], [204], [376], [395], all of which were published in the early 1970’s. Apparently the term ‘type’ appeared for the first time in [376]. One of the most important contributions to the type-cotype theory and the first paper where this theory was systematically developed is [319]. This paper contains the Maurey–Pisier Theorem 2.55. Theorem 2.56 is a special case of Theorem 2.55, its first part was proved earlier in [375]. Our presentation of the proof of Theorem 2.56 is based on [9, Chapters 7 and 11] and [375]. Proofs of the full version of Maurey–Pisier theorem (Theorem 2.55) and of other important results about type and cotype can be found in [332]. See [318] for a historical account of the type-cotype theory written by one of its main contributors. Among predecessors which were not mentioned in [318] we would like to mention [133] (nonlinear versions of type) and [365] (Fourier-analytic versions of type-cotype). Theorem 2.43 is from [227]. Jordan and von Neumann proved the result in the complex case too. Many other isometric geometric characterizations of Hilbert spaces are known, interested readers should consult the book [18]. The Kwapie´n theorem 2.44 is from [260] (another recommended source is the book [380, Chapter 3]). The part of Theorem 2.53 concerning cotype appears in [348],[349] and the part concerning type appears in [343]. However these papers were written without the type-cotype terminology. The Khinchin inequality goes back to [248], in the present form it was stated and proved in Littlewood [295], apparently Littlewood was unaware of [248]. The Kahane inequality goes back to [232] and [233]. None of these sources contains an explicit statement of the Kahane inequality, the closest statement is [233, Theorem 4 in Chapter II]. Because of its importance several different proofs of the Kahane inequality were found and presented in many texts and monographs. Here is the list of some of the proofs of the Kahane inequality. Probabilistic proofs: Most of them are close to the original original argument sketched in [232] and presented in detail in [233]. See [9, Section 6.2] and [160, Section 12.8] for similar proofs. Our presentation is very close to the one given in [9]. Somewhat different are the tail estimates proofs, see [272, pp. 91, 100–101]. Geometric proof: A proof based on the Brunn–Minkowski inequality, see [332, pp. 134–136]. Analytic proof: See [72], [288, pp. 74–77], [378], and [383]. Latała–Oleszkiewicz [268] suggested a very elegant proof in the case p D 2 with the best constant, see also [160, Section 13.3].

Section 2.8 Hints to exercises

79

The Hölder inequality is presented in numerous analysis books, see [194, Sections 2.7-2.8], [160, Section 5.4]. Historical studies show that it would be more natural to attribute this inequality to L. J. Rogers, see the footnote to Theorem 13 in [194] and [414, Chapter 9] and references therein for more on this matter. Lemma 2.49 is attributed to P. Levy in [233, Chapter II, Section 3]. Theorem 2.62 is from [354], Theorem 2.63 is from [48]. Some more results in the same spirit can be found in [46]. Exercise 2.69 is the well-known Giesy lemma (see [161, Lemmas I.4 and I.6]), it is a more precise finite-dimensional version of the result of James [211, Lemma 2.1]. It is worth mentioning that there exists a version of the Giesy lemma related to superreflexivity which we are going to study in Section 9.3.1. To state it we denote the subset of S`n1 consisting of all sequences with at most one change of signs by Jn . With this notation one of the results of James [213] can be stated as Theorem 2.70. For a Banach space X the following three conditions are equivalent. 

The space X is non-superreflexive.



infn h.X, Jn / > 0.



h.X, Jn / D 1 8n 2 N.

Using the standard ultraproduct argument it can be shown that Theorem 2.70 implies the following result. Theorem 2.71. For each n 2 N and every real number ˛, ˇ satisfying 1 > ˛ > ˇ > 0 there is N D N.n, ˛, ˇ/ 2 N such that h.X, JN /  ˇ implies h.X, Jn /  ˛. It turns out, however that in this case the dependence of N D N.n, ˛, ˇ/ on n, ˛, ˇ is much worse than in the Giesy lemma, see [350], [437].

2.8 Hints to exercises To Exercise 2.65. Show that X contains a bounded infinite sequence ¹xk º1 satiskD1 fying kxk  xm k  1 for k ¤ m. Show that this sequence contains uncountably many subsequences with any two of them having only finitely many common elements. Consider the elements of X U corresponding to these subsequences. To Exercise 2.66. Consider the `2 direct sums of two-dimensional ¹`p2 n º1 nD1 and 1 1 ¹`2qn º1 , where ¹p º and ¹q º are two dense subsets of Œ1, 2, but only one of n nD1 n nD1 nD1 them contains 1. (The definition of an `2 direct sum is similar to the definition of the `1 direct sum given in this chapter, see Definition 3.6 in the next chapter.) To Exercise 2.68. Show that each of the assumptions, used for different pairs of vectors, implies the parallelogram identity.

Chapter 3

Constructions of embeddings

Let .X, dX / and .Y , dY / be metric spaces. The infimum of distortions of bilipschitz embeddings of X into Y is denoted cY .X/. We let cY .X/ D 1 if there are no bilipschitz embeddings of X into Y . When Y D Lp we use the notation cY ./ D cp ./ and call this number the Lp -distortion of X. The parameter c2 .X/ is called the Euclidean distortion of X. We are going to study some constructions of low-distortion embeddings of finite metric spaces .X, d / into `p .1  p < 1/. The embeddings which we are going to construct actually will be into `pn with large n. To construct such an embedding means to construct a finite or infinite sequence of Lipschitz real-valued functions on X, so that the embedding will be u 7! ¹fi .u/ºniD1 2 `pn or u 7! ¹fi .u/º1 i D1 2 `p . We start with embeddings called Fréchet embeddings. They are defined as embeddings into `pn or `p for which the functions fi are defined as scalar multiples of functions of the form d.u, A/ where A is a subset of X and d.u, A/ D inf¹d.u, v/ : v 2 Aº. Some of the constructions of suitable functions of this form are based on a very useful notion of a padded decomposition of a metric space X.

3.1 Padded decompositions and their applications to constructions of embeddings Definition 3.1. A decomposition of a metric space .X, d / is a partition of X into disjoint subsets. Given a decomposition P D ¹C1, : : : , Cm º of X, we refer to the sets Ci as clusters. We write PX for the set of all partitions of X. For x 2 X and a partition P 2 PX we denote by P .x/ the unique cluster of P containing x. Recall that the diameter of a set A in a metric space X is supu,v2A d.u, v/. We are interested in collections of decompositions for which the diameters of clusters are bounded by some number , but each x 2 X is, on average over the collection, reasonably far from the complement of the cluster P .x/. In this connection we consider the set DX of all probability measures on PX . We are interested mostly in the case where X is a finite set. In such a case PX is also finite, each element of DX is a measure on a finite set. Support of such a measure is defined as the set of decompositions having nonzero probability.

81

Section 3.1 Padded decompositions

Definition 3.2. A stochastic decomposition of a finite metric space .X, d / is a probability measure ‰ 2 DX . Let " : X ! .0, 1 and > 0. The measure ‰ is called an "-padded -bounded decomposition of X (or an .", /-padded decomposition) if it satisfies the following two conditions: (a) Diameter condition: diam.C / < for all P 2 supp.‰/ and all C 2 P . (b) Pad condition: For all x 2 X, ‰¹P : d.x, XnP .x//  ".x/ º  12 . Remark 3.3. The number 12 in the pad condition can be replaced by any other number in the interval .0, 1/. This will change the class of .", /-padded decompositions, but the theory could be developed in the same way. Now we describe a way in which padded decompositions can be used to construct bilipschitz embeddings with relatively small distortions. (Later we shall prove stronger results, but some of the main ideas are more transparent in the proof of Theorem 3.4.) Theorem 3.4. Let X be a finite metric space with diam.X/ < 2t .t 2 N/ and d.u, v/  1 for all u, v 2 X, u ¤ v. Suppose that there is ˛ > 0 such that the metric space X has an .", /-padded decomposition for each 2 L D ¹1, 2, 22 : : : , 2t º with p 4 2.t C1/1=2 . ".x/ D ˛ for each x 2 X. Then c2 .X/  ˛ Proof. For each 2 L let ‰. / be an .", /-padded decomposition of X with ".x/ D ˛ for each x 2 X. Let P D ¹C1, : : : , Cm.P / º 2 supp.‰. //. Denote by U1 , : : : , U2m.P / unions of all possible collections of clusters of P . For each P 2 supp.‰. // and each j 2 ¹1, 2, 3, : : : , 2m.P / º we introduce the function fP ,j : X ! R by ´ min¹d.u, XnP .u//, º, if u 2 Uj (3.1) fP ,j .u/ D 0, if u … Uj . Remark 3.5. Here we assume that d.u, XnP .u// D 1 if XnP .u/ D ;. We use minimum with in (3.1) for two reasons: (1) To avoid infinite values if the decomposition consists of one cluster; (2) To achieve suitable bounds on the functions which we construct. Although we do not need these bounds for Theorem 3.4, we shall need them in Section 3.4. Let fP be the embedding of X into `22

m.P /

defined by the formula m.P /

fP .u/ D ¹2m.P /=2 fP ,j .u/ºj2D1 . Let us show that Lip.fP /  1. We start by showing that jfP ,j .u/  fP ,j .v/j  d.u, v/.

82

Chapter 3 Constructions of embeddings

In the case where P .u/ D P .v/ this inequality follows from the well-known simple fact: the function d.u, A/, as a function of u, has Lipschitz constant 1. In the case where P .u/ ¤ P .v/ we have 0  d.u, XnP .u//  d.u, v/ and 0  d.v, XnP .v//  d.u, v/, and the conclusion follows in this case also. Therefore we have kfP .u/  fP .v/k 

m.P /  2X

1=2 2m.P / .d.u, v//2

D d.u, v/.

j D1

On the other hand, the construction does not imply any estimates for kfP .u/fP .v/k from below. To get an embedding admitting estimates from below we concatenate the embeddings fP . To define concatenation we need the following definition. Definition 3.6. Let ¹X º2 be a family of Banach spaces. Their `p direct sum, where 1  p < 1, is defined as the Banach space F : ! [2 X P of all functions p satisfying the conditions F . / 2 X and 2 kF . /kX < 1. The sum and multiplication by scalars of such functions are defined pointwise, the norm is defined by   P p 1=p . The `p direct sum is denoted ˚2 X p . The funckF k D 2 kF . /kX tion F is also denoted ˚2 F . / or .F . //2 .   Remark 3.7. Observe that each F 2 ˚2 X p , 1  p < 1, can satisfy F . / ¤ 0 only for countably many elements 2 . Remark 3.8. `1 direct sums were introduced in Definition 2.17. In that definition we used slightly different notation. Observation 3.9. Any `2 direct sum of Hilbert spaces (finite or infinite-dimensional) is a Hilbert space of the corresponding dimension. In fact, in such a case the norm P 1=2 P 2 is induced by the inner product 2 hF . /, G. /iH , where 2 kF . /kH h, iH is the inner product of H . Now we return to the proof of Theorem 3.4. First we concatenate the embeddings fP for P 2 supp.‰. // in the following way: We consider an embedding f of X into the direct sum  m.P /  H D ˚P 2supp.‰. // `22 2 defined by

p f .u/ D ˚P 2supp.‰. // ‰.P /fP .u/.

83

Section 3.1 Padded decompositions

The choice of coefficients

p ‰.P / implies Lip.f /  1. In fact, 

X

kf .u/  f .v/k D 

1=2 ‰.P /kfP .u/  fP .v/k2

P 2supp.‰. //

X



(3.2)

1=2 ‰.P /.d.u, v//2

D d.u, v/.

P 2supp.‰. //

The pad condition can be used to estimate kf .u/  f .v/k from below if d.u, v/  . In fact, if d.u, v/  , the diameter condition implies that P .u/ ¤ P .v/ for each P 2 supp.‰. //. Therefore a quarter of all sets Uj (corresponding to such P ) satisfies the conditions u 2 Uj and v … Uj . Thus

kfP .u/  fP .v/k D

m.P /  2X

1=2 2

m.P /

2

jfP ,j .u/  fP ,j .v/j

j D1

  D

 1=2 1 m.P / m.P / 2 min¹d.u, XnP .u//, º2 2 4

1 min¹d.u, XnP .u//, º. 2

Now we use the pad condition (see also (3.2)) and get (since ˛  1) kf .u/  f .v/k 

p

‰¹P : d.u, XnP .u//  ˛ º 

˛ 1 ˛  p . 2 2 2

(3.3)

Remark 3.10. The embedding f : X ! H has the following properties (a) Lip.f /  1. (b) d.u, v/ 

)

kf .u/  f .v/k 

˛ p 2 2



(c) kf .u/k  for each u 2 X. (This property can be verified one-by-one for maps fP ,j , fP , f .) Later (in Section 3.4 of this chapter) we introduce a class of embeddings sharing similar properties, and present a way of combining them which is in a certain sense optimal. Now we finish the proof of Theorem 3.4 using a naïve concatenation. Now we concatenate f over 2 L, that is, we consider f : X ! .˚ 2L H /2 given by f .u/ D ˚ 2Lf .u/. This concatenation is a bilipschitz embedding and it

84

Chapter 3 Constructions of embeddings

is easy to estimate its distortion. In fact, on the one hand X 1=2 kf .u/  f .v/k2 kf .u/  f .v/k D 2L 1=2

 jLj

d.u, v/

D .t C 1/1=2  d.u, v/. On the other hand, since X is such that 1  d.u, v/ < 2t .8u, v 2 X, u ¤ v/, for each pair u, v we can find 2 L such that  d.u, v/ < 2 . By (3.3), for that ˛ p . Thus kf .u/  f .v/k > p d.u, v/ value of we have kf .u/  f .v/k  ˛ 2 2

and kf .u/  f .v/k >

˛ p

4 2

d.u, v/. We conclude that c2 .X/ 

4 2 p 4 2.t C1/1=2 . ˛

Remark 3.11. The proof of Theorem 3.4 can be easily modified to prove, under the same assumptions on X, the inequality cp .X/  2.8.t C 1//1=p =˛. It is worth mentioning that for 1  p < 2 we get a better inequality if use the embedding into `2 and the Dvoretzky theorem 2.30.

3.2 Padded decompositions of minor-excluded graphs Now we are going to construct a padded decomposition of the type needed for Theorem 3.4. We need the notion of a minor of a graph. Definition 3.12. Let H be a graph with the vertex set V .H / D ¹v1, : : : , vk º. We say that H is a minor of a graph G if there are disjoint connected sets of vertices Vi  V .G/, i D 1, : : : , k, such that for each edge e 2 E.H / with end vertices vi and vj there is a corresponding edge ez 2 E.G/ with end vertices in Vi and Vj . The sets Vi are called branch sets. Remark 3.13. Minors play a significant role in the structure theory of graphs. One of the most well-known results of this type is the Kuratowski theorem: a graph G is planar (that is, can be sketched in the plane without edge crossings) if and only if the complete graph K5 and the complete bipartite graph K3,3 are not minors of G. Theorem 3.14. Suppose that an unweighted connected graph G is such that the complete graph Kr is not a minor of G. Then G, considered as a metric space with its graph distance, admits an .", /-padded decomposition for each with ".x/ D c.r /, where c.r / > 0 depends on r only (and does not depend on G, x, or ). 1 . We do not claim that this bound Proof. We prove the theorem with c.r / D 16r .8r C2/ is close to being optimal. Observe that there is nothing to prove in the case where c.r /  1, because in this case all the conditions of Definition 3.2 on .", /-padded decomposition are satisfied

Section 3.2 Padded decompositions of minor-excluded graphs

85

for the decomposition of the vertex set of G onto 1-element sets. (This decomposition can be considered as a stochastic decomposition for which the probability measure is supported on one decomposition.) In the case where c.r / > 1, that is, where 8r C2 > 16r , the desired decomposition ˘  is constructed in the following way. We let ı D 8r C2 and do the following: We pick a vertex v.G/ in G and call it the center of G. We pick r1 2 ¹0, 1, : : : , ı 1º and let S1  E.G/ be the set of all edges uv in G for which dG .u, v.G// D r1 .mod ı/ and dG .v, v.G// D r1 C 1 .mod ı/, where dG is the shortest path metric of G. We call the components of the graph G2 D GnS1 (the graph obtained from G after the deletion of edges of S1 ) components of level 2. For uniformity we call G a component of level 1. We split the set of all components of level 2 into two classes which will be handled differently. We say that a component C of level k satisfies the FC-condition (far-from-centers condition) if there are vertices in C which are at dG -distance at least 4r ı from the centers of all components of lower levels containing C . Components satisfying the FC-condition are called FC-components. In application to components of level 2 the FC-condition means that the component is not contained in a ball of radius < 4r ı centered at v.G/. It is clear that in this case non-FC-components already satisfy the diameter condition, the diameter of each of them is < 8r ı  . In general, one can check that as soon as we get a component which satisfies the diameter condition of Definition 3.2, we do not need to decompose it further; but we do this for the uniformity of presentation. We continue decomposing FC-components. In each FC-component C of level 2 we pick a vertex v.C / satisfying dG .v.C /, v.G//  4r ı and call it the center of C . We pick r2 2 ¹0, 1, : : : , ı  1º and let S2 be the set of all edges uv (in all FC-components of level 2) for which dC .u, v.C // D r2 .mod ı/ and dC .v, v.C // D r2 C1 .mod ı/. We call the components of the graph G3 D G2nS2 components of level 3. It is important that in this step we use the distance dC , that is, the shortest path metric of C (the FC-component of level 2 which we partition into components of level 3). Observe that non-FC-components of level 3 do not have to have diameters < 8r ı, and we have to decompose them further to be sure that the diameter condition is satisfied. We are going to do this somewhat later. Now we work on decomposition of FC-components. FC-components of level 3 and all further levels are decomposed in the same way as in the case of level 2. Here is the general description for level k. In each FC-component C of level k we pick a vertex v.C / satisfying the condition dG .v.C /, v.Cz //  4r ı for each component Cz of one of the preceding levels containing C . We pick a number rk 2 ¹0, 1, : : : , ı1º and let Sk be the set of all edges uv in all FCcomponents C of level k for which dC .u, v.C // D rk .mod ı/ and dC .v, v.C // D

86

Chapter 3 Constructions of embeddings

rk C 1 .mod ı/. (Here in each component C of level k the distance dC is its shortest path metric.) We let GkC1 D Gk nSk . We call components of the graph GkC1 components of level k C 1. The set of edges uv in an FC-component C of level k for which dC .u, v.C // D r2 C sı and dC .v, v.C // D r2 C 1 C sı for some s 2 N [ ¹0º will be called a cut of level k and denoted Tks . The following observation is simple but crucial: Observation 3.15. A shortest path joining a vertex u in a component C of level k with the center v.C / cannot contain more than ı vertices in any of the components of level k C 1 contained in C . Proof. Let P be a shortest path joining a vertex of C with v.C /. When we form components of level k C 1, one out of each ı consecutive edges of the path P is deleted. The conclusion follows. Claim 3.16. If G has no Kr -minors, then there are no FC-components of level r . Proof. Assume the contrary. Let Cr be an FC-component of level r in G and let Cj .j D 1, : : : , r  1/ be the component of level j containing Cr . Denote by aj .j D 1, : : : , r / the center of Cj . We are going to use ¹a1, : : : , ar º to find a Kr -minor in G. The branch sets of the Kr -minor which we construct contain a1 , : : : , ar , respectively. Observe that our construction implies dG .ai , aj /  4r ı. The most natural idea is to join each pair ai , aj by a shortest path in G, and then to cut these paths somewhere for each pair, attaching each part to the corresponding branch set. However, this straightforward construction does not always work: no matter how we cut the paths, we cannot claim the disjointness of the obtained sets. Our construction can be regarded as a more complicated modified version of this straightforward construction, the purpose of the modification is to achieve disjointness. We construct inductively Kj -minors .j D 2, : : : , r / with branch sets Br j C1.j /, : : : , Br .j / containing vertices ar j C1, : : : , ar , respectively. These branch sets get some additional vertices in each step of the induction. One of the important ideas is to use Cr j -shortest paths (that is, paths which are shortest in the graph distance of Cr j ) instead of shortest paths in G. This helps because in this case the branch sets Br j C1.j /, : : : , Br .j / are subsets of Cr j C1. On the other hand, by Observation 3.15, each Cr j -shortest path can contain at most ı  1 edges in Cr j C1. We need to start this path at such a vertex of Bm .j / .m D r  j C 1, : : : , r / that within this .ı  1/-piece the Cr j -shortest path does not intersect components of the collection ¹Br j C1.j /, : : : , Br .j /ºn¹Bm.j /º. This is the main idea behind the formal description of the construction which we present now.

Section 3.2 Padded decompositions of minor-excluded graphs

87

Remark 3.17. K2 -minors are easy to find in each graph having some edges. Our construction is more complicated than is necessary because we are going to use it in the inductive construction of Kr -minors for larger r . Constructing a K2-minor. By construction, the component Cr is contained between sC1 two cuts of level r  1, say Trs1 and TrsC1 1 (the cut Tr 1 can be empty). We join ar and ar 1 with a Cr 1 -shortest path Pr .1/. We cut this path into two pieces: one of the pieces consists of vertices which are closer to ar than the 4th cut of level r  1 crossed by the path on the way to ar 1 , that is, than the cut Trs3 1 . The vertices of this piece form the branch set Br .2/, and all of the remaining vertices of the path form the branch set Br 1 .2/ (we use this terminology and notation for these sets because they are going to be the branch sets for the K2 -minor). We pick a vertex on Pr .1/ between the cuts s1 Trs2 1 and Tr 1 and denote it cr .2/, we may assume that dG .cr .2/, ar /  2ı. We let cr 1 .2/ D ar 1 . It is clear that Br 1 .2/ and Br .2/ are branch sets for a K2 -minor. It is also clear (and will be used below) that Br 1 .2/ and Br .2/ are subsets of Cr 1. (We have much more freedom in the first step. We have chosen the described construction of Br 1 .2/ and Br .2/ to have uniformity with further steps in the construction.) Constructing a K3 -minor (assuming r  3). By construction, the component Cr 1 is contained between two cuts of level r  2, say Trs2 and TrsC1 2 (the letter s here denotes, in general, a different number than in the previous step, we just do not want to use a more complicated notation). We join cr 1 .2/ and cr .2/ with ar 2 using Cr 2shortest paths Pr 1 .2/ and Pr .2/, respectively. The conditions dG .cr .2/, ar /  2ı and dG .ai , aj /  4ır imply that each of these paths is of length  4ır  2ı. We decompose each of the paths Pr 1 .2/ and Pr .2/ into two pieces. One of the pieces is determined by the condition: it consists of vertices which are closer to cr 1 .2/ and cr .2/, respectively, than the 4th cut crossed by the path on the way to ar 2, that is, than the cut Trs3 2 . We add those vertices of Pr 1 .2/ and Pr .2/ which are closer s3 than the cut Tr 2 to cr 1 .2/ and cr .2/, respectively, to the sets Br 1.2/ and Br .2/, and denote the obtained sets Br 1.3/ and Br .3/. All of the remaining vertices of the paths, including ar 2 , form the set Br 2 .3/, which we create in this step. s2 We pick vertices between the cuts Trs1 2 and Tr 2 on each of the paths Pr 1 .2/ and Pr .2/ and denote them cr 1 .3/ and cr .3/, respectively. We may assume that dG .cr 1 .3/, ar 1 /  2ı and dG .cr .3/, ar /  4ı. We let cr 2 .3/ D ar 2 . Let us show that Br 2 .3/, Br 1 .3/, and Br .3/ are branch sets of a K3 -minor. It is clear that only disjointness requires verification. The first step in the verification is easy: The set Br 2 .3/ cannot intersect any of the other sets because it is separated from them by the cut Trs3 2 . The added to Br 1 .2/ and Br .2/ pieces of paths Pr 1 .2/ and Pr .2/ cannot intersect each other by the triangle inequality: the dG -distance between them is  dG .cr 1 .2/, cr .2//  2  4ı  dG .ar 1 , ar /  2  4ı  2ı > 0. In addition, by Observation 3.15, the path Pr .2/ leaves Cr 1 before it can intersect Br 1 .2/ because the Cr 1-distance between cr .2/ and Br 1.2/ is > ı. A similar state-

88

Chapter 3 Constructions of embeddings

ment holds for the path Pr 1 .2/ and Br .2/. We emphasize that this statement holds because Pr 1 .2/ and Pr .2/ are Cr 2-shortest paths; this is one of the main ideas in the proof. We complete the inductive construction in the following way. When we construct a Kj -minor we introduce the following objects: 

Branch sets Br j C1.j /, : : : , Br .j /.



Connection vertices cr j C1.j /, : : : , cr .j / satisfying cr j C1.j / 2 Br j C1.j /, . . . , cr .j / 2 Br .j /. (Let us mention the role of these vertices: they are sufficiently distant from other branch sets, and therefore not-too-large portions of Cr j -shortest paths joining connection vertices with the center of Cr j do not intersect other branch sets.) These objects satisfy the following conditions:

(a) All of them are contained in Cr j C1. (b) The Cr j C1-distance between cm .j / and Bn .j / .m ¤ n/ is > ı. (c) dG .am , cm .j //  2.j 1/ı, and the inequality is strict for all m, except, possibly m D r. Now we describe the inductive step. By construction, the component Cr j C1 is contained between two cuts of level r  j , say Trsj and TrsC1 j (the number s here can be different from the numbers denoted s in the previous steps). We consider Cr j -shortest paths Pr j C1 .j /, . . . , Pr .j / connecting ar j and the vertices cr j C1.j /, : : : , cr .j /, respectively. We add the vertices of pieces of these paths which are closer to cr j C1 .j /, . . . , cr .j / than the cut Trs3 j to Br j C1 .j /, . . . , Br .j /, respectively. In this way we form the branch sets Br j C1.j C 1/, : : : , Br .j C 1/. The remaining vertices of the paths Pr j C1.j /, . . . , Pr .j / form a branch set Br j .j C1/, which is created in this step. Let cr j C1.j C 1/, : : : , cr .j C 1/ be vertices in the branch sets Br j C1.j C 1/, : : : , Br .j C 1/ satisfying the condition: they belong to the pieces of the paths Pr j C1 .j /, . . . , Pr .j / which are between the cuts Trs1 j and s2 Tr j , and satisfy the conditions dG .cm .j C 1/, cm .j //  2ı.

(3.4)

The existence of such vertices is obvious. We let cr j .j C 1/ D ar j . We need to show that Br j .j C 1/, : : : , Br .j C 1/ form a collection of branch sets for a Kj C1-minor. The only property which is not obvious is their disjointness. We also have to check that analogues of the conditions in items (b) and (c) for j replaced by j C 1 are satisfied. The condition in (a) follows from the construction. The branch set Br j .j C 1/ does not intersect the other branch sets because they are separated by the cut Trs3 j .

Section 3.2 Padded decompositions of minor-excluded graphs

89

By Observation 3.15, the paths Pr j C1 .j /, : : : , Pr .j / have at most ı vertices in Cr j C1. Therefore, by condition (b), Pm .j / cannot intersect Bn .j / for m ¤ n. The pieces of Pm .j / and Pn .j /, m ¤ n cannot intersect each other because their lengths are  4ı, and therefore, by condition (c) and the triangle inequality, the closest vertices on them are at distance > dG .am , an /  2  2.j  1/ı  2  4ı D dG .am , an /  4.j C 1/ı, and the last number is assumed to be nonnegative. Finally, we check the validity of conditions (b) and (c) with j replaced by j C 1. Condition (b) follows because cn .j C 1/ is separated from Bn .j C 1/ by two cuts, Trsj and Trs1 j . Condition (c) follows immediately from (3.4) and the definition of cr j .j C 1/. Claim 3.16 implies that the graph G (which has no Kr -minors) does not have FCcomponents of level r . Therefore to complete the proof of Theorem 3.14 we need to show how to decompose non-FC-components in such a way that the overall decomposition satisfies the diameter and pad conditions of Definition 3.2. Handling the non-FC-components. For each non-FC-component we define the set of at most r 1 relevant centers, as the set of centers of FC-components containing it. We pick independently r  1 integers s1 , : : : , sr 1 from the set ¹4r ı, 4r ı C 1, : : : , 4r ı C ı  1º. Let N be a non-FC-component, c1 , : : : , ck be relevant centers; where ci is a center of an FC-component of level i containing N . We consider the following sets of edges in N : R1 D ¹uv : dG .u, c1 / D s1 , dG .v, c1 / D s1 C 1º R2 D ¹uv : dG .u, c2 / D s2 , dG .v, c2 / D s2 C 1º ::: Rk D ¹uv : dG .u, ck / D sk , dG .v, ck / D sk C 1º. We remove these sets of edges from the edge set of N . The components of the obtained graph form the final decomposition. It is clear that the dG -diameters of the obtained components are  2.4r ı C ı  1/ < .8r C 2/ı  . Now we assume that each of the choices r1 , : : : , rr 1 and s1 , : : : , sr 1 was a random choice and that the probabilities of choosing each of the 2.r  1/-tuples of numbers are the same. We get a stochastic decomposition in which each of the decomposi 2.r 1/ (if it is not repeated tions obtained in the described way has probability ı1 for different choices of the integers r1, : : : , rr 1 and s1 , : : : , sr 1 , and the probability  1 2.r 1/ (the number of repetitions) otherwise). It seems worthwhile to add some ı justification to this statement. In fact, the resulting decomposition depends not only

90

Chapter 3 Constructions of embeddings

on the choice of numbers r1, : : : , rr 1 and s1 , : : : , sr 1 , but also on the choice of centers. However we can make the choice of centers uniquely determined by the choice of r1 , : : : , rr 1. This can be done as follows. We label vertices of G using positive integers 1, 2, : : : . Now we let each of the centers to be the vertex with the smallest label satisfying the corresponding conditions. Verifying the pad property. As we mentioned at the beginning of the proof, it suffices 1 to consider the case where 8r C2 > 16r . It is clear that in this case c.r /  16r .ı C1/. 1 Consider a vertex v in G. Let A be the event that v is dG -closer than 16r .ı C 1/ to any vertex from which v was separated by removal of one of the sets Si or one 2.r 1/ of the sets Ri . We can write A D [i D1 Ai , where Ai is the event that the first separation happened when the i-th removal was made (we order the edge sets as S1, : : : , Sr 1 , R1 , : : : , Rr 1 ).  C1 ˘ 1  4r Let us show that the probability of A1 is  ı2  ı16r . Let dv be the residue of the number d.v.G/, v/ modulo ı. It is easy to see that the separation described in the definition of A1 can happen only if r1 is the residue of one of the numbers     ıC1 ıC1 , : : : , dv  1, dv , dv C 1, : : : , dv C 1 dv  16r 16r modulo ı (otherwise each of the deleted edges is too far from v to separate a vertex which is close to v). The claim about probability of A1 follows immediately. Similar argument works for each of the events Ak , k D 2, : : : , 2.r  1/. Therefore the 1 probability of A is  4r .2r  1/  12 .

3.3 Padded decompositions in terms of ball growth As we have already seen in the proof of Theorem 3.4, collections of padded decompositions with the diameter bound ranging over all integral powers of 2 (of course, 2 can be replaced by any other positive number different from 1, but we are going to use 2) can be used to construct embeddings into `p . (Later we shall see that some of the embeddings constructed in this way are close to being optimal, that is, having the distortion close to the minimal possible.) In this connection it is natural to introduce the following definition: Definition 3.18 (Decomposition bundle). Let X be a metric space. Given a function " : XZ ! .0, 1, an "-padded decomposition bundle on X is a function ˇ : Z ! DX , where for every ` 2 Z, ˇ.`/ is a 2` -bounded "., `/-padded decomposition of X. Our next purpose is to construct a reasonably good (as we shall see later) decomposition bundle in an arbitrary finite metric space. Let .X, d / be a finite metric space, we use the notation B.x, r / for the set ¹y 2 X : d.x, y/  r º which is called a closed

Section 3.3 Padded decompositions in terms of ball growth

91

ball in X. In the case where no confusion can occur, we call B.x, r / a ball in X. We use the notation jB.x, r /j to denote the cardinality of the ball. Theorem 3.19. On an arbitrary finite metric space X there exists an "-padded decomposition bundle, where  1   jB.x, 2` /j . (3.5) ".x, `/ D 16 C 16 ln jB.x, 2`3 /j Proof. Let D 2` for some ` 2 Z. The corresponding stochastic decomposition ˇ.`/ is constructed in the following way. Choose, uniformly at random, a permutation    of X and a value ˛ 2 14 , 12 . Denote by P the corresponding probability measure (the product of two the uniform measure on all permutations and the Lebesgue   1 measures: 1 measure on 4 , 2 ). We may assume without loss of generality that X D ¹1, : : : , nº, where n is the cardinality of X, and so the inequality .y/ < .z/ for y, z 2 X makes sense. For each pair ., ˛/ in the constructed probability space we define the following decomposition of X. The number of clusters in each decomposition is n, but some (maybe many) of the clusters are empty. Clusters are labeled using points of X. For every point y 2 X, define a cluster   [ Cy D B.y, ˛ / n B.z, ˛ / . z: .z/< .y/

In words: a point x 2 X is assigned to Cy where y is the minimal point according to  that is within distance ˛ from x. Clearly P D ¹Cy ºy2X constitutes a decomposition of X. Furthermore, Cy  B.y, ˛ /, thus diam.Cy / < , so the diameter condition (a) in Definition 3.2 is satisfied for every decomposition P arising from this process. It remains to prove that the pad condition (b) in Definition 3.2 is also satisfied. Fix a point x 2 X and some value t  =8. We need to estimate from above the probability P of the event Bx,t consisting of (bad) decompositions P obtained in the described way and satisfying B.x, t / ª P .x/. It is clear that the event Bx,t occurs if and only if the decomposition ¹Cy ºy2X contains a cluster which intersects B.x, t /, but does not contain B.x, t /. Observe that clusters Cy cannot intersect B.x, t / if y … B.x, /. Let a D jB.x, =8/j, b D jB.x, /j. We arrange the points w1 , : : : , wb 2 B.x, / in order of increasing distance to x, resolving ties arbitrarily. Let Ek be the event consisting of decompositions for which the following two conditions are satisfied: (a) wk is the element with the minimal .wk / among those satisfying ˛  d.x, wk /  t . (b) ˛ < d.x, wk / C t .

92

Chapter 3 Constructions of embeddings

Condition (a) means that Cwk is the first cluster which could intersect B.x, t /, in the sense that there is no contradiction for the existence of such a nonempty intersection with the distance between the points, the values of t and ˛, and the triangle inequality. (Note that the intersection can still be empty.) Condition (b) is necessarily satisfied if Cwk does not contain B.x, t /. So informally (b) means that Cwk does not necessarily contain B.x, t /. To show that Bx,t is contained in [biDk Ek it suffices to establish that all decompositions in the complement of [biDk Ek are “good”. This immediately follows from the following description of this complement: it consists of partitions for which the set Cwk with the minimal .wk / among those which could intersect B.x, t / contains B.x, t /. Note that if wk 2 B.x, =8/, then PŒEk  D 0 since in this case d.x, wk / C t  =8 C t  =4 < ˛ . Therefore letting Ik D Œd.x, wk /  t , d.x, wk / C t / and denoting by PŒEk j ˛ 2 Ik  the corresponding conditional probabilities, we get PŒBx,t  

b X

PŒEk  D

kDaC1

b X kDaC1



b X kDaC1

PŒ˛ 2 Ik   PŒEk j ˛ 2 Ik    2t 1 8t b   1 C ln , =4 k a

(3.6)

where we used the fact that the event Ek does not contains pairs ., ˛/ for which  is such that .wj / < .wk / for some j < k. Therefore PŒEk j ˛ 2 Ik  does not exceed the ratio of the number of permutations of ¹w1, : : : , wb º which place numbers of the set ¹w1, : : : , wk1 º above .wk / and the number of all permutations. Since each of the numbers ¹.w1 /, : : : , .wk /º has equal chances of being the least, the ratio is equal to 1=k. Recall that D 2` . Setting t D ".x, `/  =8, where ".x, `/ is as in (3.5), the right-hand side of (3.6) is at most 12 , proving the pad condition (b) of Definition 3.2. Corollary 3.20. There exists an absolute constant 0 < C < 1 such that for every ` 2 Z there is a 1-Lipschitz map f` : X ! `2 satisfying kf` .x/k  2` for all x 2 X and kf` .x/  f` .y/k 



d.x, y/ `

jB.x,2 /j C 1 C ln jB.x,2 `3/j



(3.7)

for all x, y 2 X with d.x, y/ 2 Œ2` , 2`C1. In this proof one can recognize a modification of the proof of Theorem 3.4. Proof. Let ‰.`/ be an .".x, `/, 2` /-padded decomposition of X, where ".x, `/ is as in (3.5). Let P D ¹C1, : : : , Cm.P / º 2 supp.‰.`//. Denote by U1 , : : : , U2m.P / unions

93

Section 3.4 Gluing single-scale embeddings

of all possible collections of clusters of P . For each P 2 supp.‰.`// and each j 2 ¹1, 2, 3, : : : , 2m.P / º we introduce the function fP ,j : X ! R by ´ min¹d.u, XnP .u//, 2` º, if u 2 Uj fP ,j .u/ D 0, if u … Uj . Let fP be the embedding of X into `22

m.P /

defined by the formula m.P /

fP .u/ D ¹2m.P /=2 fP ,j .u/ºj2D1 . In the same way as in Theorem 3.4 we check that Lip.fP /  1. We concatenate the embeddings fP for P 2 supp.‰. // in the following way: We consider an embedding f` of X into the direct sum  m.P /  H` D ˚P 2supp.‰.`// `22 2 defined by

p f` .u/ D ˚P 2supp.‰.`// ‰.P /fP .u/.

As in Theorem 3.4 we get Lip.f` /  1 and kf` .x/k  2` . Let x, y 2 X be such that d.x, y/ 2 Œ2` , 2`C1 . For the same reason as in Theorem 3.4, the pad condition implies q 1 kf` .x/  f` .y/k  ‰¹P : d.x, XnP .x//  ".x, `/2` º  ".x, `/2` 2 ".x, `/2`  p 2 2 2`

 p jB.x,2` /j 2 2 16 C 16 ln jB.x,2 `3/j 

d.x, y/

. p jB.x,2` /j 64 2 1 C ln jB.x,2 `3/j

Remark 3.21. The same proof works to show the existence of a 1-Lipschitz map f` : X ! `p .1  p  1/ satisfying the conditions of Corollary 3.20.

3.4 Gluing single-scale embeddings In this section we are going to show that the problem of constructing a nontrivial embedding of a finite metric space to some extent can be reduced to that of handling a single scale in the following sense. Let .X, dX / and .Y , dY / be metric spaces, and let be a positive real number.

94

Chapter 3 Constructions of embeddings

Definition 3.22. If f : X ! Y is a 1-Lipschitz map such that for every x, y 2 X with d.x, y/ 2 Œ , 2 / the inequality dY .f .x/, f .y// 

K

holds, then we will call f a scale- embedding with deficiency K. Mappings constructed using padded decompositions, see (3.3) and (3.7), are of this type. Theorem 3.23 (Gluing lemma). Let X be a metric space with n elements. Suppose that for each m 2 Z, there exists a map 'm : X ! `p , 1  p < 1, satisfying k'm k  2m , which is a scale-2m embedding with deficiency K. Then cp .X/  1 n for some absolute constant C . C K 1 p log1=p 2 Remark 3.24. At the end of this section we shall show that the boundedness condition k'm k  2m is not needed in the Hilbert space case p D 2. Remark 3.25. It is worth mentioning that mappings constructed in the proofs of Theorem 3.4 and Corollary 3.20 satisfy the boundedness condition. Remark 3.26. In this section we use c, C , C1 , C2, : : : to denote absolute constants contained in .0, 1/. The constants denoted in the same way could be different in different places, sometimes we emphasize the fact that the constants are different by adding subscripts. Remark 3.27. Observe that for a finite metric space X the condition of being a scale2m embedding with deficiency K is restrictive only for finitely many values of m. We mean values of m for which there exist pairs x, y such that d.x, y/ 2 Œ2m , 2mC1 /. We call such values of m (or of 2m ) relevant scales. For other values of m any map, even having the same value at all points of X, is a scale-2m embedding with deficiency K. Let R be the number of relevant scales for X. Then the naïve concatenation used at the end of the proof of Theorem 3.4 (see page 83), applied to 'm corresponding to relevant scales, produces a map showing that cp .X/  CK  R1=p . This estimate is not satisfactory for our purposes in two respects (here we restrict our attention to the Euclidean distortion): (a) One can easily observe that the amount of relevant scales can be much larger than log2 n (see Example 3.28 below and Exercise 3.37 for more on this). So replacement of R by log2 n is essential. p (b) We would like to have K rather than K in our estimate for c2 .X/.

95

Section 3.4 Gluing single-scale embeddings

Example 3.28. Consider the following n-element subset of R: ¹0, 1, 1 C 2, 1 C 2 C 4, : : : , 1 C 2 C    C 2n2º, endowed with the usual metric inherited from R. It is easy to see that there are n  1 relevant scales. To deal with the issues mentioned above (in items (a) and (b)) we use the following two tools: (a) Instead of naïve (straightforward) concatenation used in the proof of Theorem 3.4, we form dlog2 ne  1 different weighted concatenations ¹ t º of ¹'m º. By a weighted concatenation we mean a concatenation of maps in which maps are multiplied by some coefficients (usually between 0 and 1) which are functions of x (x is an element of X). The weights are distributed in such a way that 

Each of the weighted concatenations solute constant.



The weighted concatenations k

t .x/ 

t

t

is C -Lipschitz where C is an ab-

are such that

t .y/k

 k'm .x/  'm .y/k

if d.x, y/ 2 Œ2m , 2mC1 / and t is in the intersection of the intervals 

\  log2 jB.y, 2m3 /j, log2 jB.y, 2mC3 /j . log2 jB.x, 2m3/j, log2 jB.x, 2mC3/j

Remark 3.29. One can easily check that the intersection is nonempty, but simple examples show that it can contain no integers. Such situations are handled in Lemma 3.31. (b) We form similar concatenation for the mappings constructed in Corollary 3.20. 1 This allows us to replace K by K 1 p in the estimate for the distortion. As we see from the last item, we plan to construct suitable weighted concatenations for maps which are not necessarily scale-2m embeddings with deficiency K. This is the purpose of our next lemma. For x, y 2 X, define ´ x if jB.x, 2m /j  jB.y, 2m /j m .x, y/ D y otherwise. Lemma 3.30. Let 1  p < 1. Given, for every m 2 Z a 1-Lipschitz map hm : X ! `p satisfying khm .x/k  2m for each x 2 X, there exists a map H : X ! `p which satisfies 1

(a) Lip.H /  C.log n/ p .

96

Chapter 3 Constructions of embeddings

(b) For every m 2 Z and every x, y 2 X with d.x, y/ 2 Œ2m , 2mC1 /, we have   p1 jB. m3 .x, y/, 2mC1 /j kH.x/  H.y/k  log2  khm .x/  hm .y/k. jB. m3 .x, y/, 2m3 /j (3.8) Proof. For each t 2 ¹1, 2, : : : , dlog2 ne  1º (recall that n is the cardinality of the metric space X) define R.x, t / D sup¹R : jB.x, R/j  2t º. Let  : R ! RC be a Lipschitz map satisfying (i) supp./  Œ24 , 24 , (ii) . / D 1 for 2 Œ23 , 23 , and (iii) 0  . /  1 for each . We may assume that  is a piecewise linear map with Lipschitz constant 24 . Define   R.x, t / . m,t D  2m Consider the `p direct sum ˚m2Z `p (we mean that Xm D `p for each m 2 Z), and define the weighted concatenation t : X ! ˚m2Z `p by t .x/

D ˚m2Z m,t .x/hm .x/,

and the map H as the straightforward concatenation of ¹ 1ºº.

t

: t 2 ¹1, 2, : : : , dlog2 ne

Bounding the Lipschitz constant. Observe that for every t the map x 7! R.x, t / is 1-Lipschitz. Since the Lipschitz constant of a composition does not exceed the product of Lipschitz constants, we have Lip.m,t /  24m . We also have X k t .x/  t .y/kp D km,t .x/hm .x/  m,t .y/hm .y/kp . (3.9) m2Z

° For a± fixed t the value of R.x, t / is just a number and therefore the numbers R.x,t / are in the support of  for at most 9 different values of m. Therefore 2m m2Z for each t there are at most 18 nonzero terms in the sum (3.9). To bound a nonzero summand in (3.9) we use km,t .x/hm .x/  m,t .y/hm .y/k  khm .x/k  jm,t .x/  m,t .y/j C jm,t .y/j  khm .x/  hm .y/k  2m  24m d.x, y/ C d.x, y/  17d.x, y/. Since there are at most 18 nonzero terms in the sum (3.9) we get k 1 C d.x, y/. The estimate Lip.H /  C.log2 n/ p follows.

t .x/ 

t .y/k



Proving (3.8). Let x, y 2 X be such that d.x, y/ 2 Œ2m , 2mC1 /. Notice that if m,t .x/ D m,t .y/ D 1, then k

t .x/



t .y/k

 khm .x/  hm .y/k.

So to get a lower estimate for kH.x/  H.y/k we need to estimate from below the number of values t 2 ¹1, 2, : : : , dlog2 ne  1º for which m,t .x/ D m,t .y/ D 1.

97

Section 3.4 Gluing single-scale embeddings

Observe that m,t .x/ D 1 if R.x, t / 2 Œ2m3 , 2mC3 , and this happens if   t 2 log2 jB.x, 2m3 /j, log2 jB.x, 2mC3 /j . Similarly, m,t .y/ D 1 if   t 2 log2 jB.y, 2m3 /j, log2 jB.y, 2mC3 /j (the right ends of the intervals are not included). Assume without loss of generality that x D m3 .x, y/ ( is defined in the paragraph preceding the statement of Lemma 3.30), so that jB.x, 2m3 /j  jB.y, 2m3 /j. Then   t 2 log2 jB.x, 2m3 /j, log2 jB.x, 2mC1 /j implies m,t .x/ D m,t .y/ D 1. To see this recall that R., t / is 1-Lipschitz and d.x, y/ < 2mC1 . Therefore R.x, t /  2mC1 implies R.y, t / < 2mC3 (and even R.y, t / < 2mC2 /. Hence the number of values of t 2 ¹1, : : : , dlog2 ne  1º for which m,t .x/ D m,t .y/ D 1 is at least 

 jB.x, 2mC1 /j . log2 jB.x, 2m3 /j

We conclude that  p

kH.x/  H.y/k 

 jB.x, 2mC1 /j log2 khm .x/  hm .y/kp . jB.x, 2m3 /j

It is clear that inequality (3.8) is of no significance in the case where  log2

jB.x, 2mC1 /j jB.x, 2m3 /j

 < 1,

and in this case we have to do something different. We do this under a weaker assump m1  /j < 1 in our next lemma. tion log2 jB.x,2 jB.x,2m2/j Lemma 3.31. For each p 2 Œ1, 1/ there exists a map G : X ! `p such that 1

(a) Lip.G/  dlog2 ne p . (b) The condition kG.x/  G.y/k  cd.x, y/ holds for all m 2 Z and all x, y 2 X satisfying d.x, y/ 2 Œ2m , 2mC1 / and  m1/j  < 1, log2 jB.x,2 jB.x,2m2/j

98

Chapter 3 Constructions of embeddings

Proof of Lemma 3.31. For each t 2 ¹1, 2, : : : , dlog2 neº, consider a random subset of X which contains each point of X independently with probability 2t . So the  jXjk probability of a subset W  X with k elements is 2t k 1  2t . Denote by .P.X/,  t / the probability space, where P.X/ is the set of all subsets of X and  t is the measure described above. For each t 2 ¹1, 2, : : : , dlog2 neº we introduce a map g t : X ! Lp .P.X/,  t / given by .g t .x//.W / D d.x, W / for a subset W 2 P.X/. We concatenate all of these maps, that is, we consider the map G : X ! .˚ t Lp .P.X/,  t //p given by G.x/ D ˚ t g t .x/. Straightforward verification shows that Lip.g t /  1 and 1 Lip.G/  dlog2 ne p .  m1  /j Fix x, y 2 X such that d.x, y/ 2 Œ2m , 2mC1 / and log2 jB.x,2 < 1. Let jB.x,2m2 /j s m1 sC1 /j  2 . s 2 ¹0, 1, 2, : : : , dlog2 ne  1º be such that 2  jB.x, 2 We introduce the following four subsets of P.X/: 

x Efar D ¹W 2 P.X/ : d.x, W /  2m1 º



x Eclose D ¹W 2 P.X/ : d.x, W /  2m2 º.



Eclose D ¹W 2 P.X/ : d.y, W / 



Efar D P.X/nEclose .

y y

3 2

 2m2 º.

y

x x Observe that for each of the probability measures  t , each of the events Efar , Eclose y x x is independent of the event Eclose , because the events Efar , Eclose depend on presence of y elements of balls B.x, 2m1 / and B.x, 2m2 / in the subset W , the event Eclose depends 3 on the presence of elements of the ball B.y, 2  2m2 / in the subset W , and the mentioned balls centered at x are disjoint with B.y, 32  2m2 / because of the assumption d.x, y/  2m .  m1 /j  < 1 implies that jB.x, 2m2 /j > Observe that the assumption log2 jB.x,2 jB.x,2m2 /j n ¯ ® 2s1 . Therefore (using the well-known fact that the sequence 1 n1 is increasing n 1 and tends to e ) r  x  1 s 2s1 s Eclose  1  .1  2 / , >1 e  x 1 sC1 s Efar  .1  2s /2 .  16

Section 3.4 Gluing single-scale embeddings

99

We get kG.x/  G.y/kp D

XZ Z



jg t .x/  g t .y/jp d t

t

y

x Eclose \Efar

jgs .x/  gs .y/jp ds

Z

C

y x Efar \Eclose

jgs .x/  gs .y/jp ds

  y y x x  .2m3 /p s .Eclose /s .Efar / C .1  s .Eclose //s .Eclose /  c.2m /p . Proof of Theorem 3.23. We apply Lemma 3.30 to two collections of maps: The collection ¹'m º whose existence is assumed in Theorem 3.23 and the collection ¹f` º whose existence was proved in Corollary 3.20, see also Remark 3.21. We denote the obtained maps ˆ : X ! `p and F : X ! `p , respectively. We concatenate the maps ˆ, F , and G, and get a map ‰ : X ! .`p ˚ `p ˚ `p / given by ‰.x/ D ˆ.x/ ˚ F .x/ ˚ G.x/. Using property (a) in Lemma 3.30 and 1 property (a) in Lemma 3.31 we get Lip.‰/  C.log2 n/ p . Now we turn to the lower bound. Fix x, y 2 X, let m be such that d.x, y/ 2 m Œ2 , 2mC1 /. Assume without loss of generality that x D m3 .x, y/ and set  A D log2

 jB.x, 2mC1 /j . jB.x, 2m3 /j

We have k‰.x/  ‰.y/kp D kF .x/  F .y/kp C kˆ.x/  ˆ.y/kp C kG.x/  G.y/kp . In the case where A < 1 we have, by Lemma 3.31, kG.x/  G.y/k  c2m . In the case where A  1 we use inequality (3.8) and get:   bAc 1 p  2m . kF .x/  F .y/k  p C .1 C A/p The constant C here is different from the constant in (3.7), because we use logarithms to different bases (here we recall our agreement in Remark 3.26).   bAc  2m . kˆ.x/  ˆ.y/kp  Kp

100

Chapter 3 Constructions of embeddings bAc C .1CA/p 1 K, the second term  K p1 .

To get the desired estimate it remains to observe that for A  1 we have bAc Kp



c K p1

for some absolute constant c > 0. (If bAc  ± ° K 1 If 1  bAc  K, the first term  min .2CK/ p , .2C1/p ). Thus k‰.x/  ‰.y/kp 

c K p1

2m ,

and the conclusion follows. Theorem 3.23 has the following important consequence, due to Bourgain. Historically this consequence preceded Theorem 3.23, which is an outgrowth of it. Corollary 3.32. c2 .X/  C log2 n for each n-element metric space X. Proof. Corollary 3.20 implies that for each n-element metric space there exists and each ` 2 Z there exists a scale-2` embedding f` into `2 with deficiency  C.1 C log2 n/ satisfying kf` .x/k  2` for all x 2 X. Combining this with Theorem 3.23 we get the desired result. We shall see in the next chapter that the estimate of Corollary 3.32 is sharp, up to a value of the constant. Now we show how to prove the statement made in Remark 3.24. It follows immediately if we combine Theorem 3.23 with the next lemma. Given a metric space M and D > 0, we denote by M D the metric space M with a new distance dM D .x, y/ D min¹dM .x, y/, Dº. q   q e D e  . In fact, ` Lemma 3.33. For every D > 0, c2 `D 2 2 e1 e1 -embeds into an `2 -sphere. Here we sketch a proof which looks like a trick. In Notes and Remarks we describe how this lemma can be almost immediately derived from the theory of embeddability into a Hilbert space developed by Schoenberg. Proof. Let ¹gi º1 i D1 be independent identically distributed standard Gaussian random variables defined on some probability space . Let H D L2./ be the Hilbert space consisting of all complex valued square integrable functions on . Let K > 0. Define F : `2 ! H by: 

 1 i X xj gj .!/ , .F .x1 , x2, : : ://.!/ D K exp K j D1

! 2 .

101

Section 3.4 Gluing single-scale embeddings

Clearly kF .x/k2 D K for every x 2 `2 . Observe that for every x, y 2 `2 , j.F .x//.!/  .F .y//.!/j2 D ˇ   X ˇ2  X 1 1 ˇ ˇ i i 2ˇ xj gj .!/  exp yj gj .!/ ˇˇ D K ˇ exp K K j D1

j D1

ˇ   X  ˇ2  X 1 1 ˇ ˇ i i D K 2 ˇˇ exp yj gj .!/ exp .xj  yj /gj .!/  1 ˇˇ K K j D1

j D1

ˇ ˇ2   X 1 ˇ ˇ i 2ˇ D K ˇ exp .xj  yj /gj .!/  1ˇˇ K D 2K

2

j D1



 1 1 X 1  cos .xj  yj /gj .!/ . K j D1

Now,

P1

j D1 .xj  yj /gj has the same distribution as g1

qP

1 j D1 .xj

 yj /2 . Hence:

h g

i 1 kx  yk2 . EjF .x/  F .y/j2 D 2K 2 1  E cos K   g1 Observe that by symmetry, E sin K kx  yk2 D 0, so that: E cos

g

1

K

kx  yk2

  g

kx  yk22 1 D E exp i kx  yk2 D exp  , K 2K 2 2

where we use the fact that Ee i ag1 D e a =2 (this fact can be derived using our computation on page 15). Putting it all together, we have shown that: r kxyk22 p kF .x/  F .y/k2 D 2K 1  e  2K 2 . Using the elementary inequality: e1 min¹1, aº  1  e a  min¹1, aº a > 0, e we deduce that: r p p e1 min¹ 2K, kx  yk2 º  kF .x/  F .y/k2  min¹ 2K, kx  yk2º. e p Letting K D D= 2 we get the desired result.

102

Chapter 3 Constructions of embeddings

3.5 Exercises Exercise 3.34. Introduce graphs Rn,k as graphs whose vertex sets are subsets in Rn with all n coordinates in the set ¹1, : : : , kº and the edge set is determined by the condition: u and v are joined by an edge if and only if ku  vk1 D 1 (where k  k1 is the norm of `n1 ). So it is natural to call Rn,k n-dimensional cubic grids. Show that Kr is a minor of R3,r for each r 2 N (and so 3-dimensional grids are not minor-excluded). See the Hints to exercises section if needed. An infinite graph is called locally finite if degrees of all vertices are finite. Exercise 3.35. Use the techniques developed in Theorems 3.4 and 3.14 to prove the following result: Let r 2 N and let G be an infinite locally finite unweighted connected graph which does not have Kr -minors, let dG be the graph distance on G. Then .G, dG / embeds coarsely into L1 . See the Hints to exercises section if needed. Exercise 3.36. If X is an unweighted graph with its graph distance and with n vertices. Show that the number of relevant scales in the sense of Remark 3.27 does not exceed blog2 nc. Exercise 3.37 (Possibly the exact evaluation is an open problem). Let X be an n-element metric space. How large can the number of relevant scales be? The general case of Exercise 3.37 could be difficult. Therefore I suggest to start by trying to get some partial results: (a) The number of relevant scales cannot reach the for nontrivially large values of n. (b) The number of relevant scales can value n.n1/ 2 be more than n. One of the ways of proving (a) and one of the ways of proving (b) are outlined in the Hints to exercises section. Exercise 3.38. Prove the statement made in Remark 3.29 and give an example whose existence is claimed in the mentioned remark.

3.6 Notes and remarks The embeddings of the type described at the beginning of this chapter are named after Fréchet, who introduced them in [155, pp. 161–162] to prove Proposition 1.17. The first result on embeddability of general n-element metric space into a Hilbert space is due to Bourgain [75]. The upper estimates for distortion found in [75] turned out to be optimal, up to a multiplicative constant. Bourgain’s ideas were further elaborated on by Matoušek in [311]. The book [314] (see also an updated version [315] of

Section 3.6 Notes and remarks

103

the chapter on metric spaces) contains a nice presentation of Bourgain’s embeddability theorem. For this reason I decided not to present Bourgain’s proof of cp .X/  O.ln jXj/ here. We derive Bourgain’s estimate as a corollary of a theory which has grown out of [75]. (It it worth mentioning that there is a slight error in the proof of [314, Lemma 15.7.2]: rj should be defined as the minimum of rq and of the current definition. See [315] for a corrected proof.) Padded decompositions. The concept of a padded decomposition first emerges in the paper of Rao [389]. Later this concept was used and studied by several other authors, see [143], [189], [254], [255], [275], [277]. The term padded decomposition was suggested in (the conference version (2003) of) the paper of Krauthgamer and Lee [254]. Theorem 3.4 goes back to Rao [389]. The theory of graph minors is a very important part of graph theory, an introduction to this theory can be found in [122]. Padded decompositions of minor-excluded graphs were discovered in [389]. They were based on the decomposition procedure suggested by Klein, Plotkin, and Rao [251]. Rao’s estimates for parameters of padded decompositions of minor-excluded graphs were improved by Fakcharoenphol and Talwar [143]. Our presentation is very close to the proof in [143]. Padded decompositions in terms of ball growth are constructed using an approach originally suggested by Calinescu, Karloff, and Rabani [88], the analysis of which was improved on by Fakcharoenphol, Rao, and Talwar [142]. Our presentation follows the work of Krauthgamer, Lee, Mendel, and Naor [255], and the extension to general measures was observed in [277]. Section 3.4 is based on the paper of Lee [273]. The corresponding part of [273] is a development of ideas of [255]. See [94] and [25] for related developments. Lemma 3.31 is a piece of Bourgain’s argument [75]. Lemma 3.33 and its proof are from [325, Lemma 5.2]. It is worth mentioning that Lee [273, Section 1.2] conjectured that one p can achieve, in the Euclidean case, the estimate .K C log n/ in Theorem 3.23 instead p of K log n, but this conjecture was disproved by Jaffe, Lee, and Moharrami [210] p who have proved that the bound K log n is tight. The proof that we mentioned before the proof of the truncation Lemma 3.33 is the following: The main of Lemma 3.33 is to show that if we endow p step in the proof 2 kxyk 2 , the obtained metric space embeds isometrically `2 with the metric 1  e in Hilbert space. This follows from a classical characterization of Schoenberg [407] of isometric embeddability into a Hilbert space in terms of negative definite kernels, see [55, Chapter 8, Section 1] for a compact and accessible presentation of the corresponding results. Corollary 3.32 is from [75], see the beginning of these Notes and Remarks for related information.

104

Chapter 3 Constructions of embeddings

3.7 Hints to exercises To Exercise 3.34. Use the following description of minors. Let us introduce the k-th level in R3,r as the set of vertices whose third coordinate is k. Let us introduce the k-th slice in R3,r as the set of vertices whose second coordinate is k. We split the r -th slice into r vertical paths and introduce the branch sets B1 , : : : , Br as follows: B1 consists of vertices of first level, except vertices of the r -th slice, and of vertices of the first vertical path. B2 consists of vertices of second level, except vertices of the r -th slice, and of vertices of the second vertical path. We continue in the obvious way: Bk , 3  k  r consists of vertices of level k, except vertices of the r -th slice, and of vertices of the k-th vertical path. Show that the the obtained sets B1 , : : : , Br are branch sets of a Kr -minor. To Exercise 3.35. See the arXiv version of [355] if you have difficulties (the journal version does not contain the proof based on the suggested techniques). To Exercise 3.37. (a) Assume that there are n.n1/ different distances. Consider the 2 metric space as a weighted graph Kn . Pick in this graph a subgraph isomorphic (as an unweighted graph) to K1,n1 . Pick in the class containing n  1 vertices two vertices with the smallest distances to the only vertex in the other class. Analyze the possible values of the distances between these two vertices and the other vertices in the class containing .n  1/ vertices. (b) Consider the graph K1,n1 with the following weights: 1, 22  ", 24  ", 26  ", : : : , 22n4  ", where " 2 .0, 1/.

Chapter 4

Obstacles for embeddability: Poincaré inequalities

The purpose of this chapter is to develop some techniques for estimates of distortion cY .X/ from below.

4.1 Definition of Poincaré inequalities for metric spaces We start with a simple example: consider a 4-cycle C4 and label its vertices in the cyclic order: v1, v2 , v3 , v4 . We are going to show that the Euclidean distortion of C4 can be estimated using the following inequality kf .v1/  f .v3 /k2 C kf .v2 /  f .v4 /k2  kf .v1/  f .v2 /k2 C kf .v2 /  f .v3 /k2 2

(4.1) 2

C kf .v3 /  f .v4 /k C kf .v4/  f .v1 /k , which holds for an arbitrary collection f .v1 /, f .v2 /, f .v3 /, f .v4 / of elements of a Hilbert space. Proof of (4.1). We use the identity ka  bk2 D kak2  2ha, bi C kbk2 for each of the terms in (4.1). Then we move everything to the right-hand side and observe that the obtained inequality can be written in the form 0  kf .v1/  f .v2 / C f .v3 /  f .v4 /k2. We postpone the computation of c2 .C4/ slightly, introducing some terminology first. Inequality (4.1) can be considered as one of the simplest Poincaré inequalities for embeddings of metric spaces. Definition 4.1. Let .X, dX / and .Y , dY / be metric spaces, ‰ : Œ0, 1/ ! Œ0, 1/ be a non-decreasing function, au,v , bu,v .u, v 2 X/ be finitely nonzero arrays of nonnegative real numbers (this condition definitely holds if X is finite). If for an arbitrary function f : X ! Y the inequality X X au,v ‰.dY .f .u/, f .v///  bu,v ‰.dY .f .u/, f .v/// (4.2) u,v2X

u,v2X

holds, we say that Y -valued functions on X satisfy the Poincaré inequality (4.2).

106

Chapter 4 Obstacles for embeddability: Poincaré inequalities

Observe that in this inequality the structure of X plays no role, we use X just as a set of labels for elements f .u/ 2 Y . On the other hand it is worth mentioning that the metric structure of X usually plays a significant role in our choice of the arrays au,v , bu,v . The inequality (4.2) is useful for the theory of embeddings only if a similar inequality does not hold for the identity map on X, that is, if X X au,v ‰.dX .u, v// < bu,v ‰.dX .u, v//. (4.3) u,v2X

u,v2X

In such a case we get immediately that X is not isometric to a subset of Y . Definition 4.2. We call the quotient P

bu,v ‰.dX .u, v// u,v2X au,v ‰.dX .u, v//

Pu,v2X

the Poincaré ratio of the metric space X corresponding to the Poincaré inequality (4.2) and denote it Pa,b,‰.t / .X/. Having more information on the values of sides of (4.3) and on the function ‰, we can get an estimate for the distortion cY .X/ and some information on the functions 1 and 2 from Definition 1.45 for any coarse embedding of X into Y . The corresponding estimate of cY .X/ is quite simple if ‰.t / D t p for some p > 0. Proposition 4.3. If Y -valued functions on X satisfy the Poincaré inequality (4.2) with ‰.t / D t p , then  1=p . (4.4) cY .X/  Pa,b,t p .X/  1=p Proof. In fact, assume the contrary. Then there exists D < Pa,b,t p .X/ , an embedding f : X ! Y , and r > 0 such that 8u, v 2 X,

rdX .u, v/  dY .f .u/, f .v//  rDdX .u, v/.

(4.5)

We get X

(4.5)

bu,v .dX .u, v//p 

u,v2X

1 X bu,v .dY .f .u/, f .v///p rp u,v2X

1 X  p au,v .dY .f .u/, f .v///p r

(4.2)

u,v2X

(4.5)

 Dp

X

au,v .dX .u, v//p .

u,v2X

1=p  we get a contradiction with the Combining this inequality with D < Pa,b,t p .X/ definition of the Poincaré ratio.

Section 4.2 Poincaré inequalities for expanders

107

Example 4.4. Now we are ready to estimate c2 .C4 /. It is clear that (4.1) is a Poincaré inequality for `2 -valued functions on C4 (more precisely: for `2 -valued functions on V .C4/). The corresponding Poincaré ratio is: dC4 .v1 , v3 /2 C dC4 .v2 , v4/2 D 2. dC4 .v1 , v2 /2 C dC4 .v2 , v3 /2 C dC4 .v3 , v4/2 C dC4 .v4 , v1 /2 p By Proposition 4.3 we get c2 .C4 /  2. This estimate is sharp, as it can be shown by considering an embedding whose image is the set .0, 0/, .0, 1/, .1, 1/, .1, 0/ 2 `22 . An important example of a Poincaré inequality is given in the next section. A very interesting fact is that the Lp -distortion can be characterized in terms of Poincaré ratios, see Section 4.3 in this chapter.

4.2 Poincaré inequalities for expanders In this section we introduce a very important notion of a family of expander graphs (or expanders), establish a very useful Poincaré inequality for L1-valued functions on expander graphs, and derive corollaries on the L1-distortion and coarse embeddability of families of expander graphs. Definition 4.5. For a graph G with vertex set V and a subset F  V by @F we denote the set of edges connecting F and V nF . The expanding constant (also known as Cheeger constant and as conductance) of G is ³ ² j@F j : F  V , 0 < jF j < C1 h.G/ D inf min¹jF j, jV nF jº (where jAj denotes the cardinality of a set A). A sequence ¹Gn º of graphs is called a family of expanders if all of Gn are finite, connected, k-regular for some k 2 N (this means that each vertex is incident with exactly k edges), their expanding constants h.Gn / are bounded away from 0 (that is, there exists " > 0 such that h.Gn /  " for all n), and jV .Gn /j ! 1 as n ! 1. Remark 4.6. It is far from being obvious that families of expanders exist. We will present several constructions of families of expanders in Chapter 5. In this chapter and in Chapter 7 we explain why families of expanders are important for metric geometry. The following is a Poincaré inequality for L1-valued functions on a vertex set of a graph. We denote the adjacency matrix of a graph G D .V , E/ by ¹au,v ºu,v2V , that is ´ 1 if u and v are adjacent au,v D 0 otherwise. Let h be the expanding constant of G.

108

Chapter 4 Obstacles for embeddability: Poincaré inequalities

Theorem 4.7. The following Poincaré inequality holds for L1-valued functions on V : X

X

au,v kf .u/  f .v/k 

u,v2V

h kf .u/  f .v/k. jV j

u,v2V

(4.6)

Lemma 4.8. Let G D .V , E/ be a connected graph with the expanding constant h, and f : V ! R be a real-valued function on V . Then X

jf .v/  M j 

v2V

1 X jf .u/  f .v/j, h

(4.7)

uv2E

where M is a median of the set ¹f .v/ºv2V . Recall that a median of a finite set of real numbers is a number M such that at least half of the numbers are  M and at least half the numbers are  M ; note that the median may be non-unique in some cases. Proof. Replacing f by fz D f  M , we may assume that M D 0. Also we assume (for simplicity) that the number of vertices is odd. (Only a slight modification is needed when the number of vertices is even.) Let f1  f2      fk  0 D fkC1  fkC2      f2kC1 be the values of the function. Then X

jf .v/j D

v2V

2kC1 X

jfi j.

i D1

We introduce level sets of the function f as L i :D ¹v : f .v/  fi º,

i D 1, : : : , k,

and LC i D ¹v : f .v/  fi º,

i D k C 2, : : : , 2k C 1.

We define also fi :D fi C1  fi , i D 1, : : : , k and fir :D fi  fi 1 , i D k C 2, : : : , 2k C 1. We have 2kC1 X i D1

jfi j D

k X i D1

jL i jfi C

2kC1 X i DkC2

r jLC i jfi .

(4.8)

109

Section 4.2 Poincaré inequalities for expanders

C Cardinalities of the sets L i and Li do not exceed k. Observe that the definition of the expanding constant implies j@F j  h.G/jF j for each F with jF j  jV j=2. Hence we have k X

jL i jfi C

i D1

2kC1 X

r jLC i jfi 

k X 1 i D1

i DkC2

1 D h 1 D h

h

[email protected] i /jfi C

X k

2kC1 X i DkC2

[email protected] i /jfi

i D1

C

1 r [email protected] i /jfi h

2kC1 X

 r [email protected] i /jfi

(4.9)

i DkC2

X

jf .u/  f .v/j.

uv2E.G/

To show the last equality, we need to observe that the contribution of the edge uv to the sum X  k 2kC1 X  C r [email protected] /jfi C [email protected] /jfi i D1

i DkC2

is equal to jf .u/  f .v/j. Proof of Theorem 4.7. Since continuous functions are dense in L1.0, 1/, it suffices to prove the inequality (4.6) in the case where the functions f .u, t / are continuous as functions of t , and so f .u, t / is well-defined for all t 2 Œ0, 1. For each t 2 Œ0, 1 we let M.t / be a median of the set ¹f .u, t /ºu2V . It is easy to show that the medians can be selected in such a way that M.t / is a continuous function on Œ0, 1. Applying Lemma 4.8 for each value of t , we get X X jf .u, t /  f .v, t /j  h jf .v, t /  M.t /j. uv2E

v2V

Integrating this inequality over Œ0, 1 we get X X kf .u/  f .v/k  h kf .v/  M k. uv2E

(4.10)

v2V

By the triangle inequality we have kf .u/  f .v/k  kf .u/  M k C kf .v/  M k. Therefore X u,v2V

X X h kf .u/  M k C h kf .v/  M k. kf .u/  f .v/k  h jV j u2V

v2V

Combining this inequality with (4.10) and the definition of the adjacency matrix we get (4.6).

110

Chapter 4 Obstacles for embeddability: Poincaré inequalities

The Poincaré inequality (4.6) can be used to get an estimate for L1-distortion of a k-regular graph with expanding constant h. In fact, to estimate such distortion from below we need to estimate from below the corresponding Poincaré ratio: P h u,v2V jV j dG .u, v/ P . (4.11) u,v2V au,v dG .u, v/ The denominator of the ratio is equal to 2jEj, where jEj is the number of edges in G. Since the graph is k-regular, we have 2jEj D kjV j. On the other hand, the number of vertices at distance D to a given vertex in a kregular graph is at most 1 C k C k.k  1/ C    C k.k  1/D1  k D C 1.

Let D D logk jV2 j  1 . Then there are at most jV2 j vertices with distance  D to a given vertex. Therefore   X h jV j h jV j2 dG .u, v/    logk 1 jV j jV j 2 2 u,v2V

and the Poincaré quotient (4.11) is h logk  2k



 jV j  1 D .ln jV j/. 2

(Here and in some other places we use the standard for computer science notation: f D .g/ means g D O.f /.) We get that distortions of members of a family of expanders grow as logarithms of their sizes. So the estimates for distortions obtained in Chapter 3 (see, for example, Corollary 3.32) have the best possible dependence on the size of the metric space. The Poincaré inequality (4.6) can be used to prove the following result on coarse non-embeddability of expanders. Theorem 4.9. Families of expanders are not uniformly coarsely embeddable into L1. Proof. Let G D .V , E/ be a finite graph and h be its expanding constant. Suppose that an embedding f : V ! L1 satisfies 8u, v 2 V

1.dG .u, v//  kf .u/  f .v/k  2.dG .u, v//

Combining this inequality with (4.6) we get X u,v2V

X h 1.dG .u, v//  au,v kf .u/  f .v/k  kjV j2.1/. jV j u,v2V

Section 4.2 Poincaré inequalities for expanders

111

Now we recall that 1 is non-decreasing and that we have already proved that for 2 at least jV2j out of jV j2 terms in the left-hand side of the last inequality we have   dG .u, v/  logk jV2 j  1 . We get jV j2 h   1 .logk 2 jV j



 jV j  1 /  kjV j2 .1/ 2

or  1 .logk

 jV j 2k2.1/ 1 /  . 2 h

(4.12)

Now we consider a family ¹Gnº1 nD1 of expanders, Gn D .Vn, En /. It is clear that a function 1 satisfying lim t !1 1 .t / D 1 cannot satisfy the inequality (4.12) for a sequence ¹jVnjº1 nD1 with jVn j ! 1 (if we plug each jVn j instead of jV j). Corollary 4.10. A metric space containing isometric copies of all graphs of some family of expanders with their graph distances does not admit a coarse embedding into L1 . It is not difficult to construct spaces satisfying this condition provided we have a family of expanders. In fact, we can just build an infinite connected graph .V , E/ out of Gn D .Vn , En / by letting V to be the disjoint union of Vn and letting E to be the union of En plus a sequence ¹enº1 nD1 of edges, where en is an edge joining a vertex of Vn with a vertex of VnC1. An important property of this space is that it has bounded geometry in the sense of Definition 1.66 (see page 31). Remark 4.11. Since (as we know from Proposition 1.21) finite subsets of `2 are isometric to finite subsets of L1 , Theorem 4.9 implies that families of expanders are not uniformly coarsely embeddable into `2 . We use repeatedly the following observation. Observation 4.12. A separable metric space is coarsely embeddable into L1 if and only if it is coarsely embeddable into a Hilbert space. Proof. The “only if” part follows from the fact that L1 is coarsely embeddable into `2 (see Example 1.40). The “if” part follows from the fact that `2 admits a linear bilipschitz (= isomorphic), and even a linear isometric, embedding into L1 . The isometric version requires some knowledge of probability theory and measure theory. Its proof: we consider a sequence ¹gi º1 i D1 of independent normal (Gaussian) random variables with 0 mean and the same variance. They belong to L1 . Well-known properties of Gaussian random variables imply that the linear span of ¹gi º1 i D1 in L1 is isometric to `2 .

112

Chapter 4 Obstacles for embeddability: Poincaré inequalities

The existence of an isomorphic embedding of `2 into L1 (and actually any Lp .1  p < 1/) follows from the Khinchin inequality (Theorem 2.51). The Khinchin inequality immediately implies that the linear extension of the mapping ek 7! rk , where ¹ek º1 is the unit vector basis in `2 and ¹rk º are Rademacher functions conkD1 sidered as elements of L1, to the linear span of unit vectors in `2 admits a continuous extension to `2 . This extension is an isomorphic embedding of `2 to L1. Later we shall need the following corollary which can be proved using the techniques of this section. Corollary 4.13 (Of the proof of Theorem 4.7). Let G be a k-regular graph with expanding constant h and f : V .G/ ! L1 be an embedding. Then ˇ° ±ˇ 3jV .G/j 2k ˇ ˇ 9x 2 L1 ˇ s 2 V .G/ : kf .s/  xk  Lip.f / ˇ  . (4.13) h 4 Proof. Observe that kf .u/  f .v/k  Lip.f / if uv 2 E.G/. Hence the left-hand side of (4.10) does not exceed jV2jk Lip.f /. By (4.10) we have X v2V

kf .v/  M k 

jV jk Lip.f /. 2h

Hence the amount of vertices v for which kf .v/  M k > The conclusion follows with x D M .

2k h

Lip.f / is less than

jV j 4 .

4.3 Lp -distortion in terms of constants in Poincaré inequalities The purpose of this section is to show that the Lp -distortion cp .X/ of a finite metric space can be characterized in terms of Poincaré inequalities of the form (4.2) with ‰.t / D t p . Theorem 4.14. Let .X, dX / be a finite metric space. Then the Lp -distortion cp .X/ is equal to the supremum of constants C for which there exist arrays au,v and bu,v .u, v 2 X/ of nonnegative and not-all-zero real numbers such that X X bu,v .dX .u, v//p  C p au,v .dX .u, v//p , (4.14) u,v2X

u,v2X

and for any map f : X ! Lp the inequality X X au,v kf .u/  f .v/kp  bu,v kf .u/  f .v/kp u,v2X

holds.

u,v2X

(4.15)

Section 4.3 Lp -distortion in terms of constants in Poincaré inequalities

113

Proof of Theorem 4.14. We need to show that cp .X/  C for each C satisfying the condition above and that for each C < cp .X/ we can find au,v and bu,v satisfying (4.14) and (4.15). The first part is completely straightforward: if cp .X/ < C , then there exist " > 0 and a map f : X ! Lp such that 8u, v 2 X

dX .u, v/  kf .u/  f .v/k  .C  "/dX .u, v/

Combining this inequality with (4.14) and (4.15), and using non-negativity of au,v and bu,v we get X X X au,v .dX .u, v//p  bu,v .dX .u, v//p  bu,v kf .u/  f .v/kp Cp u,v2X

u,v2X



X

u,v2X

au,v kf .u/  f .v/kp

u,v2X

 .C  "/p

X

au,v .dX .u, v//p ,

u,v2X

a contradiction. Now let C < cp .X/, we have to find arrays au,v and bu,v such that inequalities (4.14) and (4.15) hold. Let P be the set of all unordered pairs of distinct elements of X, we denote the linear space of all real-valued functions on P by RP . It is clear that each semimetric on X can be regarded as an element of RP . Let Lp  RP be the cone of the p-th powers of Lp -semimetrics on X, that is ¯ ® Lp D .kf .u/  f .v/kp /uv2P : f : X ! Lp  RP . Observe that Lp is a convex cone, it is closed with respect to sums because we can concatenate embeddings. We mean the following: if f1 : X ! Lp .1 / and f2 : X ! Lp .2 / are two embeddings, we define f : X ! Lp .1 [ 2 / (where 1 and 2 are assumed to be disjoint) by f .u/j1 D f1 .u/ and f .u/j2 D f2 .u/. It is clear that kf .u/  f .v/kp D kf1.u/  f1 .v/kp C kf2 .u/  f2.v/kp . (Here it is worthwhile to recall that each separable space Lp ., †, / is isometric to a subspace of Lp .0, 1/, see Fact 1.20.) Since C < cp .X/ the space .X, dX / does not admit a C -bilipschitz embedding into Lp . We introduce the set K D ¹.xuv /uv2P 2 RP : 9r > 0 8u, v

r .dX .u, v//p  xuv  r C p .dX .u, v//p º.

The set K includes the p-th powers of all semimetrics arising by C -bilipschitz embeddings of .X, dX /. On the other hand, not all elements of K are semimetrics, since the triangle inequality may be violated. Since there is no C -bilipschitz embedding of .X, dX / into Lp , we have K \ Lp D ;. Both K and Lp are convex sets in RP , and so they can be separated by a hyperplane. Moreover, since Lp and K [ ¹0º are cones, the

114

Chapter 4 Obstacles for embeddability: Poincaré inequalities

separating hyperplane should be of the form ¹x : hs, xi D 0º for some s 2 RP , s ¤ 0. So let s 2 RP be such that hs, xi  0 for all x 2 K, and hs, xi  0 for all x 2 Lp . Using the vector s, we define the desired arrays au,v and bu,v as follows ² if suv > 0, suv bu,v D 0 otherwise; ² if suv < 0, suv au,v D 0 otherwise. It remains to show that these arrays satisfy the inequalities (4.14) and (4.15). The condition hs, xi  0 for all x 2 Lp means that X suv kf .u/  f .v/kp for each f : X ! Lp . 0 uv2P

Moving those terms for which suv < 0 to the left-hand side, we get (4.15). To get (4.14) we apply the condition “hs, xi  0 for all x 2 K” to the following sequence in K: ´ if suv  0, .dX .u, v//p xuv D p p if suv < 0. C .dX .u, v// We get

X

suv .dX .u, v//p C C p

suv >0

X

suv .dX .u, v//p  0.

suv 0 such that for each N 2 N there exists a finite metric space X of cardinality at least N satisfying c1 .X/  c ln jXj. Proof. The metric space will be constructed as the set of equivalence classes of Fn2 , two elements are in the same equivalence class if and only if their difference belongs to C ? , where C is a “good code”, whose construction we are going to describe now. Theorem 4.28. For each n 2 N there exists a linear code C  Fn2 whose minimum distance and dimension satisfy d.C /  ın and dim.C /  n4 , respectively, where ı > 0 is an absolute constant. Proof. Such codes can be obtained using the following greedy construction: fix k  n=4 and let V be a k-dimensional subspace of Fn2 with d.V / > ın (it is clear that such subspaces exist for k D 1). Then V contains 2k points. Let us count the number of vectors x 2 Fn2 for which the linear span Vz D lin.V [ ¹xº/ does not satisfy d.Vz /  ın. To prove the theorem it suffices to show that for small enough ı and for k < n4 the number of such vectors is strictly smaller than the cardinality of Fn2 . The number of vectors satisfying the condition described in the previous paragraph can be estimated from above in the following way. We count, for each v 2 V , the number of vectors wt.v C x/  ın (observe that x C v D x  v in Fn2 ). n P x satisfying This number is `ı n l , the resulting upper estimate is ! X n k 2 . (4.36) l `ı n

We assume that ı  12 . In such a case (as is easy to check and is well known) the last term in the sum in (4.36) is the largest one. Therefore (4.36) can be estimated from above by nŠ . (4.37) 2k bınc  bıncŠ  .n  bınc/Š z for some 0  ız  ı  1 . Now we use the well-known It is clear that bınc D ın 2 Stirling formula: p nn nŠ D 2 n n  e ˛.n/ , (4.38) e where 1=.12n C 1/ < ˛.n/ < 1=.12n/.

124

Chapter 4 Obstacles for embeddability: Poincaré inequalities

We get the following formula for (4.37) p z z 2 n nn e ˛.n/ e ın e .1ı/n kz 2 ın p q z z z z .ın/ z ın z .1ız/n e ˛..1ı/n/ e n 2 ın e ˛.ın/ 2.1  z ı/n ..1  ı/n/ e ˛.n/ z p D 2k ın q z z z z z .1  ı/ z .1ı/n z .ı/ z ın e ˛.ın/ 2.1  ı/ e ˛..1ı/n/ ın   n  p 1 e ˛.n/ k z  2 ın  D q . z ız.1  ı/ z 1ız z z .ı/ z e ˛..1ı/n/ e ˛.ın/ 2.1  ı/

(4.39)

The first term in the last line looks complicated, but it is easy to see that it is bounded from above by an absolute constant K. Let us recall that our purpose is to show that if we pick ı sufficiently small and if k < n=4, then the number in (4.39) is < 2n . We can estimate the number in (4.39) from above by n

1

z

z

1

z

z

z

2log2 KC 4 C 2 log2 ıC 2 log2 nn.ı log2 ıC.1ı/ log2 .1ı// .

(4.40)

z z z z Since limı#0 z ı log2 ı C .1  ı/ log2 .1  ı/ D 0, it is clear that we can pick ı > 0 such that the exponent in (4.40) is < n. We need the following Fourier analytic consequence of the fact that C is “good”: Lemma 4.29. Assume that f : Fn2 ! L2 satisfies f .x C y/ D f .x/ for every x 2 Fn2 and y 2 C ? . Then for every nonempty A  ¹1, : : : , nº with jAj < d.C /, fy.A/ D 0. Proof. We use the notation 1A for the vector ¹xi º 2 Fn2 whose components xi are equal to 1 if and only if i 2 A. Since 1A … C and .C ? /? D C (by Lemma 4.26), we have 1A … .C ? /? , implying that there exists v 2 C ? such that h1A , vi D 1. Now, Z y f .x/WA .x/d.x/ f .A/ D Fn 2 Z D f .x C v/WA .x/d.x/ Fn 2 Z D f .x/WA .x  v/d.x/ Fn 2 Z h1A ,vi f .x/WA .x/d.x/ D .1/ Fn 2

D fy.A/. So fy.A/ D 0.

Section 4.5 Fourier analytic method of getting Poincaré inequalities

125

As we already mentioned, the metric space X which we are going to use for the proof of Theorem 4.27 is the space of equivalence classes of Fn2 . Two elements are in the same class if their difference is in C ? . Our next purpose is to prove a Poincaré inequality for L1-valued maps on Fn2 =C ?. Lemma 4.30. Let C be a linear code, and f : Fn2 =C ? ! L1 be a map. We define fz : Fn2 ! L1 by fz.x/ D f .x C C ? /. Then Z

2 X d.C / n

n Fn 2 F2

kfz.x/  fz.y/k1 d.x/d.y/ 

Z

n j D1 F2

k@j fz.x/k1 d.x/

(4.41)

Remark 4.31. The Poincaré inequality (4.41) is of somewhat unconventional form, because the integration (summation) is not over the metric space X, but over a “basic” set for which X is a quotient space. However, it is clear that one can write the inequality in terms of X by counting how many times each of the terms kfz.x/  fz.y/k1 and k@j fz.x/k1 is repeated. We prefer not to do that because it is more convenient to estimate the Poincaré ratio in terms of Fn2 (rather than Fn2 =C ?), too. Proof of Lemma 4.30. As was proved in Example 1.40 (see also Corollary 1.43), there exists a mapping T : L1 ! L2 such that for all x, y 2 L1, p kT .x/  T .y/k2 D kx  yk1. Define h : Fn2 ! L2 by h D T ı fz. We have Z kfz.x/  fz.y/k1 d.x/d.y/ n Fn F 2 2 Z D kh.x/  h.y/k22 d.x/d.y/ n Fn 2 F2

2

.Lemma 4.24/



.Lemma 4.29/



D

n Z X

y min¹jAj : A ¤ ;, h.A/ ¤ 0º j D1 Z n 2 X k@j h.x/k22 d.x/ d.C / Fn 2

2 d.C /

j D1 Z n X

n j D1 F2

Fn 2

k@j h.x/k22 d.x/

k@j fz.x/k1 d.x/.

Remark 4.32. It is worth mentioning that the way in which we used Example 1.40 in the proof of Lemma 4.30 has much wider applicability. It can be used in some other cases when we would like to get an Lp -valued Poincaré inequality from an Lq -valued Poincaré inequality.

126

Chapter 4 Obstacles for embeddability: Poincaré inequalities

Now we estimate from below the Poincaré ratio (see Definition 4.2) corresponding to the Poincaré inequality (4.41). That is, we estimate from below the ratio: R n Fn =C ? .x, y/d.x/d.y/ Fn 2 2 F2 . (4.42) R P n 1 ? .x, x C ej /d.x/ j D1 Fn Fn d.C / 2 =C 2

Here, of course, we need to specify the definition of the metric Fn2 =C ? .x, y/. The definition is: the distance between two equivalence classes X and Y is inf¹.x, y/ : x 2 X, y 2 Y º. One can check that because of the special form of equivalence classes this formula produces a metric. The reason is that for these classes one of the elements x and y could be fixed, that is, inf¹.x, y/ : x 2 x0 C C ? , y 2 y0 C C ? º D inf¹.x0, y/ : y 2 y0 C C ? º. (4.43) To estimate the denominator from above we use the trivial estimate Fn2 =C ? .x, x C ej /  1, therefore 1 X d.C / n

j D1

Z Fn 2

Fn2 =C ? .x, x C ej /d.x/ 

n . d.C /

To estimate the numerator, we use almost the same computation as in the proof of Theorem 4.28. Namely, we show that there exists a positive constant ˛ > 0 such that for each x 2 Fn2 the number of elements y 2 Fn2 , for which Fn2 =C ? .x, y/  ˛n is  2n =2. We observe that we can estimate from above the number of elements y for which Fn2 =C ? .x, y/ < ˛n in the following way. First we count the number of elements v satisfying wt.v  x/ < ˛n. Then we consider all equivalence classes containing all such v. The observation above (see (4.43)) implies that all elements y outside the union U of these equivalence classes satisfy Fn2 =C ? .x, y/  ˛n. So it remains to show that for sufficiently small ˛ the number of elements in all the equivalence classes < 2n1 . To P achieve this  goal we observe that the number of elements in U does not exceed 23n=4 `˛n n` . Observe that the sum is exactly of the same form as in (4.36) with 3n=4 instead of k and ˛ instead of ı. We proceed in the same way as in the proof of Theorem 4.28 and get the existence of the desired ˛. From here we get the following estimate for the numerator of (4.42): Z Fn2 =C ? .x, y/d.x/d.y/  ˛n=2. n Fn 2 F2

˛d.C /  ˛ı2 n . As we know 2 ? ˛ı log2 jFn 2 =C j . c1 .Fn2 =C ?/  ˛ı2 n  2

Therefore the value of (4.42) is  this implies the estimate

(see Proposition 4.3),

127

Section 4.6 Exercises

4.6 Exercises p Exercise 4.33. Prove the inequality c2 .Fn2 /  n using (4.21) with the matrix T whose entries are 8 ˆ 0. Exercise 4.36 (Poincaré inequalities for Lp -valued functions on expanders). Let 1  p < 1, x 2 R. We use the notation x p for jxjp  signx. Use the inequality (4.7) for .f .v//p and apply the following lemma (you are welcome to prove it if you wish): Lemma 4.37. For any real numbers a, b, and any p  1, we have jap  b p j  pja  bj.jajp1 C jbjp1 / to get the following version of (4.7): X v2V

p

jf .v/  M j

 p1

 cp

 X uv2E

p

jf .u/  f .v/j

 p1 ,

(4.44)

128

Chapter 4 Obstacles for embeddability: Poincaré inequalities

where c is a constant which depends only on the expanding constant and the degree of the graph, but not on p. Use inequality (4.44) to get the fact that the distortion of embeddings of a family ¹Gnº of expanders into Lp satisfies cp .Gn /  C p1 ln jV .Gn /j, where C depends only on the expanding constants and the degree of the family ¹Gn º. Exercise 4.38. If a k-regular graph G with n vertices is embedded into `d1 with distortion ˛, then d  nc=˛ , where the constant c > 0 depends on the expanding constant of the graph and k. In the Hints to exercises section we suggest an approach to Exercise 4.38.

4.7 Notes and remarks The method of Poincaré inequalities for lower distortion estimates was implicit in the pioneering paper [133] where the first estimate of this type was proved. Inequalities which could be called Poincaré inequalities appear in the papers [290] and [312] dealing with lower estimates of distortion for embeddings of expanders. The term “Poincaré inequality” in this context was systematically used in [292] and later papers. As previously stated, in Chapter 5 we present some of the known constructions of expanders. The Poincaré inequality for `2 -valued functions on expanders can be obtained using the spectral characterization of expanders (due to [10] and [125], we present the spectral characterization in Section 5.2). The use of spectral characterization for getting a Poincaré inequality for expanders is demonstrated, for example, in [314, Section 15.5]. It is well known that the L1 -Poincaré inequality can be obtained using a direct argument, see, for example, the 1993 edition of [297, Problem 11.30] (however this argument is absent in the 1979 edition). The general Lp Poincaré inequality (Theorem 4.14) is from [314]; see Proposition 15.5.2 (where the proof contains some errors, corrected in [315]) and Exercise 4 on p. 383. The idea goes back to [290], Matoušek writes [314, p. 380] that the result of Theorem 4.14 was communicated to him by Rabinovich. The formula (4.21) for Euclidean distortion first appears in [290, Corollary 3.5]. It was applied in many different situations in [291] and [292]. The material covered in Section 4.5 appears in [249]. The paper [249] contains many other interesting applications of the Fourier-analytic approach developed there. The famous Poincaré inequality (4.31) was first proven by Enflo in [133] via a geometric argument, he also proved Theorem 4.23. Using Matoušek’s extrapolation lemma for Poincaré inequalities [312], [42] (see Exercise 4.36 in this chapter and the corresponding references below), it is possible to prove that for a code C as in Theorem 4.27, for every p  1, cp .Fn2 =C ? /  c.p/d .

Section 4.8 A bit of history of coarse embeddability

129

The coding theory specific terminology used in this book corresponds to that which appears in [307], however the definition of a linear code presented in the latter is more general than that which we present. Theorem 4.28 is well known in coding theory, see [307, Chapter 1]. A proof of the Stirling formula can be found in [146, Chapter II, §9]. The quotient metrics were studied in [44],[82],[177],[325]. Newman and Rabinovich [339] have shown that the metric spaces Fn2 =C ? can be obtained as Cayley graphs of finite commutative groups with respect to sets of generators whose size is the logarithm of the size of the group. These Cayley graphs can, in turn, be regarded as unbounded degree expanders. Exercises 4.33 and 4.34 are taken from [291]. Exercise 4.35 is a result of [338]. Exercise 4.36 is a result of [312]. Exercise 4.38 is a result of [274, Proposition 4.1]. In connection with Exercise 4.35 it is worth mentioning that combining the Kuratowski theorem (mentioned in Remark 3.13), Theorem 3.14 on padded decompositions of minor excluded graphs, Theorem 3.4, and Exercise 3.36 we p get that the Euclidean distortion of any unweighted planar graph with n vertices is O. log n/. This result is originally due to Rao [389]. Exercise 4.35 shows that the estimate of Rao is sharp up to a value of the constant. In connection with Exercise 4.36 it is worth mentioning that Matoušek’s extrapolation argument (which is described in Exercise 4.36) was later used in [42, p. 691] to show that even a reasonably large part of an expander does not admit a “good” embedding into Lp . The extrapolation argument was also used in [351] to get graphtheoretical versions of some Sobolev inequalities.

4.8 A bit of history of coarse embeddability Gromov [175, Remark (b), p. 218] wrote: “There is no known geometric obstruction for uniform embeddings into infinite dimensional Banach spaces”. (By uniform embeddings here Gromov means what we call coarse embeddings.) At the time Gromov was unaware of Enflo’s work, [134], in which it was shown that there are no uniform (in the sense: uniformly continuous with the uniformly continuous inverse) embeddings of the Banach space c0 into a Hilbert space. Dranishnikov, Gong, Lafforgue, and Yu observed [127, Section 6] that the construction of [134] can be used to prove that there exist locally finite metric spaces which are not coarsely embeddable into Hilbert spaces. (Later the examples of [134] and [127] were combined into one example in [247].) But the spaces constructed in [127] do not have bounded geometry. Spaces with bounded geometry which are not coarsely embeddable into a Hilbert space were suggested by Gromov after he became acquainted with the results of [127]. Gromov’s observation was published in Gromov [178, p. 158]. His example is based on the notion of the expander. I would like to mention that the paper of Matoušek [312] contains

130

Chapter 4 Obstacles for embeddability: Poincaré inequalities

inequalities which imply immediately that expanders are not coarsely embeddable into `p for 1  p < 1. In [179] Gromov wrote the following in reference to the aforementioned citation: “There is no known . . . ”, should have been replaced with “There is a well-known example due to P. Enflo . . . ”, who constructed a sequence of finite graphs (of growing degrees) admitting no uniform embedding into the Hilbert space (see [134]).

4.9 Hints to exercises To Exercise 4.33. Show that Walsh functions form a set of eigenfunctions of T with nonnegative eigenvalues. (The positive semi-definiteness of the matrix in question can also be derived using the standard Enflo-type argument about the diagonals of the cube.) To Exercise 4.34. One of the ways of doing this is to use the formula (4.21) and the matrix ¹sij º given by 8   ˆ if i D j 2 cos2 n ˆ ˆ ˆ k. Since A1V D k1V , we have 1 D k. (b) If G is disconnected and U is a vertex set of a connected component, then the function 1U 2 `2 .V / satisfies A1U D k1U . Thus A has two linearly independent eigenvectors corresponding to eigenvalue k, and 2 D k. Conversely, suppose that G is connected. It suffices to show that each eigenvector corresponding to the eigenvalue k is a multiple of 1V . So let f 2 `2 .V / be such that Af D k  f . Let v be such that f .v/ is maximal. The condition Af D k  f means X f .u/ D kf .v/. u2V ,uv2E.G/

Since the summation is over the k-element set, our assumption that f .v/ is maximal implies that f .u/ D f .v/ for all vertices u adjacent to v. Now we can apply the same reasoning to f .u/. Thus for all vertices u contained in the same connected component as v we have f .v/ D f .u/. Since G is assumed to be connected, this implies that f is a multiple of 1V . (c) Suppose that G contains a connected component H which is a bipartite graph. Let X [ Y be the bipartition of the vertex set of H . Consider the function f D 1X  1Y 2 `2 .V /. We claim that Af D k  f . In fact, let v 2 X, then all k neighbors of v are in Y , therefore X .Af /.v/ D f .u/ D k D k  f .v/. u2X, uv2E.G/

Similar verification works for v 2 Y . Finally, it is clear that for v 2 V n.X [ Y / we have f .v/ D .Af /.v/ D 0. Conversely, suppose that k is an eigenvalue of G and f is such that Af D k f . Let v be such that jf .v/j is maximal. Then X kf .v/ D .Af /.v/ D f .u/. u2V , uv2E.G/

Since jf .v/j is maximal, this equality implies that f .u/ D f .v/ for each u adjacent to v. Thus jf .u/j is also maximal for each such u and we can use the same argument for u. Therefore we get that for each u in the connected component containing v we have either f .u/ D f .v/ or f .u/ D f .v/, and for adjacent vertices the signs are the opposite. This clearly implies that the set X of vertices u in this connected component with positive f .u/ and the set Y of vertices with negative f .u/ form a bipartition of the connected component containing v.

134

Chapter 5 Families of expanders and of graphs with large girth

The following theorem is called the spectral characterization of expanders. Theorem 5.5. A family ¹Gnº of k-regular graphs with jV .Gn /j ! 1 is a family of expanders if and only if the differences k  2 .Gn / are uniformly bounded away from zero. Definition 5.6. If G is a k-regular graph, the difference k  2 , where 2 is the second (from above) eigenvalue of the adjacency matrix of G, is called the spectral gap of G. Theorem 5.5 is an obvious consequence of the following more precise result. Theorem 5.7. For each finite k-regular graph G the inequality p k  2  h.G/  2k.k  2 / 2

(5.3)

holds. Proof. We start with the left-hand side inequality which is easier. The main ideas here are the following: (a) If U and W are two subsets of V , then hA1U , 1W i is the number of edges joining U and W . (b) Thus hA1U , 1V i D kjU j, and in an expander, hA1U , 1V nU i  h.G/jU j if jU j  jV j 2 ; and therefore hA1U , 1U i  .k  h.G//jU j. The converse is also true: If hA1U , 1U i  .k  s/jU j for all jU j  jV2 j , then h.G/  s. (c) To prove the left-hand side of (5.3) we use the fact that for each function f 2 ,f i  2.G/  k holds, and show that `2 .V / with zero mean the inequality hAf kf k2 a similar inequality holds for the quotient

hA1F ,1F i k1F k2

in the case where jF j 

jV j . 2

(d) This leads to the following idea: we decompose 1F into an orthogonal sum of a multiple of 1V and a vector with zero average, and use this decomposition to estimate hA1F , 1F i from above. This approach has a chance to lead to the desired conclusion since in the case where jF j  jV2 j the first term of the decomposition (that is, multiple of 1V ) is not dominating. To get the stated formula we just compute. We have 1F D

jF j 1V C f , jV j

where f has zero mean. For the norms we have    jF j 2 jF j2 2  1 , k1F k2 D jF j,  D  jV j V  jV j 2

135

Section 5.2 Spectral characterization of expanders

therefore kf k22 D jF j 

jF j2 . jV j

Since 1V is an eigenvector of A corresponding to the eigenvalue k and f is an orthogonal sum of eigenvectors corresponding to eigenvalues  2 .G/, we get 

jF j jF j 1V , 1V C hAf , f i hA1F , 1F i D A jV j jV j   jF j2 jF j2 k C 2.G/ jF j  jV j jV j 2 jF j D .k  2.G// C 2 .G/jF j. jV j Of course, somewhere in the proof we have to use jF j  jV2 j . It is time to do so now. We continue the chain of inequalities above in the following way  .k  2.G//

jF j k C 2.G/ C 2.G/jF j D jF j. 2 2

By the comment in item (b), we get h.G/  k  kC 22 .G/ D k 22 .G/ . Proof of the right-hand side inequality in (5.3) is more difficult. Let g 2 `2 .V / be a mean zero (real-valued) eigenvector of A associated with the eigenvalue 2 .G/. Let V C D ¹x 2 V : g.x/ > 0º and let g C and g  be functions defined by g C .x/ D max¹g.x/, 0º and g  .x/ D min¹g.x/, 0º. We may assume without loss of generality that jV Cj  jV j=2: we replace g by g otherwise. We claim that hAg C , g Ci  2 hg C , g Ci.

(5.4)

In fact, Ag D 2g, therefore Ag C C Ag  D 2g C C 2 g  and hAg C , g Ci C hAg  , g Ci D 2 hg C , g Ci. It remains to observe that hAg  , g C i  0. The inequality (5.4) can be rewritten as h.kI  A/g C , g Ci  .k  2/hg C , g C i.

(5.5)

This rewriting is useful because it turns out that both sides of (5.5) admit suitable estimates in terms of the following discrete analogue of the gradient.

136

Chapter 5 Families of expanders and of graphs with large girth

We introduce the space P `2 .E/ of real-valued functions on the edge set E D E.G/ with the norm k k D . e2E j .e/j2 /1=2 . So `2 .E/ is a Hilbert space with the inner product X h 1 , 2i D 1 .e/ 2 .e/. e2E

We also pick an orientation of the graph G, that is, for each edge e assign one of its ends to be a head of e and the other to be a tail of e. We denote the head of e by e C and the tail of e by e . Now, for each function f 2 `2 .V / we define its gradient as a function rf 2 `2 .E/ given by (5.6) rf .e/ D f .e C/  f .e  /. The following formula is very useful (it can be obtained by writing detailed representations of both sides): h.kI  A/f , f i D krf k2 .

(5.7)

This relation is similar to the relation of the Laplace operator and the gradient in the classical analysis, and some of the results below were derived by using this analogy with the classical case. To get an estimate of the right-hand side of (5.5) in terms of the gradient we use the following inequality proved in Chapter 4. Lemma (Lemma 4.8). Let G D .V , E/ be a connected graph with the expanding constant h, and f : V ! R be a real-valued function on V . Then X

jf .v/  M j 

v2V

1 X jf .u/  f .v/j, h

(5.8)

uv2E

where M is a median of the set ¹f .v/ºv2V . We apply (5.8) to the function f .v/ D .g C .v//2 . Observe that the assumption jV C j  jV j=2 implies that 0 is a median of this function. We get kg Ck2 

1 h.G/

X uv2E.G/

j.g C .u//2  .g C.v//2 j.

(5.9)

137

Section 5.3 Kazhdan’s property (T) and expanders

Now we estimate the right-hand side of (5.9) in terms of r.g C /. We have X X j.g C .u//2  .g C .v//2 j D jg C.e C /  g C.e  /j  jg C .e C/ C g C.e /j e2E

uv2E.G/



X

 12  X  12 C C C  2 jg .e /  g .e /j  jg .e / C g .e /j C

e2E C

 kr.g /k 

C

C

X

 2

e2E

1 2 2.jg .e /j C jg .e /j / C

C 2

C

 2

e2E

C

 kr.g /k 

p

2k  kg Ck.

Combining this inequality with (5.9) we get h.G/kg Ck 

p 2k kr.g C/k.

(5.10)

On the other hand, combining (5.5) and (5.7) we get kr.g C /k 

p k  2 kg Ck.

(5.11)

Combining (5.10) and (5.11) we get the right-hand side inequality of (5.3). Remark 5.8. 1. The lower bound in (5.3) is tight. One of the examples is the Hamming cube Fk2 . Its eigenvalues are k, k  2, : : : , k (see Exercise 5.82 at the end of this chapter), so k  2 D 2. On the other hand it is not difficult to show that h.Fk2 / D 1, and the bound is achieved on the set A of all vertices for which the first coordinate is 0. 2. The upper bound in (5.3) is close to being tight. Consider the cycle Cn with even n. It is easy to check that h.Cn / D n4 . On the other hand k  2 in this case is   2  2 cos 2

n (see Exercise 5.81 at the end of this chapter). Therefore p 4 . 2k.k  2/ n

5.3 Kazhdan’s property (T) and expanders In this section we are going to use some results on group representations to construct some classes of expanders. The expanders which we construct here are Cayley graphs of finite groups. We already introduced the corresponding definition (see page 3). However, it will be convenient to introduce it again in a slightly different form. Definition 5.9. Let F be a group, and let S be a finite subset of F . We assume that S satisfies the condition .s 2 S/ , .s 1 2 S/ and does not contain the identity. We call such a subset symmetric. The right-invariant Cayley graph Cay.F , S/ is defined to be the graph with vertex set F and edge set defined by the condition: uv is an edge

138

Chapter 5 Families of expanders and of graphs with large girth

if and only if u D sv for some s 2 S. The left-invariant Cayley graph (we do not introduce a special notation for it, reasons are explained below) is defined to be the graph with vertex set F and edge set defined by the condition: uv is an edge if and only if u D vs for some s 2 S. Remark 5.10. The term right-invariant Cayley graph is chosen because the multiplication of all group elements by a fixed element g on the right is an isomorphism of Cay.F , S/. (The reason for the choice of the term left-invariant Cayley graph is similar.) The right-invariant Cayley graph and the left-invariant Cayley graph of the same group with the same set S are isomorphic, the map f 7! f 1 is one of the isomorphisms. This is the reason why we are going to study only right-invariant Cayley graphs and call them just Cayley graphs. It is also clear that for commutative groups there is no difference between the right-invariant Cayley graph and the left-invariant Cayley graph. Observation 5.11. 1. It is easy to see that the graph Cay.F , S/ is jSj-regular. 2. The graph Cay.F , S/ is connected if and only if S is a generating set (that is, if and only if each element of F can be written as a finite product of elements of S). Example 5.12. The Hamming cube (see pages 118 and 181) can be regarded as the Cayley graph of the group Fn2 with respect to the set S D ¹.1, 0, : : : , 0/, .0, 1, : : : , 0/, : : : , .0, 0, : : : , 1/º. Observe that each element of S is an inverse of itself in the group Fn2 . Expansion of Cayley graphs can be characterized in terms of set-products in groups. If A and B are two subsets in a group F with operation , their product is defined by A  B D ¹a  b : a 2 A, b 2 Bº. Proposition 5.13. Let ¹Fn º1 nD1 be a family of groups of growing sizes, and ¹Snº be symmetric subsets Sn  Fn such that the cardinality jSn j is the same for each n. The graphs ¹Cay.Fn , Sn /º1 nD1 form a family of expanders if and only if there is a constant c > 0 independent of n such that jAn [ .Sn  An /j  .1 C c/jAn j

(5.12)

for all subsets An  Fn with jAn j  jFn j=2. Proof. Observe that Sn  An is the set of vertices which are adjacent to vertices of An in Cay.Fn , An /. The inequality (5.12) implies that j.Sn  An /nAn j  cjAn j. Therefore j@An j  cjAn j, and the expanding constant of Cay.Fn , An / is at least c. In the other direction. Let k be the cardinality of Sn (for each n). Suppose that j@An j  hjAn j for some h > 0 and each subset An  Fn with jAn j  jFn j=2.

Section 5.3 Kazhdan’s property (T) and expanders

139

Observe that this implies that the number of elements in Fn which are adjacent to elements of An , but are not in An is at least j@Ak n j . The set of all such elements and the set An are disjoint and both of them are contained in the union An [ .Sn  An /. Hence   h j@An j  1C jAn j. jAn [ .Sn  An /j  jAn j C k k Examples of Cayley graphs which are expanders can be easily constructed from any infinite finitely generated group satisfying the two conditions, one of which we are going to discuss now (we discuss the second condition and complete the construction in Section 5.4). We start by recalling some standard definitions. Definition 5.14. Let H be a complex Hilbert space with inner product h, i and T be a bounded linear operator in H. The set L.H/ of all such operators forms an algebra. It has an identity, which is the identity operator I of H. We define an involution T 7! T  on L.H/ by 8x, y 2 H hT  x, yi D hx, T yi. An operator U : H ! H is called unitary if U U  D U  U D I or, equivalently, if hUx, Uyi D hx, yi for all x, y 2 H and if U is onto. All unitary operators in L.H/ form a group (with respect to the product of operators) which is called the unitary group, denoted U.H/. A unitary representation of a group G on a Hilbert space H is a group homomorphism of G into U.H/. Remark 5.15. Studying unitary representations of topological groups it is natural and customary to introduce certain continuity conditions (strong continuity) as a part of the definition of unitary representation. Now we introduce one of the most important definitions of the theory of infinitedimensional representations of groups, restricted to the case of finitely generated groups because it is the only case which we are going to consider. Definition 5.16. Let G be a finitely generated group and  : G ! U.H/ be a unitary representation of G on a Hilbert space H. If " > 0 and K is a finite subset of G, then a norm-one vector x in H is called an .", K/-invariant vector if k.g/x  xk < " for all g 2 K. An invariant vector of the representation  is a vector x 2 H such that .g/x D x for each g 2 G. We say that G has Kazhdan’s property (T), or just property (T) if (and only if) there exists an " > 0 and a finite subset K of G such that every unitary representation having an .", K/-invariant vector has a nonzero invariant vector. We also introduce the following numerical parameter which is defined for all groups (and can be introduced for infinite subsets K as well).

140

Chapter 5 Families of expanders and of graphs with large girth

Definition 5.17. Let  : G ! U.H/ be a unitary representation of a finitely generated group G, and let K be a finite subset of G. The Kazhdan constant K.G, K, / is defined as the supremum of all " > 0 for which sup k.g/x  xk  "kxk g2K

for all x 2 H. The Kazhdan constant K.G, K/ is defined as inf K.G, K, /, where  ranges over all unitary representations of G with no nontrivial invariant vectors. Remark 5.18. 1. It is easy to see that a finitely generated group has Kazhdan’s property (T) if and only if K.G, K/ > 0 for at least one finite set K  G. 2. The constant K.G, K, / vanishes whenever the representation contains nontrivial invariant vectors (such representations exist for all groups, for example the representation which maps all elements of the group onto the identity operator). 3. It is easy to show that each finitely generated group has a unitary representation with no nontrivial invariant vectors: for finite groups see Observation 5.21, for infinite groups one of the examples is the analogue of the left regular representation introduced in Definition 5.20. Remark 5.19. If kxk D 1, the identity k.g/.x/  xk2 D 2  2Reh.g/.x/, xi holds. Using this identity we can restate the definition of the Kazhdan constant in the following way: p p inf sup 2  1  Reh.g/.x/, xi, (5.13) K.G, K/ D inf

x2H,kxkD1 g2K

where the infimum is over unitary representations of G with no nontrivial invariant vectors. At first glance it could be unclear how property (T) of groups and the Kazhdan constant are related to expanders. But if we recall the spectral characterization of expanders (Theorem 5.5) and compare it with (5.13), we see some connection because K.G, K/ > 0 implies that there is ˛ D ˛.K.G, K// > 0 such that for each unitary representation  without nontrivial invariant vectors and for each normalized vector x 2 H there is g 2 K such that Reh.g/.x/, xi  1  ˛.

(5.14)

Inequality (5.14) is already somewhat similar to the inequality which we would like to get: jhAy, yij  k  ",

Section 5.3 Kazhdan’s property (T) and expanders

141

where A is the adjacency matrix and k is the vertex degree, and y is a normalized vector with zero mean. Soon we shall see (Observation 5.21) that the subspace of vectors with zero mean has the representation-theoretical property which we need. It remains to see how .g/ can be related to A. For this we need to recall some basic notions of finite-dimensional representations of finite groups. At this point we need one of the most straightforward constructions of a representation. Definition 5.20. Let F be a finite group. The right regular representation of F is the representation of F on the space `2 .F / (in this context considered as a space over C) given by .g/.f .x// D f .xg/. The left regular representation of F is the representation of F on the space `2 .F / given by .g/.f .x// D f .g 1 x/. Observation 5.21. The restriction of a (right or left) regular representation to the subspace consisting of functions with zero mean does not have invariant vectors. In fact, one can see that all of the coordinates of an invariant vector of a left (right) regular representation have to be equal, and a vector with equal coordinates has zero mean if and only if it is 0. Now we are ready to prove the following crucial observation: Theorem 5.22. The spectral gap of the graph Cay.F , S/ can be estimated from below in terms of a function of K.F , S/, and this function is strictly positive for strictly positive K.F , S/. Proof. We start with the following observation: the adjacency matrix .Au,v /u,v2F of Cay.F , S/ can be written as a sum of matrices of operators of the left regular representation. In fact, the adjacency matrix of Cay.F , S/ is the sum of matrices M.g/ D .Mu,v .g//u,v2F corresponding to g 2 S, where M.g/ (defined for any g 2 F ) is the matrix given by ´ 1 if uv 1 D g Mu,v .g/ D 0 otherwise. It is also easy to check that M.g/ is the matrix of .g 1 /. Therefore the quantity hAx, xi, where A is the adjacency matrix of Cay.F , S/ can be written as

X  M.g/x, x . g2S

We need to estimate supx jhAx, xij where the supremum is over x 2 `2 .F / with norm one and zero average. Since A is self-adjoint, we have sup jhAx, xij D sup RehAx, xi. x

x

142

Chapter 5 Families of expanders and of graphs with large girth

This leads to the desired connection with the Kazhdan constant:

X  X M.g/x, x  sup RehM.g/x, xi  jSj  ˛.K.F , S//. sup Re x

g2S

g2S

x

This theorem shows that in order to construct a sequence of expanders we need to find a family of groups Fn with jFn j ! 1 and generating subsets Sn  Fn with jSn j D const such that the Kazhdan constants K.Fn , Sn / are bounded away from zero (that is, there exists c > 0 such that K.Fn , Sn /  c for all n). Remark 5.23. Proof of Theorem 5.22 gives a bit more, namely it shows that the se1 quence ¹Cay.Fn , Sn /º1 nD1 is a family of expanders, if the family of groups ¹Fn ºnD1 with jFn j ! 1 and generating subsets Sn  Fn with jSn j D const are such that the Kazhdan constants K.Fn , Sn , n/ are bounded away from zero (that is, there exists c > 0 such that K.Fn , Sn , n /  c for all n), where n is the restriction of the left regular representation of Fn to the space of vectors with zero average. By now many examples of such sequences ¹Fn , Sn º are known, for example, it is known that the Kazhdan constants are bounded away from zero if Fn is the alternating group on n elements and Sn is a suitably chosen subset in Fn (with jSn j D const ). Historically the first examples of this type were obtained by considering infinite finitely generated groups with property (T) which have finite quotients of growing sizes providing the desired examples. Our next section is devoted to this construction.

5.4 Groups with property (T) One of the ways to complete the construction of expanders presented in the previous section is to find an infinite finitely generated group G with property (T) having finite quotient groups ¹Gi º1 i D1 with indefinitely increasing cardinalities jGi j and such that the restrictions of the quotient maps 'i : G ! Gi to one of the finite sets K  G satisfying K.G, K/ > 0 are injective. Proposition 5.24. The graphs ¹Cay.Gi , 'i .K//º1 i D1 form a family of expanders. Proof. Since the restrictions of 'i to K are injective, all of the obtained graphs are jKj-regular. To estimate the spectral gap of the obtained graphs we denote by i the restriction of the left regular representation of Gi to the space of vectors with zero average and observe that Definition 5.17 implies K.G, K/  K.G, K, i ı 'i / D K.Gi , 'i .K/, i /. Application of Remark 5.23 completes the proof.

Section 5.4 Groups with property (T)

143

Remark 5.25. The injectivity of the restrictions of 'i to K can be dropped if we agree to consider expanders with loops and multiple edges. Some of the well-known groups have property (T). Possibly the most well-known groups which have property (T) are the groups SLn.Z/, n  3. Definition 5.26. The special linear group SLn.Z/ is the group of all n  n matrices with integer coefficients and determinant equal to 1. Remark 5.27. The same definition is used to define SLn.R/ for any commutative ring R with unit.

5.4.1 Finite generation of SLn .Z/ Proposition 5.28. The group SLn .Z/ is finitely generated for each n and is an infinite group if n  2. Proof. The second statement is trivial. The fact that SLn .Z/ is finitely generated with rather weak and depending on n bound on the number of generators is an easy consequence of the basic techniques of linear algebra known as Gaussian elimination. This bound can be obtained in the following way. Let A 2 SLn .Z/. We start by observing that the greatest common divisor of entries of the first column of A is equal to 1 (otherwise the determinant cannot be equal to one). Therefore, by Euclid’s algorithm for getting the greatest common divisor, subtracting rows one from another, and adding rows, we can get a matrix A1 whose first column has exactly one nonzero entry, and the entry is ˙1. Observe that the addition of row #j to row #i in a matrix A is equivalent to multiplication of the matrix A from the left by one of the so-called elementary matrices of a special form. Definition 5.29. An n  n matrix is called elementary if all of its diagonal entries are equal to 1, and it has exactly one nonzero non-diagonal entry equal to t . We denote such a matrix with .i, j /-entry equal to t by Eij .t /. The addition of a row mentioned before Definition 5.29 is equivalent to multiplication on the left by Eij .1/, the subtraction of row #j from row #i in a matrix A is equivalent to multiplication of the matrix A from the left by Eij .1/. Suppose that the ˙1 in the first column of A1 is the .i1 , 1/-entry. It is easy to see that the greatest common divisor of all entries of the second column except .i1 , 2/-entry is equal to 1. Repeating the procedure of multiplication on the left by matrices Eij .1/ and Eij .1/, we get a matrix A2 whose first column is the same as the first column of A1 and whose second column has at most two nonzero entries. One of them is the

144

Chapter 5 Families of expanders and of graphs with large girth

.i1 , 2/-entry, which can be zero. The other is an .i2 , 2/-entry, i2 ¤ i1 , which is equal to ˙1. Now we can subtract/add (repeatedly, if needed) the i2t h row from/to the i1t h row and make the .i1 , 2/-entry equal to 0. We continue in an obvious manner (in the third step we use the fact that the greatest common divisor of all entries of the third column except the .i1 , 3/-entry and .i2 , 3/entry is equal to 1). At the end of this computation we get a signed permutation matrix. Definition 5.30. A one-to-one mapping of the set ¹1, : : : , nº onto itself P is called a permutation. A (n by n) permutation matrix is a matrix of the form niD1 ei , .i /, where  : ¹1, : : : , nº ! ¹1, : : : , nº is a permutation and eij is the matrix with only one nonzero entry, which is an .i, j /-entry, and which is equal to 1. By a signed permutation matrix we mean a matrix which is obtained from a permutation matrix if we replace some (may be all) of its nonzero entries (which are equal to 1) by 1. This completes the proof of the theorem. We have proved that each A 2 SLn .Z/ is a product of elementary matrices of the form Eij .˙1/ (it is important that all of these matrices are in SLn.Z/) and some signed permutation matrix (which should be SLn.Z/ since A 2 SLn.Z/ and all elementary matrices are in SLn .Z/). Therefore the set consisting of (1) all matrices of the form Eij .˙1/, and (2) all signed permutation matrices with determinant 1; is a finite generating set of SLn.Z/. Remark 5.31. The groups SLn.Z/ have much smaller generating sets, see references in Notes and Remarks. See also Exercise 5.91.

5.4.2 Finite quotients of SLn .Z/ Finite quotients of SLn .Z/ satisfying the conditions mentioned at the beginning of Section 5.4 are obtained by reduction modulo p. This means the following. We consider the fields Zp of residues modulo p, where p is a prime number (the field Z2 can be identified with the field F2 which we introduced at the beginning of Section 4.5; in some papers notation Fp is used for the fields Zp ). After that we introduce groups SLn.Zp / as groups of matrices with entries in Zp with determinants equal to 1 (considered as an element of Zp ). It is easy to see that the natural map SLn.Z/ 7! SLn.Zp /, which maps a matrix ¹aij º to the matrix ¹aij .mod p/º is a surjective homomorphism. This homomorphism is injective on a finite generating set unless the set contains two matrices which are the same if considered mod p. It never happens if p  3 and the entries of the matrices are 0 and ˙1 (observe that it is the case for all sets generating SLn.Z/, which we have considered so far).

145

Section 5.4 Groups with property (T)

5.4.3 Property (T) for groups SLn .Z/ Theorem 5.32 (without proof). SLn.Z/, n  3, has property (T). The group SL3.Z/ is historically the first example of a group with property (T). Property (T) for this and many other groups is proved in the following way. The definition of property (T) is generalized to the case of locally compact groups. The difference between the definition of property (T) for locally compact groups and Definition 5.16 is that we consider a compact subset K instead of finite subset K. In the next step property (T) is established for SLn.R/ (or continuous analogues of some other groups). And finally there is a general result showing that in many cases the property (T) passes from locally compact groups to their discrete subgroups. Detailed presentation of this approach requires substantial amount of material from representation theory, in Notes and Remarks we provide references to sources containing the proof of the fact that SLn .Z/, n  3, has property (T) on these lines. It is worth mentioning that SL2.Z/ does not have property (T). There is another, in a certain sense, more algebraic and direct proof of the fact that SLn.Z/ has property (T). In this approach the property (T) is established for SLn.Z/ directly, without using SLn .R/. In Notes and Remarks we refer to a source containing a nice account of this approach.

5.4.4 Criterion for property (T) All proofs mentioned in the previous section are rather complicated. Their complexity contrasts, to a certain extent, with the existence of a relatively simple sufficient criterion for property (T), which we are going to present now. Let G be a group generated by a finite symmetric set S (we remind that according to our definition of a symmetric set, S does not contain the identity of G). We define a finite graph Z whose vertex set is S and whose edge set E.Z/ is the set of all pairs uv satisfying the conditions u, v, u1 v 2 S. We assume that S is chosen in such a way that the graph Z is connected. It is easy to see that a symmetric generating set S for which Z is connected can be chosen in each finitely generated group. For a vertex u in Z we denote by deg u its degree in Z. Let `2 .S, deg/ be a weighted Hilbert space of real-valued functions on S. The Hilbert norm of a function f : S ! R in this space is given by X .f .u//2 deg u. kf k D u2S

Let be a discrete Laplace operator on `2 .S, deg/ given by . f /.u/ D f .u/ 

1 deg u

X v, uv2E.Z/

f .v/

146

Chapter 5 Families of expanders and of graphs with large girth

The operator is a nonnegative, self-adjoint operator on `2 .S, deg/. If Z is connected then zero is a simple eigenvalue of . Let 1.Z/ be the smallest nonzero eigenvalue of considered as an operator on `2 .S, deg/. Remark 5.33. In the case where Z is a k-regular graph, the smallest eigenvalue of the discrete Laplace operator is closely related to the spectral gap (introduced in Definition 5.6). In fact, one can check that in such a case DI 

1 A, k

1.Z/ D

k  2.Z/ . k

˙ Theorem 5.34 (Zuk’s criterion). Let G be a group generated by a finite symmetric subset S. If the graph Z is connected and 1.Z/  12 then G has Kazhdan’s property (T). Moreover   1 2 . K.G, S/  p 2  1.Z/ 3 This criterion leads to many new examples of groups with property (T). Unfortunately, all explicit examples known to me of groups to which this criterion is directly applicable are rather complicated (even to describe). The criterion can be used to show that some classes of random groups have property (T), but such examples are not interesting in the context of explicit constructions of expander families. (The proof of Theorem 5.34 is relatively simple, we do not present it because we are not going to apply Theorem 5.34.) There exist more complicated criteria which generalize and strengthen Theorem 5.34 (improve the value of Kazhdan’s constant), and which are applicable to relatively simple examples of groups, but their study is beyond the scope of this book.

5.5 Zigzag products Our purpose in this section is to present a combinatorial construction of families of expanders based on the so-called zigzag product of graphs. The construction starts with a slightly different definition of a spectral gap than the one introduced in Definition 5.6 (page 134). P Recall that a nonnegative matrix A D .aij /ni,j D1 is called stochastic if jnD1 aij D 1 for each i 2 ¹1, : : : , nº. We are going to study symmetric (that is, satisfying aij D aj i ) stochastic matrices. A typical example which is of interest for us in the present context is the normalized adjacency matrix of a k-regular graph, where “normalized” means that each entry is divided by k. We are going to introduce (Definitions 5.35 and 5.38) and study quantities closely related to the spectral gap introduced in Definition 5.6. After that we show (Theorem 5.42) that these quantities behave nicely with respect to the combinatorial construction called zigzag product and thus lead to a construction of families of expanders.

147

Section 5.5 Zigzag products

Definition 5.35. Let A be an n  n symmetric stochastic matrix, and let 1 D 1 .A/ 

2.A/     n.A/  1 be its eigenvalues. Let .A/ :D max2i n j i .A/j. The quantity 1  .A/ is called the absolute spectral gap of A. Remark 5.36. Comparing Definitions 5.35 and 5.6 and recalling the spectral characterization of expanders (Theorem 5.5) we see that any family of k-regular graphs with indefinitely growing numbers of vertices and with absolute spectral gaps of their normalized adjacency matrices bounded away from zero is a family of expanders. Proposition 5.37. The reciprocal of the absolute spectral gap of A is the smallest constant C > 0 such that for all x1 , : : : , xn , y1 , : : : , yn 2 L2 we have n n n n C X X 1 XX 2 kxi  yj k2  aij kxi  yj k22. n2 n i D1 j D1

(5.15)

i D1 j D1

Proof. It is clear that it suffices to prove (5.15) in the case where x1 , : : : , xn , y1, : : : , yn are scalars. We consider ¹xi ºniD1 and ¹yi ºniD1 as vectors of `n2 . Subtracting from P each of yj their average, and adding this average to each of xi we may assume that jnD1 yj D 0. With this assumption we have: n n n n 1 XX 2 1 XX 2 .x  y / D .xi C yj2  2xi yj / i j n2 n2 i D1 j D1 i D1 j D1   n X n X 1 D 2 nkxk2 C nkyk2  2 xi yj n

 1 D kxk2 C kyk2 . n

i D1 j D1

As for the right-hand side, we get n n  C   C X X  aij xi2 C aij yj2  2aij xi yj D kxk2 C kyk2  2hAy, xi . n n i D1 j D1

After multiplying by n the inequality (5.15) becomes   kxk2 C kyk2  C.kxk2 C kyk2  2hAy, xi/, or

  1 kxk2 C kyk2 . hAy, xi  1  C 2

148

Chapter 5 Families of expanders and of graphs with large girth

Now it remains to observe that the smallest constant C for which kxk2 C kyk2 2 holds, is .A/. In fact, since A is a symmetric stochastic matrix, theP eigenvalue 1 corresponds to the vector with all coordinates equal to 1, the condition jnD1 yj D 0 implies that y is orthogonal to this vector. Therefore jhAy, xij  .A/kykkxk 

.A/.kxk2 C kyk2/=2. The equality is attained because we may choose ¹xi ºniD1 and ¹yi ºniD1 in such a way that kxk D kyk and y is an eigenvector of A whose respective eigenvalue has absolute value .A/. 8x, y 2 `n2

hAy, xi  C

Definition 5.38. Let X be an arbitrary set. A kernel on X is a symmetric function K : X  X ! Œ0, 1/. The reciprocal absolute spectral gap of A with respect to the kernel K, denoted C.A, K/, is defined as the infimum of numbers C  0 satisfying the condition: for all x1, : : : , xn , y1 , : : : , yn 2 X the inequality n n n n 1 XX C XX K.x , y /  aij K.xi , yj / i j n2 n i D1 j D1

(5.16)

i D1 j D1

holds. Remark 5.39. Observe that the quantity C defined in Proposition 5.37 is the reciprocal absolute spectral gap of A with respect to the kernel K : L2  L2 ! Œ0, 1/ given by K.x, y/ D kx  yk22. We introduce the very general Definition 5.38 because one of the important combinatorial results which we are going to prove (Theorem 5.42) holds in this generality. In what follows we deal with regular graphs, which in this section are allowed to have loops and multiple edges. (Recall that two or more edges are called parallel edges or multiple edges if they have the same pair of end vertices.) We allow loops and parallel edges because in the constructions presented below it is too difficult to get rid of them, on the other hand loops and multiple edges do not cause any problems for the most important applications of expanders. We use the convention that each loop contributes 1 to the degree of a vertex (this convention agrees with the algebraic results on adjacency matrices which we are going to use, but does not agree with the well-known identity: the sum of all degrees = twice the number of edges). Definition 5.40. The normalized adjacency matrix of a (not necessarily simple) kregular graph G D .V , E/, denoted AG , is the matrix whose rows and columns are labeled by elements of V , and whose uv-entry is the number of edges joining u and v (u, v 2 V ) divided by k. When discussing reciprocal absolute spectral gaps we will interchangeably identify G with AG . Thus, for example, we define C .G, K/ as C.AG , K/.

149

Section 5.5 Zigzag products

The quantities C .G, K/ behave nicely under the zigzag product of graphs. This product is defined only for pairs of graphs satisfying a certain matching condition. Definition 5.41. Let G1 D .V1, E1 / be an n1 -vertex graph which is d1 -regular and let G2 D .V2 , E2 / be a d1 -vertex graph which is d2 -regular. Since the number of vertices in G2 is the same as the degree in G1, we can identify V2 with the edges emanating from a given vertex u 2 V1. Formally, we fix for every u 2 V1 a bijection u : ¹e 2 E1 : u is incident with eº ! V2. We also fix for every a 2 V2 a bijection between Œd2  :D ¹1, : : : , d2º and the multiset of the vertices adjacent to a in G2 (multiset is a set in which elements are allowed to have positive integer multiplicities, we have to consider multisets here because we allow multiple edges). We denote this bijection by z G2 is the graph whose a : Œd2  ! ¹b 2 V2 : ¹a, bº 2 E2 º. The zigzag product G1  vertex set is V1  V2 and two vertices .u, a/, .v, b/ 2 V1  V2 are joined by an edge if and only if there exist i, j 2 Œd2  such that: uv 2 E1 ,

a D  u .uv/ .i/, and b D  v .uv/ .j /.

(5.17)

z G2 as a We can view this construction as follows: think of the vertex set of G1  disjoint union of “clouds” which are copies of V2 D ¹1, : : : , d1 º indexed by V1. Thus .u, a/ is the point indexed by a in the cloud labeled by u. Every edge ¹.u, a/, .v, b/º z G2 can be regarded as the result of a three-step walk: of G1  

A “zig” step in G2 from a vertex a belonging to the cloud of the vertex u 2 V1 to u .¹u, vº/ in the same cloud (for this to be defined a and u .¹u, vº/ should be adjacent in G2 ).



A “zag” step in G1 from u’s cloud to v’s cloud along the edge ¹u, vº.



A final “zig” step in G2 , in the cloud of the vertex v, from v .¹u, vº/ to b.

z G2 is a d22-regular graph with n1 d1 vertices. The zigzag product deThe graph G1  pends on the choice of the bijections ¹uºu2V1 , and in fact different bijections (which are also called labelings) of the same graphs can produce non-isomorphic zigzag products. (The choice of bijections (labelings) ¹a ºa2V2 does not affect the structure of the zigzag product but these bijections are useful in the subsequent analysis.) However, all of our results below will be independent of the actual choice of the labelings, so while our notation should formally depend on the labelings, for the sake of simplicity, we do not mention the choice of labelings in our notation. z G2 , K/ is related to C .G1 , K/ and Let us now examine how the constant C.G1  C.G2 , K/, where K : X  X ! Œ0, 1/ is an arbitrary kernel. Theorem 5.42 (Sub-multiplicativity of the reciprocal absolute spectral gap). Let G1, G1 D .V1 , E1 /, be an n1 -vertex graph which is d1-regular and let G2 D .V2, E2 / be a d1 -vertex graph which is d2 -regular. Then for every kernel K : X  X ! Œ0, 1/, z G2 , K/  C .G1 , K/  C .G2 , K/2 . C.G1 

(5.18)

150

Chapter 5 Families of expanders and of graphs with large girth

Proof. Take any f , g : V1 V2 ! X and note that the definition of C .G1 , K/ implies that for any pair a, b 2 V2 we have: 1 X C .G1 , K/ X K.f .u, a/, g.v, b//  K.f .u, a/, g.v, b//. (5.19) n1 d1 n21 u,v2V .u,v/2E1

1

We would like to explain the notation used in the right-hand side. It is adjusted to the matrix form of the definition of the reciprocal absolute spectral gap. In this notation we mean that for each edge ¹u, vº which is not a loop there are two elements in E1 : .u, v/ and .v, u/, and so the sum in the right-hand side contains two terms: K.f .u, a/, g.v, b// and K.f .v, a/, g.u, b//. P For a loop ¹u, uº there is one element .u, u/ in E1 . Division by d1 makes the sum d1 .u,v/2E1 K.f .u, a/, g.v, b// the same 1 P as the sum u,v2V1 au,v K.f .u, a/, g.v, b//, where au,v are entries of the normalized adjacency matrix of G1 . Using (5.19) we get the following inequality X 1 K.f .u, a/, g.v, b// 2 jV1  V2 j .u,a/,.v,b/2V1 V2

D

1 X 1 X K.f .u, a/, g.v, b// d12 n21 u,v2V a,b2V2



C .G1 , K/ X n1 d13

1

X

(5.20)

K.f .u, a/, g.v, b//.

a,b2V2 .u,v/2E1

Now we show that the definition of C .G2 , K/ implies that for all u 2 V1 and b 2 V2 we have 1 X X K.f .u, a/, g.v, b// (5.21) d12 a2V v2V : 2

1

.u,v/2E1



C .G2 , K/ d1 d2

X

X

  K f .u,  u .¹u,vº/ .i//, g.v, b/ .

v2V1 : i 2Œd2  .u,v/2E1

To see that this inequality follows from the definition of C .G2 , K/ we observe the following: 

For fixed u, f .u, a/ is an X-valued function of a.



For fixed u and b, and for v satisfying .u, v/ 2 E1 , we may regard g.v, b/ as a function of u .¹u, vº/, which is a vertex in the u’s cloud; and each vertex in the u’s cloud is of this form.



Hence the sum in the left-hand side can be regarded as a sum over all pairs of points of the u’s cloud.

151

Section 5.5 Zigzag products 

Therefore we may use the definition of C.G2 , K/ to estimate from above the sum in the left-hand side.



After using the definition of C .G2 , K/ we get the sum shown in the right-hand side because (as we have already mentioned) the term g.v, b/ can be regarded as a value of an X-valued function at u .¹u, vº/, and thus the sum X X   K f .u,  u .¹u,vº/ .i//, g.v, b/ v2V1 : i 2Œd2  .u,v/2E1

is actually the sum over all pairs .u .¹u, vº/, neighbor of u.¹u, vº//, that is, over the set of edges of G2 . Summing (5.21) over u 2 V1 and b 2 V2 and plugging it into (5.20), yields the bound: 1 jV1  V2 j2 

X

K.f .u, a/, g.v, b//

.u,a/,.v,b/2V1 V2

C .G1 , K/ 1 X X X  2 n1 d1 d1 u2V a2V b2V2



1

2

X v2V1 : .u,v/2E1

C .G1 , K/ X X C .G2, K/ n1 d1 d1 d2 b2V2 u2V1

D

C .G1, K/ C.G2 , K/ X X n1 d12 d2 v2V 1

i2Œd2 

X

K.f .u, a/, g.v, b// X

  (5.22) K f .u,  u .¹u,vº/ .i//, g.v, b/

v2V1 : i2Œd2  .u,v/2E1

X

X

  K f .u,  u .¹u,vº/ .i//, g.v, b/ .

u2V1 : b2V2 .u,v/2E1

Another application of the definition of C .G2 , K/ implies that for all v 2 V1 and i 2 Œd2  we have: 1 d12

X

X

  K f .u,  u .¹u,vº/ .i//, g.v, b/

u2V1 : b2V2 .u,v/2E1



C.G2 , K/ d1 d2

X

X

  K f .u,  u .¹u,vº/ .i//, g.v,  v .¹u,vº/ .j // .

u2V1 : j 2Œd2  .u,v/2E1

(5.23) Here we use the following reasoning: for fixed v, g.v, b/ might be considered as a function of b. On the other hand, for fixed i the function f .u,  u .¹u,vº/ .i// might be considered as a function of v .¹u, vº/. Hence we get a function on the Cartesian product of v’s cloud with itself. Now we use the fact that ¹ v .¹u,vº/ .j /ºj 2Œd2 is just the list of neighbors of v .¹u, vº/.

152

Chapter 5 Families of expanders and of graphs with large girth

Summing (5.23) over v 2 V1 and i 2 Œd2 , and combining the resulting inequality with (5.22) yields the desired inequality: 1 jV1  V2j2 

X

K.f .u, a/, g.v, b//

.u,a/,.v,b/2V1 V2

C.G1 , K/ C .G2 , K/ X X n1 d12 d2 v2V 1



i 2Œd2 

X

X

  K f .u,  u .¹u,vº/ .i//, g.v, b/

u2V1 : b2V2 .u,v/2E1

C.G1 , K/ C .G2 , K/2 n1 d1 d 2 X X 2X X    K f .u,  u .¹u,vº/ .i//, g.v,  v .¹u,vº/ .j // v2V1 i 2Œd2  u2V1 : j 2Œd2  .u,v/2E1

D

C .G1 , K/ C.G2 , K/2 n1 d1d22

X ¹.u,a/,.v,b/º2E.G1

K .f .u, a/, g.v, b/// .  z G2 /

Combining Theorem 5.42 and Proposition 5.37 we get Corollary 5.43. In the special case X D R and K.x, y/ D .x  y/2, the inequality (5.18) becomes: 1 1 1  .  z G2 / 1  .G1/ .1  .G2//2 1  .G1 

(5.24)

Now we use Corollary 5.43 to construct expander families. Definition 5.44. For a graph G D .V , E/ and for t 2 N, let G t be the graph with the vertex set V in which an edge between u, v 2 V is drawn for every walk in G of length t which starts at u and terminates at v. We call G t the t -th power of the graph G. For our next observation it is important that loops contribute 1 to the degree of a vertex. Observation 5.45. It is easy to see that AG t D .AG /t , therefore .G t / D . .G//t . Also it is easy to see that G t is d t -regular if G is d -regular. Theorem 5.46. The zigzag product construction leads to a construction of families of expanders if there is a regular graph G satisfying the following conditions: (a) The number n0 of its vertices and its degree d0 satisfy n0 D d02t0 , where t0 2 N is a number satisfying the following condition.   (b) There exists a 2 .0, 1/ satisfying 1  .1  a/t0 .1  .G//2  a.

153

Section 5.5 Zigzag products

Proof. In fact, these conditions allow to construct expanders inductively as follows: (a) We start with G1 D G 2. (We square G because Definition 5.41 implies that all further graphs in our construction are d02 -regular, so we need G1 also to be d02 regular.) z G. This graph is well-defined because Git0 is d02t0 -regular (b) We let G2 D G1t0  (=n0 -regular) and G has n0 vertices. By Definition 5.41, G2 is d02-regular and has n20 -vertices. (c) We repeat this construction in all of the further steps, namely, we let Gi C1 D z G for i  2. It is easy to verify that each Gn has ni0 vertices and is d02 Git0  regular. As for the absolute spectral gaps, combining inequality (5.24) and Observation 5.45 we get (5.25) 1  .Gi C1/  .1  . .Gi //t0 /.1  .G//2. Now we use induction to prove that absolute spectral gaps of all graphs Gi are at least a. The spectral gap of G1 is 1  .G 2 / D 1  . .G//2  .1  .G//2 > a. If we assume that the spectral gap of Gi is at least a, for the spectral gap of Gi C1 we get 1  .Gi C1/  .1  . .Gi //t0 /.1  .G//2  .1  .1  a/t0 /.1  .G//2  a. Definition 5.47. A graph G satisfying the assumptions of Theorem 5.46 is called a base graph. Our next purpose is to present one of the known constructions of a base graph. Let r , s 2 N and q D 2s . Let Fq be the field with q elements and let FrqC1 be the .r C 1/-dimensional vector space over this field. We introduce a q 2 -regular graph G.q, r / whose vertex set is FrqC1. The edge set of this graph is described as follows. For each a D .a0 , a1 , : : : , ar / 2 FrqC1 the set of all vertices adjacent to a is the set of vertices of the form a C y  .1, x, x 2 , : : : , x r /, where .x, y/ is any pair of elements of Fq . Theorem 5.48. For any q D 2s and any r < q the obtained graph satisfies

2.G.q, r //  qr . Proof. The vector space FrqC1 has n D 2s.r C1/ elements. Let M be the nn adjacency matrix of the graph G.q, r /. We need to recall the easy fact that Fq might be considered as a vector space over the field F2, and so there exists a surjective and linear over F2 map L : Fq ! F2.

154

Chapter 5 Families of expanders and of graphs with large girth

Let us describe eigenvectors of M considered as an operator on Rn . Both the eigenvectors and their entries will be labeled by elements of FrqC1. So, for every a D vector whose b-th entry, where b D .b0 , : : : , br / 2 .a0 , : : : , ar / 2 Fr C1, let va be the Pr Fr C1, satisfies va .b/ D .1/L. iD0 ai bi / . Here .1/` , where ` 2 F2 is understood as .1/0 D 1, .1/1 D 1. Let us show that the vectors ¹va ºa2FrC1 are pairwise orthogonal. In fact, X

hva , vc i D

q

Pr

.1/L.

b2FrC1 q

X

D

iD0 ai bi /

Pr

.1/L.

Pr

.1/L.

iD0 .ai Cci /bi /

iD0 ci bi /

(5.26) .

rC1

b2Fq

P P is clear that if ai D ci for all i, then L. riD0.ai C ci /bi / D L. riD0 ai bi C PIt r s.r C1/ . i D0 ai bi / D 0 since L is linear over F2 . Therefore hva , va i D 2 Now we consider the case where there is j 2 ¹0, : : : , r º such that aj ¤ cj . We can rewrite the last line in (5.26) as Pr X X .1/L..aj Ccj /bj / .1/L. iD0,i¤j .ai Cci /bi / . b2Frq

bj 2Fq

The first sum in this product is equal to 0 because the mapping L is surjective and thus is equal to 0 on half of the elements of Fq and is equal to 1 on the other half. On the other hand, since cj C aj ¤ 0 each element of Fq has a unique representation in the form .aj C cj /bj . Let us show that ¹va ºa2FrC1 are eigenvectors of M . Observe that entries of M are q

naturally labeled by pairs of elements of FrqC1. We have .M va /.b/ D

X

Mbc va .c/ D

X

va .b C y.1, x, : : : , x r //

x,y2Fq

rC1

c2Fq

D

 X

 va .y, yx, : : : , yx /  va .b/, r

x,y2Fq

where we used the observation: va .b C c/ D va .b/  va .c/ for any b, c 2 FrqC1. Therefore va is an eigenvector, and the corresponding eigenvalue is equal to X va .y, yx, : : : , yx r /.

a D x,y2Fq

We have

a D

X

.1/L.a0 yCa1yxCCar yx

x,y2F

r/

D

X

.1/L.ypa .x// ,

x,y2F

155

Section 5.6 Graphs with large girth: basic definitions

where pa .x/ D

a D

Pr

i D0 ai x

X

i

. Therefore .1/L.ypa .x// C

x,y2Fq , pa .x/D0

X

.1/L.ypa .x//.

x,y2Fq , pa .x/¤0

From here it is clear that if a D .0, 0, : : : , 0/, then a D q 2 . Now consider the case where a ¤ .0, 0, : : : , 0/. In this case there are at most r elements x 2 Fq such that pa .x/ D 0 and therefore the first integer P sum is a positive L.yz/ D 0 for  qr . As for the second sum, it is equal to zero because y2Fq .1/ each z ¤ 0. (This can be proved using the same argument which we used above: Since L is surjective it takes value 0 on exactly half of the elements of Fq ; since z ¤ 0, y 7! y  z is a bijection of Fq .) This completes the proof of the theorem. It remains to check that the family of graphs whose spectral gaps are estimated in Theorem 5.48 contains a suitable base graph. The graphs constructed in Theorem 5.48 have q r C1 vertices and are q 2 -regular. So the relation n0 D d02t0 of Theorem 5.46(a) is satisfied for some t0 if and only if r C 1 is divisible by 4, that is, r C 1 D 4t0, so q r C1 D q 4t0 D .q 2 /2t0 . Observe that by Theorem 5.48, .AG /, where AG is the normalized adjacency matrix of G.q, r / (see Definitions 5.35 and 5.40), is equal qr . Therefore the condition (b) of Theorem 5.46 becomes: there exists a 2 .0, 1/ satisfying    rC1 r 2  a. 1 1  .1  a/ 4 q This condition is satisfied for an arbitrary a 2 .0, 1/ if we pick q and r in a suitable way. One of the ways is the following: 

First we pick r to be large enough so that

p rC1 1  .1  a/ 4  a.



Then we pick q large enough so that   r 2 p  a. 1 q

5.6 Graphs with large girth: basic definitions Definition 5.49. The girth of a graph is the length of a shortest cycle in it. (We define girth for graphs having some cycles only.) We denote the girth of a graph G by g.G/. We would like to mention that a loop (an edge joining a vertex to itself) is considered as a cycle of length 1, and that a pair of parallel edges (that is, edges with the same

156

Chapter 5 Families of expanders and of graphs with large girth

pair of end vertices) are considered as forming a cycle of length 2. Thus, a graph has girth at least 3 if and only if it is simple. As with expanders, talking about graphs with large girth we usually mean families ¹Gnº1 nD1 of graphs with g.Gn / ! 1. The simplest such family is a family of cycles with growing length. For metric theory this family is not very interesting because of its simplicity (only precise isometric results could be difficult). Our main interest is the following types of families of graphs with large girth: (a) k-regular, k  3, families ¹Gn º1 nD1 of graphs with g.Gn / ! 1. (b) Families ¹Gn º1 nD1 of graphs with g.Gn / ! 1 and all degrees  3. (c) Families ¹Gn º1 nD1 of graphs with g.Gn / ! 1 and average degrees of Gn at least 2 C " for some " > 0. As we shall see in Chapter 8 such families do not admit uniformly bilipschitz embeddings into a Hilbert space and even into any uniformly convex Banach space. However, as we shall see in this chapter, some such families admit uniformly bilipschitz embeddings into `1 . There are still several important intriguing open problems about families of graphs with growing girth, see Section 11.3 and Problem 11.4 in Chapter 11. The described types of families of graphs with large girth are such that it is far from being clear that such families exist, especially families of type (a). However, it is easy to find k-regular graph of girths 3 and 4: The complete graph KkC1 is a k-regular graph of girth 3 and the Hamming cube Fk2 (see the definition in Exercise 5.82 on page 181) is a k-regular graph of girth 4. As we shall see these simple examples could serve as a basis for construction of k-regular graphs with arbitrarily large girth. In further sections of this chapter we present some of the known constructions of families of graphs with growing girth. Important parameters of such constructions are: order (defined as the number of vertices) and diameter of the graphs. One of the goals of some of the constructions is to minimize the order (or diameter) for a given girth.

5.7 Graph lift constructions and `1-embeddable graphs with large girth Very simple constructions of k-regular graphs with arbitrarily large girth are the constructions based on the notion of a lift of a graph, also known as graph covering. We prefer the term graph lift because the term graph covering is also used for many other purposes. z of a graph G D .V .G/, E.G// is a Definition 5.50. Let L be a finite set. A lift G z D V .G/  L. The edge set G z is the union of matchings graph with vertex set V .G/

Section 5.7 Graph lift constructions and `1-embeddable graphs with large girth

157

corresponding to edges of E.G/. The matching corresponding to an edge uv matches all vertices of ¹uº  L with vertices of ¹vº  L. Observe that to be more formal we need to consider edges uv as directed edges and to specify to each of such edges a permutation of L, and the permutation corresponding to the direction for which v is the head and u is the tail is the inverse of the permutation corresponding to the direction for which u is the head and v is the tail. Definition 5.50 immediately implies that there are well-defined projections z ! E.G/ and V .G/ z ! V .G/ : E.G/ edges of the matching corresponding to uv are projected onto uv and vertices of ¹uº  L are projected onto u. We denote both of the projections by . It is clear from the z whose projection in G is u are the same definition that the degrees of all vertices in G as the degree of u. In particular, any lift of a k-regular graph is k-regular. Remark 5.51. It is easy to see that for each walk ¹ei ºniD1 in G and each vertex u z2 z of the form u V .G/ z D .u, `/ with ` 2 L and u being the initial vertex of the walk z for which .z ei ºniD1 in G ei / D ei ¹ei ºniD1; there is a uniquely determined lifted walk ¹z and u z is the initial vertex. It is important for us that if G is connected, this remark z we consider a connected component of it. remains true if instead of G Remark 5.52. It is clear that if a walk in G has an edge e which is backtracked (that is, the walk contains two consecutive edges e), then the corresponding edge in the lifted walk is also backtracked. z to G cannot be such that Remark 5.52 implies that the projection of a cycle in G its edges induce in G a subgraph having vertices of degree 1. In particular, the graph z contains cycles in G. This immeinduced by edges of the projection of a cycle in G z  g.G/. diately implies g.G/ Remark 5.52 can be used to find relatively simple graph-lift-based constructions of k-regular graphs k  3 with arbitrarily large girth. Example 5.53. We start with an arbitrary simple k-regular graph G (example of such a graph: KkC1). We consider L D ¹0, 1ºE.G/, that is, L is the set of all ¹0, 1º-valued functions on the edge set of G. The permutation of L corresponding to e 2 E.G/ maps each function f on E.G/ to the function h, which has the same values as f everywhere except the edge e, and on the edge e its value is the other one. (Recall that we consider ¹0, 1º-valued functions. Observe that in this construction we do not have a need to consider directed edges: the change of “level” in L is the same in both directions.) z is  2g.G/. Claim 5.54. The girth of the obtained graph lift G

158

Chapter 5 Families of expanders and of graphs with large girth

z does not have to be connected (and usuRemark 5.55. (1) The obtained graph lift G ally is disconnected). This is not a problem: if we need a connected k-regular graph z of girth  2g.G/, we consider a connected component of G. z (2) One can check that actually g.G/ D 2g.G/. z The comment after Remark Proof of Claim 5.54. Consider a shortest cycle C in G. 5.52 implies that its projection .C / to G contains a cycle C0 in G. This cycle has length  g.G/. We consider C as a closed walk: we initialize and terminate it at an arbitrary vertex of C . Then .C / is a closed walk in G containing each edge of C0 . It z is also a closed is clear that the lift of the walk .C / is C . Since the lift of .C / to G walk, each of the edges of C0 should be contained in .C / an even number of times (otherwise, the walk would end on a different “level”, see the description of Example 5.53), so at least twice. Hence the length of C is  2g.G/. A similar graph lift construction can be used to find an example of a sequence ¹Gnº of connected k-regular graphs with g.Gn / ! 1 and uniformly bounded `1 distortions. Before proceeding to this construction we need to mention the following observation on minimum-diameter k-regular graphs of girth  g (we know, by Example 5.53, that k-regular graphs of girth  g exist for all values of k and g). Proposition 5.56. Let H be a k-regular graph with girth  g and such that H has the minimal amount of vertices among all such graphs. Then the diameter of H is  g. Proof. Suppose the diameter of H is > g, and let u and v be two vertices in H with dH .u, v/  g C 1. Let U D ¹u1 , : : : , uk º and V D ¹v1, : : : , vk º be the sets of all vertices adjacent to u and v, respectively. We delete u and v as well as all edges incident to u and v from H , and denote the obtained graph D. Next we add edges y. u1 v1 , : : : uk vk to D and denote the obtained graph H Observe that dH .ui , vj /  g  1 8i, j 2 ¹1, : : : , kº. (5.27) y is a simple k-regular graph. It is clear that H y has fewer vertices than H . Therefore H y /  g, we get a contradiction and complete the proof of the Thus if we prove that g.H proposition. To achieve this goal we need to show that dHy nu

i vi

.ui , vi /  g  1,

y from which we deleted the edge ui vi . To show this we y nui vi is the graph H where H y nui vi should contain assume the contrary. Then, by (5.27), the shortest ui vi -path in H one of the edges uj vj .i ¤ j /. Let a ui w1 -path be the beginning of this path until the first edge of it which is not in D and let a w2 vi -path be the end of this path after the last edge of it which is not in D. Now we observe that for i ¤ j both dD .ui , uj /  g  2 and dD .vi , vj /  g  2 (we

Section 5.7 Graph lift constructions and `1-embeddable graphs with large girth

159

use g.H /  g and the fact that we deleted paths of length 2 joining ui with uj , and vi with vj ). Together with (5.27) this implies that the length of the ui vi -path which we consider is at least 2.g  2/ C 1 D 2g  3 which is at least g if g  3. Now we turn to a construction of a family of graphs with indefinitely growing girths and uniformly bounded `1 -distortions. It is worth mentioning that it will be proved in Chapter 8 that families of k-regular, k  3 graphs with indefinitely growing girths do not admit uniformly bilipschitz embeddings into a uniformly convex Banach space. z nº1 of k-regular graphs Theorem 5.57. For each k  3 there exists a family ¹G nD1 zn / ! 1 and uniformly bounded `1 -distortions. with g.G (Our notation hints that the desired graphs will be obtained using the graph lifts.) Remark 5.58. There exist many parallels between families of graphs of two types: families of expanders and families of k-regular graphs of indefinitely growing girth. We shall see in Chapter 8 that both types of families do not admit uniformly bilipschitz embeddings into any (fixed) uniformly convex Banach space. There are many other parallels which one can see in many parts of this book. These parallels even led to a hope (which seems to never be mentioned in print) that each family of k-regular graphs with indefinitely growing girths contain some family of expanders in some weak but meaningful sense, and so this parallel extends forever. Comparing Theorem 5.57 with Theorem 4.9 we see that such hopes are not justified. Proof. We start with a sequence of k-regular connected graphs ¹Gn º with indefinitely increasing girths g.Gn /, such that g.Gn /  ˛ diam.Gn /

(5.28)

for some constant ˛ > 0. We know, by Example 5.53 and Proposition 5.56, that such families exist for ˛ D 1. We apply the graph lift construction to the graphs ¹Gn º. The fact that we get kregular graphs with indefinitely increasing girths follows immediately from the observations which we made immediately after the definition of a graph lift (see page 156). It remains to specify lifts for which there are suitable estimates for `1 distortions of connected components of the obtained graphs. The bounds for the distortions which we get are in terms of the constant ˛ in (5.28). E.G/ For each G 2 ¹Gnº1 , so each nD1 we do the following. Let L be the set ¹0, 1º element of L can be regarded as a ¹0, 1º-valued function on E.G/. For each uv 2 E.G/ we need to specify a perfect matching of ¹uº  L and ¹vº  L. To specify the perfect matching it suffices, for each edge in E, to pick a bijection of the set L. We do this as follows. The bijection corresponding to e 2 E.G/ maps each function f on E.G/ to the function h, which has the same values as f everywhere except the edge e, and on the edge e its value is the other one. (Recall that we consider ¹0, 1º-valued

160

Chapter 5 Families of expanders and of graphs with large girth

functions. Observe that in this construction we do not have a need to consider directed edges: the change of “level” in L is the same in both directions.) yn º. Observe that the We denote the graphs obtained from ¹Gn º using such lifts by ¹G y graphs ¹Gnº are disconnected. To see this consider two vertices: .v, f1/ and .v, f2 /, where v 2 V .Gn /, f1 , f2 2 ¹0, 1ºE.Gn/ , f1 ¤ f2 . If .v, f1 / and .v, f2 / are connected y n by a path P , it is clear that the projection .P / of this path is a closed walk in in G G which contains each of the edges corresponding to different values of f1 and f2 an odd number of times. Each of the other edges contained in .P / is contained there an even number of times. This implies that degrees of all vertices in the subgraph of Gn spanned by edges at which f1 and f2 have different values are even (we give a detailed proof of a more general statement in Lemma 5.61 below). Therefore .v, f1 / yn if the condition of the last sentence does not hold. and .v, f2 / are not connected in G For this reason (`1 -distortions are well-defined for connected graphs only) we pick z n . It is easy to see that a conyn a connected component which we denote G in each of G nected component of a k-regular graph with girth g is a k-regular graph with girth  g. zn º satisfy the conditions of Theorem 5.57. We are going to show that the graphs ¹G The main steps in our proof are presented as lemmas, where G is one of the ¹Gn º. Recall that an edge cut in a graph is any set of all edges joining two complementary nonempty sets of vertices in the graph. z for which Lemma 5.59. For each edge e 2 E.G/ the set of all edges ez 2 E.G/ z .z e / D e forms an edge cut in G. Proof. Recall that G is assumed to be connected. Combining this with Remark 5.51 z : .z we get that for each e 2 E.G/ and each v 2 V .G/ the sets ¹z e 2 E.G/ e / D eº z and ¹z v 2 V .G/ : .z v / D vº are nonempty. Now let e 2 E.G/. We can just describe the sets separated by the set of edges ¹z e2 z : .z z and .V .G/Ae,1 /\V .G/ z E.G/ e / D eº: they are the sets .V .G/Ae,0 /\V .G/ E.G/ where Ae,0 and Ae,1 are the sets of functions in ¹0, 1º whose values on e are equal to 0 and 1, respectively. z and The only thing which is not obvious is that the sets .V .G/  Ae,0 / \ V .G/ z z .V .G/  Ae,1 / \ V .G/ are nonempty. Let e D uv and let vz 2 V .G/ be such that z or .V .G/Ae,1/\V .G/. z .z v / D v. Then vz is in one of the sets .V .G/Ae,0 /\V .G/ z starting at vz. The other end of this walk is in the other We lift the one-edge-walk e to G z and .V .G/  Ae,1 / \ V .G/. z set of the pair .V .G/  Ae,0 / \ V .G/ Let `1 .E.G// be the space of real-valued functions on E.G/ with its `1 -norm. By the `1 -norm of a function H : E.G/ ! R we mean the norm kH k1 D

X e2E.G/

jH.e/j.

Section 5.7 Graph lift constructions and `1-embeddable graphs with large girth

161

It is easy to see that the space `1 .E.G// is isometric to a subspace of `1 and therefore z into `1 .E.G// with to prove Theorem 5.57 it suffices to find an embedding F of V .G/ distortion bounded from above by a universal constant. z satisfying .z For each edge cut R.e/ defined by the set of edges ez in G e / D e, we call one of the sides of the cut R.e/ the 0-side, and the other side the 1-side, and define z ! `1 .E.G// by a function F : V .G/ ´ 1 if x is in the 1-side of R.e/ .F .x//.e/ D 0 if x is in the 0-side of R.e/. The Lipschitz constant of this embedding is 1. In fact, the cuts R.e/ are disjoint and z is in exactly one of the cuts. Therefore kF .x/  F .y/k1 D 1 if x and each edge of G z y are adjacent vertices of G. z denote by To estimate the Lipschitz constant of F 1 we consider x, y 2 V .G/, z dGz .x, y/ their distance in G, and observe the following: z then d z .x, y/  length.P / and kF .x/ Observation 5.60. If P is an xy-walk in G, G F .y/k1 is the number of edges in the walk .P / which are repeated in the walk an odd number of times. Let us denote by D.P / the number of edges repeated in .P / an odd number of times. Observation 5.60 shows that to complete the proof of Theorem 5.57 it suffices, z to find an xy-walk P in G z for which for each x, y 2 V .G/, length.P /  ˇD.P /

(5.29)

for some absolute constant ˇ. This is our next goal. z where u, v 2 V .G/ and Let x D .u, f / and y D .v, g/ be two vertices of G, E.G/ . Let S  E.G/ be the subset on which the functions f and g differ f , g 2 ¹0, 1º (recall that we consider f and g as ¹0, 1º-valued functions on E.G/). Denote by H the subgraph of G induced by edges of S. Lemma 5.61. If u ¤ v, then degrees of all vertices of H , except u and v, are even. If u D v, then degrees of all vertices of H are even. z is connected, so there is an xy-walk Q in G. z Proof. We use the assumption that G We claim that the walk .Q/ has to contain each of the edges of S an odd number of times and each of the other edges an even number of times (possibly 0). To see this we recall our construction of the lift of G and our definition of a lifted walk (see Remark 5.51). It is obvious that Q is a lifted walk of .Q/. Our definitions are such that the change in the L-coordinate in each step (when we walk along the lifted walk) is made in exactly one value of the corresponding ¹0, 1º-valued function on E.G/, the choice of this coordinate depends only on the -projection of the edge which we are passing, and not on the direction in which we pass it, or on the L-coordinate

162

Chapter 5 Families of expanders and of graphs with large girth

of the vertex we are at (this is a very important property of the graph lift which we consider). Also, we need an obvious observation that if we change some value of a ¹0, 1º-valued function twice, it returns to its original value. We denote the subgraph of G induced by edges of .Q/ by I.Q/ (.Q/ and I.Q/ are slightly different objects: .Q/ is a sequence of edges in which some edges can be repeated, I.Q/ is the subgraph of G induced by edges which are listed in .Q/ at least once). Now we introduce a non-simple graph N.Q/ having I.Q/ as its underlying simple graph and having as many parallel edges for each edge of I.Q/, as many times the edge is repeated in .Q/. It is clear that the graph N.Q/ contains an Euler trail which is a uv-walk (to get it, we can just follow the walk .Q/, each time using a different parallel edge for edges repeated in .Q/). Therefore, if u ¤ v degrees of all vertices of N.Q/, except u and v, are even. If u D v degrees of all vertices of N.Q/ are even. It is clear that if we delete from N.Q/ an even number of parallel edges, this property continues to hold. In particular, it continues to hold if we delete all edges parallel to edges repeated in .Q/ an even number of times, and leave one copy of each edge repeated in .Q/ an odd number of times. It is clear that what we get after this deletion is the graph H . To complete the proof of Theorem 5.57 it remains to construct an xy-walk P satisfying (5.29). We use the graph H introduced above. Lemma 5.61 implies that all components of H , except possibly the one that contains u and v, contain cycles. Therefore each of them has at least g.G/ edges. It is also clear that the component containing u and v (if it exists) has an Euler trail whose initial vertex is u and whose terminal vertex is v; and all other components have Euler tours (that is, closed Euler trails). Let H1, : : : , H t be the components of H . We assume that H1 contains u and v. Observe that in the case where u D v such a component does not have to exist. In this case we introduce H1 as a trivial component containing one vertex u D v. Observe that we may assume that this trivial component contains an Euler trail joining u and v, it is just a trail with no edges. Observe that any two components of H can be joined by a path in G of length  diam.G/. Let M1 , : : : , M t 1 be paths of length  diam.G/ each, such that (a) Mi joins Hi and Hi C1. (b) The terminal vertex of Mi coincides with the initial vertex of Mi C1 . Now we form the following uv-walk M in G: 

It starts at u and follows an Euler trail of H1 to the initial vertex of M1 .



It follows M1 to H2.



It follows the Euler tour of H2.

Section 5.7 Graph lift constructions and `1-embeddable graphs with large girth

163



It follows M2 to H3.



It continues in an obvious way to H t .



It follows the Euler tour of H t .



It follows M t 1 back to H t 1 .



It follows M t 2 back to H t 2 .



It continues in an obvious way until it reaches H1 .



It follows the final part of the Euler trail of H1 (the initial part of that Euler trail was followed in the first step) and completes it at v.

We lift the walk M taking x D .u, f / as the initial vertex of the lifted walk. Denote the obtained walk by P . It is clear that all edges of S are used in M an odd number of times and that all other edges are used in M an even number of times. Therefore the lifted walk P has y D .v, g/ as its terminal vertex (to see this we use observations made in the proof of Lemma 5.61). So P is an xy-walk. Also it is clear that D.P / D jSj. Counting the number of edges in P , we get that its length does not exceed 2.t  1/diam.G/ C jE.H1 /j C jE.H2 /j C    C jE.H t /j D 2.t  1/diam.G/ C jSj. (5.30) In the case where t D 1, we use the right-hand side of (5.30) and get length.P /  D.P /, so (5.29) holds with ˇ D 1. In the case where t > 1 we use the left-hand side of (5.30) to get t X length.P /  jE.H1 /j C .jE.Hi /j C 2 diam.G//. i D2

After that we combine the fact that jE.Hi /j  g.G/ for i  2 with the assumption that g.G/  ˛ diam.G/, and get   t X 2 jE.Hi /j 1 C length.P /  jE.H1 /j C ˛ i D2   2 jSj  1C ˛   2 D 1C D.P /. ˛   Thus (5.29) holds with ˇ D ˛2 C 1 . This completes the proof of Theorem 5.57.

164

Chapter 5 Families of expanders and of graphs with large girth

5.8 Probabilistic proof of existence of expanders Historically the first proof of existence of expanders was probabilistic. Until now the simplest known proofs of existence of expanders are probabilistic. We present a version of such a proof which uses graph lifts as a tool for achieving k-regularity. For simplicity, we prove the result only for sufficiently large k, although a similar proof (with more complicated details) works for each k  3. In Notes and Remarks we describe other probabilistic proofs and provide references. The expander family which we construct is a sequence of random lifts of the graph KkC1 (the complete graph with k C 1 vertices). The number of edges in KkC1 is k.kC1/ , we denote this number by d . 2 For convenience of the reader we give a detailed definition of a random lift. Let Ln D ¹1, 2, 3, : : : , nº, we use the standard notation Sn for the set of all permutations of Ln. We consider the following probability space. Let

Ÿ

n,k D Sn  Sn      Sn  Sn , d times 1 with the probability measure Pn,k defined by Pn,k .!/ D .nŠ/ d for each d -tuple ! of permutations, so ! D ¹ e ºe2E.KkC1 / , where e 2 Sn (we label permutation by edges of KkC1). For each ! 2 n,k we introduce the corresponding graph Gn,k .!/ in the following way. The vertex set of Gn,k .!/ is the Cartesian product V .KkC1/  Ln (it is the same for all !). The edge set of Gn,k .!/ is defined as follows. We pick an orientation of the graph KkC1, that is, for each edge e assign one of its ends to be a head of e and the other to be a tail of e. We denote the head of e by e C and the tail of e by e . The edge set E.Gn,k .!// contains n edges per each edge of KkC1. They are the edges joining .e  , m/ with .e C, e .m// for each m 2 ¹1, : : : , nº. It is easy to check that for each d -tuple ! 2 n,k of permutations we get a k-regular graph Gn,k .!/, just because KkC1 is k-regular. The following theorem immediately implies that the constructed collection of graphs contains families of expanders.

Theorem 5.62. There exist k 2 N and  > 0 such that lim Pn,k ¹! : h.Gn,k .!//  º D 1, n!1

where h.Gn,k .!// is the expanding constant. Proof. We need to show that for sufficiently large k and n the graphs Gn,k .!/ have expanding constants   for the vast majority of ! 2 n,k . This means that for such jV .Gn,k .!//j satisfies ! each A  V .Gn,k .!// such that jAj  2 (5.31) j@Gn,k .!/.A/j  jAj, where we indicate the graph with respect to which we take the boundary in the subscript.

Section 5.8 Probabilistic proof of existence of expanders

165

To achieve this goal we estimate the probability of the complement of this event, that is the probability of the set of those ! for which there is A  V .Gn,k .!// satisjV .Gn,k .!//j which violates condition (5.31). fying jAj  2 Following the notation introduced in Section 5.7 we denote, for each v 2 V .KkC1/, the set ¹vº  Ln  V .Gn,k .!// by  1 .v/. Among the intersections A \  1 .v/ there is one of maximal cardinality, we denote it by S and let s 2 V .KkC1/ be the vertex for which S D A \  1.s/. There is also an intersection of the least cardinality, we use the notation T D A \  1.t / for it. It is clear that jAj  jT j. jSj  kC1 We estimate j@Gn,k .!/ .A/j from below in the following way. Observe that there are k edge-disjoint paths joining s and t in KkC1. According to Remark 5.51 (page 157), for each ! and each vertex .s, m/ 2 S, each such path can be lifted to the graph Gn,k .!/ starting from .s, m/. If such a lifted path ends outside T , then it contains an edge in @Gn,k .!/.A/. Note also that the lifted paths are edge-disjoint, because the original paths are edge-disjoint. Hence the cardinality of @Gn,k .!/.A/ can be estimated from below by the number of lifted paths which, originating in S, terminate outside T . If the inequality (5.31) does not hold, then the number of such paths is < jAj and thus there exists a set T 0 2  1 .t / of cardinality < jAj such that all paths obtained after lifting the k edge-disjoint paths joining s and t originating at vertices of S terminate in Tz D T [ T 0 . So for each set A and graph Gn,k .!/ satisfying j@Gn,k .!/ .A/j < jAj there exist sets S and Tz such that all lifted to Gn,k .!/ paths mentioned above which originate at S terminate in Tz and jAj jAj jAj , jTz j < C jAj D .1 C .k C 1//. jSj  kC1 kC1 kC1 Observe that we consider A satisfying jAj  .kC1/n and that the condition above 2 will still be satisfied if we delete some elements of the set S. Also, for simplicity, let us restrict our attention to even n. It is easy to see that to complete the proof of the theorem it suffices to show that for sufficiently large k we can choose  in such a way that the probability Pn,k of the following event goes to 0 as n ! 1. (a) There exist s, t 2 V .KkC1/, s ¤ t , and S   1 .s/ and Tz   1.t / satisfying jSj  n2 , jTz j  .1 C .k C 1//jSj and such that lifts to Gn,k .!/ starting at vertices of S of all k edge-disjoint paths from s to t in KkC1 terminate in Tz . This observation allows us to proceed in the following way. We make the following choice: after the selection of k, the value of  is determined by 1 1 D  2 kC1 (the number 12 could be replaced by any fixed number in the interval .0, 1/, but we stick with the choice of 12 ). We consider all possible pairs of subsets S and Tz , satisfying

166

Chapter 5 Families of expanders and of graphs with large girth

S   1.s/, Tz   1 .t /, s, t 2 V .KkC1/, s ¤ t , satisfying jSj  n2 , jTz j D b 32 jSjc and estimate from above the probability Pn,k of the event that lifts to Gn,k .!/ starting at vertices of S of all k edge-disjoint paths from s to t in KkC1 terminate in Tz . If we add the probabilities of all such events, over all such choices of S and Tz , we get an upper estimate for the probability of the event described above. Our next observation is crucial. It is the fact that the probability that one of the paths originating at s and ending at t will have all of its lifts starting at S terminate in Tz is the same as the probability of a permutation of Ln to map the set of Ln -coordinates of S into set of Ln-coordinates of Tz . Using this and the fact that the choices of such permutations for different paths are independent, we get the following upper estimate for the probability associated with S and Tz : R !k nr 

,

r

where r D jSj and R D jTz j. Therefore the probability of the event described in (a) can be estimated from above by k.k C 1/

X 1r  n2

n r

!

n R

! R !k nr 

,

(5.32)

r

where R D b 32 r c. Now we observe that R  n implies R      r R Rr C1 R r n D     . n nr C1 n r As for the remaining part of the product, we use the Stirling formula (which we reminded the reader of on page 123) and the inequality R  2r , to get an estimate ! !   ! n n nr nRCr nRCr e RCr 2r nR   r R  . r R r R rŠ RŠ RRCr r R e e

Now we estimate the probability in (5.32) by k.k C 1/

X nRCr e RCr 2r  R r k n RRCr n

1r  2

 k.k C 1/

X 1r  n2

 k.k C 1/

X 1r  n2

 r kRr R n  k3 !r R 4 . e  n

e 4r

(5.33)

Section 5.9 Size and diameter of graphs with large girth: basic facts

To get the inequality in the last line we used the inequalities we pick k in such a way that

R n

167

 1 and R  2r . Now

 k3 3 < 1. e  4 4

(5.34)

We fix such a value of k, denote the corresponding number in the left-hand side of 1 (5.34) by ˛, and let  D 2.kC1/ (as we agreed). Observe that each of the terms in the sum in the last line of (5.33) does not exceed ˛ r because R  b 32  n2 c  34 n. It remains only to show that now, if we let n ! 1, the sum in (5.33) goes to 0. This is a standard argument: since P the rr-th term in the sum (5.33) does not exceed k.k C 1/˛ r , the series k.k C 1/ 1 r D1 ˛ is convergent, and the r -th term in the sum (5.33) approaches 0 as n ! 1, by the Lebesgue dominated convergence theorem, the sum in (5.33) approaches 0 as n ! 1.

5.9 Size and diameter of graphs with large girth: basic facts In this section we consider not necessarily regular graphs with large girth. For a graph G we denote the minimum degree of vertices of G by ı.G/. Denote by n.g, ı/ the minimum number of vertices in a graph of girth  g and minimum degree  ı. As we have already mentioned, we are interested in the case where ı > 2. Theorem 5.63.

8 ˆ .ı  1/.g1/=2  1 ˆ .ı  1/N , and girth at least g. Therefore the average degree of this graph is > 2.ı  1/. A useful observation is: If we delete from G1 all vertices whose degree is  .ı  1/, the average degree of the obtained graph will still be > 2.ı 1/. In fact, denote the number of deleted vertices by D, so the number of deleted edges is  .ı  1/D. The average degree of the obtained graph is 2

M  .ı  1/D .ı  1/N  .ı  1/D >2 D 2.ı  1/. N D N D

Of course, after this deletion new vertices with degrees  .ı  1/ may appear. We repeat the deletion of all such vertices if needed. This procedure will terminate eventually because we consider finite graphs and it will terminate when the remaining graph is nonempty because of the bound on average degree. Thus the result will be a nonempty graph G with minimum degree  ı girth  g and at most .2ı/g vertices.

5.11 Graphs with large girth using variational techniques The purpose of this section is to prove the existence of graphs with girth g and minimum degree ı and to estimate from above the number of vertices in such graphs. Our first result (Theorem 5.67) proves the existence of even ı-regular graphs satisfying the mentioned conditions. Our second result (Theorem 5.69) is somewhat weaker, but is algorithmic. Theorem 5.67. Let ı, g 2 N, g  3, ı  2, and let X

g2

m

i D0

.ı  1/i D .if ı ¤ 2/ D

.ı  1/g1  1 ı2

(5.36)

be an integer. Then there exists a ı-regular graph of order 2m and girth at least g. Remark 5.68. A useful consequence of this is that for even values of n there exist ı-regular graphs with n vertices and girth  c logı n. It is also worthwhile to mention that the main difference between (5.36) and the inequalities in Theorem 5.63 is that the power in (5.36) is twice larger: g1 vs..g1/=2. Proof of Theorem 5.67. We fix m D m0 satisfying (5.36) for some ı D ı0 . We use induction on ı 2 ¹2, : : : , ı0º to show that for each ı 2 ¹2, : : : , ı0 º there is a ı-regular graph with 2m0 vertices and girth at least g. If ı D 2 the result is trivial because a cycle of length 2m0 is the desired graph, so suppose ı > 2. Induction hypothesis: assume that for k 2 ¹2, : : : , ı0  1º we have

171

Section 5.11 Graphs with large girth using variational techniques

proved the existence of a k-regular graph with 2m0 vertices and girth at least g. In remains to show that under this hypothesis we can find a .k C 1/-regular graph with 2m0 vertices and girth at least g. We are going to use .G/ to denote the maximum degree of vertices of a graph G. Let G be a graph with 2m0 vertices, girth  g, k  ı.G/  .G/  k C 1 (at least one graph exists by the induction hypothesis) and the maximal possible number of edges. To prove the theorem it suffices to show that it is .k C 1/-regular. Assume the contrary, then G has at least two vertices with degree k. Let u and v have degree k. If dG .u, v/  g  1, we add the edge uv to the graph, and get a contradiction with the assumption that G has the maximal possible number of edges. So we assume dG .u, v/  g  2 and that any pair of vertices of degree k in G is at distance  g  2. Let us show first that if we find an adjacent pair of vertices, y1 , y2 2 V .G/, such that dG .y1 , u/  g  1 and dG .y2 , v/  g  1, we can complete the argument. In fact, in such a case we delete the edge y1 , y2 and add edges y1 u and y2 v. It is clear that doing so we increase the total number of edges and do not create cycles of length  g  1. (Here we need to proceed with some care because by adding two edges for pairs at distance  g  1 we can create short cycles. So we add them one by one. After the addition of the edge y1 u the distance between y2 and v is still  g  1 because a y2 v-path of length < g  1 cannot go through y1 . In fact, the distance between y1 and y2 after the deletion of y1 y2 is at least g  1.) Thus we get a contradiction with the assumption that G has the maximal possible number of edges. To find a pair y1 , y2 we consider the balls of radii g  2 centered at u and v: B1 :D ¹w : dG .w, u/  g  2º,

B2 :D ¹w : dG .w, v/  g  2º.

Since the degrees of u and v are equal to k, we have jB1 j 

g2 X i D0

ki 

g2 X

.ı0  1/i  m0 ,

i D0

jB2 j 

g2 X i D0

ki 

g2 X

.ı0  1/i  m0 .

i D0

The inequality dG .u, v/  g  2 implies that B1 and B2 intersect. Since jV .G/j D 2m0, the complement of the union of the balls contains at least one vertex y 2 V .G/n.B1 [ B2 /. Now we analyze: how can it happen that a vertex y 2 V .G/n.B1 [ B2 / does not have a neighbor which is at distance  g  1 to u or to v? (If y has such a neighbor z, then y and z form the desired pair y1 , y2 .) An easy analysis shows that this can happen only if all neighbors of y are in the intersection S1 \ S2 , where S1 :D ¹w : dG .w, u/ D g  2º,

S2 :D ¹w : dG .w, v/ D g  2º.

So it remains to show that each element of V .G/n.B1 [ B2 / cannot have all of its neighbors in S1 \ S2.

172

Chapter 5 Families of expanders and of graphs with large girth

Observe that the degree of each vertex in V .G/n.B1 [ B2 / is k C 1 (otherwise such a vertex y together with u or v would form a pair of vertices of degree k with distance  g  1 between them, contrary to our assumption). Our next observation is: jV .G/n.B1 [ B2/j > jS1 \ S2j.

(5.37)

In fact, we have jV .G/n.B1 [ B2 /j D jV .G/j  jB1 [ B2 j D jV .G/j  jB1 j  jB2 j C jB1 \ B2 j. On the other hand, jB1 \ B2 j  jS1 \ S2 j C 2, because, in addition to S1 \ S2 , the set B1 \ B2 contains at least two vertices u and v. Since degrees of all vertices of V .G/n.B1 [B2/ are k C1 and degrees of all vertices of S1 \ S2 are at most k C 1, inequality (5.37) implies that it cannot happen that all neighbors of V .G/n.B1 [ B2/ are in S1 \ S2 . Our next purpose is to show that if we do not insist on having a ı-regular graph and are content with a graph G satisfying ı  1  ı.G/  .G/  ı C 1, we can make the construction of such graph algorithmic. For simplicity we consider the case where the number of vertices is even. Theorem 5.69. Let n 2 N be even, n D 2k, and let V D ¹1, : : : , nº. For each ı  2 there is a graph G.ı/ with vertex set V for which the average degree is ı,  the  minimum degree  ı  1, the maximum degree  ı C 1 and the girth is > logı n2 C 1. Proof. We construct such graphs algorithmically and inductively. In the case ı D 2 we let G.ı/ D G.2/ be a cycle with n vertices. It is clear that all of the conditions are satisfied. We produce G.3/ using the following simple algorithm. We add k D n2 edges to G.2/, one by one. Each time we pick one of the ends of the edge which we are adding to be a vertex u of the least degree among the vertices of the (intermediate) graph G which we modify, and the second vertex to be the vertex which is the farthest in G from u among vertices whose degree does not exceed 3. It is clear that in this way we get a graph whose average degree is 3 (average degree increases by 1 just because we add n2 edges), the maximum degree is  4 and the minimum degree is  2. It remains to estimate the girth. To do this we need to estimate from below distances between vertices which we join by edges. When we add an edge uv to G, it means that all vertices w satisfying dG .u, w/ > dG .u, v/

(5.38)

have degree 4 (otherwise we would add a different edge). Since the average degree of G is < 3 (we reach average degree 3 only at the end) and the minimum degree is 2, we get that the number of vertices w satisfying (5.38) is < n2 . Since the maximum degree of G is 4 and the degree of u is 2, estimating the number of vertices w which do not satisfy (5.38) we get

173

Section 5.11 Graphs with large girth using variational techniques

1 C 2 C 2  3 C 2  32 C    C 2  3dG .u,v/1 >

n . 2

This inequality can be rewritten as 3dG .u,v/ >

n . 2

Therefore dG .u, v/ > log3 and the girth of the obtained graph is

n

n

2

C 1, 2 which is the inequality claimed in Theorem 5.69. Now we turn to an induction step. At the beginning we have a graph G.ı/ with average degree degree  ı  1, the maximum degree  ı C 1 and  n ı, the minimum n girth  logı 2 C 1. We add 2 edges to G.ı/, one by one. Each time we pick one of the ends of the edge which we are adding to be a vertex u of the least degree among the vertices of the (intermediate) graph G which we modify, and the second vertex to be the vertex which is the farthest in G from u among vertices whose degree does not exceed ı C 1. It is clear that in this way we get a graph G.ı C 1/ whose average degree is ı C 1 (average degree increases by 1 just because we add n2 edges), the maximum degree is  ı C 2. We need to estimate the minimum degree and the girth. The fact that the minimum degree becomes  ı follows from the following argument. It is clear that G.ı/ cannot have more than n2 vertices of degree ı  1. Since we add n2 edges and each time pick one of the ends to be a vertex with minimum degree of the intermediate graph G, all of the vertices of degree ı  1 in G.ı/ will increase their degree when we complete our construction of G.ı C 1/. To estimate the girth we need to estimate from below distances between vertices which we join by edges. When we add an edge uv to G, it means that all vertices w satisfying dG .u, w/ > dG .u, v/ have degree ı C2 (otherwise we would add a different edge). The number of such vertices is < n2 because initially (when we started to add edges to G.ı/) there were no such vertices, and in each step we could increase the number of such vertices at most by 1. Since the maximum degree of G is ı C 2 and the degree of u is  ı, we get that the number of vertices w which do not satisfy (5.38) is n 1 C ı C ı  .ı C 1/ C ı  .ı C 1/2 C    C ı  .ı C 1/dG .u,v/1 > . 2 This inequality can be rewritten as n .ı C 1/dG .u,v/ > . 2 > log3

174

Chapter 5 Families of expanders and of graphs with large girth

Therefore dG .u, v/ > logı C1

n

2

and the girth of the obtained graph is > logı C1

n

2

C 1.

This proves Theorem 5.69.

5.12 Inequalities for the spectral gap of graphs with large girth There exists a very elegant argument imposing restrictions on the spectrum of adjacency matrices of graphs with large girth whose number of vertices is very close to the lower bound of Theorem 5.63. Let A be the adjacency matrix of a k-regular graph G with n vertices of girth g D 2r C 1. Let n.g, k/ be the lower bound of Theorem 5.63 on the number of vertices of this graph, that is k n.g, k/ D 1 C ..k  1/r  1/. k2 The number n  n.g, k/ is called the excess of G. Let us denote by Aj the following generalization of the adjacency matrix: the u, ventry of this matrix is 1 if dG .u, v/ D j and is 0 otherwise. In this notation the usual adjacency matrix is A1 , and A0 is the identity matrix. It is easy to find a recursive polynomial expression for the matrix Fr .A/ D A0 C    C Ar with rows and columns indexed by vertices of G and such that the entry is 1 if the distance between the vertices is  r . More precisely we have: A0 D I ,

A1 D A,

A2 D A2  kI ,

AkC1 D A  Ak  .k  1/Ak1 if 3  k  r  1. In fact, the u, v-entry of the matrix A  Ak is the number of uv-walks of length k C 1 satisfying the following condition: the walk goes to a vertex which is at distance k from u, and then reaches v using just one edge. Now we use the fact that in a graph of girth 2r C 1 there is just one shortest wz-walk for any two vertices w and z with dG .w, z/  r . This implies that the u, v-entry of the matrix A  Ak is 1 if dG .u, v/ D k C 1 and is k  1 if dG .u, v/ D k  1. Theorem 5.70. Let 2 be the second eigenvalue of the adjacency matrix A. Then jFr . 2/j  n  n.g, k/.

(5.39)

Section 5.13 Biggs’s construction of graphs with large girth

175

Proof. Observe that the number of ones in the column (or row) of the matrix Fr .A/ corresponding to a vertex u is the number of vertices in G which are at distance  r from u in G. It is easy to see that this number is n.g, k/. Therefore the eigenvalue of the matrix Fr .A/ corresponding to all-ones vector 1V .G/ is n.g, k/. On the other hand, the matrix J (all-ones matrix) is equal to A0 C    C AD , where D is the diameter of the graph. Observe that the matrix J  Fr .A/ has the same amount of ones in each column and each row, and this amount is n  n.g, k/, the excess of the graph G. Observe that both Fr .A/ and J are symmetric real matrices and both have 1V .G/ as an eigenvector. Also the restriction of J to the orthogonal complement of 1V .G/ is a zero-operator. Therefore Fr .A/ and J have a common basis of eigenvectors. Observe that all eigenvalues of the difference J  Fr .A/ do not exceed n  n.g, k/ (the argument is the same as in Proposition 5.4) and that J has one eigenvalue equal to n and all other eigenvalues equal to 0. Thus jFr . /j  n  n.g, k/ for each eigenvalue of A except the first one.

5.13 Biggs’s construction of graphs with large girth Now we turn to more algebraic constructions of families of graphs with fixed degree and growing girths. Definition 5.71. We say that a group G acts on a set X if there is a map G  X ! X, .g, x/ 7! g.x/ such that 

e.x/ D x for all x 2 X, where e is the identity of G.



f .g.x// D .fg/.x/ for all f , g 2 G and all x 2 X.

Definition 5.72. Let G be a finite group which acts on a finite set X. Let S be a symmetric subset of G (this means that e … S and .s 2 S/ , .s 1 2 S/) which acts freely on X in the sense that s.x/ ¤ x for all x 2 X and s 2 S, and that s1 .x/ ¤ s2 .x/ for all s1 , s2 2 S with s1 ¤ s2 and all x 2 X. Then the Schreier graph Sch.X, S/ is defined as the graph with vertex set X and edge set defined by the condition uv is an edge if and only if u D s.v/ for some s 2 S. Biggs’s construction produces Schreier graphs of subgroups of symmetric groups. First we produce the corresponding set X. It consists of labelings of vertices of a d -regular rooted tree T of depth r . This means the following: T has 1 C d C d.d  1/ C    C d.d  1/r 1 vertices which are partitioned into generations having 1, d , d.d  1/, : : : , d.d  1/r 1 vertices, respectively. The generation with d.d  1/j 1 vertices, j 2 ¹2, : : : , r º, is called the generation j ; the generation with d vertices is called the generation 1; the

176

Chapter 5 Families of expanders and of graphs with large girth

generation with 1 vertex is called the generation 0. The only vertex in the generation 0 is called the root. The edge set of T is (up to an isomorphism) determined by the following rules: Each vertex of generation j , 1  j  r , is adjacent to one vertex of the generation j  1, each vertex of generation j , 1  j  r  1, is adjacent to d  1 vertices of generation j C 1, and the root is adjacent to d vertices of generation 1. It is easy to see that all vertices in T except vertices of generation r have degree d . Vertices of generation r have degree 1. We denote by V the set of vertices of T , and let L be some set of labels satisfying jLj D jV j. We consider some initial labeling of V with elements of L. The set X consists of all labelings of V obtained from this initial labeling in the way described below. We consider a proper d -coloring of edges of T . We mean that we color edges using d different colors in such a way that edges having the same color do not have a common end. (It is easy to construct such coloring starting from edges incident with the root.) For each of the d colors we introduce the corresponding permutation of labels: The permutation i corresponding to the color i transposes pairs of labels which are at the ends of all edges having color i (more details: if an edge has color i and its ends have labels `1 and `2 before the action of i , after the action they have labels `2 and `1 , respectively). Let X be the set of all labelings obtained from the original labeling using all possible combinations of permutations i , i D 1, : : : , d . It is clear that i1 D i . Therefore S D ¹i : i D 1, : : : , d º is a symmetric set. Let G D Sch.X, S/ be the corresponding Schreier graph. It is clear that the graph G is d -regular. Now we estimate the girth of G from below. We claim that g.G/  2r C 1. It is not difficult to see that to prove this claim we need to show that for k < 2r C 1 the composition nk nk1 : : : n2 n1 cannot satisfy the following two conditions simultaneously: (a) Its product (in the given order) is the identity permutation. (b) There is no i such that ni D niC1 . To show this we apply nk nk1 : : : n2 n1 to some labeling x in X. We see that the condition (b) implies that the label of the root in x will be moved away (in the graph distance of T ) from its original position by at least first r permutations: in order to move the label in the direction of the root a permutation of the set n1 , n2 , : : : , nr has to coincide with the immediately preceding one. This shows that the girth is at least 2r C 1.

177

Section 5.14 Margulis’s 1982 construction of graphs with large girth

5.14 Margulis’s 1982 construction of graphs with large girth Recall that the special linear group SL2.Z/ is the group of all 2  2 matrices with integer coefficients and determinant equal to 1. Let a and b be two elements in a group G. We say that a and b generate a free subgroup in G if none of the nonempty words in a and b which do not contain any of the subwords from the list aa1 , a1 a, bb 1, b 1 b

(5.40)

represent the identity of G. Theorem 5.73. The matrices



1 2 aD 0 1

 and

  1 0 bD 2 1

(5.41)

generate a free subgroup of SL2 .Z/. Proof. These matrices act in a natural way on R2 . We prove the theorem by studying the action of these matrices on the regions X D ¹.x, y/ 2 R2 : jxj > jyjº and Y D ¹.x, y/ 2 R2 : jyj > jxjº. Straightforward verification shows that any nonzero power of a (positive or negative) maps Y into X and that any nonzero power of b (positive or negative) maps X into Y . Observe that any word in a and b which does not contain subwords from the list (5.40) can be written in one of the forms: (a) an1 b n2 an3 : : : ank2 b nk1 ank (k is odd). (b) b n1 an2 b n3 : : : b nk2 ank1 b nk (k is odd). (c) an1 b n2 an3 : : : ank1 b nk (k is even). (d) b n1 an2 b n3 : : : b nk1 ank (k is even). The elements of the form (a) cannot represent the identity because they map Y into X. The elements of the form (b) cannot represent the identity because they map X into Y . To show that words of the forms (c) and (d) cannot represent the identity we use the following trick to show that if a product of powers of even length represents the identity, then there is a product of powers of odd length word representing the identity. We do this for words of the form (c) (the (d) case is similar). Observation: If an1 b n2 an3 : : : ank1 b nk is equal to the identity and n1 > 0, then an1 C1 b n2 an3 : : : ank1 b nk a1 is also equal to the identity. If n1 < 0, we consider an1 1 b n2 an3 : : : ank1 b nk a. In both cases we get a product of powers of odd length representing the identity.

178

Chapter 5 Families of expanders and of graphs with large girth

We consider the natural quotient mapping of SL2.Z/ onto SL2.Zp /, where p is a prime number (see Section 5.4.2 for more details on this quotient) and the Cayley graph of the group SL2.Zp / with respect to the set S of generators which are the natural images of the matrices (5.41) and their inverses. The main result of this section is that these Cayley graphs have large girth. Theorem 5.74. g.Cay.SL2.Zp /, S//  logp2C1 .p  1/. Proof. We estimate the cycle length from below in the following way: the length of the shortest cycle is the length of the shortest word of one of the forms (a)–(d) which is equal to the identity of SL2.Zp /. We consider the corresponding word in SL2.Z/. Since the matrices (5.41) generate a free subgroup in SL2.Z/, the word cannot be the identity in SL2.Z/, so it contains entries whose differences with the respective entries of the identity matrix are multiples of p. Thus it contains at least one entry whose absolute value is  p  1. To estimate the number of terms in the product of matrices (5.41) in which at least one of the entries has absolute value  p  1 we use the submultiplicativity of matrix norms. This means the following: we consider each of the matrices as an operator on `22 and estimate its operator norms. It is easy to see that the norm of a matrix containing an entry with absolute valuep p  1 is  p  1. On the other hand, it is easy to compute that kak D kbk D 2 C 1 (using either Lagrange multipliers or some linear algebra). Hence each cycle in Cay.SL2.Zp /, S/ should contain at least logp2C1 .p1/ edges.

5.15 Families of expanders which are not coarsely embeddable one into another In this section we allow expanders to have loops and assume that each loop contributes 1 to the degree of the vertex it is attached to. Definition 5.75. Saying that a family F1 of metric spaces is uniformly coarsely embeddable into a family F2 of metric spaces we mean that there exist 1 and 2 such that each metric space in F1 is coarsely embeddable into some metric space of the family F2 , and the corresponding inequality (see Definition 1.45) holds for each of the embeddings with the same 1 and 2 . Theorem 5.76. There exist families F1 , F2 of expanders such that F1 is not uniformly coarsely embeddable into F2 . We are going to prove this theorem picking the family F2 consisting of expanders with indefinitely growing girth, and the family F1 consisting of expanders containing

Section 5.15 Not coarsely embeddable one into another

179

plenty of cycles of different lengths (satisfying an additional condition). Existence of families of the first type is well known (see Notes and Remarks). Families of the second type can be easily constructed as is shown in the next proposition. The main idea of this construction is: if we consider a union of an expander and a regular graph with the same set of vertices, we get an expander. Recall that a subset A in a metric space .X, d / is called ı-separated, ı > 0, if d.u, v/  ı for any u, v 2 A, u ¤ v. Proposition 5.77. Let ¹Pn º1 nD1 be a d1 -regular family of expanders. It is clear that 1 we can find non-decreasing sequences of positive integers ¹an º1 nD1 and ¹bnºnD1 with limn!1 an D limn!1 bn D 1, and such that Pn contains an an -separated set of vertices of cardinality bn . Let ¹Snº1 nD1 be graphs satisfying .Sn /  d2 ( .Sn / is the maximum degree of Sn ), jV .Sn /j  bn and diamSn  an . Then there is a .d1 C d2/-regular family ¹Gnº of expanders such that Gn contains an isometric copy of Sn . Proof. We just add to these an -separated sets of vertices of cardinality bn the edge structure of Sn. The condition diamSn  an implies that it is an isometric embedding of Sn into Pn . The maximum degree of the obtained graph is  d1 C d2. We add the necessary amount of loops to get a .d1 C d2/-regular graph. It is clear that the expanding constant of the graph obtained in this way from Pn cannot be less that the expanding constant of Pn . Proof of Theorem 5.76. Let Sn be a square grid graph defined as the graph with the vertex set ¹1, 2, 3, : : : , nº  ¹1, 2, 3, : : : , nº; vertices .k1 , m1 / and .k2 , m2/ in Sn are adjacent if and only if jk1  k2j C jm1  m2 j D 1. Applying Proposition 5.77 to an arbitrary family of expanders with ¹Snº1 nD1 being square grid graphs we get a family of expanders which we denote ¹Gnº1 nD1 . We may omit some graphs in this sequence and assume without loss of generality that that Gn contains a subgraph isometric to Sn . We claim that the family ¹Gn º1 nD1 does not admit uniformly coarse embeddings into a family ¹Hn º of expanders with indefinitely growing girths. In fact, assume the contrary, let fn : Gn ! Hm.n/ be a family of uniformly coarse embeddings, 1 and 2 be the corresponding functions. Consider restrictions of ¹fn º to ¹Snº. It is clear that the graph Sn contains cycles of lengths 4, 8, : : : , 4n. Furthermore, these cycles can be selected in such a way that vertices in a cycle of length 4k, which are at distance in the (graph distance of the) cycle are at distance  min¹k, º in Sn (we can just pick the cycles to be naturally defined “squares” in Sn ). The vertex set of such a cycle of length 4k is mapped onto a sequence ¹wi º4k i D1 in Hm.n/ satisfying dHm.n/ .wi , wi C1 /  2.1/ (we let i C 1 D 1 for i D 4k). We join wi and wi C1 with a shortest path in Hm.n/ and get closed walks of lengths  42.1/, : : : , 4n2.1/ in Hm.n/ . It is clear that for k satisfying 4k2.1/ < g.Hm.n/ /, where g.Hm.n/ / is the girth of Hm.n/ , the subgraph Rk of Hm.n/ induced by the introduced above walk of length  4k2.1/ is a tree. Let k0 be the largest k satisfying 4k2.1/ < g.Hm.n/ /. Now we use:

180

Chapter 5 Families of expanders and of graphs with large girth

Proposition 5.78. Let T be a thickening of a graph-theoretical tree, that is, a metric space obtained from a graph-theoretical tree if we identify each of its edges with a line segment of length 1, with the metric defined as the length of the shortest curve joining the points. Then, for any continuous map f : S ! T , where S is the unit circle in the plane with its geodesic distance dS , there exist points x, y 2 S such that dS .x, y/  2=3 and f .x/ D f .y/. Proof. Divide S into three equal parts of length 2=3 each, let a, b, c be the ends of the parts. Then the images of the parts contain continuous images of line segments joining each of the pairs .f .a/, f .b//, .f .a/, f .c// .f .b/, f .c//. By a well-known simple fact such images of line segments in a thickening of a tree contain a common point p. Considering pre-images of p in all three parts of the circle S we get a triple of points with distance  2=3 at least between one of the pairs. We apply this proposition to the map constructed in the following way. As is clear from our construction, a cycle of length 4k0d2 .1/e admits a 1-Lipschitz map onto Rk0 which is an extension of the restriction of fn to the 4k0-cycle: we replace each edge of the cycle by a path of length d2 .1/e and map this path in a 1-Lipschitz way onto the shortest path joining the corresponding wi and wi C1 in Hm.n/ (we do not require this 2 .1/e map to be injective or a coarse embedding). We use this map to construct a 4k0 d 2

Lipschitz map of S onto the thickening Tk0 of Rk0 in the most natural way: we identify vertices of the 4k0 d2.1/e-cycle with the points of S forming a regular convex polygon with 4k0 d2.1/e vertices and extend the map to the intervals between these vertices by mapping them onto the corresponding shortest paths in the thickening. By Proposition 5.78, there are two points x, y 2 S with dS .x, y/  2=3 with the same image. We consider the nearest to x and y points xS and yS in S corresponding 2

2

to images of vertices of the 4k0 -cycle. Then dS .x, xS /  4k and dS .y, yS /  4k . 0 0 Therefore dHm.n/ .fn .xS /, fn .yS //  dTk0 .fn .xS /, fn .yS // 

4 4k0 d2 .1/e D 2d2.1/e.  4k0 2

It is also easy to estimate that the distance between xS and yS in the 4k0 -cycle is at least b 43 k0  2c. Recalling the way in which the distance on a 4k0 -cycle compares with the distance in square grid graph Sn , as well as the definition of a coarse embedding, we get  ²  ³ 4 k0  2  2d2 .1/e, 1 min k0 , 3 where k0 is the largest integer satisfying 4k2.1/ < g.Hm.n/ /. This shows that limn!1 g.Hm.n/ / D 1 contradicts limt !1 1 .t / D 1.

Section 5.16 Exercises

181

5.16 Exercises Exercise 5.79. The complete bipartite graph Kn,n is n-regular. Show that its eigenvalues are 1 D n, 2 D    D 2n1 D 0 and 2n D n. Exercise 5.80. The complete graph Kn is .n  1/-regular. Show that its eigenvalues are 1 D n  1, 2 D    D n D 1. Exercise 5.81. For each n, let Cn be the cycle of length n, that is, the graph whose vertex set is ¹1, : : : , nº for which two vertices are adjacent if and only if either the difference between the corresponding number is ˙1, or one of the numbers is 1 and

2 j the other is n. Show that the eigenvalues of Cn are equal to 2 cos n , j D 1, : : : , n. If needed, see a hint in the Hints to exercises section below. Exercise 5.82. The Hamming cube defined on page 118 can be introduced as a graph with its graph distance. In fact, one can check that Fn2 is naturally isometric to the graph whose vertex set is the set of all n-element sequences consisting of 0 and 1 and whose edge set consists of edges joining sequences with exactly one distinct entry. Show that the eigenvalues of the adjacency matrix of this graph are n, n  2, n  4, : : : , n C 2, n. If needed, see a hint in the Hints to exercises section below. Exercise 5.83. Let G be a k-regular graph on n vertices with expanding constant h. Show that there is a constant ˛ > 1 depending only on k and h such that for every vertex v 2 V .G/ and each radius r 2 N, we have the following estimate for the cardinality of the ball jB.v, r /j  min¹˛ r , jV .G/jº. (The conclusion is that the diameter of G does not exceed log˛ jV .G/j.) If needed, see a hint in the Hints to exercises section below. Exercise 5.84. Let G be a k-regular graph on n vertices with expanding constant h, and let f : V .G/ ! R (since the graph is finite the function is necessarily Lipschitz). Let M be a median of f . Show that ¹v 2 V .G/ : jf .v/  M j  t Lip.f /ºj  C ne ct for all t > 0 and some constants 0 < c, C < 1 depending only on h. The next two exercises are related to Proposition 5.13.

(5.42)

182

Chapter 5 Families of expanders and of graphs with large girth

Exercise 5.85. Assuming that you already know that there exist a family ¹Fn º1 nD1 of finite groups with limn!1 jFn j D 1 and symmetric subsets Sn  Fn such that jSn j D k for all k and ¹Cay.Fn , Sn /º1 nD1 is a family of expanders, show that the condition (5.12) cannot be replaced by jSn  An j  .1 C c/jAn j.

(5.43)

If needed, see a hint in the Hints to exercises section below. Exercise 5.86. Let ¹Fn º be a sequence of finite commutative groups with lim jFn j D 1

n!1

and Sn  Fn be symmetric subsets of Fn such that the cardinality jSn j does not depend on n. Then the sequence ¹Cay.Fn , Sn /º1 nD1 cannot be a sequence of expanders. If needed, see a hint in the Hints to exercises section below. Exercise 5.87. Let G be a 2n -regular graph, n 2 N. Then the edge set of G can be represented as a union of edge sets of 2-regular spanning subgraphs of G. If needed, see a hint in the Hints to exercises section below. Exercise 5.88. (This exercise if substantially more difficult than Exercise 5.87. This result is due to Petersen.) Let G be a 2k-regular graph .k 2 N/. Then the edge set of G can be represented as a union of edge sets of 2-regular spanning subgraphs of G. If needed, see a hint in the Hints to exercises section below. Exercise 5.89. (The purpose of this exercise is to show a very interesting application of Exercise 5.88.) Let G be a 2k-regular graph .k 2 N/. Then G is a Schreier graph. If needed, see a hint in the Hints to exercises section below. Exercise 5.90. Show that there exist families of k-regular graphs for which the absolute spectral gaps (see Definitions 5.35 and 5.40) are not bounded away from 0, but the usual spectral gaps (see Definition 5.6) are bounded away from zero. If needed, see a hint in the Hints to exercises section below. Exercise 5.91. Show that the following three matrices, together form a symmetric generating set of SLn.Z/, n  3. 2 3 2 3 2 0 1 0 ::: 0 0 1 0 1 0 ::: 0 0 6 1 0 0 : : : 0 0 7 6 0 0 1 ::: 0 0 7 6 1 6 7 6 7 6 6 0 0 1 ::: 0 0 7 6 0 0 0 ::: 0 0 7 6 0 6 . . . . 6 7 6 .. .. .. . . .. .. 7 6 .. .. .. . . ... ... 7 , 6 7 , 6 ... . . . . . . 6 7 6 7 6 4 0 0 0 ::: 1 0 5 4 0 0 0 ::: 0 1 5 4 0 0 0 0 ::: 0 1 0 .1/n1 0 0 : : : 0 0 If needed, see a hint in the Hints to exercises section below.

with their inverses, 0 1 0 .. . 0 0

0 0 1 .. . 0 0

::: ::: ::: .. . ::: :::

0 0 0 .. . 1 0

0 0 0 .. . 0 1

3 7 7 7 7. 7 7 5

Section 5.17 Notes and remarks

183

Exercise 5.92. Let G be a finitely generated group. Show that one can find in G a symmetric generating set (see Definition 5.9), such that the corresponding graph Z (see the beginning of Section 5.4.4) is connected. Exercise 5.93. Give an example of graphs G1 and G2 such that the zigzag product z G2 is defined, and for different choices of bijections u (see Definition 5.41) G1  we get non-isomorphic zigzag products. Exercise 5.94. Prove the statement (2) in Remark 5.55. Exercise 5.95. Let k  3, t  3, k, t 2 N, and let c3 , c4 , : : : , cm 2 .N [ ¹0º/. Prove that there exists a k-regular graph having exactly cj cycles of length j , 3  j  m. (There are no restrictions on longer cycles.) If needed, see a hint in the Hints to exercises section below.

5.17 Notes and remarks General references: Recommended sources for readers willing to learn more about expanders and their applications in different areas are [110], [205], [299], [301], [423], see also references therein. The existence of expanders was first proven by Kolmogorov–Barzdin in [253] and then later by Pinsker in [374]. Both proofs are probabilistic and while Pinkser was the first to refer to the objects as expanders, his definition deviates slightly from the one presented in Definition 5.1. Gromov and Guth recently published a paper, [180], containing multidimensional generalizations of the Kolmogorov–Barzdin results. A probabilistic construction of expanders which is very close to Pinsker’s is given in Sarnak [404, Section 3.1.2] (it was also reproduced in [299, p. 6]). It is the following: We consider two sets of cardinality n, denote them I and O, we consider five permutations 1 , 2 , 3 , 4, 5 in the symmetric group Sn . We connect each element i of I with 1 .i/, : : : , 5 .i/ in O (with some abuse of notation we assume that both I and O are identified with ¹1, : : : , nº). Theorem 5.96. There exists a collection 1 , 2, 3 , 4 , 5 such that for each subset A  I with jAj  n=2 the number of neighbors of elements of A in O is at least 32 jAj. Remark 5.97. If we consider such graphs for all n and “paste” sets I and O according to one of the possible bijections, we get an expanding family of graphs, but such graphs may have parallel edges and loops, so we have to widen our notion of an expander family to include graphs which are not simple. Other probabilistic constructions were suggested and it was later discovered that such constructions could be used to determine expander families consistent with the

184

Chapter 5 Families of expanders and of graphs with large girth

definition provided in Definition 5.1. For more information see [70] and [297]. We are going to provide more details on the construction contained in the problem book of Lovasz [297, Problem 11.33], where 3-regular expanders, without loops and parallel edges were constructed. Here is the description of this construction. Definition 5.98. Let G be a graph. A 1-factor in G is a collection F of edges satisfying the condition: each vertex of G is incident to exactly one edge in F . Let G be a union of three random 1-factors on the set ¹1, : : : , 2nº. The proof goes in two steps: Theorem 5.99. (a) If n is large enough, the probability that G is a 3-regular simple graph is between 0.1 and 0.9. (b) The probability that G has expanding constant 

1 1000

is at least 95%.

To prove the statement (a) one is expected to use inclusion-exclusion formulas developed in [297]. A similar construction with less technical estimates, but also with a weaker result (only k-regular expanders for sufficiently large k) is presented in the lecture course [423]. Very strong results on probabilistic expanders were obtained by Friedman [156]. He proved that for an even d  4, a random d -regular graph model formed from d=2 uniform, independent permutations on ¹1, : : : , nº has a very large spectral gap k  2. Namely, p he proved that for any " > 0 all eigenvalues aside from p 1 D d are bounded by 2 d  1 C " with probability 1  O.n /, where D d. d  1 C 1/=2e. The work [156] contains results for other models of random graphs and is indispensable for everyone interested in random expanders. Amit and Linial introduced another approach to the construction of random expanders in [21]. The notion of graph lift, which we introduce in Section 5.7, serves as the cornerstone of their method. A version of their construction can be found in Section 5.8 and for more general constructions, which include the 3-regular case, refer to [21]. Section 5.2 contains a very short introduction to spectral graph theory, which is a well-developed area with various applications in other areas of mathematics. Interested readers are referred to [61], [100], [106], [170], and [297, §11] for additional information and references. We strongly recommend the elegant exposition in Biggs [61] and the rather small collection of problems in Lovasz [297, §11], which nevertheless develops many useful and important techniques. The right-hand side inequality in the spectral characterizations of expanders (Theorem 5.7) is due to Alon [10] and Dodziuk [125]. The left-hand side inequality is due to Alon [10], a close inequality was proved by Tanner [421]. Alon’s proof of the spectral characterization is based on an inequality proved in [15]. These results can be considered as discrete versions of results of Cheeger [95]. See [402] for an interesting and accessible introduction to Poincaré and Sobolev inequalities in a continuous setting.

Section 5.17 Notes and remarks

185

The method, first mentioned in the introduction, in which the expanding properties of randomly-generated expanders are checked using spectral characterization, was developed by Alon [10]. Results on algorithms for computing eigenvalues can be found in [388, Chapter 10]. The Cheeger constant itself is computationally hard. More precisely, [64] proved that its computation is co-NP-complete (see the paper for explanation of the term). They proved the statement in somewhat different setting, see [10] for relations between the notions used in [64] and the Definition 5.1. The property (T) was introduced by Kazhdan [246] and actively studied since then. See [53] for a recent accessible account. Property (T) was used by Margulis [308] to find the first deterministic construction of expanders, Section 5.3 is mostly based on his ideas. These ideas were further developed by Alon and Milman in [15], the source which we also use, as well as the lecture notes [423]. The estimates of Kazhdan constants of alternating groups which were mentioned in Section 5.3 are due to Kassabov [244]. Proposition 5.28 is a classical result whose history I did not investigate. We refer interested readers to [306, Section 22], [104, Section 7.2], [340, Section 5 in Chapter VII], [439], [195, Chapter III], and references therein both for the history of the study of finite generation of SLn.Z/ and related groups, and the smallest known symmetric sets generating SLn.Z/. Currently the smallest symmetric set known to generate SLn .Z/ has cardinality 4, see [340]. The first proof of the fact that SLn.Z/ has property (T) mentioned in Section 5.4.3 is the original proof of Kazhdan [246]. The second proof is due to Shalom [410]. Both proofs are presented in [53]. The first proof and some related results and approaches are presented in [423]. ˙ [446]. Mentioned generalizations and improvements of Theorem 5.34 is due to Zuk Theorem 5.34, as well as examples showing applicability of the criteria, can be found in [53, Chapter 5]. Construction of families of expanders based on zigzag products was suggested by Reingold, Vadhan, and Wigderson [392]. (A variant of this construction was analyzed earlier by Gromov [174].) This construction was developed in many different directions, see, for example, [14], [397]. Mendel and Naor [329] suggested a new approach to the proof of the main step of the construction of [392]. In our presentation we use the approach suggested in [329]. It is worth mentioning that inequality 5.24 implies the main result of [392]. Actually it leads to the later bound of Reingold–Trevisan–Vadhan [391]. Our construction of a base graph follows the paper [17], this construction goes back to the paper [16]. We refer to [307, Chapter 4] for the construction and properties of the fields Fq , where q is a power of a prime number. The existence and uniqueness of Fq , where q is a power of prime is shown in [307, Chapter 4, §3]; the fact which we use is shown in the introductory remarks of [307, Chapter 4, §2]. Graph lifts also known as covering graphs and graph covers are classical objects in topology and topological graph theory. They were studied from different perspectives in [20], [21], [22], [23], [27], [43], [128], [140], [183], [184], [185], [205, Section 6],

186

Chapter 5 Families of expanders and of graphs with large girth

[279], [337], [359], [417]. In some works these notions are studied using such terms as voltage graph and voltage assignment. Example 5.53 is a version of [68, Exercise 2, p. 161], it can also be found in [140], where an interesting strengthening of it is also proved. Proposition 5.56 was proved in [139, p. 254]. Theorem 5.57 was proved in [359], the construction was inspired by [27]. Theorem 5.57 solves the problem posed by Linial, Magen, and Naor [292], and repeated by Linial in [289, Open Problem 7] and [317, Problem 2.3]. See, for example, [71, Section 3.3] for well-known results on Euler trails and tours which we use in the proof of Theorem 5.57. Theorem 5.62 is a weakened and simplified version of a result of Amit and Linial [21, Theorem 2.1]. Some pieces of the argument are borrowed from the notes of Tao [423]. Theorems 5.63 and 5.64 are from [139]. Theorem 5.66 is from [68, Theorem 1.1, p. 104], its proof is close in spirit to the argument of Erd˝os in [138]. Theorem 5.67 goes back to Erd˝os and Sachs [139], it was refined by Walther [433, 434], and by Sauer [405], whose bounds are the strongest, see [68, Chapter III, §1] and [69, §3] in this connection. Theorem 5.69 is due to Chandran [92], where the readers can also find the modification of the proof needed in the case where the number of vertices is odd. The main result of Section 5.12 is due to Biggs [58]. See [19] for related results. The construction presented in Section 5.13 is due to Biggs [60, p. 57], who mentions the fact that it was inspired by the group-theoretical results of [49]. The Biggs’s construction was presented in a more general and explicit form in [11, p. 1752]. Biggs also mentions [60, p. 57] that is seems that the actual girth of the constructed graphs is much larger than 2r C 1 and that this was verified for some special cases in [203]. See [59] for related results and discussions. The construction presented in Section 5.14 is due to Margulis [309]. Our presentation is influenced by the presentation in [110, Appendix] (where one can find more detailed arguments and ideas which can be used to improve the estimates). The better girth parameters were achieved by a modification due to Imrich [206]. Lubotzky observed (see [300, Proposition 2.2.2, p. 164]) that Margulis’s graphs [309] presented in Section 5.14 are expanders. It is worth mentioning that if we consider an analogue of Margulis’s construction with matrices     1 3 1 0 , (5.44) 0 1 3 1 instead of (5.41), the expansion of the obtained graphs does not follow from the argument of [300] and Lubotzky suggested the corresponding open problem [300, Open Problem 2.2.3, p. 165] and [304, Problem 5.18, p. 90]. This problem was solved in the positive by Bourgain and Gamburd [78, Theorem 3]. The main result of Section 5.15 is due to Mendel and Naor [329, Section 9.1]. The topological part of the argument (Proposition 5.78) goes back to Rabinovich and Raz

Section 5.17 Notes and remarks

187

[387]. As for expanders with large girth, which we need for Theorem 5.75, one of the examples is provided by Margulis’s graphs presented in Section 5.14 (see the previous paragraph). A simpler proof of existence of such families can be obtained by doing some more work on the examples of random graphs presented in Section 5.10. See [175, Section 1.B] for the general notion of thickening of a graph. The result of Exercise 5.83 was proved in [15]. Results of the type of Exercise 5.84 establish the so-called concentration of measure phenomenon. The first phenomenon of this type goes back to the result of Levy, the importance of such phenomena was recognized by V. Milman and developed by him and many other authors, see [177, Chapter 3 12 ], [271], and [332] for accounts on the obtained theory. Exercise 5.84 can be easily derived from results of [15] and basic results of this theory. It is difficult to find out who first observed that commutative groups do not lead to expander families (Exercise 5.86). Lubotzky and Weiss [303] discovered several classes of families of finite groups which are not expanders with respect to any families of generators (of constant size). For example they proved that an infinite family of quotient groups of a finitely generated amenable group is not a family of expanders. The result of Exercise 5.88 was proved by Petersen [370]. See also [63, Chapter 10]. The hint to Exercise 5.88 is borrowed from [185, pp. 41–42]. The result stated in Exercise 5.89 is due to Gross [182]. The result of Exercise 5.95 is due to Sachs [398],[399].

5.17.1 Bounds for spectral gaps We would like to finish this section by mentioning some more important constructions of expanders and graphs with large girth. The following theorem bounds from above the spectral gap of a k-regular graph. Theorem 5.100 (Alon-Boppana). If ¹Gn º1 nD1 is a family of k-regular graphs with jV .Gn /j ! 1, then p lim inf 2.Gn /  2 k  1. (5.45) n!1

This result is mentioned in [10, p. 96]. To the best of my knowledge this result was never published by its authors. Nilli [341] obtained (and published) a more precise estimate. See [110, Sections 1.3–1.4] for a presentation of Theorem 5.100 and related estimates.

5.17.2 Graphs with very large spectral gaps Graphs attaining the bound (5.45) in a precise (non-asymptotic) way and for absolute values of all eigenvalues except the first one have a special name:

188

Chapter 5 Families of expanders and of graphs with large girth

Definition 5.101. A finite, connected, k-regular graph G is called a Ramanujan graph if for every eigenvalue j .G/, except 1.G/, the inequality p j j .G/j  2 k  1 holds. The existence of families of Ramanujan graphs with indefinitely growing orders (numbers of vertices) was proved independently and almost simultaneously by Lubotzky–Phillips–Sarnak [302] and Margulis [310]. The families of expanders constructed in these papers are Cayley graphs and have very large girth. More precisely, their girths satisfy   4 C o.1/ logk1 jV .Gn /j. g.Gn /  3 These constructions of Ramanujan graphs are explained and partially presented in an accessible text [110, Section 4.3], see also [299]. All the known families of Ramanujan graphs of indefinitely growing orders have essentially large girth, that is, for all L 2 N, we have lim

n!1

cL .Gn / D 0, jV .Gn /j

where cL .Gn / denotes the number of cycles of length L in Gn . Recently, it was proved [1] that this is not a coincidence. Theorem 5.102 ([1]). Let be a group generated by a finite symmetric set S and let ¹Hnº1 nD1 be a sequence of subgroups of finite index with indices ! 1 as n ! 1, and such that the Schreier graphs Sch. =Hn , S/ are Ramanujan for all n. Then the sequence ¹Sch. =Hn , S/º1 nD1 has essentially large girth.

5.17.3 Some more results and constructions 

There are many other recent constructions of expander families and families of graphs with growing girth: [78], [81], [157], [245]. One can find an overview of some of these constructions in [301] and [423]. The notes of Tao [423] contain a nice presentation of some of them.



Ajtai [7] presented an algorithm which in n3 .log n/3 constructs a 3-regular graph with “large” expanding constant on n vertices. In each step of the algorithm one pair of edges in the graph is replaced by another pair of edges so that the total number of cycles of length bc log nc decreases (for some fixed absolute constant c). The graph has a “large” expanding constant when a local minimum in the number of cycles of length bc log nc is reached.

189

Section 5.18 Hints to exercises 

The papers [158] and [413] contain results on connections between girth and expansion.



Lubotzky [298] constructed a sequence of graphs with indefinitely growing girths using the theory of Coxeter groups.



The paper [419] is a relatively recent contribution to the rather old study of the set of possible cycle lengths in a graph of average degree d and girth g.

5.18 Hints to exercises To Exercise 5.81. The corresponding matrix can be written as a sum of two permutation matrices. If we consider the matrices in the corresponding complex space Cn , each of them is diagonalizable in the basis consisting of the following vectors ¹e

2ikm n

ºnmD1 ,

k D 1, : : : , n.

See [297, Problem 11.1] for a detailed solution and related problems. To Exercise 5.82. Consider the basis consisting of the Walsh functions. See [297, Problem 11.9] for a detailed solution. To Exercise 5.83. First use the definition of the Cheeger constant to show that there is a number ˇ > 1 such that jB.v, r /j  min¹ˇ r ,

jV .G/j º. 2

Then use this fact to estimate the diameter of V .G/. Finally, combine the steps to get the desired estimate, possibly, with ˛ somewhat smaller than ˇ from the first step. To Exercise 5.84. We may assume that Lip.f / D 1 and M D 0. The set of vertices satisfying (5.42) with t > 1 consists of two parts: where f .v/  t and where f .v/  t . Use the definition of the Cheeger constant to show that cardinalities of both sequences of sets are exponentially decaying as t D 1, 2, 3, : : : . To Exercise 5.85. Consider products of the groups Fn with the group F2 consisting of two elements. To Exercise 5.86. Combine Proposition 5.13 with the result of Exercise 5.83 and the study of the growth in cardinality of products Sn  Sn      Sn . To Exercise 5.87. Use Euler circuits.

190

Chapter 5 Families of expanders and of graphs with large girth

To Exercise 5.88. Allow graphs to have loops (assuming that each loop contributes 2 to the degree of the vertex it is incident to) and use the backward induction of the number of loops in a 2k-regular graph with n vertices. To prove the result for graphs with fewer loops do the following: let uv and vw be two adjacent edges in G. Consider the graph G 0 obtained from G by deletion of uv and vw and addition of the edge uw and the loop at v. Use the result of Exercise 5.87 for 4-regular graphs. To Exercise 5.89. 2-regular graphs are unions of cycles. Consider the permutations of V .G/ corresponding to these unions of cycles. To Exercise 5.90. You are expected to use Proposition 5.4(c), the existence of expander families, their spectral characterization and the fact that they can be chosen to be bipartite. To Exercise 5.91. Use the fact that the permutation of ¹1, : : : , nº, which transposes the first two elements, and the cyclic permutation, which acts as 1 7! 2, 2 7! 3, . . . , .n  1/ 7! n, n 7! 1, generate the group of all permutations of ¹1, : : : , nº. It is also useful to compute the square of the first matrix. To Exercise 5.95. Try to follow the argument of Theorem 5.67, starting with a family of disjoint cycles with prescribed lengths and a large cycle.

Chapter 6

Banach spaces which do not admit uniformly coarse embeddings of expanders Since each n-element metric space is isometric to a subset of `n1 (Proposition 1.17) and each Banach space without nontrivial cotype admits, by the Maurey–Pisier theorem (Theorem 2.56), .1 C "/-bilipschitz embeddings of ¹`n1 º1 nD1 , we get that any family of finite metric spaces and in particular, any family of expanders, admits uniformly bilipschitz embeddings into an arbitrary Banach space without nontrivial cotype. It turns out that families of expanders are resistant to uniformly coarse and uniformly bilipschitz embeddings into all other Banach spaces (“resistant” in the sense that no such embeddings are known so far). So far there are no examples of a Banach space X with nontrivial cotype for which there is a family ¹Gn º1 nD1 of expanders admitting uniformly coarse embeddings into X, so it is still possible that such Banach spaces X do not exist. Such a negative answer is not only known for uniformly coarse, but also for uniformly bilipschitz embeddings. In Chapter 11 we list some more specific problems of this type. The purpose of this chapter is to present known non-embeddability results for families of expanders as well as to describe some classes of Banach spaces and some particular Banach spaces for which the problem of embeddability of families of expanders into them is open. It is known that wide classes of Banach spaces admit no uniformly coarse embeddings of families of expanders into them. For even wider classes of Banach spaces it is known that there are no uniformly bilipschitz embeddings of expanders into them, we discuss such results for uniformly convex Banach spaces in Chapter 8. There is also a wide class of Banach spaces for which it is known that there are families of expanders which are not uniformly coarsely embeddable into them. For bilipschitz embeddings the last version of the problem is relatively easy and a complete answer is known: there exist families ¹Gnº of expanders which admit uniformly bilipschitz embeddings into a Banach space X if and only if X has no nontrivial cotype. The idea of constructions of such families of expanders is the following: to construct expanders containing uniformly bilipschitz images of nets of balls of `n1 (See Exercise 10.21 in Chapter 10).

192

Chapter 6 Coarse embeddings of expanders

6.1 Banach spaces whose balls admit uniform embeddings into L1 It turns out that Theorem 4.9, stating that families of expanders do not admit uniformly coarse embeddings into L1, can be transferred to any Banach space whose unit ball admits a uniform embedding into L1 . The unit ball of a Banach space X is the set BX D ¹x 2 X : kxk  1º. We use the term “uniform embedding” in the way in which it was introduced in Definition 1.36. Theorem 6.1. If a Banach space X is such that BX admits a uniform embedding into L1 and ¹Gn º is a family of expanders with their graph distances, then ¹Gn º are not uniformly coarsely embeddable into X. Proof. Let G be a k-regular graph and h be its Cheeger constant (saying h is a Cheeger constant of G we mean h.G/  h). We are going to prove Theorem 6.1 on the same lines as the similar result for L1 (Theorem 4.9). We are going to present our proof in such a way that it can be read independently, using two versions of Poincaré type inequalities for L1 proved in Section 4.2. The main step is to prove that for each (finite) k-regular graph with Cheeger constant h and each mapping f : V .G/ ! X there is a constant D D D.k, h, X/ such that for some x 2 X j¹s 2 V .G/ : kf .s/  xkX  D Lip.f /ºj 

3jV .G/j . 4

(6.1)

Let D > 0 be the least number satisfying 9x 2 X j¹s 2 V .G/ : kf .s/  xk  D Lip.f /ºj 

3jV .G/j . 4

(6.2)

At this point we allow the number D to depend on the graph G and on the embedding f . (We need to modify the argument slightly if the infimum is not attained.) We are going to find an inequality for D in terms of k, h, !' and !' 1 ; where ' : BX ! L1 is a uniform embedding, and !' and !' 1 are the moduli of continuity of ' and ' 1 . The inequality which we get leads to an estimate of the desired type. Lemma 6.2. Let G be a k-regular graph with Cheeger constant h. Let f be a map of the vertex set of G into a Banach space X, x 2 X. Then j¹s 2 V .G/ : kf .s/  xk  D1 Lip.f /ºj  implies

 j¹s 2 V .G/ : kf .s/  xk 

jV .G/j , 2

 3jV .G/j 2k C D1 Lip.f /ºj  . h 4

(6.3)

(6.4)

Section 6.1 Banach spaces whose balls admit uniform embeddings into L1

193

Proof. It is clearly enough to consider functions f having Lipschitz constant 1. Consider the function r .s/ D kf .s/  xk. It is also 1-Lipschitz. The inequality (6.3) implies that its median m satisfies m  D1. Applying Lemma 4.8 we get X

jkf .s/  xk  mj 

s2V .G/

kjV .G/j . 2h

Therefore

ˇ® ¯ˇ ˇ s 2 V .G/ : jkf .s/  xk  mj  2k ˇ  3jV .G/j , h 4 and we get (6.4). In the proof of the theorem it also suffices to consider the case where Lip.f / D 1. We map the smallest ball B.x, R/ (we define balls by B.x, R/ D ¹z 2 X : kz  xk  Rº) containing f .V .G// into L1 in the following way: 

First we project all points of f .V .G// which are not in B.x, D/ onto the sphere S.x, D/ (where S.x, D/ D ¹z 2 X : kz  xk D Dº). We mean the following projection P : 8 0, P 1 ˛ D 1 we have i DnC1 i  X 1 X   n  ˛i xi  ˛i xi    i D1

and

Pn

i D1 ˛i

D 1, and

D1

1

i DnC1

  n 1 X  X  ˛i xi  ˛i xi   D 2.  i D1

v

i DnC1

Hence, by (6.25) and (6.26) we get 1 ..1  /q/

1 q

 kxi k.v1 ,`1 /,q 

2 1

..1  /q/ q

and 1 ..1  /q/ ° Therefore

1 q

 X 1 X   n   ˛i xi  ˛i xi   i D1

1

..1/q/ q 2

xi

±1 i D1

i DnC1

.v1 ,`1 /,q



2 1

.

..1  /q/ q

satisfies the condition (e) in .v1 , `1 /,q with  (pa-

rameter in the condition (e), but not in .v1 , `1 /,q ) equal to 12 . By the implication (e))(a) of Theorem 6.11, the space .v1 , `1 /,q is nonreflexive. Our next goal is to get a more explicit formula for the norm of the space .v1 , `1 /,q . Now our purpose is to look for quantities which are equivalent to K t .x/ and are more convenient computationally. For x 2 `1 let k X Vk .x/ D sup jxnj  xnj 1 j, j D1

where the supremum runs over all increasing sequences n0  n1      nk of integers. As we shall see this functional can be used to get a good approximation of K t .x/ for t D k.

212

Chapter 6 Coarse embeddings of expanders

Let n0 D 0 n1 D inf¹n : n > n0 & jxn  xn0 j > Vk .x/=kº n2 D inf¹n : n > n1 & jxn  xn1 j > Vk .x/=kº ::: nk D inf¹n : n > nk1 & jxn  xnk1 j > Vk .x/=kº.

(6.27)

Then nk D 1. In fact, otherwise we get k X

jxnj  xnj 1 j > Vk .x/ D

j D1

sup

k X

n0 n1 nk j D1 nj 2N

jxnj  xnj 1 j,

a contradiction. We consider y 2 v1 (actually, an eventually constant sequence): 8 ˆ if i < n1 xn0 ˆ ˆ ˆ ˆ ˆ if n1  i < n2 0 and the decomposition x D xv C x` is arbitrary, we get the inequality Kk .x/  Vk .x/=2 from here. The inequality Kk .x/  jx0 j follows immediately from the definitions .k  1/. We get the left-hand side inequality in (6.29). Now we are going to present an equivalent formula for the norm in .v1 , `1 /,q which does not use integrals. In order to do this we need to recall the discrete version of the K-method of real interpolation. Denote by ,q the space of sequences ¹ j ºj1D1 satisfying 1 X .2j j j j/q < 1 j D1

with the norm k¹ j ºk ,q D

 X 1

1=q .2

j

q

j j j/

< 1.

j D1

(The space ,q is just a weighted `q -space.) N let ˛j D K.2j , x, A/. N The vector x belongs to AN,q if Lemma 6.26. For x 2 †.A/ and only if the sequence ¹˛j ºj1D1 is in ,q . Furthermore, the inequality 1

1

2 .ln 2/ q k¹˛j ºk ,q  kxk,q  2.ln 2/ q k¹˛j ºk ,q holds.

(6.31)

214

Chapter 6 Coarse embeddings of expanders

Proof. We have kxk,q D

 X 1 Z

2j C1

.t  K.t , x//q

j j D1 2

dt t

1=q .

Let t 2 Œ2j , 2j C1. The definition of the K-functional immediately implies K.2j , x/  K.t , x/  K.2j C1, x/  2K.2j , x/, and hence Since

R 2j C1 2j

2 2j ˛j  t  K.t , x/  2  2j ˛j . dt t

(6.32)

D ln 2, integrating and summing the inequality (6.32), we get (6.31).

By Lemma 6.26 we get that x 2 `1 is in the space .v1 , `1 /,q if and only if the sequence ¹K.2j , x/ºj1D1 is in the space ,q and  ° ±1   j   kxk.v1,`1 /,q  K.2 , x/ . (6.33)  j D1 ,q

We need the following observation on the case j  0. By the definition of the Kfunctional and kxk1  kxkv , we get that in this case K.2j , x/ D 2j kxk1 . Therefore  X 0

1=q .2

j

j

q

K.2 , x//

D

1=q

 X 0

j D1

.2

j j

q

2 kxk1 /

j D1

(6.34)

kxk1 D . .1  2q.1//1=q For j  1, by (6.29), we have ² X 1=q 1=q ³ X 1 1 1 max .2j V2j .x//q , jx0 j 2jq 2 j D1 j D1 X 1=q 1  .2j K.2j , x//q

2

j D1 X 1

1=q .2

j

q

V2j .x//

(6.35)

1=q X 1 jq C jx0 j 2 .

j D1

j D1

We can combine (6.34) and (6.35), and write c.k¹V2j .x/ºj1D1kC C kxk1 /  kxk.v1,`1 /,q  C.k¹V2j .x/ºj1D1 kC C kxk1/,

(6.36)

215

Section 6.6 Exercises

where k  kC is the “positive” part of k  k , that is, k¹ j ºj1D1kC

D

X 1

1=q .2

j

q

j j j/

.

j D1

The formula for the norm in (6.36) is reasonably simple, but it still resists the analysis for the expander embeddability problem.

6.6 Exercises Exercise 6.27. Let X be a Banach space and BX be its unit ball. Consider the following map P : X ! BX : ´ x if x 2 BX P .x/ D x if x … BX . kxk Show that Lip.P /  2. There is a hint to this exercise. Exercise 6.28. Verify the statement made in Observation (b) on page 198. Exercise 6.29. A Banach space X is called uniformly non-square if there is a number ı < 1 such that for any pair x, y in the unit ball we have either k21.x C y/k  ı or k21.x y/k  ı. Consider the operator W `22 ! `22 given by .x, y/ D .21 .x Cy/, 21 .x  y//. Evaluate the norm of this operator. Use the result to show that a fully curved Banach space (see Definition 6.5 on page 198) is uniformly non-square. Exercise 6.30. Show that a curved Banach space (see Definition 6.5 on page 198) has nontrivial type. There is a hint to this exercise. Exercise 6.31. Show that the spaces `p , 1  p < 1, are stable (see Definition 6.13 on page 204).

6.7 Notes and remarks The main result of Section 6.1 is due to Ozawa [363]. There are surprisingly many Banach spaces satisfying the condition of Theorem 6.1. Mazur [320] proved that unit balls of Lp , 1  p < 1 and `p , 1  p < 1 are uniformly homeomorphic to each other. A very important achievement in this direction is due to Odell and Schlumprecht [346], who proved that unit balls of spaces with unconditional bases and nontrivial cotype are uniformly homeomorphic to the unit ball of a separable Hilbert space (see

216

Chapter 6 Coarse embeddings of expanders

also [55, Chapter 9]). Chaatit [91] proved a generalization of this result for Banach lattices. Thus we conclude that Banach spaces with unconditional bases and nontrivial cotype, or, more generally, separable Banach lattices with nontrivial cotype do not admit uniformly coarse embeddings of families of expanders. The paper [91] contains also nonseparable generalizations of the Odell–Schlumprecht result. Section 6.2 is based on [382], where Lemma 6.3 is attributed to V. Lafforgue. Complex interpolation for families of Banach spaces, which is introduced in Definition 6.8, is due to Coifman, Cwikel, Rochberg, Sagher, and Weiss [102] (in an equivalent form). It is worth mentioning that Pisier [382, §9] gives a characterization of uniformly curved spaces in terms similar to the ones used in the real method of interpolation (see [56, Chapter 3] for real method of interpolation). This characterization is rather technical and we do not state it here. The results of Exercises 6.29 and 6.30 are taken from [382]. We refer to [382, Chapter 2] for more discussion on the notions of (uniformly/fully) curved Banach spaces and related open problems. Here we would like to present the proof (given in [382, p. 15]) of the fact that uniformly curved Banach spaces are isomorphic to uniformly convex Banach spaces. Fix n  1. Let .n/ D ¹ .n/i ,j ºni,j D1 be the Hilbert matrix defined by ´ .n  .i C j //1 if n  .i C j / 6D 0,

.n/i ,j D 0 otherwise. It is well known that the operator .n/ satisfies k .n/`n2 !`n2 k  C , k .n/`n1 !`n1 k  C log.n C 1/ and k .n/`n1 !`n1 k  C log.n C 1/ for some constant C (independent of n), see [427]. Thus, for any Banach space X, we have k .n/X k  C log.n C 1/ X ..log.n C 1//1 /. Therefore, if X is uniformly curved, then k .n/X k D o.log.n//. By an observation made in [377], this implies that X is isomorphic to a uniformly convex space. Section 6.3 presents a characterization of reflexivity which goes back to [386]. It was further developed by [412], [368], [212, see Theorem 8 (35)], and [331]. It was presented also in some texts, see [50, Theorem 6, p. 51]. Helly’s condition is from [201, p. 73]. The notion of a stable space is due to Krivine and Maurey [256], they also proved that Lp is stable for each p < 1. Useful expositions of results on stable Banach spaces can be found in [51], [159], and [187]. Results of Section 6.4.2 are due to Raynaud [390]. It is worth mentioning that up to 1969 it was an open problem, suggested by Yu. M. Smirnov: does there exist a separable metric space which is not uniformly homeomorphic to a subset of L2? Smirnov’s problem was solved in the positive by Enflo [134] who proved that the Banach space c0 cannot be embedded uniformly into a Hilbert space. Examples of nonreflexive Banach spaces which do not contain ¹`n1 º1 nD1 uniformly are difficult to construct, for some time (1964–1974) their existence was an open prob-

217

Section 6.8 Hints to exercises

lem and there was a series of publications devoted to it. The first example was constructed by James [215]. See [217], [216], and [384] for more constructions of such spaces. See [144] for an exposition of the James–Lindenstrauss construction and [383] for a detailed exposition of the Pisier–Xu construction. As we have already mentioned, Theorem 2.56 (in combination with Exercise 2.69) implies that a Banach space contains ¹`n1 º1 nD1 uniformly if and only if it has no nontrivial type. The construction of a nonreflexive space with nontrivial type presented in Section 6.5 is due to Pisier and Xu [384]. A detailed presentation of the construction and the proof of nontrivial type of .v1 , `1 /,q are given in [383]. The simplified formula for the equivalent norm is derived using the ideas which are present in [384], some of them go back to [57]. Papers [112], [113], [145] contain related studies on degrees of non-reflexivity. The standard reference on the real interpolation method is [56, Chapters 2 and 3], which in its presentation of the method borrows from [364]. [383] features an introduction to the real interpolation method while [86] is the most comprehensive advanced monograph on the real interpolation method.

6.8 Hints to exercises To Exercise 6.27. To estimate kP x1  P x2 k in terms of kx1  x2k first consider the trivial case kx1k D kx2 k. Then, assume that kx1k  kx2k and kx2k > 1 (other cases are either similar or trivial). Let ³ ² kx1k 1 x3 D max ,  x2 . kx2k kx2k Complete the estimate using the observations: 

P x3 D P x2 .



kP x1  P x3 k is easy to estimate in terms of kx3  x1k, because it is reduced to one of the trivial cases.



kx3  x1 k  2kx2  x1 k.

To Exercise 6.30. Since X is curved, there is a value of n 2 N satisfying X .2n=2 / < 1. Consider the operator

œ

Tn :D ˝ ˝    ˝ n times n `22

`22

`22 ,

on D ˝  ˝ where is the operator introduced in Exercise 6.29. Use the fact that k.Tn /X k  X .2n=2 / < 1 to show that `1 (as a Banach space over R) is not finitely representable in X. Use Theorem 2.56.

Chapter 7

Structure properties of spaces which are not coarsely embeddable into a Hilbert space 7.1 Expander-like structures implying coarse non-embeddability into L1 There are several known obstructions for coarse embeddability of one metric space into another. But only one such obstruction is known for embeddings of spaces with bounded geometry into a Hilbert space (or any other infinite-dimensional Banach space, see Theorem 2.7). We mean Theorem 4.9 and its corollary, that is: Families of expanders are not uniformly coarsely embeddable into L1 or into a Hilbert space. An easy consequence of this result and the fact that a composition of two coarse embeddings is a coarse embedding is: Theorem 7.1. If a metric space X admits uniformly coarse embeddings of a family of expanders into it, then X is not coarsely embeddable into a Hilbert space. As we have already mentioned, in the case where X is a space of bounded geometry no other obstructions for coarse embeddability into a Hilbert space are known (see Problem 11.9 and discussion after it in this connection). The main results of this chapter were proved in attempts to find expander-like structures in metric spaces which do not admit coarse embeddings into a Hilbert space. The following property of metric spaces with bounded geometry is the main candidate for being equivalent to coarse non-embeddability into `2 . Definition 7.2. Let X be a space with bounded geometry and ¹Yn º1 nD1 be a family of expanders. We say that X weakly contains ¹Ynº if there are maps fn : Yn ! X satisfying (with some abuse of notation we use Yn to denote the vertex set of Yn ) (a) Lipschitz constants Lip.fn / are uniformly bounded jfn1 .fn .y//j D 0. n!1 y2Yn jYn j The images of Yn in X are called weak expanders.

(b)

lim max

Observation 7.3. If a sequence ¹Yn º1 nD1 is uniformly coarsely embeddable into a metric space X with bounded geometry, then X weakly contains ¹Ynº. Proof. Let fn : Yn ! X be uniformly coarse embeddings. We are going to check that ¹fnº satisfy the conditions of Definition 7.2.

Section 7.1 Expander-like structures implying coarse non-embeddability into L1

219

It is an easy observation (see Remark 1.50 on page 21) that uniformly coarse embeddings of (unweighted) graphs with their graph distances are Lipschitz maps with uniformly bounded Lipschitz constants. To show that the second condition is satisfied we prove that the cardinalities jfn1 .fn .y//j are uniformly bounded. To this end we observe that otherwise for each r > 0 there are n 2 N and y 2 Yn such that jfn1 .fn .y//j > M.r / (the function M.r / is from the definition of bounded geometry, see Definition 1.66 on page 31). Therefore for each r > 0 there are n D n.r / and u D u.r , n/, v D v.r , n/ 2 Yn satisfying fn .u/ D fn .v/ and dYn .u, v/ > r . This contradicts the assumption that ¹fn º are uniformly coarse embeddings. Example 7.4. For an arbitrary family ¹Ynº of expanders there is a metric space X with bounded geometry and Lipschitz maps fn : Yn ! X which embed ¹Yn º weakly into X, but the embeddings ¹fn º are not uniformly coarse. Proof. Let ¹Ynº1 nD1 be an arbitrary family of expanders. Let un , vn 2 V .Yn / be such that dYn .un , vn / D diamYn . It is not difficult to see that limn!1 diamYn D 1. (In fact, since Yn are k-regular, for each u 2 V .Yn / and r 2 N the number of vertices at distance  r from u does not exceed 1 C k C .k  1/k C    C .k  1/r 1 k. As limn!1 jV .Yn /j D 1, the conclusion follows.) Let Zn be the graph obtained from Yn if we identify un and vn . We mean the standard modification of a graph consisting in replacing un and vn by a single vertex incident to all the edges which were incident in Yn to either un or vn . It is clear that ¹Zn ºn are graphs with uniformly bounded degrees and therefore it is easy to form a bounded geometry metric space X containing isometric copies of all of ¹Zn º (one of the ways of constructing such X is described after Corollary 4.10). It is clear that the natural embeddings fn : Yn ! X, formed by identifying un and vn first, and then applying an isometric embedding into X, weakly embed ¹Yn º into X. On the other hand, these embeddings are not uniformly coarse because dYn .un , vn / ! 1 but dX .f .un /, f .vn // D 0. Observation 7.5. If X is a metric space with bounded geometry weakly containing a family ¹Ynº of expanders, then X is not coarsely embeddable into L1. Proof. This proof is similar to the proof of Corollary 4.10. Let fn : Yn ! X be maps showing that X weakly contains ¹Yn º (see Definition 7.2). Assume that there is a coarse embedding f : X ! L1. Then there are functions 1 and 2 such that 8n 2 N 8u, v 2 Yn and lim t !1 1.t / D 1.

1.dX .fn .u/, fn .v///  kf .fn .u//  f .fn .v//k  2 .dX .fn .u/, fn .v///.

(7.1)

220

Chapter 7 Spaces not coarsely embeddable into a Hilbert space

We apply the inequality (4.6) to 'n .u/ D f .fn .u// and get X

au,v k'n .u/  'n .v/k 

u,v2V .Yn /

X u,v2V .Yn /

h k'n .u/  'n .v/k. jV .Yn /j

(7.2)

Since Yn are k-regular, we use (7.1) and get that the left-hand side is  kjV .Yn /j  2.supn Lip.fn //. The right-hand side can be estimated from below by X u,v2V .Yn /

h 1 .dX .fn .u/, fn .v///. jV .Yn /j

We estimate this sum from below using the following argument. We claim that for each 0 < C < 1 we can find n 2 N such that for each u 2 V .Yn / the number of vertices v 2 V .Yn/ satisfying dX .fn .u/, fn .v//  C is at least jV .Yn /j=2. In fact, the number of elements x 2 X satisfying dX .fn .u/, x/  C does not exceed MX .C / (where MX ./ is the function from the definition of bounded geometry, see Definition 1.66). Therefore the number of elements v 2 V .Yn / satisfying dX .fn .u/, fn .v//  C does not exceed MX .C /  maxx2fn .Yn / jfn1 .x/j. Condition (b) of Definition 7.2 (of weak containment of expanders) implies that MX .C /  maxx2fn.Yn / jfn1 .x/j D 0. n!1 jV .Yn /j lim

Therefore the desired n exists. For such C and n we get, by inequalities (7.1), (7.2), and the argument immediately after them, that: h jV .Yn /j2   1.C /  kjV .Yn /j2 .sup Lip.fn //. 2 jV .Yn /j n Thus

2k 2 .sup Lip.fn //. h n Since C can be chosen to be arbitrarily large, this leads to a contradiction. 1 .C / 

7.2 On the structure of locally finite spaces which do not admit coarse embeddings into a Hilbert space At the moment it is unknown whether each metric space with bounded geometry which does not admit a coarse embedding into a Hilbert space contains weak expanders. In an attempt to prove this the following result was obtained. Recall (see Observation 4.12) that coarse embeddability into a (separable infinitedimensional) Hilbert space is equivalent to coarse embeddability into L1.0, 1/.

Section 7.2 Condition for locally finite spaces

221

Theorem 7.6. Let .M , dM / be a locally finite metric space which is not coarsely embeddable into L1. Then there exists a constant D, depending on M only, such that for each n 2 N there exists a finite set Bn  M  M and a probability measure  on Bn such that 

dM .u, v/  n for each .u, v/ 2 Bn .



For each Lipschitz function f : M ! L1 we have Z kf .u/  f .v/kL1 d.u, v/  DLip.f /.

(7.3)

Bn

Lemma 7.7. Let .M , dM / be a locally finite metric space which is not coarsely embeddable into L1. Then there exists a constant C depending on M only such that for each Lipschitz function f : M ! L1 there exists a subset Bf  M  M such that sup.x,y/2Bf dM .x, y/ D 1, but sup.x,y/2Bf kf .x/  f .y/kL1  C Lip.f /. Proof. Assume the contrary. Then, for each n 2 N, the number n3 cannot serve as C . This means, that for each n 2 N there exists a Lipschitz map fn : M ! L1 such that for each subset U  M  M with sup dM .x, y/ D 1, .x,y/2U

we have sup kfn .x/  fn .y/k > n3 Lip.fn /. .x,y/2U

We choose a point in M and denote it by O. Without loss of generality we may assume that fn .O/ D 0. Consider the mapping f : M ! .˚1 L / given by kD1 1 1   1 fk .x/ ,  f .x/ D ˚1 kD1 Kk 2 Lip.fk / P 1 where K D 1 kD1 k 2 (`1 direct sums were defined on p. 82). It is clear that the series converges and Lip.f /  1. Considering  L1 .a,b/ for infinitely many disjoint subinL admits an isometric embedding into tervals of Œ0, 1 it is easy to show that ˚1 kD1 1 1 L1.0, 1/. Thus f may be regarded as a mapping into L1 .0, 1/. Let us show that f is a coarse embedding. We need an estimate from below only (the estimate from above is satisfied because f is Lipschitz). The assumption implies that for each n 2 N there is N 2 N such that dM .x, y/  N ) kfn .x/  fn .y/k > n3 Lip.fn /. On the other hand, kfn .x/  fn .y/k > n3 Lip.fn / implies kf .x/  f .y/k D

1 X kD1

1 n kfk .x/  fk .y/k kfn .x/  fn .y/k 1  > .   2 2 Kk Lip.fk / Kn Lip.fn / K

Hence f : M ! L1 is a coarse embedding and we get a contradiction.

222

Chapter 7 Spaces not coarsely embeddable into a Hilbert space

Lemma 7.8. Let C be the constant whose existence is proved in Lemma 7.7, and let " > 0 be arbitrary. Then for each n 2 N there is a finite subset Mn  M such that for each Lipschitz mapping f : M ! L1 there is a pair .uf ,n , vf ,n / 2 Mn  Mn such that 

dM .uf ,n , vf ,n /  n.



kf .uf ,n /  f .vf ,n /k  .C C "/Lip.f /.

Proof. The ball in the space M of radius R centered at O will be denoted by B.R/. It is clear that it suffices to prove the result for 1-Lipschitz mappings satisfying f .O/ D 0. Assume the contrary. Since M is locally finite, this implies that for each R 2 N there is a 1-Lipschitz mapping fR : M ! L1 such that fR .O/ D 0 and, for u, v 2 B.R/, the inequality dM .u, v/  n implies kfR .u/  fR .v/kL1 > C C ". ¹fR º1 RD1 , that is, a mapping f : M ! QWe form an ultraproduct of the mappings , where U is a free ultrafilter on N and .Q L1/U , given by f .m/ D ¹fR .m/º1 RD1 . L1/U is the corresponding ultraproduct. It is well known that an ultraproduct of L1 spaces is isometric to an L1 space on some measure space (see Notes and Remarks for references). It is known (see Fact 1.20) that separable subspaces of an L1 space on any measure space are isometric to subspaces of L1.0, 1/. Therefore we may consider f as a mapping into L1.0, 1/. It is easy to verify that Lip.f /  1 and that f satisfies the condition dM .u, v/  n ) kf .u/  f .v/kL1  .C C "/. We get a contradiction with the definition of C . Proof of the Theorem 7.6. Let D be a number satisfying D > C , and let B be a number satisfying C < B < D. By Lemma 7.8, there is a finite subset Mn  M such that for each 1-Lipschitz function f on M there is a pair .u, v/ in Mn such that dM .u, v/  n and kf .u/  f .v/k  B. We choose a point in Mn and denote it by O (it should be different from the point O chosen above if that point is not in Mn , otherwise we can choose the same point). Proving the theorem it is enough to consider 1-Lipschitz functions f : Mn ! L1 satisfying f .O/ D 0. As we know (Lemma 1.24) each subset of L1 of cardinality s is isometric to a subset of `k1 for k  sŠ. (Actually much better estimates for k are known, see Notes and Remarks to Chapter 1, but we do not need them here.) Therefore it suffices to prove the result for 1-Lipschitz functions Mn 7! `k1 satisfying f .O/ D 0. It is easy to see that this set of functions is compact with respect to the metric .f , g/ D max kf .m/  g.m/k. m2Mn

Clearly it suffices to prove the inequality Z kf .u/  f .v/kd.u, v/  B Bn

223

Section 7.3 Expansion properties

  -net in the set of all functions satisfying the conditions mentioned above, for a DB 2 endowed with the metric . By compactness there exists a finite net satisfying the condition. Let N be such a net. We are going to use the von Neumann minimax theorem. Let A be the matrix whose columns are labeled by functions belonging to N , whose rows are labeled by pairs .u, v/ of elements of Mn satisfying dM .u, v/  n, and whose entry on the intersection of the column corresponding to f , and the row corresponding to .u, v/ is kf .u/  f .v/k. P Then, for each column vector x D ¹xf ºf 2N with xf  0 and f 2N xf D 1, the entries of the product Ax are the differences kF .u/  F .v/k, where   F : M ! ˚f 2N `k1 1 is given by F .m/ D ˚f 2N xf f .m/. It is easy to show that the space .˚f 2N `k1 /1 kjN j is isometric to `1 and thus admits a linear isometric embedding into L1. Therefore the function F can be considered as a function into L1. It satisfies Lip.F /  1. Hence there is a pair .u, v/ in Mn satisfying dM .u, v/  n and kF .u/  F .v/k  B. Therefore we have max min Ax  B, x



where the minimum is taken over all vectors  D ¹.u, v/º, P indexed by u, v 2 Mn , dM .u, v/  n, and satisfying the conditions .u, v/  0 and .u, v/ D 1. By the von Neumann minimax theorem we have min max Ax  B, 

x

which is exactly the inequality we need to prove because  can be regarded as a probability measure on the set of pairs from Mn with distance  n.

7.3 Expansion properties of metric spaces not admitting a coarse embedding into a Hilbert space Our next purpose is to find some expansion properties of the sets Mn whose existence is proved in Theorem 7.6. Let s be a positive integer. We consider graphs G.n, s/ D .Mn , E.Mn , s//, where the edge set E.Mn , s/ is obtained by joining those pairs of vertices of Mn which are at distance  s. The graphs ¹G.n, s/º1 nD1 have uniformly bounded degrees if the metric space M has bounded geometry. We need to recall some more definitions of graph theory. A component of a graph G is a maximal subset D  V .G/ satisfying the condition: for each u, v in D there is a uv-walk in G. A vertex cut C in G is a set of vertices of G whose deletion from G

224

Chapter 7 Spaces not coarsely embeddable into a Hilbert space

(together with all edges incident with vertices in C ) increases the number of components. If two vertices are in the same component of G, but in different components of G  C (the graph obtained after the deletion of C ), we say that they are separated by the vertex cut C . Observation 7.9. If vertices x and y in G.n, s/ can be separated by a vertex cut, then dM .x, y/ > s. Informally: each vertex cut of G.n, s/ separates it into pieces with dM -distance between them at least s. It is clear that for fixed s the identical embeddings on the graphs G.n, s/ with their graph metrics into X have uniformly bounded Lipschitz constants and are bijective. For this reason if we could prove that for X having bounded geometry, the graphs G.n, s/ contain weak expanders, this would provide a characterization of spaces with bounded geometry which do not admit coarse embeddings into a Hilbert space in terms of weak containment of expanders. (Since G.n, s/ is a family of metric spaces rather than a metric space, we need to extend our definitions slightly: saying G.n, s/ contain weak expanders, we mean that there is a family of expanders ¹Ym º1 mD1 , a sequence of positive integers, and embeddings f : Y ! G.n.m/, s/ satisfying ¹n.m/º1 m m mD1 the conditions of Definition 7.2.) At the moment we are able to prove only the following weaker expansion property of the graphs G.n, s/. We denote the measure  whose existence was proved in Theorem 7.6 by n . We introduce the measure n on Mn by n .A/ D n .A  Mn /. Let F be a subgraph of G.n, s/ and A be a set of vertices in F . The set of all vertices of F which are adjacent to vertices of A but are not in A is called the vertex boundary of A in F and is denoted by ıF A. s Theorem 7.10. Let s and n be such that 2n > s > 8D. Let '.D, s/ D 4D  2. s Then G.n, s/ contains an induced subgraph F with dM -diameter  n  2 , such that each subset A  F of dM -diameter < n  s2 satisfies the condition: n .ıF A/ > '.D, s/n .A/.

Proof of Theorem 7.10. Suppose that for some n, s 2 N satisfying 2n > s > 8D there is no such subgraph in G.n, s/. Then for each induced subgraph F in G.n, s/ of dM -diameter  n  s2 we can find a subset A  F of dM -diameter < n  s2 such that n .ıF A/  '.D, s/n .A/. We start with F1 D G.n, s/ (the definitions of Mn and n imply that the dM -diameter of Mn is  n), find a subset A1  F1 of dM -diameter < n s2 such that n .ıF1 A1 /  '.D, s/n .A1 /, and delete A1 [ıF1 A1 from G.n, s/. If the obtained graph F2 still has dM -diameter  n s2 , we find a subset A2 in it such that n .ıF2 A2 /  '.D, s/n .A2 /. We delete the subset A2 [ ıF2 A2 from F2 . We continue in an obvious way until we get a set of dM -diameter < n  s2 (this should eventually happen since Mn is finite). We denote this set Ap , where p is the number of steps in the process.

225

Section 7.3 Expansion properties

Observe that each of the sets Ai has diameter < n  s2 , and that the dM -distance between any Ai and Aj .i ¤ j / is at least s (see the observation above). p We introduce a family of 1-Lipschitz functions f on M , where  D ¹i ºi D1 2 ‚ D ¹1, 1ºp by the formula: 8

0. The numbers ai ,j (also denoted aij ) are called transition probabilities and the nn matrix A :D ¹ai ,j ºni,j D1 is called the transition matrix. Definition 8.2. The initial distribution is the set of numbers P.X0 D s1 /, : : : , P.X0 D sk /, considered as a row vector .0/ . The distribution at time t is the set of numbers P.X t D s1 /, : : : , P.X t D sk /, considered as a row vector .t / . Observation 8.3. The following equality is a straightforward consequence of our definitions: (8.2) .t / D .0/ At . (In the right-hand side we mean a multiplication of a row vector by a matrix.) Unless we would like to consider a state space with some additional structure, we assume that the state space is ¹1, : : : , kº. Definition 8.4. A Markov chain ¹X t º1 t D0 on the state space ¹1, : : : , kº with transition probabilities aij :D P.X t C1 D j jX t D i/ is called symmetric if aij D aj i .

Section 8.1 Basic definitions and results on finite Markov chains

229

Definition 8.5. Let ¹X t º1 t D0 be a Markov chain on S D ¹1, : : : , kº with a transition matrix A D ¹aij º. A probability distribution ¹i ºkiD1 on S is called reversible for the chain ¹X t º if (8.3) i aij D j aj i for all i, j 2 ¹1, : : : , kº. A Markov chain is called reversible if there exists a reversible distribution for it. A probability distribution  D ¹i ºkiD1 on S is called stationary for a Markov chain ¹X t º, or for the corresponding transition matrix A, if A D . A Markov chain is called stationary if .t / does not depend on t . Example 8.6. A typical (and very useful) example of a symmetric Markov chain: consider a k-regular connected graph G with n vertices and let S D ¹s1 , : : : , sn º be the vertex set V .G/. We consider the following random variables: X0 is distributed according to any distribution and ai ,j are defined by ´ 1 if si and sj are adjacent ai ,j D k 0 otherwise. This is clearly a symmetric Markov chain. One can easily check that this Markov chain is not stationary unless X0 has uniform distribution. Proposition 8.7. A reversible distribution for a Markov chain is stationary. P Proof. In fact, the only thing which we need to check is: kiD1 i aij D j . But this P P is immediate from (8.3), in fact (8.3) implies kiD1 i aij D kiD1 j aj i D j since Pk i D1 aj i D 1, see Definition 8.1. Observation 8.8. Condition (8.3) is equivalent to the condition: the product 2 32 3 1 : : : 0 a11 : : : a1k 6 .. . . . 7 . 7 6 . .. 4 . . .. 5 . .. 5 4 .. 0 : : : n

(8.4)

an1 : : : ann

is a symmetric matrix (where the first matrix is diagonal with 1 , : : : , n on the diagonal). Example 8.9. A very useful general example of a reversible Markov chain: consider a (not necessarily regular) finite connected graph G with k vertices and let S D ¹1, : : : , kº be the vertex set V .G/. We consider the following random variables: X0 is distributed according to the distribution  with di i D Pn

i D1 di

,

230

Chapter 8 Applications of Markov chains to embeddability problems

where di is the degree of i in G, and ai ,j are defined by ´ 1 if i and j are adjacent ai ,j D di 0 otherwise. Straightforward verification shows that  is a reversible distribution for this Markov chain.

8.2 Markov type It turns out that Markov chains are very useful for problems of bilipschitz embeddability of metric spaces, because the behavior of Markov chains in different metric spaces is in a certain sense incompatible. One of the most useful tools for distinguishing the behavior of Markov chains in different metric spaces is introduced in the following definition. Definition 8.10. Given a metric space .X, d / and p 2 Œ1, 1/, we say that X has Markov type p if there exists a constant K > 0 such that for every stationary reversible Markov chain ¹Z t º1 t D0 on ¹1, : : : , nº, every mapping f : ¹1, : : : , nº ! X and every time t 2 N, (8.5) E d.f .Z t /, f .Z0 //p  K p t E d.f .Z1 /, f .Z0 //p . (Here and throughout, we omit some parentheses and write E x p for E.x p /, etc.) The least such K is called the Markov type p constant of X, and is denoted Mp .X/. Remark 8.11. Speaking of a stationary reversible Markov chain ¹Z t º1 t D0 we will always assume that the distribution P.Z t D 1/, : : : , P.Z t D n/ is reversible. (For some reversible Markov chains there exist stationary probability distributions which are not reversible, see Exercise 8.61. Some authors avoid such situations by adding irreducibility (see Exercise 8.59) to the definition of a reversible Markov chain.) Theorem 8.12. The Hilbert space has Markov type 2 with constant 1. Proof. It is clearly enough to prove the result for L2 .0, 1/. Furthermore, it suffices to prove the result for the real line R (it passes to L2.0, 1/ by integration). Let ¹Z t º1 t D0 be a stationary reversible Markov chain on ¹1, : : : , nº, with transition matrix A D .aij /, and reversible distribution . To prove that R has the Markov type 2 with constant 1 means to show that for every x1 , : : : , xn 2 R the inequality X X i .At /ij .xi  xj /2  t i aij .xi  xj /2 (8.6) i ,j

i ,j

231

Section 8.2 Markov type

holds, where At is the t -th power of the matrix A. In fact, the stronger inequality X X i .At /ij .xi  xj /2  ƒ.t / i aij .xi  xj /2 (8.7) i ,j

i ,j

holds, where ƒ.t / is the value of the polynomial 1 C x C    C x t 1 at the second largest eigenvalue of A in the case where such an eigenvalue is positive, and 1 if the second largest eigenvalue is negative. To make this statement meaningful we need to explain why the eigenvalues of A are real. This˝property follows from the fact that A ˛ is self-adjoint with respect to the inner product ,  introduced below. To derive (8.6) from (8.7) it suffices to show that each eigenvalue of A is in the interval Œ1, 1. This statement follows from the fact that all entries of A are nonnegative and row sums are equal to 1. Therefore the absolute values of all entries of Az, where z is a column vector with coordinates z1 , : : : , zn , do not exceed max1i n jzi j, the conclusion follows. It is also clear that 1 is an eigenvalue of A and the all-ones vector is an eigenvector corresponding to the eigenvalue 1. The assumption that the considered Markov chain is reversible and  is its reversible distribution implies i .At /ij D j .At /j i . This equality can be obtained using (8.4). In fact, denoting the diagonal matrix in (8.4) by D we have D A D .D A/ , where .D A/ denotes the transposed matrix (the entries of the matrices are real numbers, so this notation is not confusing). Therefore we get D At D .D A/ At 1 D A D At 1 D A A D At 2 D :::

(8.8)

D .A /t D

D .D At / . ˝ ˛ ˝ ˛ P We introduce the inner product x, y on Rn by x, y D niD1 i xi yi . By (8.8) matrices in both sides of (8.7) are symmetric, using this symmetry and expanding the squares, we get that (8.7) is equivalent to ˛ ˝ ˛ ˝ (8.9) .I  At /x, x  ƒ.t / .I  A/x, x . If x is an eigenvector, then (8.9) holds for the following reasons: in the case where the eigenvalue is 1, inequality (8.9) is satisfied because both sides are 0, for all other eigenvalues it holds because the value of ƒ.t / cannot be less than the value of 1 C x C    C x t 1 at any of the eigenvalues except 1. The general ˝ ˛ case follows by a spectral decomposition, since A is self-adjoint with respect to ,  . The following immediate corollary of Theorem 8.12 is very important.

232

Chapter 8 Applications of Markov chains to embeddability problems

Corollary 8.13. For every metric space .X, d /, the Euclidean distortion of X satisfies: c2 .X, d /  M2.X/. Proof. Let f : X ! `2 be an embedding for which the Euclidean distortion is almost attained, that is d.x, y/  kf .x/  f .y/k  d.x, y/.c2 .X, d / C "/ for every x, y 2 X. For every reversible stationary Markov chain ¹Z t º1 t D0 on X, the 1 Markov type 2 property of `2 applied to the Markov chain ¹f .Z t /º t D0 gives Ed 2 .Z t , Z0 /  E.kf .Z t /  f .Z0 /k2 /  t E.kf .Z1/  f .Z0 /k2 /  t .c2 .X, d / C "/2 Ed 2 .Z1 , Z0 /, therefore M2 .X/  c2 .X, d / C ". Since " > 0 is arbitrary the conclusion follows.

8.3 First application of Markov type to embeddability problems: Euclidean distortion of graphs with large girth Theorem 8.14. Let G be a k-regular graph with girth g. Then r k  2 jg k . c2 .G, dG /  k 2 Proof. By Corollary 8.13 it suffices to estimate M2 .G/ from below. To get such an estimate we consider the Markov chain introduced in Example 8.6, namely ¹Z t º1 t D0 is such that Z0 is uniformly distributed on V .G/ and P.Z t C1 D vjZ t D u/ equals 1=k if u and v are adjacent, and 0 otherwise. Note thatfor˘every vertex v 2 V .G/ the graph induced by the set of vertices ¹u : dG .u, v/ < g2 º is a tree (otherwise there would be a cycle of length < g). Also, it is easy to check that as long as t  bg=2c1, each step of the random walk ¹Z t º moves away from Z0 with probability at least k1 k (we say “at least” because if Z t D Z0 , then this probability is 1) and towards Z0 with probability at most k1 . In other words, as long as t  bg=2c  1, the random walk, in expectation, moves away from Z0 . To get an estimate of M2 .G/ from below, we record the corresponding numbers: for every 1  t  bg=2c  1 we have k1 1 .EdG .Z t , Z0 / C 1/ C .EdG .Z t , Z0 /  1/ k k k2 . D EdG .Z t , Z0 / C k

EdG .Z t C1, Z0 / 

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

Hence, for every t  bg=2c  1 we have 2 .Z t C1 , Z0 / EdG

 2

 .EdG .Z t C1 , Z0 // 

k2 k

233

2 .t C 1/2 .

On the other hand, by the definition of Markov type 2, 2 2 .Z t C1 , Z0 /  M2 .G/2 .t C 1/EdG .Z1 , Z0 / D M2 .G/2 .t C 1/. EdG p Therefore M2 .G/  k2 t C 1. The result follows by taking t D bg=2c  1. k

8.4 Banach space theory: renormings of superreflexive spaces, q-convexity and p-smoothness 8.4.1 Definitions and duality Now we develop some Banach space theory in order to prove generalizations of Theorem 8.14 for uniformly convex spaces. It is worth mentioning that the theory of uniformly convex and uniformly smooth spaces, and a related theory of superreflexivity are very developed and very extensive. We do not have the intention to present a detailed account of this theory. We prove only those results which we are going to use, we mention some other results to put our presentation in a suitable context. The Notes and Remarks section contains a more detailed survey of the theory and references to original sources and to monographs containing detailed presentations of other aspects of the theory. Let .X, k  k/ be a normed space. The modulus of uniform convexity of X is defined for " 2 Œ0, 2 as ³ ² kx C yk : x, y 2 X, kxk D kyk D 1, kx  yk D " .(8.10) ıX ."/ D inf 1  2 The normed space X is said to be uniformly convex if ıX ."/ > 0 for all " 2 .0, 2. Furthermore, X is said to have modulus of convexity of power type q if there exists a constant c such that ı."/  c "q for all " 2 Œ0, 2. Remark 8.15. It is easy to see that the modulus of convexity of a Banach space is equal to the infimum of moduli of convexity of its two-dimensional subspaces. An immediate consequence of this is: If a Banach space X is finitely representable in a Banach space Y , then ıX ."/  ıY ."/ for each " 2 Œ0, 1. Combining this with Dvoretzky theorem (stated on page 42) we get that ı`2 ."/  ıY ."/ for each infinite-dimensional Banach space Y . This statement is also known to be true for finite-dimensional spaces (see Notes and Remarks). Example 8.16. The modulus of convexity of a Hilbert space is easily computable. In fact, using the parallelogram identity we get, for x and y satisfying the conditions of

234

Chapter 8 Applications of Markov chains to embeddability problems

the definition of the modulus of convexity: kx C yk2 C "2 D 4. q p 2 2 Hence kx C yk D 4  " and ı`2 D 1  1  "4

"2 8.

Remark 8.17. Combining this example with Remark 8.15 we get that if a Banach space X has modulus of convexity of power type q, then q  2. Proposition 8.18. A normed space X has modulus of convexity of power type q if and only if there exists a constant K > 0 such that for every x, y 2 X 2 kykq  kx C ykq C kx  ykq . (8.11) Kq Here we proof only the “if” part, it is convenient to prove the “only if” part together with the dual statement which is presented below as Proposition 8.24. 2 kxkq C

Proof of the “if” part of Proposition 8.18. Replacing in (8.11) x by xy , we get the inequality 2  x C y q  2    x  y q  2  C q   kxkq C kykq . 2 K 2 Now, if kxk D kyk D 1 and kx  yk D ", we get from here that  x C y q " q   .   1  2 2K and so " q q1 1 " q ıX ."/  1  1   , 2K q 2K

xCy 2

and y by

(8.12)

1

where we use linear approximation and convexity of the function 1  .1  x/ q . Remark 8.19. It is clear that we can “almost reverse” this argument. However this does not complete the proof of Proposition 8.18 because we get (8.11) only in the case where kx C yk D kx  yk. Definition 8.20. A normed space X is called q-uniformly convex if there exists a constant K such that (8.11) holds. The least K for which (8.11) holds is called the qconvexity constant of X, and is denoted Kq .X/. Remark 8.21. Sometimes it is more convenient to use q-uniform convexity in the equivalent form which is slightly different from the one obtained in (8.12):  x C y q  x  y q kxkq C kykq     . (8.13)   C   2 2K 2 The following definition was introduced with the purpose to get a notion which is in a certain sense dual to the notion of uniform convexity.

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

235

Definition 8.22. The modulus of uniform smoothness of a normed space X is defined for > 0 as ± ° kx C yk C kx  yk  1 : x, y 2 X, kxk D kyk D 1 . (8.14) X . / D sup 2 A normed space X is called uniformly smooth if lim !0 X . / D 0. Furthermore, X is said to have modulus of smoothness of power type p if there exists a constant K such that X . /  K p for all > 0. Remark 8.23. Combining Remark 8.17 with Propositions 8.18 and 8.24 and with Theorem 8.27 we get that in this case necessarily p  2. Proposition 8.24. A normed space X has modulus of smoothness of power type p if and only if there exists a constant S > 0 such that for every x, y 2 X kx C ykp C kx  ykp  2 kxkp C 2 S p kykp .

(8.15)

Proof of the “if” part of Proposition 8.24. If (8.15) holds, then for all x, y 2 X with kxk D kyk D 1 and all > 0 we have kx C ykp C kx  ykp p1 kx C yk C kx  yk 1 1 2 2 1  .1 C S p k ykp / p  1 1  .S /p , p 1

where we use linear approximation and concavity of the function .1 C x/ p  1. Therefore X has modulus of smoothness of power type p. The “only if” part of Proposition 8.24 is proved later, see page 238. Definition 8.25. A normed space is called p-uniformly smooth if (8.15) holds for some S > 0. The least S for which (8.15) holds is called the p-smoothness constant of X, and is denoted Sp .X/. Before completing the proofs of Propositions 8.18 and 8.24 we establish some very important and useful duality results. We start with a classical result: Theorem 8.26 (D. Milman–Pettis). Every uniformly convex Banach space is reflexive. Proof. Let X be a uniformly convex Banach space. It suffices to show that for each h 2 X  , khk D 1 we have h 2 .X/, where  : X ! X  is the canonical embedding. By the Goldstine theorem there exists an ultrafilter U on SX such that w  limU x D h,

236

Chapter 8 Applications of Markov chains to embeddability problems

where w   lim denotes the limit in the weak topology. To complete the proof it suffices to show that the ultrafilter converges in the strong topology of X. Since X is a complete space, it is enough to show that U is a Cauchy ultrafilter in the sense that for each " > 0 there exists a subset A 2 U such that kx  yk < " for each x, y 2 A. Let ı D ıX ."/ > 0. Then x, y 2 X, kxk, kyk D 1, and kx C yk  2  ı imply that kxyk < ". Let f 2 SX  be such that jh.f /1j < ı=2. Let V D ¹g 2 X  : jg.f / 1j < ı=2º; V is a weak neighborhood of h. Hence V \ SX 2 U. Let x, y 2 V \ SX . Then jf .x/ C f .y/j > 2  ı and therefore kx C yk  2  ı. Hence kx  yk < ". Thus U is a Cauchy ultrafilter in the strong topology and its limit h is in .X/. Theorem 8.27. Let q1 C p1 D 1, p, q 2 .1, 1/. (a) A Banach space X is p-uniformly smooth if and only if its dual space X  is q-uniformly convex. Furthermore, the p-uniform smoothness constant of X is equal to the q-uniform convexity constant of X  . (b) A Banach space X is q-uniformly convex if and only if its dual space X  is p-uniformly smooth. Furthermore, the q-uniform convexity constant of X is equal to the p-uniform smoothness constant of X  . Proof. (a) Suppose that X  is q-uniformly convex with q-uniform convexity constant K. Let x, y 2 X. Let f 2 X  be such that kf k D 1 and f .x C y/ D kx C yk. Let g 2 X  be such that kgk D 1 and g.x  y/ D kx  yk. We are going to use the q-uniform convexity in the form (8.13) for suitably adjusted vectors f and g, namely for the vectors 1

' D N  q kx C ykp1 f ,

1

 D N  q kx  ykp1 g,

where

kx C ykp C kx  ykp . 2 The adjustment is such that k'kq C kkq D 2 and the first equality in the following formula holds N D



kx C ykp C kx  ykp 2

1

p

'.x C y/ C .x  y/ 2     '  'C .x/ C .y/ D 2 2   1    ' C  q  '   q q 1    .kxkp C kKykp / p    2  C  2K  1  1 k'kq C kkq q  .kxkp C kKykp / p 2 D

1

D .kxkp C kKykp / p .

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

237

The first inequality follows from the Hölder inequality. To get the second inequality we use the q-uniform convexity of X  in the form (8.13). Now suppose that X is p-uniformly smooth with p-uniform smoothness constant K. Let f , g 2 X  and ! > 0. Let x 2 X be such that kxk D 1 and kf C gk  .f C g/.x/ C !. Let y 2 X be such that kyk D 1 and kf  gk  .f  g/.y/ C !. We are going to use the p-uniform smoothness for suitably adjusted vectors x and y, namely for the vectors     1  f C g q1 1  f  g q1 y D Mp   ,  D Mp   x,  2 2K K where  f C g q  f  g q     M D  C  . 2 2K The adjustment is such that 2kkp C 2kK kp D 2 and the first equality in the following formula holds   1    f C g q  f  g q q   C   2   2K       f C g q1 f Cg  p1   .x/  M 2 2  q1    f  g y  f  g  p1   CM C R.!/ 2 K  2K      f Cg f g D ./ C . / C R.!/ 2 2     C  Df Cg C R.!/ 2 2 p   1       p p q q q1   C  C   .kf k C kgk /  C R.!/  2  2  .psmoothness/ 1 1 1  .kf kq C kgkq / q  .2kkp C 2kK kp/ p C R.!/ 2 1  q q q kf k C kgk C R.!/, D 2 where R.!/ > 0 is a function of ! satisfying lim!#0 R.!/. The second inequality follows from the Hölder inequality. To get the third inequality we use the p-uniform smoothness of X. Letting ! # 0 we get that X  is q-uniformly convex. (b) Can be proved by combining part (a) with Theorem 8.26. If X is q-uniformly convex, then, by Theorem 8.26, X .X/ D X  , thus X  is also q-uniformly convex. Applying part (a) we get that X  is p-uniformly smooth. On the other hand, if X  is p-uniformly smooth, then, by part (a), X  is q-uniformly convex and thus X  D X .X/.

238

Chapter 8 Applications of Markov chains to embeddability problems

There exists a very nice and simple formula for X  in terms of ıX . Theorem 8.28. For every Banach space X "

X  . / D sup  ıX ."/ , . > 0/. 0"2 2

(8.16)

Proof. In fact 2X  . / D

sup

.kf C gk C kf  gk  2/

f ,g2SX 

D

sup

sup ..f C g/.x/ C .f  g/.y/  2/

f ,g2SX  x,y2SX

D sup .kx C yk C kx  yk  2/ x,y2SX

D sup

sup

. "  .2  kx C yk//

0"2 x,y2SX ,kxykD"

D sup . "  2ıX ."// . 0"2

Remark 8.29. It seems that there is no simple reverse relationship, that is no simple formula expressing ıX ."/ in terms of X  . /. Proof of the “only if” part of Proposition 8.24. Suppose that X . /  .C /p for some constant C and 1 < p  2. Then, modifying slightly the definition of X , we get that for all kxk D 1 and kyk  1 we have kx C yk C kx  yk  1 C .C kyk/p . 2 We need a similar estimate for  1 kx C ykp C kx  ykp p . 2 In this connection we consider the difference:  1 kx C ykp C kx  ykp p kx C yk C kx  yk  . 2 2 This difference can be written in the form 1  kx C ykp C kx  ykp p kx C yk C kx  yk  2 2 1    .1 C ˇ/p C .1  ˇ/p p Db 1 , 2

(8.17)

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

239

where the numbers b and ˇ are given by b :D

kx C yk C kx  yk 2

The function

 w.ˇ/ :D

and

ˇ :D

kx C yk  kx  yk . kx C yk C kx  yk

.1 C ˇ/p C .1  ˇ/p 2

 p1 1

is such that w.0/ D 0 and w 0 .0/ D 0. Since this function is analytic at 0, we get that w.ˇ/  C.p/ˇ 2 for jˇj  1, where the constant C.p/ depends on p only. Since jˇj  kyk  kyk and 1  p  2, we get b 

kx C ykp C kx  ykp 2

1

p

1   1 C .C kyk/p C C.p/kyk2  1 C .Skyk/p p

for all x and y with kyk  kxk D 1, where the constant S depends only on C and p. Therefore kx C ykp C kx  ykp  kxkp C .Skyk/p (8.18) 2 for all x and y with kyk  kxk. Finally, since we may assume that S  1, the inequality (8.18) holds for all x and y. Proof of the “only if” part of Proposition 8.18. Suppose that ıX ."/  ."=C /q for some constant C . Then, by (8.16),    q   " " "  .C /p ,  ıX ."/  sup  X  . / D sup 2 2 C 0"2 0"1 p

q

where p1 C q1 D 1. Here we use the well-known inequality ˛ˇ  ˛p C ˇq  ˛ p Cˇ q . By Proposition 8.24 we get that X  is p-uniformly smooth. Applying Theorem 8.27 we get that X is q-uniformly convex. Example 8.30 (Without proof). Well-known and very important examples of q-uniformly convex and p-uniformly smooth Banach spaces are the classical Lp .0, 1/ spaces: (a) Lp .0, 1/ is p-uniformly smooth and 2-uniformly convex if 1 < p  2. (b) Lp .0, 1/ is 2-uniformly smooth and p-uniformly convex if 2  p < 1.

8.4.2 Pisier theorem on renormings of uniformly convex spaces The following class of Banach spaces plays a very important role in Banach space theory.

240

Chapter 8 Applications of Markov chains to embeddability problems

Definition 8.31. A Banach space X is called superreflexive if each Banach space Y which is finitely representable in X is reflexive. Example 8.32. Each uniformly convex and each uniformly smooth Banach space is superreflexive. In fact, both uniform convexity and uniform smoothness are determined by two-dimensional subspaces, and hence any Banach space which is finitely represented in a uniformly convex (uniformly smooth) Banach space is uniformly convex (uniformly smooth). It remains to use the fact that uniformly convex and uniformly smooth Banach spaces are reflexive (Theorem 8.26 and Exercise 8.63). The converse does not hold, it is not difficult to construct examples of superreflexive Banach spaces which are not uniformly convex or smooth. It becomes immediate if we observe that all finite-dimensional Banach spaces are superreflexive. Another way of showing that the converse does not hold is based on the following. Banach spaces X and Y are called isomorphic if there exists a bijective bounded operator T : X ! Y such that T 1 is also bounded (actually boundedness of T 1 is redundant, it follows from the open mapping theorem). Proposition 8.33. If X and Y are isomorphic and X is superreflexive, then Y is also superreflexive. Proof. Assume the contrary. Then there exists a nonreflexive space Z which is finitely representable in Y . By Proposition 2.33, there exists an ultrafilter U such that the ultrapower Y U contains a subspace isometric to Z. Then (as is easy to check) the ultrapower X U contains a subspace isomorphic to Z. It is well known (and follows, for example, from Theorem 6.11 (a),(d),(e)) that a Banach space isomorphic to a nonreflexive Banach space is nonreflexive. Therefore X U contains a nonreflexive subspace. By Proposition 2.31, X U , and hence Z is finitely representable in X. Thus X is not superreflexive. We get a contradiction. Remark 8.34. Sometimes Proposition 8.33 is stated as: superreflexivity is an isomorphic invariant. On the other hand, it is easy to see that a Banach space isomorphic to a uniformly convex space does not have to be uniformly convex. Example 8.35. Consider the space `2 with the norm 1 jjj¹xi º1 i D1 jjj D max¹k¹xi ºi D1 k`2 , jx1 j C jx2 jº.

Denote this space by Y . It is easy to verify that Y is not uniformly convex. On the other hand, the identical embedding T : `2 ! Y is bounded and has bounded inverse, so Y is isomorphic to `2 . Hence `2 with the norm jjj  jjj is superreflexive, but not uniformly convex.

241

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

A very important result on superreflexive spaces is the following theorem: Theorem 8.36 (Enflo). Each superreflexive Banach space is isomorphic to a uniformly convex Banach space and to a uniformly smooth Banach space. This theorem admits the following strengthening: Theorem 8.37 (Pisier). Each superreflexive Banach space is isomorphic to a q-uniformly convex Banach space for some 2  q < 1 and to a p-uniformly smooth Banach space for some 1 < p  2. We are not going to give complete proofs of these theorems in this book, we shall prove only the following important for us part of Theorem 8.37. Theorem 8.38. Each uniformly convex Banach space is q-uniformly convex for some 2  q < 1. Using the duality (Theorems 8.26, 8.27, 8.28, and hints to Exercise 8.63) we get: Corollary 8.39. Each uniformly smooth Banach space is p-uniformly smooth for some 1 < p  2. Proof of Theorem 8.38. First we introduce a notion which is very natural in the study of uniform convexity. Let C be a closed bounded subset of a real Banach space X, a 2 R, and f 2 X  . Then the set S D ¹x 2 C : f .x/ > aº is called a slice of C . We define the following set-valued derivatives on bounded subsets of a Banach space. Let " > 0 and C be a bounded subset in a Banach space X. We define C"0 D C and C"1 D C"0 D ¹x 2 C such that any slice of C containing x has diameter > "º. For a positive integer n  2 the dent-derivative of C of order n is defined by C"n D .C"n1 /1" . Remark 8.40. Such set-valued derivatives are usually defined not only for positive integers n, but also for arbitrary ordinal numbers n, however we do not need this generalization here. We define the dentability index of a bounded set in a Banach space by ´ inf¹n : C"n D ;º if such n 2 N exists Dent.C , "/ D 1 otherwise.

(8.19)

242

Chapter 8 Applications of Markov chains to embeddability problems

Lemma 8.41. If X is a uniformly convex Banach space, then Dent.C , "/ < 1 for each bounded set C  X and each " > 0. Proof. It is clear that if C  A, then C"n  An" for each n 2 N and each " > 0 and therefore Dent.C , "/  Dent.A, "/. In is also clear that .rBX /0" D ; if r  "2 (where rBX is a multiple of a unit ball). Therefore it suffices to show that for each r > "2 we have .rBX /0"  .1  !.", r , X//rBX (8.20) for some 0 < !.", r , X/ < 1 which depends only on ", r , and X and is non-increasing in r > 0 for fixed " and X. The desired property of balls almost coincides with the definition of the uniform convexity. In fact, first let us show that a point z 2 BX is contained in a slice of diameter  " if kzk > 1ıX ."/. In fact, let z  be such that kz  k D 1 and z  .z/ D kzk. Let S be the slice ¹x 2 X : z  .x/ > kzkº. Although z … S, the small perturbation argument implies that it suffices to show that diam.S/ < ". Suppose not, then S contains points u and w such that ku  wk  ". Using convexity of the norm we may assume that kuk D kwk D 1. In fact, we may get that one of the norms is equal to 1 just by extending the line segment joining u and w in the direction in which the functional z  is non-decreasing, so we assume that kuk D 1 and kwk  1. Now we consider a line w C t r , where t 2 R and r 2 ker z  . This line intersects the unit sphere in two points. By convexity of the norm, the distance between u an at least one of the intersection points is  ". Also it is easy to see that we may assume ku  wk D " (just consider a continuous curve on the set SX \ ¹x 2 X : z  .x/ > kzkº joining u and w, it should contain a point satisfying this condition). By the definition of the modulus of convexity we get   u C w    2   1  ıX ."/. 

On the other hand z

uCw 2

 > kzk > 1  ıX ."/.

We get a contradiction. Thus we have proved that .BX /0"  .1  ıX ."//BX . By homothety, this implies .rBX /0r "  .1  ıX ."//rBX . Applying this result to "0 D

" r

we get

"

.rBX /0"  1  ıX rBX r

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

243

provided r"  2. It remains to show that ıX .t /, 0  t  2 is a non-decreasing function. By Remark 8.15, it suffices to do so for a two-dimensional space X. Let 0 < t0   2, xCy   let x and y be such that kxk D kyk D 1, kx  yk D t0 and ıX .t0 / D 1  . It 2 is enough to show that there exists t1 < t0 such that for each s 2 Œt1 , t0 there exist x 0 and y 0 such that kx 0k D ky 0 k D 1, kx 0  y 0 k D s and     0  x C y0   x C y      2   2 . To achieve this goal we introduce the functional f 2 X  such kf k D 1 and     x C y  xCy  . D f 2 2  

If f .x/ D f .y/ D f

 xCy , 2

(8.21)

the claim is obvious: We just move the line containing x and y in the direction of increase of f keeping it parallel to the original position until its intersection with BX has length t 2 Œ0, t0 . Then the midpoint m of the obtained line segment satisfies:     x C y  xCy  D kmk  f .m/  f  2 . 2 If (8.21) is not satisfied, we assume without f .y/ f .x/.  Let  of generality that  xCy ı>xCy  loss . Our  > 0 and let v D g 2 X  be such that g.x/ D g.y/ D g xCy 2 2 2 choice of f implies that f .v/  f .y/. Therefore the line n1 D ¹u 2 X : f .u/ D f .y/º either has common points with the intersection of the interior of BX with the half-plane ¹u : g.u/ > g.x/º, or contains a line segment on the surface of BX (the reader is advised to sketch a picture). The latter case cannot occur because we assumed that X is uniformly convex. In the former case the line n2 D ¹u 2 X : f .u/ D f .x/º cannot intersect the intersection of the interior of BX with the half-plane ¹u : g.u/ > g.x/º and therefore the midpoints m.c/ of the line segments which are intersections of   . BX and lines of the form ¹u : g.u/ D cº with c > g.x/ satisfy f .m.c//  f xCy 2 This completes the proof of the lemma. Now we turn to the proof of Theorem 8.38. We let Nk D Dent.BX , 2k /  1 and introduce the following function on X: f .x/ D kxk C

Nk k 1 X X 2 d.x, B2nk /, Nk

kD1 nD1

where B D BX , the sets ¹B2nk ºn,k are its dent-derivatives, and d.x, B2nk / is the distance in the original norm of X from x to the dent-derivative.

244

Chapter 8 Applications of Markov chains to embeddability problems

Let jjj  jjj be the Minkowski functional of the convex symmetric set C D ¹x 2 X : f .x/  1º, that is jjjxjjj D inf¹ 2 .0, 1/ :

x 2 C º.

Observe that kxk > 1 implies f .x/ > 1 and that kxk  12 implies f .x/  1. Therefore kxk  jjjxjjj  2kxk for all x 2 X. As it is well known, the Minkowski functional of a bounded convex symmetric set is a norm, therefore .X, jjj  jjj/ is a normed space, and the identical embedding of the Banach space .X, kk/ into .X, jjjjjj/ is an isomorphism. It remains to show that the Banach space .X, jjj  jjj/ is q-uniformly convex for some q 2 Œ2, 1/. Lemma 8.42. If " > 0 and x, y 2 X are such that f .x/ D f .y/ D 1 and kx  yk  ", then   xCy "2 f  1 (8.22) 2 .   2 32 Dent BX , " 8

Proof. Let k 2 N be such that "8  2k  "4 and n 2 N be the largest number for which both x 2 B2nk and y 2 B2nk . Assume that x 2 B2nk nB2nC1 k . Observe that xCy nC1 kx  yk  " implies that n < Nk (because it implies 2 2 B2k ). Let D 4N" . k The main idea of the proof is to show that the gap between the average of distances and the distance from the midpoint is sufficiently nontrivial for at least one of the terms on the suitable scale. We mean the following claim. Claim 8.43. There exists 1  m  Nk  n such that    1 x C y nCm nCm nCm d.x, B2k / C d.y, B2k /  d , B2k  . 2 2

(8.23)

Proof. First we show that the contrary to (8.23), that is,  1 nCm d.x, B2nCm k / C d.y, B2k / 2   x C y nCm < d , B2k 2

(8.24)

 1 nCm d.x, B2nCm k / C d.y, B2k / < m . 2

(8.25)

8m 2 ¹1, : : : , Nk  nº

implies 8m 2 ¹1, : : : , Nk  nº

which are at The idea of the argument is that the midpoint of two points in B2nCm k , and this leads to the desired estimate. distance > 22k from each other is in B2nCmC1 k

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

245

In fact, for m D 1 we have x, y 2 B2nk and kx  yk  " > 2  2k . Therefore xCy 2 B2nC1 k and the assumption (8.24) for m D 1 implies the desired estimate (8.25) 2 for m D 1. To verify (8.25) for other values of m we use induction. So we assume that (8.25) holds for some value of m and verify the same statement for m C 1. Inequality (8.25) such that 12 .kx  xzk C ky  yk/ z < m . This implies that there exist xz, yz 2 B2nCm k inequality implies that xz and yz are sufficiently far from each other: kz x  yk z > "  2m  Therefore

x z Cy z 2

"  2  2k . 2

2 B2nCmC1 . Since k    x C y xz C yz     2  2  < m , 

we conclude that d

 x C y nCmC1 < m . , B2k 2

Together with the assumption that the inequality in (8.24) holds for m, this implies  1 d.x, B2nCmC1 / C d.y, B2nCmC1 / < .m C 1/ , k k 2 thus we have proved (8.25). Now we apply the argument used in the inductive step to m D Nk  n. We find k such that kz x  yk z > "  2.Nk  n/  "2 . Therefore xz, yz 2 B2Nk

N C1

k We get a contradiction because we assumed that B2k

x zCy z 2

k C1 2 B2Nk .

is empty.

Using Claim 8.43 it is easy to complete the proof of Lemma 8.42. In fact, let k be such that "8  2k  "4 and let m be such that (8.23) holds. Since all other terms having the same form as the left-hand side of (8.23) are nonnegative, and also 12 .kxkC     0, inequality (8.23) implies kyk/   xCy 2 1 .f .x/ C f .y//  f 2



xCy 2

 

2k . Nk

Recalling the choice of k and using the definition of and f .x/ D f .y/ D 1, we get   "2 xCy  1 f 2 .   2 32 Dent BX , "8 Using Lemma 8.42 it is easy to derive that the Banach space X with the norm jjj  jjj is uniformly convex and to find an estimate for its modulus of convexity. To this end,

246

Chapter 8 Applications of Markov chains to embeddability problems

let x and y be such that jjjxjjj D jjjyjjj D 1 and jjjx  yjjj D ". Then f .x/ D f .y/ D 1 and kx  yk  "2 (the double sum in the definition of f cannot exceed the first term). By Lemma 8.42, we get   "2 xCy  1 f  .   " 2 2 128 Dent BX , 16 The definition of f immediately implies that f , considered as a function on the space X endowed with the norm k  k, is 2-Lipschitz. Therefore the k  k-distance "2 and any point z satisfying f .z/ D 1 is  between xCy 2 . Since jjjjjj  2 " 256.Dent .BX , 16 // k  k, the same applies to the jjj  jjj-distance. Thus jjj

xCy "2 jjj  1  2 ,   " 2 256 Dent BX , 16

and the modulus of convexity of the Banach space X with the norm jjj  jjj is at least ıX,jjjjjj."/ 

"2 2 .   " 256 Dent BX , 16

(8.26)

Therefore to complete the proof of Theorem 8.38 it suffices to show that for each uniformly convex Banach space X there are constants 0 < C , r < 1 such that Dent.BX , "/ 

C . "r

(8.27)

Unfortunately, the proofs of (8.27) known at the moment are rather indirect. (More direct proofs would be very useful for our understanding of uniformly convex spaces.) What we present below is a roundabout approach to proving (8.27). One of the reasons for which (8.27) resists a direct proof is that it seems difficult if at all possible to show the dentability index Dent.C , "/ satisfies the following submultiplicativity relation: Dent.BX , "  z "/  Dent.BX , "/  Dent.BX , "z/.

(8.28)

Such a relation, if true, would imply (8.27) because it is well known that submultiplicativity implies power-type estimates. However there exists another index, called weak Szlenk index, which is similar to the dentability index, for which a similar submultiplicativity relation is quite easy to prove. On the other hand, there exists an inequality between the dentability index of X and the weak Szlenk index of the space Lp .X/ of vector valued functions. The proof of Theorem 8.38 is obtained by combining these facts and the result on uniform convexity of Lp .X/ for a uniformly convex X.

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

247

Let " > 0 and C be a bounded subset in a Banach space X. We define C"h0i D C and C"h1i D ¹x 2 C such that any weak-open neighborhood of C containing x has diameter > "º. For a positive integer n  2 the weak Szlenk derivative of C of order n is defined by C"hni D .C"hn1i /"h1i . We define the weak Szlenk index of a bounded set in a Banach space by ´ hni inf¹n : C" D ;º if such n 2 N exists wSz.C , "/ D 1 otherwise. h1i

(8.29)

hni

Observation 8.44. It is clear that C"  C"1. Therefore C"  C"n for each n 2 N. Hence wSz.C , "/  Dent.C , "/ and Lemma 8.41 implies wSz.C , "/ is finite for any bounded subset of a uniformly convex Banach space. For weak Szlenk index the submultiplicativity relation is easy to get: Lemma 8.45. "/  wSz.BX , "/  wSz.BX , z "/. wSz.BX , "  z

(8.30)

Proof. It suffices to prove 8n 2 N

hnwSz.X,z "/i

.BX /"z"

 .BX /hni " .

We prove this by induction. The statement is obviously true for n D 0. We assume that it holds for n and prove it for n C 1. hnC1i h.nC1/wSz.X,z "/i Let x … .BX /" . We need to show that x … .BX /"z" . Since the hni

derivatives are decreasing, we may assume that x 2 .BX /" . Therefore there is a   hni  weak-open neighborhood U of x such that diam U \ .BX /"  ". Therefore    x C "BX . U \ .BX /hni " On the other hand it is clear that hwSz.BX ,z "/i

."BX /"z" Therefore

D ;.

  hwSz.BX ,z"/i U \ .BX /hni D ;. " "z "

248

Chapter 8 Applications of Markov chains to embeddability problems

Since the intersection of two weak-open neighborhoods

is a weak-open neighborhood, hni this implies that all points of the set U \ .BX /" which are not in 

hwSz.BX ,z"/i  , U \ .BX /hni " "z "

are also not in



.BX /hni "

hwSz.BX ,z"/i "z "

.

In particular, using the induction hypothesis we get hwSz.BX ,z"/i  "/i .BX /h.nC1/wSz.X,z . x … .BX /hni " "z " "z " Next, we establish the connection between values of the dentability index of X and the weak Szlenk index of Lp .X/. Lemma 8.46. Let X be a uniformly convex Banach space and let 1 < p < 1, then Dent.BX , "/  wSz.Lp .X/, "2 /. Remark 8.47. We restrict our attention to uniformly convex X and to p 2 .1, 1/ because only these cases will be needed for our proof. Here Lp .X/ is the space of X-valued functions on Œ0, 1 endowed with its Lp norm. This space can be described as the completion of the linear space of simple functions with respect to the Lp -norm defined in the following way. define a simple X-valued function on Œ0, 1 as a function of the form h.t / D PWe n n i D1 1Ai .t /xi where measurable subsets ¹Ai ºi D1 of Œ0, 1 form a partition of Œ0, 1, 1Ai are the indicator functions, and xi are vectors in X. The Lp -norm of h is defined by ! p1 n X kxi kp m.Ai / , khk :D i D1

where m.Ai / is the Lebesgue measure of the set Ai . Proof of Lemma 8.46. It suffices to prove the claim that for each finite collection P hni , where B is the x1, : : : , xm 2 .BX /n" , the function f D m i xi is in B i D1 1. i1 "=2 m ,m unit ball of Lp .X/. The claim is clearly true for n D 0. Now we assume that it is true for n and prove . We need to prove that the function f D it for n C 1. So let x1 , : : : , xm 2 .BX /nC1 " Pm hnC1i hni i D1 1. i1 , i  xi is in B"=2 . The induction hypothesis implies that f 2 B"=2 . m

m

The fact that xi 2 .BX /nC1 implies that the set of points of .BX /n" with distance > "2 " to xi at cannot be separated from xi by a linear functional (the separating functional could be used to get a slice of diameter  " containing xi ). The well-known separation theorem for closed convex sets implies that xi is in the closure in the strong topology of the convex hull of the set of points in .BX /n" with distance > "2 to xi . Thus for

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

249

k.i /

each i 2 ¹1, : : : , mº and each > 0 there exist ¹xi ,j ºj D1  .BX /n" and positive real k.i /

numbers ¹˛i ,j ºj D1 such that   k.i / X   xi   ˛ x i ,j i ,j  < ,  j D1

Pk.i /

D 1, and kxi  xi ,j k > "2 for all j D 1, : : : , k.i/. We use these vectors xi ,j in order to construct a sequence ¹fk º  Lp .X/ such that Ps  Each fk is a simple function of the form pD1 1. p1 , p  yp,k , where s and the s s n vectors yp,k 2 .BX /" depend on k. j D1 ˛i ,j



The sequence ¹fk º converges weakly to a function g 2 Lp .X/ satisfying kf  gk < .



Each element of the sequence ¹fk º satisfies kf  fk k  "2 .

The existence of such a sequence leads to the desired conclusion. In fact, consider an arbitrary weak-open neighborhood V of f . Picking > 0 to be small enough we may assume that V contains the ball of radius centered at f . Since the sequence ¹fk º converges weakly to some element of this ball, the set V contains some members of hni the sequence ¹fk º. By the induction hypothesis we have fk 2 B"=2 . Thus diam.V \ hni

hnC1i

B"=2 /  "2 . Therefore f 2 B"=2 . So it remains to construct the sequence ¹fk º. First we do the construction in the case where m D 1 and x D x1 , and denote the obtained sequence by ¹fk.0,1,x º1 . kD1 Observe that approximating ˛i ,j by rational numbers and repeating some of the vectors xi ,j several times, we may assume that all of the coefficients ˛i ,j are equal to each other and that their common value is 1` , where ` 2 N. So we assume that the P inequalities kx1  x1,j k > "2 and kx1  `1 j`D1 x1,j k < hold. We introduce the function .t/ D

` X j D1

1. j 1 , j  .t /x1,j `

`

and extend it as a periodic function with period 1 to the whole real line. We define the .0,1,x function fk : .0, 1 ! X by .0,1,x

fk

.t / D .kt /.

All properties of this sequence except the statement about the weak limit are easy to check.

250

Chapter 8 Applications of Markov chains to embeddability problems .0,1,x

As for the weak limit, we claim that the weak limit of the sequence ¹fk º1 kD1 1 P` is the function which takes the value ` j D1 xi ,j at each point of the interval .0, 1. To verify this statement we need the description of the dual space of Lp .X/. The description is: Theorem 8.48. If the Banach space X is uniformly convex, the dual space to Lp .X/ is Lq .X  /, where p1 C q1 D 1 and X  is the dual space of X. If f 2 Lp .X/ and F 2 R1 Lq .X  /, the action F .f / is given by F .f / D 0 .F .t //.f .t //dt , where F .t / 2 X  , f .t / 2 X, and .F .t //.f .t // is the value of F .t / at f .t /. The standard proof of this theorem is based on the theory of vector measures and is not included here, see Notes and Remarks for references. Now let x  2 X  , d 2 N, and j 2 ¹1, : : : , d º. We introduce F 2 Lq .X  / by F D 1. j 1 , j  x  . A simple computation shows that we have d

d

lim

k!1

.0,1,x F .fk /

  ` 1 1X  D x .xi ,j / D F .g/, d ` j D1

P where g is the constant function on .0, 1 taking the value 1` j`D1 xi ,j . The description of the dual space presented in Theorem 8.48 implies that the linear span of the functionals of the form 1. j 1 , j  x  is dense in the space Lq .X  /. The d d conclusion follows. As for the general case, we observe that, as is easy to check, the functions fk D

m X

i . i1 m , m ,xi

fk

i D1 .a,b,x

.0,1,x

form the desired sequence, where fk is defined as a “compression” of fk the interval .a, b, that is, ´ .0,1,x  t a  if t 2 .a, b fk .a,b,x ba fk .t / D 0 otherwise.

to

The established relation between Dent.X, "/ and wSz.Lp .X/, "=2/ can be useful only if we can prove that the indices wSz.Lp .X/, "=2/ are finite. In this connection our next step is to show that Lp .X/ is uniformly convex. Theorem 8.49. If X is a uniformly convex space and 1 < p < 1, then the space Lp .X/ is also uniformly convex.

Section 8.4 Renormings of superreflexive spaces, q-convexity and p-smoothness

251

Proof. First we show that for each " > 0 there is ı."/ > 0 such that for y, z 2 X satisfying 1 D kyk  kzk and ky  zk  " we have   p p  y C z p    .1  ı."// kyk C kzk .  2  2 1 Assume the contrary, then for some " > 0 there are sequences ¹yn º1 nD1 , ¹znºnD1 in X such that 1 D kyn k  kzn k, kyn  zn k  ", and    yn Czn p  2  lim ky kp Ckz kp D 1. (8.31) n!1

n

2

n

Now we use the strict convexity of the function t p , t  0, 1 < p < 1 and get   1 C t p 1 C tp < , t  0, t ¤ 1. (8.32) 2 2 By (8.31) and the triangle inequality we get

p lim

n!1

1Ckzn k 2

1Ckzn kp 2

D 1.

By (8.32), this implies limn!1 kznk D 1. Let un D zn =kznk. Then limn!1 kun  zn k D 0, therefore lim inf kun  yn k D lim inf kzn  yn k  ". n!1

n!1

On the other hand, applying (8.31) we get    yn C un   D 1. lim   n!1  2 This contradicts the uniform convexity of X. If y, z 2 X are not both zero, we divide them by max¹kyk, kzkº, and immediately get       y C z p kykp C kzkp ky  zk   1ı  . (8.33)  2  max¹kyk, kzkº 2 Now let f , g 2 Lp .X/, " > 0, kf kp D kgkp D 1 and kf  gkp  ". Let D be the subset of Œ0, 1 (recall that Lp .X/ is the set of functions on Œ0, 1) for which kf .x/  g.x/kp 

"p "p .kf .x/kp C kg.x/kp /  max¹kf .x/kp , kg.x/kp º. 4 4 (8.34)

252

Chapter 8 Applications of Markov chains to embeddability problems

Applying (8.33), we get that for each x 2 D       f .x/ C g.x/ p kf .x/kp C kg.x/kp   1ı "  . 1   2 2 4p

(8.35)

Inequality (8.34) implies Z Œ0,1nD

Therefore

kf .x/  g.x/kp dx 

Z D

kf .x/  g.x/kp dx 

"p . 2

"p . 2

(8.36)

Let fQ, gQ 2 Lp .X/ be the functions which coincide with f and g, respectively, on D, and are equal to 0 on Œ0, 1nD. By (8.36) we have " kfz  gzkp  1 , 2p and hence max¹kfzkp , kgzkp º 

" 2

1 p C1

.

(8.37)

We have Z

  Z  f .x/ C g.x/ p kf .x/kp C kg.x/kp   dx dx    2 2 Œ0,1 Œ0,1   Z Z  fz.x/ C gz.x/ p kfz.x/kp C kgz.x/kp   dx    dx   2 2 Œ0,1 Œ0,1    z Z (8.35) " kf .x/kp C kgz.x/kp  ı dx 1 2 p Œ0,1 4   p (8.37) " "  ı . 1 pC2 4p 2

Therefore

Letting

     p 1 p f C g  "    1ı " . 1  2  pC2 p 4p 2

(8.38)

   p  p1 " " , ı1."/ D 1  1  ı 1 pC2 4p 2 we get a lower estimate for the modulus of convexity of Lp .X/. It is clear that ı1."/ > 0 if " > 0.

Section 8.5 Markov type of uniformly smooth Banach spaces

253

The argument used in the next lemma is well known, we already used a version of it in Lemma 2.58 (page 68). Lemma 8.50. For each uniformly convex space X there exist C , r 2 .0, 1/ such that wSz.BX , "/  "Cr for all " 2 .0, 2. Proof. It is clear that wSz.BX , 12 / D 2r for some r 2 .0, 1/. By the submultiplicativity Lemma 8.45, we get   n  1 1   1 n r . wSz BX , (8.39) 2 2 Now we use the obvious fact that wSz.BX , "/ is a non-increasing function. Letting C D max"2Œ 1 ,2.wSz.BX , "/  "r ) we immediately get that the desired inequality holds 2  blog2 .1="/c      R, where R 2 12 , 1 . for each " 2 Œ 12 , 2. Now let " 2 0, 12 . Then " D 12 By Lemma 8.45 and (8.39), wSz.BX , "/  "r    blog2 .1="/c   blog2 .1="/c r  1 1  wSz BX ,   .wSz.BX , R/  Rr / 2 2  1  C D C. This completes the proof of Lemma 8.50. Now we can complete the proof of Theorem 8.38. Since X is uniformly convex, the space Lp .X/ .1 < p < 1/ is also uniformly convex. Hence, by Lemmas 8.46 and 8.50, we get Dent.BX , "/  "Cr . Combining this estimate with the estimate for the modulus of convexity of .X, jjj  jjj/ obtained in (8.26) we get that X is q-uniformly convex for q D 2r C 2.

8.5 Markov type of uniformly smooth Banach spaces Everywhere in this section X is a p-uniformly smooth Banach space with p-smoothness constant Sp .X/. The main result of this section is: Theorem 8.51. Let 1 < p  2 and let X be a p-uniformly smooth Banach space with p-smoothness constant Sp .X/. Then the Markov type of X satisfies the following inequality 8 Sp .X/. Mp .X/  pC1 .2  4/1=p In the next results we use the notion of a Banach space valued random variable, martingale and a conditional expectation, the reader is referred to Section 1.3 for the definitions and basic facts.

254

Chapter 8 Applications of Markov chains to embeddability problems

Lemma 8.52. Fix 1 < p  2 and let Z 2 Lp ., †, P, X/. Then EkZkp  kEZkp C

Sp .X/p  EkZ  EZkp . 2p1  1

Proof. Let   0 be the largest constant such that for every Z 2 Lp .X/,    EkZkp  kEZkp  EkZ  EZkp .

(8.40)

Our goal is to show that   .2p1  1/Sp .X/p . To this end, fix " > 0 and Z 2 Lp .X/ such that   (8.41) . C "/ EkZkp  kEZkp > EkZ  EZkp . Applying Definition 8.25 of Sp .X/ to the vectors x D .Z C EZ/=2 and y D .Z  EZ/=2, we get the pointwise inequality: 1 1 p p 1 1     kZkp C kEZkp  2 Z C EZ  C 2 Sp .X/p  Z  EZ  . 2 2 2 2

(8.42)

We take expectations of both sides of (8.42) and get (8.41) 1 EkZ  EZkp < EkZkp  kEZkp  C"    p p  (8.42) 1 1 1   p 1  2 E Z C EZ  C Sp .X/ E Z  EZ   2kEZkp 2 2 2 2     p   1 1 1  p 1  Z C EZ  D 2 E Z C EZ   E 2 2 2 2 p 1 1   C 2 Sp .X/p E Z  EZ  2 2 p p 1 (8.40) 2  1 1 1    E Z  EZ  C 2 Sp .X/p E Z  EZ  .   2 2 2 2

It follows that ity.

2p C"



2 

C 2Sp .X/p . Letting " tend to zero we get the desired inequal-

We also need a conditional version of Lemma 8.52. For most of the applications it suffices to consider the case where the -subalgebras Fi are finite. Lemma 8.53. Fix 1 < p  2, let Z 2 Lp ., †, P, X/ and let F be a -subalgebra of †. Then for almost every ! 2  E.kZkp jF/.!/  kE.ZjF/.!/kp C

Sp .X/p  E.kZ  E.ZjF/kp jF/.!/. 2p1  1

Section 8.5 Markov type of uniformly smooth Banach spaces

255

Since in this book we are going to apply martingales only in the case where all subalgebras are finite, we skip the proof of this lemma. In fact, in the case where the -algebra F is finite, A  † is one of its atoms (with positive P.A/), and ! 2 A, the inequality reduces to EA kZkp  kEA Zkp C

Sp .X/p  EA kZ  EA Zkp , 2p1  1

where

E.W  1A / P.A/ for any random variable (scalar- or vector-valued). It is clear that the obtained inequality is the same as the inequality obtained in Lemma 8.52, but for a different probability space. EA W D

Theorem 8.54. Fix 1 < p  2 and let ¹Mk ºnkD0  Lp ., †, P, X/ be a martingale in X. Then EkMn  M0 kp 

n1 Sp .X/p X EkMkC1  Mk kp .  2p1  1 kD0

Proof. Assume that ¹Mk ºnkD0 is a martingale with respect to the filtration F0  F1      Fn1 ; that is, Mi D E .Mi C1jFi / for i D 0, 1, : : : , n  1. By Lemma 8.53 applied to Z D Mn  M0 and F D Fn1 , we get   E kMn  M0kp j Fn1 .!/  kE.Mn  M0 jFn1 /.!/kp Sp .X/p  E.kMn  M0  E.Mn  M0 jFn1 /kp jFn1 /.!/. C p1 2 1 Using linearity of conditional expectation, this inequality can be rewritten as a pointwise inequality between functions:   Sp .X/p  E.kMn  Mn1 kp jFn1 /. E kMn  M0kp j Fn1  kMn1  M0 kp C p1 2 1 Taking expectations of both sides we get EkMn  M0 kp  E kMn1  M0 kp C

Sp .X/p  EkMn  Mn1 kp , 2p1  1

and the required inequality follows by induction. Lemma 8.55. Let X be a normed space, ¹Z t º1 t D0 a stationary reversible Markov chain on ¹1, : : : , nº and f : ¹1, : : : , nº ! X. Then for every t 2 N there are two X-valued martingales ¹Ms ºtsD0 and ¹Ns ºtsD0 (with respect to two different filtrations) with the following properties:

256

Chapter 8 Applications of Markov chains to embeddability problems

(a) For every 1  s  t  1 the equality f .ZsC1 /  f .Zs1 / D .MsC1  Ms /  .N t sC1  N t s /

(8.43)

holds. (b) For every 0  s  t  1 and p  1, max ¹EkMsC1  Ms kp , EkNsC1  Ns kp º  2p Ekf .Z1 /  f .Z0 /kp . (8.44) Proof. Let A D .aij / be the transition matrix of the Markov chain ¹Z t º, and let i :D P.Z0 D i/. As we have mentioned above (see Remark 8.11) we assume that the distribution ¹i ºniD1 is reversible. Define the map .Lf / : ¹1, : : : , nº ! X by Lf .i/ D

n X

aij .f .j /  f .i// D

j D1

n X

aij f .j /  f .i/.

j D1

We use E.f .Zs /jZ0 , : : : , Zs1 / to denote the conditional expectation of the random variable f .Zs / with respect to the -algebra generated by random variables Z0 , : : : , Zs1 , that is, with respect to the -algebra generated by all sets of the form ¹! : Z0 .!/ D x0, : : : , Zs1 .!/ D xs1 º. We have E.f .Zs /jZ0 , : : : , Zs1 / D E.f .Zs /jZs1 / D Lf .Zs1 / C f .Zs1 /, where the first equality follows from the definition of a Markov chain, and the second equality is obtained by making the computation for each possible value of Zs1 . We claim that since ¹Zs º1 sD0 is stationary, and its stationary distribution is reversible, we also have that for every 0  s < t , E.f .Zs /jZsC1 , : : : , Z t / D Lf .ZsC1 / C f .ZsC1 /.

(8.45)

In fact, reversibility implies (by induction) that for each k  1 and each collection i0 , i1 , : : : , ik 2 ¹1, : : : , nº we have i0 ai0 ,i1 ai1,i2 , : : : , aik1,ik D ik aik ,ik1 aik1,ik2 , : : : , ai1 ,i0 .

(8.46)

If s  k, the equality (8.46) can be written as P.Zs D i0 , ZsC1 D i1 , : : : , ZsCk D ik / D P.Zsk D ik , ZskC1 D ik1 , : : : , Zs D i0 /. The equation (8.45) follows.

(8.47)

257

Section 8.5 Markov type of uniformly smooth Banach spaces

We define M0 D f .Z0 /. For each s  1 we define Ms by Ms D f .Zs / 

s1 X

Lf .Zr /.

r D0

We have E.Ms jZ0 , : : : , Zs1 / D Lf .Zs1 / C f .Zs1 / 

s1 X

Lf .Zr / D Ms1 ,

r D0

P where we used the observation that s1 r D0 Lf .Zr / is measurable with respect to the algebra generated by Z0 , : : : , Zs1 . Therefore, ¹Ms º1 sD0 is a martingale with respect 1 to the natural filtration ¹Fk ºkD1, where Fk induced by ¹Zs ºksD0 . Observe that each -algebra in this filtration is finite. We also define N0 :D f .Z t / and Ns :D f .Z t s / 

t X

Lf .Zr /.

r Dt sC1

for 1  s  t . As in the case of ¹Ms º, for s  1 we have E.Ns jZ t sC1 , : : : , Z t / D Ns1 . Therefore ¹Ns ºtsD0 is a martingale with respect to the natural filtration ¹Fzk ºtkD0, where Fzk is generated by Z t , Z t 1 , : : : , Z t k . The equalities MsC1  Ms D f .ZsC1 /  f .Zs /  Lf .Zs /, NsC1  Ns D f .Z t s1 /  f .Z t s /  Lf .Z t s /. imply (8.43). To prove (8.44) observe that for every s  0 and q  1, EkLf .Zs /kp D

n X

n X p   i  aij Œf .j /  f .i/

i D1



n X n X

j D1

i aij kf .j /  f .i/kp

i D1 j D1

D Ekf .Z1 /  f .Z0 /kp . Therefore, EkMsC1  Ms kp D Ekf .ZsC1 /  f .Zs /  Lf .Zs /kp  2p1 Ekf .ZsC1/f .Zs /kp C2p1 EkLf .Zs /kp  2p Ekf .Z1 /f .Z0 /kp , and similarly, EkNsC1  Ns kp  2p Ekf .Z1 /  f .Z0 /kp .

258

Chapter 8 Applications of Markov chains to embeddability problems

Proof of Theorem 8.51. Let ¹Zs ºtsD0, f , ¹Ms ºtsD0 and ¹Ns ºtsD0 be as in Lemma 8.55. Assume first that t is even, and write t D 2m. Summing the identity (8.43) over s D 1, 3, 5, : : : , 2m  1 we get m X

f .Z t /  f .Z0 / D

.M2k  M2k1 / 

kD1

m X

.N2k  N2k1 /.

kD1

Observe that the sequences ²X s

³m .M2k  M2k1 / sD1

kD1

and

²X s

³m .N2k  N2k1 / sD1

kD1

are martingales with respect to filtrations ¹F2k ºm and ¹Fz2k ºm , respectively. ApkD1 kD1 plying Theorem 8.54 to these martingales (and the convexity of the function X 7! kXkp in the first step) we get Ekf .Z t /  f .Z0 /kp m m X p X p      2p1 E  .M2k  M2k1 / C 2p1 E  .N2k  N2k1 / kD1



2p1 Sp .X/p 2p1  1

kD1 m X



E kM2k  M2k1 kp C E kN2k  N2k1 kp



kD1

2p1 Sp .X/p t pC1  2 E kf .Z1 /  f .Z0 /kp .  2p1  1 2 When t is odd, apply the above reasoning at time t  1, to get E kf .Z t /  f .Z0 /kp  2p1 E kf .Z t 1/  f .Z0 /kp C 2p1 E kf .Z t /  f .Z t 1 /kp   3p2 2 Sp .X/p p1 E kf .Z1 /  f .Z0 /kp  .t  1/ C 2 2p1  1 23p2 Sp .X/p t E kf .Z1 /  f .Z0 /kp .  2p1  1 This concludes the proof of the theorem.

Section 8.6 Distortions of embeddings into uniformly smooth spaces

259

8.6 Applications of Markov type to lower estimates of distortions of embeddings into uniformly smooth Banach spaces Our purpose in this section is to derive some consequences of the results proved in the previous section. Corollary 8.56. Let X be a p-uniformly smooth Banach space, 1 < p  2, and G be a finite k-regular, k  3, graph of girth g D g.G/. Then cX .G/  C.p, X, k/.bg.G/=2c/

p1 p

.

Proof. We consider the standard Markov chain ¹Z t º1 t D0 (see Example 8.6) on vertices of G. Assume that the initial distribution is uniform on the set of all vertices of G. As in the proof of Theorem 8.14, for every 1  t  bg=2c  1 we have EdG .Z t C1 , Z0 / 

k2 .t C 1/. k

Since p > 1 and E.dG .Z1 , Z0 //p D 1, this inequality implies E.dG .Z t C1 , Z0 //p  .EdG .Z t C1 , Z0 //p   k2 p  .t C 1/p E.dG .Z1 , Z0 //p . k

(8.48)

Let f : V .G/ ! X be such that dG .u, v/  kf .u/  f .v/k  dG .u, v/.cX .G/ C "/. Using Theorem 8.51 we get   k2 p .t C 1/p E.dG .Z1 , Z0 //p k (8.48)

 E.dG .Z t C1 , Z0 //p

(8.49)

 Ekf .Z t C1/  f .Z0 /kp

Theorem 8.51



.Mp .X//p .t C 1/Ekf .Z1 /  f .Z0 /kp

(8.49)

 .Mp .X//p .t C 1/.cX .G/ C "/p E.dG .Z1 , Z0 //p .

We let t D bg=2c  1 and " # 0. We get   p1 k2 .bg=2c/ p . cX .G/  kMp .X/

(8.49)

260

Chapter 8 Applications of Markov chains to embeddability problems

Corollary 8.57. Let X be a p-uniformly smooth Banach space, 1 < p  2, and G be a finite k-regular graph .k  3/ with n vertices and the absolute spectral gap of its normalized adjacency matrix equal to 1  ˛. Then cX .G/  C.p, X, k, ˛/.ln n/

p1 p

.

Proof. The proof follows the same lines as for Corollary 8.56. We consider the standard Markov chain ¹Z t º1 t D0 (see Example 8.6) on vertices of G. First we assume that the initial distribution u is concentrated in some vertex u. Then the distribution of Z t will be uAtG , where AG is the normalized adjacency matrix of G. Our first observation is that the distributions u AtG approach the uniform distribution  very fast. In fact, since the matrix A is symmetric we can write everything for transposes (so that it looks more familiar): p kAt  zu   z k1  n kAt  zu   z k2 p t D n kA . z u  z /k2 p t  n ˛ k z u  z k2 p t  2 n˛ , where transposes are denoted using z. Let t D dlog1=˛ .2n3=2 /e. Then z u  z k1  kAt 

1 . n

This means that the probability that Z t is in the set of n2 closest to u vertices of G is  12 C n1 . Now we recall the computation  on page  110 which implies that there are at most n2 vertices in G at distance  logk n2  1 to u. This immediately implies that  EdG .Z t , Z0 / 



n 1 1  logk  1  ct EdG .Z1 , Z0 /, 2 n 2

where the constant c > 0 depends only k and ˛ provided n > 4. Since p > 1, we get E.dG .Z t , Z0 //p  c p t p E.dG .Z1 , Z0 //p ,

(8.50)

where we use the fact that dG .Z1 , Z0 / D 1 almost everywhere. So the standard Markov chain on an expander moves fast from its original position for the chosen value of t . It is clear from the definition of the Markov chain that this estimate remains true if we assume that the initial distribution is uniform. We complete the proof as in Corollary 8.56. For the reader’s convenience we provide the details. Let f : V .G/ ! X be such that dG .u, v/  kf .u/  f .v/k  dG .u, v/.cX .G/ C "/.

(8.51)

261

Section 8.7 Exercises

Using Theorem 8.51 we get (8.50)

c p t p E.dG .Z1 , Z0 //p  E.dG .Z t , Z0 //p (8.51)

 Ekf .Z t /  f .Z0 /kp

Theorem 8.51



.Mp .X//p t Ekf .Z1 /  f .Z0 /kp

(8.51)

 .Mp .X//p t .cX .G/ C "/p E.dG .Z1 , Z0 //p .

Recalling that our choice of t is t D dlog1=˛ .2n3=2 /e and letting " # 0, we get   p1 p1 c cX .G/  .dlog1=˛ .2n3=2 /e/ p  C.p, X, k, ˛/.ln n/ p . Mp .X/ Remark 8.58. Observation (b) on page 198 shows that the result of Corollary 8.57 can be extended to all expander families.

8.7 Exercises Exercise 8.59. A Markov chain with transition matrix A D ¹ai ,j ºni,j D1 is called irreducible if for every i, j 2 ¹1, : : : , nº there exists m 2 N such that the .i, j /-entry of Am is strictly positive. Show that a reversible distribution of a reversible irreducible Markov chain is uniquely determined by the chain. Exercise 8.60. Consider a Markov chain with the transition matrix 3 2 0 13 0 23 6 2 0 1 07 3 7 6 3 4 0 2 0 1 5. 3 3 1 2 3 0 3 0 (1) Show that this Markov chain is not reversible. (2) Find a stationary distribution for it. Exercise 8.61. Give an example of a reversible Markov chain which has a stationary distribution which is not a reversible distribution. There is a hint to this exercise. Exercise 8.62. Let X be a uniformly convex Banach space and Z be a closed subspace of X. Show that the quotient X=Z is also uniformly convex. Exercise 8.63. Prove that a uniformly smooth Banach space is reflexive. One of the ways is outlined in the Hints to exercises section.

262

Chapter 8 Applications of Markov chains to embeddability problems

8.8 Notes and remarks Basic material on Markov chains can be found in Chapters 2–6 of [191]. A much more extensive introduction can be found in the book [281]. The idea of using the Markov chains for Lipschitz extension problems is due to Ball [36] who introduced the notion of Markov type and proved that Hilbert spaces have Markov type 2. Our proof of this fact follows [292] and [335]. Theorem 8.14 and its proof are due to Linial, Magen, and Naor [292]. The conjecture that graphs with large girth have large Euclidean distortion goes back to [290]. The authors of [292] present two proofs of this conjecture. In addition to the proof presented in Section 8.3, they give a proof based on the PSD-criterion (this criterion is described in Section 4.4 of this book). The proof uses the description of the adjacency matrix for k-distant vertices in a graph of girth 2k C 1 (similar to the description of Fr .A/ in Section 5.12 ) and analysis of properties of relevant orthogonal polynomials. There are some related and still open problems, see Section 11.3 for their discussion. The paper [42] contains some extensions of Theorem 8.14. The theory of uniformly convex spaces is a classical direction of Banach space theory. It was initiated by Clarkson [101], who proved that the spaces Lp , 1 < p < 1 are uniformly convex. The fact that ı`2 ."/  ıY ."/ for all " 2 Œ0, 2 and for all Banach spaces Y of dimension at least 2 is due to Nordlander [342]. The notion of a uniformly smooth space was introduced by Day [115], who used the term uniformly flattened space. In the mentioned paper Day discovered the duality between uniform convexity and smoothness. Theorem 8.26 is due to D. Milman [330] and Pettis [372]. The presented proof is a version of the proof in [119, p. 49], where it is attributed to J. Rainwater. To the best of my knowledge this John Rainwater is a non-existing person and the attribution means that the proof evolved in the coffee room of the math department at the University of Washington and too many took part for it to be published privately. The Goldstine theorem is from [171]. Its statement: .BX / is weak dense in BX  . It can be found (with a proof different from the original) in [130, Theorem V.4.5]. Estimates for the modulus of uniform convexity of spaces Lp implying the statement of Example 8.30 can be found in Hanner [193] and Kadets [228]. Some of these estimates were known already to Clarkson [101]. Propositions 8.18 and 8.24 are from [39, Proposition p 7], see also [148]. It was shown in [39] (see also [148]) that K2.Lp /  1= p  1 for 1 < p  2, and p S2.Lp /  p  1 for 2  p < 1. The paper [39] also contains estimates for quniform convexity and p-uniform smoothness constants of Schatten classes. There is another way of proving Remarks 8.17 and 8.23: In [149], [148] (see also [288], Theorem 1.e.16) it is shown that if a Banach space X has modulus of convexity of power type q then X also has cotype q. Similarly, if X has modulus of smoothness of power type p then X has type p. Combining this result with our knowl-

Section 8.8 Notes and remarks

263

edge of possible values of type-cotype (see Definition 2.37 and Remark 2.41) we get the results mentioned in the remarks. Observe that L1 has cotype 2 (see Theorem 2.53), but it is clearly not uniformly convex (it is non-reflexive). Results on duality between uniformly convex and uniformly smooth spaces were developed in [115], [283], [148], [39]. Theorem 8.28 was obtained in [283] (we use a slightly shorter presentation of [148, p. 122]), Theorem 8.27 was obtained in [39]. Remark 8.29 is discussed in [148, p. 122]. This remark is related to the fact that ıX ."/ could be a non-convex function, as was shown by Liokumovich [294]. Theorem 8.36 was obtained in [135]. Theorem 8.37 was obtained in [377]. These result were built upon the theory of superreflexive spaces created by James [213, 214]. Another, more elementary approach to the proof of Theorem 8.37 was suggested by Maurey and published in [50, Part 4, Chapter IV], see also [117, Chapter IV]. Our proof of Theorem 8.38 follows the approach of Lancien [265], we also use the presentation of Lancien’s proof in [166]. The weak Szlenk derivative is the analogue of the derivative introduced by Szlenk [420] for the weak topology. The uniform convexity of the space Lp .X/ for uniformly convex X and 1 < p < 1 was established in [114]. We follow the proof of McShane [322]. The dual space of Lp .X/ (Theorem 8.48) was determined by Bochner and Taylor [67]. The description holds under more general condition than the uniform convexity of X, see [121, Chapter IV, Section 1]. There are many other results on uniform convexity and smoothness, some of them are contained in the accounts of the theory of uniformly convex and uniformly smooth spaces which can be found in [50], [117], [119], [141], [288], and [383], to which we refer readers for further information. Theorem 8.51 is due to Naor, Peres, Schramm, and Sheffield [335, p Theorem 2.3]. The immediate consequence of this theorem is that M2 .Lp /  4 p  1 for every 2  p < 1. The proof of Lemma 8.52 is a slight modification of the proof of Lemma 3.1 in [36]. Theorem 8.54 was first proved in [377] (without the explicit constant), it is a simple corollary of Lemma 8.52 (see also Proposition 3.3 in [36]). Lemma 8.55 is motivated by the continuous martingale decompositions of stochastic integrals constructed in [305]. We would like to turn the attention of readers who are interested in embeddings into non-Banach metric spaces to the many classes of metric spaces that are known to have Markov type: trees, Gromov hyperbolic spaces, negatively curved Riemannian manifolds [335] (see this paper for precise statements of the results), planar graph metrics (that is, planar graphs with their graph distances) [123], Alexandrov spaces of nonnegative curvature [347].

264

Chapter 8 Applications of Markov chains to embeddability problems

8.9 Hints to exercises To Exercise 8.61. One can show that such a Markov chain cannot be irreducible. To Exercise 8.63. One can use the following plan: 

Prove the following version of Theorem 8.28: "

 ıX  ."/ , . > 0/. X . / D sup 0"2 2



Derive that (X is uniformly smooth))(X  is uniformly convex).



Use Theorem 8.26.



Show that (X  is reflexive))(X is reflexive).

Chapter 9

Metric characterizations of classes of Banach spaces 9.1 Introduction In connection with problems of embeddability of metric spaces into Banach spaces it would be interesting to find metric characterizations of well-known classes of Banach spaces. By a metric characterization in the most general sense we mean a characterization which refers only to the metric structure of a Banach space and does not involve the linear structure. Later we shall define more specific forms of metric characterization. The most important impetus to the development of the theory of metric characterizations was given by the following result. Theorem 9.1 (Ribe theorem). Let Z and Y be Banach spaces. If Z and Y are uniformly homeomorphic, then Z and Y are crudely finitely representable in each other. Recall (see Definition 2.28) that X is crudely finitely representable in Y if 9C < 1 8n 2 N, for an arbitrary n-dimensional subspace Xn  X there is an n-dimensional subspace Yn  Y and a linear map Tn : Xn ! Yn such that kTnk  kTn1 k  C . Saying X and Y are uniformly homeomorphic we mean that there exists a uniformly continuous bijection F : X ! Y such that the map F 1 is also uniformly continuous. At this point three different proofs of the Ribe theorem are available. We present one of them in Section 9.2. Many important properties P of Banach spaces are so-called hereditary local isomorphic invariants. This means that if a Banach space X has property P and Y is crudely finitely representable in X, then Y also has property P. Remark 9.2. The complicated terminology is explained as follows: hereditary means that the property is inherited by closed subspaces, local means that it is determined by a finite-dimensional structure, and isomorphic invariant means that it is inherited by isomorphic spaces. Some of the properties which we have studied in this book are hereditary local isomorphic invariants. We mean such properties as having type p and having cotype q. In this chapter we shall study one more important hereditary local isomorphic invariant. It is called superreflexivity and it can be defined as being isomorphic to a uniformly convex space. (Possibly it is not immediately clear why this property is inherited by crudely finitely representable spaces, we shall show this in Section 9.3.2.) There are

266

Chapter 9 Metric characterizations of classes of Banach spaces

several other important properties of Banach spaces which are hereditary local isomorphic invariants. An immediate consequence of the Ribe theorem is that each such property is preserved under uniform homeomorphisms. Therefore it is feasible to describe the property in terms of the metric structure of Banach spaces having it. If we compare this statement, for example, with the definitions of type and cotype, we see that the original definitions involve not only the metric structure, but also inequalities between norms of linear combinations of vectors in the space. That is, the original definitions of these important hereditary local isomorphic invariants cannot be regarded as given in terms of metric structure. Bourgain found an equivalent definition of superreflexivity in terms of metric structure and formulated the program of searching for equivalent definitions of other hereditary local isomorphic invariants in terms of metric structure with the next step consisting in studying these metrical concepts in general metric spaces in an attempt to develop an analogue of the linear theory. This program came to be known as Ribe program. We describe some results belonging to the Ribe program below and give more references in the Notes and Remarks section.

9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem We are going to present the proof of the Ribe theorem based on Bourgain’s discretization theorem. Recall that the infimum of distortions of bilipschitz embeddings of a metric space X into a metric space Y is denoted cY .X/. Let X be a finite-dimensional Banach space and Y be an infinite-dimensional Banach space. Definition 9.3. For " 2 .0, 1/ let ıX,!Y ."/ be the supremum of those ı 2 .0, 1/ for .Nı / . The function ıX,!Y ."/ is which every ı-net Nı in BX satisfies cY .X/  cY1" called the discretization modulus for embeddings of X into Y . At the moment it is not even clear that the discretization modulus is defined for any " 2 .0, 1/. Theorem 9.4 (Bourgain’s discretization theorem). There exists C 2 .0, 1/ such that for every two Banach spaces X, Y with dim X D n < 1 and dim Y D 1, and every " 2 .0, 1/, we have Cn (9.1) ıX,!Y ."/  e .cY .X/="/ . Remark 9.5. Bourgain’s discretization theorem is often quoted with the conclusion that if ı is at most as large as the right-hand side of (9.1) and Nı is a ı-net of BX , then X admits a linear embedding into Y whose distortion is at most cY .Nı /=.1  "/. We

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

267

are going to prove the theorem in this linear form. It is worth mentioning that the wellknown differentiation argument shows that the bilipschitz version implies the linear version, see Notes and Remarks, item (3) on page 306. Observation 9.6. Bourgain’s discretization theorem implies the Ribe theorem. Lemma 9.7. A uniform homeomorphism f : Z ! Y between two Banach spaces is bilipschitz for large distances, that is, for every d 2 .0, 1/ there exist L1, L2 2 .0, 1/ such that L1kx  ykZ  kf .x/  f .y/kY  L2kx  ykZ whenever x, y 2 Z satisfy kx  ykZ  d . Proof. First we observe that the modulus of continuity of maps between Banach spaces are subadditive in the sense that !f .t C s/  !f .t / C !f .s/.

(9.2)

In fact, if u, v 2 Z are such that ku  vk  t C s, then there is w 2 Z such that ku  wk  t and kw  vk  s. Therefore kf .u/  f .v/k  kf .u/  f .w/k C kf .w/  f .v/k  !f .t / C !f .s/. Inequality (9.2) follows. Since f is uniformly continuous, the quantity !f .d / is finite for some d > 0. Using (9.2) we get that !f .d / is finite for all d 2 .0, 1/. Let L2 D 2!f .d /=d . Let x, y 2 Z be such that kx  yk  d , so let kx  yk D kd C r , where k 2 N and r 2 Œ0, d /. Since x and y are in a Banach space, there are vectors v0 D x, v1 , : : : , vk , vkC1 D y in Z such that kvi 1  vi k D d for i D 1, : : : , k, and kvk  vkC1k D r . We have kf .vi 1 /  f .vi /k  !f .d / for i D 1, : : : , k C 1, therefore kf .x/  f .y/k  .k C 1/!f .d /  kL2 d  L2kx  yk. Let L1 > 0 be any number satisfying !f 1 .L1 d /  d2 . Then it is clear that kx  yk  d implies kf .x/  f .y/k  L1d . Therefore kf .x/  f .y/k D k.L1d / C r for some k 2 N and r 2 Œ0, L1 d /. Let u0 D f .x/, u1 , : : : , uk , ukC1 D f .y/ be such that kui 1  ui k D L1d for i D 1, : : : , k, and kuk  ukC1k D r . Then  kf 1 .ui 1 /  f 1 .ui /k  d2 for i D 1, : : : , k C 1. Therefore kx  yk  .kC1/d 2 1 kf .x/  f .y/k. L1 Remark 9.8. The same argument works for all metric spaces having something like geodesics of a suitable kind. Proof of Observation 9.6. Actually we are going to show that the fact that ıX,!Y ."/ is positive for some " > 0 implies the Ribe theorem (Theorem 9.1). So let Z and Y be uniformly homeomorphic infinite-dimensional Banach spaces. By Lemma 9.7, if X  Z is any finite-dimensional subspace and d > 0, then d nets in rBX embed into Y with distortion bounded by some constant C which does

268

Chapter 9 Metric characterizations of classes of Banach spaces

not depend on the choice of a finite-dimensional subspace X  Z and r for every r > d . By rescaling, we get that the same assertion holds for ı-nets in BX for every ı 2 .0, 1/. By Bourgain’s discretization theorem (see also Remark 9.5), X admits a C . Of course the same linear embedding into Y with distortion bounded above by 1" argument applies also to finite-dimensional subspaces of Y .

9.2.1 Proving Bourgain’s discretization theorem. Preliminary step: it suffices to consider spaces with differentiable norm The linear map which we are going to construct in our proof will be obtained as a derivative of a non-linear map. In this connection, it will be convenient for us to assume that the norm of X is differentiable everywhere except 0. The purpose of this section is to show that we may assume this without loss of generality because any norm on Rn can be arbitrarily well approximated by an infinitely differentiable norm. Proposition 9.9. For each norm k  k on Rn and each ˛ > 0 there exists a norm jjj  jjj on Rn such that 8x 2 Rn

.1  ˛/kxk  jjjxjjj  .1 C ˛/kxk

and jjjxjjj is an infinitely differentiable function on Rn everywhere except 0. Proof. Denote by W the normed space .Rn , k  k/. Let ¹xi ºm i D1 be an ˛-net on ¹x 2    W be such that kx k D 1 and xi .xi / D 1. Then Rn : kxk D 1º. Let ¹xi ºm i D1 i def

kxk1 D max jxi .x/j 1i m

is a norm on Rn . It is easy to check that it satisfies 8x 2 Rn

.1  ˛/kxk  kxk1  kxk.

Let p 2 N be an even integer satisfying m1=p  .1 C ˛/. Then for each collection ¹ai ºm i D1 of real numbers the inequality max jai j 

1i m

m X

!1 p

ai

p

 .1 C ˛/ max jai j 1i m

i D1

holds. Therefore def

jjjxjjj D

m X .xi .x//p i D1

is a desired norm. All verifications are immediate.

! p1

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

269

9.2.2 First step: picking the system of coordinates From now on .X, k  kX / will be a fixed n-dimensional normed space (n > 1), with unit ball BX D ¹x 2 X : kxkX  1º and unit sphere SX D ¹x 2 X : kxkX D 1º. We need the following classical result. Theorem 9.10 p (John’s theorem). Let X be an n-dimensional normed space. Then d.X, `n2 /  n, where d denotes the Banach–Mazur distance. Proof. The proof goes as follows. Let E be an ellipsoid of maximal volume in BX . (The volume in X depends on the choice of a basis, so it is defined up to a positive multiplicative constant. Therefore the notion of an ellipsoid of maximal volume inside a compact set in a finite-dimensional space is well-defined.) We pick the system of coordinates in X for which E is the Euclidean ball. It is clear that to prove Theorem p 9.10 it suffices to show that the multiple n E contains BX . Assume the contrary. Let e1 be a p vector for which the Euclidean norm introduced above is equal to 1 and there is d > n such that de1 2 BX . We pick an orthonormal basis ¹ei ºniD1 for the described Euclidean structure in such a way that e1 is the first vector of this basis. Let ¹ti ºniD1 be the coordinates of a vector with respect to this basis. We are going to show that there exist numbers a > 1 and b < 1 such that the ellipsoid ´ n μ  2 X n  2 X t1 ti CD ti ei : C 1 (9.3) a b i D1

i D2

is contained in BX and has a larger volume than E. It is easy to verify that to achieve the second goal we need to have ab n1 > 1. Lemma 9.11. Let a > 1 and b 2 .0, 1/ be such that   1 a2 2 C b 1  2  1. d2 d

(9.4)

(9.5)

Then the ellipsoid C given by (9.3) is contained in BX . Proof. Observe that the convex hull of de1 and E is contained in BX . We are going to show that C is contained in the convex hull of E [ ¹de1 º. Clearly it suffices to consider the two-dimensional case (just the plane containing the vectors e1 and e2 ). We circumscribe about E a rhombus R with one of the diagonals being the line segment joining de1 and de1 . Easy computation shows that the other diagonal is of length q 2

2 dd2 1 . It is easy to check that since we are concerned only with the case where b < 1, it suffices to check that the ellipse C is contained in the rhombus.

270

Chapter 9 Metric characterizations of classes of Banach spaces

 2  2 Now we make one more linear transformation, mapping the ellipse ta1 C tb2  1 onto the unit disc in the plane. The rhombus considered q above is mapped onto the 2

rhombus with lengths of diagonals equal to 2 da and b2 dd2 1 . The condition for such a rhombus to contain the unit disc is obtained by computing its area in two different ways, and is s s  2  2 d 1 d2 d2 d 1 C  2   . a b d 1 a b d2  1 Squaring both sides and simplifying we get (9.5). By Lemma 9.11 it suffices to show that there are numbers a 2 .1, 1/ and b 2 .0, 1/ such that the conditions (9.4) and (9.5) are satisfied. For a  1 consider the function  2  n1 d  a2 2 f .a/ D a . d2  1 This function represents the quantity in the left-hand side of (9.4) for a family of ellipsoids which, by Lemma 9.11, are contained in BX . Observe that f .1/ D 1. To complete the proof it suffices to show that f 0 .1/ > 0. This is a straightforward computation:  n1 1  2 d  a2 2 d 2  na2 0 .  f .a/ D d2  1 d2  1 We have f 0 .1/ > 0 because d 2 > n. Now we return to the proof of Theorem 9.4. We identify X with Rn . By John’s theorem (Theorem 9.10) we assume without loss of generality that the standard Euclidean norm k  k2 on Rn satisfies 8 x 2 X,

1 p kxk2  kxkX  kxk2. n

(9.6)

In what follows, the volume of a Lebesgue measurable set A  Rn will be denoted vol.A/. Recall that the statement which we are proving is: the existence of a low-distortion embedding of a ı-net of BX into Y implies the existence of a linear embedding of X into Y with a suitable bound. It is clearly enough to prove the statement for sufficiently small ı and ". Fix ", ı 2 .0, 1=8/ and let Nı be a fixed ı-net in BX . We also fix D 2 .1, 1/, a Banach space .Y , k  kY /, and a mapping f : Nı ! Y satisfying 1 kx  ykX  kf .x/  f .y/kY  kx  ykX . (9.7) D By translating f , we assume without loss of generality that f .Nı /  2BY . Our goal Cn will be to show that provided ı is small enough, namely ı  e .D="/ , there exists 1 an injective linear operator T : X ! Y satisfying kT k  kT k  .1 C 12"/D. 8 x, y 2 Nı ,

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

271

9.2.3 Second step: construction of a Lipschitz almost-extension The next step is to construct a mapping F : Rn ! Y that is a Lipschitz almostextension of f , that is, F is Lipschitz and on Nı , it takes values that are close to the corresponding values of f . As a preliminary step we prove the following lemma. Lemma 9.12. Let K  Rn be a convex set and fix , , L 2 .0, 1/. Assume that we are given a mapping h : K C BX ! Y satisfying kh.x/  h.y/kY  L .kx  ykX C /

(9.8)

for all x, y 2 K C BX . Define H : K ! Y by Z 1 H.x/ D n h.x  y/dy. vol.BX / BX   Then kH.x/  H.y/kY  L 1 C n 2 kx  ykX for all x, y 2 K. Proof. Consider two points in K. Translating K if needed, we may assume that one of the points is the origin and denote the other one by x. Put B D ¹z 2 X : kzk  º and Bx D ¹z 2 X : kz  xk  º. Then n vol.BX / D volB and Z Z 1 1 h.y/dy  h.y/dy H.0/  H.x/ D volB B volB Bx Z  Z 1 D h.y/dy  h.y/dy . volB BnBx Bx nB Consider the family ¹L t D t C Rxº t 2X of straight lines parallel to x. Each L t intersect the sets BnBx and Bx nB in intervals of the same length (which does not exceed kxk, and is exactly kxk in the case where L t intersects B \ Bx ). Observe that here by the length of a line segment Œu, v we mean ku  vkX . Let c : .BnBx / ! .Bx nB/ be the transformation which translates each interval L t \ .BnBx / onto the interval L t \ .Bx nB/ along L t . It is clear that it is a measurepreserving transformation. Fix a hyperplane Z  X such that x … Z and let P be the projection from X to Z along x, and w.t / .t 2 X/ be the length of L t \ B. The Lebesgue measure on X is the product of any two appropriately normalized measures on Z and on the line R  x. We normalize the measure on R  x by setting the measure of Œ0, x to be equal kxk. This and the choice of the standard Lebesgue measure on Rn determine the Lebesgue measure, which we denote , on Z, and we have Z w.z/d.z/ D volB. P .B/

Consider B as the disjoint union of two sets (“wide” and “narrow”): W : D ¹t 2 B : L t \ B \ Bx ¤ ;º, N : D ¹t 2 B : L t \ B \ Bx D ;º.

272

Chapter 9 Metric characterizations of classes of Banach spaces

Clearly we have N  .BnBx /. It is easy to see that the length of the intersection L t \ .BnBx / is equal to kxk for t 2 W and is equal to w.t / for t 2 N . Also we have ´ w.t /, t 2 W \ .BnBx /, kt  c.t /k D kxk, t 2 N . We get  Z Z    h.y/dy  h.y/dy  volB kH.0/  H.x/k D   BnBx Bx nB  Z   .h.y/  h ı c.y//dy  D   BnBx Z kh.y/  h ı c.y/kdy  BnBx Z (9.8)  L .ky  c.y/k C / dy BnBx   Z ky  c.y/kdy , D L  vol.BnBx / C

(9.9)

BnBx

where the second equality holds because c is a measure preserving map of BnBx onto Bx nB. The integral in the last line will be estimated separately on N and on W \ .BnBx /: Z Z N

Z

ky  c.y/kdy D

W \.BnBx /

N

kxkdy D kxkvolN ,

Z

ky  c.y/kdy D

w.y/dy Z

D

W \.BnBx /

w.t /kxkd.t / P .W \.BnBx //

Z

D kxk

w.t /d.t / P .W \.BnBx //

D kxkvolW , where the second equality follows from the normalization of measures and from the fact that each intersection of W \ .BnBx / and L t is an interval of length kxk. Hence Z kt  c.t /kdy.t / D kxk.volW C volN / D kxkvolB. (9.10) BnBx

Now we estimate vol.BnBx / in terms of kxk. Here we use the easily verified inclusion:     x kxk / x kxk .BnBx / [ .Bx nB/  B , C B ,  . 2 2 2 2

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

273

This inclusion in combination with vol.BnBx / D vol.Bx nB/ implies      volB kxk n kxk n   C vol.BnBx /  2 n 2 2 nvolB kxk C o.kxk/ as kxk ! 0. D 2 We combine this estimate with (9.9) and (9.10), and get n

kH.0/  H.x/k  L 1 C kxk C o.kxk/. 2 Since K is convex, we can derive the desired estimate from this estimate using the same argument as in Lemma 9.7: consider a sequence of points v0 D x, v1 , : : : , vm1 , vm D y in K such that vi  vi 1 D yx for i D 1, : : : , m. Then m kH.y/  H.x/k  lim sup m!1

 lim sup m!1

m X

kH.vi /  H.vi 1 /k

i D1 m  X i D1

  n

1 kvi  vi 1 k C o L 1C 2 m

n

DL 1C ky  xk. 2

Now we turn to the construction of a Lipschitz almost-extension of f mentioned before Lemma 9.12. " , then there exists a map F : Rn ! Y which is differentiable Lemma 9.13. If ı < 4n everywhere on Rn and has the following properties.

(a) F is supported on 3BX . (b) kF .x/  F .y/kY  8kx  ykX for all x, y 2 Rn . (c) kF .x/  F .y/kY  .1 C "/ kx  ykX for all x, y 2 12 BX . (d) kF .x/  f .x/kY 

13nı "

for all x 2 Nı .

Proof. We deduce Lemma 9.13 from Lemma 9.12 using partition of unity argument. Let ¹p : Rn ! Œ0, 1ºp2Nı be a family of smooth functions satisfying   P ı n p2Nı p .x/ D 1 for all x 2 1 C 2 BX and p .x/ D 0 for all .p, x/ 2 Nı  R with kx  pkX  2ı. One of the standard constructions of such functions is obtained by taking a smooth function : Rn ! Œ0, 1 which equals 1 on 32 BX and vanishes

274

Chapter 9 Metric characterizations of classes of Banach spaces

outside 2BX , and defining p .x/ D ..x  p/=ı/ for .p, x/ 2 Nı  Rn . Then we order elements of Nı D ¹p1, p2 , : : : , pN º and define p1 D

p1 ,

pj D

pj

jY 1

.1 

pi /

for

j 2 ¹2, : : : , N º.

i D1

It is easy to verify (for example, by induction) that the identity Y X p D 1  .1  p /

p2Nı

p2Nı



holds. Since every x 2 1 C ı2 BX satisfies kx  pkX  32 ı for some p 2 Nı , this

P implies p2Nı p .x/ D 1 for each x 2 1 C ı2 BX .   We define g : 1 C ı2 BX ! Y by X p .x/f .p/. (9.11) g.x/ D p2Nı

Let ˇ : Œ0, 1/ ! Œ0, 1 be a differentiable 2-Lipschitz non-increasing function which satisfies ˇ.t / D 1 for t 2 Œ0, 1 and ˇ.t / D 0 for t 2 Œ2, 1/. Let r : X ! Œ1, 1/ be a differentiable function satisfying the conditions r .x/ D 1 for x 2 BX and kxkX    r .x/  kxkX 1 C ı2 for x 2 Rn nBX . Now we introduce the map h : Rn ! Y by



h.x/ D ˇ.kxkX /g

 x . r .x/

(9.12)

It is clear that h is everywhere differentiable (recall that we assume that kxkX is differentiable  everywhere except 0). Observe that our definitions imply that for x 2  1 C ı2 BX we have X X p .x/ D 1 and g.x/ D p .x/f .p/. p2Nı \.xC2ıBX /





p2Nı \.xC2ıBX /

Therefore for x, y 2 1 C ı2 BX we have X p .x/f .p/  g.x/  g.y/ D p2Nı \.xC2ıBX /

D

X

p2Nı \.xC2ıBX / q2Nı \.yC2ıBX /

X

q .y/f .q/

q2Nı \.yC2ıBX /

p .x/q .y/.f .p/  f .q//.

(9.13)

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

275

Since for each nonzero term in the last sum we have kp  qk  kx  yk C 4ı, combining (9.7) and (9.13) we get   ı 8x, y 2 1 C (9.14) BX , kg.x/  g.y/kY  kx  ykX C 4ı. 2 For x, y 2 BX we have h.x/ D g.x/ and h.y/ D g.y/, therefore 8x, y 2 BX ,

kh.x/  h.y/kY  kx  ykX C 4ı.

(9.15)

If x 2 BX and y 2 Rn n BX , we use f .Nı /  2BY and the fact that ˇ is 2-Lipschitz, and get          y   C .1  ˇ.kykX // g y  kh.x/  h.y/kY   g.x/  g    r .y/ Y r .y/ Y   (9.14)  y     x  r .y/  C 4ı C 2 .kykX  1/ sup kf .p/kY p2Nı X     y   kx  ykX C  y  r .y/  C 4ı C 4 .kykX  1/ X  kx  ykX C 5.kykX  1/ C 4ı. Since kykX  1  kx  ykX C kxkX  1  kx  ykX , it follows that 8x 2 BX , 8y 2 Rn n BX ,

kh.x/  h.y/kY  6kx  ykX C 4ı.

(9.16)

If x, y 2 Rn n BX then

      y  x  ˇ.kxkX /  g kh.x/  h.y/kY   g  r .x/ r .y/ Y     y   jˇ.kxkX /  ˇ.kykX /j  C g r .y/ Y   (9.14)  x y   C 4ı C 4 kx  yk     X r .x/ r .y/ X       x  y   C x  x      kxk  kykX X r .x/ kxkX X   X  y y   C   r .y/ kyk  C 4ı C 4 kx  ykX X X

ı ı  2kx  ykX C C C 4ı C 4 kx  ykX 2 2 D 6kx  ykX C 5ı. (See Exercise 6.27 in connection with the last inequality.)

(9.17)

276

Chapter 9 Metric characterizations of classes of Banach spaces

Set D 2nı=" 2 .0, 1=2/ and define for x 2 Rn , Z 1 h.x  y/dy. F .x/ D n vol.BX / BX

(9.18)

Since h is differentiable everywhere and its derivative may be assumed to be continuous, it follows from (9.18) that F is differentiable everywhere. Since h is supported on 2BX , the function F is supported on .2 C /BX  3BX , therefore the condition (a) of Lemma 9.13 is satisfied. Due to (9.15), (9.16), and (9.17), an   application of Lemma 9.12 with K D Rn , L D 6 and  D ı shows that F is 6 1 C "4 Lipschitz on Rn , proving the assertion (b) of Lemma 9.13. Due to (9.15), an application of Lemma 9.12 with K D .1  /BX shows that F is .1 C "/-Lipschitz on .1  /BX 12 BX . This establishes the assertion (c) of Lemma 9.13. To prove assertion (d) of Lemma 9.13, fix x 2 Nı . Then, Z 1 kF .x/  h.x/kY  n kh.x  y/  h.x/kY dy vol.BX / BX (9.19) (9.15) & (9.16)



6 C 4ı.

Also,

X

kh.x/  f .x/kY 

kf .p/  f .x/kY p .x/

p2Nı



max

p2Nı \.xC2ıBX /

kf .x/  f .p/kY

(9.20)

(9.7)

 2ı.

Recalling that D 2nı=", assertion (d) of Lemma 9.13 follows from (9.19) and (9.20), if n  6".

9.2.4 Third step: further smoothing of the map using Poisson kernels The brief outline of the rest of the proof is: we smoothen the map F further using the convolution with the Poisson kernel. Then we show that the obtained map has a point of differentiability such that the derivative at that point is the desired linear map. Now we provide a very short summary of information on the Poisson kernel, in Notes and Remarks the reader can find references to sources containing more details. The Poisson kernel arises in the solution of the following problem. Let f 2 L2.Rn / (where Rn is endowed with the Lebesgue measure). The problem is to find a function in RnC1 :D ¹.x, t / : x D ¹xi ºniD1 2 Rn , t > 0º satisfying the Laplace equation C @2 u X @2 u C D0 @t 2 @xi2 n

u :D

i D1

such that the functions ¹u.x, t /º t >0 converge to f .x/ in L2.Rn / as t # 0.

(9.21)

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

The solution of this problem can be written in the form Z u.x, t / D fy.z/e 2 i hz,xie 2 kzk2t dz, Rn

t > 0,

277

(9.22)

where fy is the Fourier transform of f . One can check that the integral in (9.22) converges absolutely. Straightforward differentiation shows that the function v.x, t , z/ :D P 2 @2 v e 2 i hz,xie 2 kzk2t satisfies the equation @@tv2 C niD1 @x 2 D 0 for each fixed z. Therei

fore the function u given by (9.22) satisfies (9.21). The L2 theory of Fourier transform implies that ¹u.x, t /º t >0 converge to f .x/ in L2 .Rn / as t # 0. It is easy to verify that the formula (9.22) can be rewritten as a convolution Z P t .s/f .x  s/ds, (9.23) u.x, t / D P t  f :D Rn

where the Poisson kernel P t .s/ is given by Z P t .s/ D e 2 i hz,sie 2 kzk2t dz, Rn

t > 0.

(9.24)

Recall that a convolution of two functions on Rn (the same definition is used for any locally compact Abelian group with operations denoted by C and ) is defined by Z f .x  y/g.y/dy. (9.25) f  g.x/ D Rn

Using substitution one can easily see that Z f  g.x/ D g  f .x/ D

Rn

f .y/g.x  y/dy.

(9.26)

Using quite nontrivial classical computation with integrals (see Notes and Remarks for references) one can find an explicit formula for the Poisson kernel:  

nC1 cn t 2 . (9.27) P t .x/ D  nC1  nC1 , where cn D  2 t 2 C kxk2 2 2

Some properties of Poisson kernels: (a) Formula (9.27) shows that P t .x/ > 0. (b) Applying the Fourier inversion formula to (9.24) we get that for each fixed t the Fourier transform Pyt .x/ is equal to e 2 kxk2t . As a special case of this formula we get Z P t .x/dx D Pyt .0/ D 1. (9.28) Rn

278

Chapter 9 Metric characterizations of classes of Banach spaces

(c) The semigroup property P t Ps D P t Cs for t , s > 0. This property can be derived from (b) and the well-known fact that Fourier transform maps convolutions onto products. " and fix a mapping F : Rn ! Y satisfying the Assume from now on that ı < 4n conclusion ofRLemma 9.13. We consider the functions P t  F : Rn ! Y given by P t F .x/ D Rn P t .xy/F .y/dy (it is a convolution of a scalar-valued and a vectorvalued function). Observe that the convolution P t  F is everywhere differentiable (we can differentiate under the integral sign and P t is an everywhere differentiable function). The derivative T D .P t0  F /0 .x/ is a linear map of X into Y . Our goal is to show that there exists t0 2 .0, 1/ and x 2 Rn such that the derivative T D .P t0  F /0 .x/ is injective and satisfies kT k  kT 1 k  .1 C 12"/D. Since the functions P t F converge to F in L2 .Rn , Y / (this property of Poisson kernels mentioned above remains true for vector-valued functions) and F itself is close to a bilipschitz map when restricted to the ı-net Nı , one might expect this to happen for every small enough t . However, so direct a proof of the existence of such T is not known so far. The known proof of existence of t0 is by contradiction, and it does not show how to get a point t0 for which .P t0  F /0 .x/ has the desired properties. The estimate of the norm of .P t0  F /0 .x/ from above is relatively easy, it is based on the following lemma.

Lemma 9.14. Assume that 0 < t
A.RC1/km  F .x/ Y dxd.a/ C m SX Rn By iterating (9.34) we get the estimate Z

Z

    @a PA.RC1/m1  F .x/ dxd.a/ Y SX Z Z     @a PA.RC1/  F .x/ dxd.a/ C 8.m C 1/vol.3BX / . (9.35) > Y m SX Rn Rn

At the same time, F is differentiable everywhere and is 8-Lipschitz. Therefore for every a 2 SX we have k@a F kY  8 everywhere. Now we use the commutativity of

280

Chapter 9 Metric characterizations of classes of Banach spaces

convolution (9.26) and get

 Z   PA.RC1/m1 .y/F .x  y/dy @a PA.RC1/m1  F .x/ D @a Rn Z D PA.RC1/m1 .y/@a F .x  y/dy Rn

D .PA.RC1/m1  @a F /.x/. Since F is supported on 3BX , it follows that Z     @a PA.RC1/m1  F .x/ dx Y Rn Z     PA.RC1/m1  @a F .x/ dx D Y n ZR Z  PA.RC1/m1 .x  y/k@a F .y/kY dxdy Rn Rn Z (9.28) D k@a F .y/kY dy  8vol.3BX /.

(9.36)

3BX

We integrate (9.36) with respect to . Since  is a probability measure we get a contradiction with (9.35) We also need the following well-known lemma on cardinalities of nets in balls of finite-dimensional normed spaces. Lemma 9.18. Let O be an arbitrary subset of the unit ball of an n-dimensional n  normed linear space X. Let ˛ > 0. Then there is an ˛-net for O with cardinality  1 C ˛2 . Proof. Let ¹xi ºM i D1 be a maximal subset in O satisfying kxi  xj k  ˛ for i ¤ M ¯ is an ˛-net. It is easy to see that the open balls Ui D ®j . It is clear that ¹xi º˛i D1 x 2 X : kx  xi k < 2 are disjoint and are contained in ° ˛± U D x 2 X : kxk < 1 C . 2 Therefore

M X

volUi  volU

i D1

or M

˛ n

2 The desired inequality follows.

˛ n volBX  1 C volBX . 2

Now we show how to conclude the proof of Theorem 9.4 using all lemmas stated above. (It will then remain to prove Lemmas 9.15 and 9.14, see Section 9.2.5.)

Section 9.2 Proof of the Ribe theorem through Bourgain’s discretization theorem

281

Proof of Theorem 9.4. Assume that ı 2 .0, 1/ satisfies ı

" .cD="/2n , cD

(9.37)

where c D 300 (this is an overestimate for the ensuing calculation). It is easy to see that since D can be chosen to be (9.37) implies (9.1). We fix an    2cY.X/, estimate n 3D n < . Let  be the uniform probability ."=D/-net F in SX with jFj  1 C 2D " " measure on F. Let $ %    " 5n cD nC1 cD 4n AD , RD  1, m D  1. (9.38) cD " " Applying Lemma 9.17 with these parameters, we get the existence of t 2 .0, 1/ satisfying " .cD="/2n " 5n t  , (9.39) cD cD such that

XZ

a2F

Rn

k@a .P t  F /.x/kY dx 

XZ a2F

Rn

k@a .P.RC1/t

8jFjvol.3BX / .  F /.x/kY dx C m

(9.40)

Straightforward verification shows that for ı satisfying (9.37), R as in (9.38), and any t satisfying (9.39), inequalities (9.30) and (9.31) are satisfied for n  2 (for n D 1 the statement of Bourgain’s discretization theorem is obvious). Thus the conclusion (9.29) of Lemma 9.15 holds true for all a 2 SX and x 2 18 BX . Note that by convexity of the norm and properties of the convolution (commutativity: f g D gf and associativity: f .gh/ D .f g/h) and of the Poisson kernel (positivity and semigroup property) we have for every a 2 SX and every x 2 Rn : k@a .P.RC1/t  F /.x/kY D k@a .PRt  .P t  F //.x/kY D k .PRt  .@a .P t  F /// .x/kY  .k@a .P t  F /kY  PRt / .x/. Thus k@a .P t  F /kY  PRt  k@a .P.RC1/t  F /kY  0 and we have ² ³ " 1 " vol x 2 BX : .k@a .P t  F /kY  PRt / .x/  k@a .P.RC1/t  F /.x/kY  D 8 D  Z   .k@a .P t  F /kY  PRt / .x/  k@a .P.RC1/t  F /.x/kY dx .  Rn

282

Chapter 9 Metric characterizations of classes of Banach spaces

Therefore for each a 2 SX ² 1 vol x 2 BX : 8

"± .k@a .P t  F /kY  PRt / .x/  k@a .P.RC1/t  F /.x/kY  D  Z   D  .k@a .P t  F /kY  PRt / .x/  k@a .P.RC1/t  F /.x/kY dx n " ZR  Z D D k@a .P t  F /.x/kY dx  k@a .P.RC1/t  F /.x/kY dx . " Rn Rn (9.41)

Hence, ² 1 vol x 2 BX : 8

"± 9a 2 F, .k@a .P t  F /kY  PRt / .x/  k@a .P.RC1/t  F /.x/kY  D   Z XZ (9.41) D X  k@a .P t  F /.x/kY dx  k@a .P.RC1/t  F /.x/kY dx " Rn Rn a2F

a2F

D 8jFjvol.3BX /   "  m    (9.38) 16D 3D n " nC1 1 BX  .24/n vol " " cD 8   n  6 16 1  BX D vol 300 25 8   1 < vol BX . 8 (9.40)

(In the inequality where we use (9.38), we also use our estimate for the cardinality of F and the observation that for " 2 .0, 1/, D 2 .1, 1/ and c D 300, the inequality 1 " nC1  2. cD / holds.) Consequently, there exists x 2 18 BX satisfying m 8a 2 F,

.k@a .P t  F /kY  PRt / .x/  k@a .P.RC1/t  F /.x/kY
0 and x 2 X, kxk D 1, there is ı.x, "/ > 0 such that for any vector kyk  1, the inequality kx C yk > 2  ı.x, "/ implies kx  yk < ".

292

Chapter 9 Metric characterizations of classes of Banach spaces

Observation 9.34. The definition of a locally uniformly convex norm can be restated as: For any x 2 SX (the unit sphere of X) and for any sequence ¹xnº satisfying kxn k  1 and limn!1 kx C xnk D 2, the condition limn!1 kx  xn k D 0 holds. Let C be a convex weakly compact set in a separable Banach space X. We are going to replace the original norm k  k of the space X by an equivalent norm jjj  jjj satisfying the following two conditions: 

The norm is locally uniformly convex in the sense of Definition 9.33 (see below).



The new norm jjj  jjj attains its maximum on the set C .

The existence of such a norm is sufficient for our purposes because of the following lemma. Lemma 9.35. The point w at which a locally uniformly convex norm jjj  jjj attains its maximum on C is a strongly exposed point of C . Proof. Let f 2 X  be a functional satisfying jjjf jjj D 1 and f .w/ D jjjwjjj. We are going to show that f satisfies the conditions of Definition 9.29. Assume the contrary. Then there is a sequence ¹wnº in C such that f .wn / ! f .w/, but jjjwn  wjjj ¹ 0. The condition f .wn / ! f .w/ implies lim supn!1 jjjwn C wjjj  2jjjwjjj. Since jjjwn jjj  jjjwjjj,

(9.60)

(9.61)

this implies jjjwn C wjjj ! 2jjjwjjj.

(9.62)

The combination of (9.60), (9.61), and (9.62) contradicts the local uniform convexity of the norm jjj  jjj. Now we turn to the construction of jjj  jjj. We start by constructing an equivalent strictly convex norm k  k1 on X. Let ¹xi º1 i D1 be a dense sequence in the unit sphere   X be functionals satisfying kxi k D 1 and xi .xi / D 1. We of X. Let ¹xi º1 i D1 introduce the norm k  k1 by 1  1 X 2 def 2i .xi .x//2 . kxk1 D kxk2 C i D1

p

It is clear that kxk  kxk1  2kxk. Since `2 is uniformly convex it cannot happen  . Thus k  k1 is a strictly convex norm. that kxk1 D kyk1 D  xCy 2 1

293

Section 9.3 Test-space characterizations

In the next step we construct an equivalent locally uniformly convex norm on X. Let ¹Xi º1 i D0 be an increasing sequence of finite-dimensional subspaces of X such that X0 D ¹0º and the union [1 i D0 Xi is dense in X. We introduce a new norm on X by kxk2 D

X 1

1 2i .d.x, Xi //2

2

,

i D0

where d.x, Xi / is the distance from a vector x to the subspace Xi in the distance induced by the norm kk1 . Let kxk2 D 1 and ¹xi º1 i D1 be such a sequence that kxi k2  1 and lim kx C xi k2 D 2. (9.63) i !1

We are going to show that ¹xi º1 i D1 contains a subsequence which converges to x. Since the same argument can be used for each subsequence of ¹xi º, this would imply that ¹xi º itself converges to x. By the uniform convexity of `2 , condition (9.63) implies that limi !1 d.xi , Xj / D d.x, Xj /. Now we pick " > 0. Since [1 i D0 Xi is dense in X, there is m 2 N such that d.x, Xm / < ". Therefore d.xi , Xm / < " for sufficiently large i 2 N. Also the sequence ¹xi º is bounded. Let zi 2 Xm be such that kzi  xi k1 < ", i 2 N. The sequence ¹zi º1 i D1 is bounded. Therefore it contains a convergence subsequence zin . Therefore the sequence ¹xin º satisfies kxin  xi t k  3" for sufficiently large in and i t . We repeat the procedure for smaller " for this subsequence, by the diagonal procedure we get a Cauchy sequence. Let y be its limit. By (9.63) we get kx C yk1 D 2, we also have kyk1  1. Since the norm k  k1 is strictly convex we get x D y. Therefore the norm k  k2 is locally uniformly convex. Now we modify the norm k  k2 in order to get an equivalent norm jjj  jjj on X which is locally uniformly convex and attains its maximum on C . We do this as follows. We let jjj  jjj0 D k  k2 and construct simultaneously sequences of: 

Norms ¹jjj  jjji º1 i D0 satisfying jjjxjjj0  jjjxjjj1      jjjxjjjn  : : : for all x, and such that jjjxjjjn  2jjjxjjj0 (9.64) for all x 2 X and n 2 N. Therefore jjjxjjj D lim jjjxjjji i !1

exists (and is finite) for every x 2 X. 

Vectors ¹xi º1 i D0  C .



 Functionals ¹fi º1 i D0  X .

294

Chapter 9 Metric characterizations of classes of Banach spaces

We let ¹Mi º1 i D0 be given by Mi D sup jjjxjjji . x2C

The purpose of our construction of sequences is to achieve the following: For each weak-limit point w of the sequence ¹xi º the condition jjjwjjj D lim jjjwjjji D lim Mi i !1

i !1

(9.65)

holds and therefore jjj  jjj attains its maximum on C at w. (We have w 2 C since C is weakly compact.) To achieve the goal (9.65) we do the following: 

1 We pick sequences ¹˛i º1 i D0 and ¹ˇi ºi D0 of positive numbers such that ˛i # 0 and ˇi # 0 as i ! 1. Later we shall describe some additional conditions on these sequences.



1 1 We pick sequences ¹xi º1 i D0 , ¹fi ºi D0 , and ¹jjj  jjji ºi D0 as follows: 

Let x0 2 C be such that jjjx0 jjj0  M0  ˛0.



Let f0 2 X  be such that jjjf0 jjj0 D 1 and f0 .x0 / D jjjx0 jjj0 .



Let jjjxjjj1 D jjjxjjj0 C ˇ0 jjjf0 .x/x0 jjj0



Let x1 2 C be such that jjjx1 jjj1  M1  ˛1.



Let f1 2 X  be such that jjjf1 jjj1 D 1 and f1 .x1 / D jjjx1 jjj1 .



Let jjjxjjj2 D jjjxjjj1 C ˇ1 jjjf1 .x/x1 jjj1 .



We proceed in an obvious way.

The main point of this construction is that we can choose the numbers ˛i and ˇi in such a way that jfi .xj /j  Mi  ıi ,

j  i,

(9.66)

where ¹ıi º1 i D0 is some sequence of positive numbers tending to 0. It is easy to show that (9.66) implies the fact that jjj  jjj attains its maximum on C at w. In fact (9.66) immediately implies jjjwjjji  Mi  ıi . Since obviously supx2C jjjxjjj D limi !1 Mi , the desired conclusion is immediate. So we pick an arbitrary sequence ¹ıi º1 i D0 satisfying ıi > 0, ıi  ıi C1 and ıi # 0 1 as i ! 1, and show that we can achieve (9.66) by picking ¹˛i º1 i D0 and ¹ˇi ºi D0 in a suitable way. We have X

j 1

jjjxjjjj D jjjxjjji C

kDi

ˇk jjjfk .x/xk jjjk .

(9.67)

295

Section 9.3 Test-space characterizations

First one can easily see that by picking ¹ˇi º1 i D0 we can achieve the inequality (9.64). Also, (9.67) implies (for the second inequality we drop all but one terms from the sum) Mj  jjjxi jjji C

j 1 X

ˇk jjjfk .xi /xk jjjk

kDi

 jjjxi jjji C ˇi jjjxi jjj2i  .Mi  ˛i / C ˇi .Mi  ˛i /2 . Hence jjjxj jjjj D jjjxj jjji C

j 1 X

ˇk jjjfk .xj /xk jjjk

kDi

 Mj  ˛j  .Mi  ˛i / C ˇi .Mi  ˛i /2  ˛j , from which, since jjjxj jjji  Mi and jjjxi jjji  Mi , we get X

j 1 2

ˇi jfi .xj /j  jjjxi jjji  ˛i C ˇi .Mi  ˛i /  ˛j 

ˇk jjjfk .xj /xk jjjk

kDi C1

and

j X .Mi  ˛i /2 ˛i C ˛j ˇk jjjfk .xj /xk jjjk   . jfi .xj /j  Mi ˇi Mi ˇi Mi kDi C1

1 Now we are ready to describe suitable choices of ¹˛i º1 i D0 and ¹ˇi ºi D0 . First we pick 1 ¹ˇi ºi D0 in such a way that, in addition to the condition (9.64) (which we have already mentioned), the conditions 1 X ˇk jjjfk .xj /xk jjjk ıi  ˇi Mi 3

kDi C1

are satisfied for all i D 0, 1, : : : . Next we pick ¹˛i º1 i D0 in such a way that the conditions: .Mi  ˛i /2 ıi  Mi  Mi 3 and

˛i C ˛j ıi  , j  i, ˇi Mi 3 are satisfied. It is clear that such choices are possible. It is also clear that these three ıi 3 -conditions imply the condition (9.66).

296

Chapter 9 Metric characterizations of classes of Banach spaces

Now we need to verify that the norm jjj  jjj is still locally uniformly convex. The reason for this is quite general: if we consider a sum of a locally uniformly convex norm and a seminorm which is estimated from above by a multiple of this locally uniformly convex norm, we get a locally uniformly convex norm. We do the estimates in the case which we consider. We have jjjxjjj D jjjxjjj0 C p.x/, where p.x/ is a seminorm satisfying p.x/  jjjxjjj0 , see (9.64). So we assume that x 2 X and ¹xnº1 nD1 are such that jjjxjjj D 1, jjjxn jjj  1 and limn!1 jjjx C xn jjj D 2. We need to show that limn!1 jjjx  xn jjj D 0. It suffices to find a subsequence of ¹xnº converging to x. Passing to subsequences if necessary we may assume that the limits lim jjjxn jjj0 D ˛

n!1

and

lim p.xn / D ˇ

n!1

exist. It is clear that ˛  12 and ˛ C ˇ D 1, and that we may assume jjjxn jjj0  ˛. x jjj C jjjy x jjj (this is We split x D xz C xy in such a way that jjjz x jjj0 D ˛ and jjjxjjj D jjjz possible if jjjxjjj0  ˛, we split xn otherwise). We have 2 D lim jjjx C xnjjj x jjj0 C p.z x / C p.y x / C p.xn //  lim.jjjz x C xn jjj0 C jjjy x jjj0 C p.z x / C p.y x / C p.xn //  lim.jjjz x jjj0 C jjjxn jjj0 C jjjy D ˛ C ˛ C jjjxjjj0  ˛ C p.x/ C .1  ˛/ D lim.jjjxjjj C jjjxn jjj/ D 2. Hence lim jjjz x C xn jjj0 D 2˛. Since the norm jjj  jjj0 is locally uniformly convex, we get lim jjjz x  xnjjj0 D 0. n!1

Therefore lim jjjz x  xn jjj D 0.

n!1

Therefore jjjz x jjj D 1 and xz D x. Thus jjj  jjj is locally uniformly convex. By Lemma 9.35, this completes the proof of (b) ) (a). (a))(c). Let Y be a nonreflexive Banach space which is finitely representable in X. By Theorem 6.11, (a),(c) for some  2 .0, 1/ there are sequences ¹yi º1 i D1  BY  such that and ¹fi º1  B Y i D1 ´  if n  k fn .yk / D 0 if n > k.

297

Section 9.3 Test-space characterizations

This condition implies that for each finitely nonzero sequence ¹ai º of scalars we have   1 ˇ 1 ˇ  X ˇX ˇ  ˇ ai yi  ai ˇˇ. (9.68) ˇ  i D1

i Dn

Therefore for each finite subinterval A  N we have  ˇ  1 ˇ  X ˇX ˇ ˇ ai yi  ai ˇˇ. 2 ˇ  i D1

(9.69)

i 2A

Since Y is finitely representable in X, for each m 2 N and each " > 0 there is a  sequence ¹xi ºm i D1 in X satisfying (9.58) with ˛ D 2.1C"/ . (c))(a). Let U be a free ultrafilter on N. It is easy to see that the condition (c) implies that the unit ball of X U contains an infinite sequence ¹ui º1 i D1 satisfying   ˇ ˇ X  ˇX ˇ   ˛ˇ a u ai ˇˇ (9.70) i i   ˇ i

i 2A

for each finite finitely-nonzero sequence ¹ai º1 i D1 A of N. Therefore the functionals fn given by fk

X i

 ai ui



of real numbers and each subinterval

X 1

 ai

i Dk

are contained in the unit ball. They also satisfy the condition ´ ˛ if k  n fk .xn / D 0 if k > n. Using the characterization of reflexivity Theorem 6.11 and equivalence (a),(c) we get that X U is nonreflexive. Since, as we know (Proposition 2.31), X U is finitely representable in X, we get that X is non-superreflexive.

9.3.2 Characterization of superreflexivity in terms of diamond graphs Several different test-space characterizations of the class of superreflexive spaces are known. We present a test-space characterization of superreflexivity which is in a certain sense the closest to the Banach space theoretical characterization of superreflexivity presented in the previous section. The diamond graph of level 0 is denoted D0 . It is defined as a graph having two vertices joined by an edge of weight (also called length) 1. Di is obtained from Di 1 as follows. Given an edge uv 2 E.Di 1 /, it is replaced by a quadrilateral u, a, v, b with edge lengths 2i . We endow Dn with their shortest path metrics. We consider the vertex set of Dn as a subset of the vertex set of DnC1, it is easy to check that this defines an isometric embedding of Dn into DnC1.

298

Chapter 9 Metric characterizations of classes of Banach spaces

Theorem 9.36. Let X be a Banach space. Then X is not superreflexive if and only if there is a constant C < 1 such that cX .Dn /  C for all n 2 N. Proof. Proof of the “if” part of Theorem 9.36 is an immediate consequence of the following lemma and the characterization of superreflexivity in terms of the finite tree property (Theorem 9.27). Lemma 9.37. If there exist uniformly bilipschitz embeddings of ¹Dnº1 nD1 into a Banach space X, then X has the finite tree property. Proof. Let fn : Dn ! X be uniformly bilipschitz embeddings. Without loss of generality we assume that ıdDn .x, y/  kfn .x/  fn .y/k  dDn .x, y/

(9.71)

for some ı > 0, all n 2 N and all x, y 2 Dn . Let us show that this implies that the unit ball of X contains a sequence ¹xi : i D 1, : : : , 2n 1º required in the definition of a finite tree property. To get such a sequence we use the map fn . Let x1 D fn .u0 /  fn .v0 /, where ¹u0, v0 º D V .D0/. Now we consider the quadrilateral u0 , a, v0 , b. Inequality (9.71) implies kfn .a/  fn .b/k  ı. Consider two pairs of vectors (corresponding to two different paths joining u0 and v0 in D1 ): Pair 1. fn .v0 /  fn .a/, fn .a/  fn .u0 /.

Pair 2. fn .v0 /  fn .b/, fn .b/  fn .u0 /.

The inequality kfn .a/  fn .b/k  ı implies that at least one of the following is true k.fn .v0 /  fn .a//  .fn .a/  fn .u0 //k  ı or k.fn .v0 /  fn .b//  .fn .b/  fn .u0 //k  ı. Suppose that the first inequality holds. We let x2 D 2.fn .v0 /  fn .a//

and x3 D 2.fn .a/  fn .u0 //.

1 2 .x2

C x3 / and kx2  x3k  2ı, hence the conditions of DefiniIt is clear that x1 D tion 9.26 are satisfied. Also, the condition (9.71) implies that kx2 k, kx3k  1. We continue our construction of a ı-tree in the unit ball of X in a similar manner. For example, to construct x4 and x5 we consider the corresponding quadrilateral a, a1 , v0 , b1 in D2 . The inequality kfn .a1 /  fn .b1 /k  ı=2 implies that at least one of the following is true k.fn .v0 /  fn .a1 //  .fn .a1 /  fn .a//k  ı=2 or k.fn .v0 /  fn .b1 //  .fn .b1 /  fn .a//k  ı=2.

299

Section 9.3 Test-space characterizations

Suppose that the second inequality holds. We let x4 D 4.fn .v0 /  fn .b1//

and x5 D 4.fn .b1 /  fn .a//.

It is clear that x2 D 12 .x4 C x5 / and kx4  x5k  2ı, hence the conditions of Definition 9.26 are satisfied. Also (9.71) implies that kx4k, kx5k  1. Proceeding in an obvious way we get the set ¹xi : i D 1, : : : , 2n  1º required in the definition of the finite tree property. Clearly, this construction can be done for each n 2 N. To prove the “only if” part of Theorem 9.36 it is convenient to assume that all edges of the diamond Dn have length one (that is, we multiply all distances in the definition of Dn given above by 2n ). We start by describing the inductive construction of a 2bilipschitz embedding of the diamond graph Dn into `1 , the image of Dn will be a n n subset of `21 contained in ¹0, 1º2 . The map which we construct is 1-Lipschitz and its inverse is 2-Lipschitz. We map vertices of D0 onto 0 and 1 considered as elements of `11 . It is clearly an isometric embedding. Now we assume that we have already constructed a suitable embedding of Dn1 n1 n into `21 . We construct an embedding of Dn into `21 in two steps. First we map those vertices of Dn which come from vertices of Dn1 in the following way. If the image n1 of a vertex v 2 V .Dn1/ is .v1 , v2 , v3 , : : : , v2n1 / 2 `21 , we map the corresponding n vertex in V .Dn/ onto .v1 , v1 , v2, v2 , v3 , v3 , : : : , v2n1 , v2n1 / 2 `21 . Recall that to get Dn an edge uv 2 E.Dn1 / was replaced by a quadrilateral u, a, v, b. It is clear that n the induction hypothesis implies that the images of u and v in `21 differ in two consecutive coordinates. We map a and b onto different sequences each of which differs from images of u and v in one coordinate only. It is clear that this is a well-defined map. Now we show that the constructed map is 2-bilipschitz. It is clear that the map Dn 7! 2n `1 is 1-Lipschitz (any two vertices which form an edge are mapped onto vectors at n distance 1 in `21 ). So it remains to show that the inverse image is 2-Lipschitz. The analysis of the diamond graphs needed to establish this will also be used for estimates of the Lipschitz constants for embeddings into general non-superreflexive spaces. k Let u, v be two adjacent vertices in Dk , u` , v ` be their images in `21 . It is clear that u` , v ` have only one different coordinate. We assume that the value of the coordinate is 0 for u` and 1 for v ` . Let un , vn be the corresponding vertices in Dn , n > k. Then n the images u`n , vn` of un , vn in `21 differ on a set F of 2nk consecutive coordinates and coincide on the set R of the remaining coordinates. We say that the set of vertices n in Dn, whose images in `21 coincide with u`n and vn` on this set R of the remaining coordinates, forms a subdiamond. The vertex un is called the bottom of the subdiamond and the vertex vn is called the top of the subdiamond. It is easy to check that for any two vertices w and z of Dn there is a well-defined smallest subdiamond of Dn containing both w and z. Let F D Fw,z  ¹1, : : : , 2n º be a subset on which the coordinates of the images of vertices of the smallest subdiamond containing w and z may differ.

300

Chapter 9 Metric characterizations of classes of Banach spaces

There are two possible cases: (a) At least one of the vertices w, z is either top or bottom of this subdiamond. (b) Neither w nor z is the top or bottom of the subdiamond. It is easy to see by induction that in case (a) the distance between w and z in Dn is n the same as the distance between their images in `21 . So it remains to consider case (b). The vertex of the subdiamond whose image has ones in the left half of F and zeros in the right half of F is called the leftmost vertex of the subdiamond. Similarly we define the rightmost vertex of the subdiamond. It is clear from the construction that each vertex in the subdiamond, except the top and the bottom, is either closer (with respect to the graph distance of Dn ) to the leftmost vertex or to the rightmost vertex. In the first case we say that the vertex is in the left side, in the second case we say that it is in the right side. We need to consider two subcases of (b): 

w and z are on the same side.



w and z are on different sides.

Consider the same-side case, assume that the considered side is the left side. We n claim that in this case the `21 -image of one of the vertices, w or z, should have all of the coordinates greater than that of the image leftmost vertex; and the image of the other should have all of the coordinates less than that of the image leftmost vertex. In fact, if this is not the case, the vertices are contained in the smaller subdiamond formed by the leftmost vertex and either the top or the bottom of the subdiamond which we consider, and we get a contradiction. Now, if the condition of the previous paragraph is satisfied, it is easy to see that n the Dn -distance between w and z is the same as the `21 -distance between their images. (The shortest path goes through the leftmost vertex, and its length is exactly the n amount of different `21 -coordinates.) Now we consider the different-sides case. In this case either (1)

One of the vertices is closer to the top of the subdiamond, and the other is closer to the bottom, or

(2)

Both vertices are at least as close to the top (bottom) as to the bottom (top).

In case (1) the image of the vertex which is closer to the top has either all of the coordinates in the left half of F equal to 1, or all of the coordinates in the right half of F equal to 1. We consider the left-half case only. In this case the image of the other vertex n has all the coordinates in the left half of F equal to 0. Therefore the `21 -distance ben tween the `21 -images is at least 12 jF j. Since 12 jF j is half of the diameter of the subdian mond, this implies that the map inverse to the embedding of Dn into `21 is 2-Lipschitz.

301

Section 9.3 Test-space characterizations

In case (2) we consider only the subcase where both vertices are at least as close to the top as to the bottom (the other case is similar). In this case the shortest path between the vertices w and z goes through the top t and the vectors f .t /  f .w/ and n f .t /f .z/ are disjoint, where f is our notation for the map f : V .Dn / ! `21 which we constructed, therefore the `1 -distance between f .w/ and f .z/ is the same as the graph distance between w and z. To show that diamond graphs admit uniformly bilipschitz embeddings into an arbitrary non-superreflexive space, we use the characterization proved in Theorem 9.27. Namely, we use the fact that for each non-superreflexive space X there is a sequence n ¹xi º2i D1 in BX satisfying (9.58). Now we map the vertex w of Dn corresponding to Pn Pn n n the sequence ¹wi º2i D1 2 `21 onto the vector 2i D1 wi xi . We denote 2i D1 wi xi by '.w/. It remains to show that for w, z 2 V .Dn / the norms of '.w/  '.z/ in X can n be estimated from below by norms of f .w/  f .z/ in `21 . In the case where either w or z is the top or the bottom of the smallest subdiamond containing the images of w and z, the differences '.w/  '.z/ and f .w/  f .z/ have all coordinates of the same sign and therefore, by (9.58), k'.w/  '.z/kX  ˛kf .w/  f .z/k`2n D ˛dDn .w, z/. 1

In the case where w and z are on the same side of the subdiamond, the estimate is the same as is the reason for its validity. In the different-sides case we may assume that the shortest path between w and z is through the bottom of the subdiamond, which we denote y, and we may assume that dDn .w, y/  dDn .y, z/.

(9.72)

We may also assume that w is on the left side. In this case we use (9.58) in the case Pn Pn where A is the left half of F . We write '.w/ D 2i D1 wi xi , '.z/ D 2i D1 zi xi , and Pn '.y/ D 2i D1 yi xi . Inequality (9.72) implies that z is closer to y than the rightmost vertex. For this reason all coordinates of z on A are the same as of y and, by (9.58), we get ˇ ˇ ˇX ˇ ˇ ˇ k'.w/  '.z/kX  ˛ ˇ .wi  zi /ˇ ˇ ˇ ˇ ˇi 2A ˇ ˇX ˇ ˇ D ˛ ˇ .wi  yi /ˇ ˇ ˇ i 2A

1  ˛dDn .w, y/ 2 ˛  dDn .w, z/. 4

302

Chapter 9 Metric characterizations of classes of Banach spaces

9.4 Exercises Exercise 9.38. Let 0 < ˛ < 1 and X be an n-dimensional normed linear space. Then the cardinality of an ˛-net for BX is at least ˛ n . Exercise 9.39. Prove the following estimate for the discretization modulus:

ı`n1 ,!L2 ."/  2= .1  "/2n . There is a hint to Exercise 9.39. Exercise 9.40. Construct an equivalent norm jjj  jjj on `2 such that jjj  jjj does not attain its maximum on the unit ball of `2 in its original norm. Exercise 9.41. We introduce the infinite diamond as a kind of inductive limit of diamond graphs. Recall that the diamond graph of level 0 is denoted D0 . It has two vertices joined by an edge of length 1. Di is obtained from Di 1 as follows. Given an edge uv 2 E.Di 1 /, it is replaced by a quadrilateral u, a, v, b with edge lengths 2i . We endow Dn with their shortest path metrics. We consider the vertex of Dn as a subset of the vertex set of DnC1, it is easy to check that this defines an isometric embedding. We introduce the infinite diamond D! as the union of the vertex sets of ¹Dnº1 nD0 . For u, v 2 D! we introduce dD! .u, v/ as dDn .u, v/ where n 2 N is any integer for which u, v 2 V .Dn/. Since the natural embeddings Dn ! DnC1 are isometric, it is easy to see that dDn .u, v/ does not depend on the choice of n for which u, v 2 V .Dn /. Show that a Banach space admitting a bilipschitz embedding of the infinite diamond is nonreflexive. There is a hint to Exercise 9.41. Exercise 9.42. The Laakso graph of level 0 is denoted L0. It consists of two vertices joined by an edge of length 1. The Laakso graph Li is obtained from Li 1 as follows. Each edge uv 2 E.Li 1 / of length 4i C1 is replaced by a graph with 6 vertices u, t1 , t2, o1 , o2 , v where o1 , t1 , o2 , t2 form a quadrilateral, and there are only two more edges ut1 and vt2, with all edge lengths 4i . We endow Ln with their shortest path metrics. Prove the following analog of the “only if” part of Theorem 9.36: Let X be a nonsuperreflexive Banach space. Then there is a constant C < 1 such that cX .Ln /  C for all n 2 N.

303

Section 9.5 Notes and remarks

9.5 Notes and remarks The Ribe theorem was obtained in [393]. Some versions of Ribe’s proof were presented in [136] and [55, pp. 222–224]. Another proof of the Ribe theorem was obtained by Heinrich and Mankiewicz [200], it is also presented in [55, p. 222]. The third proof of the Ribe theorem is due to Bourgain [77]. The paper [77] is very difficult reading, for this reason several authors worked on a simplification and clarification of Bourgain’s argument. One of the steps in Bourgain’s proof was simplified in [52]. The rest of Bourgain’s proof was simplified and explained by Giladi, Naor, and Schechtman [164]. In Section 9.2 we present Bourgain’s proof of the Ribe theorem following [164] and using [52]. Lemma 9.13 is a refinement of a result of Bourgain [77]. The proof of Bourgain’s almost extension result has been significantly simplified by Begun [52], and our proof of Lemma 9.13 follows Begun’s argument. Our proof of Lemma 9.12 is due to Begun [52]. Bourgain’s proof of the Ribe theorem is, in my opinion, more complicated than the other proofs. I have chosen this proof for presentation in this chapter because the quantitative estimates obtained in Bourgain’s proof (and further results along the same lines) are important for applications (see [164], [336]). Also, Bourgain’s proof is the only one which is not presented in the well-known monograph [55]. There is an alternative proof of Bourgain’s discretization theorem in the case where the target space is superreflexive, see [282]. There are interesting related results and open problems for specific target spaces in the paper [164]. Estimate (9.1) was proved in [164], Bourgain [77] proved a slightly weaker estimate Cn

ıX,!Y ."/  e .n="/

.

(9.73)

The estimate (9.1) implies (9.73) since due to the Dvoretzky theorem (Theorem 2.30) p cY .`n2 / D 1, and therefore cY .X/  n by John’s theorem (Theorem 9.10). Lemma 9.7 is due to Corson and Klee [103]. Proposition 9.9 is a part of a welldeveloped direction devoted to approximation of arbitrary convex bodies by smooth ones, see [406] and references therein. Theorem 9.10 is due to John [218]. We give a proof which uses only basic calculus and n-dimensional analytic geometry. There are many other proofs which are more elegant and provide additional information, but use more advanced tools, see [37, Lecture 3], [380, Theorem 1.12], [381, Chapter 3]. Lemma 9.18 is a classical lemma, most possibly it was rediscovered many times by different authors. See, for example, the proof of Schur’s theorem in [105, Theorem 36.14]. More details on the Poisson kernel can be found in [416, Chapter III, Section 2] and [428, Section X.3]. The computation of the surface area of the unit sphere can be found, for example, in [37, Lecture 1]. The Stirling formula for the gamma function is derived, for example, in [150, Section 540]. See [150, Section 531] for relations between the gamma function and factorial.

304

Chapter 9 Metric characterizations of classes of Banach spaces

The program which came to be known as “the Ribe program” was suggested by Bourgain in [76]. After suggesting it Bourgain wrote: “A detailed exposition of this program will appear in J. Lindenstrauss’s forthcoming survey paper [J. Lindenstrauss, Topics in the geometry of metric spaces, to appear].” Unfortunately, the mentioned paper of Lindenstrauss was never published and I have never seen even an unpublished version of it. Today the Ribe program is a very active research direction. See [38] for a short and [334] for an extensive survey on the Ribe program. The theory of superreflexive spaces was initiated and developed by James in a very interesting series of papers, see [213] and [214]. In particular he proved Theorem 9.27. Further important contributions are due to Enflo [135] (see Theorem 8.36) and Pisier [377] (see Theorem 8.37). Theorem 9.30 is due to Lindenstrauss [284]. The notion of a locally uniformly convex norm was introduced in [296]. Kadets [229] proved that each separable Banach space has an equivalent locally uniformly convex norm. We presented a simplified proof which is due to Davis and Johnson [111]. The nonseparable version of Theorem 9.30 is due to Troyanski [429]. There is an alternative proof of the fact that a space containing a ı-tree cannot be reflexive. It is based on the argument going back to Asplund and Namioka [29] and can be found in [50, pp. 232–235]. See [74] for further development of this method which also generalizes the above mentioned result of Troyanski. Our proof of the existence of the equivalent norm jjj  jjj which attains its maximum on a weakly compact set uses the idea of Lindenstrauss [284], which was used by him to prove that operators attaining maximum of the norm on a weakly compact separable set are dense in the space of all operators. There is an alternative approach to the final part of our proof of Theorem 9.30. It is based on the notion of a farthest point, see [28], [269], [270]. The test-space characterization of superreflexivity which we present in Section 9.3.2 is due to Johnson and Schechtman [226], our proof of the “if” part follows [356]. The original proof in [226] uses Theorem 8.36. The paper [226] uses some ideas of the papers [83] and [276], in which diamond graphs were used to prove negative results about dimension reduction in `1 . It seems that the first metric geometry paper dealing with diamond graphs is the paper of Gupta, Newman, Rabinovich, and Sinclair [190] (the conference version of this paper appeared in 1999). A closely related construction is due to Laakso [261]. Exercise 9.39 is from [164]. Exercise 9.41 is from [356]. Since spaces with the Radon–Nikodým property also do not contain bounded ı-trees (see e.g. [55]) we actually get a stronger result: if a Banach space admits a bilipschitz embedding of an infinite diamond, it does not have the Radon–Nikodým property. In [356] a similar result was proved for the Laakso space defined as an inductive limit of the graphs Ln (defined in Exercise 9.42): We consider the vertex of Ln as a subset of the vertex set of LnC1, it is easy to check that this defines an isometric embedding. We introduce the Laakso space L! as the union

Section 9.5 Notes and remarks

305

of the vertex sets of ¹Lnº1 nD0 . For u, v 2 L! we introduce dL! .u, v/ as dLn .u, v/ where n 2 N is any integer for which u, v 2 V .Ln /. Since the natural embeddings Ln ! LnC1 are isometric, it is easy to see that dLn .u, v/ does not depend on the choice of n for which u, v 2 V .Ln/. Laakso space was originally constructed in [261], our description of it is similar to the version presented in [267, p. 290]. The fact that Laakso space does not admit bilipschitz embeddings into a Banach space with the Radon–Nikodým property was known before, see [96]. Exercise 9.42 is from [226] who sketched a proof of the fact that a Banach space X is non-superreflexive if and only if there is a constant C < 1 such that cX .Ln /  C for all n 2 N. Their proof of the “if” part is not similar to our proof of the “if” part of Theorem 9.36. In an interesting related study Cheeger and Kleiner [97] proved a general result on embeddability into L1 of a wide class of Laakso-space-type metric spaces.

9.5.1 Another test-space characterization of superreflexivity: binary trees The following test-space characterization of superreflexivity predates the characterization which we presented in Section 9.3.2. It was proved by Bourgain in the same paper [76] in which he suggested the Ribe program. For n 2 N we define the binary tree of depth n as a graph Tn whose vertex set is the set of sequences of 0 and 1 containing at most n elements, including an empty sequence. Two sequences are joined by an edge if and only if one of them is obtained from the other by adding one element at the right end. Theorem 9.43. A Banach space is non-superreflexive if and only if it admits uniformly bilipschitz embeddings of the graphs ¹Tnº1 nD1 . A proof of Theorem 9.43 can be found in [383]. See [313] for a result in a closely related direction. Baudier [45] proved the following strengthening of the “only if” part of Theorem 9.43. We define the infinite binary tree T1 as an infinite graph whose vertex set is the set of all finite sequences of 0 and 1, including an empty sequence. Two sequences are joined by an edge if and only if one of them is obtained from the other by adding one element at the right end. Theorem 9.44. A Banach space X is non-superreflexive if and only if it admits a bilipschitz embedding of T1 . Theorem 9.44 can be obtained by combining Theorem 9.43 with Theorem 2.6. It should be mentioned that 9.44 was proved before Theorem 2.6.

306

Chapter 9 Metric characterizations of classes of Banach spaces

9.5.2 Further results on test-spaces Test-spaces are known for several important classes of Banach spaces. First we observe that the Maurey–Pisier theorem ([319], we stated it as Theorem 2.55), combined with (1)

The estimates of type and cotype of spaces Lp (Theorem 2.53);

(2)

The result of Bretagnolle–Dacuhna-Castelle–Krivine [80] implying that for 1  p  q  2 the space Lq .0, 1/ is isometric to a subspace of Lp .0, 1/;

(3)

Combination of the result on differentiability of Lipschitz maps proved by Heinrich and Mankiewicz [200] (see also [55, Theorem 7.9 and Corollary 7.10]) with the principle of local reflexivity of Banach spaces ([286], [225]) (we state and prove the principle of local reflexivity in Theorem 10.18) implies that a finite-dimensional Banach space F admits a bilipschitz embedding into a Banach space X with distortion C if and only if F admits an isomorphic embedding f : F ! X such that kf k  kf 1 jf .F / k D C ;

implies the following results: (a) The spaces ¹`n1 º1 nD1 form a collection of test-spaces for the class of spaces with nontrivial cotype. (b) The spaces ¹`n1 º1 nD1 form a collection of test-spaces for the class of Banach spaces with nontrivial type. (c) Item (b) is a part of a more general result: For any fixed p 2 Œ1, 2/ the spaces ¹`pn º1 nD1 form a collection of test-spaces for the class of Banach spaces which have type q for some q > p. The mentioned classes of Banach spaces and their test-spaces were studied in the spirit of Problem 9.23(a)–(d). We mention the results obtained on these lines: (a) Bourgain, Milman, and Wolfson [79] proved that the Hamming cubes ¹Fn2 º1 nD1 are a collection of test-spaces for the class of Banach spaces with nontrivial type. See [379] for a simpler proof. (b) Mendel and Naor [327] proved that lattice graphs Lm,n defined as graphs whose vertex sets are ¹0, 1, : : : , mºn, two vertices are joined by an edge if and only if their `1 -distance is equal to 1, form a collection of test-spaces for the class of Banach spaces without cotype. (c) Ostrovskii [357] proved that for each sequence ¹Xm º1 mD1 of finite-dimensional Banach spaces there exists a sequence ¹Hnº1 of finite connected unweighted nD1 graphs with maximum degree 3 such that the following conditions on a Banach space Y are equivalent:

Section 9.6 Hints to exercises 

Y admits uniformly isomorphic embeddings of ¹Xm º1 mD1 .



Y admits uniformly bilipschitz embeddings of ¹Hnº1 nD1 .

307

This result can be combined with the consequences of the Maurey–Pisier theorem mentioned above. (d) Baudier [45] and Ostrovskii [361] showed that the classes mentioned above can be characterized using one test-space.

9.5.3 Further results on the Ribe program We would like to mention classes of Banach spaces for which the Ribe program was completed (in some sense): (a) Banach spaces with type p .1 < p  2/ [326]; earlier the program was almost completed in [79], see [379] for a simplification of some proofs in [79]. Many interesting questions remain open, see [137], [163], [334]. (b) Banach spaces with cotype q .2  q < 1/ [327], some improvements in [162]. (c) q-uniformly convex Banach spaces [278] and [328]. The corresponding problem for p-uniformly smooth Banach spaces remains open, see [334].

9.5.4 Non-local properties The Ribe program concerns only hereditary local isomorphic invariants. But the problem of a metric characterization and a test-space characterization can be posed for an arbitrary class of Banach spaces satisfying the condition (H) defined on page 288. Johnson [426, Problem 1.1] suggested this problem for reflexivity and the Radon– Nikodým property. Some results in this direction were obtained in [356] and [362]. We have already mentioned some of the results of [356] in Exercise 9.41 and our comment to it.

9.6 Hints to exercises p To Exercise 9.39. Use the fact that the metric space .`n1 , kx  yk1/ embeds isometrically into L2 (see Corollary 1.43). Derive from here that any ı-net in B`n1 emp 2=ı. Derive from Theorem 4.23 the equality beds into Lp 2 with distortion at most cL2 .`n1 / D n. Combine these estimates. To Exercise 9.41. Use the fact that a Banach space containing a bounded ı-tree is nonreflexive (page 290) and the argument used to prove the “if” part of Theorem 9.36.

Chapter 10

Lipschitz free spaces

10.1 Introductory remarks A slight variation of the argument of Proposition 1.7 shows that each metric space X can be embedded into `1 .X/ and that this embedding is in a certain sense canonical, up to a choice of a base point. If we consider an n-element metric space, we can find its canonical isometric embedding into `n1 . However, for applications it is important to embed into spaces which do not contain large-dimensional subspaces which are close to `n1 . In this connection it is interesting to analyze the second known canonical embedding of a metric space X into a Banach space. It was invented and reinvented by many different authors. For this reason the construction has several different names: Arens– Eells space, Kantorovich–Rubinshtein space, Lipschitz free space. I have decided to use the term Lipschitz free space, because I did not come to a definite conclusion on the priority problem, and prefer to use the term “Lipschitz free space”, rather than to use different names for different versions and special cases. This chapter has a modest purpose: to introduce some basic facts about the Lipschitz free spaces and to mention (in the Notes and Remarks section) some other known facts about these spaces. In the presentation of the basic facts we treat separately the case where the metric space is a graph with its graph distance. In this case the Lipschitz free space admits a natural description of terms of the cut space and the cycle space.

10.2 Lipschitz free spaces: definition and properties Definition 10.1. Let X be a metric space. A moleculePof X is a function m : X ! R which is supported on a finite set and which satisfies p2X m.p/ D 0. For p, q 2 X define the molecule mpq by mpq D 1p  1q , where 1p and 1q are indicator functions of singleton sets ¹pº and ¹qº. We endow the space of molecules with the seminorm kmkLF D inf

²X n i D1

jai jdX .pi , qi / : m D

n X

³ ai mpi qi .

i D1

It is not difficult to see that this is actually a norm (this statement also follows from Theorem 10.2). The Lipschitz free space over X is defined as the completion of the space of all molecules with respect to the norm k  kLF . We denote the Lipschitz free space over X by LF.X/.

309

Section 10.2 Lipschitz free spaces: definition and properties

By a pointed metric space we mean a metric space with a distinguished point, denoted O. By Lip 0 .X/ we denote the space of all Lipschitz functions f : X ! R satisfying f .O/ D 0, where O is the distinguished point of a pointed metric space X. It is not difficult to check that Lip 0 .X/ is a Banach space with respect to the norm kf k D Lip.f /. Theorem 10.2. Let X be a pointed metric space. The dual space of the Lipschitz free space LF.X/ can be described as LF.X/ D Lip 0 .X/. The duality for a given function G 2 Lip 0 .X/ and a molecule m D given by n X ai .G.pi /  G.qi //. G.m/ D

Pn

i D1 ai mpi qi

is

(10.1)

i D1

Proof. Let g 2 LF.X/ , we introduce a function on X by G.p/ D g.mpO /. It is clear that G.O/ D 0. Now we show that Lip.G/  kgk, where kgk is the norm of g in LF.X/ . In fact, we have jG.p/  G.q/j D jg.mpO  mqO /j  kgkkmpO  mqO kLF  kgkdX .p, q/. On the other hand, for each G 2 Lip 0 .X/, we can define the functional g on molecules of the form mpO by g.mpO / D G.p/, and extend it by linearity to the space of all molecules, so (10.1) holds (straightforward verification shows that this extension is well defined). To show that this functional extends to a bounded linear functional on LF.X/ and to complete the proof it suffices toP show that kgk  Lip.G/. n Let " > 0, let m be a molecule, and let Pnm D i D1 ai mpi qi be its almost optimal representation, that is, kmkLF  .1  "/ i D1 jai jdX .pi , qi /. Since mpi qi D mpi O  mqi O , we have g.m/ D

n X

ai .G.pi /  G.qi // 

i D1

n X i D1

jai jLip.G/dX .pi , qi / 

Lip.G/ kmkLF . 1"

Letting " # 0 we get kgk  Lip.G/. Remark 10.3. The definition of k  kLF immediately implies that it is the largest seminorm on the space of molecules among all seminorms satisfying kmpq k  dX .p, q/. Now we can prove the property which makes the Lipschitz free spaces important for metric geometry.

310

Chapter 10 Lipschitz free spaces

Theorem 10.4. Let X be a pointed metric space. Then (a)

The map ı given by ı.p/ D mpO isometrically embeds X in LF.X/. (We denote this map by ı because ı.p/, considered as an element of the space .Lip 0 .X// , corresponds to the point evaluation at p.)

(b)

Let E be a Banach space and let T : X ! E be a Lipschitz map satisfying T .O/ D 0. Then there is a unique bounded linear map TQ : LF.X/ ! E such that TQ ı ı D T . This map satisfies kTQ k D Lip.T /.

Proof. (a) We need to prove kmpO  mqO kLF D d.p, q/. Since mpO  mqO D mpq , the inequality kmpO  mqO kLF  d.p, q/ follows immediately from the definition of k  kLF . As for the inverse inequality, we can get it from the duality established in Theorem 10.2. In more detail: there exists a 1-Lipschitz function G in Lip 0.X/ such that G.q/  G.p/ D d.p, q/. The function d.p, x/  d.p, O/ is one such function. Therefore kmpO  mqO kLF  jG.mpO  mqO /j D jG.p/  G.q/j P D d.p, q/. (b) Every molecule is uniquely expressible in the form m D i ai mpi O , where points ¹pi º are distinct and none of them coincides with O. We define TQ by X TQ .m/ D ai T .pi /. This definition immediately implies TQ ı ı D T . Since ı is an isometric embedding and Lip.TQ / D kTQ k, this equality implies kTQ k  Lip.T /. Pimmediately n Is it easy to see that for m D i D1 ai mpi qi we have X   n   Q Q kT .m/k D  ai T .mpi qi /  i D1

  n  X  ai .T .pi /  T .qi // D  i D1



n X

jai jLip.T /d.pi , qi /.

i D1

Taking infimum over all representations of m in the form kTQ .m/k  Lip.T /  kmkLF . Thus kTQ k  Lip.T /.

Pn

i D1 ai mpi qi ,

we get

Remark 10.5. Theorem 10.4 shows that LF.X/ is in a sense the “smallest” Banach space containing an isometric image of X. This “smallness” is however different from the natural Banach space theoretical “smallness”. It does not imply that any Banach space containing an isometric image of X contains an isometric copy of LF.X/. One of the important examples of this type is presented in Theorem 10.16, see Remark 10.17.

Section 10.2 Lipschitz free spaces: definition and properties

311

Proposition 10.6. Let X be a metric space and X0 be a subset of X. The subset X0 is regarded as a metric space with the metric inherited from X. The equality kmkLF.X0 / D kmkLF.X/ holds for each molecule m of X0 and therefore there is a natural isometric embedding of LF.X0 / into LF.X/. Remark 10.7. At this point it becomes important that molecules are defined as realvalued functions (or real-valued finitely-supported measures). If we consider a complex-valued analogue of the spaces LF.X/, the embedding of LF.X0 / into LF.X/ does not have to be isometric. Proof of Proposition 10.6. Let m be a molecule of X0. It is immediate from the definition that kmkLF.X0 /  kmkLF.X/ (the infimum corresponding to the latter norm is over a larger set of representations). To prove the inverse inequality we use the description of the dual space (Theorem 10.2) and the extension property of Lipschitz real-valued functions. It is convenient to pick a distinguished point O in the smaller space X0 . By Theorem 10.2 there exists f 2 Lip 0 .X0/ such that Lip.f / D 1 and f .m/ D kmkLF.X0/ . By the extension property of Lipschitz real-valued functions (see Theorem 10.8 below), there exists fz 2 Lip 0 .X/ with Lip.fz/ D 1 whose restriction to X0 coincides with f . Therefore we have kmkLF.X/  fz.m/ D f .m/ D kmkLF.X0/ . Theorem 10.8. Let X0 be a subset of a metric space .X, d /, X0 is also considered as a metric space with metric d . Let f be a real-valued Lipschitz function on X0. Then there exists a Lipschitz function fz : X ! R such that Lip.fz/ D Lip.f / and the restriction fzjX0 coincides with f . Furthermore, if f is a bounded function and a  f .x/  b for some a, b 2 R and all x 2 X0 , then the function fz can be chosen to satisfy a  fz.x/  b for all x 2 X. Proof. First we prove the first part of the theorem (without the boundedness condition). We can view this proof as follows. We assign values to fz.x/ for each x 2 XnX0, one at a time (formally the construction can actually be done like this only if XnX0 is at most countable, in the general case we use Zorn’s lemma to complete the proof). We need to show that in each step there is at least one real number available as a suitable value for fz.x/. Straightforward verification shows that the only restriction for the choice of fz.x/ is sup .f .z/  Lip.f /d.z, x//  fz.x/  inf .f .y/ C Lip.f /d.x, y//. z2X0

y2X0

Therefore, to show that the desired choice is possible we need to prove that the inequality f .z/  Lip.f /d.z, x/  f .y/ C Lip.f /d.x, y/

312

Chapter 10 Lipschitz free spaces

holds for each z, y 2 X0. This is obvious, since the inequality can be rewritten as f .z/  f .y/  Lip.f /.d.z, x/ C d.x, y//, and therefore it follows from the triangle inequality and f .z/  f .y/  Lip.f /d.z, y/, which is just the Lipschitz condition. This completes the first part of the proof in the case where XnX0 is finite or countable. In the general case we use Zorn’s lemma (Lemma 2.5, page 34) as follows. We consider the set E of all extensions of f to subsets of X containing X0 and having the same Lipschitz constant as f . We regard E as a partially ordered set with the partial order  defined by: g  h if and only if the domain of h contains the domain of g and the restriction of h to the domain of g coincides with g. It is clear that each totally ordered subset H of this ordered set has an upper bound u defined by: the domain of u is the union U of all domains of elements of H , the value of u at x 2 U is defined as the value at x of any of the elements of H (the definition of the partial order is such that these values are the same). By Zorn’s lemma, the set E has a maximal element fz. The definition of E implies that Lip.fz/ D Lip.f /. The domain of fz should coincide with X because otherwise, by the first part of the proof, we can extend the domain by one point. In the case where a  f .x/  b we construct fz as before and then observe that if we consider the function ® ¯ fy D min max¹fz.x/, aº, b , we get the desired function.

10.3 The case where dX is a graph distance In the case where X is a graph with its graph distance, the construction of a Lipschitz free space can be obtained somewhat differently. Let G D .V .G/, E.G// D .V , E/ be a graph. Denote by C 1 .G/ and C 0 .G/ the space of real-valued functions on the edge set and the vertex set, respectively. We consider some orientation of edges of G (see the corresponding definitions on page 136, recall that we denote by e C and e  the head and the tail of an edge e, respectively). The corresponding incidence matrix D is defined as a matrix whose rows are labeled using vertices of G, whose columns are labeled using edges of G and the ve-entry is given by 8 ˆ1, if v D e C, < dve D 1, if v D e , ˆ : 0, if v is not incident to e.

313

Section 10.3 The case where dX is a graph distance

Remark 10.9. Almost all of the important results of this section do not depend on the choice of orientation, although many of the notions which we use do (for example, the incidence matrix). Interpreting elements of C 1 .G/ and C 0 .G/ as column vectors, we may regard D as a matrix of a linear operator D : C 1 .G/ ! C 0 .G/. We also consider the transpose matrix D T and the corresponding operator D T : C 0.G/ ! C 1.G/. It is easy to describe ker D T . In fact, for f 2 C 0 .G/ the value of D T .f / 2 C 1.G/ at an edge e is f .e C/  f .e /, therefore f 2 ker D T if and only if it has the same value at the ends of each edge. It is clear that this happens if and only if f is constant on each of the connected components of G. Therefore the ranks of the operators D T and D are equal to jV j  c, where c is the number of connected components of G. We let Z D ker D. This subspace of C 1 .G/ is called the cycle space or cycle subspace. The name is chosen because, as we explain below, this subspace is spanned by the set of signed indicator functions of all cycles. First we observe that for f 2 C 1 .G/ the function Df 2 C 0.G/ is given by X X f .e/  f .e/. (10.2) .Df /.v/ D e, eC Dv

e, e Dv

It is convenient to view this formula in the following way. We regard f as a record of transportation of a certain commodity between storage places (vertices) and f .e/ is the amount of commodity transported directly from e  to e C. Then .Df /.v/ reflects the change in the amount of the commodity at the location labeled v. The function f is in ker D if and only if the amount of the commodity at each v remains unchanged. Now we consider a cycle C in G. We consider one of the two possible orientations of C satisfying the following condition: each vertex of C is a head of exactly one edge and a tail of exactly one edge. Now we introduce the signed indicator function C 2 C 1.G/ of the cycle C by 8 ˆ if e 2 C and its orientations in C and G are the same 0 there exists x 2 X such that kxk < .1 C ı/kx k and T x D y. Proof. This lemma can be derived from the Goldstine theorem which we have already mentioned (page 262). In fact, we may assume that kx  k  1, so y 2 T  .BX  /. By the Goldstine theorem BX  is the weak closure of BX . Since T  is weak continuous, we get that y is in the weak closure of T .BX /. Both y and T .BX / are in Y , and the restriction of weak topology to Y is just the weak topology of Y . Thus y is in the weak closure of T .BX /. On the other hand T .BX / is a convex set, hence its weak and strong closures coincide. Therefore y is in the strong closure of T .BX /. Since T has closed range, by the Baire category theorem, the strong closure of T .BX / has interior pointsPin TX. This easily implies that for any ˛ 2 .0, 1/ there exists a representation 1 i y D P such that yi D ˛ i T .zi /. Then i 2 BX be  i D0 yi such that Py1i 2 ˛i T .BX /. LetzP 1 1 1 i i   yDT i D0 ˛ zi , i D0 ˛ zi 2 X, and i D0 ˛ zi  1˛ . Picking ˛ satisfying 1 D 1 C ı, we get the desired result. 1˛ Proof of Theorem 10.18. Choose ı > 0 so that if we let ! D ı in Lemma 2.32, we get kT k  kT 1 k < 1 C ". Choose norm one elements a1 , a2 , : : : , am 2 X  containing a basis of F and such that kx  k < .1 C ı/ sup jx  .aj /j j 2¹1,:::,mº

for all x  2 E (this can be done using the argument employed in the proof of Proposition 9.9, see page 268). Choose b1 , b2, : : : , bn to be a ı-net in the unit sphere of E such that b1 , : : : , bk is a basis for X \ E and b1 , : : : , br , r  k, is a basis for E. Then, def for 1  p  q D n  r , we have the unique scalars ¹tp,i º, 1  i  r , such that X br Cp D tp,i bi . i 2¹1,:::,r º

320

Chapter 10 Lipschitz free spaces

Define for 1  p  q numbers 8 ˆ