Exploring Continued Fractions: From the Integers to Solar Eclipses [1 ed.] 1470461285, 9781470461287

There is a nineteen\-year recurrence in the apparent position of the sun and moon against the background of the stars, a

188 4 68MB

English Pages 480 [503] Year 2021

Table of contents :
Cover
Title page
Copyright
Contents
Introduction
Strand I: Patterns
Tips on problem-solving and spotting patterns
A look ahead at three patterns
Chapter I: Tally Bones to the Integers
Tally bones
A table of primes?
The solution to a puzzle?
A base twelve or base sixty system?
Base ten, base twenty, base eight, base two
A binary digit interlude
Solving the shepherd’s puzzle and beyond
Three parting puzzles
Exercises
Strand II: Leibniz and the Binary Revolution
A continued fraction connection
Chapter II: Mathematical Induction
Set notation and the well-ordering principle
The principle of mathematical induction
The fundamental theorem of arithmetic
Equivalence classes
Nim*
Case Study: Mancala*
Mancala nim*
Exercises
Strand III: Al-Maghribî meets Sudoku
Chapter III: GCDs and Diophantine Equations
The greatest common divisor
An ancient algorithm for the greatest common divisor
The Diophantine solution
A litmus test for Euclid’s solution
Clock arithmetic
Systems of Diophantine equations
The totient is multiplicative
A problem from Diophantus’s Arithmetica
Exercises
Strand IV: Fractions in the Pythagorean Scale
A note-naming interlude
How Pythagoras generated his scale
Chapter IV: A Tree of Fractions
Unitary fractions in ancient Egypt
A continued fraction tradition
Farey sequences
A mediant interlude*
The Stern-Brocot tree
A grand finale*
Exercises
Strand V: Bach and The Well-Tempered Clavier
A well-tempered innovation
A musical interlude
An equal-tempered revolution
A continued fraction connection
Chapter V: The Harmonic Series
Case Study: Jeeps in the Desert
A look behind and a look ahead
A generating function finale*
Exercises
Strand VI: A Clay Tablet
The Babylonian number system
The accepted transliteration of Plimpton 322
Reciprocal pairs generate normalized Pythagorean triples
Finding the realm of potential generators
How the scribe may have screened for generators
The purpose of the tablet
Chapter VI: Families of Numbers
Primitive Pythagorean triples
Binomial coefficients
Fibonacci numbers
The continued fraction recursion for 𝑒
The Catalan numbers*
Ben-Hur numbers*
Pogo-stick hikes along continued fractions
Exercises
Strand VII: Planetary Conjunctions
A few conjunction stories
A rough guess
A numerical approach
A continued fraction approach
Chapter VII: Simple and Strange Harmonic Motion
A heavenly approach to circular motion
An earthly approach to circular motion*
Strange harmonic motion
A where, what, and why interlude
The harmonic algorithm
A blue moon application
Exercises
Strand VIII: The Size and Shape of Utopia Island
Chapter VIII: Classic Elliptical Fractions
The prehistory of the ellipse
The trammel of Archimedes
An old elliptical puzzle
A model for the heavens
Newton’s case for a flattened Earth*
The French expeditions to Peru and Lapland
A final riddle
Exercises
Strand IX: The Cantor Set
A lotus-flower introduction
Ternary notation
A reality check*
Chapter IX: Continued Fractions
A local approach to continued fractions
A global approach to continued fractions
A plethora of continued fractions
Why the ugly duckling 𝐺 is really a swan
An interlude delineating Algorithm 𝑂*
Dominance domains
The harmonic algorithm is a chameleon
Applying continued fractions to factoring integers
The first infinite continued fraction
Black holes and the receding Moon
Exercises
Strand X: The Longevity of the 17-year Cicada
Chapter X: Transits of Venus
A historical interlude
A Venus-Earth-Sun model
Conditions for a transit to occur
Recognizing the pattern
A reality check
An easier way to determine when transits occur
A final thought
Exercises
Strand XI: Meton of Athens
Chapter XI: Lunar Rhythms
Predicting the time lapse between successive new moons
Checking the expected length of short and long spans
Expected value of the variation in spans of years*
Final thoughts
Exercises
Strand XII: Eclipse Lore and Legends
Chapter XII: Diophantine Eclipses
Adapting the Earth-Moon-Sun model
Eclipse duration
A sufficient condition for eclipses
Finding 𝐻 at any lunation
Using Condition 1 to find the lapse between successive eclipses
Continued fraction insight
Some Diophantine magic
Lunar eclipses
A reality check
A final note
Exercises
Appendix I: List of Symbols Used in the Text
Appendix II: An Introduction to Vectors and Matrices
Appendix III: Computer Algebra System Codes
Appendix IV: Comments on Selected Exercises
Bibliography
Index
Back Cover

Recommend Papers

$Continued fractions 9780486696300, 0486696308$

Continued fractions 9780486696300, 0486696308

Elementary-level text by noted Soviet mathematician offers superb introduction to positive-integral elements of theory o

348 54 420KB Read more

$Continued fractions 9789810210472, 9810210477$

Continued fractions 9789810210472, 9810210477

This book presents the arithmetic and metrical theory of regular continued fractions and is intended to be a modern vers

341 57 1MB Read more

CONTINUED FRACTIONS 9789812564771, 9789812774682, 9812564772

376 88 2MB Read more

$Continued fractions [1ST ed.]$

Continued fractions [1ST ed.]

408 58 1MB Read more

Continued Fractions 9780883856093, 9780883859261, 0883859262

Front Cover; Continued Fractions; Copyright Page; Contents; Preface; Chapter 1. Expansion of Rational Fractions; 1.1 Int

340 71 5MB Read more

Multidimensional Continued Fractions 9780198506867, 0198506864

395 36 14MB Read more

$Analytic theory of continued fractions 9780821821060, 0821821067$

Analytic theory of continued fractions 9780821821060, 0821821067

The theory of continued fractions has been defined by a small handful of books. This is one of them. The focus of Wall&#

390 15 3MB Read more

$Continued fractions and Pade approximants 0444881697, 9780444881694$

Continued fractions and Pade approximants 0444881697, 9780444881694

368 93 4MB Read more

Continued Fractions. [illustrated edition] 9789812564771, 9812564772

The Euclidean algorithm is one of the oldest in mathematics, while the study of continued fractions as tools of approxim

112 37 382KB Read more

$Continued fractions [illustrated edition] 9789812564771, 9812564772$

Continued fractions [illustrated edition] 9789812564771, 9812564772

The Euclidean algorithm is one of the oldest in mathematics, while the study of continued fractions as tools of approxim

371 22 982KB Read more

Exploring Continued Fractions: From the Integers to Solar Eclipses [1 ed.]
1470461285, 9781470461287

Author / Uploaded
Andrew J. Simoson

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

File loading please wait...

Citation preview

AMS / MAA

DOLCIANI MATHEMATICAL EXPOSITIONS

Exploring Continued Fractions From the Integers to Solar Eclipses

Andrew J. Simoson

VOL 53

Exploring Continued Fractions: from the Integers to Solar Eclipses

AMS/MAA

DOLCIANI MATHEMATICAL EXPOSITIONS

VOL 53

Exploring Continued Fractions: from the Integers to Solar Eclipses Andrew J. Simoson

Dolciani Mathematical Expositions Editorial Board Harriet S. Pollatsek, Editor Priscilla S. Bremser Alfred M. Dahma Elizabeth Denne Emily H. Moore Katharine Ott

Thomas A. Richmond C. Ray Rosentrater Ayse A. Sahin Dan E. Steffy

2010 Mathematics Subject Classification. Primary 11J70, 00A05, 70F15.

For additional information and updates on this book, visit www.ams.org/bookpages/dol-53

Library of Congress Cataloging-in-Publication Data Names: Simoson, Andrew J., author. Title: Exploring continued fractions : from the integers to solar eclipses / Andrew J. Simoson. Description: Providence, Rhode Island : MAA Press, [2019] | Series: Dolciani mathematical expositions ; volume 53 | Includes bibliographical references and index. Identifiers: LCCN 2018042445 | ISBN 9781470447953 (alk. paper) Subjects: LCSH: Continued fractions. Classification: LCC QA295 .S615 2019 | DDC 512.7/2–dc23 LC record available at https://lccn.loc.gov/2018042445

Copying and reprinting. Individual readers of this publication, and nonprofit libraries acting for them, are permitted to make fair use of the material, such as to copy select pages for use in teaching or research. Permission is granted to quote brief passages from this publication in reviews, provided the customary acknowledgment of the source is given. Republication, systematic copying, or multiple reproduction of any material in this publication is permitted only under license from the American Mathematical Society. Requests for permission to reuse portions of AMS publication content are handled by the Copyright Clearance Center. For more information, please visit www.ams.org/publications/pubpermissions. Send requests for translation rights and licensed reprints to [email protected]. © 2019 by the American Mathematical Society. All rights reserved. The American Mathematical Society retains all rights except those granted to the United States Government. Printed in the United States of America. ∞ The paper used in this book is acid-free and falls within the guidelines ⃝

established to ensure permanence and durability. Visit the AMS home page at https://www.ams.org/ 10 9 8 7 6 5 4 3 2 1

24 23 22 21 20 19

In memory of my mother

Marion Valencia Enquist Simoson (1925–1990) who modeled both beauty and whimsy in art and music through the vicissitudes of life.

Contents Introduction Strand I: Patterns Tips on problem-solving and spotting patterns A look ahead at three patterns

xiii 1 2 4

Chapter I: Tally Bones to the Integers Tally bones A table of primes? The solution to a puzzle? A base twelve or base sixty system? Base ten, base twenty, base eight, base two A binary digit interlude Solving the shepherd’s puzzle and beyond Three parting puzzles Exercises

9 9 10 12 13 15 16 19 21 23

Strand II: Leibniz and the Binary Revolution A continued fraction connection

29 34

Chapter II: Mathematical Induction Set notation and the well-ordering principle The principle of mathematical induction The fundamental theorem of arithmetic Equivalence classes Nim∗ Case Study: Mancala∗ Mancala nim∗ Exercises

37 37 41 43 45 47 57 60 63

Strand III: Al-Maghribî meets Sudoku

69

Chapter III: GCDs and Diophantine Equations

73 vii

viii

Contents The greatest common divisor An ancient algorithm for the greatest common divisor The Diophantine solution A litmus test for Euclid’s solution Clock arithmetic Systems of Diophantine equations The totient is multiplicative A problem from Diophantus’s Arithmetica Exercises

74 78 85 88 89 92 93 93 94

Strand IV: Fractions in the Pythagorean Scale A note-naming interlude How Pythagoras generated his scale

99 100 102

Chapter IV: A Tree of Fractions Unitary fractions in ancient Egypt A continued fraction tradition Farey sequences A mediant interlude∗ The Stern-Brocot tree A grand finale∗ Exercises

107 108 110 111 116 118 130 132

Strand V: Bach and The Well-Tempered Clavier A well-tempered innovation A musical interlude An equal-tempered revolution A continued fraction connection

139 141 142 144 145

Chapter V: The Harmonic Series Case Study: Jeeps in the Desert A look behind and a look ahead A generating function finale∗ Exercises

147 157 162 163 166

Strand VI: A Clay Tablet The Babylonian number system The accepted transliteration of Plimpton 322 Reciprocal pairs generate normalized Pythagorean triples Finding the realm of potential generators How the scribe may have screened for generators The purpose of the tablet

169 170 172 174 178 181 182

Contents

ix

Chapter VI: Families of Numbers Primitive Pythagorean triples Binomial coefficients Fibonacci numbers The continued fraction recursion for 𝑒 The Catalan numbers∗ Ben-Hur numbers∗ Pogo-stick hikes along continued fractions Exercises

185 185 186 190 195 197 203 209 211

Strand VII: Planetary Conjunctions A few conjunction stories A rough guess A numerical approach A continued fraction approach

221 221 222 223 224

Chapter VII: Simple and Strange Harmonic Motion A heavenly approach to circular motion An earthly approach to circular motion∗ Strange harmonic motion A where, what, and why interlude The harmonic algorithm A blue moon application Exercises

229 229 234 240 244 246 251 253

Strand VIII: The Size and Shape of Utopia Island

261

Chapter VIII: Classic Elliptical Fractions The prehistory of the ellipse The trammel of Archimedes An old elliptical puzzle A model for the heavens Newton’s case for a flattened Earth∗ The French expeditions to Peru and Lapland A final riddle Exercises

271 272 274 275 278 280 289 295 299

Strand IX: The Cantor Set A lotus-flower introduction Ternary notation A reality check∗

303 303 305 308

Chapter IX: Continued Fractions

311

x

Contents A local approach to continued fractions A global approach to continued fractions A plethora of continued fractions Why the ugly duckling 𝐺 is really a swan An interlude delineating Algorithm 𝑂∗ Dominance domains The harmonic algorithm is a chameleon Applying continued fractions to factoring integers The first infinite continued fraction Black holes and the receding Moon Exercises

311 318 322 328 330 331 332 335 336 340 345

Strand X: The Longevity of the 17-year Cicada

351

Chapter X: Transits of Venus A historical interlude A Venus-Earth-Sun model Conditions for a transit to occur Recognizing the pattern A reality check An easier way to determine when transits occur A final thought Exercises

357 358 362 364 368 373 375 375 376

Strand XI: Meton of Athens

379

Chapter XI: Lunar Rhythms Predicting the time lapse between successive new moons Checking the expected length of short and long spans Expected value of the variation in spans of years∗ Final thoughts Exercises

383 384 389 391 393 395

Strand XII: Eclipse Lore and Legends

399

Chapter XII: Diophantine Eclipses Adapting the Earth-Moon-Sun model Eclipse duration A sufficient condition for eclipses Finding 𝐻 at any lunation Using Condition 1 to find the lapse between successive eclipses Continued fraction insight

405 405 408 408 410 412 412

Contents Some Diophantine magic Lunar eclipses A reality check A final note Exercises

xi 415 418 419 420 421

Appendix I: List of Symbols Used in the Text

425

Appendix II: An Introduction to Vectors and Matrices

429

Appendix III: Computer Algebra System Codes

437

Appendix IV: Comments on Selected Exercises

453

Bibliography

465

Index

473

Introduction This book is a gentle introduction to continued fractions by way of pattern recognition and applications.

What is a continued fraction? Definition 1: Finite simple continued fractions. A finite simple continued fraction is a finite list of integers 𝑛𝑖 , called partial denominators, with 𝑛𝑖 > 0 for all 𝑖 > 0, denoted by [𝑛0 ; 𝑛1 , 𝑛2 , 𝑛3 , … , 𝑛𝑘 ], where integer 𝑖 ranges from 0 to some integer 𝑘. Associated with this continued fraction are 𝑘 + 1 fractions, 𝐶0 , 𝐶1 , through 𝐶𝑘 , referred to as convergents, whose values are 𝐶0 = 𝑛0 ,

𝐶1 = 𝑛0 +

1 , 𝑛1

1

𝐶2 = 𝑛0 +

𝑛1 +

1 𝑛2

,

and so on. To illustrate, convergent 1 for the simple continued fraction [1; 2, 3, 4, 5] is 1 3 𝐶1 = 1 + = , whereas convergent 4 is 2

2

1

𝐶4 = 1 +

=

1

2+

225 . 157

1

3+

4+

1 5

225

Since 𝐶4 is the last convergent, we write = [1; 2, 3, 4, 5]. 157 Continued fractions may be infinitely long. xiii

xiv

Introduction

Definition 2: Infinite simple continued fractions. An infinite simple continued fraction is a list of integer-valued partial denominators 𝑛𝑖 with 𝑛𝑖 > 0 for 𝑖 > 0, denoted by [𝑛0 ; 𝑛1 , 𝑛2 , 𝑛3 , …], where 𝑖 is a nonnegative integer. As in the finite case, the infinite continued fraction has convergents 𝐶𝑖 computed as given in Definition 1. This time there are infinitely many convergents. As the name suggests, the sequence of convergents often has a limit 𝜔, in which case we write 𝜔 = [𝑛0 ; 𝑛1 , 𝑛2 , 𝑛3 , …]. We will discuss questions of convergence in Chapter IX. Each positive rational number can be written as a finite simple continued fraction. Each infinite simple continued fraction evaluates to some irrational number, and every positive irrational number can be written as a simple continued fraction, as we illustrate at various times through the text. For example, the simple continued fraction representation for √2 is √2 = [1; 2, 2, 2, …] = [1; 2]. Its convergents (along with their decimal approximations) are 1,

7 17 41 99 3 = 1.5, = 1.4, ≈ 1.41667, ≈ 1.41379, ≈ 1.41429. 2 5 12 29 70

As the term simple continued fraction implies, not all continued fractions are simple. In Chapter IX, numerators of the partial denominators will be permitted to be 1 or −1. The notation for these continued fractions is [𝑛0 ; 𝜖𝑖 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …] where 𝜖𝑖 is either 1 or −1, with convergents 𝐶0 = 𝑛0 ,

𝐶1 = 𝑛0 +

𝜖1 , 𝑛1

𝐶2 = 𝑛0 +

𝜖1 𝑛1 +

𝜖2 , 𝑛2

and so on. For example, 𝜋 can be written as the continued fraction 𝜋 = [3; 7, 16, −294, …] → {𝐶0 = 3, 𝐶1 =

22 355 104348 ,𝐶 = ,𝐶 = , …}. 7 2 113 3 33215

(1)

Much more general continued fraction examples are given on page 35 of Strand II, in Exercise VI.10c, and in Example IX.43.

Introduction

xv

Of what use is a continued fraction? Suppose we wish to approximate the number 𝜋. The first six digits of 𝜋’s decimal expansion are 3.14159. Therefore, one seemingly good approximation for 𝜋 as a fraction in lowest terms should be 314159 355 . However, from (1), 𝜋’s second convergent, 𝐶2 = , is simpler than 100000 314159 100000

113

and is an order of magnitude closer to 𝜋:

||𝜋 − 314159 || ≈ 2.65 × 10−6 | 100000 |

versus

||𝜋 − 355 || ≈ 2.67 × 10−7 . | 113 |

Continued fractions are, in general, an elegant way to find good fractional approximations for irrational numbers. In general, when measuring phenomena using given units, the numbers we get often appear to have no repeating pattern in their decimal expansions. The diagonal of the unit square has length √2. The old Greek puzzle about doubling the 3 volume of a cube involves scaling its side length by √ 2. The ratio of a circle’s circumference to its diameter is 𝜋. Correct to five decimal places (using kilograms, meters, and seconds), the universal gravitational constant is 𝐺 ≈ 6.67408×10−11 . Wherever we look we find what appear to be irrational numbers. Whenever we approximate, we use rational numbers. Strange as it might at first sound, oftentimes the rational approximations given to us by continued fractions enable us to see patterns more clearly in our universe.

A brief outline of this book. This book is an exploration of continued fractions. It includes brief forays into ideas that are from outside elementary number theory, yet are part of the standard undergraduate mathematics curriculum. Besides number theory, the text uses elements of calculus (limits, integrals, and series), vector calculus, discrete mathematics, linear algebra, probability, mathematical statistics, combinatorics, graph theory, geometry, differential equations, and analysis, as well as allusions to abstract algebra. As a guide to the reader, those sections of the book that include such forays or include enrichment material on a particular idea are marked with an asterisk. Thus, for example, the reader will see the asterisk in the Chapter VIII section Newton’s case for a flattened Earth∗ . This asterisk is a cue that this section is optional for understanding the book and may be a section to skip on a first reading. This book contains twelve strands and twelve chapters. The strands are meant to be somewhat light-hearted introductions to the following chapter. They involve a single idea, puzzle, or personality related in some way to the material of the subsequent chapter. While successive chapters of the book are related to previous chapters and foreshadow later chapters, each chapter can be read more or

xvi

Introduction

less on its own. Although the book is structured to reach a climax in Chapter IX on continued fractions, each chapter is also an end in and of itself. Here is an outline of the chapters. • Chapter I starts with the set of positive integers. Rearranging heaps of 𝑛 pebbles into arrays of 𝑝 rows of 𝑞 pebbles, where 𝑝 and 𝑞 are integers, soon leads to the discovery of the fundamental theorem of arithmetic, and gives a natural way to think of taking parts of a whole. That is, splitting 𝑝𝑞 into 𝑝 equal 1 parts leads to the idea of the unitary fraction , a fundamental building block 𝑝

of continued fractions. • Chapter II presents the well-ordering principle and mathematical induction, which, among other things, gives a division algorithm. With this tool, we show how to find the simple continued fraction of any fraction. • Chapter III shows how a recursive application of the division algorithm leads to Euclid’s method for finding the greatest common divisor of two positive integers and for solving Diophantine equations. As we will see, Euclid’s method for the greatest common divisor of two positive integers is equivalent to finding the simple continued fraction representation for the ratio of the given two integers. 𝑝

• Chapter IV shows how each positive (non-integer) fraction , where 𝑝 and 𝑞 𝑞

𝑎

𝑐

are integers, is a combination of two unique simpler fractions and that 𝑏 𝑑 solve the Diophantine equation 𝑝𝑥 − 𝑞𝑦 = ±1. This structure defines a tree of fractions and allows us to identify each fraction between 0 and 1 with a tree address. In fact, finite continued fractions allow us to find the tree address of any fraction, and to know the fraction at any address. • In Chapter V, the idea of unitary fractions from earlier chapters extends naturally to the harmonic series whose terms 𝐻𝑛 are the sums of the first 𝑛 unitary fractions. Recall from calculus that Euler’s constant 𝛾 is the limit of the difference between 𝐻𝑛 and ln 𝑛. We illustrate the generation of a non-simple continued fraction using 𝛾, where the numerators of the partial denominators are allowed to be either 1 or −1 (rather than always being 1). • In Chapter VI, we generate families of numbers recursively and, in doing so, explore series. Recall from calculus that the real natural number 𝑒 is the sum ∞ 1 of an infinite number of unitary fractions: 𝑒 = ∑𝑖=0 . We show how to find 𝑖! 𝑒’s infinite simple continued fraction representation, 𝑒 = [𝑛0 ; 𝑛1 , 𝑛2 , …], and determine an explicit formula that yields the partial denominator 𝑛𝑖 for any desired positive integer 𝑖.

Introduction

xvii

• Chapter VII is about simple harmonic motion—the approximate motion of some planets and satellites about their suns or planets, respectively. We show how simple harmonic motion gives a geometrical algorithm for finding a continued fraction equal to a given number. We explore more traditional algorithms in Chapter IX. • Chapter VIII showcases a few classic ratios involving the ellipse from the seventeenth and eighteenth centuries, ratios which we then represent as continued fractions. • In Chapter IX, we present a variety of continued fraction algorithms, and show that each positive irrational number 𝜔 has many convergent continued fraction representations. When restricting integer partial denominators 𝑛𝑖 to be either only positive integers or to always have magnitude at least 2 (when 𝑖 ≥ 1), we show that every infinite continued fraction converges using the tree of fractions between successive integers presented in Chapter IV. • The final three chapters are applications of continued fractions to the motion of the Moon, Earth, and Venus.

How could this book be used in the undergraduate mathematics classroom? As a minimal requirement for reading this book, the reader should be aware of mathematical induction, a topic often introduced in Discrete Mathematics (or any bridge course to writing proofs) or Calculus I. A subset of the chapters could serve as at least one of the texts for a course on number theory (Chapters I through IV, IX, and XII and some of the strands such as Strand VI), for a course on the history of mathematics (any subset works), or for a capstone course. A brave soul might use the book as a text for Discrete Mathematics. Readers interested in following a minimal path to the applications of the later chapters should familiarize themselves with Diophantine equations in Chapter III; be able to compute the general mediant of two neighboring Farey fractions, and to find the mother and father fractions for any given fraction in the Stern-Brocot tree from Chapter IV; understand the continued fraction constructions of Example II.4, Example III.8, Puzzle V.6, and Example VI.31; and read the first half of Chapter IX. As aids to the reader, the appendices include the following items. • A list of symbols used throughout the text. • An introduction to vectors and matrices with respect to the matrix multiplication of Chapters IV and X. • Algorithmic code for a score of algorithms introduced in the text. In addition to a presentation of the Mathematica code for many of the algorithms used

xviii

Introduction

in this text, we provide access to them via an AMS website www.ams.org/ bookpages/dol-53 as both a pdf file and a Mathematica notebook. For most of these selections, the code is easily adaptable to any computer algebra system (CAS). • Comments on selected exercises. Snippets of this book have appeared in print over the years. Strand II is a version of A. Simoson, Life lessons from Leibniz, Math Horizons 22:4 (2015) 5–7, 29 © Mathematical Association of America, 2015, all rights reserved. Strand V is an adaptation of B. Linderman and A. Simoson, A Bach diesel canon, Math Horizons 25:4 (2018) 5–7 © Mathematical Association of America, 2018, all rights reserved. Strand VIII is an expanded version of A. Simoson, Minimizing Utopia, Math Horizons 23:3 (2016) 18–21 © Mathematical Association of America, 2016, all rights reserved, a version of which, The size and shape of Utopia, also appeared in the Proceedings of the Bridges Jyväskylä 2016 Conference [139]. A portion of R. Fillers, B. Linderman, and A. Simoson, Mancala as nim, Coll. Math. J. 45:5 (2014) 350– 356 © Mathematical Association of America, 2014, all rights reserved, appears as a case study in Chapter II. Strand VI is an expanded version of A. Simoson, Extrapolating Plimpton 322, Coll. Math. J., 50:3, © Mathematical Association of America, 2019, all rights reserved. A condensed version of J. Dodge and A. Simoson, Ben-Hur staircase climbs, Coll. Math. J. 43:4 (2012) 274–284 © Mathe- matical Association of America, 2012, all rights reserved, appears as an example in Chapter VI. Adaptations and combinations of A. Simoson, Newton’s radii, Maupertuis’ arc length, and Voltaire’s giant, Coll. Math. J. 42:3 (2011) 274–284 © Mathematical Association of America, 2011, all rights reserved, and A. Sim- oson, Newton’s 501 jeans, The Mathematical Scientist 43:1 (2018) 1–9 © Applied Probability Trust, 2018, appear as a case study in Chapter VIII. Chapter X is a version of A. Simoson, Periodicity domains and the transit of Venus, Amer. Math. Monthly 121:4 (2014) 283–298 © Mathematical Association of America, 2011, all rights reserved. Chapter XI is an expanded version of A. Simoson, Lunar rhythms and strange signatures, The Mathematical Scientist 41:1 (2016) 25–39 © Applied Probability Trust, 2016. Chapter XII is a version of A. Simoson, Diophantine eclipses, The Mathematical Scientist 42:2 (2017) 74–89 © Applied Probability Trust, 2017. This book contains some whimsy. Musings on the Ishango bone are pushed to the limit in Chapter I. Mancala of Chapter II analyzed as nim is probably intractable for most configurations. We translate the firing sequence of a twelvecylinder engine into a musical score, even though a typical diesel train engine makes 500 to 1500 rotations per minute. We consider the problem of dropping a small black hole at Earth’s surface, and we make conjectures about the longevity of the 17-year cicada.

Introduction

xix

Numerous illustrations appear in the text. Where noted in a figure’s caption, permission use has been granted. Figures appearing without acknowledgment are in the public domain. Some of the figures are my sketches. The flower figure on the cover is meant to be a visual characterization for the optimal continued fraction convergents to the natural number 𝑒, as explained fully in [143]. Finally, I wish to thank a number of people. • A colleague Bill Linderman who rendered the diesel canon for the diesel engine firing of Chapter I and Strand V as a musical score using the software Sibelius. • King University students Rhianna Fillers, Shuler Hopkins, and Sam Barker for joint work on nim of Chapter II and the synchronization example of the 17-year cicada in Strand X. • Keri-Lynn Paulson, a King University librarian, who helped secure various high-resolution images from a number of libraries. • Dan Kalman for directing my attention to strange Strang figures upon seeing my analysis of the signatures of strange harmonic motion as presented in Chapter VII. • King University astronomer Raymond Bloomer for fielding many questions about celestial mechanics. • The trustees of King University for a 2016 spring semester sabbatical to focus on this manuscript. • Senior Acquisitions Editor Stephen Kennedy for encouraging me to submit a book manuscript featuring continued fractions. • The Dolciani Committee chaired by Harriet Pollatsek for careful reading of multiple drafts of the manuscript. • Senior Production Editor Lauren Foster for rendering the manuscript into its published form, and allowing me to make last minute revisions. 25 March 2019

Strand I: Patterns On 20 July 1963, a solar eclipse occurred as seen from northern Minnesota. I was ten years old and had been using a pin-hole camera to view the action of the Moon encroaching upon the disk of the Sun. Like many people who have viewed or will view a solar eclipse,1 I wondered, How can one predict these events?

Figure 1. Viewing an eclipse with a pin-hole camera, author sketch.

As I was to learn much later in what are now the details of this book, the specific answer to my question involves the properties of the integers, the greatest common divisor of two positive integers, ratios of real numbers, and continued fraction approximations for real numbers. But more generally, my question was, How do we recognize patterns?

1 The solar eclipse of 21 August 2017 mesmerized most inhabitants of the continental USA. As of the printing of this book, the next solar eclipse to mesmerize us across the continental USA will occur on 8 April 2024, as discussed on p. 420.

1

2

Strand I: Patterns

Figure 2. Camille Flammarion, L’Atmosphere: Météorologie Populaire, Paris (1888), p. 163. Source: Wikimedia Commons.

Tips on problem-solving and spotting patterns Imagine yourself an ancient Babylonian astronomer/astrologer. You have been commissioned by the king to predict the next solar or lunar eclipse without fail, a task we refer to as problem 𝒫. You have records from the past several hundred years from remote parts of the empire and beyond, giving the dates of previous eclipse occurrences. In fact, you can arrange them in order in time as 𝑎0 , 𝑎1 , 𝑎2 , 𝑎3 , 𝑎4 , … ,

(1)

hundreds of data points. You search through the list looking for patterns. Your life itself may depend on what you find. A sense of urgency may help one look long and hard to see patterns towards solving 𝒫. Beyond that tip, what can we do? In the 1945 book How to Solve It, George Pólya (1887–1985) suggests a variety of problem-solving strategies [119]. We shall use all of the following Pólya tips as we try to solve 𝒫 and, more generally, as we explore continued fractions. • Introduce suitable notation to keep track of progress towards solving 𝒫. As of now, we have notation for finite simple continued fractions given in Definition 1 of the Introduction: [𝑛0 ; 𝑛1 , 𝑛2 , … , 𝑛𝑘 ]. Much more notation is coming.

Tips on problem-solving and spotting patterns

3

• To gain familiarity with 𝒫, sketch pictures and graphs modeling 𝒫. For example, when we consider a geometric version of continued fractions as introduced in Example 3 below and as pursued more carefully in Chapter VII, we use multiple graphs as aids to understanding, such as the one in Figure 3, p. 5. • Solve simpler or related versions of 𝒫. For example, consider the problem of two runners on a circular track, running at different constant speeds. When will they next meet again given that they started at the same place at time 0? • Consider special cases of problem 𝒫. For example, solving the problem of how often new moons occur is a step towards solving 𝒫. • Take advantage of symmetry within 𝒫. Observe that two types of eclipses exist: solar and lunar. Perhaps these two phenomena follow similar patterns. • Guess! Make sub-conjectures. For example, observe also that two kinds of solar eclipses occur. Sometimes the diagonal path of the Moon, from left to right (in the northern hemisphere), across the face of the Sun ascends, and sometimes the diagonal path descends. This observation might help in solving 𝒫. • Generalize. For example, in the Introduction we first established a finite simple continued fraction in Definition 1. We then generalized to an infinite simple continued fraction in Definition 2. • Sometimes, to simplify a problem, it helps to complicate it first. For example, to continue with the previous hint, in Chapter IX we generalize further and allow numerators of partial denominator terms for continued fractions to be either 1 or −1. Why should we wish to complicate an already complicated construct? As we demonstrate in Example V.6, sometimes a more complicated approach solves some problems more quickly than a simpler one. • Be creative! Here is a classic example of what this hint might mean. From Plato’s Phaedo, just before Socrates drank hemlock to fulfill an Athenian death sentence, Socrates said of our ability to understand physical phenomena: If anyone should come to the top of the air [atmosphere], he could lift his head above it and see, as fishes lift their heads out of the water and see the things in our world, so he would see things in that upper world, things even more superior to those in this world. The engraving of Figure 2 shows a veritable Socratic figure lifting his head above normal perception to glimpse the mechanism by which the heavens move. Sometimes a change of perspective—if only we could stumble across a good one—allows us to see a solution to problem 𝒫.

4

Strand I: Patterns

The above list is helpful, but not exhaustive because the process of discovering new mathematics is also almost magical. Here are how two legendary mathematicians perceived this discovery process. Responding to an interview question [2], What’s the best part of being a mathematician, Paul Halmos (1916–2006) said: I’m not a religious man, but it’s almost like being in touch with God when you’re thinking about mathematics. God is keeping secrets from us, and it’s fun to try to learn some of the secrets. The itinerant mathematician Paul Erdős (1913–1996) had similar thoughts. As his biographer describes it [71], with respect to Erdős’s perspective, God was the Supreme Fascist, the Number-One Guy Up There, God, who was always tormenting Erdős by hiding his glasses, stealing his Hungarian passport, or, worse yet, keeping to Himself the elegant solutions to all sorts of intriguing mathematical problems. Erdős often alluded to The Book, a volume kept by God containing the best mathematical proofs. It is high praise indeed to say of any particular mathematical argument, “Aha, this is from The Book.” We close this strand with three problems and patterns to be explored in this book.

A look ahead at three patterns Example 1: Meton and a nineteen-year calendar. Summer, fall, winter, spring—in temperate climes, we can literally feel the change in the seasons, and soon we recognize the phenomenon of year-ness. Looking at the Moon’s phases, we soon recognize the phenomenon of moon-ness, or month-ness. Meton2 was a fifth-century bc Athenian astronomer who championed a nineteen-year calendar of 235 months. Every nineteen years, as the Babylonians before him had noticed, the Sun and the Moon against the background starry skies return to the same orientation of what had been. Such a period is not readily discerned. It is hidden within the pattern of the yearly cycle. Yet if we take the ratio of the period of the Moon about Earth and the period of Earth about the Sun, one of the first 235 few simple continued fraction convergents for this ratio is the fraction . This 19 approximation allows us to see what had been camouflaged. We explore this phenomenon more deeply in Chapter XI. ♢ Example 2: Newton and the idea of gravity. The Babylonians were careful observers of the heavens. The Greeks borrowed this tradition of record-keeping, 2A

sketch of Meton’s life appears in Strand XI.

A look ahead at three patterns

5

and to it added a model of circular motion about Earth, culminating in the Ptolemaic system of planetary orbits of circles bedecked by epicycles. Figure 2 is a fanciful glimpse of this clockwork-like system. After years of examining careful planetary data compiled by Tycho Brahe (1546–1601), Johannes Kepler concluded that planetary orbits about the Sun are ellipses, and formulated three laws of planetary motion by 1619. In 1687, Isaac Newton published the Principia, deriving Kepler’s laws from first principles. Yet Newton was somewhat sheepish concerning his principle of universal gravitation, the idea that any two particles of matter in the universe attract each other. Newton had no explanation as to why or how such a force might exist. But if such a force did exist, voilà, we have an elegant explanation and understanding of celestial mechanics. Savants3 admired his work but, especially on the continent, shook their heads in disbelief. Fifty years later the notion of gravity morphed into a common idea. Yet the initial, fundamental idea of gravity, that matter attracts itself, arose from intuition. It was thinking outside the box. It was pattern recognition. My favorite portion of the Principia is this puzzle: If the ratio of Earth’s polar radius to its equatorial radius is 100 to 101, then what ratio of integers gives the ratio of gravity at the north pole to gravity at the equator? 501

Newton’s answer is . Somewhat whimsically, we use continued fractions in 500 Chapter VIII to test whether his answer is the best possible one with denominator near 500. ♢ 1 2

3

4

5

1 0

113 100

200

0

6

Figure 3. Seven strands for 𝜋, labeled 0 through 6. 3 Prior

to 1800, the word scientist was rarely used, if ever.

300

6

Strand I: Patterns

Example 3: A geometrical algorithm for continued fractions. As noted on 22 355 page xiv, the fractions 𝐶1 = and 𝐶2 = in Equation (1) are approximations 7 113 to 𝜋. Observe that if we could somehow have identified the integers 7 and 113 as the denominators of two fractions that give good approximations to 𝜋 (without already knowing those fractions), then we can recover their numerators: 22 = [7𝜋]

and

355 = [113𝜋],

4

where [𝑥], the nearest integer function, is the integer nearest to 𝑥. As presented in Chapter VII, one way to identify these two denominators (7 and 113) is to inspect the graph of the set of points {(𝑛, sin 2𝜋2 𝑛)| 𝑛 ∈ ℤ} as shown in Figure 3, where ℤ is the set of integers. This set of ordered pairs appears to be seven sine-like strands of beads labeled 0 through 6. The horizontal translate distance between these successive strands just happens to be 113 to the nearest integer as labeled in the figure (the approximate translate distance between strands 0 and 1).

a. 𝜔 =

1+√5 2

.

b. 𝜔 = 𝜋.

Figure 4. Seed arrangements, 𝒫𝜔 , in ideal sunflowers, 0 ≤ 𝑛 ≤ 200. For any irrational number 𝜔, will the set 𝒮𝜔 = {(𝑛, sin 2𝜋𝜔𝑛)| 𝑛 ∈ ℤ}, referred to as the signature of 𝜔, exhibit similar behavior? Can we find two integers 𝑞1 and 𝑞2 such that the set appears to be 𝑞1 sine-like strands of beads where the horizontal translate distance between successive strands to the nearest integer is 𝑞2 ? Furthermore, with 𝑝1 = [𝑞1 𝜔] and 𝑝2 = [𝑞2 𝜔], will the difference between 𝑝2 𝑝 and 𝜔 be less than the distance between 1 and 𝜔? 𝑞2

𝑞1

In Chapter IX, we show that this geometric approach of using signatures to find fractional approximations for any given irrational number is equivalent to various standard continued fraction algorithms. 4 When

𝑥=𝑛+

1 2

with 𝑛 an integer, we stipulate that [𝑥] = 𝑛.

A look ahead at three patterns

7

Disentangling the snake-like intertwining of 𝒮𝜔 ’s strands by using both the 𝑥-axis and the 𝑦-axis results in a pattern called phyllotaxis in plants, the arrangement of seeds, petals, or leaves [106], which can be modeled by the set 𝒫𝜔 = {𝑔(𝑛)(cos 2𝜋𝜔𝑛, sin 2𝜋𝜔𝑛)| 𝑛 ∈ ℤ, 𝑛 ≥ 0} where 𝑔(𝑛) is a function for the distance of the point 𝑔(𝑛)(cos 2𝜋𝜔𝑛, sin 2𝜋𝜔𝑛) from the origin. Figure 4a shows the phyllotaxis of seeds of a sunflower with 1+√5

𝑔(𝑛) = √𝑛 and 𝜔 = , the golden mean. Similarly, Figure 4b shows the 2 phyllotaxis for a flower with 𝜔 = 𝜋. Note that this latter flower has seven spiral strands—much like the seven sine-like strands of 𝒮𝜔 in Figure 3. ♢

Chapter I: Tally Bones to the Integers In this chapter we explore the origin and nature of the integers. We outline the development of our number system from tally marks to number systems with respect to various bases. We investigate the structure of the integers themselves: the fundamental theorem of arithmetic, the idea that any positive integer can be expressed as a product of prime integers. Such structure allows us to look at parts of the whole and, in particular, integer factors of any given integer, which in turn leads naturally to fractions.

Tally bones Sitting around campfires after dining on subdued beasts of long ago, our forebears told stories, mended clothing and tools, played games, and perhaps etched records upon the bones of the beasts. A petrified baboon bone of mathematical significance was unearthed by Jean de Heinzelin in 1950 at an archeological dig at Ishango near Lake Edward bounded by the Congo and Uganda, one of the headwaters of the Nile. Considered to be at least 11 000 years old [75], this ten-centimeter-long bone bears a peculiar arrangement of tally marks. Shown in Figure 5, side 1 of the bone has two rows 𝐴 and 𝐵 of notches, whereas side 2 has but one row 𝐶 of notches. These notches are grouped by spacing. The numbers of notches are as indicated in the figure, so that row 𝐴 consists of 11, 13, 17, 19 notches, row 𝐵 consists of 11, 21, 19, 9 notches, and row 𝐶 consists of 3, 6, 4, 8, 10, 5, 5, 7 notches.5 Heinzelin has suggested that these bone markings might be the score in some unknown game. Marschack [97, pp. 27–32] speculated that they represent a lunar calendar of sorts, primarily because the sum of each row of side 1 is 60, twice the lunar cycle of about 30 days. Even though Joseph [79, p. 34] cautions against 5 Due to ambiguity in the tally markings, the group of ten notches on row 𝐶 may possibly be a group of nine notches.

9

10

Chapter I: Tally Bones to the Integers

Figure 5. The three rows 𝐴, 𝐵, and 𝐶 of the Ishango bone, on display at The Royal Belgian Institute of Natural Sciences, Brussels, author sketch. over-interpreting these tally marks, saying that “a single bone may well collapse under the heavy weight of conjectures piled upon it,” the temptation to propose additional conjectures is irresistible. We give some of these.

A table of primes? Definition 4: Divisibility. Let 𝑎 and 𝑏 be integers, with 𝑎 ≠ 0. We say that 𝑎 divides 𝑏, denoted by 𝑎|𝑏, if 𝑏 = 𝑎𝑐 for some integer 𝑐. Definition 5: Primes and irreducibles. An integer 𝑝 larger than 1 is prime6 if whenever 𝑝|𝑚𝑛 where 𝑚 and 𝑛 are integers, either 𝑝|𝑚 or 𝑝|𝑛. By convention, we disallow 1 as being prime. An integer greater than 1 is composite if it is not prime. A nonzero integer 𝑝 is irreducible if whenever another integer 𝑎 divides 𝑝, either 𝑎 equals ±𝑝 or 𝑎 = ±1. With respect to the Ishango bone, a first irresistible conjecture is that row 𝐶 followed by row 𝐴 is a characterization of two patterns starting with 3. In particular, the first pattern starts 2 ⋅ 3 = 6, 2 ⋅ 4 = 8 and ends 10 = 2 ⋅ 5, while the second pattern continues with 5, 7 and—once the bone is flipped over—proceeds on to row 𝐴 with the integers 11, 13, 17, 19. A modern-day observer might say of the second pattern, Aha, the primes from three through nineteen!—even though no other evidence exists anywhere suggesting that people were aware of the notion of primeness before, say, 3000 years ago. 6 As we will see in Proposition II.8, an integer 𝑝 > 1 is prime if and only if 𝑝 is irreducible. For the remainder of this chapter we shall assume this result is true. However, various number systems other than the integers exist in which some irreducible numbers (other than −𝑝 where 𝑝 is prime) fail to be prime. See Exercise 10cd for an example.

A table of primes?

11

Can we characterize the notion of a prime without formally alluding to multiplication? Yes. Imagine yourself a shepherd overseeing a group of twenty sheep.7 The animals more or less stay in one place, grazing. Your job is to prevent sheep from straying and to prevent predators from terrorizing the sheep. Most of your time is spent waiting. Meanwhile you have a pouch of pebbles, twenty of them, one for each sheep. You may not have names for counting from one to twenty, but you do know whether or not a sheep is missing. At times you cast the pebbles onto the smooth ground and arrange them into rows of uniform length. You discover three different rectangular arrangements where the number of rows is no more than the number of pebbles in each row as shown in Figure 6: one row of twenty, two rows of ten, and four rows of five.

one row of twenty two rows of ten

four rows of five

Figure 6. Three rectangular arrangements of twenty pebbles. If the herd had twenty-three sheep, the shepherd might at first surmise that with a larger number of pebbles, more rectangular arrangements than three might exist. But of course, the shepherd would be disappointed, for there exists only one rectangular arrangement. That is, the shepherd has discovered the notion of primeness, and is well on the way to answering Puzzle 6. Puzzle 6: A shepherd’s puzzle. Let 𝑛 be a positive integer. Into how many rectangular arrays can 𝑛 pebbles be arranged? ♢ We answer Puzzle 6 in Example 14. Meanwhile, we return to the Ishango bone. One bone pundit, whom we call Sam, discounts the idea of the Ishango bone as being the earliest known tabulation of primes by answering the following question. 7 Sheep

were first domesticated somewhere between 11000 and 9000 bc.

12

Chapter I: Tally Bones to the Integers

Puzzle 7: A probability puzzle. What is the likelihood of selecting four primes in order from the integers 1 through 30? 1 The answer Sam gave was . How so? Of the first thirty integers, ten are prime, so that

1 3

81

of these integers are prime. Thus the likelihood that four num1

4

bers chosen at random from the integers one through thirty are all prime is ( ) = 3

1

. Sam goes on to argue that hundreds of Ishango-like bones must exist. Since 81 only one of them has been found, Sam concluded that the four primes 11, 13, 17, and 19 were simply a random selection. However, let us interpret Puzzle 7 more carefully. Our set of four integers must all be prime, must be distinct, and must be successive primes, either in increasing order or in decreasing order. For the moment, let us imagine them as being in increasing order. Once we select the smallest 𝑝 of these prime numbers, the rest are chosen. Since 𝑝 could be a prime from 2 through 17, there are 7 ways of selecting these four prime integers. By symmetry (to allow for a decreasing order of primes), there are 14 ways to select four successive primes among the 30 integers. Furthermore, the number of ways to pick four integers where duplicate integers are allowed and where order is important is 304 . Therefore the likelihood that four prime integers like the ones on the Ishango bone should appear on a tally bone that is arbitrarily unearthed is 14/304 , that is, less than one in fifty thousand.8 Thus, it may very well be that the tally-maker of the Ishango bone knew something about primes. ♢

The solution to a puzzle? Another possible explanation for the markings on the Ishango bone is that they were answers to a puzzle, somewhat in keeping with Heinzelin’s original hunch [69]. For example, a series of stories, traditions, and beliefs collected by Littmann [90, pp. 36–37] in Ethiopia before 1910 includes this brainteaser. Puzzle 8: Camel loads of cloth. A man had three wives, and he went to the market to buy cloth for them, taking nine camels on which to carry the cloth. After he had bought the cloth he loaded the camels: On the first camel he put one load of cloth, on the second two loads, and so on. Doing this he put on each camel as many loads as its number. If he wishes to divide the cloth equally among his wives without unpacking the camels, what should he do? 8 Here is an alternate way to model Puzzle 7: Given that four distinct integers have been chosen at random in increasing order from the first 30 integers, what is the likelihood that all four are primes? This time, we need a binomial coefficient as described in Proposition VI.16 to conclude that the likelihood is 1 in 3915.

A base twelve or base sixty system?

13

Figure 7. Author sketch of Willard Wigan’s Nine camels in the eye of a needle micro-sculpture. One answer is that camels 1, 6, and 8 go to wife 1; camels 2, 4, and 9 go to wife 2; and camels 3, 5, and 7 go to wife 3. Thus each wife has 15 loads of cloth. Another solution is 1, 5, and 9 to wife 1; 2, 6, and 7 to wife 2; and 3, 4, and 8 to wife 3. ♢ Camels were first domesticated about four thousand years ago. Yet we can imagine a brainteaser similar to Puzzle 8 dating to eleven thousand or more years ago, the time when the Ishango-bone baboon lost its life. This time we use African elephants. Puzzle 9: Elephant loads of bananas. A tribe has four elephants—raised from orphaned calves—and a total of sixty clusters of bananas. Rather than load the clusters equally onto each elephant, how may the clusters be loaded onto the four elephants so that no two loads are the same? Side 1 of the Ishango bone gives two solutions. That is, place 11, 13, 17, 19 clusters onto the elephants from the least to the greatest elephant, or pack 9, 11, 19, 21 on them. ♢

A base twelve or base sixty system? In his analysis, Heinzelin notes that the sum of the marks on side 2 of the bone is 48 (whereas the two rows of marks on side 1 both sum to 60). Since both 48 and 60 are multiples of twelve, he speculates that this bone and its markings could very well be evidence of the beginnings of a base twelve number system. He goes on to conclude [75]: It is possible that the modern world owes one of its greatest debts [the idea of multiples of a number and a base for a number system] to the people who lived at Ishango. It is remarkable that the oldest clue to the use of a number system by man dates to central Africa of the Mesolithic period. What a bone! Of course, using twelve as a base for a counting system is a natural idea because in one year the Moon completes its cycle of phases approximately twelve times.

period. period. What What a bone! a bone! OfOf course, course, using using twelve twelve as as a base a base forfor a counting a counting system system is is a natural a natural idea idea be-because cause in in one one year year thethe Moon Moon completes completes itsits cycle cycle of of phases phases approximately approximately twelve twelve times. times. 14 Chapter I: Tally Bones to the Integers

▽ ▽

a. The The 12finger 12 finger finger bones bones ofthe of thethe right right hand. hand. b. b. Two Two twelves twelves and five five ones is 29. is 29. a.a.The 12 bones of right b. Two twelves andand five ones isones 29. hand. Figure Figure 8. 8.Signing Signing anan integer, integer, author author sketch. sketch. Figure 8. Signing an integer, author sketch. Pletser Pletser and and Huylebrouck Huylebrouck [114] [114] describe describe a combination a combination of of a base a base twelve twelve and and base base sixty sixty number number system system as as is found is found in in West West Africa Africa among among thethe Yasgua Yasgua people people in in Nigeria. Nigeria. One One can can use use thethe thumb thumb to to count count from from one one to to twelve twelve onon thethe small small bones bones of of thethe four four fingers fingers of of thethe right right hand hand as as shown shown in in Figure Figure 8a.8a. For For each each twelve twelve that that is is counted, counted, wewe use use a digit a digit of of thethe leftleft hand, hand, which which means means that that thethe five five digits digits of of thethe leftleft hand hand count count altogether altogether as as 5 ×5 12 × 12 oror 60.60.Perhaps Perhaps this this scheme scheme is is thethe basic basic Pletser and Huylebrouck [115] describe a combination of a base twelve and tradition tradition that that gave gave rise rise to to thethe Babylonian Babylonian system system of of numeration numeration using using multiples multiples base sixty number system as is found in West Africa among the Yasgua people in of of sixty. sixty. Nigeria. One can use the thumb to count from one to twelve on the small bones The The Babylonians Babylonians used used vertically vertically aligned aligned wedge-shaped wedge-shaped cuneiform cuneiform tally tally marks marks of the four fingers of the right hand as shown in Figure 8a. For each twelve that is clay to to count count from from one through through nine.They They also also used used horizontally aligned ▽ in ▽ inclay counted, we use a digit ofone the left hand,nine. which means that thehorizontally five digits of aligned the left hand count altogether as 5 × 12 or 60. Perhaps this scheme is the basic tradition wedge-shaped wedge-shaped tally tally marks marks forfor tenten through through fifty. fifty. When When wewe write write 𝑛▽𝑛▽and and that gave rise to the Babylonian system of numeration using multiples of sixty. 𝑛 𝑛 , we , we mean mean 𝑛 successive 𝑛 successive ’s and 𝑛 successive 𝑛 successive ’s. ’s.For For example, example, thethe tally tally ▽’s▽and The Babylonians used vertically aligned wedge-shaped cuneiform tally marks marks 4 4 5 ▽5 in ▽ inFigure Figure 9a9a represent represent thethe integer integer forty-five forty-five because because it it is is four four in clay to count from one through nine. They also used horizontally aligned ▽marks tens tens and and five five ones. ones.ToTo represent represent integers integers at at least least as as large large as as 60,60, thethe BabyloniBabyloniwedge-shaped tally marks for ten through fifty. When we write 𝑛▽ and ▽’s.▽’s.ForForsuch and and such strings strings of of symbols, symbols, thethe symsymans ans alternated alternated groups groups of of ’s ’s 𝑛bols’ , we mean 𝑛 successive For example, the’stally ▽’s and 𝑛tosuccessive bols’ values values in in a grouping a grouping immediately immediately to thethe leftleft of of a given a’s.given grouping grouping of of and ’s and

▽ ▽

▽ ▽

▽ ▽

▽▽▽

▽ ▽

▽

▽

▽

marks 4 5 ▽ in Figure 9a represent the integer forty-five because it is four tens and five ones. To represent integers at least as large as 60, the Babyloni-

▽

ans alternated groups of ’s and ▽’s. For such strings of symbols, the symbols’ values in a grouping immediately to the left of a given grouping of ’s and

▽

▽

▽ ▽

▽

▽’s are 60 times their values in that given grouping. For example, the sequence 3▽ 4 5 ▽ as illustrated in Figure 9b can be grouped as (2 3 ▽ ) fol2 lowed by (4 5 ▽ ), which translates to twenty-three followed by forty-five; now

Base ten, base twenty, base eight, base two

15

multiply twenty-three by sixty and then add forty-five, resulting in 60 ⋅ 23 + 45 =

▽

▽

1425. Alternatively, 2 3▽ 4 5 ▽ is the integer 1425 because it is two sixhundreds plus three sixties plus four tens plus five ones. To avoid ambiguity in this system, space was inserted between pertinent symbols. For example, to rep-

▽▽ rather than 6

▽

resent the integer sixty-two, Babylonians wrote ▽

2▽ .

worth 10 worth 1 worth 600 worth 60

a. 45 = 4 ⋅ 10 + 5 ⋅ 1.

b. 1425 = 60(2 ⋅ 10 + 3 ⋅ 1) + (4 ⋅ 10 + 5 ⋅ 1).

Figure 9. Babylonian cuneiform tally marks.

Base ten, base twenty, base eight, base two The Babylonian place value system was simplified by the Arabs and Indians to a base ten system. As a few other traditions had done,9 they also implemented a marvelous digit to represent nothing at all, the zero. Instead of using tally marks for the integers one through nine, they used a single symbol or glyph for each grouping of 𝑛 tally marks, where 𝑛 is any of the digits 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. These are set in place value positions to represent particular multiples of specific powers of ten. Thus the sequence 4083 is four thousands plus zero hundreds plus eight tens plus three ones. The reason we use base ten of course is because we have ten fingers. The Mayans of Central America, who may very well have shunned footwear for much of the year because of living in the tropics, used base twenty, making great use of all the digits on the hands and feet. Some cultures10 counted on the spaces between fingers, developing a base eight system. As notation to help keep track of the base being used when representing any given number, we make the following definition. 9 For example, the Babylonians introduced a place-holder symbol (a zero) around 350 bc. The Mayans developed a zero symbol around the year 350; see Exercise 6. Scholars have found an instance of a zero symbol in an old Indian document known as the Bakhshali manuscript. In 2017, carbon dating showed that this old birch-bark document, found in 1881 by a farmer in the Pakistani village of Bakhshali, dates to the third or fourth century. 10 The Yuki people of northern California and the Pame people of southern Mexico counted this way [7].

16

Chapter I: Tally Bones to the Integers

Definition 10: Base 𝑏 notation. Let 𝑛 and 𝑏 be nonnegative integers with 𝑏 > 1. In base 𝑏, the digits are the integers from 0 through 𝑏 − 1. We say that 𝑛 is represented in base 𝑏 by a string 𝑤 = 𝑎𝑘 𝑎𝑘−1 … 𝑎1 𝑎0 of 𝑘 + 1 digits if 𝑘

𝑛 = 𝑎0 + 𝑎1 𝑏 + 𝑎2 𝑏2 + ⋯ + 𝑎𝑘 𝑏𝑘 = ∑ 𝑎𝑖 𝑏𝑖 , 𝑖=0

where the 𝑎𝑖 ’s are digits in base 𝑏. To emphasize that a given string 𝑤 is the representation of an integer in base 𝑏, we write 𝑤𝑏 or (𝑤)𝑏 , where 𝑏 is represented in base ten (without any subscript). Furthermore, when 𝑏 is ten, 𝑤10 is usually written as 𝑤. Example 11: A base riddle: Why are Christmas and Halloween the same? To illustrate Definition 10, let 𝑛 be twenty-five. Then 𝑛 can be represented as (31)8 , an octal representation, because (31)8 = 3 ⋅ 8 + 1 = 25. In base ten, twenty-five is (25)10 = 25, a decimal representation. Using these representations in Oct(318 ) and Dec(25) and interpreting them as calendar days in the months of October and December gives the tongue-in-cheek observation that Halloween and Christmas are the same. ♢ When we write 𝑐 = 𝑑, such as (31)8 = 25, we mean that 𝑐 and 𝑑 refer to the same number. As another example of writing a number in a different base, the decimal integer 𝑚 with base five representation (3104)5 is 𝑚 = (3104)5 = 3 ⋅ 53 + 1 ⋅ 52 + 0 ⋅ 51 + 4 ⋅ 50 = 375 + 25 + 0 + 4 = 404. The simplest base of all is base two, or binary. It is easily adapted to hard-wiring a computer because computers are designed to be able to check any particular location in their memory as being either open or closed, on or off. With the digits in binary being 0 and 1, the string (1011001)2 6

5

4

represents 1 ⋅ 2 + 0 ⋅ 2 + 1 ⋅ 2 + 1 ⋅ 23 + 0 ⋅ 22 + 0 ⋅ 21 + 1 ⋅ 20 = 89.

A binary digit interlude As a practical application, which we revisit in Strand V, of binary digits—the use of two symbols to record information—we consider the firing sequence of an internal combustion diesel train engine of twelve cylinders. Figure 10 shows two banks of six cylinders, the right side and the left side, separated by an exhaust manifold. The exact order in which the twelve cylinders fire is somewhat arbitrary.11 For simplicity, we imagine the sequence is 𝐿1 𝑅1 𝐿2 𝑅2 … 𝐿6 𝑅6 . 11 In actual engines, successive cylinders in a firing sequence are on opposite sides of the manifold and successive cylinders on the same side are rarely adjacent. Thus a reasonable firing sequence might be, as in a Ferrari sports car, 𝐿1 𝑅6 𝐿5 𝑅2 𝐿3 𝑅4 𝐿6 𝑅1 𝐿2 𝑅5 𝐿4 𝑅3 .

A binary digit interlude

17

L6

L4 L3

Exhaust Manifold

L5

R6 R5 R4 R3

L2

R2

L1

R1

Figure 10. An internal combustion engine with twelve cylinders. When a cylinder is in a potential firing state we can engineer it either to fire or to skip. After each of its twelve cylinders has either fired or skipped a firing, we say that it has passed through one cycle. Suppose that, to save fuel when idling or coasting downhill, during each cycle we want only some of the cylinders to fire. In particular, how can we engineer a solution best satisfying the following ideal features? • Each cylinder fires in exactly fifty percent of the cycles. We want each cylinder to wear uniformly. • No cylinder skips more than twice in succession. Successive cylinder skips cause the cylinder to grow cold, and cold cylinders produce unwanted smoke when fired. • Exactly six cylinders fire during each cycle. We want to preserve power. • Exactly six cylinders change their state—from firing to missing or from missing to firing—from one cycle to the next. We want a smooth carryover between cycles in the engine so that it purrs rather than barks. We say that a bit-string is a finite sequence of zeroes and ones. Any bit-string of length twelve is a firing seqence. For example 𝑤1 = 101010101010 is the sequence fire-skip-fire-skip-fire-skip-fire-skip-fire-skip-fire-skip. A first approach to a solution is to alternate directly between 𝑤1 and 010101010101. Such a solution exhibits the first three features, but not the last because all twelve cylinders change their state from one cycle to the next. Finding a solution that

18

Chapter I: Tally Bones to the Integers

satisfies all four features is a challenge. Since there are 924 bit-strings of length twelve consisting of six 1’s and six 0’s,12 a brute-force approach to finding a solution of 𝑘 different cycles between 𝑤1 and 010101010101 involves searching through a space of 924𝑘 possibilities, an astronomically large number if 𝑘 is large, which is yet larger if we seek an approximate solution. However, after some experimentation, we find an approximate solution consisting of four cycles 𝑤1 , 𝑤2 , 𝑤3 , 𝑤4 shown in the first two columns of Table 1. Let 𝑤 = 𝑤1 𝑤2 𝑤3 𝑤4 . Table 1. A fifty percent firing solution. firing sequence 𝑤1 𝑤2 𝑤3 𝑤4

binary decimal 101010101010 2730 101100110110 2870 010101010101 1365 010011001001 1225

hexadecimal 0AAA 0B36 555 4C9

base thirty-six 23U 27Q 11X 0Y1

As the reader may verify, over the firing sequence 𝑤, each cylinder fires twice (out of four cycles). No cylinder skips more than twice in succession. After startup, each cylinder repeats the pattern fire-fire-skip-skip. However, the number of cylinders firing during each cycle repeats the pattern 6-7-6-5, for an average of six. Also, the number of cylinders changing their states from one cycle to the next repeats the pattern 5-7-5-7, for an average of six. See Exercise 5 for a solution satisfying all four properties. Up to this point, any firing sequence of zeroes and ones has no numeric significance. But suppose we wish to remember a particular firing sequence. Rather than memorizing a sequence of twelve digits, we can interpret the firing sequence as a binary number 𝑚 and then write 𝑚 using a base greater than two, hence reducing the number of digits needed to record the firing sequence information. For example, the third, fourth, and fifth columns of Table 1 give the firing sequence in decimal, hexadecimal, and base thirty-six. In hexadecimal, we need sixteen digits, six more than in decimal, so as to represent ten through fifteen. By custom, we use the letters A through F to do so. Furthermore, by custom, when a hexadecimal number starts with a letter, to emphasize that it is a number we preface it with the digit 0. Thus the integer 0ABC when rendered into decimal is (0ABC)16 = (A)16 ⋅ 162 + (B)16 ⋅ 161 + (C)16 ⋅ 160 = 10 ⋅ 256 + 11 ⋅ 16 + 12 = 2748. As can be seen, it is easier to remember, say, 0B36 in hexadecimal rather than 101100110110, yet both representations contain the same information. To represent an integer in base thirty-six, we use all the letters of the alphabet. Thus Z 12 The

number 924 is the binomial coefficient (12) as described in Proposition VI.16. 6

Solving the shepherd’s puzzle and beyond

19

has value thirty-five. Special care must be exercised to distinguish between the digit zero and the letter O and between the digit 1 and the letter I.

Solving the shepherd’s puzzle and beyond In this section we solve Puzzle 6 and a related problem about the number of positive integers less than a given positive integer 𝑛 that have no divisors in common with 𝑛. To answer these questions, we need to know how to factor any composite integer as a product of its prime divisors. We first note that the number of different primes we may encounter is unbounded. Euclid proved this result in Proposition 20 of Book IX of the Elements. He did so by assuming that only a finite number of primes exist and then reasoning to reach a contradiction. We paraphrase his argument in the following way. Proposition 12: An infinitude of primes.13 There exist an infinite number of primes. Proof. Suppose that the primes constitute a finite set 𝒫 of integers, 𝒫 = {𝑝1 , 𝑝2 , … , 𝑝𝑛 }. Let 𝑁 = 1 + 𝑝1 𝑝2 ⋯ 𝑝𝑛 . Since 𝑁 is larger than each of the primes 𝑝𝑖 , 𝑁 is not in 𝒫. So 𝑁 is not prime. Thus for some 𝑗, 1 ≤ 𝑗 ≤ 𝑛, prime 𝑝𝑗 divides 𝑁. But then 𝑝𝑗 must also divide 1 = 𝑁 − 𝑝1 𝑝2 ⋯ 𝑝𝑛 , a contradiction. Therefore, there are an infinite number of primes. Beginning with this result, we eventually obtain the following proposition, a proof of which appears in the next chapter. Proposition 13: The fundamental theorem of arithmetic. Every integer larger than 1 can be written as a product of powers of prime integers, and this representation is unique up to the order of the prime powers.14 Proposition 13 can be used to answer Puzzle 6. Example 14: A solution to the shepherd’s puzzle. By the fundamental theo𝑛 𝑛 𝑛 rem of arithmetic, let 𝑛 = 𝑝1 1 𝑝2 2 ⋯ 𝑝𝑘 𝑘 , where 𝑝𝑘 are distinct primes and 𝑛𝑖 are positive integers, 1 ≤ 𝑖 ≤ 𝑘 for some positive integer 𝑘. We wish to decompose n into the product 𝑛 = 𝐴 ⋅ 𝐵, where 𝐴 and 𝐵 are positive integers and 𝐴 ≤ 𝐵. For the moment we drop this last requirement. By the fundamental theorem, 𝐴 𝑚 𝑚 𝑚 must be written as 𝐴 = 𝑝1 1 𝑝2 2 ⋯ 𝑝𝑘 𝑘 for integers 𝑚𝑖 , where, to ensure that 𝐴 13 Technically, this proposition shows that there are an infinite number of positive irreducible integers. See footnote 6 on p. 10. 14 See Code 1 in Appendix III for how to use a computer algebra system to illustrate both this theorem and Proposition 17.

20

Chapter I: Tally Bones to the Integers

divides 𝑛, we have 0 ≤ 𝑚𝑖 ≤ 𝑛𝑖 . The number 𝑄 of distinct ways that the 𝑚𝑖 can be selected is 𝑄 = (𝑛1 + 1)(𝑛2 + 1) ⋯ (𝑛𝑘 + 1). For any such integer 𝐴, we have 𝐵 = 𝑛/𝐴. However, by symmetry there are as many decompositions with 𝐴 < 𝐵 as there are decompositions with 𝐴 > 𝐵. There is exactly one decomposition when 𝐴 = 𝐵, in which case 𝑛 is a perfect square, in which case 𝐴 = √𝑛 = 𝐵. So reinstating the requirement 𝐴 ≤ 𝐵 means that when 𝑛 is a perfect square, 𝑛 pebbles can be arranged into 𝐴 rows of 𝐵 pebbles (𝑄 + 1)/2 ways. When 𝑛 is not a perfect square, 𝑛 can be arranged into 𝐴 rows of 𝐵 pebbles 𝑄/2 ways. ♢ Example 15: The shepherd’s puzzle for 𝑛 = 360. By the fundamental theorem, 𝑛 = 23 32 51 = 360. So 𝑄 = 4 ⋅ 3 ⋅ 2 = 24, where 𝑄 is defined in Example 14. Thus, by our general solution to the shepherd’s puzzle, 360 pebbles can be arranged as 𝑄/2 = 12 different rectangular arrays. The list of possible arrangements of 360 pebbles into rectangular arrays is 1×360, 2×180, 3×120, 4×90, 5×72, 6×60, 8×45, 9×40, 10×36, 12×30, 15×24, 18×20. ♢

As another implication of the fundamental theorem, Leonhard Euler (1707– 1783) was able to answer a related puzzle. To take an example, how many positive integers less than or equal to twelve have no divisors greater than 1 in common with twelve? The answer is four, because 2, 3, 4, 6, 8, 9, 10, and 12 all have divisors greater than 1 in common with 12, whereas 1, 5, 7, and 11 do not. To pose this puzzle in general terms, we make the following definition. Definition 16: The Euler phi function. For any positive integer 𝑛, let 𝜙(𝑛) be the number of positive integers less than or equal to 𝑛 that have no divisors larger than 1 in common with 𝑛. The symbol 𝜙 is the Euler phi function, also known as the totient function. Here is Euler’s amazing discovery. Proposition 17: How to calculate 𝜙(𝑛). Note that 𝜙(1) = 1. Let 𝑛 be an integer 𝑛

𝑘

𝑛

𝑛

𝑛

with 𝑛 ≥ 2. With 𝑛 = 𝑝1 1 𝑝2 2 ⋯ 𝑝𝑘 𝑘 = Π 𝑝𝑖 𝑖, where 𝑝𝑖 are distinct primes and 𝑖=1

𝑛𝑖 are positive integers, 1 ≤ 𝑖 ≤ 𝑘 for some positive integer 𝑘, 𝑘

𝑘

𝑘

1 ). (2) 𝑝 𝑖 𝑖=1 𝑖=1 𝑖=1 Almost a proof. Observe that 𝜙(𝑝) = 𝑝 − 1 when 𝑝 is a prime integer. To find 𝜙(𝑝2 ), note that the only integers less than or equal to 𝑝2 that have divisors greater than 1 with 𝑝2 are the 𝑝 integers in the arithmetic sequence 𝑛

𝜙(𝑛) = 𝜙( Π 𝑝𝑖 𝑖 ) =

𝑛

Π 𝜙(𝑝𝑖 𝑖 ) =

𝑝, 2𝑝, 3𝑝 , ⋯ , 𝑝 ⋅ 𝑝, 1

which means that 𝜙(𝑝2 ) = 𝑝2 − 𝑝 = 𝑝2 (1 − ). 𝑝

𝑛

Π 𝑝𝑖 𝑖 (1 −

Three parting puzzles

21

To generalize, consider 𝜙(𝑝𝑛 ) with 𝑛 ≥ 1. The only integers less than or equal to 𝑝𝑛 that share divisors greater than 1 with 𝑝𝑛 are the 𝑝𝑛−1 integers in the sequence 𝑝, 2𝑝, … , 𝑝𝑛−1 𝑝, 1

which means that 𝜙(𝑝𝑛 ) = 𝑝𝑛 − 𝑝𝑛−1 = 𝑝𝑛 (1 − ). 𝑝

Now consider the integer 𝑛 = 𝑝𝑞 where 𝑝 and 𝑞 are different primes. Observe that the only integers having a divisor greater than 1 in common with 𝑛 are those having a factor of 𝑝 or 𝑞: 𝑝, 2𝑝, 3𝑝, … , 𝑞𝑝

and 𝑞, 2𝑞, 3𝑞, … , 𝑝𝑞.

These lists contain exactly 𝑝 + 𝑞 − 1 distinct integers because 𝑞𝑝 = 𝑝𝑞. Thus 1 1 𝜙(𝑝𝑞) = 𝑝𝑞 − (𝑝 + 𝑞 − 1) = (𝑝 − 1)(𝑞 − 1) = 𝑝𝑞(1 − )(1 − ). 𝑝 𝑞 To generalize, consider 𝜙(𝑝𝑛 𝑞𝑚 ), where 𝑛 ≥ 1 and 𝑚 ≥ 1. Every factor of 𝑝𝑛 𝑞𝑚 that is greater than 1 must have either the form 𝑘𝑝 or the form 𝑘𝑞 where 𝑘 is also a factor of 𝑝𝑛 𝑞𝑚 . As before, there are 𝑝𝑛−1 𝑞𝑚 relevant factors of the form 𝑘𝑝 and 𝑝𝑛 𝑞𝑚−1 factors of 𝑝𝑛 𝑞𝑚 of the form 𝑘𝑞. A factor is counted in both lists if and only if it has the form 𝑘𝑝𝑞. There are 𝑝𝑛−1 𝑞𝑚−1 such factors. Thus 𝜙(𝑝𝑛 𝑞𝑚 ) = 𝑝𝑛 𝑞𝑚 − 𝑝𝑛−1 𝑞𝑚 − 𝑝𝑛 𝑞𝑚−1 + 𝑝𝑛−1 𝑞𝑚−1 1 1 = 𝑝𝑛 (1 − )𝑞𝑚 (1 − ) = 𝜙(𝑝𝑛 )𝜙(𝑞𝑚 ). 𝑝 𝑞 In Chapter III, we will generalize to 𝑘 distinct prime factors to obtain Equation (2). ♢ Example 18: Calculating 𝜙(90). By the fundamental theorem of arithmetic, 90 = 2 ⋅ 32 ⋅ 5. Observe that 𝜙(2) = 1, 𝜙(32 ) = 6, and 𝜙(5) = 4. Thus Equation (2) yields 𝜙(90) = 24. To check this result, here are the 24 integers less than 90 having no divisors of 2, 3, or 5: {1, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 49, 53, 59, 61, 67, 71, 73, 77, 79, 83, 89}. ♢

Three parting puzzles We close this chapter with three puzzles. The first one is a segue into Chapter II. The second is an algebra review and a look ahead to continued fractions. The third is a variation on discovering an Ishango bone. Puzzle 19: A geometric sequence? Consider this pattern: Start with a circle and mark 𝑛 points in general position around its circumference. Now connect

22

Chapter I: Tally Bones to the Integers

3

1

1

2

a. 1 point, 1 region. 5

1 3

7

6 2

12

d. 4 points, 8 regions.

1 2

b. 2 points, 2 regions. c. 3 points, 4 regions. 11 2 10 1 5 4 3 8 7 9 6

4 8

4

15

13

14 16

e. 5 points, 16 regions.

f. 6 points, ? regions.

Figure 11. How many regions for 𝑛 = 6 points?

each pair of points with a line segment. Into how many regions has the circle been partitioned? By general position we mean that the points have been chosen so that no three line segments coincide at a point. When 𝑛 ranges from 1 to 5, we see the pattern of 1, 2, 4, 8, 16 regions as explicitly enumerated in Figure 11. How many partitioned regions appear when 𝑛 = 6, as shown in Figure 11f? Is it 32? A formula giving the answer for 𝑛 points appears in Chapter II, Equation (II.12). ♢ Puzzle 20: A missing digit. In the following continued fraction expression, 𝑥 is a digit from the set {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}. Thus the term 3𝑥8 means 308+10𝑥. Find the missing digit. 3𝑥8 1 ♢ =3+ 1 . 101 4+ 𝑥5

Puzzle 21: Another Ishango bone. Aliens have stumbled across the markings of Figure 12 on an Earth space probe far from our solar system. What information might they infer about it? (Hint: It has something to do with primeness and the complex numbers. Extra hint: See Exercise 10.) ♢

Exercises

23

Figure 12. A different kind of Ishango bone. Exercises 1. For the Ishango bone, the numbers in rows 𝐴 and 𝐵 are all odd integers. Find a probability similar to that of Puzzle 7: What is the probability that when randomly selecting eight integers (not necessarily distinct or in order) from the integers 1 to 30, all eight are odd? 2. Pletser and Huylebrouck [115] suggest that the notches on the Ishango bone are a number game. They ask: How may each of the numbers in rows 𝐴 and 𝐵 be obtained by taking a sum 𝑆 of at most three successive numbers in row 𝐶 subject to the following special rule? Special rule: If either one number 𝑎 from row 𝐶 or two successive numbers 𝑎 and 𝑏 from row 𝐶 are selected to form a sum 𝑆, then 𝑆 may be taken as any of 𝑎 + 1, 𝑎 + 2, 𝑎 + 𝑏, 𝑎 + 𝑏 + 1, or 𝑎 + 𝑏 + 2, provided 𝑆 is odd. Pletser and Huylebrouck also assume that the number 10 in row 𝐶 of Figure 4 is the number 9. To illustrate the play of the game, take the number 13 in row 𝐴. To win we must find up to three numbers 𝑎, 𝑏, and 𝑐 in row 𝐶 such that 𝑆 = 13. One solution is when 𝑎, 𝑏, and 𝑐 are 3, 4, and 6. Furthermore, the numbers 4, 8, and 9 in row 𝐶 yield 𝑆 = 21, a number appearing in row 𝐵. With 𝑎 and 𝑏 as 3 and 6, 𝑆 = 3 + 6 + 2 = 11, whereas for 6 and 4, 𝑆 = 6 + 4 + 1 = 11. With 𝑎 = 7, 𝑆 = 7 + 2 = 9.

24

Chapter I: Tally Bones to the Integers

worth 20(1) worth 20(5) worth 1 worth 5

a. 2 ⋅ 5 + 3 ⋅ 1 = 13.

b. 400 ⋅ 16 + 20 ⋅ 7 + 13 = 6553.

Figure 13. Mayan counting with ones, fives, and twenties. With respect to these rules, what natural sums of numbers taken from row 𝐶 fail to yield a number as given in rows 𝐴 and 𝐵? 3. Beyond the two solutions given to Puzzle 8, are there other ways to partition the camels so that each wife receives the same amount of cloth?

▽

▽

4. (a) What base ten number corresponds to the number 3 2▽ 5 4▽ ? (b) With the two wedge symbols of the Babylonians, how might a scribe unambiguously represent the number 601? (c) From the Babylonians we have inherited their tradition of 360∘ in one revolution. Offer an explanation as to why that tradition came to be. (d) Rewrite the binary number (1011011)2 in base three. (e) What decimal numbers correspond to the base thirty-six numbers 0R2D2 and 0C3PO, the names of two droids in the Star Wars films? (f) Interpret your first name as a base thirty-six number. Rewrite this number in base ten. 5. (a) To construct a solution cycle satisfying all four properties outlined on p. 17, we consider a block of four cylinders. To transition from 𝑎 = 1010 to 𝑏 = 0101 and back again, let 𝑥 = 1001 and 𝑦 = 0110, and consider the sequence 𝑎 → 𝑥 → 𝑏 → 𝑦 → 𝑎. Use this idea to find a solution to the original problem of twelve cylinders. (b) Find a different solution from the one outlined in part (a). 6. The Mayan system of enumeration was much like the Babylonian one. They used a circular dot for one and a horizontal line for five. Instead of interleaving their two shapes to the left to account for higher place values, the Mayans stacked them atop each other. Instead of using base sixty, the Mayans used base twenty, so that a dot or line at a level immediately above a given level has value 20 times its value at that given level. They also invented a symbol for •

zero:

. The Mayan integer twenty was written

, whereas their

Exercises

25

Figure 14. A page of the thirteenth-century Grolier Codex, author sketch. The codex is on display at the National Museum of Anthropology, Mexico City. ••

integer twenty-one was

• •.

The Mayan forty was

and one hundred was

. Using these clues and the examples15 given in Figure 13, try these exercises. (a) Write the numbers one through forty in Mayan. (b) Write the integer 2017 in Mayan. (c) On a Mayan stela, we see a number 𝑁 rendered as

••• • •••• What is the value of 𝑁? (d) Figure 14 is a page from an old Mayan book called a codex. What Mayan numbers can you recognize? 15 We have over-simplified the Mayan number system in the text. To fit the calendar better the Mayans used 18 ⋅ 20 = 360 rather than 400 in their third level of numeration. Thus the Mayan number in Figure 13 actually represents 360 ⋅ 16 + 20 ⋅ 7 + 13 = 5913 rather than 6553.

26

Chapter I: Tally Bones to the Integers

7. (a) To which group of people has history more or less awarded the honor of being the first to think of primes? (b) Implement the sieve of Eratosthenes to identify all the primes between 1 and 200. That is, start with a row of the integers 1 through 200, and cross off 1 because 1 is not prime by definition. Then circle 2, and cross off every second entry in the list, namely, 4, 6, 8, …. Circle the first number on the left which is neither circled nor crossed off, namely 3, and cross off every third entry in the list (which may very well include entries already crossed off). Repeat this process. 8. (a) Factor 4840 in accordance with the fundamental theorem of arithmetic. (b) Calculate 𝜙(4840) using Proposition 17. (c) Find an integer 𝑛 where 𝜙(𝑛) = 84. 9. In Puzzle 19, the sequence for the number of regions in the pattern starts 1, 2, 4, 8, 16. What are the next few terms in this pattern? 10. Let16 ℤ[√10] be the set of all numbers of the form 𝑎 + 𝑏√10 where 𝑎 and 𝑏 are integers. Let ℤ[𝑖], the Gaussian integers17 , be the set of all numbers of the form 𝑎 + 𝑏𝑖 where 𝑎 and 𝑏 are integers and 𝑖 = √−1. An element 𝑢 in a number system18 is called a unit if there is another element 𝑣 with 𝑢𝑣 = 1. In ℤ the only units are ±1. In the set of rational numbers, every nonzero number is a unit. More formally, a nonzero element 𝑝 in a number system is prime if 𝑝 is not a unit and whenever 𝑝 divides 𝑚𝑛, either 𝑝 divides 𝑚 or 𝑝 divides 𝑛. A nonzero element 𝑝 is irreducible if 𝑝 is not a unit and whenever 𝑝 = 𝑎𝑏, either 𝑎 = 𝑢𝑝 for some unit 𝑢 or 𝑎 is a unit. (a) In addition to ±1, show that 3 + √10 and 19 + 6√10 are also units in ℤ[√10]. (b) ∗ Characterize the entire set of units in ℤ[√10]. Hint: The answer in𝑝 volves all the continued fraction convergents of the regular continued 𝑞

fraction algorithm for √10, where regular is defined in Definition IX.21, p. 322. (c) Show that the only units in ℤ[𝑖] are ±1 and ±𝑖. 16 This problem is deceptively deep, with links to material in Chapter IX. The items marked with an asterisk may be more challenging to prove than others. Hints for some of these items are provided here and also in Appendix IV. As you work your way through this book, you may wish to return to this exercise from time to time. For a fairly complete analysis of ℤ[√10 ], see [145]. 17 The Gaussian integers are named after Carl Friedrich Gauss (1777–1855) and share many, but not all, properties of the ordinary integers ℤ. See Example II.25 for the characterization of the set of all Gaussian primes. 18 By a number system we mean a ring, a structure studied at length in Abstract Algebra, a course often taken by undergraduate mathematics majors in their third year.

Exercises

27

(d) ∗ Show that the number 2 is irreducible in ℤ[√10]. That is, show that 2 is neither a unit nor expressible as a product 2 = (𝑎 + 𝑏√10)(𝑐 + 𝑑√10) unless one factor is ±2 and the other factor is ±1, where 𝑎, 𝑏, 𝑐, and 𝑑 are integers. See Appendix IV for a few key ideas. (e) ∗ Similarly, show that 3 and 2 + √10 are irreducible in ℤ[√10]. However, show that although 31 is prime in ℤ it is not irreducible in ℤ[√10]. Show that 7 is the least positive prime in ℤ that is also prime in ℤ[√10 ]. (f) Observe that 2⋅3 = (2+ √10)(−2+ √10), so 2 divides the product on the right-hand side. Show that it is impossible for 2 to divide either 2 + √10 or −2 + √10 in ℤ[√10]. Therefore 2 is not prime in ℤ[√10]. (g) In ℤ[𝑖], 5 = (1 + 2𝑖)(1 − 2𝑖). Thus 5 is neither a Gaussian prime nor a Gaussian irreducible. Find a prime in ℤ that is also prime in ℤ[𝑖]. ∗ (h) Solve Puzzle 21 with the extra hint that it has something to do with Gaussian primes.

Strand II: Leibniz and the Binary Revolution Ours is said to be an age of the computer. The person who foreshadowed this future, the one who popularized the idea of a machine consisting of on/off switches within a milieu of a formal logical language, was Gottfried Leibniz (1646–1716). Besides promoting these ideas, Leibniz is also credited with being a joint discoverer of calculus with Isaac Newton. It is said that he was the last to know almost everything that was known about almost anything. Throughout his long life, he traveled extensively and maintained a vibrant, voluminous correspondence with savants, theologians, statesmen, and friends. In this strand, we sketch his life.

Figure 1. Leibniz as a young man, engraving by Johann Heinrich Lips [62]. 29

30

Strand II: Leibniz and the Binary Revolution

Leibniz’s father, a philosophy professor at the University of Leipzig, maintained a large private library from which he often read selections to a very young son. Unfortunately, his father died when Gottfried was six. But Gottfried was allowed continued access to this library, reading texts in Latin and Greek. By the time he was ready for the university, he had already read the masters in philosophy and letters, and could engage his professors as a veritable equal. He earned a degree in philosophy at age sixteen, a degree in law at nineteen, and a doctoral degree in law at twenty. So extensive was Leibniz’s approach to reading that he became an expert in numerous fields: Philosophy, mathematics, astronomy, physics, chemistry, geology, botany, psychology, medicine, natural history, jurisprudence, ethics, political science, history, antiquities, languages (German, European, Chinese), linguistics, etymology, philology, poetry, theology, church reunification, diplomacy, technology, structure of scientific societies, libraries, the book trade. [4, p. 2] His resultant writings, as republished and published-for-the-first-time by the Berlin Academy of Sciences, “will eventually extend to one hundred twenty large quarto volumes” [4, p. 2]. As he described himself: So many thoughts occur to me in the morning during an hour in which I am still in bed, that it takes me all morning, and sometimes all day and more, to write them down. [4, p. vi]

Figure 2. A multiplication machine. Image courtesy of the Gottfried Wilhelm Leibniz Bibliothek. What motivated Leibniz?

Strand II: Leibniz and the Binary Revolution

31

Gottfried was born in Leipzig in 1646, two years before the end of the Thirty Years’ War, a religious conflict that devastated central Europe for an entire generation. Leibniz had been raised as a Lutheran. In addition, he had freely read the gamut of extant theological ideas in his father’s library. He believed that if people could truly communicate on conflicting issues, genuine agreement could be found. Thus, he championed the idea of a natural language in which ideas could be formulated without misunderstanding. A small-scale setting for developing such a language is mathematics, and he pioneered formal logic—primarily in the hope that this language could grow to encompass diplomacy and theology, which in turn might lead to the abolition of war and, in particular, religious strife. He spent much of his life trying to harmonize what was truly substantive in Protestant and Catholic perspectives globally and, more locally, in Lutheran and Calvinist perspectives.

Figure 3. The first binary numbers. Image courtesy of the Gottfried Wilhelm Leibniz Bibliothek. As an example of Leibniz’s ideas on harmonizing Christendom, he advocated a balance of power strategy. For instance, Leibniz thought that Louis XIV needed to be kept in check lest another Thirty Years’ War break out afresh. He advanced the idea that if Louis really wanted war—and the commensurate potential material gain and dubious prestige of such activity—then instead of devastating Europe, why not conquer Egypt,1 which at that time had been controlled for several 1 One hundred and twenty some years after Leibniz suggested this idea, Napoleon did just that in 1798–1801. In addition to an army, Napoleon brought along a team of scientists and mathematicians

32

Strand II: Leibniz and the Binary Revolution

hundred years by the Mamelukes, a caste of Muslim warrior-slave-rulers continuously refurbished by child-trafficking in eastern Europe and western Asia. In part to seek audience with Louis’s advisors so as to promote this foreign policy, Leibniz traveled to Paris. Concurrently, he had become intrigued with Blaise Pascal’s addition machine. Leibniz improved Pascal’s machine to be also a multiplication machine, as illustrated in Figure 2. He demonstrated this device at the French Academy of Sciences and in London at the Royal Academy. The Leibniz Society has recreated a working model of his machine wherein the computations are performed in binary with steel bearings. The idea of the binary number system originated in reports sent to Leibniz from China by another traveler, Jesuit scholar-missionary Joachim Bouvet, concerning the ancient Chinese text, the I Ching. In this text the yin, the zero, was represented as a broken line ( ) and the yang, the one, as a solid line ( ). When these symbols are stacked, lower layers correspond to higher multiples of two.2 Thus, two is represented as , three as , four as , and so on, as indicated in Figure 3. When young Leibniz first arrived in Paris in 1672, he made it a point to meet Christiaan Huygens, the reigning mathematics authority on the continent, and promptly explained to him why 𝑛2 is the sum of the first 𝑛 odd positive integers. Perhaps he reasoned in the following way. Start by writing the sum 𝑆𝑛 of the first 𝑛 odd integers as 𝑆𝑛 = 1 + 3 + ⋯ + (2𝑛 − 1), where 2𝑛 − 1 is the 𝑛th odd integer. Now imagine that 𝑛 is even. (In this case, 𝑛 𝑛 the middle two odd integers are 2( ) − 1 and 2( + 1) − 1.) These 𝑛 integers can 2 2 be paired: first and last, second and penultimate, and so on. After doing so we are left with 𝑛/2 pairs, each of which sums to 2𝑛, 𝑆𝑛 =(1 + (2𝑛 − 1)) + (3 + (2𝑛 − 3)) 𝑛 𝑛 𝑛 + ⋯ + ((2( ) − 1) + (2( + 1) − 1)) = ( )(2𝑛) = 𝑛2 . 2 2 2 We leave the other case, finding 𝑆𝑛 when 𝑛 is odd, to the reader. In the ensuing chapter, we show another way to obtain this result. After listening to Leibniz’s explanation about 𝑆𝑛 , Huygens posed a related, more challenging problem. Define 𝑇𝑛 , the 𝑛th triangular number, to be the sum to study the feasibility of a Suez canal. In attempting to fulfill that commission, the team uncovered the wonders of ancient Egypt. Joseph Fourier, a leader of this scientific expedition, assembled an expeditionary report, the final volume of which was printed in 1829. Figures III.1, VII.1, VII.8, and IX.1 of this book are from that report. 2 Contrast this stacking with the Mayan system in which higher layers of symbol groupings correspond to higher values.

Strand II: Leibniz and the Binary Revolution

33

Figure 4. The first integral signs. Image courtesy of the Gottfried Wilhelm Leibniz Bibliothek. of the first 𝑛 positive integers.3 Huygens’s problem was to find the sum of the reciprocals of all triangular numbers. Within a week, Leibniz had derived the identity4 ∞

1 1 1 1 1 1 1 1 1 + ⋯ = 2((1 − ) + ( − ) + ( − ) + ⋯) = 2. (1) = 1+ + + 𝑇 3 6 10 2 2 3 3 4 𝑛=1 𝑛 ∑

Within three years, after more mentoring from Huygens, Leibniz discovered the calculus. Figure 4 shows perhaps the first instance of an integral sign from an old Leibniz manuscript. When Leibniz formally wrote the calculus details nine years later, he communicated some of these pertinent papers to the French Academy through Nicholas Malebranche, a scholar and priest whom he himself had mentored, and who in turn recruited the Bernoulli brothers (Jacob and Johann) to make greater sense of these early discoveries.5 Leibniz was ever active in bringing people together. He belonged to both the Royal Society and the French Academy of Sciences. He established the Berlin Academy and served as its president, and he advised Peter the Great in person, which in due course led to the creation of the Russian Academy. Figure 5 is a map showing Leibniz’s wanderings over the years to professional societies, libraries, and courts. For the last forty-one years of his life, Leibniz served as librarian to a succession of three dukes at Hanover. Each new duke had his own priorities. As Leibniz described it: It bothers me that I am not in a great city like Paris or London, where there are plenty of learned men from whom one can benefit and receive assistance. Here [in Hanover] one scarcely finds anyone to talk to; it is not regarded as appropriate for a courtier [like me] to speak of learned matters. [4, p. 196] 3 With

regard to its name, 𝑇𝑛 equals the number of dots in the triangular array of 𝑛 rows, where • row 1 contains one dot, row 2 contains two dots, and so on. Thus 𝑇1 counts •, 𝑇2 counts • • , and 𝑇3 • counts • •• •• . 4 In Chapter II, we will “reprove” Equation (1), this time using mathematical induction. 5 In Chapter VIII, we will consider one early application of the integral and show how Isaac Newton was able to predict that Earth was flattened at its poles by about 17 miles.

34

Strand II: Leibniz and the Binary Revolution

Figure 5. Leibniz’s travel itinerary. Duke 1’s primary focus for Leibniz was to establish a legal library. Duke 2’s focus was on improving silver mining operations, the primary revenue source for the duchy. And Duke 3’s focus was on documenting the lineage of the House of Hanover to its origins in the old Roman Empire. In due course, Duke 3 became an elector in the Holy Roman Empire, and ultimately was chosen as England’s sovereign George I, the successor to Queen Anne. For each of these transitions of power and vision in a new duke, Leibniz needed to reinvent himself and make himself useful. Imagine returning from researching a genealogical lead in some remote corner of the Empire to find that all the library’s books had been crated and stored in an attic. What would you do? Yet he remained in Hanover, creative and productive until the end.

A continued fraction connection Leibniz’s first great discovery was the series 𝜋 1 1 1 = 1 − + − + ⋯, 4 3 5 7 an honor he shares with two other independent discoverers [127]: James Gregory (1638–1675) and an Indian mathematician Kerala Gargya Nilakantha (circa

A continued fraction connection

35

1450–1550). In 1776, Leonhard Euler discovered that this series is equivalent to the strange-looking continued fraction [16, p. 82] 𝜋 1 = . 4 12 1+ 32 2+ 52 2+ 72 2+ 2+ ⋱ That is, 1 −

1 2 = = 3 3

1 12 1+ 2

,1−

13 1 1 + = = 3 5 15

1 1+

, and so on. We give

12 2+

32 2

a proof in Chapter IX. For a fun Leibniz continued fraction see Puzzle 5 of Chapter II.

Chapter II: Mathematical Induction The unit integer 1 is the least positive integer, and every positive integer 𝑛 has a unique and larger successor 𝑛 + 1. From any given positive integer, we can proceed to its successor, and so on forever. This archetype of inductive reasoning and construction leads to a fundamental idea and tool called mathematical induction. It is widely used for proving sequences of appropriately related mathematical statements, a practice going back to at least the days of Euclid in the third century bc. In this chapter, we review some set notation, pose the well-ordering principle for the set of positive integers, formalize mathematical induction, rework three examples from Chapter I and Strand II in this context, and introduce the notion of equivalence relations. In Example 4, we show how the division algorithm, a consequence of the well-ordering principle, allows us to generate a finite simple continued fraction representation for any given fraction. As an example that synthesizes the topics from the previous chapter and Strand II with the ideas of this chapter and introduces the Chapter IV graph-theoretical notion of a tree, we showcase the mathematical game of nim. Analyzing nim involves applications of mathematical induction, binary arithmetic, and equivalence relations. It also provides an introduction to an alternative way to define the set of fractions, an idea we sketch in Exercise IV.10. Lastly, we feature a nim version of mancala, perhaps the world’s oldest board game.

Set notation and the well-ordering principle Definition 1: Some set notation. A set is a collection of elements. We leave the terms collection and element undefined. The notation 𝑎 ∈ 𝐴 means that 𝑎 is an element of the set 𝐴. We let ℕ denote the set of natural numbers, ℕ = {0, 1, 2, 3, 4, …}.6 The set of integers {0, ±1, ±2, …} is denoted by ℤ. The set of 6 Some

texts define the set of natural numbers to exclude 0.

37

38

Chapter II: Mathematical Induction

all positive integers is denoted by ℤ+ . We say that the set 𝐴 is a subset of the set 𝐵, denoted by 𝐴 ⊂ 𝐵, if every element in 𝐴 is in 𝐵. A subset 𝐴 of 𝐵 is a proper subset of 𝐵 if 𝐴 ≠ 𝐵. In Definition 1, we stated that the terms collection and element are left undefined. These are examples of primitive terms that have no definition, yet nevertheless may strike the human mind as being vaguely sensible. In like fashion, we assume that our number system possesses a few primitive properties that seem intuitively reasonable. Such properties are taken as axioms, somewhat self-evident statements that are assumed to be true.7 One of these axioms is the well-ordering principle of the set of the positive integers. Axiom 2: The well-ordering principle of ℤ+ . If 𝐴 is a nonempty subset of ℤ+ , then 𝐴 has a least member. An intuitive rationale. Although axioms cannot be proved, we offer an intuitive argument which may suggest why this axiom is self-evident. To find the least member of any nonempty set 𝐴 of ℤ+ , we choose an integer 𝑎 ∈ 𝐴. Suppose 𝑎 is the least element in 𝐴. Well and good! If 𝑎 is not the least element, then from 𝐴 we discard all those integers larger than 𝑎, leaving a finite set of integers 𝐴1 . This new set must contain an integer 𝑎1 smaller than 𝑎. Discard 𝑎 from 𝐴1 leaving the set 𝐴2 . If 𝑎1 is the least member of 𝐴2 , again, well and good: It is the least member of 𝐴. Otherwise we repeat this process on 𝑎1 so as to find an integer 𝑎2 in 𝐴2 with 𝑎2 < 𝑎1 , and so on. But we can only repeat this back-pedaling process a finite number of times because 𝐴1 is a finite set. ♢ As examples of using this axiom, observe that 𝐹 = {7, 15, 31} is a nonempty subset of ℤ+ , and its least member is 7. The least element for the set of positive even integers is 2. However, the set of even integers has no least member. Axiom 2 gives us the familiar division algorithm. When 𝑎 and 𝑏 are integers with 𝑎 > 0, recall that the phrase 𝑎 divides 𝑏, denoted by 𝑎|𝑏, is defined in Definition I.4, p. 10. Proposition 3: The division algorithm. Given 𝑎, 𝑏 ∈ ℤ+ , we have 𝑏 = 𝑞𝑎 + 𝑟 for some unique integers 𝑞 and 𝑟 with 0 ≤ 𝑟 < 𝑎. Proof. See Exercise 1, a detailed outline of the proof asking the reader to provide the reasons for each step. 7 The first few class discussions in a course on real analysis often are devoted to stating these axioms explicitly. In this book we will typically be less formal and assume without comment the usual basic rules for the arithmetic and ordering of the integers.

Set notation and the well-ordering principle

39

Example 4: The division algorithm generates continued fractions. As a preview of continued fractions from Chapter IX, we apply the division algorithm 532 repeatedly to write the fraction as a finite simple continued fraction, defined 1193 in Definition 1 of the Introduction. At each stage we box a newly generated partial denominator. Since this positive fraction is less than 1, the initial partial denominator is 0 . A first application of the division algorithm gives 1193 = 2 ⋅ 532 + 129. Thus 1 532 1 = = . 1193 129 1193 2+ ( ) 532 532 A second application gives 532 = 4 ⋅ 129 + 16. Thus 532 = 1193

1 1 2+ 532 ( ) 129

1

=

1

2+ 4+

.

16 129

A third application gives 129 = 8 ⋅ 16 + 1. Because the remainder of this last division was 1, two partial denominators were boxed in this last step. We now have 532 1 1 , = = 1 1 1193 2+ 2+ 1 1 4+ 4+ 1 129 8+ ( ) 16 16 532

giving the finite simple continued fraction = [0; 2, 4, 8, 16]. As it turns 1193 out, the only other simple continued fraction representation for this fraction is [0; 2, 4, 8, 15, 1]. ♢ Puzzle 5: Find the error. Figure 6 is a page from Leibniz’s notebooks. He 4290000 is calculating the continued fraction for . As observed in a 2013 blog by 135424 Stephen Wolfram, Leibniz made a mistake somewhere in his calculations. Use the ideas of Example 4 to correct his mistake. (Note: Leibniz uses the symbol Π as an equal sign.8 ) ♢ Definition 6: Relatively prime. We say that two positive integers 𝑝 and 𝑞 are relatively prime if 1 is the only positive common divisor of 𝑝 and 𝑞. 8 The first person to use the symbol = to represent equality of two items was the Englishman Robert Recorde in 1557.

40

Chapter II: Mathematical Induction

Figure 6. A page from Leibniz’s notebook. Image courtesy of the Gottfried Wilhelm Leibniz Bibliothek. Proposition 7: A linear combination of two relatively prime integers. Let 𝑝 and 𝑞 be positive integers. Then 𝑝 and 𝑞 are relatively prime if and only if there exist integers 𝑥 and 𝑦 with 𝑝𝑥 + 𝑞𝑦 = 1. Proof. Let 𝑝 and 𝑞 be relatively prime and let 𝑆 = {𝑝𝑥+𝑞𝑦| 𝑝𝑥+𝑞𝑦 > 0, 𝑥, 𝑦 ∈ ℤ}. Observe that 𝑆 ≠ ∅ because 𝑝 ∈ 𝑆 (take 𝑥 = 1 and 𝑦 = 0). By the well-ordering axiom, let 𝑑 be the least member of 𝑆. So 𝑑 = 𝑝𝑥0 + 𝑞𝑦0 for some integers 𝑥0 and 𝑦0 . Suppose that 𝑑 > 1. By Proposition 3, there exist unique integers 𝑚 and 𝑟 with 0 ≤ 𝑟 < 𝑑 and 𝑝 = 𝑚𝑑 + 𝑟. So 𝑟 = 𝑝 − 𝑚𝑑 = 𝑝 − (𝑝𝑥0 + 𝑞𝑦0 )𝑚 = 𝑝(1 − 𝑥0 𝑚) + 𝑞(−𝑚𝑦0 ), which means that 𝑟 = 0 (otherwise 𝑟 would be a lesser positive element of 𝑆 than is 𝑑). So 𝑑 divides 𝑝. Similarly, 𝑑 divides 𝑞. Therefore 𝑑 = 1, and so 𝑝𝑥0 +𝑞𝑦0 = 1. To prove the converse, suppose 𝑝𝑥0 + 𝑞𝑦0 = 1 for some integers 𝑥0 and 𝑦0 . Let 𝑑 be a common integer divisor of 𝑝 and 𝑞. Then 𝑑 must divide 1, which means that 𝑑 = 1. Therefore 𝑝 and 𝑞 are relatively prime. Proposition 8: Equivalence of primes and irreducibles in ℤ. Let 𝑝 > 1 be an integer. Then 𝑝 is prime if and only if 𝑝 is irreducible. Proof. Recall the notions of prime and irreducible from p. 10. Let 𝑝 be irreducible and suppose that 𝑝|𝑚𝑛, where 𝑚 and 𝑛 are positive integers. We must show that 𝑝 divides either 𝑚 or 𝑛. Assume that 𝑝 fails to divide 𝑚. Then 𝑝 and 𝑚 are relatively prime. By Proposition 7 there exist integers 𝑥 and 𝑦 with 𝑝𝑥 + 𝑚𝑦 = 1. Therefore 𝑝𝑛𝑥 + 𝑚𝑛𝑦 = 𝑛. (2)

The principle of mathematical induction

41

Since 𝑝|𝑝𝑛𝑥 and 𝑝|𝑚𝑛𝑦, we must have 𝑝|(𝑝𝑛𝑥 + 𝑚𝑛𝑦). By Equation (2), then 𝑝|𝑛. Therefore 𝑝 is prime. Conversely, suppose 𝑚|𝑝 for some positive integer 𝑚 when 𝑝 is prime. So there is a positive integer 𝑛 with 𝑚𝑛 = 𝑝. Observe that 𝑚 ≤ 𝑝 and 𝑛 ≤ 𝑝. By definition, 𝑝 divides either 𝑚 or 𝑛. If 𝑝|𝑚, then 𝑚 ≥ 𝑝. So 𝑚 = 𝑝, which means that 𝑛 = 1. Otherwise 𝑝|𝑛, which means that 𝑝 = 𝑛, and so 𝑚 = 1. Thus 𝑝 is irreducible. As we progress through this book we will see that much mathematical structure is developed inductively—the custom of defining new values in terms of already established values. The following definition gives an archetypal example of what we mean. Definition 9: The factorial function. Define the factorial of zero, denoted by 0!, as 0! = 1. For each 𝑛 ∈ ℤ+ , the factorial of 𝑛, denoted by 𝑛!, is 𝑛! = 𝑛(𝑛 − 1)!. To illustrate, 5! = 5⋅4! = 5⋅4⋅3! = 5⋅4⋅3⋅2! = 5⋅4⋅3⋅2⋅1! = 5⋅4⋅3⋅2⋅1⋅0! = 120. A major tool used to prove families of inductively related statements is developed in the next section.

The principle of mathematical induction Example 10: Families of statements. When we write a sentence 𝒮 involving an as-yet-to-be-specified number of items such as 𝒮 ∶ My dog Fido has buried

bones in the back yard.

(3)

we could fill in the blank with any specific integer 𝑛. With 𝑛 = 5, we denote the sentence by 𝒮5 . Thus the open sentence (3) becomes 𝒮5 ∶ My dog Fido has buried 5 bones in the back yard. In general, we write 𝒮𝑛 ∶ My dog Fido has buried 𝑛 bones in the back yard. We thus have an infinite family of related statements {𝒮𝑛 }∞ 𝑛=1 . As a more interesting example, we revisit Leibniz’s identity from Strand II about the sum of a finite number of successive odd integers beginning with 1. We write his claim as a family of related statements with a fill-in-the-blank integer: 𝒮1 ∶ 1 = 12 . 𝒮2 ∶ 1 + 3 = 22 . 𝒮3 ∶ 1 + 3 + 5 = 32 . .. . 𝒮𝑛 ∶ 1 + 3 + 5 + ⋯ + (2𝑛 − 1) = 𝑛2 . .. .

♢

42

Chapter II: Mathematical Induction

When successive statements within a family of statements are sufficiently related, sometimes we can prove that all of the statements are true using the following variation of the well-ordering principle. Proposition 11: Mathematical induction. For each 𝑛 ∈ ℤ+ , let 𝒮𝑛 be a statement about 𝑛. If 𝒮1 is true, and if 𝒮𝑛+1 is true whenever 𝒮𝑛 is true, then 𝒮𝑛 is true for all 𝑛 ∈ ℤ+ . Proof. Let 𝐴 = {𝑛 ∈ ℤ+ | 𝒮𝑛 is false}. Observe that 1 ∉ 𝐴 because we are given that 𝒮1 is true. Suppose 𝐴 ≠ ∅. Let 𝑎 be the least member of 𝐴. Such an element must exist by the well-ordering principle. Observe that 𝑎 > 1, so 𝑎 − 1 ∈ ℤ+ and 𝑎 − 1 ∉ 𝐴. Thus 𝒮𝑎−1 is true, which by the given hypothesis means that 𝒮𝑎 is true, too, a contradiction. Therefore 𝐴 = ∅ and so 𝒮𝑛 is true for all 𝑛 ∈ ℤ+ . At times we use an alternate, yet equivalent, version of Proposition 11, called strong mathematical induction, whose proof is left as an exercise. Proposition 12: Strong mathematical induction. Let 𝒮𝑛 be a statement for each integer 𝑛 ∈ ℤ+ . If 𝒮1 is true, and if 𝒮𝑛 is true whenever 𝒮𝑘 is true for 1 ≤ 𝑘 < 𝑛, 𝑘 ∈ ℤ+ , then 𝒮𝑛 is true for all 𝑛. We apply mathematical induction to prove two identities presented in Strand II and a theorem from Chapter I. Proposition 13: An odd sum. For each 𝑛 ∈ ℤ+ , 𝑛

∑ (2𝑘 − 1) = 𝑛2 .

(4)

𝑘=1

Proof. Let 𝒮𝑛 be Equation (4). Observe that 𝒮1 is the statement 1 = 12 . Thus 𝒮1 is true. Assume that 𝒮𝑛 is true for some integer 𝑛 ≥ 1. Then 1+3+⋯+(2𝑛−1)+(2𝑛+1) = (1+3+⋯+(2𝑛−1))+(2𝑛+1) = 𝑛2 +(2𝑛+1) = (𝑛+1)2 , which means that 𝒮𝑛+1 is true. Therefore by induction the statement 𝒮𝑛 is true for all 𝑛 ∈ ℤ+ . Recall that the 𝑛th triangular number 𝑇𝑛 from Strand II is the sum of the first 𝑛 positive integers. Proposition 14: A triangular sum. For each 𝑛 ∈ ℤ+ , 𝑛

𝑇𝑛 = ∑ 𝑘 = 𝑘=1

𝑛(𝑛 + 1) . 2

(5)

The fundamental theorem of arithmetic

43

Proof. The proof of this proposition is Exercise 2a. ∞

1 = 2. The next proposition is a proof of an 𝑇 𝑛=1 𝑛

From Strand II, recall that ∑ equivalent statement.

Proposition 15: A telescoping series. ∞

∞

1 1 1 = ∑( − ) = 1. 2𝑇 𝑛 𝑛 + 1 𝑛 𝑛=1 𝑛=1 ∑

(6)

𝑛

1 1 = 1− . Observe that 𝒮1 is 𝑛 + 1 𝑘(𝑘 + 1) 𝑘=1

Proof. Let 𝒮𝑛 be the statement 𝒮𝑛 ∶ ∑

1 1 true because = 1 − . Now assume that 𝒮𝑛 is true for some integer 𝑛 ≥ 1. 1⋅2 2 Then 𝑛+1 𝑛 1 1 1 ∑ = (∑ )+ 𝑘(𝑘 + 1) 𝑘(𝑘 + 1) (𝑛 + 1)(𝑛 + 2) 𝑘=1 𝑘=1 1 1 1 1 = (1 − − , )+( )=1− 𝑛+1 𝑛+1 𝑛+2 𝑛+2 which means that 𝒮𝑛+1 is true. Therefore by induction the statement 𝒮𝑛 is true for all 𝑛 ∈ ℤ+ . Finally, since 1/(𝑛 + 1) approaches zero as 𝑛 increases, Equation (6) is true.

The fundamental theorem of arithmetic As promised in Chapter I, and having established the equivalence of primes9 and irreducibles in Proposition 8, we now prove the fundamental theorem of arithmetic by mathematical induction. The proof is broken into three parts over the next three propositions. Proposition 16: A product of primes. Every integer 𝑛 > 1 can be written as a product of primes. Proof. Let 𝒮𝑛 be the statement that 𝑛 can be written as a product of primes. Observe that 𝒮2 is true because 2 is prime. Suppose for some positive integer 𝑛, 𝒮𝑘 is true for all integers 𝑘 with 2 ≤ 𝑘 < 𝑛. If 𝑛 is prime then 𝒮𝑛 is true. Otherwise 𝑛 is composite. Thus there exists a prime 𝑝 with 𝑛 = 𝑝𝑚 for some positive integer 𝑚 with 2 ≤ 𝑚 < 𝑛. By the 9 In this chapter, unless we specify otherwise, we assume that a prime integer must be positive. The reader may wish to recall the general definition of a prime number in Exercise I.10 and to look ahead to the use of equivalence classes in Example 25.

44

Chapter II: Mathematical Induction

inductive hypothesis 𝑚 can be written as a product of primes. Therefore 𝑛 = 𝑝𝑚 can be written as product of primes. So 𝒮𝑛 is true. By the principle of strong mathematical induction, the proposition is true. 𝑘

Proposition 17: When a prime divides a product. If prime 𝑝 divides Π 𝑎𝑖 𝑖=1

where 𝑎𝑖 are positive integers, 1 ≤ 𝑖 ≤ 𝑘, then 𝑝|𝑎𝑖 for some 𝑖.

Proof. The proposition is true for 𝑘 = 1 by default. Suppose the proposition 𝑘+1

𝑘

is true for some 𝑘 ≥ 1 and that 𝑝| Π 𝑎𝑖 . By Definition I.5, then 𝑝| Π 𝑎𝑖 or 𝑖=1

𝑖=1

𝑝|𝑎𝑘+1 . If the latter case occurs the proposition is true for 𝑘 + 1. Otherwise, by the inductive hypothesis, 𝑝|𝑎𝑖 for some 𝑖, 1 ≤ 𝑖 ≤ 𝑘. Therefore the statement is true for 𝑘 + 1. Proposition 18: Unique factorization. Every positive integer 𝑛 greater than 1 can be written as a product of primes in exactly one way (up to the order of the primes). Proof. The proposition is true when 𝑛 is 2 or 3. Assume that the proposition is true for all integers less than 𝑛 for some integer 𝑛 > 3. By Proposition 16, 𝑛 can be written as a product of primes. Suppose 𝑛 can be factored in two different ways: 𝑛 = 𝑝1 𝑝2 ⋯ 𝑝𝑚 = 𝑞1 𝑞2 ⋯ 𝑞𝑘 ,

(7)

where 𝑝𝑖 and 𝑞𝑗 are primes, 1 ≤ 𝑖 ≤ 𝑚, 1 ≤ 𝑗 ≤ 𝑘. By Proposition 17, 𝑝𝑚 |𝑞𝑗 for some 𝑗. So 𝑝𝑚 = 𝑞𝑗 . Thus 𝑝𝑚 can be canceled from Equation (7), leaving ℓ = 𝑝1 𝑝2 ⋯ 𝑝𝑚−1 = 𝑞1 𝑞2 ⋯ 𝑞𝑗−1 𝑞𝑗+1 ⋯ 𝑞𝑘 .

(8)

If ℓ ≥ 𝑛 then 𝑛 = 𝑝𝑚 ⋅ℓ > 𝑛, a contradiction. Thus ℓ < 𝑛. So unique factorization holds for ℓ, which means that it does for 𝑛 as well. Therefore the two products in Equation (7) contain the same primes, perhaps in different orders. Corollary 19: The fundamental theorem of arithmetic. Every integer 𝑛 > 1 𝑘

𝛼

can be written uniquely as a product of distinct primes, 𝑛 = Π 𝑝𝑖 𝑖 , where 𝑝1 < 𝑝2 < ⋯ < 𝑝𝑘 and 𝛼𝑖 ∈ ℤ+ , 1 ≤ 𝑖 ≤ 𝑘.

𝑖=1

Not every number system shares with the integers the property of unique factorization into irreducibles. For example, consider the set ℤ[√10] of all numbers of the form 𝑎+𝑏√10 where 𝑎 and 𝑏 are integers, as presented in Exercise I.10. The integer 6 factors both as 2 ⋅ 3 and as (2 + √10)(−2 + √10), and Exercises I.10d and I.10e show that each of the factors 2, 3, and ±2 + √10 is irreducible in ℤ[√10].

Equivalence classes

45

The next proposition uses the fundamental theorem of arithmetic to prove a statement about an irrational number. Recall that a rational number is a quotient of integers whose denominator is not 0. Proposition 20: √2 is not a rational number. 𝑚

Proof. We use an indirect proof and assume that √2 = , where 𝑚, 𝑛 ∈ ℤ+ and 𝑛 𝑚 and 𝑛 have no common divisors in ℤ+ except 1. Since 1 < 2 < 4, we have 1 < √2 < 2, which means that √2 ∉ ℤ+ , which in turn means that 𝑛 > 1. Let 𝑝 be a prime divisor of 𝑛. Since 2𝑛2 = 𝑚2 , 𝑝|𝑚2 . By Proposition 17, the only prime divisors of 𝑚2 are prime divisors of 𝑚. So 𝑝|𝑚, a contradiction. Therefore √2 is not a rational number. For a related result, Exercise 5c asks the reader to show that the number 𝑒 is not rational.

Equivalence classes In Strand II, we used the terms even and odd integers somewhat naturally. These are archetypal examples of what are called equivalence classes. To make this notion precise, first recall that two items are said to be related to each other if they have a special property in common. Thus we might say that two people are related if they have the same biological mother. Alternatively, two people may be related in another sense because they have the same eye color. Is every item always related to itself? If we define a specific relation so that two objects are related if they are not the same, then I am related to the universe, but I am not related to me. A much more tame kind of relation that has proven to be especially helpful in sorting various collections for ease of classification, study, and conversation consists of those relations that satisfy three somewhat natural properties. Definition 21: Equivalence relation. We say that a relation 𝑅 on the set 𝑄 is an equivalence relation if for any 𝑝, 𝑞, 𝑟 ∈ 𝑄, 𝑅 has three properties: i. Reflexive: 𝑝 𝑅 𝑝, where 𝑝 𝑅 𝑝 is read p is related to p. ii. Symmetric: 𝑝 𝑅 𝑞 whenever 𝑞 𝑅 𝑝. iii. Transitive: 𝑝 𝑅 𝑟 whenever 𝑝 𝑅 𝑞 and 𝑞 𝑅 𝑟. For each 𝑝 ∈ 𝑄, the equivalence class containing 𝑝 is the set 𝑄𝑝 = {𝑞 ∈ 𝑄| 𝑞 𝑅 𝑝}.

46

Chapter II: Mathematical Induction

Example 22: Other non-equivalence relations. Let 𝑎, 𝑏 ∈ ℤ. Define 𝑎 and 𝑏 to be related to each other, denoted by 𝑎 ∼ 𝑏, if |𝑎 − 𝑏| = 1. Thus 1 ∼ 2, and 2 ∼ 3. But 1 is not related to 3 because |1 − 3| = 2 ≠ 1. Therefore relation ∼ is not an equivalence relation. As a second example, define the integers 𝑎 and 𝑏 to be related to each other if 𝑎|𝑏. Thus 3 is related to 6 because 3|6, but 6 fails to be related to 3. ♢ Proposition 23: Distinct equivalence classes on 𝑄 partition 𝑄. Let ∼ be an equivalence relation on a nonempty set 𝑄. Let 𝑝, 𝑞 ∈ 𝑄. Then 𝑄𝑝 ∩ 𝑄𝑞 = ∅ or 𝑄𝑝 = 𝑄𝑞 . Furthermore, 𝑄 is the union of all the equivalence classes. Proof. We leave the proof of this proposition to the reader. Proposition 24: The equivalence classes of even and odd integers. Let ≡ be the relation on ℤ where 𝑝 ≡ 𝑞 if and only if 2|(𝑝−𝑞). Then ≡ is an equivalence relation on ℤ and partitions ℤ into two equivalence classes, the odd integers and the even integers. Proof. Let 𝑝, 𝑞, 𝑟 ∈ ℤ. Observe that 𝑝 ≡ 𝑝 because 2 divides 0 = 𝑝 − 𝑝. Furthermore, if 2|(𝑝 − 𝑞), then 2|(𝑞 − 𝑝). Finally, if 2|(𝑝 − 𝑞) and 2|(𝑞 − 𝑟), there exist integers 𝑚 and 𝑛 with 𝑝 − 𝑞 = 2𝑚 and 𝑞 − 𝑟 = 2𝑛 so that 𝑝 − 𝑟 = 𝑝 − 𝑞 + 𝑞 − 𝑟 = 2𝑚 + 2𝑛 = 2(𝑚 + 𝑛). Thus 2|(𝑝 − 𝑟). Since ≡ possesses all three properties, ≡ is an equivalence relation. For every 𝑛, by Proposition 3, 𝑛 = 2𝑞 + 𝑟 with 0 ≤ 𝑟 ≤ 1. If 𝑟 = 0, then 𝑛 is even and is equivalent to 0. If 𝑟 = 1, then 𝑛 is odd and is equivalent to 1. As a more interesting example of equivalence relations we look again at the Gaussian integers10 ℤ[𝑖] = {𝑎 + 𝑏𝑖| 𝑎, 𝑏 ∈ ℤ}. Example 25: An equivalence relation on the Gaussian integers.∗ We say that two numbers 𝑎 and 𝑏 are associates if 𝑎 = 𝑢𝑏 for some unit 𝑢. For example, the irreducibles 1 + 𝑖 and −1 − 𝑖 are associates, as are 1 − 𝑖 and −1 + 𝑖. We say that two numbers 𝑎 and 𝑏 are equivalent, denoted by 𝑎 ∼ 𝑏, if 𝑎 and 𝑏 are associates. Gauss showed that, under this equivalence relation, ℤ[𝑖] possesses the unique factorization property and that, with 𝑎, 𝑏, and 𝑐 in ℤ, the prime Gaussian integers are those complex numbers of the form 𝑢𝑐 or 𝑎 + 𝑏𝑖 where, respectively, 𝑢 is a unit and |𝑐| is a prime integer that cannot be written as a sum 𝑚2 + 𝑛2 for some integers 𝑚 and 𝑛, or 𝑎2 + 𝑏2 is a prime integer. Note that 2 factors uniquely (with respect to the equivalence relation ∼) into the prime irreducibles (1 + 𝑖)(1 − 𝑖), whereas 3 and −3 are equivalent and are both Gaussian primes. ♢ 10 Gaussian integers and units were introduced in Exercise I.10.

±𝑖.

In ℤ[𝑖], the only units are ±1 and

Nim∗

47

Nim∗ As an example involving binary numbers, mathematical induction, equivalence classes, and taking parts of wholes, we consider the two-person mathematical game nim. We characterize a winning strategy for nim. In so doing, what the reader may find surprising is that one plus one in “nim addition” is no longer two. The term nim was coined in 1901 by C. L. Bouton [15], who may have borrowed the term from German since nimm means to take, and taking is what nim is all about. Nim refers to a wide variety of impartial two-person games, a great catalogue of which is [10]. By impartial we mean that from any configuration in the game, the same moves are available to each player. In the normal play convention for nim, the first person unable to make a move loses.

Figure 7. The Decision of the Flower. The simplest version of nim is equivalent to a popular solitaire French game, Effeuiller la Marguerite, dating back to at least 1820, played by successively plucking petals from a daisy and alternately saying She loves me and She loves me not, as is illustrated in Figure 7a.11 We can interpret the game as the dynamics between two ideal players Yes and No for Yes, she loves me and No, she loves me not. If the flower has an even number of petals, as in Figure 7b, the starting player loses. And if the flower has an odd number, the starting player wins. More interesting versions of this game arise if we vary the rules. Rather than speak of petals, we use the more generic term blocks. 11 Figure 7a depicts a maiden and a cavalier. The maiden is plucking the petals of the flower as the couple playfully sees whether the flower suggests that they love each other or not.

48

Chapter II: Mathematical Induction

Definition 26: Configurations. A collection of a finite number of blocks arranged into stacks of blocks is called a configuration. The size of a stack is the number of blocks in the stack. The term configuration n refers to a single stack of 𝑛 blocks, 𝑛 ∈ ℕ. The term configuration (𝑚, 𝑛) refers to two stacks of sizes 𝑚 and 𝑛, respectively, 𝑚, 𝑛 ∈ ℕ, and so on. A configuration 𝒟 is a subconfiguration of the configuration 𝒞 if the set of stacks of blocks in 𝒟 is a subset of the set of stacks of blocks in 𝒞. We say that a configuration 𝒟 is simpler than a configuration 𝒞 if the number of stacks in 𝒟 is no more than the number of stacks in 𝒞 and the number of blocks in 𝒟 is less than the number in 𝒞. For example, 𝒞 = (5, 2, 2, 4, 1) is a configuration of five stacks of sizes 5, 2, 2, 4, and 1 for a total of 14 blocks. The configuration (1, 2) is a subconfiguration of 𝒞. The configuration (1, 2, 3) is a simpler configuration than 𝒞 but not a subconfiguration. The configuration (10, 2) is simpler than 𝒞 because it has but two stacks and twelve blocks versus the five stacks and fourteen blocks of 𝒞. The version of nim we consider in this section is the two-person game 𝒩 involving a finite collection of blocks. Nim game 𝒩: Play begins with a configuration. At each turn, a player can choose any stack, and from that stack remove one or more blocks. The first person unable to make a move loses. By convention, person 𝒜 is the player who has the first move in the game of nim, and person ℬ is the second player. In our analysis, we assume that if a player has a winning strategy then the player follows that strategy.

Figure 8. Who wins—the first player or the second player? If this game is new to you, play it with a friend. Try to identify winning strategies. In one of your trials, play Puzzle 27 below. Your stacks of blocks can be piles of coins, heaps of pebbles, or stacks of books.

Nim∗

49

Puzzle 27: A first game. You are confronted with the configuration (5, 2, 4), as illustrated in Figure 8. What’s your first move in this game of nim? (The solution to this puzzle is found in Example 45.) ♢ As we show, the key to finding a good first move is to create a way to assign a value to each stack and then sum these values. Making this assignment precise is the goal of the next few pages. Definition 28: More configuration terminology. Let 𝒞1 and 𝒞2 be two configurations of 𝑚1 and 𝑚2 stacks totaling 𝑛1 and 𝑛2 blocks, respectively, where 𝑚1 , 𝑚2 , 𝑛1 , 𝑛2 ∈ ℕ. We say that the union of the two configurations, denoted by 𝒞1 ∪ 𝒞2 , is the union of the two configurations, consisting of the 𝑚1 + 𝑚2 stacks. Given a configuration 𝒞, we say that a configuration 𝒟 which is the result of a single nim move from 𝒞 is a child configuration of 𝒞. The child set of 𝒞 is the set of the children of 𝒞. When we say that a player reduces12 a configuration 𝒞 to a configuration 𝒟 we mean that 𝒟 is the resultant configuration from 𝒞. To illustrate, let 𝒞1 = 5 and 𝒞2 = (2, 4). Then the configuration 𝒞1 ∪ 𝒞2 = (5, 2, 4) consists of eleven blocks as shown in Figure 8. Incidentally, (5, 2, 4) is the same configuration as (2, 4, 5). Furthermore, (1, 0) and 1 are the same configurations, as are (1, 2) and 1 ∪ 2. When a player removes two blocks from the stack of two blocks in 𝒞2 = (2, 4), the resultant configuration is (0, 4). Equivalently, the player has reduced 𝒞2 to (0, 4). As another way to say it, the player has chosen the child (0, 4) of (2, 4). The child set of 𝒞2 = (2, 4) is the set {(1, 4), (0, 4), (2, 3), (2, 2), (2, 1), (2, 0)}. Observe that (0, 0), (0, 1), (0, 3), (1, 0), (1, 1), (1, 2), and (1, 3), although they are simpler configurations than 𝒞2 , are not children of 𝒞 because at each move we cannot remove blocks from more than one stack. A configuration is in a player’s favor if it is possible for that player to win by some sequence of moves beginning with that configuration regardless of what moves the opponent makes. As stated earlier, yet to emphasize it again, in analyzing this game we assume that, when possible, a player always chooses moves that will lead to a win. The next proposition shows that when a nim configuration consists of two copies of the same configuration of blocks, then 𝒜 always loses. Proposition 29: The tweedledee principle. Suppose that 𝒞1 and 𝒞2 are identical configurations of stacks of blocks, each consisting of 𝑚 blocks, 𝑚 ∈ ℕ. Faced 12 Making a nim move means exchanging a configuration for a child configuration. We will refer to this process in a variety of ways. As another variation besides those in Definition 28, when we say that a player chooses a child 𝒟 of 𝒞, we mean that the player has reduced 𝒞 to 𝒟.

50

Chapter II: Mathematical Induction

with the union of the two configurations, 𝒞1 ∪ 𝒞2 of 2𝑚 blocks, player 𝒜 loses this game of nim. Proof. We use a tweedledee13 principle to show that 𝒜 loses. If 𝑚 = 0, then 𝒜 loses. Let 𝑚 = 1. As play begins, 𝒜 makes a move in one of the two subconfigurations, 𝒞1 or 𝒞2 . Player ℬ counters by executing the exact same move in the subconfiguration not chosen by 𝒜. The resultant configuration now consists of 0 blocks, which means that 𝒜 loses. Now assume that whenever 𝑚 < 𝑛 for some positive integer 𝑛 > 1, 𝒜 loses. Suppose play starts with each configuration having 𝑛 blocks. Then ℬ mirrors 𝒜’s move, resulting in a configuration consisting of two identical subconfigurations, each with at most 𝑛 − 1 blocks. By the strong mathematical induction principle, 𝒜 loses. Corollary 30: Given two stacks of 𝑚 and 𝑛 blocks with 𝑚 > 𝑛 ≥ 0, 𝑚, 𝑛 ∈ ℕ, 𝒜 wins by removing 𝑚 − 𝑛 blocks from the stack of 𝑚. Proof. The proof of this corollary is left as an exercise for the reader. How can we extend the strategy of Proposition 29 to configurations of at least three stacks? Our approach is to assign a nim value to each configuration. We do so by giving the empty configuration the nim value 0 and working our way to more exotic configurations one block at a time. The key idea we use to assign nim values is the observation that the child set for 𝑛, the configuration of a single stack of 𝑛 blocks, is {0, 1, 2, … , 𝑛 − 1}, where 𝑛 ∈ ℤ+ . With this idea in mind, we make the following definition. Definition 31: The minimal excluded number [10, p. 56]. Given a proper subset 𝐴 of ℕ, the minimal excluded number of 𝐴, denoted by mex(𝐴), is the least member of 𝐴𝑐 = {𝑛 ∈ ℕ| 𝑛 ∉ 𝐴}. For example, let 𝐴 = {0, 1, 2, 3, 5, 7}. Since 𝐴𝑐 = {4, 6} ∪ {8, 9, 10, …}, mex(𝐴) = 4. Given a finite subset 𝐴 of ℕ, 𝐴𝑐 with respect to the universal set ℕ is nonempty. By Axiom 2, 𝐴𝑐 has a least member, and so mex(𝐴) exists. Definition 32: Nim value of configurations. Let 𝒞 be a configuration, and let 𝑉(𝒞) denote the nim value of 𝒞. If 𝒞 is the empty configuration, then 𝑉(𝒞) = 0. If 𝒞 is not empty, then 𝑉(𝒞) is defined recursively: 𝑉(𝒞) = mex{𝑉(𝒟)| 𝒟 is a child of 𝒞}. 13 The term tweedledee alludes to Lewis Carroll’s twin characters Tweedledee and Tweedledum from Alice’s further adventures in Through the Looking-Glass, published in 1871. Whatever one twin does the other also does.

Nim∗

51

If 𝑉(𝒞) = 0, then either 𝒞 is the empty configuration or all the child configurations of 𝒞 have positive nim values. On the other hand, if 𝑉(𝒞) = 𝑛 > 0, then 𝒞 has children with nim values 0 through 𝑛 − 1. Although no child of 𝒞 has nim value 𝑛, 𝒞 could have a child with nim value greater than 𝑛. Armed with this recursive definition, the next example shows how to calculate the nim value of the configuration 𝒞 = (1, 2). Example 33: The nim value of (1, 2). The nim values of the pertinent configurations simpler than (1, 2) are as given below. • 𝑉(0) = 𝑉(0, 0) = 0. Observe that as configurations, 0 = (0, 0) because both consist of zero blocks. • 𝑉(1) = 𝑉(1, 0) = mex{𝑉(0, 0)} = mex{0} = 1. By symmetry, 𝑉(0, 1) = 1. • 𝑉(1, 1) = mex{𝑉(1, 0), 𝑉(0, 1)} = mex{1, 1} = 0. • 𝑉(2) = 𝑉(0, 2) = mex{𝑉(0, 1), 𝑉(0, 0)} = mex{1, 0} = 2. Therefore 𝑉(1, 2) = mex{𝑉(0, 2), 𝑉(1, 1), 𝑉(1, 0)} = mex{2, 0, 1} = 3.

♢

By mathematical induction, we show that every configuration of 𝑛 blocks has nim value no more than 𝑛. Proposition 34: The existence of nim values. Let 𝑛 ∈ ℕ, and let 𝒞 be a configuration of 𝑛 blocks. Then 𝑉(𝒞) = 𝑘 for some integer 𝑘 with 0 ≤ 𝑘 ≤ 𝑛. Proof. By our previous work we know that 𝑉(0) = 0 and 𝑉(1) = 1. Since there is but one configuration of a stack of one block, the proposition is true when 𝑛 = 1, as well as when 𝑛 = 0. Assume that for some positive integer 𝑛 ≥ 2 the proposition is true for all integers 𝑗 with 0 ≤ 𝑗 < 𝑛. Let 𝒞 be a configuration of 𝑛 blocks. Each child of 𝒞 contains at most 𝑛−1 blocks. By the inductive hypothesis, the nim value for any child of 𝒞 is some integer 𝑗 with 0 ≤ 𝑗 < 𝑛. The minimal excluded number 𝑘 for such sets of nim values exists and is no greater than 𝑛. Lemma 35: Nim value of a single stack. Let 𝒞 be the configuration 𝑛, a stack of 𝑛 blocks, 𝑛 ∈ ℕ. Then 𝑉(𝑛) = 𝑛. Proof. The proof is left as an exercise. Knowing the nim value of a configuration 𝒞 tells us who should win that nim game, as the next proposition shows. While reading these nim proofs, remember that every configuration of positive nim value must have a child with value 0. Proposition 36: A loss for 𝒜. Let 𝒞 be a configuration, and let 𝒜 be the first player. Player 𝒜 loses the game if and only if 𝑉(𝒞) = 0.

52

Chapter II: Mathematical Induction

Proof. Let 𝑛 be the number of blocks in 𝒞. If 𝑛 = 0, then 𝑉(𝒞) = 0 and 𝒜 loses because there are no blocks in 𝒞 (which means that 𝒜 is unable to make a move). Assume that for some 𝑛 ≥ 1, 𝒜 loses whenever 𝑉(𝒞) = 0 and 𝒞 has at most 𝑛 − 1 blocks. Let 𝒞 be a configuration of 𝑛 blocks with 𝑉(𝒞) = 0. Once the game starts, 𝒜 chooses some child 𝒟 of 𝒞. Since each child of 𝒞 has nim value greater than 0, 𝑉(𝒟) > 0. Since 𝑉(𝒟) > 0, one of its children ℰ must have value 0. So ℬ chooses ℰ. Since ℰ has fewer than 𝑛 blocks and 𝑉(ℰ) = 0, 𝒜 loses by the inductive hypothesis. Let 𝒞 be a configuration for which 𝒜 loses. Suppose 𝑉(𝒞) > 0. 𝒜 chooses a child 𝒟 of 𝒞 for which 𝑉(𝒟) = 0. As we have already shown, the player moving in such a configuration loses. So ℬ loses and 𝒜 wins. Thus 𝑉(𝒞) cannot be positive, which means that 𝑉(𝒞) = 0. Now we can refine Proposition 29 in the context of nim values. Proposition 37: A necessary condition for nim equality. Let 𝒞1 and 𝒞2 be configurations with 𝑉(𝒞1 ) = 𝑛 = 𝑉(𝒞2 ), 𝑛 ∈ ℕ. Then 𝑉(𝒞1 ∪ 𝒞2 ) = 0.

(9)

Proof. Let 𝒜 make a move. Without loss of generality, assume that the move is to child 𝒟 of 𝒞1 with resultant configuration 𝒟 ∪ 𝒞2 . Observe that 𝑉(𝒟) = 𝑘 ≠ 𝑛 for some integer 𝑘 ∈ ℕ. If 𝑘 > 𝑛, some child ℰ of 𝒟 must have value 𝑛. So ℬ has a move that produces the same type of configuration as the original configuration yet with fewer blocks. If 𝑘 < 𝑛, some child ℱ of 𝒞2 must have value 𝑘. So ℬ has a move that produces the same type of configuration as the original configuration yet with fewer blocks. Therefore as long as 𝒜 has a move, ℬ has a move, which means that 𝒜 loses this game of nim. Thus, by Proposition 36, 𝑉(𝒞1 ∪𝒞2 ) = 0. Corollary 38: A sufficient condition for nim equality. If 𝒞1 and 𝒞2 are configurations with 𝑉(𝒞1 ∪ 𝒞2 ) = 0, then 𝑉(𝒞1 ) = 𝑉(𝒞2 ). Proof. Suppose that 𝑉(𝒞1 ) = 𝑛1 , 𝑉(𝒞2 ) = 𝑛2 , 𝑛1 ≠ 𝑛2 , and 𝑉(𝒞1 ∪ 𝒞2 ) = 0, 𝑛1 , 𝑛2 ∈ ℕ. Without loss of generality, assume that 𝑛1 > 𝑛2 . Player 𝒜 chooses a child 𝒟 of the subconfiguration 𝒞1 with 𝑉(𝒟) = 𝑛2 . By Proposition 37, the configuration now facing player ℬ, 𝒟 ∪ 𝒞2 , has nim value 0. By Proposition 36, ℬ loses this game of nim, a contradiction. Therefore, 𝑉(𝒞1 ) = 𝑉(𝒞2 ). Corollary 39: A nim equivalence. Let 𝒞 and 𝒟 be configurations with nim values 𝑐 and 𝑑, respectively, 𝑐, 𝑑 ∈ ℕ. Then 𝑉(𝒞 ∪ 𝒟) = 𝑉(𝑐, 𝑑), where (𝑐, 𝑑) is the configuration of two stacks of sizes 𝑐 and 𝑑. Proof. Using an argument similar to the proof of Proposition 37, it follows that 𝑉(𝒞 ∪ 𝒟 ∪ 𝑐 ∪ 𝑑) = 0,

Nim∗

53

where 𝑐 and 𝑑 in this context are configurations of a stack of 𝑐 blocks and a stack of 𝑑 blocks, respectively. By Corollary 38, 𝑉(𝒞 ∪ 𝒟) = 𝑉(𝑐 ∪ 𝑑) = 𝑉(𝑐, 𝑑). The utility of Corollary 39 is that since the configurations 𝒞 ∪𝒟 and (𝑐, 𝑑) are equivalent14 in the sense of having a common nim value, finding a good move in the latter configuration might suggest a good move in the former configuration. To help in finding nim values of configurations, we define nim addition. Definition 40: Nim addition. Let 𝑚, 𝑛 ∈ ℕ. The nim sum of 𝑚 and 𝑛, denoted ∗ ∗ by 𝑚 + 𝑛, is 𝑚 + 𝑛 = 𝑉(𝑚, 𝑛). Observe that 𝑉(𝑚, 0) = 𝑉(𝑚) = 𝑉(0, 𝑚) = 𝑚 for all 𝑚 ∈ ℕ since (𝑚, 0), (0, 𝑚), and 𝑚 are different ways of representing a single stack of size 𝑚. To calculate 𝑉(𝑚, 𝑛) where 𝑚, 𝑛 ∈ ℕ, we must find the minimal excluded number from the set {𝑉(𝑚 − 1, 𝑛), 𝑉(𝑚 − 2, 𝑛), … , 𝑉(0, 𝑛)} ∪ {𝑉(𝑚, 𝑛 − 1), 𝑉(𝑚, 𝑛 − 2), … , 𝑉(𝑚, 0)}. ∗

Thus, by Example 33 and Definition 40, 1 + 2 = 3. As the reader may verify, ∗ the nim sums 𝑚 + 𝑛 for all integers 0 ≤ 𝑚, 𝑛 ≤ 3 are given in Table 1. ∗

Table 1. The nim values 𝑚 + 𝑛, where 0 ≤ 𝑚 ≤ 3 and 0 ≤ 𝑛 ≤ 3. ∗

+

0 1 2 3

0 0 1 2 3

1 1 0 3 2

2 2 3 0 1

3 3 2 1 0

∗

Definition 41: We say that a subset 𝑆 of ℕ is closed under nim addition if 𝑎 + 𝑏 ∈ 𝑆 for all 𝑎, 𝑏 ∈ 𝑆. Table 1 shows that {0, 1, 2, 3} is closed under nim addition, as are the sets {0} ∗ and {0, 1}. However, the set {0, 1, 2} is not closed because 1 + 2 = 3. Before attempting to extend this table to include nim values 𝑛 with 𝑛 > 3, observe that since the order of the stacks in nim is unimportant, nim addition is both commutative and associative. Thus, whenever we find configurations ∗ ∗ ∗ ∗ where 𝑚 + 𝑘 = 𝑛 + 𝑘, we can conclude that 𝑚 = 𝑛 (because if 𝑚 + 𝑘 = 𝑛 + 𝑘 then 14 Exercise 7c asks the reader to show that nim equality is an equivalence relation on the set of all nim configurations.

54

Chapter II: Mathematical Induction ∗

∗

∗

∗

𝑚 = 𝑚 + 0 = 𝑚 + 𝑘 + 𝑘 = 𝑛 + 𝑘 + 𝑘 = 𝑛 + 0 = 𝑛). Therefore, in any table of nim sums, no nim value will occur more than once in any row or any column.15 A convenient way to extend Table 1 is to use mathematical induction, taking advantage of the structure of closed sets of nim values under nim addition. Proposition 42: Evaluating nim sums. Let 𝑆𝑛 = {0, 1, … , 2𝑛+1 − 1}, where 𝑛 is an integer with 𝑛 ≥ −1. Then 𝑆𝑛 is closed under nim addition for 𝑛 ≥ −1. In particular, when 𝑛 ≥ 0, ∗ 2𝑛 + 𝑚 = 2 𝑛 + 𝑚 (10) where 0 ≤ 𝑚 < 2𝑛 , 𝑚 ∈ ℕ. Proof. Observe that 𝑆−1 = {0}. The proposition is true when 𝑛 = −1, 𝑛 = 0, and 𝑛 = 1, as evidenced by Table 1. Let 𝑛 ∈ ℕ, 𝑛 ≥ 2, and assume that the proposition is true for all integers 𝑘 with 0 ≤ 𝑘 ≤ 𝑛−1. Let 𝒞 = 2𝑛 ∪𝑚∪(2𝑛 +𝑚), three stacks of size 2𝑛 , 𝑚, and 2𝑛 +𝑚. 𝒜 makes a move, leaving ℬ facing a child configuration of 𝒞. The child set of 𝒞 is the set of all configurations of the following forms: {2𝑛 ∪ (𝑚 − 𝑗) ∪ (2𝑛 + 𝑚),

2𝑛 ∪ 𝑚 ∪ 𝑘} (11) where 0 < 𝑗 ≤ 𝑚 and 0 ≤ 𝑘 < 2𝑛 . Player ℬ can reduce the first two configurations of (11) to 2𝑛 ∪ (𝑚 − 𝑗) ∪ (2𝑛 + (𝑚 − 𝑗)), which has the same form as 𝒞. If player 𝒜 persists in making these kinds of moves, player ℬ can always reciprocate in kind. However, since 𝑚 is finite, player 𝒜 must eventually make a move to one of the latter two configurations of (11). Faced with a configuration ∗ 𝑘 ∪ 𝑚 ∪ (2𝑛 + 𝑚), ℬ realizes by the induction hypothesis that 𝑘 + 𝑚 = 𝑗 for some integer 𝑗 where 0 ≤ 𝑗 < 2𝑛 . So ℬ moves in the subconfiguration of 2𝑛 + 𝑚 to its child with nim value 𝑗, leaving 𝒜 to face a configuration equivalent to the configuration 𝑗∪𝑗, whose nim value is 0. By Proposition 36, 𝒜 loses the game. A similar argument can be made when ℬ faces the configuration 2𝑛 ∪ 𝑚 ∪ 𝑘, an exercise we leave for the reader. Thus, in every possible scenario, 𝒜 must eventually move facing a configuration of value 0, and loses. By Proposition 36, 𝑉(𝒞) = 0. By ∗ Corollary 38, 2𝑛 + 𝑚 = 2𝑛 + 𝑚. Finally, we show that 𝑆𝑛 is closed. Let 𝑥, 𝑦 ∈ 𝑆𝑛 . We consider the case where 𝑥 ≥ 2𝑛 and 𝑦 ≥ 2𝑛 , and leave the other cases to the reader. So 𝑥 = 2𝑛 + 𝑚 and 𝑦 = 2𝑛 + 𝑘 where 𝑚 and 𝑘 are nonnegative integers less than 2𝑛 . Since ∗ 𝑚, 𝑘 ∈ 𝑆𝑛−1 , we have 𝑚 + 𝑘 = 𝑗 for some 𝑗 ∈ 𝑆𝑛−1 . Thus ∗

2𝑛 ∪ 𝑚 ∪ (2𝑛 + (𝑚 − 𝑗)),

∗

∗

∗

𝑥 + 𝑦 = (2𝑛 + 𝑚) + (2𝑛 + 𝑘) = 2𝑛 + 𝑚 + 2𝑛

∗

+

𝑘 ∪ 𝑚 ∪ (2𝑛 + 𝑚),

∗

∗

∗

∗

𝑘 = (2𝑛 + 2𝑛 ) + (𝑚 + 𝑘) = 0 + 𝑗 = 𝑗,

∗

which means that 𝑥 + 𝑦 ∈ 𝑆𝑛 . 15 For readers who have studied abstract algebra, the sets of nim values closed under nim addition are groups.

Nim∗

55

Corollary 43: Almost binary addition. Let 𝑛1 , 𝑛2 , … , 𝑛𝑘 be 𝑘 distinct positive integers. For 𝑛 ∈ ℕ, ∗

(2𝑛1 + 2𝑛2 + ⋯ + 2𝑛𝑘 ) + 𝑛 = 2𝑛1

∗ ∗ + 2 𝑛2 +

⋯

∗ ∗ + 2𝑛𝑘 +

𝑛.

Proof. Without loss of generality, let 𝑛1 > 𝑛2 > ⋯ > 𝑛𝑘 . Successive applications of Proposition 42 give ∗ ∗ + 2 𝑛2 +

2 𝑛1 + 2 𝑛2 + ⋯ + 2 𝑛𝑘 = 2 𝑛1

⋯

∗ + 2 𝑛𝑘 ,

which implies the corollary. Corollary 43 means that we can find nim sums using binary notation, as we make precise in the following algorithm. ∗

Algorithm 44: Using binary to find nim sums.16 To calculate 𝑚 + 𝑛 write each integer in binary, add without carrying to obtain a sum, and convert the sum ∗ to decimal notation. By adding without carrying we mean that (1)2 + (1)2 = (0)2 , ∗ ∗ (1)2 + (0)2 = (1)2 , and (0)2 + (0)2 = (0)2 . Proof. Let 𝑚, 𝑛 ∈ ℕ. Writing both 𝑚 and 𝑛 as sums of distinct powers of two gives 𝑘

𝑘

𝑚 = ∑ 𝑎𝑗 2𝑗

and

𝑛 = ∑ 𝑏𝑗 2𝑗

𝑗=0

𝑗=0

for some nonnegative integers 𝑘 and 𝑎𝑗 , 𝑏𝑗 ∈ {0, 1}. Thus 𝑘 ∗

𝑚 + 𝑛 = ∑ (𝑎𝑗

∗

+ 𝑏𝑗 )2𝑗

= ((𝑎𝑘

∗

+

∗

𝑏𝑘 )(𝑎𝑘−1 + 𝑏𝑘−1 ) ⋯ (𝑎0

∗

+

𝑏0 ))2 ,

𝑗=0 ∗

∗

∗

where the notation ((𝑎𝑘 + 𝑏𝑘 )(𝑎𝑘−1 + 𝑏𝑘−1 ) ⋯ (𝑎0 + 𝑏0 ))2 is to be interpreted as the juxtaposition of binary digits. See Table 2. ∗

To illustrate this algorithm, consider 7 + 2. We know that 7 = (111)2 and ∗ 2 = (10)2 . Adding without carrying gives (111)2 + (10)2 = (101)2 = 5. To see why the algorithm works in this special case, note that the binary representation of 7 follows from writing 7 as a sum of powers of 2: 7 = 4 + 2 + 1. Thus by Proposition 42, ∗

∗

∗

∗

∗

∗

∗

∗

7 + 2 = (4 + 2 + 1) + 2 = 4 + 2 + 1 + 2 = (4 + 1) + (2 + 2) = 5 + 0 = 5. In general, one way to evaluate nim sums is to write each addend in binary, then cancel any pair of the same power of two. The result is the nim sum of different powers of two, whose value is the ordinary sum of the powers of two. Now we can solve Puzzle 27. 16 Code 2 in Appendix III shows how to implement nim addition using a computer algebra system (CAS).

56

Chapter II: Mathematical Induction

Example 45: Puzzle 27 revisited. To see who wins in the nim configuration of Figure 8, the nim sum associated with three stacks of 5, 2, and 4 blocks is ∗

∗

∗

5 + 2 + 4 = (101)2 + (10)2

∗

+

(100)2 = (11)2 = 3.

Since 3 > 0, 𝒜 should win the game by Proposition 36. To find a winning move, 𝒜 must move in one of the three stacks so that the nim value of the resultant configuration has value 0. 𝒜’s only such play is to remove one block from the stack of two blocks so that the new nim sum is 0. By Proposition 36, ℬ will (eventually) lose the game. ∗ ∗ ∗ ∗ Another way to see this result is to observe that 5 + 2 + 4 = 2 + (5 + 4) = ∗ 2 + 1. That is, we can view the original configuration of (5, 2, 4) as equivalent to the configuration (2, 1). A winning move for 𝒜 in the latter configuration is to remove one block from the stack of two blocks, which is also a winning move for 𝒜 in the configuration (5, 2, 4). ♢ ∗

Table 2. The nim values 𝑚 + 𝑛, where 0 ≤ 𝑚 ≤ 7 and 0 ≤ 𝑛 ≤ 7. ∗

+

0 1 2 3 4 5 6 7

0 0 1 2 3 4 5 6 7

1 1 0 3 2 5 4 7 6

2 2 3 0 1 6 7 4 5

3 3 2 1 0 7 6 5 4

4 4 5 6 7 0 1 2 3

5 5 4 7 6 1 0 3 2

∗

6 6 7 4 5 2 3 0 1

∗

7 7 6 5 4 3 2 1 0

Example 46: Finding a good move in 30 + 40 + 50. We write 30 = (11110)2 , 40 = (101000)2 , and 50 = (110010)2 . The nim sum of these three stacks of blocks by the algorithm is 11110 101000 ∗ + 110010 100 Thus the nim sum of 30, 40, and 50 is 4. If 𝒜 removes four blocks from the stack of 30, the resultant configuration has nim value 0, as indicated below: 11010 101000 ∗ + 110010 0

Case Study: Mancala∗

57

Therefore, ℬ loses this game of nim. Another way to find a good first move in nim is to make use of the associative ∗ ∗ and commutative properties of nim addition. Write the game 30 + 40 + 50 as the ∗ ∗ ∗ game 30 + (40 + 50). Since 40 + 50 = 26, we now view the game as a configura∗ tion of two stacks rather than three whose nim value is 30 + 26. At this point, by Corollary 30, we know that to win this game, we must remove four blocks from the stack of 30. Fortunately, we see how to translate this move to the original configuration, namely, remove four blocks from the stack of 30. However, if we ∗ ∗ ∗ rewrite the game as (30 + 50) + 40 = 44 + 40, we know that somehow we must remove four blocks from the configuration (30, 50). But from which stack should we remove four? Thus, this regrouping of the configuration is unhelpful. ♢ Example 47: Another round of nim. Find a good first move in the game with these three stacks: (1011011)2 , (1100100)2 , and (1001010)2 . 1011011 1100100 Observe that the nim sum of these three numbers is ∗ + 1001010 1110101 Since the nim value of the game is (1110101)2 , there is a good move. In fact, there are two good moves, one by moving in the first stack and the other by moving in the third stack. We proceed by moving in the first stack. In order to have the resultant nim sum be 0, we replace (1011011)2 = 91 with (0101100)2 = 44. That is, we remove forty-seven blocks from the first stack. Finding the other good first move is left for the reader. ♢ As these examples show, without knowing Algorithm 44 or an equivalent rule, most players new to nim would seldom stumble upon a good move, let alone recognize it as such. A little mathematics can be powerful even in the humblest of settings.

Case Study: Mancala∗ Mancala is probably the oldest board game in human history. For mancala versions between two players having access to the same pieces, mancala is a nim game. Fortunately, the theory developed for analyzing the nim game 𝒩 of the previous section applies to this new game. Definition 48: Mancala terminology. Mancala’s board is often partitioned into one to four rows with 6 to 12 basins or pits in each row, although the concrete board of Figure 9 has many more. The word mancala means to move or to count. The moveable pieces or stones are beans, pebbles, shells, nuts, or chips, more

58

Chapter II: Mathematical Induction

Figure 9. An unusually large number of pits in a Zambian Mancala board, author sketch. or less indistinguishable, and are distributed according to some rule to start the game. Often a larger basin, a storehouse or roumba, may appear at the end of each board. Mancala is usually a game between two players, 𝒜 and ℬ. Play alternates between the two, with 𝒜 making the first move. A move consists in selecting a pit, removing all of its stones, and then seeding or depositing them a stone at a time into successive adjacent pits, moving around the board in a circular fashion until those stones are gone. In mancala versions where each player has their own stones, the usual object of the game is to clear one’s stones into the roumba before the opponent does. Multiple mancala variations exist, some of which involve elaborate moves.17 We focus on a version that incorporates much of the spirit and strategy inherent within this family of games, namely Tchoukaillon, a solitaire game devised in 1977 by Véronique Gautheron [35] to model and analyze end-game play in regular two-person mancala. Definition 49: Tchoukaillon terminology. The board consists of a single row of 𝑛 + 1 pits, denoted by 𝑝0 to 𝑝𝑛 where 𝑛 ∈ ℤ+ . The roumba is 𝑝0 . A finite number of stones are initially scattered among 𝑝1 through 𝑝𝑛 . In contrast to the nim game 𝒩 of the previous section, where a configuration is a collection of stacks of 17 Backgammon is a version of mancala, although the player’s moves are restricted by rolling dice. The children’s game Sorry! is also a version of mancala, where the player’s moves are restricted by drawing cards.

Case Study: Mancala∗

59

blocks, for mancala a configuration is an arrangement of stones in the pits on the board. A legal move consists in selecting a pit 𝑝𝑗 , for 𝑗 from 1 to 𝑛, removing its stones, and then seeding them to the left (depositing one stone in each pit 𝑖 with 1 ≤ 𝑖 < 𝑗), with the last stone deposited in the roumba. A player wins once all stones on the board are placed there. Whereas many initial stone arrangements are unwinnable, the configuration shown in Figure 10 is winnable.

roumba pits: 0

1

2

3

4

5

6

Figure 10. A Tchoukaillon board with six regular pits and a roumba.

Figure 11. A poor move in Tchoukaillon from the configuration given in Figure 10. To illustrate game dynamics: With respect to the board of Figure 10, we remove 𝑝6 ’s stones and seed them, yielding the configuration shown in Figure 11. This new configuration results in a loss, since 𝑝1 now contains two stones forever trapped there. That is, with 𝑖 being a positive integer, choosing to move by selecting 𝑝𝑖 when it contains more than 𝑖 stones is an invalid move, because after seeding the stones as far as is possible, more stones yet remain in the hand. Definition 50: More Tchoukaillon terminology. We say that 𝑝𝑖 is hot if it contains exactly 𝑖 stones, and that 𝑝𝑖 is cold if it contains more than 𝑖 stones. If 𝑖 < 𝑗 and both 𝑝𝑖 and 𝑝𝑗 are hot, 𝑝𝑗 is said to be hotter than 𝑝𝑖 and 𝑝𝑖 is said to be cooler than 𝑝𝑗 , where 𝑖, 𝑗 ∈ ℤ+ . Note that only hot pits correspond to legal moves. To represent stone configurations, we adopt the notation 1-0-0-2-4-6 and 2-11-3-5-0, respectively, to codify the configurations of Figures 10 and 11. To streamline this notation, sometimes we drop a configuration’s trailing right-hand zeroes. For example, the configuration 2-1-1-3-5-0 is the same as the configuration 2-11-3-5.

60

Chapter II: Mathematical Induction

Rather than moving haphazardly in a Tchoukaillon configuration, a better algorithm is to move in the coolest hot pit (any other move will transform some hot pit into a cold pit, whereupon its stones are forever trapped therein). Adopting this strategy means that the first move from Figure 10 should be 0-0-0-2-4-6, followed by 1-1-1-3-5, followed by 0-1-1-3-5, followed by 1-2-2-4, and so on.

Mancala nim∗ When played between two people, Tchoukaillon is a nim game which we call mancala nim. For this game, a move consists in selecting pit 𝑝𝑖 containing 𝑖 stones, for some 𝑖, and seeding them to the left, the last of which is placed in the roumba. Play then passes to the next person. The first person unable to move loses. Example 51: Playing a game of mancala nim. Who wins with the initial configuration of Figure 10—the first player 𝒜 or the second player ℬ? To see that 𝒜 should win the game, consider the following progression of moves. As convenient notation, any pit no longer corresponding to a valid move is marked with an X. • 𝒜 moves from 1-0-0-2-4-6 to 0-0-0-2-4-6. • ℬ must move to 1-1-1-3-5, because it is the only available move. • 𝒜 moves to X-2-2-4. • At this point, ℬ has two possible moves, one to X-X-3 and the other to X-0-2-4. • If ℬ moves to the former option, X-X-3, then 𝒜 counters with the winning move to X-X, and ℬ loses the game. On the other hand, what happens when ℬ moves to the latter option, X-0-2-4? • Then 𝒜 must move to X-1-3. • ℬ must move to X-2, because that is the only available move. • 𝒜 counters with the winning move to X. Thus, in every scenario of moves, ℬ loses the game. Therefore, 𝒜 wins the game. ♢ Puzzle 52: A mancala challenge. It is your turn to make a move in the configuration of Figure 12, consisting of three board configurations 𝒳, 𝒴, and 𝒵. At each turn, a player selects exactly one of the three configurations and makes a move therein. Which initial move should lead to a win? (The resolution to this puzzle may be found in Example 55.) ♢

Mancala nim∗

61

X

Y

Z

Figure 12. What is the best first move? To begin our search for a good first move for 𝒜 in this puzzle, we compute the nim value of each configuration 𝒳, 𝒴, and 𝒵. We note that if there is no move from a configuration 𝒞, then the nim value of 𝒞 is 0. However, calculating the nim value of any configuration means considering all of the configuration’s children, and grandchildren, and great grandchildren, and so on. A game tree will help us keep track of this information. Definition 53: A game tree. Briefly, a game tree is a map showing the relationship between all configurations that could possibly occur from the initial configuration 𝒞 of the game. We visualize each configuration as a point or a node of the tree. The initial configuration is called the root of the game tree. To indicate that the configuration ℰ is a child of 𝒟, the map includes an arrow from 𝒟 to ℰ. Any node that has no children is called a leaf node of the tree. Example 54: Finding the nim value 𝑉(𝒵).18 Consider the configuration 𝒵 =10-3-0-5-0 of Figure 12, the third configuration in Puzzle 52. The game tree for 𝒵 is shown in Figure 13. The nodes are displayed as ellipses with the configuration written within. For brevity, since the number of stones in each of the pits 𝑝1 through 𝑝6 can be represented by a single digit, we drop the hyphen notation. Thus the configuration 0-2-1-1 can also be written as 0211. The node labeled 𝐴 is the root node of the tree, namely configuration 𝒵. Since 𝒵 has three children, three arrows point from node 𝐴 to the nodes labeled 𝐵, 𝐶, and 𝐷. The leaves of this tree are labeled 𝐷, 𝐽, 𝐾, 𝑂, 𝑄, and 𝑇. At each node, within the ellipse we include a small circle in which we write the nim value of the configuration corresponding to that node. 18 Code

3 in Appendix III shows how to use a CAS to find the nim value of a mancala nim board.

62

Chapter II: Mathematical Induction

a node label A configuration B 00305 0 E H

01005 0 N P

1211 1

1011

X1005 0

D

X1X1 0 G X211

11X1 1 J

X211 1

K 01X1 0

X011 0

Q

1 X011

0

O

X011 0

0211 0 S

C

F

11005 2 I

root node nim value of configuration

10305 1

a leaf node 1

T 0011 0

Figure 13. A complete game tree for 𝒵 =1-0-3-0-5-0. To find the nim values for the configurations in this game tree we work from the leaves to the root, as outlined below. • By Definition 32, the nim value for each leaf configuration is 0 because a leaf has no children. Thus, 𝑉(𝐷) = 𝑉(𝐽) = 𝑉(𝐾) = 𝑉(𝑂) = 𝑉(𝑄) = 𝑉(𝑇) = 0. • Since 𝑆 has only one child, 𝑇, it follows that 𝑉(𝑆) = mex{𝑉(𝑇)} = mex{0} = 1. Similarly, 𝑉(𝑆) = 𝑉(𝐼) = 𝑉(𝐹) = 𝑉(𝐺) = 1. • Similarly, 𝑉(𝑃) = mex{𝑉(𝑆)} = mex{1} = 0 and 𝑉(𝐶) = 0. • Since 𝑁 has two children, 𝑉(𝑁) = mex{𝑉(𝑃), 𝑉(𝑄)} = mex{0, 0} = 1. • 𝑉(𝐻) = mex{𝑉(𝑁)} = mex{1} = 0. • 𝑉(𝐸) = mex{𝑉(𝐻), 𝑉(𝐼)} = mex{0, 1} = 2. • 𝑉(𝐵) = mex{𝑉(𝐸), 𝑉(𝐹)} = {2, 1} = 0. • Lastly, 𝑉(𝐴) = mex{𝑉(𝐵), 𝑉(𝐶), 𝑉(𝐷)} = {0, 0, 0} = 1. Therefore 𝑉(𝒵) = 𝑉(1-0-3-0-5) = 1. The game tree for configuration 𝒴 of Puzzle 52 is shown in Figure 15. The reader should use this game tree to verify

Exercises

63

that 𝑉(𝒴) = 2. Exercise 10b asks the reader to generate the game tree for 𝒳 and verify that 𝑉(𝒳) = 0. ♢ With this ability to find the nim values of mancala nim configurations, we can now solve Puzzle 52. Example 55: Revisiting Puzzle 52. Let 𝒞 be the configuration 𝒞 = 𝒳 ∪ 𝒴 ∪ 𝒵, ∗ ∗ (Figure 12). From Example 54 we know that 𝑉(𝒞) = 0 + 2 + 1 = 3. By Proposition 36, 𝒜 should win the game. 𝒜’s first move should be to a configuration 𝒟 that has nim value 0. But which move should we advise player 𝒜 to make? We may ignore configuration 𝒳 because its nim value is 0. We focus on the configuration 𝒴 ∪ 𝒵 where 𝑉(𝒴) = 2 and 𝑉(𝒵) = 1. Choosing a child of 𝒴 with nim ∗ value 1 means that the nim value of the resultant configuration is 1 + 1 = 0. 𝒴’s two children are 0-0-0-2-4-6

and

2-1-1-3-5-0,

the nim values for which are 0 and 1, respectively, as the reader may verify using 𝒴’s game tree shown in Figure 15. Thus, 𝒜 chooses the child configuration 21-1-3-5. Now ℬ faces a mancala nim configuration of nim value 0 and loses. Therefore, 𝒜 wins. ♢ Exercises 1. Prove the division algorithm for ℤ+ by supplying a reason for each Why? . The division algorithm: Let 𝑎, 𝑏 ∈ ℤ+ . There exist unique integers 𝑞 and 𝑟 where 𝑏 = 𝑎𝑞 + 𝑟, 0 ≤ 𝑟 < 𝑎. • Let 𝑆 = {𝑏 − 𝑎𝑛| 𝑏 − 𝑎𝑛 ≥ 0, 𝑛 ∈ ℕ}. 𝑆 ≠ ∅. Why? • Let 𝑟 = min(𝑆), where min(𝑆) is the minimum integer belonging to 𝑆. Observe that 𝑟 exists and is a nonnegative integer. Why? • Thus, 𝑟 = 𝑏 − 𝑎𝑛0 for some 𝑛0 ∈ ℕ. If 𝑟 ≥ 𝑎 then 𝑏 − 𝑎𝑛0 ≥ 𝑎. So 𝑏 − 𝑎(𝑛0 + 1) ≥ 0, a contradiction. Why? • Therefore 0 ≤ 𝑟 < 𝑎. Now we must show uniqueness of 𝑞 and 𝑟. To do this we let 𝑏 = 𝑎𝑞1 + 𝑟1 and 𝑏 = 𝑎𝑞2 + 𝑟2 with 0 ≤ 𝑟1 < 𝑎 and 0 ≤ 𝑟2 < 𝑎. We assume that 𝑞1 > 𝑞2 and reach a contradiction. Observe that 𝑞1 ≥ 𝑞2 + 1. Why? • Thus 𝑎𝑞2 + 𝑟2 = 𝑏 ≥ 𝑎(𝑞2 + 1) + 𝑟1 ⇒ 𝑟2 ≥ 𝑎 + 𝑟1 , a contradiction. Why? • Therefore, 𝑞1 = 𝑞2 . This result in turn means that 𝑟1 = 𝑟2 . Why? 2. (a) Prove that the 𝑛th triangular number 𝑇𝑛 is 𝑛(𝑛 + 1)/2, where 𝑇𝑛 is the sum of the first 𝑛 positive integers.

64

Chapter II: Mathematical Induction (b) Prove that 𝑛 < 2𝑛 for all 𝑛 ∈ ℕ. (c) Prove that 1(1! ) + 2(2! ) + ⋯ + 𝑛(𝑛! ) = (𝑛 + 1)! −1 for all 𝑛 ∈ ℕ. (d) Prove that Proposition 11 and Proposition 12 are equivalent. 𝑛

𝑛(𝑛 + 1)(2𝑛 + 1) . One way to 6 𝑘=1 prove this formula is to derive it from Exercise 2a. To do so, observe that 𝑘3 − (𝑘 − 1)3 = 3𝑘2 − 3𝑘 + 1. Summing these expressions as 𝑘 goes from 1 to 𝑛 gives

3. (a) The sum of the first 𝑛 squares is ∑ 𝑘2 =

𝑛

𝑛

𝑛

𝑛

𝑛3 = 3 ∑ 𝑘2 − 3 ∑ 𝑘 + ∑ 1 = 3 ∑ 𝑘2 − 𝑘=1

𝑘=1

𝑘=1

𝑘=1

3𝑛(𝑛 + 1) + 𝑛. 2

2

Solve this equation for ∑ 𝑘 to obtain the desired result. 𝑛

(b) Now replicate the idea used in part (a) to find a formula for ∑ 𝑘3 . Start 𝑘=1

with 𝑘4 − (𝑘 − 1)4 .

2 3

L

1

4

7

5 6

Figure 14. Revisiting Puzzle 𝑍1 of Chapter I. 4. (a) With respect to Puzzle 𝑍1 of Chapter I, let 𝑆(𝑛) be the number of regions into which a circle has been partitioned by line segments between all pairs of 𝑛 nodes (points along the circumference), where no three line

Exercises

65

segments contain the same point. Let 𝑛 + 1 be a new node on this circle containing 𝑛 nodes. Give an argument showing that if the line segment between node 𝑛 + 1 and node 𝑗 crosses 𝑘 line segments (where node 𝑗 is one of the given 𝑛 nodes), then the number of regions into which the circle has been partitioned is 𝑘 + 1 more than 𝑆(𝑛). (b) Consider Figure 14 showing that 𝑆(6) = 31, where 𝑆(𝑛) is defined in part (c). Note that 𝑆(0) = 1. For integer 𝑛 ≥ 1, let 𝑈(𝑛) = 𝑆(𝑛) − 𝑆(𝑛 − 1). To find, say, 𝑈(7), observe that six more line segments must be inserted into Figure 14 to complete the partition. In particular, consider the dashed line segment ℒ between node 7 and node 3. ℒ crosses six solid line segments; alternatively, this number is also obtained by multiplying the number of nodes on one side of ℒ by the number of nodes on the other side of ℒ, namely, 2 ⋅ 3. By part (c), inserting ℒ into the figure contributes 2 ⋅ 3 + 1 = 7 towards the value of 𝑈(7). Thus insertion of six new line segments between node 7 and the other six nodes gives 𝑈(7) = (0 ⋅ 5 + 1) + (1 ⋅ 4 + 1) + (2 ⋅ 3 + 1) + (3 ⋅ 2 + 1) + (4 ⋅ 1 + 1) + (5 ⋅ 0 + 1) 5

= ∑ (𝑘(5 − 𝑘) + 1) = 26. 𝑘=0

Show that 𝑈(𝑛) = (𝑛 − 1)(𝑛2 − 5𝑛 + 12)/6 for all 𝑛 ≥ 1. 𝑛

(c) Show that 𝑆(𝑛) = 𝑆(1) + ∑𝑘=1 𝑈(𝑘) for all 𝑛 ≥ 1. (d) Use Exercises 2a and 3 and the preceding parts of this exercise to show that 𝑛4 − 6𝑛3 + 23𝑛2 − 18𝑛 + 24 𝑆(𝑛) = (12) 24 for all 𝑛 ∈ ℤ+ . 5. (a) Let 𝑎 be a real number, 𝑎 ≠ 1, and 𝑆(𝑛) = 1 + 𝑎 + 𝑎2 + ⋯ + 𝑎𝑛 . Use mathematical induction to show that 𝒮𝑛 ∶ 𝑆(𝑛) =

1 − 𝑎𝑛+1 1−𝑎

is valid for all 𝑛 ∈ ℤ+ . (b) From part (a), obtain the geometric series formula ∞

1 = ∑ 𝑎𝑛 , 1 − 𝑎 𝑛=0

(13)

where |𝑎| < 1. ∞

1 , 𝑛! 𝑛=0 recreate Joseph Fourier’s 1815 proof that 𝑒 is irrational by following the outline below.

(c) Using Definition 9 and Equation (13), and knowing that 𝑒 = ∑

66

Chapter II: Mathematical Induction ∞

∞

1 1 < ∑ 𝑛 = 1.) (𝑛 + 1)! 𝑛=1 2 𝑛=1

• Show that 2 < 𝑒 < 3. (Use the relation ∑ • Assume that 𝑒 = 𝑛 ≥ 2. • Multiply 𝑒 =

𝑚 𝑛

𝑚 𝑛

where 𝑚 and 𝑛 have no factors in common and 𝑛

∞

𝑛! 𝑛! +∑ . 𝑘! (𝑛 + 𝑘)! 𝑘=0 𝑘=1

by 𝑛! and obtain 𝑚(𝑛 − 1)! = ∑

1 1 1 + + ⋯ must be • So + 𝑛+1 (𝑛 + 1)(𝑛 + 2) (𝑛 + 1)(𝑛 + 2)(𝑛 + 3) an integer, a contradiction, because this infinite sum is positive but 1 less than 1 by Equation (13) when 𝑎 = . 3

6. Find the nim values for the following configurations in the nim game 𝒩, and determine nim moves, if they exist, for which the resultant configurations have value 0. (a) 35 ∪ 45 ∪ 55. (b) 17 ∪ 21 ∪ 27 ∪ 12. (c) 100 ∪ 200 ∪ 300. 7. (a) Describe a winning strategy for this variation of the game of nim: A player is only allowed to move either one or two blocks at a time from a stack. (b) Describe a winning strategy for this variation of the game of nim: A player is only allowed to move either two or three blocks at a time from a stack. (c) Define a relation ∼ on the set of all nim configurations by 𝒞 ∼ 𝒟 if and only if 𝑉(𝒞 ∪ 𝒟) = 0. Show that ∼ is an equivalence relation. 8. (a) The first ten winning Tchoukaillon positions are the following configurations. What are the eleventh and twelfth configurations? 1∶ 2∶ 3∶ 4∶ 5∶

1-0-0-0-0-0 0-2-0-0-0-0 1-2-0-0-0-0 0-1-3-0-0-0 1-1-3-0-0-0

6∶ 7∶ 8∶ 9∶ 10 ∶

0-0-2-4-0-0 1-0-2-4-0-0 0-2-2-4-0-0 1-2-2-4-0-0 0-1-1-3-5-0

(b) Finish the sequence of moves begun on p. 60 to solve the Tchoukaillon configuration of Figure 10. 9. The value of a mancala nim position 𝒫 is the minimal excluded value of the nim values of 𝒫’s children. Recall that if 𝒫 has no children, its nim value, denoted by 𝑉(𝒫), is 0, and observe that 𝑉(1-0-0-0-0-0) = 1. (a) Show 𝑉(0-2-0-0-0-0) = 0, 𝑉(1-2-0-0-0-0) = 1, and 𝑉(1-1-1-4-0-0) = 2. (b) Find the nim values 𝑉(1-2-3-0-0-0), 𝑉(1-1-0-4-0-0), and 𝑉(0-0-0-0-0-6).

Exercises

67

Figure 15. A complete game tree for 𝒴 =1-0-0-2-4-6 of Puzzle 52. (c) Find a configuration whose nim value is 3. (d) Explain why, in generating the game tree for any single board of a mancala nim game, no configuration will appear as two distinct nodes. 10. (a) Consider the mancala nim configuration in Figure 10. Show that if 𝒜’s first move is to X-1-1-3-5-0, then ℬ should win the game. (b) The game tree for configuration 𝒴 of Puzzle 52 is given in Figure 15. Using the same process as outlined in Example 54, verify the nim value 𝑉(𝒴) = 2. (c) Generate a game tree for configuration 𝒳 of Puzzle 52. Conclude that 𝑉(𝒳) = 0.

Strand III: Al-Maghribî meets Sudoku This strand focuses on an old brainteaser about partitioning an inheritance into ninths. It was posed by Molla Mohammed on pilgrimage from India to Mecca around 1590. A landowner has 81 trees. Each year, the first tree produces one basket of fruit, the second tree produces two baskets of fruit, and so on, so that the eighty-first tree produces 81 baskets of fruit. How may he partition these trees among his nine sons so that each one receives nine trees and an equal number of baskets of fruit each year? [76, 77] This riddle is sometimes referred to as the Mecca problem. It first appeared in the appendix of a book on algorithms and Diophantine equations, Tuhfetu’lÂdâd lizevil Rüşd ve’s—Sedad written by the Algerian-born mathematician Ali bin Veli Ibn Hamza al-Cezâirî, also known as Al-Maghribî. In this strand we analyze the structure of this riddle and show that it is a slight variation of the popular Sudoku puzzles of today.

Figure 1. An orchard of 81 trees? An engraving circa 1820 [33, E. M., Vol. I, Plate 83]. 69

70

Strand III: Al-Maghribî meets Sudoku

Since Table 1 below contains the answer to the riddle, the reader may wish to attempt a solution before reading further. To do a warm-up exercise first, try a simpler version of the puzzle: A landowner has twenty-five trees and five daughters. Tree 𝑖 produces 𝑖 baskets of fruit each season, 1 ≤ 𝑖 ≤ 25. How may the trees be partitioned so that each daughter receives five trees and an equal number of baskets of fruit each season? To represent his solution, Al-Maghribî formed a 9 × 9 grid into which he placed the trees as represented by the integers 1 through 81, one tree per cell of the grid. Then each son’s tree allotment is the set of trees whose numbers appear in his corresponding column; that is, the first son’s trees are in column one, the second in column two, and so on, so that the ninth son’s trees are in column 9. Table 1. Al-Maghribî’s solution. 1 18 26 34 42 50 58 66 74

2 10 27 35 43 51 59 67 75

3 11 19 36 44 52 60 68 76

4 12 20 28 45 53 61 69 77

5 13 21 29 37 54 62 70 78

6 14 22 30 38 46 63 71 79

7 15 23 31 39 47 55 72 80

8 16 24 32 40 48 56 64 81

9 17 25 33 41 49 57 65 73

As can be seen, the first son receives trees {1, 18, 26, 34, 42, 50, 58, 66, 74}, the second son receives {2, 10, 27, 35, 43, 51, 59, 67, 75}, and so on. In total, the annual production of each son’s trees tallies to 369 baskets. Table 2. Subtracting an arithmetic sequence from Table 1. 1 9 8 7 6 5 4 3 2

2 1 9 8 7 6 5 4 3

3 2 1 9 8 7 6 5 4

4 3 2 1 9 8 7 6 5

5 4 3 2 1 9 8 7 6

6 5 4 3 2 1 9 8 7

7 6 5 4 3 2 1 9 8

8 7 6 5 4 3 2 1 9

9 8 7 6 5 4 3 2 1

Strand III: Al-Maghribî meets Sudoku

71

One way to see the logical structure in Al-Maghribî’s solution is to subtract successive multiples of 9 from each row, beginning with zero. That is, subtract 0 from each cell of the first row, subtract 9 from each cell of the second row, subtract 18 from the third row, and so on, and finish by subtracting 72 from the last row to obtain Table 2. Each row of Table 2 is a right circular translation of the preceding row. This pattern is one choice from a complete set of mutual derangements.1 Thus the columns have a common sum, which also means—without actually summing them—that the columns of Table 1 have a common sum. The reader may observe that Table 2 looks much like a completed Sudoku square.2 In fact, if we rearrange its rows, it is one. We rearrange the rows 𝑅𝑖 of Table 2, 1 ≤ 𝑖 ≤ 9, in the order 𝑅1 , 𝑅4 , 𝑅7 , 𝑅2 , 𝑅5 , 𝑅8 , 𝑅3 , 𝑅6 , 𝑅9 to obtain Table 3, to which we have added four extra internal lines so as to parTable 3. Rearranging the rows of Table 1. 1 7 4 9 6 3 8 5 2

2 8 5 1 7 4 9 6 3

3 9 6 2 8 5 1 7 4

4 1 7 3 9 6 2 8 5

5 2 8 4 1 7 3 9 6

6 3 9 5 2 8 4 1 7

7 4 1 6 3 9 5 2 8

8 5 2 7 4 1 6 3 9

9 6 3 8 5 2 7 4 1

tition the 9 × 9 grid into nine 3 × 3 grids. Each of these sub-grids contains all the integers from 1 through 9, and each row and column of the 9 × 9 grid contains all the integers from 1 through 9. Table 3 is a completed Sudoku puzzle. From this new array, we generate an alternate solution to the Mecca problem by adding the arithmetic sequence of successive multiples of nine to its rows, so obtaining Table 4. In fact, any completed Sudoku puzzle gives rise to a Mecca problem solution. Just as we did in going from Table 3 to Table 4, all that must be done is to add the successive multiples of nine to successive rows of the puzzle. 1 Two ordered lists of length 𝑛 containing all of the integers 1 through 𝑛 are derangements of each other if their 𝑖th members are never the same for all 𝑖 with 1 ≤ 𝑖 ≤ 𝑛. 2 A Sudoku square is a 9 × 9 grid in which every row, column, and 3 × 3 block contains the digits 1–9 exactly once [126]. Sudoku was originally called Number Place, appearing for the first time in 1979 in Dell Puzzle Magazine. It gained great popularity in Japan and much of Asia before earning its place on the puzzle pages of American newspapers and a variety of popular magazines.

72

Strand III: Al-Maghribî meets Sudoku Table 4. A different Mecca problem solution than Table 1. 1 16 22 36 42 48 62 68 74

2 17 23 28 43 49 63 69 75

3 18 24 29 44 50 55 70 76

4 10 25 30 45 51 56 71 77

5 11 26 31 37 52 57 72 78

6 12 27 32 38 53 58 64 79

7 13 19 33 39 54 59 65 80

8 14 20 34 40 46 60 66 81

9 15 21 35 41 47 61 67 73

The Mecca problem generalizes: A parent wishes to distribute 𝑛2 trees to 𝑛 children equitably. An equitable distribution is one in which each child has the same number of trees and each child’s total tree production is the same. As before, for 1 ≤ 𝑖 ≤ 𝑛, tree-𝑖 produces 𝑖 baskets of fruit annually. How may this partition be done? To solve this general problem, simply find 𝑛 derangements of distinct integers from the first 𝑛 integers. A very simple pattern that does so is when the first row is {1, 2, 3, 4, … , 𝑛}, the second row is {2, 3, 4, 5, … , 𝑛, 1}, the third row is {3, 4, 5, 6, … , 𝑛, 1, 2}, and so on, so that the last row is {𝑛, 1, 2, 3, … , (𝑛 − 1)}. Now add the arithmetic sequence 0, 𝑛, 2𝑛, 3𝑛, … , (𝑛 − 1)𝑛 to these rows in the same manner as is done in going from Table 3 to Table 4, and we have a solution. The Mecca problem asks a landowner to partition 81 trees of varying productivity into nine disjoint sets so that each set is equally productive. In this next chapter we present another partitioning problem: A landowner has two gardens of 𝑎 trees and 𝑏 trees; find the maximal number 𝑑 of children the landowner can have so that each child can inherit an equal part from each garden. For example, if 𝑑 = 5, then each child inherits 𝑎/5 trees in the first garden and 𝑏/5 trees in the second garden where 𝑎/5 and 𝑏/5 are integers.

Chapter III: GCDs and Diophantine Equations Diophantus of Alexandria (third century) gathered together problems of an algebraic nature into a work called Arithmetica. Of these problems about two hundred have survived to our day. Some involving higher-order equations are surprisingly tricky. As the Byzantine mathematician Maximum Planudes said: Thy soul, Diophantus, be with Satan because of the difficulties of your theorems. Fortunately, Diophantus supplies a strategy for solving each problem along with a solution. In fact, Diophantus addresses his collection to a friend, Dionysius, saying: Knowing that you are anxious to learn how to investigate problems in numbers, I have tried to set forth for you the nature and power subsisting in numbers. Although Diophantus was content to supply just one answer to his various indeterminate problems, many of his problems and methods of solution can be generalized. The simplest of these problems is a two-dimensional arithmetic procedure to find integers 𝑥 and 𝑦 for given integers 𝑎, 𝑏, and 𝑐 satisfying 𝑎𝑥 + 𝑏𝑦 = 𝑐.

(1)

Equations of this type are called linear Diophantine equations in his honor. In this chapter we define the greatest common divisor of two positive integers, show three different ways to calculate it, and show how it is related to Equation (1) and to finite simple continued fractions. Then we solve Equation (1) and apply the idea within the system of modular arithmetic. In Chapter XII, we use Diophantine equations to predict the occurrences of solar eclipses. 73

74

Chapter III: GCDs and Diophantine Equations

The greatest common divisor Definition 1: GCD. Let 𝑎, 𝑏 ∈ ℤ+ . The greatest common divisor (GCD) of 𝑎 and 𝑏, denoted by gcd(𝑎, 𝑏), is the greatest integer 𝑑 that divides both 𝑎 and 𝑏. Because gcd(𝑎, 𝑏) = 1 is an important special case, we revise Definition II.6 on relatively prime integers. Definition 2: Relative primeness. Let 𝑎, 𝑏 ∈ ℤ+ . We say that 𝑎 and 𝑏 are relatively prime if gcd(𝑎, 𝑏) = 1. Proposition II.7 showed that gcd(𝑎, 𝑏) = 1 if and only if there exist integers 𝑥0 and 𝑦0 with 𝑎𝑥0 + 𝑏𝑦0 = 1. But the proof revealed neither how to find integers 𝑥0 and 𝑦0 nor (in the absence of 𝑥0 and 𝑦0 ) how to determine when two positive integers are relatively prime. We remedy that shortcoming in this section. One way to determine the greatest common divisor of two given integers 𝑎 and 𝑏 is to use the fundamental theorem of arithmetic, writing each integer as a product of powers of primes, thereby revealing the greatest common divisor 𝑑. Example 3: The GCD via the fundamental theorem. Let 𝑎 = 1683 and 𝑏 = 1768. These integers factor as 𝑎 = 23 ⋅ 13 ⋅ 17 and 𝑏 = 32 ⋅ 11 ⋅ 17. Thus the greatest common divisor of these integers is 𝑑 = 17. ♢ One difficulty in applying the decomposition approach illustrated in Example 3 is that factoring an arbitrary integer is tedious and is impractical in real time if the integer is very large. However, another approach is always successful. It is sometimes called the chocolate bar algorithm. 6

4 10 cm

5 1

2 3

26 cm

Figure 2. A chocolate bar, 10 cm by 26 cm.

The greatest common divisor

75

Proposition 4: The chocolate bar algorithm. Let 𝑎, 𝑏 ∈ ℤ+ with 𝑎 < 𝑏. Then gcd(𝑎, 𝑏) = gcd(𝑎, 𝑏 − 𝑎).

(2)

To find 𝑑 = gcd(𝑎, 𝑏), apply Equation (2) repeatedly until obtaining 𝑑 = gcd(𝑛, 𝑛) = 𝑛 for some 𝑑 ∈ ℤ+ . Proof. Observe that the integer 𝑐 is a divisor of both 𝑎 and 𝑏 if and only if 𝑐 is a divisor of both 𝑎 and 𝑏 − 𝑎. Hence gcd(𝑎, 𝑏) = gcd(𝑎, 𝑏 − 𝑎). Since repeated use of this idea leads to a pair of positive integers whose sum is always less than the previous pair, the algorithm must always reach an integer pair of the form (𝑛, 𝑛) where 𝑛 ∈ ℤ+ . So gcd(𝑛, 𝑛) = 𝑛. The next example illustrates Proposition 4. Example 5: Eating a chocolate bar. Imagine that Ann loves chocolate. Her favorite chocolates are two-dimensional rectangular bars. Let 𝑎 and 𝑏 be the integer width and length of a chocolate bar, with 𝑎 ≤ 𝑏. By custom, Ann eats the bar by breaking off the largest possible square from one end, and eats that piece. Thus, if 𝑎 = 𝑏, Ann eats the entire bar. Otherwise she eats an 𝑎 × 𝑎 square from the end, leaving a bar with dimensions 𝑎 × (𝑏 − 𝑎). To illustrate these dynamics, let 𝑎 = 10 and 𝑏 = 26. See Figure 2. Ann eats the bar in six steps. At step 1, she breaks off a 10 × 10 square from the left-hand side, leaving a 10 × 16 rectangular bar. At step 2, she breaks off another 10 × 10 square, leaving a 10 × 6 rectangular bar. At step 3, she breaks off a 6 × 6 square, leaving a 4 × 6 bar. At step 4, she breaks off a 4 × 4 square, leaving a 2 × 4 bar. At step 5, she breaks off a 2 × 2 square, leaving a 2 × 2 square. At step 6, she eats the last square. Therefore, by Proposition 4, gcd(10, 26) = 2. ♢ This chocolate bar algorithm is equivalent to what is called Euclid’s greatest common divisor algorithm. Rather than successively breaking off squares from a chocolate bar, his algorithm repeatedly utilizes Proposition II.3, the division algorithm: 𝑏 = 𝑎𝑞 + 𝑟, where 𝑏 and 𝑞 are unique nonnegative integers and 0 ≤ 𝑟 < 𝑎. After using the division algorithm to decompose an idealized chocolate bar into ever smaller rectangular pieces, Euclid’s algorithm gives instructions for reassembling the pieces so that the greatest common divisor 𝑑 of 𝑎 and 𝑏 is written as a linear combination of 𝑎 and 𝑏, namely, 𝑎𝑥 + 𝑏𝑦 = 𝑑 (3) for some integers 𝑥 and 𝑦. Observe that Equation (3) is equivalent to Equation (1) when 𝑐 = 𝑑. Proposition 6: Euclid’s algorithm. Let 𝑎 and 𝑏 be positive integers such that gcd(𝑎, 𝑏) = 𝑑. To find a solution 𝑥0 and 𝑦0 of integers for Equation (3), use the following procedure.

76

Chapter III: GCDs and Diophantine Equations

Step i: If 0 < 𝑎 ≤ 𝑏, write 𝑏 = 𝑎𝑞 + 𝑟, where 𝑞 ≥ 1 and 0 ≤ 𝑟 < 𝑎. If 𝑟 = 0, jump to step iii. Otherwise proceed to step ii. Step ii: Repeat step i by redefining 𝑏 as 𝑎 and 𝑎 as 𝑟. Step iii: Let 𝑑 be the last 𝑎 that divides the last 𝑏. Step iv: For each equation 𝑏 = 𝑞𝑎 + 𝑟 with 𝑟 ≠ 0 in the cascade of equations generated in steps i and ii, solve for 𝑟 and back-substitute until producing Equation (3). Proof. In step i, we repeatedly use the division algorithm from Chapter II. Step iii is valid by Proposition 4. Observe that the initial remainder was a specific nonnegative integer and the successive remainders form a strictly decreasing sequence of integers. Thus we eventually reach step iii. By the same argument used in Proposition 4, at each step the gcd(𝑎, 𝑏) remains invariant for each new pair of integers (𝑎, 𝑏). The last step involves straightforward but tedious arithmetic. The next two examples illustrate Euclid’s algorithm.3 Example 7: Revisiting Example 5. We apply Euclid’s algorithm to find the greatest common divisor of 𝑏 = 26 and 𝑎 = 10. • By step i, 26 = 2 ⋅ 10 + 6. By step ii, let 𝑎 = 6 and 𝑏 = 10. • By step i, 10 = 1 ⋅ 6 + 4. By step ii, let 𝑎 = 4 and 𝑏 = 6. • By step i, 6 = 1 ⋅ 4 + 2. By step ii, let 𝑎 = 2 and 𝑏 = 4. • By step i, 4 = 2 ⋅ 2 + 0. By step iii, 𝑑 = 2. • By step iv, solve each of the above equations (except for the last equation) for its remainder 𝑟: 2 = 6 − 4,

4 = 10 − 6 ,

and

6 = 26 − 2 ⋅ 10.

Back-substituting and simplifying repeatedly gives 2 = 6 − 4 = 6 − (10 − 6) = 2 ⋅ 6 − 10 = 2 ⋅ (26 − 2 ⋅ 10) − 10 = 2 ⋅ 26 − 5 ⋅ 10. Therefore gcd(10, 26) = 2 and 2 = 2 ⋅ 26 − 5 ⋅ 10, so that 𝑥0 = 2 and 𝑦0 = −5 with respect to Proposition 6. ♢ Example 8: Revisiting Example 3. We apply Euclid’s algorithm to find the greatest common divisor of 𝑏 = 1768 and 𝑎 = 1683. This time we assemble the successive equations 𝑏 = 𝑞𝑎+𝑟 as generated by the algorithm in Table 5, in which each quotient 𝑞 is boxed. Back-substituting and simplifying the equations from 3 Code 4 in Appendix III illustrates the Mathematica command to obtain 𝑎𝑥 + 𝑏𝑦 = 𝑑 for any given positive integers 𝑎 and 𝑏, with 𝑑 being the greatest common divisor of the two integers.

The greatest common divisor

77

column three of the table give 17 = 85 − 68 = 85 − (1683 − 19 ⋅ 85) = 20 ⋅ 85 − 1683 = 20 ⋅ (1768 − 1683) − 1683 = 20 ⋅ 1768 − 21 ⋅ 1683. Therefore gcd(1683, 1768) = 17 and −21 ⋅ 1683 + 20 ⋅ 1768 = 17, so that 𝑥0 = −21 and 𝑦0 = 20 with respect to Proposition 6. Table 5. Bookkeeping for the remainders in Example 8. iteration 𝑏 = 𝑞𝑎 + 𝑟

solving for 𝑟

1

1768 = 1 ⋅ 1683 + 85

85 = 1768 − 1683

2

1683 = 19 ⋅ 85 + 68

68 = 1683 − 19 ⋅ 85

3

85 = 1 ⋅ 68 + 17

17 = 85 − 68

4

68 = 4 ⋅ 17 + 0

To find the finite simple continued fraction representation for the fraction we use the ideas of Example II.4 and assemble the successive quotients— 1683 which have been boxed in Table 5—as a list of partial denominators: the finite simple continued fraction [1; 19, 1, 4]. Observe that 104 1 , = (4) [1; 19, 1, 4] = 1 + 1 99 19 + 1 1+ 4 1768

which is

1768 1683

♢

in reduced form.

In general, every time we use Euclid’s algorithm to find the greatest common divisor of two positive integers 𝑎 and 𝑏, we are generating a finite simple 𝑎 continued fraction for . For this reason, these simple continued fractions, as 𝑏 in Equation (4), are referred to as Euclidean continued fractions. Observe that although the simple continued fraction [1; 10, 1, 3, 1] is an alternate expression for the continued fraction in Equation (4), it is not a Euclidean continued fraction. In general, whenever we use Euclid’s algorithm to generate a simple contin𝑝 ued fraction for a fraction , the last partial denominator will never be 1 (unless 𝑝 𝑞

𝑞

= 1), because otherwise the last remainder would be equal to the last divisor

in the Euclidean algorithm, a contradiction. Thus, unless we specify otherwise, 𝑝 when we refer to the simple continued fraction for the fraction we mean the 𝑞

Euclidean continued fraction.

78

Chapter III: GCDs and Diophantine Equations

An ancient algorithm for the greatest common divisor Yet a third way to calculate the greatest common divisor of two positive integers 𝑚 and 𝑛 is to start with an array of 𝑚 × 𝑛 dots arranged as 𝑛 rows and 𝑚 columns. Imagine this array is a series of finger pokes upon a smooth sandy region near a cooking fire, possibly after dark.4 Figure 3a gives one such array.

a.

b.

Figure 3. A 5 × 3 grid and its associated sona drawing. Around this fire, children gather, and a master story-teller narrates an Aesop’s fable-type legend while tracing a curve through the sand, usually using the right index finger. The curve starts somewhere on the grid (not at a dot) of the array, and always proceeds along the avenues between the dots, where the avenues are inclined at 45∘ to the rows and columns. No avenue is ever retraced. The only time the curve changes direction is outside the array of dots. If the storyteller pauses in indecision or if the story-teller’s finger returns in futility to the start point before each dot is sequestered by the curve from the other dots, the children laugh, knowing the master erred. Figure 3b shows the completed curve through an array of 5 × 3 dots. The master must know beforehand which arrays of dots yield single-loop solutions through the maze of avenues, a situation that occurs only when 𝑚 and 𝑛 are relatively prime. However, if 𝑚 and 𝑛 have greatest common divisor 𝑑 > 1, the completed curve through all the avenues will consist of 𝑑 loops, and the master’s index finger will be raised 𝑑 times to trace the curve. These dots and curves in the sand are called sona drawings. More formally, we make the following definition. Definition 9: Sona drawings. Let 𝒮(𝑚, 𝑛) be a rectangular array of 𝑚 × 𝑛 dots as 𝑛 rows of 𝑚 dots. Following Schlatter [129], we impose a coordinate scheme upon this array and encompass the array with a rectangle whose corners are at coordinates (0, 0), (2𝑚, 0), (2𝑚, 2𝑛), and (0, 2𝑛), so that each of the coordinates of the 𝑚𝑛 dots are odd as shown in Figure 4. Thus, for example, there is a dot 4 This algorithm has a long tradition among the Cockwe people of greater Angola [55], and may date far into the past.

An ancient algorithm for the greatest common divisor

M

L

K

J

79

I

N

H

O

G

P

F A

B

C

D

E

Figure 4. The sona drawing of Figure 3b as a beam of reflected light, 𝑚 = 5, 𝑛 = 3. at (1, 1) and at (2𝑚 − 1, 2𝑛 − 1). At (1, 0), position a laser angled at 45∘ to the horizontal axis so that the laser beam is directed up and to the right. Imagine that the bounding sides of the rectangle are mirrors, and the path taken by the master’s tracing finger is the path of the laser beam reflected by the mirrors. The laser beam as a path will bounce off the sides of the rectangle and eventually return to (1, 0).5 If for some 𝑗 with 2 ≤ 𝑗 ≤ 𝑛, the point (2𝑗 − 1, 0) is not on this path, position the laser (with angle 45∘ ) at that point and generate another laser path. Continue this process until every point of the form (2𝑗 − 1, 0) is on a laser path. The union of all these laser paths forms a sona drawing. Using Definition 9 means that the rounded turns of the sona drawing of Figure 3b will now be replaced with right-angle reflections in the mirrors as shown in Figure 4. These sona drawings can be idealized as loops in a graph. To make this language precise, we have the following definition. Definition 10: Graphs. A graph 𝒢 is a set of vertices 𝑉 and a set of edges 𝐸. Each of the edges is labeled by a pair of vertices from 𝑉. We may think of an edge 𝐴𝐵 as a curve, such as a line segment, whose endpoints are the two vertices 𝐴 and 𝐵. For example, the game trees introduced in Chapter II are graphs.6 5 Since the boundary of the rectangle contains only finitely many possible reflection points, the beam must eventually reflect at a point on the boundary it has already visited. Let 𝑍 be the first such revisited point. Let 𝑋 and 𝑌 be the two boundary points connected to 𝑍 by 45-degree lines. If 𝑍 ≠ (1, 0), the beam must have arrived and left 𝑍 and then returned to 𝑍, which means that either 𝑋 or 𝑌 had been already visited twice, a contradiction. So 𝑍 = (1, 0). 6 In particular, a game tree’s vertices are all the various nim configurations that could conceivably occur in a game, and each of its edges connect a configuration 𝒞 to a child configuration 𝒟 of 𝒞.

80

Chapter III: GCDs and Diophantine Equations

Definition 11: Sona graphs. The sona graph 𝒢(𝑚, 𝑛) is the graph associated with 𝒮(𝑚, 𝑛). The vertices of 𝒢(𝑚, 𝑛) are the 2(𝑚 + 𝑛) points along the boundary of a 2𝑚×2𝑛 rectangle bounding an array of 𝑚×𝑛 dots. Specifically, these vertices’ coordinates are of the form (2𝑖 − 1, 0), (2𝑚, 2𝑗 − 1), (2𝑖 − 1, 2𝑛), and (0, 2𝑗 − 1), where 1 ≤ 𝑖 ≤ 𝑚 and 1 ≤ 𝑗 ≤ 𝑛. The edges of the graph are the segments between successive points of reflection in the laser beam paths. For example, in Figure 4, segment 𝐷𝐺 is an edge between the vertices 𝐷 and 𝐺, but segment DE is not an edge of the graph. Since we used the terms path and cycle somewhat informally in Definition 9, we now give a more formal definition. Definition 12: Paths and cycles. A path in 𝒢 is a finite string of (not neccessarily distinct) vertices where each pair of adjacent vertices is an edge in 𝒢. A cycle in 𝒢 is a path of otherwise distinct vertices beginning and ending with the same vertex. Each loop in a sona drawing is a cycle in the corresponding sona graph. Every cycle is a path. Definition 13: Covers. Let 𝒢 be a graph with vertex set 𝑉. Let 𝒫 be a set of paths in 𝒢. We say that 𝒫 covers 𝒢 if the set of all vertices in 𝒫 is 𝑉. Example 14: A cycle in 𝒢(5, 3). In Figure 4, we have labeled the sixteen vertices of 𝒢(5, 3) by the letters 𝐴 through 𝑃. The path in the sona graph is the cycle 𝑤: 𝑤 = 𝐴𝐽𝐺𝐷𝑀𝑁𝐶𝐻𝐼𝐵𝑂𝐿𝐸𝐹𝐾𝑃𝐴. The set {𝑤} is a cover of 𝒢(5, 3).

♢

Before explaining how Figures 3b and 4 demonstrate that gcd(5, 3) = 1, consider an example of a pair of integers whose greatest common divisor is more than 1. Example 15: A sona graph containing three cycles. Consider the sona graph 𝒢(9, 6). This time we cannot complete the sona drawing with just one cycle. We must raise our drawing finger and start afresh, not just a second time but a third as well, as shown in Figure 5. Starting the sona curve along the 𝑥-axis at 𝑥 = 1, 11, or 13 generates the cycle of Figure 5a. Starting at 𝑥 = 3, 9, or 15 generates the cycle of Figure 5b. Starting at 𝑥 = 5, 7, or 17 generates the cycle of Figure 5c. It is no coincidence that the greatest common divisor of 6 and 9 is indeed 3. ♢

An ancient algorithm for the greatest common divisor

a. The cycle starting at (1, 0).

c. The cycle starting at (5, 0).

81

b. The cycle starting at (3, 0).

d. Union of the three cycles.

Figure 5. The 9 × 6 sona drawing. The patterns obtained in sona drawings when following Definition 11 are called plaited mats. Use of different colored cords or reeds for each cycle in these mats make rather attractive craft designs. Definition 16: The number of cycles in a sona graph. For 𝑚, 𝑛 ∈ ℤ+ , let 𝒬(𝑚, 𝑛) be the number of distinct cycles in the sona graph 𝒢(𝑚, 𝑛). For instance, 𝒬(5, 3) = 1 and 𝒬(9, 6) = 3 by Examples 14 and 15. The proofs for the next three propositions are left to the reader. Proposition 17: Distinct cycles are disjoint. Let 𝑤 and 𝑣 be distinct cycles in 𝒢(𝑚, 𝑛) where 𝑚, 𝑛 ∈ ℤ+ . The cycles 𝑤 and 𝑣 have no vertices in common. Proposition 18: Every vertex is part of a sona cycle. With 𝑚, 𝑛 ∈ ℤ+ , let 𝑊 be a vertex in the sona graph 𝒢(𝑚, 𝑛). Then 𝑊 is part of some cycle in the graph 𝒢(𝑚, 𝑛). Proposition 19: Edge crossings. Let 𝑚, 𝑛 ∈ ℤ+ . In the sona graph 𝒢(𝑚, 𝑛) the (laser-traced) edges of the graph cross each other or touch each other at (2𝑖−1, 2𝑗) and (2𝑖, 2𝑗 − 1) for all 𝑖, 𝑗 ∈ ℕ with 1 ≤ 𝑖 ≤ 𝑚 and 1 ≤ 𝑗 ≤ 𝑛.

82

Chapter III: GCDs and Diophantine Equations

(2n − 2j + 1, 2n)

(2n, 2n − 2j + 1)

(0, 2j − 1)

(2j − 1, 0) Figure 6. A sona cycle within a square. Proposition 20: Square sona arrays. Consider the sona graph 𝒢(𝑛, 𝑛) where 𝑛 ∈ ℤ+ . This sona graph contains 𝑛 distinct cycles. That is, 𝒬(𝑛, 𝑛) = 𝑛. Proof. Each cycle in 𝒢(𝑛, 𝑛) has exactly four vertices. In particular, if the cycle starts along the bottom edge of the rectangle at (2𝑗 − 1, 0) for some integer 𝑗 with 1 ≤ 𝑗 ≤ 𝑛, then the resultant cycle is given by the following string of vertices (where each vertex is labeled by its coordinates). We use a rightward arrow to indicate that adjacent vertices are connected by an edge in 𝒢(𝑛, 𝑛). (2𝑗 − 1, 0) → (2𝑛, 2𝑛 − 2𝑗 + 1) → (2𝑛 − 2𝑗 + 1, 2𝑛) → (0, 2𝑗 − 1) → (2𝑗 − 1, 0). Figure 6 illustrates this structure. Since 𝒢(𝑛, 𝑛) contains exactly 4𝑛 vertices, Proposition 17 implies that 𝒬(𝑛, 𝑛) = 𝑛. Example 21: A prelude to a proof of Proposition 22. Consider 𝒢(9, 6) in Figure 7. In Example 15 we showed that three cycles cover the graph. By Proposition 17 there is no other way to cover the graph using only cycles. We can also view this figure as the juxtaposition of two smaller graphs 𝒢(6, 6) and 𝒢(3, 6). We represent this juxtaposition using a gray vertical line separating these two smaller graphs. Figure 7 shows a 9 × 6 array partitioned by a gray vertical line into a 6 × 6 array on the left and a 3 × 6 array on the right. We claim that 𝒬(9, 6) = 𝒬(3, 6). To see why this is so, let 𝒞 = 𝐴𝐹𝐸𝐷𝐶𝐵𝐴 be the cycle in 𝒢(3, 6) shown in Figure 7. 𝒞 touches the gray line at 𝐶 and 𝐹. Into cycle 𝒞 we splice cycles from 𝒢(6, 6) (and ˆ within 𝒢(9, 6): remove vertices 𝐶 and 𝐹) to form the cycle 𝒞 ˆ = 𝐴𝑅𝑆𝑇𝐸𝐷𝑍𝑌 𝑋𝐵𝐴. 𝒞

An ancient algorithm for the greatest common divisor R

X

83

D

S C E B F

Y Z

T

A

Figure 7. Severing a square in a rectangular grid. Each of the three cycles in 𝒢(3, 6) will include two points on the gray line and so will have two cycles from 𝒢(6, 6) spliced into it. Thus the number of cycles or loops needed to complete the sona drawing of a 9 × 6 array is the same as for the 3 × 6 array. ♢ Proposition 22: Rectangular sona arrays. Consider the sona graph 𝒢(𝑚, 𝑛) where 𝑚, 𝑛 ∈ ℤ+ with 𝑛 < 𝑚. Then 𝒬(𝑚, 𝑛) = 𝒬(𝑚 − 𝑛, 𝑛). Proof. We claim that 𝒢(𝑚, 𝑛) can be viewed as the juxtaposition of two smaller graphs 𝒢(𝑛, 𝑛) and 𝒢(𝑚 − 𝑛, 𝑛) in the following sense. Recall that 𝒢(𝑚, 𝑛) is embedded in a rectangular frame (of mirrors) whose bottom-left and top-right coordinates are (0, 0) and (2𝑚, 2𝑛). Arrange the two smaller graphs so that 𝒢(𝑛, 𝑛)’s bottom-left and top-right coordinates are at (0, 0) and (2𝑛, 2𝑛) and so that 𝒢(𝑚 − 𝑛, 𝑛)’s bottom-left and top-right coordinates are at (2𝑛, 0) and (2𝑚, 2𝑛). Let ℒ be the vertical line through (2𝑛, 0), the common side between the two rectangular arrays. We can imagine ℒ to be a double-sided mirror. Line ℒ contains a total of 𝑛 vertices that do not belong to 𝒢(𝑚, 𝑛) but do belong to both of the smaller graphs. By Proposition 19, the collection of edges in 𝒢(𝑚, 𝑛) looks exactly the same as the collection of edges in the two smaller graphs. The only difference is that each edge 𝑒 = 𝐴𝑅 in 𝒢(𝑚, 𝑛) that passes through ℒ at vertex 𝐹 consists of two edges 𝑒1 = 𝐴𝐹 ∈ 𝒢(𝑛, 𝑛) and 𝑒2 = 𝐹𝑅 ∈ 𝒢(𝑚 − 𝑛, 𝑛). In our proof we shall splice cycles together from the two smaller graphs in an appropriately oriented manner. Let 𝒞 and 𝒟 be cycles in 𝒢(𝑚 − 𝑛, 𝑛) and 𝒢(𝑛, 𝑛), respectively, that have a common vertex 𝐹 on ℒ. We write 𝒞 = 𝐴𝐹𝑤𝐴 and 𝒟 = 𝐹𝑅𝑆𝑇𝐹, where 𝑤 is a path (of distinct vertices not including 𝐴 and 𝐹) in 𝒢(𝑚 − 𝑛, 𝑛). (In Figure 7, 𝑤 = 𝐸𝐷𝐶𝐵.) Depending on the relative orientations of 𝒞 and 𝒟, we know that either 𝐴𝑅 or 𝐴𝑇 is an edge of 𝒢(𝑚, 𝑛). If 𝐴𝑅 is an

84

Chapter III: GCDs and Diophantine Equations

edge, the result of replacing vertex 𝐹 in 𝒞 with 𝒟 (after dropping 𝐹 from 𝒟) is the sequence of vertices ℰ = 𝐴𝑅𝑆𝑇𝑤𝐴. Otherwise it is ℰ = 𝐴𝑇𝑆𝑅𝑤𝐴. Convert each cycle in 𝒢(𝑚 − 𝑛, 𝑛) into a cycle in 𝒢(𝑚, 𝑛) in the following manner. Let 𝒞 be a cycle in 𝒢(𝑚 − 𝑛, 𝑛). The cycle 𝒞 will contain at least one vertex from ℒ, and, by Proposition 20, to each vertex in ℒ there corresponds a cycle from 𝒢(𝑛, 𝑛) containing that vertex and no other vertex in ℒ. Replace each vertex in ℒ ∩ 𝒞 with its corresponding appropriately oriented cycle from 𝒢(𝑛, 𝑛), and eliminate the vertex in ℒ from both cycles. Since the resulting cycle does not include any vertices from ℒ, it is a cycle in 𝒢(𝑚, 𝑛). We claim that every cycle in 𝒢(𝑚, 𝑛) can be produced in this manner. Every vertex of 𝒢(𝑛, 𝑛) is included in a 𝒢(𝑛, 𝑛) cycle and every 𝒢(𝑛, 𝑛) cycle is connected to a 𝒢(𝑚 − 𝑛, 𝑛) cycle via a vertex in ℒ. Hence every vertex in 𝒢(𝑛, 𝑛) (as well as every vertex in 𝒢(𝑚 − 𝑛, 𝑛)) will be included in one of our newly constructed 𝒢(𝑚, 𝑛) cycles. Since the gluing procedure above will cover every vertex in 𝒢(𝑚, 𝑛), every 𝒢(𝑚, 𝑛) cycle is produced from a 𝒢(𝑚 − 𝑛, 𝑛) cycle. Therefore 𝒬(𝑚, 𝑛) = 𝒬(𝑚 − 𝑛, 𝑛).

Figure 8. A sona lioness.

Corollary 23: Sona drawings and GCDs. The number of disjoint cycles in the sona graph 𝒢(𝑚, 𝑛) is gcd(𝑚, 𝑛). Proof. By Proposition 22, the condition 𝒬(𝑚, 𝑛) = 𝒬(𝑚 − 𝑛, 𝑛) when 0 < 𝑛 < 𝑚 means that we can treat the 𝑚 × 𝑛 array of dots as the chocolate bars of Proposition 4. Thus 𝒢(𝑚, 𝑛) = gcd(𝑚, 𝑛). Finally, with respect to the story being told while sona cycles are being drawn on the ground, once the master story-teller finishes the tale and the sona drawing is complete, the master makes two more finger swishes in the sand. A tail appears on the left-hand side and a head on the right-hand side of the drawing. As in

The Diophantine solution

85

Figure 8, a stylized lioness appears, who perhaps was the main character in the tale being told.7

The Diophantine solution The Euclidean algorithm, Proposition 6, gives a way to solve the linear Diophantine equation (1). Definition 24: Standard form of a linear Diophantine equation. Let 𝑎, 𝑏 ∈ ℤ+ and 𝑐 ∈ ℤ with 𝑐 ≠ 0. Let 𝑑 = gcd(𝑎, 𝑏). A linear Diophantine equation, 𝑎𝑥 + 𝑏𝑦 = 𝑐, is in standard form if 𝑑 = 1. A general solution to a Diophantine equation is a representation that characterizes all possible solutions of the equation. Let 𝑎, 𝑏, 𝑑 ∈ ℤ+ and 𝑐 ∈ ℤ\{0} with gcd(𝑎, 𝑏) = 𝑑. If 𝑑 does not divide 𝑐, then the equation 𝑎𝑥 + 𝑏𝑦 = 𝑐 has no solution. If 𝑑|𝑐 and 𝑑 > 1, the equation 𝑎𝑥 + 𝑏𝑦 = 𝑐 can be simplified to the standard form 𝑏 𝑐 𝑎 ( )𝑥 + ( )𝑦 = . 𝑑 𝑑 𝑑 Thus we pose the following solution scheme for linear Diophantine equations in standard form. Proposition 25: A Diophantine theorem. Given nonzero relatively prime integers 𝑎 and 𝑏, a nonzero integer 𝑐, and an integer solution 𝑥 = 𝑥0 and 𝑦 = 𝑦0 to 𝑎𝑥 + 𝑏𝑦 = 𝑐, the general solution to 𝑎𝑥 + 𝑏𝑦 = 𝑐 is given by 𝑥 = 𝑥0 + 𝑏𝑡 and 𝑦 = 𝑦0 − 𝑎𝑡

(5)

for all integers 𝑡. Proof. Observe that any 𝑥 and 𝑦 given by Equation (5) is a solution to 𝑎𝑥 + 𝑏𝑦 = 𝑐 because 𝑎(𝑥0 + 𝑏𝑡) + 𝑏(𝑦0 − 𝑎𝑡) = 𝑎𝑥0 + 𝑏𝑦0 = 𝑐. Let 𝑥 = 𝑥1 and 𝑦 = 𝑦1 be another solution. Then 𝑎(𝑥1 −𝑥0 )+𝑏(𝑦1 −𝑦0 ) = 0. Since gcd(𝑎, 𝑏) = 1, we have 𝑏|(𝑥1 − 𝑥0 ) and 𝑎|(𝑦1 − 𝑦0 ) (proving this last statement is Exercise 5b). Thus 𝑥1 − 𝑥0 = 𝑏𝑡 and 𝑦1 − 𝑦0 = 𝑎𝑠 for some integers 𝑠 and 𝑡. So 7 G. H. Hardy (1877–1947) once remarked about his work as a number theorist, “I have never done anything useful. No discovery of mine has made, or is likely to make, directly or indirectly, for good or ill, the least difference to the amenity of the world.” He was mistaken, as his work has many real-world applications. In much the same way, the greatest common divisor algorithm as sona drawings was never intended to be useful other than as a story-telling technique. Thus, for eons this useful number-theoretic algorithm awaited the day when it would be used in cryptology, solving Diophantine equations, finding modular inverses, and generating continued fractions, to name just a few of its uses.

86

Chapter III: GCDs and Diophantine Equations

𝑥1 = 𝑥0 + 𝑏𝑡 and 𝑦1 = 𝑦0 + 𝑎𝑠, which almost matches Equation (5). Since 𝑥 = 𝑥1 and 𝑦 = 𝑦1 is a solution to the Diophantine equation, 𝑎(𝑥0 + 𝑏𝑡) + 𝑏(𝑦0 + 𝑎𝑠) = 𝑐, which means that 𝑎𝑏𝑡 + 𝑎𝑏𝑠 = 0. Since 𝑎𝑏 ≠ 0, 𝑠 = −𝑡. So 𝑥1 = 𝑥0 + 𝑏𝑡 and 𝑦1 = 𝑦0 − 𝑎𝑡, which matches Equation (5) exactly. Therefore every solution to the Diophantine equation is given by Equation (5). Table 6. Sales of $5 and $7 tickets from Example 26. 𝑠

104

(𝑥, 𝑦)

(46, 4)

𝑥+𝑦

50

105

106

(39, 9) (32, 14) 48

107

108

109

110

(25, 19)

(18, 24)

(11, 29)

(4, 34)

44

42

40

38

46

Example 26: A Diophantine ticket sales problem. Tickets for a small concert sell at $7 for a balcony seat and $5 to stand on the main floor. Ticket sales amount to $258. How many tickets were sold? Solution. Let 𝑥 and 𝑦 be the number of $5 tickets sold and the number of $7 tickets sold, respectively. Our Diophantine equation to solve is 5𝑥 + 7𝑦 = 258.

(6)

By Euclid’s algorithm, 5 ⋅ 3 + 7 ⋅ (−2) = 1. Therefore one solution to Equation (6) is 𝑥0 = 3 ⋅ 258 = 774 and 𝑦0 = −2 ⋅ 258 = −516. By Equation (5) (with 𝑠 = −𝑡), solutions to Equation (6) are given by 𝑥 = 774 − 7𝑠 ≥ 0

and

𝑦 = −516 + 5𝑠 ≥ 0,

since we cannot sell a negative number of tickets of either kind. These two inequalities can be written as 774 516 𝑠≤ ≈ 110.57 and 𝑠≥ ≈ 103.2. 7 5 Thus we have solutions to Equation (6) whenever 104 ≤ 𝑠 ≤ 110. Table 6 gives the ticket sales as 𝑠 ranges from 104 to 110. Observe that the total number of tickets could be any even integer from 38 through 50. ♢ The next example is more challenging. Example 27: A classic Diophantine coconut problem. Five sailors are marooned on a tropical island. They pile coconuts in a heap during the day and agree to divide them equally at sunrise. During the night, sailor 1 wakes, divides the nuts into five equal piles with one left over, which he gives to a friendly monkey; he hides one pile, combines the rest into a single heap, and retires. Subsequently, sailor 2 wakes and does the same thing, as do the third, fourth, and fifth sailors.

The Diophantine solution

87

At sunrise, the remaining pile of coconuts (less one) is divisible by 5. Find the least possible number 𝑁 of coconuts in the original pile. Solution. Let 𝑎, 𝑏, 𝑐, 𝑑, and 𝑒 be the number of coconuts that the respective sailors hid. Then we have six equations: 𝑁 = 5𝑎+1, 4𝑎 = 5𝑏+1, 4𝑏 = 5𝑐+1, 4𝑐 = 5𝑑+1, 4𝑑 = 5𝑒+1, 4𝑒 = 5𝑓+1. Observe that we can eliminate the variables 𝑎 through 𝑒 by replacing them with an equivalent expression in terms of 𝑓. That is, 42 𝑑 = 5(4𝑒) + 4 = 5(5𝑓 + 1) + 4 = 52 𝑓 + 9, and so on. Thus the six equations can be replaced with a single Diophantine equation: 45 𝑁 − 56 𝑓 = 11529. (7) 5 By Euclid’s algorithm, 4 𝑥 − 56 𝑦 = 1 gives 𝑥 = −4776 and 𝑦 = −313. Thus 𝑁0 = −4776 ⋅ 11529 and 𝑓0 = −313 ⋅ 11529 is a solution to Equation (7), which means that any solution must be of the form 𝑁 = 𝑁0 + 56 𝑡 and

𝑓 = 𝑓0 + 45 𝑡,

where 𝑡 is an integer. The least value of 𝑡 for which both 𝑁 and 𝑓 are nonnegative is 𝑡 = 3525. So the least value of 𝑁 is 𝑁 = 𝑁0 + 56 ⋅ 3525 = 15621. Rather than using Euclid’s algorithm to find a particular solution to Equation (7), a clever way to do so is to let 𝑁0 = −4 coconuts. What? A negative number of coconuts? Yes, because sailor 1 rises, gives 1 coconut to the monkey (from the pile of −4 coconuts), so there are now −5 coconuts in the pile which he splits into fifths. He hides 𝑎 = −1 coconut and puts the remaining −4 coconuts in a pile. Aha, a fixed point! The other sailors do likewise, giving 𝑏 = −1 = 𝑐 = 𝑑 = 𝑒 = 𝑓. Thus, 𝑁 = −4 + 56 𝑡, and the least positive value of 𝑁 is at 𝑡 = 1, namely, 𝑁 = −4 + 56 = 15621. ♢ Example 28: Solving a Diophantine equation via continued fractions. As a prelude to Chapter IX, we solve the Diophantine equation 532𝑥 + 1193𝑦 = 2 via simple continued fractions. 532 Let 𝐹 = . By Definition 1 of the Introduction, and from Example II.4, 1193 the simple continued fraction for 𝐹 is 532 1 𝐹 = [0; 2, 4, 8, 16] = . = 1 1193 2+ 1 4+ 1 8+ 16 Now consider the convergents 𝐶1 , 𝐶2 , and 𝐶3 : 4 33 1 1 1 = = , 𝐶1 = , 𝐶2 = 𝐶3 = . 1 1 2 9 74 2+ 2+ 1 4 4+ 8

88

Chapter III: GCDs and Diophantine Equations 33

The penultimate convergent 𝐶3 = gives us a way to find a particular solution to 74 the Diophantine equation. (In general, this result holds for other linear Diophantine equations because successive convergents are Farey neighbors, as will be seen in Proposition IV.15 and Lemma IX.17.) Observe that 532⋅(−74)+1193⋅33 = 1. Thus we know that 𝑥0 = 2(−74) = −148 and 𝑦0 = 2 ⋅ 33 = 66 is a solution to our equation. By Proposition 25, any solution to the equation is given by 𝑥 = −148 + 1193𝑡

and

where 𝑡 ∈ ℤ.

𝑦 = 66 − 532𝑡, ♢

A litmus test for Euclid’s solution As we have seen, applying Euclid’s algorithm to relatively prime integers 𝑎 and 𝑏 with 0 < 𝑎 < 𝑏 yields a specific pair of integers 𝑥 = 𝑥0 and 𝑦 = 𝑦0 such that 𝑎𝑥 + 𝑏𝑦 = 1, even though there are an infinite number of solutions to that Diophantine equation. For example, with 𝑎 = 5 and 𝑏 = 11, −2 ⋅ 5 + 1 ⋅ 11 = 1 = 9 ⋅ 5 − 4 ⋅ 11. Since there are an infinite number of solutions to 𝑎𝑥 + 𝑏𝑦 = 1, how can we recognize which solution Euclid’s algorithm will produce? The following litmus test is due to Rankin [121]. Definition 29: Euclidean pair. We say that {𝑥0 , 𝑦0 } is a Euclidean pair of integers for the relatively prime integers 𝑎 and 𝑏, 0 < 𝑎 < 𝑏, if Euclid’s algorithm generates the linear combination 𝑎𝑥0 + 𝑏𝑦0 = 1. Proposition 30: Rankin’s litmus test. Let 𝑎 and 𝑏 be relatively prime with 0 < 𝑎 < 𝑏. Then {𝑥, 𝑦} is the Euclidean pair for 𝑎 and 𝑏 if and only if 𝑎𝑥 + 𝑏𝑦 = 1, |𝑦| ≤ 𝑎/2, |𝑥| ≤ 𝑏/2, and at least one of the preceding two inequalities is a strict inequality. Proof. To prove this proposition we apply induction on the number 𝑛 of times the division algorithm is used in Euclid’s algorithm before generating a zero remainder. Let {𝑥, 𝑦} be a Euclidean pair for 𝑎 and 𝑏. For 𝑛 = 1, the only time a zero remainder arises on the first division is when 𝑎 = 1; by definition, {1, 0} is the Euclidean pair for 𝑎 = 1 and 𝑏 ≥ 2; observe that 1 ≤ 𝑏/2 and 0 < 𝑎/2. Assume that the litmus test is true for some integer 𝑛 ≥ 1 when applying Euclid’s algorithm to any two relatively prime positive integers 0 < 𝑎 < 𝑏 for which 𝑛 divisions occur before attaining a zero remainder. Suppose that Euclid’s algorithm requires 𝑛 + 1 divisions to attain a zero remainder for the relatively prime integers 0 < 𝑎 < 𝑏. The requirement of 𝑛 + 1 divisions forces 𝑎 to be at least 2. By the division algorithm we have 𝑏 = 𝑎𝑞+𝑟, where 𝑞 is a positive integer and 0 ≤ 𝑟 < 𝑎. But 𝑟 ≥ 1, because Euclid’s algorithm needs 𝑛 + 1 divisions to attain a zero remainder. Let {𝑥, 𝑦} be the Euclidean pair for the relatively prime

Clock arithmetic

89

integers 𝑟 and 𝑎. By the inductive hypothesis, 𝑟𝑥+𝑎𝑦 = 1, |𝑦| ≤ 𝑟/2, and |𝑥| ≤ 𝑎/2. Substituting 𝑟 = 𝑏 − 𝑎𝑞, we have 1 = 𝑎𝑦 + 𝑥(𝑏 − 𝑎𝑞) = 𝑎(𝑦 − 𝑞𝑥) + 𝑏𝑥. Furthermore, |𝑥| ≤

𝑎𝑞 + 𝑟 𝑎 𝑟 𝑎 𝑏 and |𝑦 − 𝑞𝑥| ≤ |𝑦| + 𝑞|𝑥| ≤ + 𝑞( ) = = . 2 2 2 2 2

Since 𝑎 and 𝑏 cannot both be even, at least one of these latter two inequalities is a strict inequality. Finally, suppose that for the relatively prime integers 𝑎 and 𝑏, 0 < 𝑎 < 𝑏, there exist integers 𝑥1 and 𝑦1 for which 𝑎𝑥1 + 𝑏𝑦1 = 1, |𝑦1 | ≤ 𝑎/2, |𝑥1 | ≤ 𝑏/2, and at least one of the latter two inequalities is strict. Euclid’s algorithm produces two integers 𝑥0 and 𝑦0 with 𝑎𝑥0 + 𝑏𝑦0 = 1, |𝑦0 | ≤ 𝑎/2, and |𝑥0 | ≤ 𝑏/2, where at least one of the two inequalities is strict. If 𝑥0 = 𝑥1 and 𝑦0 = 𝑦1 , we are done. Otherwise the Diophantine algorithm says that any solution to 𝑎𝑥 + 𝑏𝑦 = 1 is given by 𝑥 = 𝑥0 + 𝑏𝑡 and 𝑦 = 𝑦0 − 𝑎𝑡 for some integer 𝑡. Thus, 𝑥1 = 𝑥0 + 𝑏𝑡0 and 𝑦1 = 𝑦0 − 𝑎𝑡0 for some integer 𝑡0 ≠ 0, which means that 𝑏 | | |𝑥1 | = |𝑥0 + 𝑏𝑡0 | ≥ ||𝑥0 | − 𝑏|𝑡0 || = 𝑏|𝑡0 | − |𝑥0 | ≥ 𝑏 − |𝑥0 | ≥ . | | 2 𝑎

Similarly, |𝑦1 | ≥ , a contradiction. 2

Clock arithmetic If in our study of Diophantine equations 𝑎𝑥 + 𝑏𝑦 = 𝑐 we restrict 𝑏 to have a fixed value, we are then studying what is called modular arithmetic. For example, given a bungee cord of length 𝑎 and a wheel or clock of circumference 𝑏 with tick marks at successive integer arc lengths around the wheel labeled clockwise 0 through 𝑏 − 1, imagine stretching the cord by a factor of 𝑥. Now wrap this cord clockwise around the wheel with one end fixed at tick mark 0. For what 𝑥 value will the other end of the cord of length 𝑎𝑥 be at 𝑐? That is, given a starting point 𝑎 and knowing that 𝑐 is some tick mark on the wheel, how many steps of size 𝑎 around the wheel must we take before landing precisely on 𝑐? Definition 31: Modular arithmetic. Let 𝑏 ∈ ℤ+ , and let 𝑎, 𝑐 ∈ ℤ. We say that 𝑎 and 𝑐 are equivalent modulo 𝑏, denoted by 𝑎 ≡ 𝑐 mod 𝑏, if 𝑏 divides 𝑎 − 𝑐. We say that integer 𝑖 is a primitive residue modulo 𝑏 if 0 ≤ 𝑖 < 𝑏. When we write 𝑎 mod 𝑏 we mean the primitive residue equivalent to 𝑎 modulo 𝑏.

90

Chapter III: GCDs and Diophantine Equations

The reader should check that relation ≡ is an equivalence relation on ℤ. In general, any primitive residue 𝑖 is the unique smallest nonnegative integer in its equivalence class modulo 𝑏. The reader may further check that adding and multiplying modulo 𝑏 are well-defined operations on equivalence classes. For example, modulo 11, an integer equivalent to 5 multiplied by an integer equivalent to 7 is an integer equivalent to 2 because 35 = 3 ⋅ 11 + 2, and so on. To show the utility of this idea, we construct a simple cryptographic scheme for encoding secret messages. Example 32: A clock cryptosystem. To encode messages using the English alphabet of 26 letters, let 𝑏 = 26. In this context, we identify each alphabet letter with a primitive residue 0 through 25 as indicated by the first two columns of Table 7. The third column of the table contains the primitive residue modulo 26 for the product of 7 and the entry in the second column. If 𝑛𝑖 is entry 𝑖 in the third column, then letter 𝑖 in the fourth column is letter 𝑛𝑖 of the alphabet. For example, the letter 𝐸 corresponds to integer 5, which when multiplied by 7 is 35, which is equivalent to 9 modulo 26. Thus we encode letter 𝐸 by the letter 𝐼. Therefore the word HELLO is encoded as DIFFA. ♢ To decode a message with respect to the cryptosystem of Example 32, we must undo a multiplication by 7. One way to accomplish this task is to read Table 7 backwards. For example, to decode the letter 𝐴, we find its location in the fourth column of the table and read across to the first column to get the letter O. But how can we decode without using the table? As in ℚ where the multiplica1 tive inverse of 7 is , we seek the multiplicative inverse of 7 modulo 26. If such a 7 number exists, it will be some primitive residue from 0 through 25. Which one is it? Definition 33: Modular multiplicative inverses.8 Let 𝑏 ∈ ℤ+ and 𝑎, 𝑐 ∈ ℤ. We say that 𝑐 is the multiplicative inverse of 𝑎 modulo 𝑏, denoted by 𝑐 = 𝑎−1 , if 𝑎𝑐 ≡ 1 mod 𝑏. For a given 𝑏 ∈ ℤ+ , some integers have no modular multiplicative inverses. Proposition 34: Inverse existence. Let 𝑎, 𝑏 ∈ ℤ with 𝑎 ≠ 0 and 𝑏 ≥ 2. Then 𝑎−1 exists modulo 𝑏 if and only if 𝑎 and 𝑏 are relatively prime. Proof. Observe that 𝑎 and 𝑏 are relatively prime if and only if 𝑎𝑥 + 𝑏𝑦 = 1 has a solution 𝑥0 and 𝑦0 , if and only if 𝑎𝑥0 ≡ 1 mod 𝑏 for some integer 𝑥0 . 8 Code 4 in Appendix III shows how a CAS can be used to solve equations of the form 𝑎𝑥 ≡ 𝑏 mod 𝑐.

Clock arithmetic

91

Table 7. Enciphering the alphabet via multiplication modulo 26. plain letter A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

letter value 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 0

code value 7 14 21 2 9 16 23 4 11 18 25 6 13 20 1 8 15 22 3 10 17 24 5 12 19 0

code letter G N U B I P W D K R Y F M T A H O V C J Q X E L S Z

Example 35: A continuation of Example 32. Applying Proposition 34 to 𝑏 = 26, observe that the only primitive residues having multiplicative inverses are 1, 3, 5, 7, 9, 11, 15, 17, 19, 21, 23, 25. As the reader may check, the multiplicative inverses of these numbers are as given in Table 8. In particular, we review how to find the multiplicative inverse Table 8. Inverses modulo 26. 𝑎 𝑎 mod 26 −1

1 1

3 9

5 21

7 9 11 15 15 3 19 7

17 19 23 11

21 5

23 25 17 25

of 7. By Euclid’s algorithm, 1 = 7 ⋅ (−11) + 26 ⋅ (3) ≡ 7 ⋅ (−11) mod 26 ≡ 7 ⋅ 15 mod 26.

92

Chapter III: GCDs and Diophantine Equations

So the multiplicative inverse of 7 is 15. To decode a message such as WAABNSI, we multiply 𝑊 by 15 modulo 26 and get 23 ⋅ 15 ≡ 7 mod 26. Thus 𝑊 decodes as 𝐺. Continuing this process yields the plain text message GOODBYE. ♢

Systems of Diophantine equations In honor of an old tradition exemplified by early Chinese brainteasers dating back to the first century involving systems of linear Diophantine equations, the following proposition used to solve them is known as the Chinese remainder theorem. Proposition 36: The Chinese remainder theorem. Given a set of 𝑘 pairwise relatively prime positive integers 𝑛𝑖 , 1 ≤ 𝑖 ≤ 𝑘, the system of 𝑘 equations 𝑥 ≡ 𝑘

𝑎𝑖 mod 𝑛𝑖 with 𝑎𝑖 ∈ ℤ has a unique solution 𝑥 mod 𝑁 where 𝑁 = Π 𝑛𝑖 . 𝑖=1

Proof. Let 𝑁𝑖 = 𝑁/𝑛𝑖 , 1 ≤ 𝑖 ≤ 𝑘. By hypothesis, gcd(𝑛𝑖 , 𝑁𝑖 ) = 1. Consider the equations 𝑁𝑖𝑧 ≡ 1 mod 𝑛𝑖 . (8) Multiplying Equation (8) through by the multiplicative inverse of 𝑁𝑖 modulo 𝑛𝑖 gives the unique solution 𝑥𝑖 to Equation (8) with 0 ≤ 𝑥𝑖 < 𝑛𝑖 . Let 𝑘

𝑥 = ∑ 𝑎𝑗 𝑁𝑗 𝑥𝑗 .

(9)

𝑗=1

Since 𝑁𝑖 ≡ 0 mod 𝑛𝑗 whenever 𝑖 ≠ 𝑗, we have 𝑥 ≡ 𝑎𝑗 𝑁𝑗 𝑥𝑗 mod 𝑛𝑗 for all 𝑗 with 1 ≤ 𝑗 ≤ 𝑘. Furthermore, since 𝑁𝑖 𝑥𝑖 ≡ 1 mod 𝑛𝑖 , 𝑥 ≡ 𝑎𝑖 mod 𝑛𝑖 . So 𝑥 is a solution to the system of 𝑘 equations. To show uniqueness, let 𝑥̃ be another solution. Then 𝑛𝑖 |(𝑥 − 𝑥), ̃ which means that 𝑁|(𝑥 − 𝑥), ̃ which means that 𝑥 ≡ 𝑥̃ mod 𝑁. Example 37: A pirate puzzle from ancient China. Seventeen pirates steal a chest of gold coins. Dividing the treasure into equal portions leaves a remainder of three coins. Fighting over who should get these coins leaves a dead pirate. Equal division again leaves a remainder of ten coins. Fighting again leaves another dead pirate. This time equal division has no remainder. Find the least number of possible booty coins. Solution. With respect to the Chinese remainder theorem, the puzzle involves three equations, 𝑥 ≡ 3 mod 17,

𝑥 ≡ 10 mod 16,

and 𝑥 ≡ 0 mod 15

so that 𝑛1 = 17, 𝑛2 = 16, 𝑛3 = 15, 𝑎1 = 3, 𝑎2 = 3, and 𝑎3 = 0. Solving the system 16 ⋅ 15𝑥1 ≡ 1 mod 17,

17 ⋅ 15𝑥2 ≡ 1 mod 16,

and 𝑥3 ≡ 17 ⋅ 16 ≡ 1 mod 15

A problem from Diophantus’s Arithmetica

93

gives 𝑥1 = 9, 𝑥2 = 15, and 𝑥3 = 8. Therefore 𝑥 ≡ 3(16 ⋅ 15)9 + 10(17 ⋅ 15)15 + 0(17 ⋅ 16)8 ≡ 44730 mod 4080 ≡ 3930. Thus the minimum number of gold coins is 3930.

♢

The totient is multiplicative Let 𝜙 be the totient function of Definition I.16. As promised in Chapter I, armed with the notion of greatest common divisor, we can show that 𝜙 is multiplicative, namely, that 𝜙(𝑚𝑛) = 𝜙(𝑚)𝜙(𝑛) whenever 𝑚 and 𝑛 are relatively prime. But first we state a cancellation property of modular arithmetic, whose proof we leave as an exercise. Proposition 38: A modular cancellation property. Let 𝑎𝑚 + 𝑟 ≡ 𝑐𝑚 + 𝑟 mod 𝑏, where 𝑏 ∈ ℤ+ and 𝑎, 𝑐, 𝑚, 𝑟 ∈ ℤ with gcd(𝑚, 𝑏) = 1. Then 𝑎 ≡ 𝑐 mod 𝑏. Proposition 39: Euler’s phi function. Let 𝑚 and 𝑛 be relatively prime positive integers. Then 𝜙(𝑚𝑛) = 𝜙(𝑚)𝜙(𝑛). Proof. Observe that an integer 𝑘 is relatively prime to 𝑚𝑛 if and only if 𝑘 is relatively prime to both 𝑛 and 𝑚. Let Φ(𝑋) be the number of elements in the set 𝑋 that are relatively prime to 𝑚𝑛 so that 𝜙(𝑚𝑛) = Φ({1, 2, … , 𝑚𝑛}). By intersecting {1, 2, … , 𝑚𝑛} with the equivalence classes of integers modulo 𝑚, we decompose the set {1, 2, … , 𝑚𝑛} into the disjoint union of the sets 𝑆𝑟 = {𝑚𝑗 + 𝑟| 0 ≤ 𝑗 < 𝑛}, 0 ≤ 𝑟 < 𝑚. By our previous observation, if gcd(𝑚, 𝑟) ≠ 1 then gcd(𝑚𝑛, 𝑚𝑗 + 𝑟) ≠ 1, so Φ(𝑆𝑟 ) = 0. Suppose that gcd(𝑚, 𝑟) = 1. By our earlier observation again, Φ(𝑆𝑟 ) is the number of elements of 𝑆𝑟 that are relatively prime to 𝑛. Since gcd(𝑛, 𝑚𝑗 + 𝑟) = gcd(𝑛, (𝑚𝑟 + 𝑟) mod 𝑛) and since, by Exercise 5d, {(𝑚𝑗 + 𝑟) mod 𝑛| 0 ≤ 𝑗 < 𝑛} = {0, 1, 2, … , 𝑛 − 1}, we conclude that Φ(𝑆𝑟 ) = Φ({0, 1, 2, … , 𝑛 − 1}) = 𝜙(𝑛). Since there are 𝜙(𝑚) values of 𝑟 for which Φ(𝑆𝑟 ) ≠ 0 and for each such 𝑟, Φ(𝑆𝑟 ) = 𝜙(𝑛), we conclude that 𝜙(𝑚𝑛) = Φ({1, 2, … , 𝑚𝑛}) = 𝜙(𝑚)𝜙(𝑛).

A problem from Diophantus’s Arithmetica To close this chapter, we showcase a typical problem from the Arithmetica. Example 40: Diophantus Problem 24 from Book I.∗ Find three numbers such that if each receives a given fraction of the sum of the other two, the results are all equal. Let it be required that the first number receives a third of the sum of the two remaining numbers, the second receives a fourth of the sum of the two

94

Chapter III: GCDs and Diophantine Equations

remaining numbers, the third receives a fifth of the sum of the two remaining numbers, and the resulting numbers are equal. Solution. Let 𝑁 be the common value of all three sums, and let 𝑥, 𝑦, and 𝑧 be the three numbers. In today’s algebra we write the three sums as 1 1 1 𝑁 = 𝑥 + (𝑦 + 𝑧), 𝑁 = 𝑦 + (𝑥 + 𝑧), and 𝑁 = 𝑧 + (𝑥 + 𝑦). 3 4 5 This linear system of three equations with four unknowns can be replaced with three equations each of which involves 𝑁 and one of 𝑥, 𝑦, and 𝑧. Doing so gives 13𝑁 = 25𝑥,

17𝑁 = 25𝑦,

and 19𝑁 = 25𝑧.

Diophantus gives but one solution, namely, 𝑥 = 13, 𝑦 = 17, 𝑧 = 19, and 𝑁 = 25. But any rational multiple of this solution set gives a valid rational solution set to the problem. ♢ Exercises 1. (a) A mother has five daughters and 25 trees. Tree 𝑖 produces 𝑖 baskets of figs each year, 1 ≤ 𝑖 ≤ 25. How may she partition the trees so that each daughter has four trees and each daughter’s total fig production is the same? Table 9. Two Sudoku puzzles. 8 6 3 2 9 4 5 7 1 a.

1 2 6 4 3 5 7 9 4 7 9 2 5 1 8 3 9 5 1 8 7 2 6 4 7 6 3 1 4 8 9 5 8 3 5 7 2 6 4 1 5 1 8 6 9 3 2 7 3 4 2 9 6 7 1 8 2 8 4 5 1 9 3 6 6 9 7 3 8 4 5 2 A completed Sudoku puzzle.

5

6 4

9

7

6 9 8

5 3 8 7 6 2 1 2 3 4 8 4 2 3 8 9 1 4 5 3 9 5 4 b. A Sudoku puzzle.

(b) Use the completed Sudoku puzzle in Table 9a to generate another solution to the Mecca problem. (c) Complete the Sudoku puzzle in Table 9b to generate another solution to the Mecca problem. (d) The landowner of the Mecca problem labeled his trees 1 to 81 according to their fruitfulness. On a sandy region of his orchard, he drew a 9×9 grid. After much trial and error he succeeded in entering all 81 integers into the grid so that the column sums were all the same. Thus son 1 receives

Exercises

95

the trees labeled in the first column, son 2 receives the trees labeled in the second column, and so on. As he was admiring his solution, an infrequent rain shower rendered some numbers illegible. He recovered the first row easily enough. But what about the empty cells in the grid of Table 10? Can you help him recover his solution? Table 10. A Mecca problem puzzle. 1

2

3 15

4

28 43

30

5 6 16 17

26 45 47 60 75

7

8

9

22 38 51 49 57

48 56 71 68

31

69 80

41 58 64

73

78

2. Suppose Ali and Mweni have 16 children. Using hexadecimal notation, label them 1, 2, 3, 4, 5, 6, 7, 8, 9, 𝐴, 𝐵, 𝐶, 𝐷, 𝐸, and 𝐹. They also have 162 trees, and tree 𝑖 produces 𝑖 baskets of figs annually. How may Ali and Mweni partition the trees equitably? Write your answer in hexadecimal form, remembering for example that the decimal integer 256 is the hexadecimal string 100. 3. Mo and Hanna have 25 children. To pose a 25 × 25 Mecca problem they use a base 26 numbering system. In this system, 𝐴 is worth 1, 𝐵 is worth 2, 𝐶 is worth 3, and so on, so that letter 𝑌 is worth 25. The letter 𝑍 represents zero. Any integer up to 625 can be written using at most two of these symbols. For example, the integer symbolized by NM is 377 (being 26 ⋅ 14 + 13). Again as in Exercise 1d, rain has eradicated some of the symbols in the 25 × 25 array of Table 11. Complete the table. 4. Besides the plaited mat pattern as exemplified in Figure 3, the Cockwe people have many more such patterns. Some of these involve placing extra mirrors in an array of dots. In particular, we place a horizontal (double-sided) mirror between each dot in even numbered columns. For example, Figure 9a shows a 5 × 3 grid with four mirrors placed in the second and fourth columns. The rules for sand drawing are the same as before, but now we have more reflective surfaces. The result is called a lion’s stomach. Figure 9b shows the completed sona drawing. (a) Draw a 7 × 5 and a 9 × 5 lion’s stomach.

96

Chapter III: GCDs and Diophantine Equations Table 11. A 25 × 25 Mecca problem challenge.

A

B

C

D

BN CD DF

BG CC

BP CV

BO CJ DJ

E AS BU

DU EO FJ

EU GN HO HU IT

MD MX NN PB PK

LP ND OE

HL IE IU

LO

KD KX LX

RQ

UW WZ

WQ

IQ KB LL

ID JH

H AD

CM DQ ER FZ FX HZ IJ

CS

I

J

K AI

L

BQ BX DN

DI

DP

DL EC

EB FD GB HP

SE

PX QT SB

TM VZ WD

UR VH WU

O

BW CT

BZ CB

CF

EF FI GZ HH

LS NJ NR PC

NG NP

NA

RU SU UP

UL

VP XA

WF

WR

NE

QC QS RV

SW UD

UZ UO

VN

T

CE

EW GP HJ

JP KU LW NI

OF

RE

S

U AH

V AC

CP

CN

W AA

EZ

PH RG

R

JI

KM LA

MZ MW

Q AE BS

FN GG

HS JF JX

P

X

Y

BD

BH

EJ

EG FK

DD

GE

QE RB RP TG VB VD

N

GU

KC KQ

RF RI SM

M

FP

NL OS QH

SS TX UK

GL HI

G

NC

QY SL

F AP

PR QI RS TW VA

VK

KP ML MV OC OU

LZ

FQ

ST TI UJ

FR GK GY

HV IY

IG JD

HW JC

KA

LD MH MO OJ OP PJ

OA OY PN

QJ SC

RO

SA

TO

TX

MM OH OQ PV

VY

EX GD GT

KN

RC SK TQ

FC GF HB IM JG

MG

TE UY VL

MI MN

OG QG QR RJ SR

QU

MT NM

PI QB QQ RN TZ UC

SJ TF

UV

WV

XZ

WI

VM WE

WJ

(b) How many loops are needed to complete an 𝑚 × 𝑛 lion’s stomach? Experiment with various values of 𝑚 and 𝑛.

a.

b.

Figure 9. A sona lion stomach. (c) Generate the sona drawing associated with the dot-and-mirror configuration of Figure 10, which Gerdes [55] refers to as a chased chicken design. 5. (a) Let 𝑚, 𝑛 ∈ ℤ+ . Show that gcd(𝑚, 𝑛) = gcd(𝑚 mod 𝑛, 𝑛). (b) If 𝑎𝑥 = 𝑏𝑦 and gcd(𝑎, 𝑏) = 1, show that 𝑎|𝑦 and 𝑏|𝑥. (c) Show that gcd(𝑚𝑛, 𝑎) = 1 if and only if gcd(𝑚, 𝑎) = 1 = gcd(𝑛, 𝑎), where 𝑚, 𝑛, and 𝑎 are positive integers. (d) Let gcd(𝑚, 𝑛) = 1 and let 𝑟 be an integer with 0 ≤ 𝑟 < 𝑚. For each integer 𝑘 with 0 ≤ 𝑘 < 𝑛, show that there is an integer 𝑗 that solves the equation 𝑘 ≡ 𝑗𝑚 + 𝑟 mod 𝑛 .

Exercises

97

Figure 10. A chased chicken. | | (e) Show that |𝑥 + 𝑦| ≥ ||𝑥| − |𝑦|| for all 𝑥, 𝑦 ∈ ℝ. | | (f) Show that the geometric mean of two different positive real numbers 𝑎 and 𝑏 is less than the arithmetic mean of the numbers. That is, show that √𝑎𝑏 < 𝑎+𝑏 . 2

6. (a) Using the encoding scheme of Example 4, encode the phrase, “For the Cokwe, these drawings are their writings.” (b) Using this same scheme, decode IGUD BGS JDGJ HGCCIC CA JAA BKCGHHIGV JDI UAFFIUJKXI HGCJ. (c) Decode the message below, which was encoded using multiplication by 5 mod 26. QERTILEBNQ NEFY JYYR NERTYT TWKR VNLWAIN E DYK MYR WFYL MERU IYRYLEVSWRQ 7. Solve this problem of Diophantus, Problem 29 from Book I: Find two numbers such that their sum and the difference of their squares are given numbers. Let it be required that the sum of the numbers is 20 and the difference of their squares is 80. 8. Solve this puzzle from Brahmagupta (circa 650 ad) [112]: An old woman goes to market and a horse steps on her basket and crushes the eggs. The rider offers to pay for the damages and asks her how many eggs she had brought. She does not remember the exact number, but when she had taken them out two at a time, there was one egg left. The same happened when she picked them out three, four, five, and six at a time, but when she took them seven at a time they came out even. What is the smallest number of eggs she could have had?

98

Chapter III: GCDs and Diophantine Equations

9. Solve this Trattato d’Arithmetica puzzle attributed to Paolo dell’Abbaco, circa 1370 [146]: A man sent one of his children to a garden to fetch seven apples, saying: “You will find three gatekeepers, each of whom will say: ‘I want half of all your apples and two more from those which remain after the division.’ I want to know how many have to be taken at the beginning, so that seven will remain at the end.” 10. Solve this puzzle from the 850 ad Indian collection of puzzles, Ganita-sārasaṅgraha of Mahāvīra [146]: When a certain man brought mango fruits home, his elder son took one fruit first and then half of what remained. After the elder son did this, the younger son did similarly with what was left there. He further took half of what was thereafter left; and the other son took the other half. Find the number of fruits brought by the father.

Strand IV: Fractions in the Pythagorean Scale As an application of fractions, the subject of this next chapter, we look at how the musical scale developed. Among the discoveries attributed to Pythagoras is the Pythagorean scale. According to Iamblichus, a fourth-century philosopher who wrote at length about this legendary mathematician, one day Pythagoras walked by a blacksmith shop where he heard hammers beating on iron. At the forge, he realized that some combinations of hammers sounding simultaneously produced pleasant harmony, whereas other combinations of hammers produced merely noise.

Figure 1. Pythagoras at the lyre, an old woodcut. Iamblichus continues the legend, saying that Pythagoras then experimented with various amounts of water in vessels, various lengths of pipes, and various lengths of strings on an instrument with a moveable bridge allowing for careful measurement of musical ratios. In Figure 1, Pythagoras is plucking strings of the same lengths where the strings are stretched at one end by distinct weights. He 99

100

Strand IV: Fractions in the Pythagorean Scale

concluded that sequences of tones based on low-integer relationships produced pleasing harmony. In particular, any musician will notice that changing the length of a vibrating string will change its tone. For example, consider a violin whose strings have unit length, with endpoints 0 and 1. Let 𝑡 be a real number between 0 and 1. We say that a string is clamped at 𝑡 if the string has been depressed (onto the violin’s fingerboard) at point 𝑡.1 Hence an unclamped string is a string that has not been 1 clamped anywhere between 0 and 1. When a vibrating string is clamped at 𝑡 = , 2 the sound produced has twice the frequency (an octave above) of the unclamped string. In general, when the string is depressed at 𝑡, the sound produced by the 1 vibrating string of length 𝑡 is times the frequency of the unclamped string. 𝑡 Let 𝑋0 be the unit tone, tone 1 (frequency 1), produced by an unclamped string of length 1. (Throughout this discussion, the string is held at constant tension.) Let 𝑋1 be tone 2 (frequency 2), produced by the unclamped string of 1 length . Pythagoras wished to design a finite sequence of tones 𝑋0 = 𝑌1 < 𝑌2 < 2 ⋯ < 𝑌𝑛 = 𝑋1 so that, like successive rungs in a ladder, the notes from 𝑌1 through 𝑌𝑛 sound as if they progress in equal increments from the unit tone to an octave above the unit tone. Pythagoras devised a method to approximate this idea where 𝑛 = 13 so that the tones progress in 12 increments. Before we show how he did this, we first pause to discuss how we might name these thirteen notes.

A note-naming interlude Consider the tone progression 𝑤 = 𝑌1 -𝑌2 -𝑌3 -𝑌4 -𝑌5 -𝑌6 -𝑌7 -𝑌8 -𝑌9 -𝑌10 -𝑌11- 𝑌12 -𝑌13 . How could Pythagoras have given them more appealing names? Since the first and the last notes of 𝑤 should be named the same because they are an octave apart, Pythagoras has twelve notes to name. He could have used the first twelve letters of the Greek alphabet. So his twelve notes followed by 𝛼 would be 𝛼, 𝛽, 𝛾, 𝛿, 𝜖, 𝜁, 𝜂, 𝜃, 𝜄, 𝜅, 𝜆, 𝜇, 𝛼. This progression of notes forms what is called a chromatic scale (where the last 𝛼 is played an octave above the initial 𝛼). Because the last note is twice the frequency of the first note, such a scale sounds like a complete musical phrase to the human ear. However, other progressions that skip some notes also form complete musical phrases to the ear. In Pythagoras’s day, the customary progression choice was what we call a minor scale. The sequence of notes in the minor scale starting with 𝛼 are 𝛼, 𝛾, 𝛿, 𝜁, 𝜃, 𝜄, 𝜆, 𝛼, which we rename as the sequence 𝐴, 𝐵, 𝐶, 𝐷, 𝐸, 𝐹, 𝐺, 𝐴. See Table 1. By 1 To produce a somewhat ethereal effect, a violinist sometimes lightly touches a vibrating string at various places rather than clamping the string.

A note-naming interlude

101

Figure 2. A fingerboard to infinity, pencil sketch by author. custom, the notes in the chromatic scale not included in this 𝐴 minor scale are called 𝛽 ≡ 𝐴♯ , 𝜖 = 𝐶 ♯ , 𝜂 ≡ 𝐷♯ , 𝜅 ≡ 𝐹 ♯ , and 𝜇 ≡ 𝐺 ♯ . Thus a minor scale has eight notes, and the first and last notes are an octave apart.2 Since the chromatic scale has thirteen notes, there are twelve intervals between them. By custom, each of these intervals is called a half-step or a semitone. Thus, a scale spans six steps. When some people hear an 𝐴 minor scale, they may experience a melancholy, reflective, somber, or spooky sensation. As musical tastes changed over the years from Pythagoras’s day, musicians tended to favor scales that induced a 2 When a musician composes a melody in a certain key, the notes in the melody are restricted to the eight notes in that key or scale. To interject tension or surprise into the melody, the composer sometimes inserts one or more of the other five notes. By custom, these notes are called accidentals, a word vaguely suggesting that they have been included in the piece by serendipitous accident.

102

Strand IV: Fractions in the Pythagorean Scale Table 1. Naming the notes in the Pythagorean chromatic scale.

Pythagorean notes 𝛼 𝛽 Notes in 𝑎 minor 𝐴 Notes not in 𝑎 minor 𝐴♯

𝛾 𝐵

𝛿 𝐶

𝜖

𝜁 𝐷

𝐶♯

𝜂

𝜃 𝐸

𝜄 𝐹

𝐷♯

𝜅 𝐹♯

𝜆 𝐺

𝜇

𝛼 𝐴

𝐺♯

bright, sweet, festive sensation. With respect to the 𝐴 minor scale, they changed a few notes. The result was the 𝐴 major scale: 𝐴𝐵𝐶 ♯ 𝐷𝐸𝐹 ♯ 𝐺 ♯ 𝐴. The one major scale that uses no sharps is the 𝐶 major scale: 𝐶𝐷𝐸𝐹𝐺𝐴𝐵𝐶. Its progression of semitones and whole tones from 𝐶 to 𝐶 is exactly the same as the progression of semitones and whole tones from 𝐴 to 𝐴 in the 𝐴 major scale. It is for this reason (the absence of sharps) that we identify tone 1 (𝑋0 ) with 𝐶0 , or simply 𝐶, and tone 2 (𝑋1 ) with 𝐶1 (equivalently, frequency 1 with 𝐶0 and frequency 2 with 𝐶1 ) for the rest of this strand and for Strand V.

How Pythagoras generated his scale To add a note to his scale between 𝐶0 and 𝐶1 , Pythagoras experimented with the 2 tone produced when the string of length vibrates, so generating a tone with 3

3

frequency . He discovered that this tone played together with 𝐶0 produced a 2

3

pleasant-sounding chord. What note is this one? Since tone is half-way between 2 tone 1 and tone 2, perhaps it should be the note in the middle of the thirteennote chromatic scale. In the key of 𝐶 major, the middle note is 𝐹 ♯ (because 𝐹 ♯ is the middle note of the chromatic 𝐶 scale, 𝐶𝐶 ♯ 𝐷𝐷♯ 𝐸𝐹𝐹 ♯ 𝐺𝐺 ♯ 𝐴𝐴♯ 𝐵𝐶). But no. 3 The note with frequency belongs in the 𝐶 major scale because it sounds so 2

3

well when played with 𝐶. So custom has dictated that this note of frequency 2 is 𝐺. The sound of 𝐶 and 𝐺 played together, denoted as chord 𝐶𝐺, is called a fifth because there are five notes of the 𝐶 major scale between 𝐶 and 𝐺. From this fifth, Pythagoras generated other fifths, creating a circle of fifths as shown in Figure 3, progressing counterclockwise to higher and higher tones about the circle. Alternatively, proceeding clockwise to higher and higher tones about this same circle gives a succession of fourths such as 𝐶𝐹. One way to generate the fractions corresponding to the notes of the Pythagorean scale is to use a geometric sequence and equivalence classes. We say that tones 𝑋 and 𝑌 are equivalent to one another if the ratio of their frequencies is some integer power of two. For example, the tones equivalent to 𝐶 are all those tones obtained by successively doubling or halving the frequency of 𝐶. We say

How Pythagoras generated his scale

103

C

G

F

ction of dire th f f o o u le

A

directi

E B

A#

ircle of fifth s ec s rth

of circ on

D

D# G#

C#

F#

Figure 3. The circle of fourths and fifths. that 𝑋 is the canonical member of its equivalence class if 1 ≤ 𝑋 < 2. Thus 𝐶 is canonical whereas 𝐶1 is not canonical. How can we generate other tones or notes in a scale starting from 𝐶? Since multiplying 𝐶 by any power of two will merely give us a note equivalent to 𝐶, we could try multiplying 𝐶 by powers of some prime 𝑝 other than two. The Pythagoreans chose 𝑝 = 3. Starting with 𝐶 = 1, we let 𝑌 = 3𝑛 𝐶 = 3𝑛 where 𝑛 is any integer, and then find the canonical tone equivalent to 𝑌 . Thus the other tones in the Pythagorean scale, for −4 ≤ 𝑛 ≤ 7, were taken as 1 1 1 1 (1) , , , , 1, 3, 9, 27, 81, 243, 729, 2187, 81 27 9 3 whose canonical tones are 128 32 16 4 3 9 27 81 243 729 2187 , , , , 1, , , , , , , . (2) 81 27 9 3 2 8 16 64 128 512 2048 The reason we terminate the progression in (1) is because the canonical tones given by (2) provide a more or less uniformly spaced progression of sounds from tone 1 to tone 2. To recap and to arrange (2) in increasing order, the uniformly spaced progression of twelve sounds from the chromatic Pythagorean scale are given in Table 2. Table 2. The Pythagorean scale. Tone Ratio 𝑋 Cents 𝑔(𝑋)

𝐶

𝐶♯

𝐷

𝐷♯

𝐸

𝐹

𝐹♯

𝐺

𝐺♯

𝐴

𝐴♯

𝐵

𝐶1

1

2187

9

32

81

4

729

3

128

27

16

243

2

1

2048

8

27

64

3

512

2

81

16

9

128

1

0

113.7

203.9

294.1

407.8

498

611.7

702

792.2

905.9

996.1

1109.8

1200

104

Strand IV: Fractions in the Pythagorean Scale

Now suppose we wished a musical scale to be equally tempered. That is, we want the ratio 𝐶 ♯ /𝐶 to be the same as the ratio 𝐷/𝐶 ♯ , and so on. Then 𝐷 = (𝐶 ♯ /𝐶)2 𝐶. Similarly, 𝐷♯ = (𝐶 ♯ /𝐶)3 𝐶, and so on—an exponential relationship. Since there are twelve half-steps in the chromatic scale from 𝐶 to 𝐶1 , the musical world has opted to define this exponential relationship as 𝑓 ∶ [0, 1200] → [1, 2] with 𝑓(𝑥) = 2𝑥/1200 where 𝑥 is measured in cents. Thus the cent value associated with 𝐶 is 0, and the cent value associated with an octave above 𝐶 is 1200. To create an even-tempered scale, we would choose 𝑓(0) = 𝐶, 𝑓(100) = 𝐶 ♯ , 𝑓(200) = 𝐷, and so on, so that 𝑓(1200) = 𝐶1 . Let 𝑋 = 𝑓(𝑥), the tone at parameter 𝑥. Inverting this exponential gives the cent value of the tone 𝑋 as 𝑔(𝑋) = 1200 log2 𝑋. 1

Thus, for example, 𝑔(𝐷) = 1200 log2 (𝐷) = 1200 log2 (𝑓(200)) = 1200 log2 (2 6 ) = 200, and so on. However, the Pythagorean scale is not equally tempered, as illustrated in Table 2 and Figure 4. Most of the notes of the scale, indicated as dots in the figure, are not quite on the logarithmic curve 𝑦 = 1200 log2 (𝑋), and the cent values for Pythagorean notes fail to be at integer multiples of 100 except at 𝐶 and 𝐶1 .

12 hundreds of cents, x

B

C1

A G

6 E 2

F

D C

1.2

1.6

2

X

Figure 4. The Pythagorean scale arranged along a logarithmic curve 𝑔(𝑋) = 1200 log2 𝑋. A perfect fifth should span 𝑔(1.5) ≈ 702.0 cents. Thus 𝐶𝐺 is a perfect fifth. But not all Pythagorean fifths are perfect. For example, from Table 2, chord 𝐶 ♯ 𝐺 ♯ spans 792.2−113.7 = 678.5 cents. Table 3 lists the spans of all Pythagorean fifths. An elegant feature of the Pythagorean scale is that all complementary fourths and fifths are nearly ideal. That is, the ideal span of a fourth and its corresponding

How Pythagoras generated his scale

105

Table 3. Spans of Pythagorean fifths. Chord 𝐶𝐺 𝐶 ♯ 𝐺 ♯ Cents 702 678.5

𝐷𝐴 𝐷♯ 𝐴♯ 702 702

𝐸𝐵 𝐹𝐶 𝐹 ♯ 𝐶 ♯ 702 702 702

𝐺𝐷 701.9

𝐺 ♯ 𝐷♯ 𝐴𝐸 701.9 701.9

𝐴♯ 𝐹 𝐵𝐹 ♯ 701.9 701.9

fifth together should be near 1200 cents. For example, the fifth CG has span 702 and the fourth G𝐶1 has span 498, giving a total of 1200. Unfortunately, as we explore in Strand V, the Pythagorean scale fails to produce good-sounding thirds. So other scales eventually surpassed it in popularity. One particular chord is especially dissonant. Sometimes called the devil’s chord, it is between a fourth and fifth. This tri-tone spans six half-steps, and is often used in jazz music to create tension. A perfect devil’s chord such as 𝐶𝐹 ♯ has ratio √2, and its span is 𝑔(√2) = 600 cents. Legends say that the early Pythagoreans believed all numbers to be rational. And it is said that a certain Pythagorean, perhaps Hippasus of Metapontum, who lived about a century after Pythagoras, demonstrated that √2 cannot be rational3 and was shortly thereafter drowned for the impropriety. Perhaps Hippasus hummed snatches of tri-tone progressions while awaiting his judges’ deliberations, further sealing his doom. As a final curiosity regarding the Pythagorean scale, Pliny the Elder in Book II of his Natural History (first century ad) says that Pythagoras used the semitones of his scale to describe the distances between heavenly bodies, designating the distance between Earth and the Moon as a whole tone, that between the Moon and Mercury a semitone, between Mercury and Venus the same, between her and the Sun a tone and a half [a minor triad], between the Sun and Mars a tone, between Mars and Jupiter half a tone, between Jupiter and Saturn half a tone, between Saturn and the Zodiac a tone and a half: the seven tones thus producing the so-called diapason, a universal harmony, the music of the spheres.4

3 Here is his argument: suppose that √2 = 𝑚/𝑛 where 𝑚 and 𝑛 are relatively prime integers; then 2𝑛2 = 𝑚2 which means that 2|𝑚. Thus 2𝑛2 = 4𝑘2 for some integer 𝑘, which means that 2|𝑛, a contradiction. 4 In the Ptolemaic system of the universe, the planets, Sun, and stars circled Earth. Earth itself was fixed in space. As the heavenly bodies circled, they sang, at least according to the poets. Alternatively, with respect to the woodcut of Figure I.1, the rotating wheels and gears of this abstract clockwork universe, as they rubbed against each other, made universal harmony.

Chapter IV: A Tree of Fractions In previous chapters we have seen that mathematical structure is often defined in terms of simpler mathematical structures. The integer 1 is the successor of 0, 2 is the successor of 1, and so on, thus creating the natural numbers ℕ. Each successor is never the same as any of its predecessors. In like fashion, the principle of mathematical induction allows us to prove an infinitude of sufficiently related statements. For example, in Chapter III, we established Euclid’s algorithm to compute the greatest common divisor of any two positive integers via induction. In this chapter, we construct the set of rational numbers, ℚ, where ℚ={

𝑚 𝑚, 𝑛 ∈ ℤ, and 𝑛 ≠ 0}. 𝑛

|

To do so, we focus on the rational numbers 𝑚

𝑚 𝑛

(3)

between 0 and 1. Any other rational

number 𝑞 is of the form 𝑞 = 𝑘 + , where 𝑘 ∈ ℤ and 0 ≤ 𝑚 < 𝑛. 𝑛 Historically, we will see that the Egyptians developed their fractions directly from the positive integers. For every 𝑛 ∈ ℤ+ , they defined the unitary fraction 1 . We explore how they then could write any fraction as a non-repeating sum of 𝑛 these natural fractions. To generate the rational numbers, we could be content 1 with Equation (3). But this method creates many duplicates. For example, 2

2

3

occurs infinitely often as , , and so on. We seek a more lean method, one that 4 6 will generate fractions only in reduced form. To do so, we start with 0 and 1, and 1 generate the first true fraction , a fraction that does not belong to ℕ. Observe that 2

0 and 1 are the two numbers simpler than

1 2

1

that are nearest to . For this reason, 1

2

we might say that 0 and 1 are the parents of . In like fashion, each fraction 2 between 0 and 1 has two parents. How can we do this? To ask the question in reverse, given a rational number between 0 and 1, how can we find its parents? Establishing a way to accomplish these genealogical feats is the purpose of this 107

108

Chapter IV: A Tree of Fractions

chapter. As we will see in Chapter IX, this genealogical structure of the rational numbers is the key idea in the construction of continued fractions.

Unitary fractions in ancient Egypt A far older list of mathematical problems than those of Diophantus comes from the Rhind Papyrus,5 dating to about 1650 bc. The document begins: This book was copied in regnal year 33, month four of Akhet, under the majesty of the King of Upper and Lower Egypt, Awserre, given life from an ancient copy made in the time of the King of Upper and Lower Egypt, Nimaatre. The scribe Ahmose writes this copy. So this original list of problems is about 4000 years old, perhaps older. In keeping with the Egyptian tradition of writing parts of a whole as the sum of distinct 1 unitary fractions—fractions of the form where 𝑛 ∈ ℤ+ —the Rhind Papyrus 𝑛

2

opens with a table, which we reproduce as Table 4, on how to decompose 2𝑛+1 as a non-repeating unitary sum, for 1 ≤ 𝑛 ≤ 50. The remainder of the papyrus is a list of 84 exercises, mostly involving the manipulation of fractions. 2

Table 4. Rewriting as a sum of unitary fractions, where 𝑛 is 𝑛 an odd integer with 3 ≤ 𝑛 ≤ 101. 𝑛: parts 3: 2-6 5: 3-15 7: 4-28 9: 6-18 11: 6-66 13: 8-52-104 15: 10-30 17: 12-51-68 19: 12-76-114 21: 14-42

𝑛: parts 23: 12-276 25: 15-75 27: 18-54 29: 24-58-174-232 31: 20-124-155 33: 22-66 35: 30-42 37: 24-111-296 39: 26-78 41: 24-246-328

𝑛: parts 43: 42-86-129-301 45: 30-90 47: 30-141-470 49: 28-196 51: 34-102 53: 30-318-795 55: 30-330 57: 38-114 59: 36-236-531 61: 40-244-488-610

𝑛: parts 63: 42-126 65: 39-195 67: 40-335-536 69: 46-138 71: 40-568-710 73: 60-219-292-365 75: 50-150 77: 44-308 79: 60-237-316-790 81: 54-162 2

1

𝑛: parts 83: 60-332-415-498 85: 51-255 87: 58-174 89: 60-356-534-890 91: 70-130 93: 62-186 95: 60-380-570 97: 56-679-776 99: 66-198 101: 101-202-303-606 1

To decipher Table 4, the first entry, 3: 2-6, means = + , and so on. Some 3 2 6 of these decompositions can be obtained from the formula 2 2 𝑚+1 2 2 = ⋅ = + , (4) 𝑚𝑛 𝑚+1 𝑚𝑛 𝑛(𝑚 + 1) 𝑚𝑛(𝑚 + 1) where both 𝑚 and 𝑛 are odd integers. Since 𝑚 is an odd integer, both of the last two fractions in Equation (4) reduce to unitary fractions. 5 Alexander

Henry Rhind purchased this old Egyptian manuscript in Luxor in 1858.

Unitary fractions in ancient Egypt

109

Example 1: Decomposing fractions into non-equal unitary fractions. To 2 decompose this way, let 𝑚 = 5 and 𝑛 = 13 in Equation (4). Then 65

2 2 2 2 1 1 = = + = + . 65 5 ⋅ 13 6 ⋅ 13 6 ⋅ 5 ⋅ 13 39 195 2

However, decomposing via Equation (4) with 𝑚 = 41 and 𝑛 = 1 gives a differ41 ent solution than Table 4: 2 1 1 = + . 41 21 861 As a final decomposition problem, how might the scribe Ahmose have decom23 23 posed ? Since 51 = 3 ⋅ 17, Ahmose might rewrite as 51

51

23 6 + 17 6 17 1 2 = = + = + . 51 3 ⋅ 17 3 ⋅ 17 3 ⋅ 17 3 17 Ahmose then consults Table 4 for the entry 17: 12-51-68, and so 1 1 1 1 23 = + + + . 51 3 12 51 68

♢

Figure 5. Problem 80 of the Rhind Papyrus, author sketch. To illustrate the notation used by the Egyptians, Figure 5 is a sketch of Problem 80 from the Rhind Papyrus. The first line of the text translates as: “With respect to the vessels used by the clerks in the granary, here is how to take a succession of halves”—which is the eye of Horus algorithm using the geometric 1 1 sequence 1, , , … [27]. The scribe illustrates this algorithm starting with 1 hekat 2 4 of grain, which is equal to ten henu of grain, half of which is five, and so on, as annotated in the figure. The hieroglyphic symbol for one is |, the symbol for ten 1 is ∩, and the symbol for is . The symbol to denote a unitary fraction is ; 2

thus, to denote one-tenth, scribes wrote ∩ , and so on.

110

Chapter IV: A Tree of Fractions

Example 2: Dividing 700 loaves of bread. Problem 63 of the Rhind Papyrus requires that we divide 700 loaves of bread among four people in the sequential 2 1 1 1 proportions6 ∶ ∶ ∶ . Asking questions in this manner is a clever way to ask for

𝑚 𝑛

3

2

3

4

of something without explicitly asking. That is, let the four people’s

proportions be 𝑤, 𝑥, 𝑦, and 𝑧. Then 4 3

, which means that 𝑤 = 𝑥=

4𝑥 3

,𝑥=

𝑤

3𝑦 2

𝑥

2 1

4

𝑥

3 2

3 4𝑧

𝑦

= / = ,

, and 𝑦 =

3𝑦 3 4𝑧 = ⋅ = 2𝑧 2 2 3

3

1 1

3

𝑦

2 3

2

𝑧

= / = , and

1 1

= / = 3 4

. Back-substituting gives

and

𝑤=

4 8𝑧 ⋅ 2𝑧 = . 3 3

Therefore

8𝑧 4𝑧 + 2𝑧 + + 𝑧 = 7𝑧, 3 3 2 1 which means 𝑧 = 100, 𝑦 = 133 , 𝑥 = 200, and 𝑤 = 266 loaves. 700 = 𝑤 + 𝑥 + 𝑦 + 𝑧 = 3

♢

3

A continued fraction tradition The Egyptian tradition of unitary fractions has been kept alive through the ages in continued fractions—because the unitary fraction is the basic building block of simple continued fractions. An early example of a finite simple continued fraction comes from Archimedes. Example 3: An Archimedean continued fraction. In On the Measurement of 1351 a Circle (circa third century bc), Archimedes represents the fraction as 780

⎛ 1351 1⎜ 1 = ⎜5 + 1 780 3⎜ 5+ ⎜ 1 10 + ⎝ 10

⎞ ⎟ ⎟. ⎟ ⎟ ⎠ 1351 1 = [5; 5, 10, 10]. In terms of Definition 1 of the Introduction on p. xiii, 780 3

♢

Puzzle 4: A golden continued fraction. Let 𝐶𝑛 = [1; 1, 1, 1, 1, … , 1], a list containing 𝑛 partial denominators of 1 after the semicolon. Let 𝐶 = lim 𝐶𝑛 , an 𝑛→∞

infinite simple continued fraction. Can you guess the value of 𝐶? As a hint, the first few convergents of 𝐶 simplify to the following fractions: 1 3 1 5 1 8 1 13 1+ = , 1+ = , 1+ = . 1 1 1 = 3, 1 + 1+1 2 5 8 1+ 1+ 1+ 1 1 1+

1+1

6 The

2

Egyptians allowed the use of the fraction . 3

1+1

1+

1+

1 1+1

Farey sequences

111 ♢

A solution to this puzzle appears in Example VI.30.

Farey sequences In this section, our goal is to see how we can generate all fractions between 0 and 1 from more elementary fractions, starting with 0 and 1. Towards this end we need some terminology. Definition 5: Fraction terminology. We say that any fraction that is not an 0 1 integer is a true fraction.7 When we write 0 and 1 as fractions, we use and , 1 1 respectively. For the remainder of this chapter, unless we specify otherwise, when 𝑎 we refer to any true fraction we mean that gcd(𝑎, 𝑏) = 1 and 𝑎 < 𝑏. Thus every fraction

𝑝 𝑞

𝑏

we encounter in this chapter, unless specified otherwise, will belong to

the unit interval [0, 1] and be in reduced form. We say that than 𝑎

𝑎 𝑏 𝑐

if 𝑑 < 𝑏; when 𝑏 = 𝑑,

𝑐 𝑑

𝑎

is simpler than

𝑏

𝑐 𝑑

is a simpler fraction

if 0 < 𝑐 < 𝑎. When we write

= we mean that 𝑎 = 𝑐 and 𝑏 = 𝑑 as well as acknowledging that the two 𝑑 fractions have the same value. 𝑏

When we encounter a possible non-reduced fraction relabel it as 𝑎 𝑏

𝑎 𝑏

where

, we mean that

𝑎 𝑏

𝑎 𝑏

𝑝

𝑝 𝑞

in this chapter, we

is the reduced form of . When we refer to the fraction 𝑞

is in reduced form from the equivalence class of all fractions 𝑎

whose value is the same as the value of . 𝑏

2

Example 6: Reduced form. To illustrate Definition 5, the fraction is reduced, 4

37

3

5

whereas the fraction is non-reduced. The fraction is simpler than because 6 42 91 both are reduced fractions and 42 < 91. ♢ Definition 7: The mediant. We say that the mediant of two fractions denoted by

𝑎 𝑏

𝑐

𝑎+𝑐

𝑑

𝑏+𝑑

⊕ , is

𝑎 𝑏

𝑐

and , 𝑑

.

Example 8: An application of the mediant. The mediant operator is often used when awarding partial credit for an answer to a multi-part question on an exam. For example, suppose a multi-part exercise has two parts, 𝐴 and 𝐵. According to a pre-devised rubric, part 𝐴 is worth five points and part 𝐵 is worth four points. If a student earns three points on part 𝐴 and two points on part 𝐵, 3 2 then that student has earned the mediant score ⊕ , or five of nine points. ♢ 5

7 We

4

could refer to 0 and 1 as untrue fractions or pseudo fractions, but never false fractions.

112

Chapter IV: A Tree of Fractions

The next definition honors John Farey (1766–1826), a geologist who used the mediant while studying sound propagation through matter. Definition 9: Farey sequences. The Farey sequence of order 𝑛, denoted by ℱ𝑛 , is the set, in ascending order, of all (reduced) fractions in [0, 1] whose denomi𝑐 𝑎 nators are at most 𝑛. Furthermore, we say that and are adjacent fractions 𝑏 𝑑 or neighbors if they are adjacent fractions in some Farey sequence ℱ𝑛 , where 𝑎 𝑐 𝑎 𝑐 𝑎 𝑎, 𝑏, 𝑐, 𝑑, ∈ ℕ and 𝑛 > 0. If and are neighbors and < , then is the left-hand neighbor of

𝑐 𝑑

and

𝑐 𝑑

𝑏

𝑑

𝑏

𝑎

𝑑

𝑏

is the right-hand neighbor of . 𝑏

The first few Farey sequences are 0 1 0 1 1 ℱ1 = { , } , ℱ2 = { , , } , 1 1 1 2 1 0 1 1 1 2 1 3 2 3 ℱ5 = { , , , , , , , , , 1 5 4 3 5 2 5 3 4

0 1 1 2 1 ℱ3 = { , , , , } , 1 3 2 3 1 4 1 0 1 1 , } , ℱ6 = { , , , 5 1 1 6 5

0 1 1 1 2 3 1 ℱ4 = { , , , , , , } , 1 4 3 2 3 4 1 1 1 2 1 3 2 3 4 5 1 , , , , , , , , , }. 4 3 5 2 5 3 4 5 6 1 𝑎

𝑐

To generate ℱ𝑛+1 from ℱ𝑛 , for each pair of adjacent fractions and in ℱ𝑛 𝑏 𝑑 where 𝑏 + 𝑑 = 𝑛 + 1, insert their mediant between them. As we show in Proposition 14, the mediant is already in reduced form and lies between the two fractions. To establish this result, we use a little algebra. Definition 10: Cardinality. Let 𝑆 be a set consisting of a finite number 𝑛 of elements, where 𝑛 is a nonnegative integer. We say that the cardinality of 𝑆, denoted by |𝑆|, is 𝑛. Recall from Definition I.16 that 𝜙(𝑛) is the number of positive integers less than or equal to 𝑛 that are relatively prime to 𝑛. 𝑛

Proposition 11: Farey sequence size.8 For any 𝑛 ∈ ℤ+ , |ℱ𝑛 | = 1 + ∑ 𝜙(𝑘). 𝑘=1

Proof. Observe that the proposition is true when 𝑛 = 1 because |ℱ1 | = |{0, 1}| = 2 = 1+𝜙(1). Assume that the proposition is true for a given positive integer 𝑛, and show that this implies the proposition is true for 𝑛 + 1. If gcd(𝑘, 𝑛 + 1) > 1 then 𝑘 is not in reduced form. It can be reduced to a true fraction with denominator 𝑛+1 less than 𝑛 + 1. This reduced fraction is already in ℱ𝑛 . If gcd(𝑘, 𝑛 + 1) = 1 and 𝑘 𝑘 1 ≤ 𝑘 ≤ 𝑛, then is a new fraction; that is, ∉ ℱ𝑛 . Thus 𝑛+1

𝑛+1

| | 𝑘 gcd(𝑘, 𝑛 + 1) = 1, 1 ≤ 𝑘 ≤ 𝑛}|| = |ℱ𝑛 | + 𝜙(𝑛 + 1). |ℱ𝑛+1 | = |ℱ𝑛 | + ||{ 𝑛+1

|

8 Propositions

11 through 15 can also be found in [103, pp. 255–268].

Farey sequences

113

𝑛+1

So |ℱ𝑛+1 | = 1 + ∑ 𝜙(𝑘), making the proposition true by induction for all 𝑛. 𝑘=1

Proposition 12: Farey denominators. No two adjacent Farey fractions have 0 1 the same denominator except and . 1

𝑝

Proof. Let

𝑛

Definition 5,

and 𝑝 𝑛

𝑞 𝑛

and

1

belong to ℱ𝑛 , where 𝑛, 𝑝, 𝑞 ∈ ℕ and 𝑛 ≥ 2. With respect to 𝑞 𝑛

are in reduced form. Without loss of generality, we assume 𝑎

that 𝑝 < 𝑞 < 𝑛, which means that 𝑛 − 𝑝 ≥ 2. Let 𝑎

Since 𝑏 ≤ 𝑛 − 1, we have 𝑝 𝑛

𝑏

𝑞

∈ ℱ𝑛 . Observe that

and , being separated by 𝑛

𝑎 𝑏

𝑝 𝑛

𝑏

0.

> 0.

Proposition 14: Uniqueness of the in-between fraction. If 𝑎𝑑 − 𝑏𝑐 = ±1, 𝑎 𝑐 then ⊕ is the unique fraction (already in reduced form) from ℱ𝑏+𝑑 between 𝑏

𝑎

𝑑

𝑐

and and thus is the simplest fraction in that interval (not including the end𝑏 𝑑 points). 𝑎

𝑐

Proof. Observe first that ⊕ is already in reduced form because (𝑎 + 𝑐)𝑑 − (𝑏 + 𝑏 𝑑 𝑑)𝑐 = 𝑎𝑑 − 𝑏𝑑 = ±1, which means that 𝑎 + 𝑐 and 𝑏 + 𝑑 are relatively prime. 𝑝 𝑎 𝑐 𝑐 𝑎 Let be a fraction between and . Without loss of generality, let < . 𝑞

Consider the case where 𝑎+𝑐 𝑏+𝑑

−

𝑐 𝑑

>

𝑝 𝑞

𝑐

𝑐 𝑑

0.81 ≈ 1 + − ln 2 = 𝑠2 . 2 Observe that 1 𝑛. An integer is said to be a regular number if its only prime divisors are 2, 3, or 5. If the numerator and denominator of 𝑥 (in reduced form) are both regular, then 𝑥 is said to be a super-regular number. Let 𝑥 be super1 1 regular; we say that the ordered pair (𝑥, ) is a reciprocal pair2 if 𝑥 > . 𝑥 𝑥 Any three positive real numbers 𝑎, 𝑏, and 𝑐 satisfying the equation 𝑎2 + 𝑏2 = 𝑐2 2 Scribes had access to standard tables giving the sexagesimal representations of the reciprocals of regular integers up to sixty-four, along with the outlier eighty-one, in, for example, tablet MLC 1670 [124, Figure 7].

The Babylonian number system

171

c

a

b

Figure 2. The diagonal rule, also called the Pythagorean theorem. form a Pythagorean triple,3 denoted by (𝑎, 𝑏, 𝑐). A Pythagorean triple (𝑎, 𝑏, 𝑐), where 𝑎, 𝑏, and 𝑐 are positive integers, is a primitive Pythagorean triple if the greatest common divisor of 𝑎, 𝑏, and 𝑐 is 1. 73

In this notation, is rendered ⟨1; 13⟩. For convenience, and to save space 60 in a table, we sometimes write the number ⟨𝑎0 ; 𝑎1 , 𝑎2 , 𝑎3 , … , 𝑎𝑘 ⟩ as a list of 𝑎𝑖 ’s separated by spaces: 𝑎0 𝑎1 𝑎2 … 𝑎𝑘 , allowing two decimal digits for each 𝑎𝑖 , 1 ≤ 𝑖 ≤ 𝑘. For example, the number on line thirteen of column IV of Plimpton 322 is 7 ▽ followed by a gap followed by 3 ▽ 4

5 ▽ , namely, ⟨ 0; 27 00 03 45 ⟩ or

▽

▽

2

the list 0 27 00 03 45. Its value is

27 60

+

0 602

+

3 603

45

+

=(

604

2

161 240

) . Note that

1 2

is a

1

fraction of order one, while , as shown below, has order infinity. 7

Example 2: A nonterminating sexagesimal fraction. To find the sexagesi1 mal representation of , we follow this outline: 7

1

1

7 1

7

• To find the number 𝑥 of sixtieths in , solve 𝑥 ≈ 8.57. Thus the number of 8

• After subtracting the number 𝑦 of 𝑦 ≈ 34.29. Thus,

60 1 602 1 7

from ’s in

1 7 1

105

1 60

’s in

• Continuing gives

1 7

60

for 𝑥. Doing so gives

is ⌊𝑥⌋ = 8.

we obtain a remainder: , solve the equation

1 105

1

−

7

=

8 60 𝑦

3600

=

1 105

. To find

. Doing so gives

in Babylonian starts out as ⟨0; 08 34⟩.

• To continue, the remainder now is gives 𝑧 ≈ 17.14. So

7

𝑥

=

1 7

1 105

−

34 3600

=

1 12600

. Solving

1 12600

=

𝑧 603

≈ ⟨0; 08 34 17⟩.

= ⟨0; 08 34 17 08 34 17 …⟩ = ⟨0; 08 34 17⟩.

3 Given a rectangle of side lengths 𝑎 and 𝑏 with diagonal 𝑑, as in Figure 2, scribes knew the diagonal rule: 𝑎2 + 𝑏2 = 𝑑2 . Mansfield [95, p. 7], citing a number of sources, says, “Evidence of the Diagonal rule can be found in quite a few tablets.” Thus, he concludes, we “now call [it] Pythagoras’ theorem,” or the Pythagorean theorem.

172

Strand VI: A Clay Tablet

Interestingly, Sachs [128, p. 152] deciphers a tablet cataloged as M10 where the 1 scribe showed that lies between ⟨0; 8, 34, 16, 59⟩ and ⟨0; 8, 34, 18⟩. ♢ 7

The following lemma gives a way to determine when a rational number has order 𝑛. Lemma 3: Fractions of order 𝑛. Let 𝑥 be a positive rational number. If 𝑥⋅60𝑛 = ⌊𝑥 ⋅ 60𝑛 ⌋, then 𝑥 is at most of order 𝑛, where ⌊𝑥⌋ is the floor function. Proof. Let 𝑥 = ⟨𝑎0 ; 𝑎1 𝑎2 …⟩. The fractional part 𝑟 of 𝑥 ⋅ 60𝑛 is 𝑟 = ⟨0; 𝑎𝑛+1 𝑎𝑛+2 …⟩. If the order of 𝑥 is more than 𝑛, then 𝑟 ≠ 0. However, if 𝑥 is at most of order 𝑛, then 𝑟 = 0, in which case 𝑥 ⋅ 60𝑛 = ⌊𝑥 ⋅ 60𝑛 ⌋.

The accepted transliteration of Plimpton 322 The data in Plimpton 322 has been interpreted as a table where each row involves two positive numbers 𝑎 and 𝑐 and where 𝑎 < 1 and 𝑎2 + 1 = 𝑐2 .

(1)

So the Pythagorean triple (𝑎, 1, 𝑐) can be viewed as the side lengths of a normalized triangle, or a normalized rectangle where 𝑎 is the short side, 1 is the long side, and 𝑐 is the hypotenuse of the triangle or the diagonal of the rectangle. Shown in Figure 1, parts of the tablet are damaged and unreadable. Nevertheless, this missing information can be recovered by understanding the undamaged part of the tablet. For convenience, we use the term scribe to refer to the tablet’s author. The heading of the tablet describes the contents of the table: the diagonal [hypotenuse]4 from which 1 is torn, so that the short side is found [124]. Beneath the heading are fifteen lines of information, broken into four columns. Since many, if not all, researchers who have examined the tablet concluded that the original tablet contained additional columns on the left-hand side, we use the convention of labeling the columns from right to left.5 Table 1 is a transliteration6 of Plimpton 322. This table contains two tables separated by a double vertical line. The left-hand part is a transition from Babylonian to the sexadecimal notation of Definition 1, and the right-hand part is a transition into decimal notation. For example, the first entry in column IV on the left is 59 00 15, which we write as 2

59 15 212415 14161 0 119 + = = + =( ) , 60 602 603 14400 120 603 4 Robson

uses the term “takiltum-square of the diagonal.” usual custom in Plimpton 322 articles is a left to right labeling of the columns. 6 This table incorporates the six well-recognized corrections of mostly copy or careless scribal errors. An analysis of each of these errors appears in Britton [17, pp. 524–526]. 5 The

The accepted transliteration of Plimpton 322

173

Table 1. A transliteration of Figure 1. IV

II

I

IV∗

2 49

1

119 ) ( 120

1 20 25

2

3367 ) ( 3456

III

59 00 15

1 59

56 56 58 14 50 06 15

56 07

2

2

2

55 07 41 15 33 45

1 16 41

1 50 49

3

4601 ( 4800 )

53 10 29 32 52 16

3 31 49

5 09 01

4

12709 ( 13500 )

48 54 01 40

1 05

1 37

5

( 65 ) 72

47 06 41 40

5 19

08 01

6

319 ) ( 360

43 11 56 28 26 40

38 11

59 01

7

2291 ( 2700 )

41 33 45 14 03 45

13 19

20 49

8

799 ( 960 )

38 33 36 36

8 01

12 49

9

481 ( 600 )

35 10 02 28 27 24 26 40

1 22 41

2 16 01

10

4961 ) ( 6480

1 15

11

( 43 )

2

2

2

2

2

2

2

33 45

45

29 21 54 02 15

27 59

48 49

12

1679 ( 2400 )

2 41

04 49

13

161 ( 240 )

25 48 51 35 06 40

29 31

53 49

14

1771 ( 2700 )

23 13 46 40

28

53

15

28 ) ( 45

27 00 03 45

2

2

2

2

2

III∗

𝑀

II∗

119

120

169

3367

3456

4825

4601

4800

6649

12709

13500

18541

65

72

97

319

360

481

2291

2700

3541

799

960

1249

481

600

769

4961

6480

8161

3

4

5

1679

2400

2929

161

240

289

1771

2700

3229

28

45

53

a result we place in row 1 and column IV∗ in the right-hand portion of the table.7 To obtain the last equality in the above equation, the scribe would need to re212415 duce8 and then take the square root of both numerator and denominator9 3 60 119

to obtain 7 Some

120

. We record the numerator 119 in column III∗ of the right-hand part

researchers, including Robson [124], suggest that each of the column IV numbers originally included a leading 1. Under this assumption, the entry in column IV would be 1 59 00 15, the 119 2 entry in column IV∗ would be 1 + ( ) by Equation (1), and so on. 120 8 Scribes had algorithms to scale two numbers by the same quantity [95, p. 7]. 9 Scribes knew how to take the square roots of large perfect squares [95, pp. 8–11].

174

Strand VI: A Clay Tablet

of the table, and, since 1⋅60+59 = 119, we write 1 59 in column III. Similarly, the 59 15 28561 169 sexadecimal number 1 59 00 15 is 1 + + 3 = = ( )2 . The Babylonian 60 60 14400 120 representation for 169 is 2 49, the entry in column II, whose value we also record in column II∗ as the decimal number 169. In an additional column 𝑀 between II∗ and III∗ we record the value 120.

Reciprocal pairs generate normalized Pythagorean triples The Babylonians had various geometrical algorithms, often of a cut-and-paste type, to calculate areas. To illustrate this custom, consider the next example. Example 4: Three squares in arithmetic progression. Exercise 3 from tablet MS 5112 is a puzzle concerning three squares: 3▽ 2

▽

▽

The sum of the areas of three squares is 2

and the sum of their

sides is ▽ . What are the sides? From a lifetime spent deciphering Babylonian mathematical phraseology, Friberg [50, pp. 318–319] clarifies: The square sides form an arithmetical progression. Furthermore the sum of the sides being ▽ means that the sum of their sides is 60. Let 𝑃, 𝑄, and 𝑅 be squares with respective side lengths 𝑝, 𝑞, and 𝑟 with 0 < 𝑟 < 𝑞 < 𝑝. Define 𝑑 so that 𝑝 − 𝑞 = 𝑑 = 𝑞 − 𝑟. So 60 = 𝑟 + 𝑞 + 𝑝 = 3𝑞. To recap, we know that 𝑟2 + 𝑞2 + 𝑝2 = 1380 + 20 = 1400 and 3𝑞 = 60; our goal is to find 𝑝, 𝑞, and 𝑟. Attacking this problem, a scribe might partition 𝑃 so that a copy of 𝑄 is cut symmetrically from its center as shown in Figure 3. Inside 𝑃 and outside this copy of 𝑄, cut out a small square 𝐷 at each of 𝑃’s vertices so that two opposing vertices of each copy of 𝐷 coincide with a vertex of 𝑃 and a vertex of 𝑄. Furthermore, adjacent to each of the four copies of 𝐷, cut out a copy of 𝐷 inside 𝑃 and outside 𝑄, both above and below 𝑄. Label the four remaining unnamed rectangular regions as two copies of 𝐸 and two copies of 𝐹; next, paste together these last four rectangles to encompass 𝑅, hence forming a square congruent to 𝑄. Since the area of four copies of 𝐷 is 𝑑2 , the sum of the areas of 𝑃, 𝑄, and 𝑅 is 3𝑞2 + 2𝑑2 = 1400. Because three copies of the side length 𝑞 make 60, we have 𝑞 = 20. Therefore two squares of side length 𝑑 together have area 200, which means that 𝑑 = 10. So 𝑟 = 10, 𝑞 = 20, and 𝑝 = 30. ♢ One way scribes found solutions to (1) was via a geometrical cut-and-paste algorithm involving reciprocal pairs.10 10 This solution scheme appears on clay tablet YBC 6967 as cited by both Neugebauer and Sachs [109, Plate. 17] and Robson [124, Figure 10].

Reciprocal pairs generate normalized Pythagorean triples

p

175

q r Square Q

Square P

D D

E

Square R

D D

F

F

D D

D D

E

DD

D D

E F

F

E

D D

D D

Figure 3. A Babylonian area problem from tablet MS 5112. Algorithm 5: Generating Pythagorean triples using reciprocal pairs (and 1 a single parameter). Let (𝑥, ) be a reciprocal pair. Babylonian scribes discov𝑥 ered a relationship equivalent to the equation 2

(𝑥 +

2

1 1 ) − (𝑥 − ) = 4, 𝑥 𝑥

1

1

1

1

2

𝑥

2

𝑥

(2)

which means that ( (𝑥 − ), 1, (𝑥 + )) is a Pythagorean triple. Proof. Scribes may have discovered Equation (2) by the cut-and-paste proofwithout-words of Figure 4. To put words to this proof: From a square of side

176

Strand VI: A Clay Tablet

x+1/x

−

1 x

x −1/x

x+1/x

x −1/x

1 x 1

x 1 x 1 x

x

1 x

1 1

1 x

1

Figure 4. An old Babylonian algorithm to generate Pythagorean triples. length 𝑥 +

1 𝑥

remove a square of side length 𝑥 −

1 𝑥

, leaving an L-shaped re-

gion.11 Partition this L-shaped region into five regions: three rectangles each 1 1 1 of area 𝑥 × = 1, one rectangle of area (𝑥 − ) × , and one rectangle of area 1 𝑥

1

𝑥

𝑥

𝑥

× . Paste together the two smaller rectangles to form a fourth rectangle of 𝑥

1

1

area 1. Thus the difference of (1 + )2 and (1 − )2 is 4, whereupon the desired 𝑥 𝑥 conclusion follows. Example 6: Generating a Pythagorean triple via Algorithm 5. Let 𝑥 = 2. 1 1 5 3 3 5 Let 𝑐 = (𝑥 + )/2, 𝑎 = (𝑥 − )/2, and 𝑏 = 1. Then 𝑐 = and 𝑎 = . So ( , 1, ) 𝑥 𝑥 4 4 4 4 is a three-tuple satisfying (1), and is similar to the Pythagorean triple (3, 4, 5) corresponding to the triple on line 11 of Plimpton 322, (40, 60, 75). ♢ 11 A suggestion for this L-shaped cut-and-paste construction appears in Britton [17, figure 10]; besides that reference, this proof-without-words figure may be new in the mathematical literature.

Reciprocal pairs generate normalized Pythagorean triples

177

1

Each entry in column IV of Figure 1 is ((𝑥 − )/2)2 for some rational number 𝑥 𝑥. To illustrate, let 𝑞 = ⟨0; 27 00 03 45⟩, the thirteenth number in column IV of Table 1. Solving 2

𝑞 = ((𝑥 − for 𝑥 > 0 gives 𝑥 =

15 8 1

1 )/2) 𝑥

1

= 2 − . In our number system, 𝑞 = 8 289 ( )2 . 240

5832225 604

= (

161 2 240

) .

Furthermore, ((𝑥 + )/2)2 = Accordingly, the entries on line thirteen, 𝑥 columns III and II, are 𝑎 = 161 and 𝑐 = 289. The first fifteen rows of Table 3 on p. 180 show that the generators listed in Table 2 do indeed generate the data of Plimpton 322. Table 2. The generator 𝑥 for each line of Plimpton 322. line

𝑥

1

2+

2

2+

3

2+

4

2+

5

2+

line 2 5 10 27 11 32 17 54 1 4

𝑥

6

2+

7

2+

8

2+

9

2+

10

2+

2 9 4 25 2 15 1 12 1 40

line

𝑥

11

2+0

12

2−

13

2−

14

2−

15

2−

2 25 1 8 1 27 1 5

An alternate way to generate Pythagorean triples appeared in about 300 bc as a lemma preceding Proposition 29 in Book X of Euclid’s Elements. Lemma 7: Euclid’s lemma for Pythagorean triples (using two parameters). Let 𝑢 and 𝑣 be relatively prime positive integers of opposite parity (one is odd and the other is even) with 𝑢 > 𝑣. Then the ordered triple (𝑎, 𝑏, 𝑐) is a primitive Pythagorean triple where 𝑎 = 𝑢2 − 𝑣 2 ,

𝑏 = 2𝑢𝑣,

𝑐 = 𝑢2 + 𝑣 2 .

(3)

Proof. Adapted from Kozlov [85], we offer a proof-without-words figure for Equation (3) in Figure 16, which is also outlined in Exercise 1e.12 12 To establish primitiveness (in which a scribe may be uninterested), here is a modern-day number theory approach: Since 𝑢 and 𝑣 are of opposite parity, both 𝑎 and 𝑐 are odd integers. Let 𝑑 be the greatest common divisor of 𝑎, 𝑏, and 𝑐. Since 𝑎 and 𝑐 are odd, 𝑑 is also odd. Since 𝑑|(𝑐 + 𝑎) and 𝑑|(𝑐 − 𝑎), we have 𝑑|(2𝑢2 ) and 𝑑|(2𝑣2 ). So 𝑑|𝑢2 and 𝑑|𝑣2 . Thus 𝑑 = 1, making (𝑎, 𝑏, 𝑐) a primitive Pythagorean triple.

178

Strand VI: A Clay Tablet

Although it is questionable whether the Babylonians knew Lemma 7, a few calculations using Algorithm 5 will soon lead to discovering Euclid’s method, as we observe in the next lemma. Lemma 8: Equivalence of the reciprocal pair algorithm and Euclid’s ᵆ lemma. Let 𝑥 = > 1 where 𝑢 and 𝑣 are relatively prime positive integers. 1

1

2 2

𝑥

𝑣

1

1

2 2

𝑥

Let 𝛼 = (𝑥 − ) and 𝛾 = (𝑥 + ). Then (2𝑢𝑣𝛼, 2𝑢𝑣, 2𝑢𝑣𝛾) is the Pythagorean triple (𝑢 − 𝑣2 , 2𝑢𝑣, 𝑢2 + 𝑣 ). 1

1

1

1

2

𝑥

2

𝑥

Proof. We know that ( (𝑥 − ), 1, (𝑥 + )) is a Pythagorean triple. Observe that 1 1 1 𝑢 𝑣 𝑢2 − 𝑣 2 1 1 1 𝑢 𝑣 𝑢2 + 𝑣 2 𝛼 = (𝑥 − ) = ( − ) = and 𝛾 = (𝑥 + ) = ( + ) = , 2 𝑥 2 𝑣 𝑢 2𝑢𝑣 2 𝑥 2 𝑣 𝑢 2𝑢𝑣 giving the desired result. 17

125

= , the generator for line 4 of To illustrate the lemma, let 𝑥 = 2 + 54 54 Plimpton 322 from Table 2. So 𝑢 = 125 and 𝑣 = 54. Notice that 𝑢 and 𝑣 are rela125 tively prime and that 2𝑢𝑣 = 13500. The generator 𝑥 = gives the Pythagorean 12709

18541

54

triple ( , 1, ), which when multiplied by 2𝑢𝑣 = 13500 gives the primi13500 13500 tive Pythagorean triple (12709, 13500, 18541).

Finding the realm of potential generators What rule did the scribe use to assemble the numbers in column IV in the order given? To investigate, we ask the scribe to be patient with us while we use the 1 1 1 1 computer to find potential generators 𝑥 for triples ( (𝑥 − ), 1, (𝑥 + )). After 2 𝑥 2 𝑥 doing so, we may be able to see from a global perspective what the scribe probably discovered by a trial-and-error local perspective. Recall the terms order of a number and regular number from Definition 1. Observe that the fraction of greatest order in column IV of Table 1 offers a clue about our search space. The twelfth number in this column is of order eight. Since it is a perfect square, its square root is of order four. Let 𝑥 be a generator. 1 1 Since (𝑥− ) is the short side of a rectangle whose long side is 1, we have 0 < 𝑥− 1 𝑥

2

𝑥

< 2, which means that 1 < 𝑥 < 1+ √2 ≈ 2.41. Now we could hunt for all super-

regular numbers of at most order four between 1 and 1 + √2 whose reciprocals are also at most of order four, but the corresponding search space contains almost nineteen million (about 604 √2) fractions. To shorten our search for super-regular numbers among this multitude of fractions, we consider all fractions between 1 and 1 + √2 of at most order three whose reciprocals are at most order four. Thus, our search space is now less than a third of a million fractions.

Finding the realm of potential generators

179

If we consider all 𝑥 values with 1 < 𝑥 < 1 + √2, 𝑥 ≠ 2, and specify that 𝑥 must be written as a fraction of the form 𝑥 = 2 ± 𝑝/603 while 1/𝑥 must be written as a fraction of the form 𝑞/604 where 𝑝 and 𝑞 are positive integers, 0 < 𝑝 < 603 , and 0 < 𝑞 < 604 , then by use of Lemma 3 the possible values in decreasing order ˆ are negative) for ±𝑝/603 are the elements of magnitude (since the elements of 𝐵 ˆ∪𝐵 ˆ where of 𝐴 ˆ = { 2 , 10 , 11 , 17 , 38 , 1 , 2 , 49 , 4 , 2 , 7 , 1 , 6 , 1 } 𝐴 5 27 32 54 125 4 9 288 25 15 64 12 125 40 and ˆ = −{ 3 , 2 , 14 , 1 , 4 , 1 , 2 , 19 , 34 , 22 , 5 , 1 , 19 , 2 , 7 , 58 , 𝐵 64 25 135 8 27 5 9 72 125 75 16 3 50 5 16 125 1 14 239 14 26 19 11 13 2 67 88 18 47 3 4 , , , , , , , , , , , , , , , 2 27 432 25 45 32 18 20 3 96 125 25 64 4 5 22 53 91 106 194 7 8 527 23 14 23 122 79 , , , , , , , , , , , , }. 27 64 108 125 225 8 9 576 25 15 24 125 80 Since the Babylonian list of generators between 1 and 1 + √2 from Table 2 have denominators of at most 54, our scribe (by trial and error, as we discuss in the next section) may have found all super-regular numbers between 1 and 1 + √2 with denominator at most 60. Imposing this rule, we let 𝐴 be the list of elements ˆ whose denominator is less than 60, along with 0 as its last member. Let from 𝐴 ˆ whose denominator is less than 60. Then 𝐵 be the list of elements from 𝐵 2 10 11 17 1 2 4 2 1 1 𝐴={ , , , , , , , , , , 0} 5 27 32 54 4 9 25 15 12 40 and 𝐵 = −{

2 1 4 1 2 5 1 19 2 7 1 14 14 26 19 11 , , , } ∪ −{ , , , , , , , , , , , , 25 8 27 5 9 16 3 50 5 16 2 27 25 45 32 18 13 2 18 3 4 22 7 8 23 14 23 , , , , , , , , , , }. 20 3 25 4 5 27 8 9 25 15 24

With respect to Table 2, observe that the eleven terms values 2 + 𝑘

𝑘 𝑚

𝑘 𝑚

in 𝐴 correspond to the

that generate the first eleven terms in column IV, and the first four

terms − in 𝐵 correspond to the values 2 − 𝑚 fifteenth terms in column IV.

𝑘 𝑚

that generate the twelfth through

Example 9: Extending Plimpton 322. As we have seen, if we take the first fifteen generators of 𝐴 ∪ 𝐵, we generate the data in Plimpton 322. What about

180

Strand VI: A Clay Tablet Table 3. Extended Plimpton 322 table via reciprocal pairs, 1 1 𝑎 where (𝑥 − ) = . 2

𝑛 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38

𝑘/𝑚 2/5 10/27 11/32 17/54 1/4 2/9 4/25 2/15 1/12 1/40 0 −2/25 −1/8 −4/27 −1/5 −2/9 −5/16 −1/3 −19/50 −2/5 −7/16 −1/2 −14/27 −14/25 −26/45 −19/32 −11/18 −13/20 −2/3 −18/25 −3/4 −4/5 −22/27 −7/8 −8/9 −23/25 −14/15 −23/24

𝑥

𝑏

𝑥 = 2 + 𝑘/𝑚 12/5 64/27 75/32 125/54 9/4 20/9 54/25 32/15 25/12 81/40 2 48/25 15/8 50/27 9/5 16/9 27/16 5/3 81/50 8/5 25/16 3/2 40/27 36/25 64/45 45/32 25/18 27/20 4/3 32/25 5/4 6/5 32/27 9/8 10/9 27/25 16/15 25/24

(𝑎/𝑏)2 (119/120)2 ≈ 0.9917 (3367/3456)2 ≈ 0.9742 (4601/4800)2 ≈ 0.9585 (12709/13500)2 ≈ 0.9414 (65/72)2 ≈ 0.9028 (319/360)2 ≈ 0.8861 (2291/2700)2 ≈ 0.8485 (799/960)2 ≈ 0.8323 (481/600)2 ≈ 0.8017 (4961/6480)2 ≈ 0.7656 (3/4)2 = 0.7500 (1679/2400)2 ≈ 0.6996 (161/240)2 ≈ 0.6708 (1771/2700)2 ≈ 0.6559 (28/45)2 ≈ 0.6222 (175/288)2 ≈ 0.6076 (473/864)2 ≈ 0.5475 (8/15)2 ≈ 0.5333 (4061/8100)2 ≈ 0.5014 (39/80)2 ≈ 0.4875 (369/800)2 ≈ 0.4613 (5/12)2 ≈ 0.4167 (871/2160)2 ≈ 0.4032 (671/1800)2 ≈ 0.3728 (2071/5760)2 ≈ 0.3595 (1001/2880)2 ≈ 0.3476 (301/900)2 ≈ 0.3344 (329/1080)2 ≈ 0.3046 (7/24)2 ≈ 0.2917 (399/1600)2 ≈ 0.2494 (9/40)2 ≈ 0.2250 (11/60)2 ≈ 0.1833 (295/1728)2 ≈ 0.1707 (17/144)2 ≈ 0.1181 (19/180)2 ≈ 0.1056 (52/675)2 ≈ 0.07704 (31/480)2 ≈ 0.06458 (49/1200)2 ≈ 0.04083

2

𝑎 119 3367 4601 12709 65 319 2291 799 481 4961 3 1679 161 1771 28 175 473 8 4061 39 369 5 871 671 2071 1001 301 329 7 399 9 11 295 17 19 52 31 49

1

𝑏 120 3456 4800 13500 72 360 2700 960 600 6480 4 2400 240 2700 45 288 864 15 8100 80 800 12 2160 1800 5760 2880 900 1080 24 1600 40 60 1728 144 180 675 480 1200

175

𝑐 169 4825 6649 18541 97 481 3541 1249 769 8161 5 2929 289 3229 53 337 985 17 9061 89 881 13 2329 1921 6121 3049 949 1129 25 1649 41 61 1753 145 181 677 481 1201

the sixteenth generator, 𝑥 = 2 − ? Observe that (𝑥 − )/2 = , which gener9 𝑥 288 ates the Pythagorean triple (175, 288, 337). Thus, had the scribe chosen to extend the table, then the sixteenth line would probably have contained the cuneiform

How the scribe may have screened for generators

181

175

versions of ( )2 , 175, and 337. Generated by the method of reciprocal pairs, Ta288 ble 3 extends Plimpton 322 to thirty-eight lines, giving the same table as appears in, for example, [1], [17], [30, pp. 173–176], and [95], each of which used either Lemma 5, Lemma 7, or some combination of the two methods to generate the table. ♢

How the scribe may have screened for generators The regular integers greater than one and less than sixty form the set

{2, 3, 4, 5, 6, 8, 9, 10, 12, 15, 16, 18, 20, 24, 25, 27, 30, 32, 36, 40, 45, 48, 50, 54}. (4)

How would the scribe have found super-regular generators 𝑥 from (4)? The scribe probably began with 𝑥 = 2, the simplest non-trivial super-regular number. As we have seen, 𝑥 = 2 generates line 11 of Plimpton 322. Then the scribe may have searched for other super-regular numbers near 2. In particular, 𝑗 the scribe may have tried numbers of the form 𝑥 = 2 ± , where 𝑛 is a regular 𝑛 number from (4) and 𝑗 is an integer with 1 ≤ 𝑗 < 𝑛. Thus, finding super-regular generators is now a matter of trial and error. To begin this trial-and-error process, we consider the simplest case first, 𝑥 = 1 5 3 5 2 ± , yielding 𝑥 = and 𝑥 = . The former possibility, 𝑥 = , generates the 2

Pythagorean triple (

2 21

20

29

, 1,

20

2

); but then 3

21 20

2

is the long side of this triple rather

than 1. The latter possibility, 𝑥 = , generates line 22 of Table 3. 2

Now we try the hardest case, 𝑛 = 54, so that 𝑥 = 2 ± reciprocal of 𝑥 is

1 𝑥

=

54 108±𝑗

𝑗 54

for 1 ≤ 𝑗 < 54. The

. For which of these fractions is 108±𝑗 regular? We try

𝑗 = 53, giving denominators of 55 and 161, neither of which is regular. Next we try 𝑗 = 52, giving 56 and 160. The latter number 160 is regular. The scribe notes 54 27 that can be scaled to . This fraction is super-regular. But—the sexagesimal 160

1

80

fails to appear in the standard look-up table of reciprocals. (Recall form for 80 that the standard look-up tables give reciprocals for the numbers in (4) along with 64 and 81.) However, the scribe does not panic. He or she simply takes half the reciprocal of the regular number 40, and then scales by 27 to obtain the 40 sexagesimal form for the reciprocal of the super-regular number 𝑥 = . This 𝑥 27 value generates line 23 in Table 3.

182

Strand VI: A Clay Tablet

In like fashion, all of the other cases can be similarly handled.13 In the end, should the scribe have continued this process, a Plimpton-like table of thirtyeight lines would be generated when the generators are arranged in decreasing order. The reader may wish to contrast this approach (using reciprocal pairs expanded about 2) with the eleven-step approach (using a combination of the two generating methods) found in Mansfield and Wildberger [95, pp. 10–11].

The purpose of the tablet Robson [124] has proposed that Plimpton 322 was a teaching aid, a table of drill problems. Friberg [50, p. 448] concurs, explaining that students had hand tablets (like yesteryear’s hand slates or hand whiteboards in some of today’s classes) on which they wrote class lecture notes: and at the end of the school day the students took the hand tablets home, where they . . . filled in the details of the problems and . . . the solutions. Plimpton 322 . . . could obviously serve as a rich source of data for [these] hand tablets. These drill problems would include finding super-regular numbers 𝑥, inverting 1 1 𝑚 when 𝑛 is them, evaluating (𝑥 ± ), squaring them, simplifying a fraction 2 𝑥 𝑛 regular, scaling pairs of numbers, and finding the square roots of 𝑚 and 𝑛 given that both of them are perfect squares. About such a rationale, Mansfield and Wildberger [95] observe that some of Plimpton 322’s column IV numbers “are just too big to allow students to reasonably obtain the square roots of the quantities required.”14 Armed with this supposition, along with the understanding that the curvature of the fragment [of Plimpton 322] suggests that a third, or more, of the table has been lost. A reasonable estimate is that the missing third, or so, can have contained about four narrow columns [50, p. 424] it is natural and fitting to make conjectures. One of these is that Plimpton 322 was constructed to serve as a (angle-less) trigonometry table. Perhaps other extant, untranslated-as-yet tablets exist to support this claim. Meanwhile, the suggestion is enticing, and we leave the reader with a trigonometric application of Plimpton 322. 13 Regarding

1

the feasibility of a scribe writing any fraction in sexagesimal when 𝑛 is a regular 𝑛 number, Sachs [128, p. 151] reassures us, saying, “The reciprocals of all regular numbers, without exception, could be found [by a scribe].” 14 Many extant clay tablets contain surprises for mathematicians. For example, Friberg [50, pp. 456–459] deciphers tablet BM 34601 as calculating the sexagesimal expansion of 392 . What student would be expected to do this? Similarly, it could be that Plimpton 322, written in the midst of such a tradition, contained challenge exercises.

The purpose of the tablet

183

Example 10: Using Plimpton 322 as a trigonometry table. Suppose we are given two sides 𝛼 and 𝛽 of a rectangle (or a triangle) with 𝛼 < 𝛽. To use Table 1 to 𝛼 approximate the diagonal (or hypotenuse) 𝛾, first compute or approximate ( )2 . 𝑎

𝛽 𝛼

𝑏

𝛽

Find row 𝑗 of the table for which the entry ( )2 in column IV∗ is nearest to ( )2 . Let 𝑏𝑗 and 𝑐𝑗 be the entries in columns 𝑀 and II∗ , respectively, of row 𝑗. Then 𝛾≈

𝑐𝑗 𝛽 𝑏𝑗

. 𝛼

For example, with 𝛼 = 27 and 𝛽 = 31, ( ) ≈ 0.7575. Row 11 provides the 𝛽

nearest match. With 𝑏11 = 4 and 𝑐11 = 5, 𝛾 ≈ 𝛽𝑐11 /𝑏11 = 31 ⋅ 5/4 = 38.75. The actual value is 𝛾 = √𝛼2 + 𝛽 2 = √1690 ≈ 41.11 for a relative error of about 6%. Using a suggestion from Knuth [84], we let the reader and scribe interpolate15 to a better approximation. ♢

15 Mansfield and Wildberger [95] have found tablets suggesting that a Babylonian would be knowledgeable about interpolation.

Chapter VI: Families of Numbers In this chapter, we look for patterns within families of numbers. Given a data sequence 𝑎0 , 𝑎1 , 𝑎2 , …, can we find a formulation for 𝑎𝑛 ? The terms may be ordered in time, such as a sequence of eclipse dates, or ordered in increasing value, or apparently ordered higgledy-piggledy. Given a phenomenon such as primitive Pythagorean triples of Strand VI or how far a jeep can proceed into the desert on 𝑛 tanks of gas from Chapter V, can we generate the corresponding family and find recursive relationships such as 𝑎𝑛 = 𝑎𝑛−1 + 𝑎𝑛−2 among the neighbors in the family? In particular, we consider five families of numbers: primitive Pythagorean triples (again), binomial coefficients, Fibonacci numbers, Catalan numbers, and Ben-Hur numbers. Thousands of similar families exist.16 We have selected these particular families because they are somewhat representative of the others and because they are interesting in and of themselves. As a look ahead to Chapter IX, for a given irrational number 𝜔, we will see that continued fraction algorithms generate entire families {𝐶𝑛 }𝑛≥0 of good fractional approximations which converge to 𝜔 while satisfying the recursive formula 𝐶𝑛 = 𝑎𝑛 𝐶𝑛−1 ⊕ 𝐶𝑛−2 using the notation developed in Chapter IV, where integer 𝑎𝑛 is chosen so that 𝐶𝑛 is near 𝜔.

Primitive Pythagorean triples Puzzle 11: How many triples? As shown in the preceding strand, any primitive Pythagorean triple can be generated by two relatively prime positive integers 𝑢 and 𝑣, one of which is even. Let 𝑃𝑛 be the number of primitive Pythagorean triples where 𝑢 + 𝑣 = 2𝑛 + 1. 𝑃1 = 1 because the only solution to 𝑢 + 𝑣 = 3 with 𝑢 > 𝑣 > 0 is (𝑢, 𝑣) = (2, 1), which generates the triple (3, 4, 5). 𝑃2 = 2 because the only solutions to 𝑢 + 𝑣 = 5 with 𝑢 > 𝑣 > 0 are (𝑢, 𝑣) = (4, 1) and 16 The On-Line Encyclopedia of Integer Sequences has cataloged over a quarter million sequences of interest.

185

186

Chapter VI: Families of Numbers

29

37

12 35

(u, v) = (6, 1)

20

25

21 (u, v) = (5, 2)

24

7

(u, v) = (4, 3)

Figure 5. The three triples counted by 𝑃3 . (𝑢, 𝑣) = (3, 2), giving (15, 8, 17) and (5, 12, 13). Thus the family of numbers 𝑃𝑛 starts with 𝑃1 = 1 and 𝑃2 = 2. The three triples counted by 𝑃3 appear in Figure 5. Find 𝑃4 , 𝑃5 , and 𝑃6 . Can you guess a formula for 𝑃𝑛 ? Since the answer is the next proposition, the reader may wish to try this puzzle before reading further. ♢ The key to Puzzle 11 is the Euler phi function 𝜙 of Chapters I and III. Proposition 12: An Euler phi function application. The number 𝑃𝑛 of primitive Pythagorean triples where 𝑢 + 𝑣 = 2𝑛 + 1, 𝑢 > 𝑣 > 0, and gcd(𝑢, 𝑣) = 1 is 𝜙(2𝑛 + 1)/2. Proof. Observe that if 𝑢 and 2𝑛 + 1 are not relatively prime, then 𝑢 and 𝑣 are not relatively prime. The number of positive integers less than 2𝑛 + 1 that are relatively prime to 2𝑛 + 1 is 𝜙(2𝑛 + 1). Because 2𝑛 + 1 is odd and by Propositions I.17 and III.39, 𝜙(2𝑛+1) is even. Therefore, there exist precisely half this number of distinct pairs (𝑢, 𝑣). Example 13: Pythagorean triples where 𝑢 + 𝑣 = 15. To illustrate Proposition 12, we consider 𝑃7 . Since 2 ⋅ 7 + 1 = 15 = 3 ⋅ 5, and since 𝜙(15) = 2 ⋅ 4 = 8, there should be four pairs of primitive-Pythagorean-triple generators. They are (14, 1), (13, 2), (11, 4), and (8, 7), giving the respective triples (195, 28, 197), (165, 52, 173), (105, 88, 137), and (15, 112, 113).

♢

Binomial coefﬁcients The binomial coefficients are the coefficients of 𝑥𝑘 , denoted by (𝑛), in the expan𝑘 sion of (1 + 𝑥)𝑛 , where 𝑛 and 𝑘 are integers with 0 ≤ 𝑘 ≤ 𝑛. Thus (1 + 𝑥)𝑛 is the generating function for the family of binomial coefficients. Example 14: Expanding a power of 1 + 𝑥. To illustrate binomial coefficients we expand (1 + 𝑥)4 : (1 + 𝑥)4 = 1 + 4𝑥 + 6𝑥2 + 4𝑥3 + 𝑥4 .

Binomial coefﬁcients Binomial coefﬁcients Binomial coefﬁcients

187 187 187

b. Decimal notation. a. A triangle from 1303. Source: Wikimedia Commons. b. Decimal notation. a. A triangle from 1303. a. A triangle from Source: Wikimedia Commons. b. Decimal notation. Source: Wikimedia1303. Commons. Figure 6. Pascal’s triangle of binomial coefficients. Figure 6. 6. Pascal’s Pascal’s triangle triangle of of binomial binomial coefficients. coefficients. Figure 4 4 4 4 4 Thus ( ) = 1 = ( ), ( ) = 4 = ( ), and ( ) = 6. ♢ 0 4 1 3 2 Thus (4) = 1 = (4), (4) = 4 = (4), and (4) = 6. ♢ 0 4 1 3 2 Thus 1 = (4), (4relationship and (4) =the 6. binomial coefficients by ex♢ (4) = ) = 4 = (4), between We obtain a recursive 0 4 1 3 2 amining the function relationship (1 + 𝑥)𝑛 for the binomial coefficients. Wegenerating obtain a recursive between the binomial coefficients by ex𝑛 We obtain a recursive relationship binomial coefficients by examining the generating function (1 + 𝑥)between for thethe binomial coefficients. amining15: theAgenerating (1 +For 𝑥)𝑛integers for the binomial coefficients. Proposition binomialfunction recursion. 𝑛 and 𝑘 with 1 ≤ 𝑘 ≤ 𝑛, Proposition 15: A binomial For 𝑛 + 1 recursion. 𝑛 𝑛 integers 𝑛 and 𝑘 with 1 ≤ 𝑘 ≤ 𝑛, ( ) =recursion. ( ) + (For).integers 𝑛 and 𝑘 with 1 ≤(5) Proposition 15: A binomial 𝑘 ≤ 𝑛, 𝑛 𝑘 𝑛 + 1𝑘 − 1 𝑛 𝑘 (5) ( )=( ) + ( ). 𝑘1 𝑘 𝑛− 1 𝑛+ 𝑛𝑘 + 𝑛(. The ) 𝑛=+( 𝑥(1 +) 𝑥) ). coefficient of 𝑥 𝑘 in (5) Proof. Observe that (1 + 𝑥)𝑛+1 =((1 𝑘+ 𝑥) 𝑘−1 𝑘 𝑛+1 𝑛+1 𝑘−1 𝑛 𝑛 𝑛 𝑛+1 that (1 + 𝑥) of 𝑥= (1 in + (1 𝑥)++ . which The coefficient of 𝑥 𝑘 in coefficient 𝑥)𝑥(1 is + means that (1 + 𝑥)Proof. is (Observe ). The ( 𝑛𝑥)), 𝑘−1 𝑘 𝑛 𝑛+1 𝑘−1 𝑛+1 𝑘 𝑛 in (1 + 𝑥)𝑛 𝑛 𝑛 𝑛+1 𝑛= (1 𝑛 of 𝑥𝑘 means (1 + 𝑥)Observe ). The (The Proof. (1 𝑥) +𝑥 𝑥)the +coefficient 𝑥(1 + 𝑥)is.of in the coefficient of is 𝑥 ( inthat 𝑥(1 ++ 𝑥)coefficient is ( ).ofAnd 𝑥 𝑘),coefficient inwhich (1 + 𝑥) is that 𝑘

𝑘−1

𝑘−1

𝑛 𝑛 (1 𝑥)𝑛+1 is (𝑛+1 coefficient in (1the + 𝑥)coefficient is ( 𝑛 ), of which the+coefficient of ).𝑥 𝑘The 𝑥(1 + 𝑥)𝑛 isof 𝑥 𝑘 inmeans (1 + 𝑥)that is ( 𝑥𝑛𝑘−1 ). And the proposition isintrue. (𝑛). Thus 𝑘 𝑘−1 𝑘−1 𝑘 𝑛 𝑛 𝑘 𝑛 𝑘 𝑛 Thus the proposition ( ).coefficient the of 𝑥 in 𝑥(1is+true. 𝑥) is ( ). And the coefficient of 𝑥 in (1 + 𝑥) is 𝑘 𝑘−1 The 𝑛 binomial coefficients are used to count items within a very broad context. the proposition is true. ( ). Thus The binomial coefficients are used to count items within a very broad context. 𝑘 Proposition 16: Binomial coefficients as a counting tool. Let 𝑘 and 𝑛 be The0binomial are used to count within a very broad context. integers with ≤ 𝑘 16: ≤ 𝑛.coefficients Without regard to the order incounting which items areLet chosen, Proposition Binomial coefficients as aitems tool. 𝑘 and 𝑛 be 𝑛 the number of with ways 0to≤choose items from 𝑛 items integers 𝑘 ≤ 𝑛.𝑘Without regard to theisorder ( ). in which items are chosen, 𝑘 the number of ways to choose 𝑘 items from 𝑛 items is (𝑛). 𝑘

188

Chapter VI: Families of Numbers

Proposition 16: Binomial coefficients as a counting tool. Let 𝑘 and 𝑛 be integers with 0 ≤ 𝑘 ≤ 𝑛. Without regard to the order in which items are chosen, the number of ways to choose 𝑘 items from 𝑛 items is (𝑛). 𝑘

Proof. Observe that the number of ways to choose 0 items from 0 items is 1, because there is one way to do nothing at all. Thus the proposition is true (by default) for 𝑛 = 0. The proposition is true when 𝑛 = 1. Assume it is true when 𝑛 ≥ 1. Let 𝐴 be a set of 𝑛 + 1 distinct items. Remove an item 𝑎 from 𝐴, forming a set 𝐵 of 𝑛 distinct items. When 1 ≤ 𝑘 ≤ 𝑛, the number of ways to choose 𝑘 − 1 items from 𝐵 is ( 𝑛 ). Let 𝑊 be one of these ways. Place 𝑎 into 𝑊, forming a set 𝑘−1 𝑉 of 𝑘 elements. The size of the set of all sets 𝑉 formed in this way is the number of ways ( 𝑛 ) to choose 𝑘 items from 𝐴 so that 𝑎 is one of the items. Furthermore, 𝑘−1

the number of ways to choose 𝑘 items from 𝐵 is (𝑛). This is the same number 𝑘 of ways to choose 𝑘 items from 𝐴 where 𝑎 is not chosen. By Proposition 15, the number of ways to choose 𝑘 items from 𝐴 is (𝑛+1). When 𝑘 = 𝑛 + 1 or 𝑘 = 0, 𝑘 there is exactly 1 way to choose all 𝑛 + 1 items or 0 items, respectively, from a set of 𝑛 + 1 items. Therefore the proposition is true for 𝑛 + 1. Example 17: Choosing two items from four. Suppose we wish to choose two suits from the set of four suits {♣, ♢, ♡, ♠}. Itemizing the number of ways to do this gives {♣, ♢}, {♣, ♡}, {♣, ♠}, {♢, ♡}, {♢, ♠}, {♡, ♠}, a total of six ways, which is the same as (4) by Example 14.

♢

2

The recursive relationship identified by Proposition 15 gives, by way of mathematical induction, a formula to compute any particular binomial coefficient. Proposition 18: A binomial formula. For integers 𝑛 and 𝑘 with 0 ≤ 𝑘 ≤ 𝑛, 𝑛 𝑛! . ( )= 𝑘 𝑘! (𝑛 − 𝑘)!

(6)

Proof. As observed earlier in the proof of Proposition 16, (𝑛) = 1. Since 𝑛

0

𝑛

0! 0!(0−0)!

=

1, the proposition is true when 𝑛 = 0. By definition, ( ) = 1 and ( ) = 1 for all 𝑛 ∈ ℤ+ . Since

𝑛! 0!(𝑛−0)!

=1=

𝑛! 𝑛!(𝑛−𝑛)!

0

𝑛

, the proposition is true when 𝑘 = 0 and

𝑘 = 𝑛, for all positive integers 𝑛. The first non-trivial instance of 𝑛 and 𝑘 with respect to this proposition is 𝑛 = 2 and 𝑘 = 1. By Proposition 15, 2 1 1 2! . ( )=( )+( )=1+1=2= 1 0 1 1! (2 − 1)!

Binomial coefﬁcients

189

Assume that the proposition is true for a given 𝑛, and consider (𝑛+1) where 1 ≤ 𝑘 𝑘 ≤ 𝑛. By Proposition 15 and the inductive hypothesis, (

𝑛+1 𝑛 𝑛 𝑛! 𝑛! (𝑛 + 1)! + = . )=( )+( )= 𝑘 𝑘−1 𝑘 (𝑘 − 1)! (𝑛 − 𝑘 + 1)! 𝑘! (𝑛 − 𝑘)! 𝑘! (𝑛 + 1 − 𝑘)!

By mathematical induction, the proposition is true. The earliest use of binomial coefficients dates (as far as I know) to Pingala (circa second century bc) in the Sanskrit manuscript Chandasastra. When the binomial coefficients are assembled into a triangular tree according to the relationship of Proposition 15, they are called the Pascal triangle after Blaise Pascal (1623–1662), who used them in a treatise on probability. Figure 6 shows two versions of the triangle, the first one in the Chinese script of Zhu Shijie from 1303, and the other in decimal notation. The node at the top of this triangle corresponds to (0) in row zero. The numbers in row 𝑛 are (𝑛), 0 ≤ 𝑘 ≤ 𝑛. 0 𝑘 As a small connection between the Pythagorean family of numbers and the binomial coefficients we offer the following puzzle. Puzzle 19: Consecutive binomial coefficients and Pythagorean triples. Find17 positive integers 𝑛 and 𝑘 such that 𝑛 𝑛 𝑛 (( ), ( ), ( )) 𝑘 𝑘+1 𝑘+2 ♢

forms a Pythagorean triple.

While the black plague was raging in London in 1665, Isaac Newton (1643– 1727) stayed in his country home for the next eighteen months. During this time, among other accomplishments, he discovered the general binomial theorem. Proposition 20: Newton’s binomial series. Let 𝛼 ∈ ℝ; then for all 𝑥 with |𝑥| < 1, (1 + 𝑥)𝛼 = 1 + 𝛼𝑥 +

𝛼(𝛼 − 1)𝑥2 𝛼(𝛼 − 1)(𝛼 − 2)𝑥3 + + ⋯. 2! 3!

(7)

Proof. The proof is a straightforward application of Taylor’s theorem. As a special case, we have Equation (8). Corollary 21: A special square root expansion. For all 𝑥 ∈ ℝ with |𝑥| < 1, 1 √1 − 𝑥

∞

= ∑ 𝑛=0

(2𝑛) 𝑛

22𝑛

𝑥𝑛 .

(8)

17 As a hint, try the range of values 60 through 70 for 𝑛. As it turns out, this solution is unique when 2𝑘 < 𝑛. For a further hint, see Luca [92].

190

Chapter VI: Families of Numbers 1

Proof. Apply Proposition 20 with 𝛼 = − , and in place of 𝑥 write −𝑥 or use 2 Taylor’s theorem directly. Then follow the outline given in Exercise 6.

Fibonacci numbers In 1202, Leonardo Pisano18 (circa 1170–1250) wrote Liber Abaci, a discourse on the Arabic numbers explaining why computing with them is easier than computing with Roman numerals. The most famous problem from that work concerns the propagation of rabbits: A certain man put a pair of rabbits in a place surrounded on all sides by a wall. How many pairs of rabbits can be produced from that pair in a year if it is supposed that every month each pair begets a new pair which from the second month on becomes productive? The answer to Fibonacci’s riddle is the twelfth term in the sequence 1, 1, 2, 3, 5, 8, 13, 21, … , a family of integers known as the Fibonacci numbers. We recast Fibonacci’s rabbit riddle as a riddle about climbing stairs on a pogostick. We invite the reader to show that the number of rabbits at month 𝑛 is the same as the number of ways to climb 𝑛 stairs on a pogo-stick.19 Puzzle 22: Pogo-stick climbs. On a pogo-stick, how many ways can person 𝒜 climb a staircase of 𝑛 stairs taking one or two stairs at a time? In particular, how many ways can 𝒜 climb the twelve stairs in Figure 7? (We assume that 𝒜 goes up the staircase at each step.) In our counting convention, we say that there is one way to climb a staircase of zero steps. That is, there is one way to do nothing at all. Figure 8 shows the various ways to pogo-stick climb staircases of sizes 1 through 4. In particular, we use a unit square to indicate the one way to climb a staircase of one stair. We use two unit squares and a single 1 × 2 rectangle to show the two ways to climb a staircase of two stairs. For 𝑛 = 3 and 𝑛 = 4, we use gray and black rectangles to highlight the recursive structure of these Fibonacci numbers. To climb three stairs, 𝒜 can start with a pogo-stick jump of one step and finish in the two ways shown in case 𝑛 = 2, or 𝒜 can start with a pogo-stick jump of two steps and finish 18 Somehow during repeated hand-copying and translating Liber Abaci, Leonardo’s name was rendered Fibonacci. Leonardo was the son of Bonaccio Pisano. In the Latin script, this relationship was written filius Bonaccio and was perhaps translated as Fibonacci. 19 One way to start this problem is to let 𝑟 be the total number of pairs of rabbits in month 𝑛. We 𝑛 assume that at month 1 we have 𝑟1 = 1 pair of baby rabbits. In the next month, this pair of rabbits has matured, so at month 2 we have 0 pair of baby rabbits and 1 pair of adult rabbits, which means that 𝑟2 = 1. In the next month, that single pair of adults will produce 1 pair of baby rabbits, for a total of 𝑟3 = 2. Analyze what happens over the next few months.

Fibonacci numbers

191

Figure 7. Replica of the grand ballroom staircase aboard the Titanic.

n=1

n=2

n=4

n=3

Figure 8. Pogo-stick staircase climbs taking one or two stairs at a time. in the one way of case 𝑛 = 1. If we let 𝑏𝑛 represent the 𝑛th Fibonacci number, this recursion can be written as 𝑏𝑛 = 𝑏𝑛−1 + 𝑏𝑛−2

where 𝑏0 = 1 = 𝑏1 ,

(9) ♢

for 𝑛 ≥ 2. Table 4 shows the values of 𝑏𝑛 as 𝑛 ranges from 0 to 12. Table 4. The first few terms for Puzzle 22. 𝑛 𝑏𝑛

0 1

1 2 3 4 5 6 1 2 3 5 8 13

7 21

8 9 34 55

10 11 89 144

12 233

192

Chapter VI: Families of Numbers

From the definition of generating function from Chapter V and by Table 4, the generating function for the Fibonacci numbers is 𝑓(𝑥) = 1 + 𝑥 + 2𝑥2 + 3𝑥3 + 5𝑥4 + ⋯ . The next proposition shows how to represent this open form of 𝑓 as a closed form. Proposition 23: A closed Fibonacci form. The closed form for the Fibonacci generating function is 1 𝑓(𝑥) = . (10) 1 − 𝑥 − 𝑥2 Proof. Multiply Equation (9) by 𝑥𝑛 : 𝑏𝑛 𝑥𝑛 = 𝑏𝑛−1 𝑥𝑛 + 𝑏𝑛−2 𝑥𝑛 . Sum Equation (11) over all 𝑛 ≥ 2: ∞

∞

(11)

∞

∑ 𝑏𝑛 𝑥𝑛 = ∑ 𝑏𝑛−1 𝑥𝑛 + ∑ 𝑏𝑛−2 𝑥𝑛 . 𝑛=2

𝑛=2

(12)

𝑛=2

Rewrite the first, second, and third summations of Equation (12), respectively, as ∞

∞

∑ 𝑏𝑛 𝑥𝑛 = ∑ 𝑏𝑛 𝑥𝑛 − 1 − 𝑥 = 𝑓(𝑥) − 1 − 𝑥, 𝑛=2 ∞

𝑛=0

∞

∞

∞

∑ 𝑏𝑛−1 𝑥𝑛 = ∑ 𝑏𝑛 𝑥𝑛+1 = 𝑥 ∑ 𝑏𝑛 𝑥𝑛 = 𝑥( ∑ 𝑏𝑛 𝑥𝑛 − 𝑏0 ) = 𝑥(𝑓(𝑥) − 1), 𝑛=2

and

𝑛=1 ∞

𝑛=1

𝑛=0

∞

∞

∑ 𝑏𝑛−2 𝑥𝑛 = ∑ 𝑏𝑛 𝑥𝑛+2 = 𝑥2 ∑ 𝑏𝑛 𝑥𝑛 = 𝑥2 𝑓(𝑥). 𝑛=2

𝑛=0

𝑛=0

Therefore, Equation (12) becomes 𝑓(𝑥) − 1 − 𝑥 = 𝑥(𝑓(𝑥) − 1) + 𝑥2 𝑓(𝑥).

(13)

Solving Equation (13) for 𝑓 gives the desired result. Proposition 24: A formula for the 𝑛th Fibonacci number. Let 𝜎 = 0.618 and 𝜇 = −

√5+1 2

≈ −1.618. Then 𝑏𝑛 =

5 + √5 2 + . 10𝜎𝑛 (5 + √5)𝜇𝑛

Proof. The roots of 1 − 𝑥 − 𝑥2 are 𝜎 =

√5−1 2

and 𝜇 = −

1 𝐴 𝐵 = 𝑥 + 1 − 𝑥 − 𝑥2 1− 1− 𝜍

𝑥 𝜇

√5+1 2

. Solving

√5−1 2

≈

Fibonacci numbers gives 𝐴 =

5+√5 10

and 𝐵 =

193 2 5+√5 ∞

. By the geometric series, ∞

𝑛

∞

𝑛

1 𝑥 𝑥 𝐴 𝐵 = 𝐴 ∑ ( ) + 𝐵 ∑ ( ) = ∑ ( 𝑛 + 𝑛 )𝑥𝑛 𝜎 𝜇 𝜎 𝜇 1 − 𝑥 − 𝑥2 𝑛=0 𝑛=0 𝑛=0 when |𝑥| < 𝜎. The coefficient of 𝑥𝑛 is the desired result. Corollary 25: A concise Fibonacci formula. The 𝑛th Fibonacci number is also given by 5 + √5 𝑏𝑛 = [ ], 10𝜎𝑛 where [𝑥] denotes the integer nearest to the real number 𝑥. 2 | | | is a decreasing sequence and has value 0.28 at 𝑛 = 0, the Proof. Since | | (5+√5)𝜇𝑛 |

𝑛th Fibonacci number is the integer nearest to

5+√5 10𝜍𝑛

.

Rather than Proposition 24 or Corollary 25, an alternative approach to finding a non-recursive formula for 𝑏𝑛 is to use binomial coefficients. Proposition 26: A binomial approach to the Fibonacci numbers. The 𝑛th Fibonacci number is 𝑛 2

⌊ ⌋

𝑏𝑛 = ∑ ( 𝑘=0

𝑛−𝑘 ). 𝑘

Proof. One way to compute the number 𝑏𝑛 is to consider a specific case for 𝑛 and then generalize. To climb 𝑛 = 7 stairs using jumps or strides of length 1 or 2, we must do one of the following: • Seven steps, each of stride length 1. There is one way to do this, namely, 1 = (7−0). 0

• Six steps, 1 of stride 2 and 5 of stride 1. There are 6 ways to do this, 6 = (7−1). 1

• Five steps, 2 of stride 2 and 3 of stride 1. There are 10 ways to do this, 10 = (7−2). 2

• Four steps, 3 of stride 2 and 1 of stride 1. There are 3 ways to do this, 3 = (7−3). 3

7 ⌊ ⌋ 2

7−𝑘 So 𝑏7 = ∑ ( ) = 21. Generalizing this example, as the reader may do, gives 𝑘 𝑘=0 the proposition.

194

Chapter VI: Families of Numbers

The next two results show how to use Fibonacci numbers to generate Pythagorean triples. Proposition 27: Another Fibonacci recursion. Let 𝑏𝑛 be the 𝑛th Fibonacci 2 number. For each integer 𝑛 ≥ 1, 𝑏𝑛−1 + 𝑏𝑛2 = 𝑏2𝑛 . Proof. The number of ways for 𝒜 to pogo-stick climb 2𝑛 stairs is 𝑏2𝑛 . Alternatively, the number of ways to climb 𝑛 stairs is 𝑏𝑛 . So the number of ways for 𝒜 to climb 2𝑛 stairs so that 𝒜 actually lands on step 𝑛 is 𝑏𝑛2 . For 𝒜 to climb 2𝑛 stairs so as not to land on step 𝑛, 𝒜 must land on step 𝑛 − 1, then take a jump of two 2 steps to land on step 𝑛 + 1. The number of ways for 𝒜 to do this is 𝑏𝑛−1 . Corollary 28: Fibonacci-Pythagorean triples. For each integer 𝑛 > 2, (𝑏𝑛−2 𝑏𝑛+1 , 2𝑏𝑛−1 𝑏𝑛 , 𝑏2𝑛 ) is a Pythagorean triple. 2 2 Proof. By algebra, we know that (𝑏𝑛2 −𝑏𝑛−1 , 2𝑏𝑛−1 𝑏𝑛 , 𝑏𝑛−1 +𝑏𝑛2 ) is a Pythagorean 2 2 triple. By Proposition 27, 𝑏𝑛−1 + 𝑏𝑛 = 𝑏2𝑛 . Furthermore, by Equation (9), 2 𝑏𝑛2 − 𝑏𝑛−1 = (𝑏𝑛 − 𝑏𝑛−1 )(𝑏𝑛 + 𝑏𝑛−1 ) = 𝑏𝑛−2 𝑏𝑛+1 .

And so we have the corollary. Definition 29: The golden mean. The number 𝜙 = golden mean, where 𝜎 =

√5−1 2

1

=

𝜍

1+√5 2

is called the

, as defined in Proposition 24.

Example 30: A look ahead—the continued fraction for the golden mean. As a prelude example for Chapter IX, we find the continued fraction for the golden mean. As will be seen, we repeatedly use the identity 𝜙 = 1 + 𝜎. Using the same idea as introduced in Example II.4, where we generated the 532 , we write 𝜙 in the following form: continued fraction for the fraction 1193

𝜙=1+𝜎=1+

1 1

( ) 𝜍

=1+

1 1 1 =1+ =1+ 𝜙 1+𝜎 1+

1

1 ( ) 𝜍

Since we can continually expand 𝜙 as 1 +

1 𝜙

=1+

1 1+

1

.

𝜙

, the simple continued fraction of

𝜙 is [1; 1, 1, 1, …] with respect to the notation introduced in Definition 2 of the Introduction. Furthermore, the first few convergents of this infinite simple continued fraction are 1 3 5 2 1 1 1+ = , = , = , 1+ 1+ 1 1 1 1 2 3 1+ 1 1+ 1

1+

1

The continued fraction recursion for 𝑒

195

and so on. That is, the convergents of 𝜙 are the ratios of successive Fibonacci numbers. Exercise 2 explores how well these ratios approximate 𝜙. ♢

The continued fraction recursion for 𝑒 Surprisingly, the continued fraction for 𝑒 has a Fibonacci-like recursive structure. Euler discovered this result in 1744. Example 31: Successive continued fraction partial denominators for 𝑒. Using the same ideas as in Example II.4 and Puzzle V.6 (and using the standard division algorithm), we find the first few remainders obtained from generating the continued fraction for 𝑒. In doing so we uncover a repeating pattern within 𝑒. Since this pattern is more clear in the number 𝑒 − 1, we find the partial denominator representation for 𝑒 − 1: 𝑒 − 1 = [𝑎0 ; 𝑎1 , 𝑎2 , 𝑎3 , …]. To highlight these partial denominators 𝑎𝑖 as we generate them, we box them. Step 0: By the division algorithm, 𝑒 − 1 = 1 + (𝑒 − 2). Thus 𝑎0 = 1 and the first remainder is 𝑟1 = 𝑒 − 2 ≈ 0.718. 1

Step 1: By the division algorithm,

𝑟1

=

1 𝑒−2

1 𝑟2

=

𝑒−2 3−𝑒

convergent is 𝐶2 = 1 + Step 3: Similarly,

1 𝑟3

= 2 + 1

Step 4: Similarly,

1 𝑟4

= 1 +

11

1 𝑟5

= 4+

8−3𝑒 𝑒−3

3

3

. So 𝑎3 = 1, 𝑟4 =

11−4𝑒 3𝑒−8

, 𝐶3 = 1 +

7

1 1+

= ,

1

4

1 2+ 1

19−7𝑒 4𝑒−11 19 7

. So 𝑎4 = 1, 𝑟5 =

19−7𝑒 4𝑒−11

, 𝐶3 = 1+

1 1+

=

1 2+

1

7

,

.

87−32𝑒 7𝑒−19

, and 𝑒’s fifth convergent is

12

1 1+ 1

. So 𝑎5 = 4, 𝑟6 =

87−32𝑒 7𝑒−19

, 𝐶4 = 1+

1 1+

=

1 1

2+

1 1+

32

,

, and the second

1+

55

𝑒−2

.

4

and 𝑒’s fourth convergent is Step 5: Similarly,

. Thus 𝑎2 = 2, 𝑟3 =

8

3𝑒−8

and 𝑒’s third convergent is

3−𝑒

= 2. So 𝑒’s first convergent

5

11−4𝑒

= 1 +

𝑒−3

1

. Thus 𝑎1 = 1, 𝑟2 =

= . So 𝑒’s second convergent is .

1 2

1+

8−3𝑒

𝑒−2

1

and the first convergent 𝐶1 for 𝑒 − 1 is 𝐶1 = 1 + is 3. Step 2: Similarly,

3−𝑒

= 1 +

87 32

.

1 4

196

Chapter VI: Families of Numbers

Continuing, we generate the list of partial denominators for 𝑒 − 1: 𝑒 − 1 = [1; 1, 2, 1, 1, 4, 1, 1, 6, 1, 1, 8, 1, 1, 10, …]. ♢

So the continued fraction for 𝑒 is 𝑒 = [2; 1, 2, 1, 1, 4, 1, 1, 6, 1, 1, 8, …].

Example 32: Successive terms in (𝑒 − 1)’s remainders of Example 31. The numerator and denominator expressions in the remainders of Example 31 are all of the form 𝑚𝑒+𝑛 where 𝑚 and 𝑛 are integers. Better yet, we write the denominators as 𝑚𝑒 − 𝑛 where 𝑚, 𝑛 ∈ ℕ. We partition these expressions into three families of numbers, denoted by 𝛼𝑛 , 𝛽𝑛 , and 𝛾𝑛 . The first few terms have the values given in Table 5. (The terms 𝛼0 = 𝑒 − 1 and 𝛽0 = −1 were chosen as initial values so as to agree with a recursion we will characterize.) Table 5. Successive expressions in the continued fraction remainders of 𝑒 − 1. 𝛼0 𝑒−1

𝛽0 𝛾0 −1 𝑒 − 2

𝛼1 𝑒−3

𝛽1 3𝑒 − 8

𝛾1 4𝑒 − 11

𝛼2 7𝑒 − 19

𝛽2 32𝑒 − 87

𝛾2 39𝑒 − 106

Observe that 𝛼0 + 𝛽0 = 𝛾0 , 𝛽0 + 𝛾0 = 𝛼1 , 𝛾0 + 2𝛼1 = 𝛽1 , and 𝛼1 + 𝛽1 = 𝛾1 . In general, it appears that the pattern is given by 𝛼𝑛 = 𝛽𝑛−1 + 𝛾𝑛−1 ,

𝛽𝑛 = 2𝑛𝛼𝑛 + 𝛾𝑛−1 ,

and

𝛾𝑛 = 𝛼𝑛 + 𝛽𝑛

(14) ♢

for all 𝑛 ∈ ℤ+ . Proposition 33: The pattern of 𝑒.∗ For each 𝑛 ∈ ℕ, let 1 𝑛

𝑥 (𝑥−1)𝑛 𝑒𝑥 𝐴𝑛 =∫ 𝑑𝑥, 𝑛! 0

1 𝑛+1

𝐵𝑛 = −∫ 0

1 𝑛

𝐶𝑛 = −∫ 0

𝑥

𝑥 (𝑥−1) 𝑛!

(𝑥−1)𝑛 𝑒𝑥 𝑑𝑥, 𝑛!

𝑛+1 𝑥

𝑒

𝑑𝑥.

Then 𝛼𝑛 = 𝐴𝑛 , 𝛽𝑛 = 𝐵𝑛 , and 𝐶𝑛 = 𝛾𝑛 . Furthermore, 𝛼𝑛 , 𝛽𝑛 , and 𝛾𝑛 converge to 0 as 𝑛 increases. Proof. See Exercise 4 for a proof and also a verification of the pattern in Equation (14), which is an adaptation from [28]. To illustrate this proposition, consider 𝛾2 = 39𝑒 − 106 from Table 5. We know 106 106 that 39𝑒 − 106 ≈ 0. Thus 𝑒 ≈ and 𝑒 − ≈ 0.00033. 39

39

The Catalan numbers∗

197

d 0

n

Figure 9. A graphical representation of 𝑆0 .

The Catalan numbers∗ In this section, we solve a classic puzzle, first solved by George Pólya (1887–1985) in 1921. Puzzle 34: A random walk. Suppose that 𝒜 is a robot that moves back and forth along a number line, advancing one step (a unit length) to the right or one step to the left at each second with equal likelihood. If 𝒜 starts at the origin (home), what is the likelihood that 𝒜 will eventually return home? (We assume 𝒜 never stops.) ♢ Definition 35: Step terminology. A step sequence 𝑆 = {𝑠1 , 𝑠2 , 𝑠3 , …} is a sequence whose terms 𝑠𝑖 are either 1 or −1. A rightward step is recorded as 1 and a leftward step as −1. Furthermore, 𝒜’s signed distance from 0 at time 𝑛 with respect to a step sequence 𝑆 is 𝑛

𝑑(𝑛) = ∑ 𝑠𝑖 . 𝑖=1

When we use the phrase step sequence of length n we mean the first 𝑛 terms of the step sequence. For example, the sequence 𝑆0 = {−1, −1, 1, 1, −1, 1, 1} is a random walk of length seven. Figure 9 is a graphical representation of 𝑆0 given by the points (𝑛, 𝑑(𝑛)) joined by line segments. Thus the graph of 𝑆0 is formed by connecting the points (0, 0), (1, −1), (2, −2), (3, −1), (4, 0), (5, 0), and so on. Figure 10 shows two random walks each of 100 steps. On the first walk 𝒜 returns home (whenever 𝑑(𝑛) = 0) several times on the hike, whereas in the second 𝒜 fails to do so. Observe that if 𝑑(𝑛) = 0 for the random walk 𝑆, then the number of 1’s and the number of −1’s from 𝑠1 to 𝑠𝑛 must be the same. Thus if 𝑑(𝑛) = 0, then 𝑛 must be even. With this idea in mind we make the following definition.

198

Chapter VI: Families of Numbers S

S

100 0

0

a. 𝑑(𝑛) = 0 for several 𝑛 ≤ 100.

100

b. 𝑑(𝑛) > 0 for 0 < 𝑛 ≤ 100.

Figure 10. Two random walks, each of 100 paces. Definition 36: Primitive hikes. We say that a step sequence 𝑆 of length 2𝑛 is a mountain hike if 𝑑(0) = 𝑑(2𝑛) = 0 and 𝑑(𝑖) ≥ 0 for all integers 𝑖 with 0 ≤ 𝑖 ≤ 𝑛. Let 𝐶𝑛 be the number of mountain hikes of length 2𝑛 for all 𝑛 ∈ ℕ. We say that a mountain hike 𝑆 of length 2𝑛 is a primitive mountain hike if 𝑑(𝑖) > 0 for all integers 𝑖 with 0 < 𝑖 < 2𝑛. Figure 11 shows the number of mountain hikes of length two, four, and six. For each hike, the light gray horizontal line is a base line. Of these eight hikes, four are primitive mountain hikes. If we shift the hikes up by one unit, prepending 1 and appending −1 to each step sequence, then the mountain hikes in the figure become primitive mountain hikes of length four, six, and eight, respectively. Figure 12 shows that 𝐶4 , the number of mountain hikes of length eight, is also the number of primitive mountain hikes of length ten. Figure 18 shows that 𝐶5 , the number of mountain hikes of length ten, is the number of all primitive mountain hikes of length twelve. Generalizing these observations, we have the following proposition. Proposition 37: Primitive versus standard hikes. The number of mountain hikes of length 2𝑛 is the number of primitive mountain hikes of length 2𝑛 + 2. Furthermore, the step sequence corresponding to any primitive mountain hike longer than two must end with two left steps. Proof. The first statement follows from the definitions of primitive mountain hike and mountain hike. The simplest primitive mountain hike is {1, −1}. Since clipping the first and last steps from any primitive mountain hike of length at least four gives a mountain hike of length at least two, and since every mountain hike must end in −1, every primitive mountain hike ends with two left steps. The family of numbers 𝐶𝑛 is called the Catalan numbers; they have a long history, summarized in Exercise 8. To find a way to determine the value of 𝐶𝑛 in general, we try to view the Catalan numbers recursively so that 𝐶𝑛 is defined in terms of 𝐶𝑖 for values of 𝑖 smaller than 𝑛. One way to do this is to realize that each mountain hike must begin with a primitive mountain hike. Furthermore, since all mountain hikes

The Catalan numbers∗

a. 𝐶1 = 1.

199

b. 𝐶2 = 2.

c. 𝐶3 = 5.

Figure 11. Mountain hikes for 𝑛 = 1, 2, 3.

of length 2 are primitive, 𝐶0 = 𝐶1 = 1, while otherwise, 𝐶𝑛 is the number of primitive hikes of length 2𝑛 + 2. For example, Figure 12 sorts the fourteen mountain hikes counted by 𝐶4 into four categories. Each hike itself is partitioned into two sub-hikes by a dashed vertical line. The sub-hikes to the left of the vertical line are primitive mountain hikes. The first of these categories, pictured in the top tier of the figure, consists of five mountain hikes, each of which is a primitive mountain hike of length two followed by a mountain hike of length six. The number of hikes in this category is 5 = 𝐶0 𝐶3 . The second of these categories consists of two hikes, each of which is a primitive hike of length four followed by a mountain hike of length four. The number of hikes is 2 = 𝐶1 𝐶2 . The third of these categories consists of two hikes, each of which is a primitive hike of length six followed by a mountain hike of length two. The number of hikes is 2 = 𝐶2 𝐶1 . Finally, the fourth of these categories consists of five hikes, each of which is a primitive hike of length eight followed by a mountain hike of length zero. The number of hikes is 5 = 𝐶3 𝐶0 . Thus the total number 𝐶4 of mountain hikes of length eight is 𝐶4 = 𝐶0 𝐶3 + 𝐶1 𝐶2 + 𝐶2 𝐶1 + 𝐶3 𝐶0 . In Exercise 5a, we ask for a similar partitioning of all 42 hikes, as counted by 𝐶5 . The general convolution formula for 𝐶𝑛 is thus 𝑛

𝐶𝑛 = ∑ 𝐶𝑘−1 𝐶𝑛−𝑘

with

𝐶0 = 1.

(15)

𝑘=1

Definition 38: The Catalan generating function. The generating function ∞ for the Catalan numbers is 𝑓(𝑥) = ∑𝑛=0 𝐶𝑛 𝑥𝑛 .

200

Chapter VI: Families of Numbers

Primitive mountain hikes of length 2, followed by a mountain hike of length 6.

Primitive mountain hikes of length 4, followed by a mountain hike of length 4.

Primitive mountain hikes of length 6, followed by a mountain hike of length 2.

Primitive mountain hikes of length 8, followed by a mountain hike of length 0. Figure 12. Partitioning the hikes as counted by 𝐶4 . Proposition 39: A Catalan relation. The Catalan function 𝑓(𝑥) satisfies the equation 𝑓(𝑥) − 1 = 𝑥𝑓(𝑥)2 . (16) Proof. By Definition 38 and Equation (15), ∞

∞

∞

𝑛

∞

(𝑥 ⋅ 𝑓(𝑥))𝑓(𝑥) = ( ∑ 𝐶𝑘−1 𝑥𝑘 )( ∑ 𝐶𝑘 𝑥𝑘 ) = ∑ ∑ 𝐶𝑘−1 𝐶𝑛−𝑘 𝑥𝑛 = ∑ 𝐶𝑛 𝑥𝑛 . 𝑘=1

𝑘=1

𝑛=1 𝑘=1

𝑛=1

∞

Since ∑ 𝐶𝑛 𝑥𝑛 = 𝑓(𝑥) − 1, we have 𝑥𝑓(𝑥)2 = 𝑓(𝑥) − 1. Solving for 𝑓(𝑥) gives its 𝑛=1

closed form. Proposition 40: A closed Catalan form. The closed form of the Catalan generating function is 1 − √1 − 4𝑥 𝑓(𝑥) = . (17) 2𝑥

The Catalan numbers∗

201

Proof. Solving Equation (16) for 𝑓(𝑥) via the quadratic formula gives 𝑓(𝑥) = 1±√1−4𝑥 2𝑥

lim𝑥→0

. Because 𝐶0 = 1, we know that lim𝑥→0 𝑓(𝑥) must be 1. Observe that

1+√1−4𝑥 2𝑥

fails to exist and, by L’Hôpital’s rule, lim𝑥→0

1−√1−4𝑥 2𝑥

= 1.

From Corollary 21, we can find a formula for 𝐶𝑛 . Proposition 41: A Catalan formula. For each 𝑛 ∈ ℕ, 𝐶𝑛 =

1 2𝑛 ( ). 𝑛 𝑛+1

(18)

Proof. By Corollary 21, Newton’s binomial series gives ∞

1 √1 − 𝑥

= ∑ 𝑛=0

(2𝑛) 𝑛

22𝑛

𝑥𝑛

(19) 1

for all 𝑥 with |𝑥| < 1. Substituting 4𝑥 for 𝑥 in (19) means that for all 𝑥 with |𝑥| < , 4

1

∞

= ∑(

2𝑛 ) 𝑥𝑛 . 𝑛

√1 − 4𝑥 𝑛=0 Integrating Equation (20) from 0 to 𝑋 (and then replacing 𝑋 with 𝑥) gives ∞

∞

1 1 1 1 2𝑛 2𝑛 − √1 − 4𝑥 = ∑ ( ) 𝑥𝑛+1 = 𝑥 ∑ ( ) 𝑥𝑛 𝑛 𝑛 2 2 𝑛 + 1 𝑛 + 1 𝑛=0 𝑛=0

(21)

1

1−√1−4𝑥

4

2𝑥

for all 𝑥 with |𝑥| < . Dividing Equation (21) by 𝑥 shows that 𝑓(𝑥) = ∞ ∑𝑛=0

(20)

𝐶𝑛 𝑥𝑛 , proving the desired result.

=

We can now answer our puzzle. Proposition 42: Solution of the random walk puzzle. With respect to Puzzle 34, 𝒜 returns to 0 with probability 100 percent. Proof. If 𝒜’s first step is to the right and 𝒜 eventually returns to 0, 𝒜 will do so for the first time along a primitive mountain path. The probability that 𝒜 1 returns to 0 along a primitive mountain path of length two is 𝐶0 ( )2 = 𝐶0 /4. The 2 probability that that 𝒜 returns to 0 along a primitive mountain path of length four 1 is 𝐶1 ( )4 = 𝐶1 /42 . In general, the probability that 𝒜 returns to 0 along a primitive 2

mountain path of length 2𝑛 + 2 is 𝐶𝑛 /22𝑛+2 = 𝐶𝑛 /4𝑛+1 . Thus the probability that ∞ 𝒜 returns to 0 along a primitive mountain path is ∑𝑛=0 𝐶𝑛 /4𝑛+1 . By symmetry, since 𝒜’s first step may be to the left, the probability that 𝒜 returns to 0 at least ∞ 1 1 once is 2 ∑𝑛=0 𝐶𝑛 /4𝑛+1 = 𝑓( ) = 1 by Proposition 40. 2

4

202

Chapter VI: Families of Numbers

Pólya went on to show that if 𝒜 is allowed to take unit steps north and south as well as east and west with equal likelihood, the probability that 𝒜 returns home is again 1. But if 𝒜 is allowed to take unit steps up and down as well as in the four natural planar directions with equal likelihood, then 𝒜 has about probability 34% of returning home in an arbitrary random walk. Puzzle 43: A Chichen Itza random walk. Robot 𝒜 is positioned half-way up a staircase of ninety-one steps on the Chichen Itza pyramid at step forty-six, a step we call home. As in Puzzle 34, at each second, 𝒜 goes either up one stair or down one stair. After forty-five seconds, how far from home on the average is 𝒜? (The signed distance 𝑑(𝑛) on the average of course is zero.) ♢ To solve this puzzle we first calculate the average squared distance from home after 𝑛 steps. Proposition 44: Average squared distance from home. In a random walk, after 𝑛 steps, 𝒜’s average squared distance from home is 𝑛. Proof. By Definition 35, 2

𝑛

𝑛

𝑑(𝑛) = ( ∑ 𝑠𝑖 ) = ∑ 𝑠2𝑖 + 2 ∑ 𝑠𝑖 𝑠𝑗 = 𝑛 + 2 ∑ 𝑠𝑖 𝑠𝑗 , 2

𝑖=1

𝑖=1

𝑖 𝑅.

b. A 𝜃 cross-section.

Figure 21. Orientation 𝐵, a unit mass 𝑃 on the equator. To facilitate the resulting integration (since the 𝜃 cross-sections of Earth are no longer congruent as they were in orientation 𝐴), we take 𝑑𝜏 = 𝑟 𝑑𝑟 𝑑𝑧 𝑑𝜃 (rather than 𝑟 𝑑𝜃 𝑑𝑟 𝑑𝑧). As before, the downward attraction on 𝑃 induced by point mass 𝑆 is proportional to 𝑟(𝜌 − 𝑧) 𝑑𝑟 𝑑𝑧 𝑑𝜃 . 3 (𝑟2 + (𝜌 − 𝑧)2 ) 2 By symmetry, we partition Earth into four congruent sections, the regions deter𝜋 𝜋 3𝜋 3𝜋 mined by 0 ≤ 𝜃 ≤ , ≤ 𝜃 ≤ 𝜋, 𝜋 ≤ 𝜃 ≤ , and ≤ 𝜃 ≤ 2𝜋. See Figure 20. 2 2 2 2 The sum over all downward attractions of point masses in each of the sections is exactly the same. Thus, when 𝜌 > 𝑅, the sum of all of these attractions in Earth is 𝜋/2

𝐵(𝜌, 𝑅) = 4 𝛼(𝜌, 𝑅) ∫ 0

where 𝑄 =

𝜌2 cos2 𝜃 𝑅2

𝜌

∫ ∫ −𝜌

√

𝜌2 −𝑧2 𝑄

𝑟(𝜌 − 𝑧) 3

𝑑𝑟 𝑑𝑧 𝑑𝜃,

(14)

(𝑟2 + (𝜌 − 𝑧)2 ) 2

0

+ sin2 𝜃. The outermost limits of integration are obtained 𝜋

by observing that 𝜃 ranges from 0 to , the section of one-quarter of Earth. In 2 each 𝜃 cross-section of this quarter of Earth, 𝑧 ranges from −𝜌 to 𝜌, from one side of the equator to its antipodes, as can be seen in Figure 21b. Thus the limits of integration for the middle integral are −𝜌 and 𝜌. Finally, the set of all points for

286

Chapter VIII: Classic Elliptical Fractions

fixed 𝜃 and fixed 𝑧 values is a line segment, shown as a dotted line in Figure 21b. The left-hand endpoint of this segment is 𝑟 = 0, and the right-hand endpoint is obtained by solving Equation (12) for 𝑟. Thus the limits of integration for the innermost integral are 0 and

𝜌2 −𝑧2

. Again, by straightforward techniques (see √ 𝑄 Exercise 4b), Equation (14) simplifies to Equation (13). Now we check the result of these last two propositions with Newton’s claim.

Proposition 20: Newton’s ratio. With 𝐴(𝜌, 𝑅) and 𝐵(𝜌, 𝑅) as in Propositions 17 and 19, 𝐴(101, 100) 501 . ≈ 500 𝐵(101, 100) Proof. By a CAS,6 𝐴(101, 100)/𝐵(101, 100) ≈ 1.001985254, which is almost the same as 501/500 = 1002/1000 = 1.002. Example 21: The continued fraction for Newton’s ratio.∗ As a whimsical 501 question we ask: Is Newton’s estimate of the best fraction (with denominator near 500) for the real number 𝜔 =

500 𝐴(101, 100) 𝐵(101, 100)

≈ 1.001985254?

From Chapter VIII, the first few harmonic convergents for 𝜔 are 1 →

505 1514 → . 504 1511

Checking the distances of the fractions || 501 − 𝜔|| ≈ 0.000015 | 500 |

501 500

and

and

505 504

from 𝜔 gives

|| 505 − 𝜔|| ≈ 0.0000028. | 504 |

505

501

is about an order of magnitude better than . But of That is, the convergent 504 500 course, Newton’s fraction is certainly close enough to 𝜔 for all practical purposes. Nevertheless it is fun to wonder what Newton may have done with a computer.7 ♢ To account for the force of gravity on a unit mass at the equator of a rotating Earth, we need a little physics. Proposition 22: Centripetal force. The outward acceleration 𝑎 on a mass whose position at time 𝑡 is 𝑝(𝑡) = 𝜌(cos 𝜔𝑡, sin 𝜔𝑡, 0) is 𝑎 = 𝜌𝜔2 . The force of this acceleration on a unit mass 𝑃 at the equator is also 𝜌𝜔2 . 6 See

Code 13 in Appendix III for a way to implement this calculation.

7 Paul Nahin, a prolific writer of popular mathematics, raises this question in Number Crunching

[108, pp. 305–323].

Newton’s case for a ﬂattened Earth∗

287

y y = 230/229

1.0044

1.0040 3953.5 1.0036 3950

thousands of miles

R 3975

Figure 22. The graph of 𝑓(𝑅) = 𝐴(𝜌0 , 𝑅)/𝐶(𝜌0 , 𝑅) versus

230 229

.

Proof. The second derivative of 𝑝(𝑡) is 𝑝″ (𝑡) = −𝜌𝜔2 (cos 𝜔𝑡, sin 𝜔𝑡, 0). This is the centripetal acceleration towards the center of this circle which must be applied to keep the mass on the path 𝑝(𝑡). Thus the outward acceleration (away from the center) has magnitude 𝜌𝜔2 . Definition 23: The length of a sidereal day. A sidereal day is the average length of time 𝑇 it takes for Earth to complete one rotation about its axis with respect to the background of the fixed stars. 𝑇 is about 23 hours, 56 minutes, and 4 seconds. Equivalently, 𝑇 ≈ 86 164 seconds. Proposition 24: Gravity at the equator of a rotating Earth. Let 𝜌 > 𝑅. The gravity, denoted by 𝐶(𝜌, 𝑅), acting on a unit mass 𝑃 at the equator of a rotating Earth with period 𝑇 is 𝐶(𝜌, 𝑅) = 𝐵(𝜌, 𝑅) − 𝜌 (

2𝜋 2 ) . 𝑇

(15)

Proof. Let mass 𝑃’s position at time 𝑡 be 𝑝(𝑡) = 𝜌(cos 𝜔𝑡, sin 𝜔𝑡, 0) where 𝜔 = By Propositions 19 and 22, 𝐶(𝜌, 𝑅) is given by Equation (15).

2𝜋 𝑇

.

For the final steps in calculating Δ𝑟 (the amount by which Earth is flattened at the poles), Newton needed an approximation for the ratio of gravity at the pole and at the equator and an approximate value of Earth’s equatorial radius 𝜌. He 𝑎 assumed that at Paris, a ball falls = 2174 Paris lines in one second within a vac2 uum, where 𝑎 is the acceleration due to gravity at Paris.8 With this information, 8 There are 12 Paris lines to a Paris inch. Each Paris foot is composed of twelve Paris inches. And a Paris foot is about 12.79 English inches. As to why Newton used Paris units rather than English or London units, Newton respected the observational/experimental skill of Giovanni Cassini and felt free to use his data.

288

Chapter VIII: Classic Elliptical Fractions

Figure 23. Zeroing in on Δ𝑟. Newton estimated that the ratio of gravity at the north pole and at the equator is as 230 is to 229. Finally, Newton used Cassini’s estimate for 𝜌, namely 𝜌 = 3971 miles. Proposition 25: Newton’s guess. If the ratio of polar gravity to equatorial gravity is 230/229 and if Earth is uniformly dense and 𝜌 = 𝜌0 = 3971 miles ≈ 6389.3 km, then Δ𝑟 ≈ 17.1 miles ≈ 27.5 km. Proof. Let 𝑓(𝑅) = 𝐴(𝜌0 , 𝑅)/𝐶(𝜌0 , 𝑅). Figure 22 shows the graph of 𝑓 as 𝑅 ranges 230 from 3950 to 3975.9 As can be seen, 𝑓(3953.5) ≈ . The difference between 229 3971 and 3953.5 is 17.5 miles, very near Newton’s guess of 17.1 miles. With better data, we might imagine that Newton’s guess for Δ𝑟 would improve. Yet reaching the north pole and taking gravity measurements there was science fiction in Newton’s day.10 Puzzle 26: A better ratio. Calculate the value for Δ𝑟 in Newton’s model of Earth given that the acceleration due to gravity at the north pole is 𝑔𝑝 ≈ 9.832 m/sec2 and at the equator is 𝑔𝑒 ≈ 9.781 m/sec2 [91, p. 40]. To solve this puzzle, we use the 9 See

Code 13 in Appendix III to render this graph. for first reaching the north pole (or coming very close to it) is usually given to Robert Peary and his team of 1909. 10 Credit

The French expeditions to Peru and Lapland

289

method of minimizing the square root of the sum of the squares of the errors.11 Let ℎ(𝜌, 𝑅) = √(𝐴(𝜌, 𝑅) − 𝑔𝑝 )2 + (𝐶(𝜌, 𝑅) − 𝑔𝑒 )2 . To find 𝜌 and 𝑅 values for which ℎ(𝜌, 𝑅) is close to 0, we plot ℎ’s contours. That is, we say that contour 𝑐 is the set of all points (𝜌, 𝑅) for which ℎ has value 𝑐. With a computer algebra system,12 we plot the contours of ℎ(𝜌, 𝑅) where ℎ is 0.01 and 0.005 m/sec2 , obtaining Figure 23. This information suggests that a close approximation for Δ𝑟 occurs at (𝜌, 𝑅) ≈ (3972, 3938) for a difference of 44 miles, over twice the approximation given in Proposition 25. Ironically, with better data,13 Newton would have gotten worse results, whereas with his slightly flawed data values, Newton came very close to the actual value of Δ𝑟. ♢

The French expeditions to Peru and Lapland As noted in the previous sections of this chapter, both Cassini and Newton assumed that Earth is an ellipsoid. Its profile is given parametrically by (𝑥, 𝑦) = (𝜌 cos 𝜙, 𝑅 sin 𝜙),

(16)

where 𝜌 is Earth’s equatorial radius, 𝑅 is Earth’s polar radius, and 𝜙 is a parameter with 0 ≤ 𝜙 ≤ 2𝜋, a representation discussed in Proposition 9. They disagreed on which was larger, 𝜌 or 𝑅. In the early 1730s, as an effort to resolve this by then forty-plus-year stalemate, Pierre-Louis Moreau de Maupertuis (1698–1759) successfully lobbied the French Academy of Sciences to launch expeditions both to the far north and to the equator to measure a degree of arc length along lines of longitude on Earth’s surface. Meanwhile, Maupertuis’s mathematics mentor Johann Bernoulli had written to him about the foolhardiness of these expeditions: Tell me, do the observers [of the proposed expeditions] have a predilection for one or the other of the two sentiments? Because if they favor the flattened Earth, they will find it flattened; if on the contrary, they are imbued with the idea of the elongated Earth, their observations will not fail to confirm its elongation; the difference between the compressed spheroid and the elongated is so slight, that it is easy to be mistaken if one wants to be mistaken in favor of one or the other opinion. [155, pp. 94–95] 11 By virtue of letting 𝑃 be a unit mass on Earth’s surface in Definitions 16 and 18, we may interpret the units of 𝐴(𝜌, 𝑅) and 𝐶(𝜌, 𝑅) as either newtons (a force) or m/sec2 (an acceleration). 12 See Code 13 in Appendix III to generate the contour graph. 13 One of the reasons why this new estimate of Δ𝑟 is over 30 miles too much is that Earth fails to be uniformly dense, a key Newtonian assumption.

290

Chapter VIII: Classic Elliptical Fractions

Figure 24. A commemorative pyramid marking the northernmost node of the arctic expedition network, Kittisvaara, Finland; photo by author, 2012. The chosen site at the equator was near Quito in South America, a territory controlled by Spain, and the chosen site for the far north was in Lapland, controlled by Sweden. Diplomatic agreements were made between heads of state to allow a French expedition to take measurements on non-French soil. To facilitate these concessions and for both Spain and Sweden to maintain watch on a team of potential spies and contraband artists, Spain appointed two young navy lieutenants fresh from the military academy to work together as equals with the French in the Viceroyalty of Peru, while the Swede Anders Celsius, of the Celsius thermometer,14 had early on attached himself to the French team going to the arctic circle. To make their measurements, the geodesic teams laid out a network of triangles, most of the vertices of which were atop mountains. Some of these vertices are now national landmarks in Ecuador and Finland. For example, Figure 24 shows the marker at latitude 66∘ N in Finland. The angles at each of the network’s vertices were measured using the best of English optics. One edge, the baseline, from each network was measured using rods laid end to end. In Peru, the baseline is now the site of Quito’s airport. In Finland, the baseline lay mostly along the frozen Tornio River. Figure 25 shows the French geodesic team taking measurements under the light of the aurora borealis during wintertime and perpetual darkness. By the summer of 1737, Maupertuis’s team had returned from Lapland with the arc length along one degree of longitude near the arctic circle measured as 69.52 miles (57 395 toises), and, in 1744, the equatorial team 14 Unlike in the modern Celsius scale, Celsius originally set the freezing point of water at 100∘ and the boiling point at 0∘ .

The French expeditions to Peru and Lapland

291

Figure 25. Measuring the baseline along the Tornio River at the arctic circle; sketch by J. Ansseau, 1882 [46]. arrived at the measurement of 68.76 miles (56 768 toises) for one degree of arc at the equator [70, p. 227]. How can we use this information to determine Δ𝑟? The answer involves a little calculus.

How Δ𝑠 gives Δ𝑟 Definition 27: Latitude of an ellipse. By the latitude 𝜃 at a point 𝑃 on the ellipse as given by Equation (16), we mean the angle 𝜃 between a normal to the ellipse at 𝑃 and a line through 𝑃 parallel to the 𝑥-axis. The polar angle 𝜓 at a point 𝑃 on the ellipse is the angle at the origin 𝑂 measured between the positive 𝑥-axis and ray 𝑂𝑃. How does the parameter 𝜙 of the ellipse (𝑥, 𝑦) = (𝜌 cos 𝜙, 𝑅 sin 𝜙) relate to the latitude 𝜃 of the ellipse? Figure 26 shows the profile of a planet where 𝜌 = 2 and 𝑅 = 1. Proposition 28: Parameter, latitude, and polar relations. Consider the parametrized ellipse 𝑃(𝜙) = (𝜌 cos 𝜙, 𝑅 sin 𝜙). The latitude 𝜃 corresponding to parameter 𝜙 is 𝜌 𝜃(𝜙) = tan−1 ( tan 𝜙) 𝑅 and the polar angle 𝜓 corresponding to parameter 𝜙 is 𝑦 𝑅 𝜓(𝜙) = tan−1 ( ) = tan−1 ( tan 𝜙). 𝑥 𝜌 Proof. The slope of the tangent line to the parametrized curve at 𝜙 is given by 𝑑𝑦 𝑑𝑦/𝑑𝜙 𝑅 cos 𝜙 𝑅 = = = − cot 𝜙, 𝑑𝑥 𝑑𝑥/𝑑𝜙 −𝜌 sin 𝜙 𝜌

292

Chapter VIII: Classic Elliptical Fractions

y θ = 66.5° ψ ≈ 29.9°

x

corresponding to φ ≈ 0.885019

Figure 26. An extreme model of Earth’s profile. which means that the latitude 𝜃 corresponding to parameter 𝜙 is 𝜃(𝜙) = tan−1 (

𝜌 tan 𝜙) 𝑅

or, equivalently, that the parameter 𝜙 corresponding to latitude 𝜃 is 𝜙(𝜃) = tan−1 (

𝑅 tan 𝜃). 𝜌

Meanwhile, the polar angle 𝜓 corresponding to parameter 𝜙 is 𝑦 𝑅 𝜓(𝜙) = tan−1 ( ) = tan−1 ( tan 𝜙). 𝑥 𝜌 Example 29: Arc length at the arctic circle and at the equator. For the ellipse of Figure 26, if 𝜃 = 66.5∘ N, the latitude of the arctic circle, then 𝜃’s radian measure is 1.16064 radians, which corresponds to the parameter value 𝜙(1.16064) ≈ 0.855019, so that 𝜓(0.855019) ≈ 0.521805 radians, or about 29.9∘ . ♢ Before we show the results of the French expeditions, we pause to solve Puzzle 4. Example 30: A solution to Puzzle 4. By Proposition 28, let 𝜙1 and 𝜙2 be 𝜙 𝑅

parameters corresponding to 10∘ and 80∘ , namely, 𝜙1 = tan−1 ( tan(10𝜋/180)) 𝜌

and 𝜙2 = tan

−1

𝑅

( tan(80𝜋/180)). The arc length from the equator to 10∘ on 𝒪’s 𝜌

surface is 𝜙1

𝑄𝑡 (𝜌, 𝑅) = ∫ 0

2 2 2 2 √𝜙 sin 𝜙 + 𝑅 cos 𝜙 𝑑𝜙,

293

nt

10 −5

R 7.0

co

co nto

R

ou r

0.1 ur

7.5

(9.132711, 7.213024)

10

−4

The French expeditions to Peru and Lapland

7.21300 8.5

ρ

9.13275 9.13270 ρ

9.5

a. A first zoom.

10 −4

contour 0.01

co nt ou r

6.5

co nt o

ur

7.21305

b. A closer zoom.

Figure 27. Zooming in on the solution to Puzzle 4. where 𝑡 represents ten degrees. The arc length from 80∘ to the north pole on 𝒪’s surface is 𝜋 2

𝑄𝑛 (𝜌, 𝑅) = ∫ √𝜙2 sin2 𝜙 + 𝑅2 cos2 𝜙 𝑑𝜙, 𝜙2

where n represents the north pole. Then the square root of the sum of the squares of the differences between 𝑄𝑡 and 1 and between 𝑄𝑛 and 2 is 𝒟 = √(𝑄𝑡 − 1)2 + (𝑄𝑛 − 2)2 , a function in terms of 𝜌 and 𝑅. We wish to find values 𝜌 = 𝜌0 and 𝑅 = 𝑅0 for which 𝒟 is zero. As in Puzzle 26, we plot the contours of 𝒟. To obtain a reasonable first-guess range of points over which to plot various contours of 𝒟, we know that the arc length from the equator should be somewhere between 9 and 18 units, so the perimeter of the ellipse is between 36 and 72. Circles of these circumferences should have radii between about 6 and 12 units. Figure 27a shows various contours as 𝜌 ranges from 8 through 10 units and as 𝑅 ranges from 6 to 8 units. Figure 27b shows a much smaller window. The dot near the center point of these contours is the point at which 𝒟 would be 0. We approximate the coordinates of this point as (𝜌0 , 𝑅0 ) ≈ (9.132711, 7.213024). Checking our work gives 𝑄𝑡 (𝜌0 , 𝑅0 ) ≈ 0.99999994 and 𝑄𝑛 (𝜌0 , 𝑅0 ) ≈ 1.9999905. Let 𝜔 = 𝑅0 /𝜌0 ≈ 0.78979987. The first few convergents for 𝜔 as given by the harmonic algorithm of Chapter VII are 1 →

4 15 124 511 1657 → → → → . 5 19 157 647 2098

Chapter VIII: Classic Elliptical Fractions

Polar radius R in miles

294

3957

contour 0.005 3956

contour 0.01 3975 3974 Equatorial radius ρ in miles

Figure 28. The polar and the equatorial expeditions’ 𝜌 and 𝑅.

For ellipsoid 𝒪, the ratio of its polar axis to its equatorial axis is about | 15 − 𝜔| ≈ 0.0003). | 19 |

15 19

(since ♢

Proposition 31: Vindicating Newton’s guess. The French Academy’s mideighteenth century conclusion was that Δ𝑟 ≈ 17.3 miles. Proof. Recall that the expedition’s results for the lengths of one degree along meridians at the arctic circle and equator were 69.52 and 68.76 miles, respectively. To use these two measurements to determine Earth’s shape, we take 𝜃1 , 𝜃2 , and 𝜃3 as the respective radian measures of 66∘ , 67∘ , and 0.5∘ N. Let 𝜙1 = 𝜙(𝜃1 ), 𝜙2 = 𝜙(𝜃2 ), and 𝜙3 = 𝜙(𝜃3 ). The arc length of one degree along the meridian at the arctic circle is 𝜙2 2 2 2 2 √𝜌 sin 𝜙 + 𝑅 cos 𝜙 𝑑𝜙,

𝑄𝑎 (𝜌, 𝑅) = ∫

(17)

𝜙1

where 𝑎 represents the arctic circle, and the arc length of one degree along the meridian at the equator is 𝜙3

𝑄𝑒 (𝜌, 𝑅) = 2 ∫

2 2 2 2 √𝜌 sin 𝜙 + 𝑅 cos 𝜙 𝑑𝜙,

(18)

0

where 𝑒 represents the equator. To find the 𝜌 and 𝑅 values that best reflect the two given arc length values, we use least squares and minimize ℳ = √(𝑄𝑎 − 69.52)2 + (𝑄𝑒 − 68.76)2 ,

(19)

A ﬁnal riddle

295

A O

D

B = (b, 0)

2πωt

Figure 29. Planets 𝒜 and ℬ about the Sun. where 𝑄𝑎 and 𝑄𝑒 are from Equations (17) and (18). Figure 28 is a contour plot15 of ℳ. The region within the larger oval represents all values (𝜌, 𝑅) for which ℳ is no larger than 0.01 miles. Hence the two geodesic expeditions launched by Louis XV give 𝜌 ≈ 3974.2 and 𝑅 ≈ 3956.9 miles, for a difference of Δ𝑟 ≈ 17.3 miles, uncannily close to Newton’s original estimate of 17.1 miles. The result of Proposition 31 is uncanny in that in later years it was determined that Maupertuis’s arctic team overestimated by about 0.25 miles. With this new arc length value of 69.27 miles and the equatorial team’s old value of 68.76 miles, we get 𝜌 ≈ 3962.8 miles and 𝑅 ≈ 3951.3 miles, so that Δ𝑟 ≈ 11.5 miles, not too far afield from satellite measurements of 𝜌 ≈ 3964.1 miles, 𝑅 ≈ 3950.8 miles, and Δ𝑟 ≈ 13.3 miles.

A ﬁnal riddle Puzzle 32: A planetary riddle. Which two planets are nearest Earth? This question was asked on a popular televised quiz show, as noted in [123]. From Table 1, the semi-major axial distances in astronomical units (AU) from the Sun of the first five planets—Mercury, Venus, Earth, Mars, and Jupiter—are 0.387, 0.723, 1, 1.524, and 5.203, respectively. The differences of these distances from unity are 0.613, 0.277, 0, 0.524, and 4.203. Thus the reader might be tempted to answer Venus and Mars. But rare is the time when all the planets are aligned as in a bus queue. Instead, if we project each planet onto the orbital plane of Earth, at any particular time these planetary projection points are scattered higgledy-piggledy about that plane with respect to Earth’s position. Thus a better criterion to use when answering the question might be average distance from Earth. For a first attempt at 15 See

Code 14 in Appendix III for how to generate this graphic image.

296

Chapter VIII: Classic Elliptical Fractions

solving the puzzle, we assume that Mercury, Venus, Earth, and Mars have circular orbits about the Sun and proceed in their orbits with simple harmonic motion. We also assume that the orbital planes of all of these planets are the same. As we did in the last section of Chapter VII, let 𝒜 and ℬ be two athletes or planets running around a circular track with center 𝑂 = (0, 0). However, suppose that 𝒜 runs with period 𝐴 in an inner lane of radial distance 𝑎 units from 𝑂, and ℬ runs with period 𝐵 in an outer lane of radial distance 𝑏 from 𝑂. The ratio of their periods (ℬ to 𝒜) about the track is 𝜔0 = 𝐵/𝐴. Assuming that 𝒜’s period is shorter, as she has less distance to run to complete a circuit of the track, gives 𝜔0 > 1. For simplicity, we imagine ℬ is stationary, running in place. Thus, with respect to ℬ’s position fixed at (𝑏, 0), 𝒜’s position is 𝑎(cos 2𝜋𝜔𝑡, sin 2𝜋𝜔𝑡) where 𝑡 is time and 𝜔 = 𝜔0 − 1 > 0, which we refer to as the relative angular velocity of 𝒜 to ℬ. At 𝑡 = 0, 𝒜 is at (𝑎, 0). At 𝑡 = 𝑛 units, where 𝑛 is an integer, ℬ has run in place a distance equivalent to completing 𝑛 circuits of the track. As illustrated in Figure 29, the distance 𝐷(𝑡) between 𝒜 and ℬ is 𝐷(𝑡) = √(𝑎 cos 2𝜋𝜔𝑡 − 𝑏)2 + 𝑎2 sin2 2𝜋𝜔𝑡 = √𝑎2 + 𝑏2 − 2𝑎𝑏 cos 2𝜋𝜔𝑡.

(20)

Let 𝒜 be Mercury and ℬ be Earth. Let 𝑎 = 0.387 astronomical units (AU), Mercury’s distance from the Sun. Let 𝑏 = 1 AU, Earth’s distance from the Sun. And let 𝜔 ≈ 365.26/87.96 − 1 ≈ 3.153. From the harmonic algorithm of Chap22 41 ter VII, two good approximations for 𝜔 are ≈ 3.143 and ≈ 3.154. Choosing 7 13 the latter option means that in thirteen years, Mercury has lapped Earth fortyone times, and at thirteen years, the Sun, Mercury, and Earth, in that order, are almost collinear. To say it another way, in the time that Earth completes thirteen circuits, Mercury has completed fifty-four circuits. Let 𝐷1 (𝑡) be the distance between 𝒜 and ℬ, Equation (20). The average distance between Mercury and Earth over thirteen years is about 13

13

1 1 ∫ 𝐷1 (𝑡) 𝑑𝑡 = ∫ √𝑎2 + 𝑏2 − 2𝑎𝑏 cos 2𝜋𝜎𝑡 𝑑𝑡 ≈ 1.038, 13 0 13 0

(21)

where 𝑎 = 0.387, 𝑏 = 1, and 𝜔 = 3.153. Because the integrand of Equation (21) is periodic, its average value should also be 1 𝜔

𝜔 ∫ √𝑎2 + 𝑏2 − 2𝑎𝑏 cos 2𝜋𝜔𝑡 𝑑𝑡 ≈ 1.038,

(22)

0

matching the result of Equation (21). With respect to Earth and Venus, let 𝒜 be Venus and ℬ be Earth. This time, let 𝑎 = 0.723 AU, Venus’s distance from the Sun. Let 𝜔 = 365.26/224.70 − 1 ≈ 5 5 0.6255 ≈ , where 𝜔 is the relative angular velocity, and where is a good ap8 8 proximation to 𝜔 given by the harmonic algorithm of Chapter VII. Let 𝐷2 (𝑡) be

A ﬁnal riddle

297

the distance between 𝒜 and ℬ, Equation 20. Thus the average distance between Venus and Earth is 8

8

1 1 ∫ 𝐷2 (𝑡) 𝑑𝑡 = ∫ √𝑎2 + 𝑏2 − 2𝑎𝑏 cos 2𝜋𝜔𝑡 𝑑𝑡 ≈ 1.135, 8 0 8 0 since Venus laps Earth five times in eight years. With respect to Earth and Mars, let 𝒜 be Earth and ℬ be Mars. This time, 37 let 𝑎 = 1 AU, 𝑏 = 1.524 AU, 𝜔 = 686.98/365.26 − 1 ≈ 0.8808 ≈ , and 𝐷3 (𝑡) be 42 the distances of Earth and Mars from the Sun, respectively, their relative angular 37 velocity, and the distance between them as given by Equation (20), where is a 42 good approximation to 𝜔 by the harmonic algorithm. Then the average distance between Earth and Mars is 42

42

1 1 ∫ 𝐷3 (𝑡) 𝑑𝑡 = ∫ √𝑎2 + 𝑏2 − 2𝑎𝑏 cos 2𝜋𝜔𝑡 𝑑𝑡 ≈ 1.693, 42 0 42 0 since Earth laps Mars thirty-seven times in forty-two Martian years. Therefore, the two planets nearest to Earth in general, in order, are Mercury and Venus. ♢ Puzzle 33: The final riddle revisited.∗ Compared with simple harmonic motion, how does the average distance between planets change when assuming an inverse square law for gravity? Since the orbits of Mercury, Earth, Venus, and Mars are nearly circular— having respective approximate eccentricities of 0.21, 0.0068, 0.017, and 0.0934— the answers should be nearly the same as those already calculated. Since Mercury’s eccentricity is greatest in this list, we focus on the average distance between Earth and Mercury. In solving this puzzle for Mercury and Earth, we use some equations not derived in this book. As before, we assume that the orbital planes of Mercury and Earth are the same. Furthermore, we assume that at time 𝑡 = 0, the Sun, Mercury, and Earth are collinear, and that Mercury and Earth are at their perihelia, the points in their orbits nearest the Sun. We assume Kepler’s laws of motion. As Newton showed, the radial distance 𝑟 of a planet from the Sun is (1 − 𝑒2 )𝑎 ℎ2 /𝑘 = , (23) 1 + 𝑒 cos 𝜃 1 + 𝑒 cos 𝜃 where 𝑒 is the planet’s orbital eccentricity, 𝑎 is the semi-major axial length of the elliptical orbit, 𝜃 is the polar angle from the Sun between the positive 𝑥-axis and the planet’s present position, 𝑘 is a gravitational constant, and ℎ is a constant of angular momentum, namely, 𝑑𝜃 ℎ = 𝑟2 , (24) 𝑑𝑡 where 𝑡 is time. 𝑟=

298

Chapter VIII: Classic Elliptical Fractions

4

Sun 0.2

2 40

20

80

0.1

t, in days

a. Angular displacement versus time.

b. Mercury’s orbit, 𝑒 ≈ 0.21.

Figure 30. Mercury’s orbit over time. Integrating Equation (24) using Equation (23) gives16 ℎ3

𝑡(𝜃) =

𝑘2 (1 − 𝑒2 )

3 2

(𝜃 − 2 tan−1 (

𝑒 sin 𝜃 1 + √1 − 𝑒2 + 𝑒 cos 𝜃)

)−

𝑒√1 − 𝑒2 sin 𝜃 ), 1 + 𝑒 cos 𝜃

where 𝑡(𝜃) gives the time at which the planet’s polar angle is 𝜃. To avoid dealing with values of ℎ and 𝑘, let 𝑒 sin 𝜃 𝑇 𝑡(𝜃) 𝑇 𝑒√1 − 𝑒2 sin 𝜃 = (𝜃 − 2 tan−1 ( ), )− 2𝜋 1 + 𝑒 cos 𝜃 𝑡(2𝜋) 1 + √1 − 𝑒2 + 𝑒 cos 𝜃) (25) where 𝑇 is a planet’s period, 𝑒 is the planet’s eccentricity, and 𝜏𝑒, 𝑇 (𝜃) gives the time at which the planet’s polar angle is 𝜃. From Equation (25) we can find 𝜃 in terms of time 𝑡, 𝜃 = 𝜏𝑒,−1𝑝 (𝑡), which we denote by 𝜃𝑄 (𝑡), where 𝑄 refers to a planet such as Earth, 𝐸, or Mercury, 𝑀. Figure 30a shows Mercury’s orbital angle displacement 𝜃 versus time 𝑡 in days. Figure 30b shows Mercury’s orbit as a series of dots at noon on each of eighty-eight consecutive days; when Mercury is near the Sun the dots are relatively far apart (showing that Mercury is moving relatively quickly), whereas when Mercury is far from the Sun the dots are relatively close together (showing that Mercury is moving relatively slowly). As already noted, the Sun, Mercury, and Earth realign after thirteen years. From a table17 of angular displacements for both Mercury and Earth over each day for the next thirteen years, Mercury and Earth on day 𝑛 are 𝐷(𝑛) astronomical units apart: 𝜏𝑒, 𝑇 (𝜃) =

‖ ‖ 𝐷(𝑛) = ‖𝑟𝐸 (𝜃𝐸 (𝑛))(cos 𝜃𝐸 (𝑛), sin 𝜃𝐸 (𝑛)) − 𝑟𝑀 (𝜃𝑀 (𝑛))(cos 𝜃𝑀 (𝑛), sin 𝜃𝑀 (𝑛))‖ ‖ ‖ (26) 16 The 17 See

details of this derivation appear in [131, pp. 65, 124, 310]. Code 15 of Appendix III for an example of how to generate such a table with a CAS.

Exercises

299

where 𝑟𝐸 and 𝑟𝑀 are instances of Equation (23) for Earth 𝐸 and Mercury 𝑀, and ‖𝑍‖ = √𝑍 ⋅ 𝑍 represents18 the magnitude of vector 𝑍. To approximate the average value of 𝐷(𝑡) over the next thirteen years, we reason numerically; we sum Equation (26) over the days in this time period and then divide by thirteen years, yielding 13⋅365

1 ∑ 𝐷(𝑛) ≈ 1.039 AU, 13 ⋅ 365 𝑛=1 which is almost the same result as when we approximated planetary motion using simple harmonic motion, Equation (22). Repeating a similar set of calculations with respect to Earth and Venus yields almost the same result as before, namely, an average distance of 1.136 AU. ♢ As can be seen, when planetary orbits are nearly circular, simple harmonic motion—even though it only approximates reality—often gives extremely good approximations in predicting what will happen, a rule of thumb we exploit in the final three chapters of this book. Exercises 1. In Figure 8, each of the fifty-four city-states are represented by a castle bearing a binary coded number. The binary number on castle is the decimal number 43. What is the decimal number of the castle at the capital of Utopia? 2. (a) Derive Equation (6) starting with √(𝑥 + 𝑐)2 + 𝑦2 + √(𝑥 − 𝑐)2 + 𝑦2 = 2𝑎. (b) Derive the formula for the area of an ellipse. (c) Derive the formula for the volume of an ellipsoid of equatorial radius 𝜌 and polar radius 𝑅. (An outline appears in Appendix IV.) (d) Show that the parametric representations for 𝑥 and 𝑦 from Equation (16) satisfy

𝑥2 𝜌2

+

𝑦2 𝑅2

= 1. (An outline appears in Appendix IV.)

3. (a) Recall that at Paris, the acceleration due to gravity is 𝑎 = −2(2174) Paris lines per second per second. Newton estimated that the ratio of gravity at Paris and at the equator is 2 295 667 to 2 290 000. Recall also that there are about 2.54 centimeters in one English inch. Use this information to show that Newton’s estimate for Earth’s equatorial gravity was 𝑔𝑒 ≈ 9.785 m/sec2 . (b) Newton assumed that 𝑔𝑝 /𝑔𝑒 is approximately 230/229, where 𝑔𝑝 is Earth’s gravity at the north pole. Approximate Newton’s value for 𝑔𝑝 in the metric system. 18 The

dot product of two vectors 𝑋 = (𝑥1 , 𝑥2 ) and 𝑌 = (𝑦1 , 𝑦2 ) is 𝑋 ⋅ 𝑌 = 𝑥1 𝑦1 + 𝑥2 𝑦2 .

300

Chapter VIII: Classic Elliptical Fractions

P Peg 3 Peg 1 Peg 2 Figure 31. A curve with three foci. (c) Replicate the calculations appearing in Puzzle 26 using Newton’s values for 𝑔𝑒 and 𝑔𝑝 . 4. (a) Derive the orientation 𝐴 gravity formula, Equation (10). (b) Derive the orientation 𝐵 gravity formula, Equation (13). 5. (a) Estimate the length between the two poles in Weule’s hut as described on p. 274. (b) Draw the curve with three foci as shown in Figure 31. That is, imagine a length of rope—the dotted line in the figure—made into a loop of a given length. We want the curve of all points 𝑃 for which a hoe at 𝑃 holds the rope taut with all three pegs either on the path of the rope or within the loop of the rope. 6. (a) Show that the sum of constructible numbers is constructible. 𝑚𝑝 (b) Given any length 𝑝, show that is constructible where 𝑚 and 𝑛 are 𝑛 positive integers. (c) Given any length 𝑝, show that √𝑝 is constructible. (d) List a dozen constructible angles. (e) Find the semi-major and semi-minor axial lengths 𝑎 and 𝑏 for each ellipse given in Equation (7). What are the centers of these two ellipses? 7. (a) For an ellipse with semi-major and semi-minor axes of 𝑎 = 5 feet and 𝑏 = 3 feet, respectively, what are its arc lengths for one degree of arc centered at 0∘ and at 45∘ ? (b) Find the lengths of the semi-major and semi-minor axes of an ellipse if the length of one degree of arc centered at zero degrees is 1 meter and the length of one degree of arc centered at 45∘ is 1.5 meters. 8. In the Principia, Newton observed that one degree centered half-way between London and York at latitude 0.9203 radians is given as 69.41 miles (57 300 toises) and another degree centered half-way between Corbeil (just south of Paris) and Amiens at latitude 0.8593 radians is 69.12 miles (57 060 toises); see

Exercises

301

[110, p. 822] and [147, p. 67–77]. For the corresponding analog of ℳ in Equation (19), find the (𝜌, 𝑅) values that minimize the square root of the sum of the squares of the error and the corresponding Δ𝑟 value. 9. From 1792 to 1799, the French Academy sponsored an expedition19 to measure ten degrees of arc along a meridian through Paris from latitude 40 degrees to 50 degrees. Their idea was to define a meter to be one ten-millionth of the way from the equator to the north pole. However, an oblate earth (where the polar radius is less than the equatorial radius) fails to attain its average meridianal arc length value at 45∘ . How much of an error 𝐸 is incurred in making such an assumption? Assume that the Earth’s equatorial radius is 𝜌 = 6400 km, and take Newton’s estimate that Earth’s polar radius 𝑅 is shorter by 27.7 km (17.1 miles). With Earth’s profile as the parametric curve (𝜌 cos 𝜙, 𝑅 sin 𝜙), and with the parameter 𝜙 associated with latitude 𝜃 given by 𝜙 = tan−1 (𝑅 tan(𝜃)/𝜌), show that 𝐸 minus nine times the arc length from 40∘ to 50∘ is about 174 meters, which means that Newton’s assumption yields an error in determining the length of the meter of about 0.02 mm. 10. All other variables remaining the same, suppose that Earth’s eccentricity is 0.4 and Mercury’s eccentricity is 0.6, and calculate afresh the average distance between them over time.

19 The collection and analysis of the massive amount of attendant data for this scientific expedition marked the transition from the era of savants to the era of scientists.

Strand IX: The Cantor Set Strange as it might seem, once we adopt the real number system ℝ, when we measure the characteristics of objects or phenomena, such as mass or volume or intensity, with respect to standard units, such as the kilogram or the meter or the decibel, we almost always obtain a number that appears to be irrational. For example, measure the dimensions of your favorite breakfast cereal box to as many decimal places as possible, perhaps surpassing electron-microscope precision. Almost always, the decimal progression generated displays no apparent cyclical repetition in its digits. Because we must truncate our measurement at some degree of accuracy—making the resultant measured number by default a 𝑝 rational number , perhaps one with a very large integer denominator 𝑞—we 𝑞

more or less treat the number as having a degree of uncertainty. Oftentimes, we 𝑎 𝑝 𝑎 desire a rational number where approximates well, with integer 𝑏 much 𝑏

𝑏

𝑞

smaller than 𝑞. The next chapter is an introduction to the art of finding such approximations. In this strand we use a classic set of real numbers, the Cantor set1 , to illustrate the phenomenon discussed in the last paragraph, that rational numbers are rare numbers in the set of real numbers.

A lotus-ﬂower introduction To introduce the Cantor set, we consider a stylized image of the lotus flower from ancient Egypt. The lotus is the flower of the lily pad, as shown in Figure 1. At sunrise, the lotus rises above the water and opens. At sunset, the lotus closes and submerges for the night. The Egyptians used this flower to symbolize the cycle of life, of birth and death, of eternal renewal. It appeared extensively in their art and adorned many columns in their temples.

1 The Cantor set was initially constructed in 1874 by Henry John Stephen Smith and reintroduced by Georg Cantor in 1883.

303

304

Strand IX: The Cantor Set

a. A lotus flower.

b. A stylized lotus flower.

Figure 1. The lotus flower. Sometimes the lotus symbol is highly stylized, as in Figure 1b and in Figure 2. This latter figure displays the lotus at multiple scalings.2 In Figure 2a, the top of the column is adorned with four large lotus petals 𝑃0 . Smaller lotus petals, 𝑃1 , about half the height of 𝑃0 , rise from the base of those petals. In turn, another generation of petals, 𝑃2 , about half the height of 𝑃1 , rise from the base of the petals 𝑃1 , and so on, at least two more times. As shown in Figure 2b [33, Vol. I, Plate 26, Figure 5], an intriguing variation of the lotus flower was carved on a temple column on Philae Island in the Nile. The lotus flower has been stylized as a line segment. Beneath the upper tier of the largest line segments is a tier of line segments scaled by a factor of about one-third. In fact, it seems as if the second tier is the first tier with the middle third removed from each segment. Similarly, the third tier is the second tier with the middle third removed from each segment. And the fourth tier is the third 2 The artists and scientists who accompanied Napoleon on his 1799 Egyptian campaign reproduced these crest designs in their report, Description de l’Egypte.

a. Somewhat stylized.

b. Highly stylized.

Figure 2. Lotus flower ornamentation atop ancient Egyptian columns.

Ternary notation

305

Figure 3. The Cantor set. tier with the middle third removed from each segment. Figure 3 reproduces this pattern to the sixth tier.

Ternary notation The Cantor set is an idealization of this recursive pattern. To make this idea precise, we use base three. Observe that when we extend the base notation for integers in Definition I.10 to all real numbers, some numbers fail to have unique representations. For example, in decimal notation the integer one can be written 1 as both 1 and 0.999 …. Similarly, has two representations in base three. It can be 1

1

3

written as = (0.1)3 and = (0.0222 …)3 , where the former representation ter3 3 minates and the latter representation does not. With this idea in mind we make the following definition. Definition 1: Ternary notation. Let 𝑥 ∈ ℝ with 0 ≤ 𝑥 ≤ 1. We say that 𝑥 is in standard ternary form, or simply ternary form, if it is written in base three and uses the nonterminating representation when 𝑥 can be represented in two different ways. Following the convention of Chapter I, we let (𝑥)3 be the ternary form of 𝑥. When the context is clear, we may simplify this notation to 𝑥3 . The ternary expansion of 𝑥 is denoted by (𝑥)3 = (0.𝑎1 𝑎2 𝑎3 …)3 where 𝑎𝑗 is either 0, 1, or 2, for all positive integers 𝑗. Thus ∞

𝑥 = (0.𝑎1 𝑎2 𝑎3 …)3 =

𝑎 𝑎1 𝑎2 𝑎3 + 2 + 3 + ⋯ = ∑ 𝑛𝑛 . 3 3 3 3 𝑛=1

Example 2: One-fourth in ternary. To find the ternary representation of onefourth, write four as (11)3 and divide (11)3 into (1)3 , knowing that the quotient at each step of the algorithm can only be 0, 1, or 2. As illustrated in Figure 4, ∞

1 2 2 2 = (0.0202 …)3 = 2 + 4 + ⋯ = ∑ 2𝑛 . 4 3 3 3 𝑛=1

♢

306

Strand IX: The Cantor Set 11

0.0202 1.0000 22 100 22 1

Figure 4. Long division in base three. Definition 3: Scaling and translating sets. Let 𝐴 be a nonempty subset of ℝ. Let 𝛼, 𝛽 ∈ ℝ. We say that the set {𝛼𝑎| 𝑎 ∈ 𝐴}, denoted by 𝛼𝐴, is a scaling of 𝐴 and that {𝑎 + 𝛽| 𝑎 ∈ 𝐴}, denoted by 𝐴 + 𝛽, is a translation of 𝐴 by 𝛽. Definition 4: The Cantor set. Let 𝛼, 𝛽 ∈ ℝ with 𝛼 ≤ 𝛽. We say that the set of all real numbers 𝑥 such that 𝛼 ≤ 𝑥 ≤ 𝛽 is the closed interval from 𝛼 to 𝛽, denoted by 1 2 [𝛼, 𝛽]. Let 𝐶0 = [0, 1]. Let 𝐶1 = [0, ] ∪ [ , 1], the union of two closed intervals, 3

1

3

each of length . Observe that 𝐶1 is obtained from 𝐶0 by removing its middle 3 third. Let 2 1 2 7 8 1 𝐶2 = [0, ] ∪ [ , ] ∪ [ , ] ∪ [ , 1]. 9 9 3 3 9 9 Observe that 𝐶2 is obtained from 𝐶1 by removing the middle third interval from each of the two intervals comprising 𝐶1 . Another way to view the construction of 𝐶1 from 𝐶0 and of 𝐶2 from 𝐶1 is to scale and translate and form unions. That 1 1 2 1 1 2 is, 𝐶1 = 𝐶0 ∪ ( 𝐶0 + ), and 𝐶2 = 𝐶1 ∪ ( 𝐶1 + ). For each 𝑛 ∈ ℕ, let 1

3

1

3

2

3

3

3

3

𝐶𝑛+1 = 𝐶𝑛 ∪ ( 𝐶𝑛 + ). The Cantor set, 𝐶, is the intersection of all 𝐶𝑛 , namely, 3

3

3

𝐶=

∩

𝑛≥0

𝐶𝑛 .

As a useful tool in analyzing the Cantor set, we define the digit operator on 𝑥, denoted by dig(𝑥, 𝑛), as the integer dig(𝑥, 𝑛) = 𝑎𝑛 , the 𝑛th ternary digit in 𝑥 for positive integers 𝑛. Proposition 5: The Cantor set is nonempty. Let 𝐶 be the Cantor set. Let 𝑥 = (0.𝑎1 𝑎2 𝑎3 …)3 be a real number between 0 and 1. Then 𝑥 ∈ 𝐶 if and only if 𝑎𝑛 is either 0 or 2 for every 𝑛 ∈ ℤ+ . Proof. Observe that 𝐶0 = [0, 1]. By definition of 𝐶1 , 𝑥 ∈ 𝐶1 if and only if 𝑎1 = 0 or 𝑎1 = 2. Let 𝑛 ∈ ℕ and assume that 𝑥 ∈ 𝐶𝑛 if and only if none of the first 𝑦 𝑛 ternary digits of 𝑥 are 1. Let 𝑥 ∈ 𝐶𝑛+1 . There exists 𝑦 ∈ 𝐶𝑛 with 𝑥 = or 𝑦

2

3

𝑥 = + . In either case, dig(𝑥, 𝑘 + 1) = dig(𝑦, 𝑘) ∈ {0, 2} for all integers 𝑘 with 3 3 1 ≤ 𝑘 ≤ 𝑛. In the former case, dig(𝑥, 1) = 0, and in the latter case, dig(𝑥, 1) = 2. Conversely, let 𝑥 ∈ [0, 1] with dig(𝑥, 𝑖) = 𝑎𝑖 ∈ {0, 2} for integers 𝑖, 1 ≤ 𝑖 ≤ 𝑛 + 1.

Ternary notation

307

If 𝑎1 = 0, then 3𝑥 ∈ 𝐶𝑛 and 𝑥 = 1

2

1 3

(3𝑥). If 𝑎1 = 2, then 3𝑥 − 2 ∈ 𝐶𝑛 and

𝑥 = (3𝑥 − 2) + . So 𝑥 ∈ 𝐶𝑛+1 if and only if 𝑥’s first 𝑛 + 1 ternary digits are 0 or 3 3 2. By mathematical induction we conclude that for each 𝑛 ∈ ℕ, 𝑥 ∈ 𝐶𝑛 if and only if 𝑎𝑘 is never 1 for all positive integers 𝑘 with 𝑘 ≤ 𝑛. Let 𝑥 ∈ 𝐶. Then 𝑥 ∈ 𝐶𝑛 for all 𝑛 ∈ ℕ so that none of the ternary digits of 𝑥 are 1. The next proposition demonstrates an especially surprising property of 𝐶. Exercise 1 outlines a few more Cantor set properties. Observe that 𝐶0 consists of one closed interval of length 1. 𝐶1 consists of two 1 2 disjoint closed intervals each of length for a combined length of . In general, 3

1

3

2

𝐶𝑛 consists of 2𝑛 pairwise-disjoint intervals each of length 𝑛 . Since ( )𝑛 → 0 as 3 3 𝑛 increases, it would seem as if the Cantor set should be sparsely populated with real numbers. Proposition 5 shows that the Cantor set has an infinite number of elements. However, the next proposition shows that 𝐶 has as many elements as the closed interval [0, 1]. Proposition 6: The cardinality of the Cantor set. Let 𝑓 ∶ 𝐶 → [0, 1] be defined by 𝑓((0.𝑎1 𝑎2 𝑎3 …)3 ) = (0.𝑏1 𝑏2 𝑏3 …)2 , where 𝑥 = (0.𝑎1 𝑎2 𝑎3 …)3 and 𝑏𝑘 = 𝑎𝑘 /2 for all integers 𝑘 ≥ 1, for all 𝑥 ∈ 𝐶. Thus the cardinality of 𝐶 is the same as the cardinality of the set of real numbers between 0 and 1. Furthermore, this cardinality exceeds that of the set of rational numbers in the unit interval. Proof. Let 𝑥 ∈ 𝐶. Then, by Proposition 5, every digit in (𝑥)3 is divisible by two. The only difference between 0.𝑎1 𝑎2 𝑎3 … and 0.𝑏1 𝑏2 𝑏3 … as strings of digits is that every occurrence of the digit 2 in the former is the digit 1 in the latter, and vice versa. Since this function is an onto mapping from 𝐶 into the unit interval 𝐼, 𝐶’s cardinality is at least as large as the set [0, 1]. Since 𝐶 ⊂ [0, 1], these two cardinalities are the same. Showing the last statement of the proposition is Exercise 3a, using a famous Cantor diagonalization argument. The recursion present in the Cantor set gives rise to beautiful patterns when rendered in more than one dimension. Any Cantor-like recursive structure—in which we continually split an item into two items—describes what is called a binary tree, a graph discussed in Chapter IV. We illustrate this tree growth in Figure 5, where an upright stem gives rise to two branches of shorter length inclined away from a continuation of the stem by a given angle. Each of these sub-branches gives rise to two new branches,

308

Strand IX: The Cantor Set

a. Level 1.

b. Level 2.

c. Level 3.

d. Level 4.

Figure 5. Budding levels of a binary tree.

Figure 6. A level-ten binary tree. and so on. When we repeat this budding sequence ten times, the tree of Figure 6 appears.3

A reality check∗ Consider the mind experiment of throwing a dart at a unit interval 𝐼 dart board. When throwing any dart, suppose it lands at some random number in 𝐼. In particular, for any subinterval 𝐽 of 𝐼, suppose that the probability of the dart landing in 𝐽 is the length of 𝐽. By Definition 4, intuition suggests that the dart should land in the Cantor set with probability zero. By Proposition 6 and Exercise 3a, the cardinality of the Cantor set exceeds that of the set of rational numbers ℚ[0, 1] in 𝐼. Furthermore, Exercise 3b shows that ℚ[0, 1] is a set of outer measure zero; that is, for any small positive number 𝜖 it is a subset of a countable4 collection of open intervals where the sum of the lengths of the intervals is less than 𝜖. That is, more formally, it appears that the dart should land in ℚ[0, 1] with probability zero. 3 See

Code 16 of Appendix III for an example of how to generate these fractal trees with a CAS. set is countably infinite if it has the same number of elements as ℤ+ . A set is countable if it either has a finite number of elements or is countably infinite. 4A

A reality check∗

309

To echo the introduction to this strand, when reporting numerical information such as age, height, or weight, we tend to use integers or simple fractions. This habit might persuade the typical person in the street that simple fractions make up a goodly percentage of the set of all real numbers. But as the Cantor set shows us, simple fractions are rare. Yet the beauty of simple fractions is their simplicity. To that end, given an irrational number 𝜔 and any small positive number 𝜖, we would like to find the simplest fraction for which the distance between 𝜔 and that fraction is less than 𝜖. This next chapter explores several algorithms that solve this very natural problem.

Chapter IX: Continued Fractions The lore and literature of continued fractions is long and storied.5 “Lifetimes could be devoted to their study” [40, p. 98]. This book is but an introduction. Given a positive irrational number 𝜔, we outline two different ways to obtain rational number approximations, called convergents, to 𝜔 using continued fractions. One way is to focus on a sequence of remainder terms 𝑟𝑖 (defined below) without explicitly involving the convergents, which we call a local approach. A second way is to focus on the convergents without explicitly involving a remainder, which we call a global approach. Although the former method is the approach to continued fractions taken in many introductory number theory texts, we pursue the latter approach at length to highlight applications of both the Euclidean algorithm for the greatest common divisor of Chapter III and the SternBrocot tree of Chapter IV. In particular we consider several specific convergentfocused algorithms, contrast them, demonstrate how the harmonic algorithm of Chapter VII relates to them, and then give some applications of continued fractions.

A local approach to continued fractions As we define a continued fraction, we first of all restrict our attention to continued fractions of irrational numbers. We do so for the sake of simplicity, even though continued fraction results for irrationals largely hold for rationals. As one author put it, “It requires disproportionate circumstantiality to formulate the results [for rationals] since ambiguities arise there” [13]. Definition 7: Ordinary continued fractions. We say that the list of partial denominators [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …] is an ordinary continued fraction where 𝑛0 is an integer and, for each positive integer 𝑖, 𝑛𝑖 is a positive integer and 𝜖𝑖 is either 1 or −1. Each list of the first 𝑖 + 1 partial denominators, sometimes called 5 The term continued fraction was first coined by John Wallis in his 1653 Arithmetica Infinitorum.

311

312

Chapter IX: Continued Fractions

a section of the continued fraction, evaluates6 to convergent 𝑖, denoted by 𝐶𝑖 , for all 𝑖 ≥ 0: 𝜖 𝜖1 𝜖1 , …. 𝐶0 = 𝑛0 , 𝐶1 = 𝑛0 + 1 , 𝐶2 = 𝑛0 + 𝜖2 𝜖2 , 𝐶3 = 𝑛0 + 𝑛1 𝑛1 + 𝑛1 + 𝜖3 𝑛2 +

𝑛2

𝑛3

When the sequence 𝐶𝑖 converges to a number 𝜔, we write 𝜔 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , . …]. At each stage in this sequence of convergents there is a remainder term 𝑟𝑖 where 𝑟𝑖 is a real number satisfying the equations 𝜖1 𝜖1 , 𝜔 = 𝑛0 + 𝜔 = 𝑛0 + 𝑟1 , 𝜔 = 𝑛0 + , …. 𝜖 𝑛1 + 𝑟2 𝑛1 + 2 𝑛2 +𝑟3

Sometimes we refer to the remainder 𝑟𝑖 as an error term. Finally, we say that the representation of a continued fraction is customary if at each stage, 𝑛𝑖 is chosen so that 𝑟𝑖+1 has magnitude less than 1. A convergent is customary if it is the convergent of a customary continued fraction representation. The next example explores a divergent continued fraction. Example 8: A wild continued fraction. Let 𝐴 be the ordinary continued fraction 𝐴 = [0; 2, −1, −2, 1, 2, −1, −2, 1, …] = [0; 2, −1, −2, 1 ]. This innocent-looking continued fraction’s first three convergents are 𝐶0 = 0, 1 1 𝐶1 = , and 𝐶2 = −1 = 1. But 2

2+

1

𝐶3 = where we identify

1 0

1 2+

−1 1+

−1 2

=

1 2+

−1 1 2

=

1 1 = = △, 2−2 0

with the symbol △. The list of 𝐴’s convergents is a progres-

sion of half-steps to infinity interrupted every fourth time with a hiccup of △ eradicating the (missing) terms 3 + 4𝑖: 1 5 9 13 0, , 1, △, 2, , 3, △, 4, , 5, △, 6, , 7, … , 2 2 2 2 where 𝑖 ∈ ℕ. ♢ After considering Example 8, one might think that using positive and negative partial denominators is a poor idea. How can we guarantee convergence of a continued fraction with such ingredients? Yet we can salvage the idea by sometimes disallowing 1 and −1 as partial denominators. Towards that end we make the following definition. 6 Note

that some convergents may be undefined due to division by 0.

A local approach to continued fractions

313

Definition 9: Tame continued fractions. We say that an ordinary continued fraction [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …] is a tame continued fraction if either 𝑛𝑖 ≠ 1 for all integers 𝑖 ≥ 1 or 𝜖𝑖 = 1 for all 𝑖 ≥ 1. Unless specified otherwise, all continued fractions for the rest of this book are tame. Tame continued fractions have two great properties: All of their convergents are fractions, as shown in the next proposition, and each tame continued fraction converges to some real number, as shown in Proposition 20. Furthermore, in Definition 21, we characterize six continued fraction algorithms that generate (tame) continued fraction representations for any given irrational number. Proposition 10: Existence of convergents for tame continued fractions. Let 𝐶𝑘 be convergent 𝑘 for some tame continued fraction, 𝑘 ∈ ℕ. Then 𝐶𝑘 ∈ ℚ for all 𝑘. Proof. Note that 𝐶0 and 𝐶1 are in ℚ for any ordinary continued fraction. For any tame continued fraction [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …], for each integer 𝑚 > 1 let 𝜖2 . 𝐷𝑚 = 𝑛1 + 𝜖3 𝑛2 + 𝑛3 + 𝜖𝑚 ⋱𝑛 ) 𝑚−1 + ( 𝑛𝑚 We will show that 𝐷𝑚 > 1 for all 𝑚 > 1, which will imply that 𝐶𝑚 = 𝑛0 + 𝜖1 /𝐷𝑚 is in ℚ. Observe that 𝐷𝑚 > 1 for all tame continued fractions whose partial denominators are all positive, for all 𝑚 > 1. Next, consider any tame continued fraction whose partial denominators 𝜖𝑖 𝑛𝑖 1 1 1 𝜖 have magnitude at least 2, 𝑖 ≥ 1. Since ≤ , we have 𝑛1 + 2 ≥ 𝑛1 − ≥ 2−

𝑛2

1

2

𝑛2

2

> 1. Thus 𝐷2 > 1 for all tame continued fractions. 2 Suppose that 𝐷𝑚 ≤ 1 for some 𝑚 > 2 for some tame continued fraction. Then among the tame continued fractions there is a continued fraction, call it [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …], for which 𝐷𝑚 ≤ 1 where 𝑚 is as small as possible; call it 𝑚0 . We know that 𝑚0 > 2 and that 𝐷𝑗 > 1 for all tame continued fractions with 1 ≤ 𝑗 < 𝑚0 . By our inductive hypothesis, 𝜖3 > 1. 𝑛2 + 𝑛3 + 𝜖𝑚0 ⋱𝑛 ) 𝑚0 −1 + ( 𝑛𝑚0 So 𝑛1 +

𝜖2 𝑛2 +

𝑛3 +

𝜖3

𝜖𝑚0 ⋱𝑛 ) 𝑚0 −1 + ( 𝑛𝑚0

> 𝑛1 − 1 ≥ 1,

314

Chapter IX: Continued Fractions

a contradiction. Thus 𝐷𝑚 > 1 for all 𝑚. Puzzle 11: A repeating list. What number is represented by the continued fraction 𝜔 = [3; −5, 3, −5, 3, −5, …]? The first few convergents 𝐶𝑖 and their decimal equivalents are

𝐶0 = 3, 𝐶1 = 3 −

𝐶4 = 3 −

1 1 ≈ 2.8, 𝐶2 = 3 − 5 5+ 1

5+

≈ 2.81328,

1 3−

1

≈ 2.8125, 𝐶3 = 3 −

3

𝐶5 = 3 −

1 5+

1 3

1 5+

1 5+

1 5+

1 1 5

3−

If 𝜔 is a real number, then 𝜔 must satisfy the identity 3 −

1 5+

1 𝜔

this equation gives 5𝜔2 − 13𝜔 − 3 = 0, whose solution is 𝜔 = must be positive, 𝜔 =

13+√229 10

≈ 2.8133,

1 3− 5

≈ 2.81327.

1 3−

1

= 𝜔. Simplifying 13±√229 10

. Since 𝜔 ♢

≈ 2.81327.

In Lemma 12 we discover that the successive remainder terms 𝑟𝑖 introduced in Definition 7 are recursively related. Further, sgn(𝑟𝑖 ) = 𝜖𝑖 for all 𝑖 ∈ ℤ+ . Lemma 12: A remainder algorithm. Let 𝜔 be a positive irrational number with partial denominator list 𝜔 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , …], where 𝑛0 is either the floor or the ceiling of 𝜔 and 𝑟1 = 𝜔 − 𝑛0 . For each 𝑖 ≥ 1, the remainder terms 𝑟𝑖 satisfy the recursive relation 𝜖 𝑟𝑖+1 = 𝑖 − 𝑛𝑖 . (1) 𝑟𝑖 Furthermore, if the list of partial denominators is customary, then 𝑛𝑖 is either the 𝜖 floor or the ceiling of 𝑖 and 𝜖𝑖 = sgn(𝑟𝑖 ). 𝑟𝑖

Proof. By Definition 7, 𝑟𝑖 =

𝜖𝑖 𝑛𝑖 +𝑟𝑖+1

. Solving this equation for 𝑟𝑖+1 yields Equation

(1). If the list of partial denominators is customary, then 𝑟𝑖+1 has magnitude less 𝜖 than 1, and so 𝑛𝑖 must be either the floor or the ceiling of 𝑖 . Furthermore, for 𝑟𝑖

customary continued fractions, since 𝑛𝑖 + 𝑟𝑖+1 must be positive (and since 𝜖𝑖 is either 1 or −1), 𝜖𝑖 = sgn(𝑟𝑖 ). Example 13: Puzzle 11 revisited. We show that the list of partial denominators for 𝜔 =

13+√229 10

= [3; −5, 3, −5, …] is customary.

A local approach to continued fractions

315 R

L

n=4

n=3 3

4

22 7 εn = 15

εn=16

333 106

εn = −1

εn = 8

εn = 7

25 8 εn= −1 εn = −2

355 113

47 15

22 7

εn = −2

3

εn= −1 εn=7

εn=6

25 8

22 7

7 2

εn = −2 10 3

3

Figure 7. A tree of the first few possible customary convergents for 𝜋, Example 14.

From the list of partial denominators, we know that 𝜖1 = −1 = 𝜖3 and 𝜖2 = 1. By Lemma 12, 𝑟1 = 𝜔 − 3 = 𝑟2 =

√229−17 10

≈ −0.186. Thus |𝑟1 | < 1. By the lemma again,

√229 − 13 𝜖1 17 + √229 −5= −5= ≈ 0.355. 𝑟1 6 6

So |𝑟2 | < 1. By the lemma again, 𝑟3 =

𝜖2 𝑟2

−3=

13+√229 10

−3=

√229−17

the remainders 𝑟𝑖 alternate, and all have magnitude less than 1.

10

= 𝑟1 . Thus ♢

Example 14: Customary convergents to 𝜋. Within the realm of ordinary continued fractions, what are all the possible first few customary convergents for 𝜋? In working through this example, at each stage we have two choices for the 𝜖 next partial denominator 𝑛𝑖 corresponding to taking the floor or the ceiling of 𝑖 as described in Lemma 12.

𝑟𝑖

Step 0: By the lemma, we have two choices for 𝑛0 . Call them 𝑛(𝐿) = 3 and 𝑛(𝑅) = 4 (𝐿 for left and 𝑅 for right). These are also the 𝐶0 convergents; call them 𝐶(𝐿) = 3 and 𝐶(𝑅) = 4. The first remainders—the differences 𝑟𝑖 between 𝜋 and 𝑛0 —are 𝑟(𝐿) = 𝜋 − 3 ≈ 0.14 and 𝑟(𝑅) = 𝜋 − 4 ≈ −0.86. Call the sign of these 𝑟1 remainders 𝜖(𝐿) = 1 and 𝜖(𝑅) = −1.

316

Chapter IX: Continued Fractions

Step 1: The magnitudes of the reciprocals of the two 𝑟1 remainders are 𝜖(𝐿) 1 1 = ≈ ≈ 7.063 𝜋 − 3 0.14 𝑟(𝐿)

𝜖(𝑅) −1 1 = ≈ = 1.165. 𝜋 − 4 0.86 𝑟(𝑅)

and

We have four choices for 𝑛1 . Call them 𝑛(𝐿𝐿) = 7, 𝑛(𝐿𝑅) = 8, 𝑛(𝑅𝐿) = 1 and 𝑛(𝑅𝑅) = 2. Thus our four different first convergents 𝐶1 for 𝜋 are 𝜖(𝐿)

• 𝐶(𝐿𝐿) = 𝑛(𝐿) +

𝑛(𝐿𝐿)

• 𝐶(𝐿𝑅) = 𝑛(𝐿) + • 𝐶(𝑅𝐿) = 𝑛(𝑅) + • 𝐶(𝑅𝑅) = 𝑛(𝑅) +

𝜖(𝐿) 𝑛(𝐿𝑅) 𝜖(𝑅) 𝑛(𝑅𝐿) 𝜖(𝑅) 𝑛(𝑅𝑅)

1

=3+ =3+ =4+

=4−

22

=

7 1

7

=

8

−1 1

8

= 3.125,

= 3,

1

=

2

25

≈ 3.14286,

7 2

= 3.5.

Let 𝑢 be a string of 𝐿’s and 𝑅’s, and let 𝑆 be either 𝐿 or 𝑅. In general, by Lemma 12, the remainder 𝑟(𝑢𝑆) is 𝑟(𝑢𝑆) =

𝜖(𝑢) − 𝑛(𝑢𝑆). 𝑟(𝑢)

(2)

The next 𝑟2 remainders are 𝑟(𝐿𝐿) ≈ 0.0625, 𝑟(𝐿𝑅) ≈ −0.937, 𝑟(𝑅𝐿) ≈ 0.165, and 𝑟(𝑅𝑅) ≈ −0.835. So 𝜖(𝐿𝐿) = 1, 𝜖(𝐿𝑅) = −1, 𝜖(𝑅𝐿) = 1, and 𝜖(𝑅𝑅) = −1. Step 2: As in Step 1, we need the magnitudes of the reciprocals of the remainders. By Equation (2), we have 𝜖(𝐿𝐿) ≈ 15.997, 𝑟(𝐿𝐿)

𝜖(𝐿𝑅) ≈ 1.067, 𝑟(𝐿𝑅)

𝜖(𝑅𝐿) ≈ 6.0625, 𝑟(𝑅𝐿)

𝜖(𝑅𝑅) ≈ 1.198. 𝑟(𝑅𝑅)

Taking the floor or ceiling of each of these four remainders gives eight possible 𝑛2 partial denominators: 𝑛(𝐿𝐿𝐿) = 15, 𝑛(𝑅𝐿𝐿) = 6,

𝑛(𝐿𝐿𝑅) = 16, 𝑛(𝐿𝑅𝐿) = 1, 𝑛(𝑅𝐿𝑅) = 7, 𝑛(𝑅𝑅𝐿) = 1,

𝑛(𝐿𝑅𝑅) = 2, 𝑛(𝑅𝑅𝑅) = 2.

The first of these eight integers, 𝑛(𝐿𝐿𝐿), generates the 𝐶2 convergent 𝜖(𝐿)

𝐶(𝐿𝐿𝐿) = 𝑛(𝐿) +

𝜖(𝐿𝐿)

𝑛(𝐿𝐿) +

=3+

𝑛(𝐿𝐿𝐿)

1 7+

1

=

333 . 106

15

Similarly, the eight 𝐶2 approximations are, respectively, 𝐶(𝐿𝐿𝐿) = 𝐶(𝑅𝐿𝐿) =

333 106 22 7

, 𝐶(𝐿𝐿𝑅) =

,

𝐶(𝑅𝐿𝑅) =

355 113 25 8

, 𝐶(𝐿𝑅𝐿) =

,

22

,

𝐶(𝐿𝑅𝑅) =

𝐶(𝑅𝑅𝐿) = 3,

𝐶(𝑅𝑅𝑅) =

7

47 15 10 3

, .

Finally, these first few convergents for 𝜋 are displayed in the tree of Figure 7.

♢

A local approach to continued fractions

317

How do we decide which way to proceed down the binary tree of convergents for any given positive irrational number 𝜔? The oldest, and perhaps simplest, method, which we refer to as the regular continued fraction,7 is to take the floor 𝜖 of 𝑖 . We illustrate this algorithm in the next example (the same algorithm as was 𝑟𝑖

used in Example II.4, p. 38, and Example VI.31, p. 195). The reader may contrast its results with those of Puzzle V.6, p. 152. Example 15: A local approach to a regular continued fraction.8 We illustrate the dynamics of generating partial denominators for Euler’s constant, 𝛾 ≈ 0.5772156649, a number introduced in Proposition V.4. At each stage in 𝜖

generating the next partial denominator, we let 𝑛𝑖 = ⌊ 𝑖 ⌋ as in Lemma 12. Ap𝑟𝑖

plying this decision rule gives 𝜖𝑖 = 1 for all positive integers 𝑖. At each step, we box the partial denominator. Step 0: In the context of the lemma, 𝜔 = 𝛾. Thus 𝑛0 = ⌊𝛾⌋ = 0 . Then 𝑟1 = 𝛾 − 𝑛0 = 𝛾. So 𝐶0 = 0. 1

Step 1: Intuition might suggest that 𝑛1 is 2 because is the unitary fraction 2 nearest 𝛾. Instead, we follow the rule of applying the floor function and let 1

1

𝑟1

𝑟1 1

𝑛1 = ⌊ ⌋ = ⌊1.732 …⌋ = 1 . Let 𝑟2 = convergent 𝐶1 for 𝛾 is therefore 𝐶1 = 0 +

1

− 𝑛1 ≈ 0.732. The first regular = 1.

1

1

𝑟2 1

𝑟2

Step 2: Let 𝑛2 = ⌊ ⌋ = ⌊1.365 …⌋ = 1 , so that 𝑟3 = 𝐶2 = 0 +

1 1+

1 1

=

2

− 𝑛2 ≈ 0.365. So

. At this point the user of this algorithm might be some-

what disappointed—because our intuition had already suggested this result, probably at step 0. But, patience! Also realize that speedier continued fraction algorithms exist (such as the algorithm introduced in Puzzle V.6). 1

1

𝑟3

𝑟3

Step 3: Let 𝑛3 = ⌊ ⌋ = ⌊2.73⌋ = 2 , so that 𝑟4 = 0+

3

1 1+

= .

1 1+

− 𝑛3 ≈ 0.73. Thus 𝐶3 =

1 2

5

Step 4+ : Continuing this process, we generate 𝛾 = [0; 1, 1, 2, 1, 2, 1, 4, 3, 13, 5, 1, …]𝑅 7 Richard Guy (b. 1916), in a talk at the Joint Mathematical Meetings on 9 January 2013 in San Diego, coined the terms reg and neg for the regular continued fraction and the negative continued fraction algorithms. 8 We treat 𝛾 as an irrational number.

318

Chapter IX: Continued Fractions

where the subscript 𝑅 denotes the regular continued fraction which we recognize as a simple continued fraction. The first few regular convergents for 𝛾 are 1 3 4 11 15 71 228 (3) 0, 1, , , , , , , , …. 2 5 7 18 26 123 395 ♢

A global approach to continued fractions In order to harness the power of the Stern-Brocot tree structure9 developed in Chapter IV and to apply our skill in using recursion as practiced in Chapter VI, we change our focus from local to global. In particular, to generate a positive irrational number 𝜔’s successive convergents, we will use a second-order recursion with respect to its convergents. As useful notation to begin this recursive process, the expressions 𝐶−1 and 𝐶−2 are referred to as the pre-initial convergents for 𝜔. To understand the notation of this next proposition, review the Chapter IV material on general mediants. Proposition 16: From a local to a global approach. Let 𝜔 be a positive irrational number whose customary and tame partial denominators are given by 𝜔 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …], and let 𝑝−2 = 0, 𝑞−2 = 1, 𝑝−1 = 1, 𝑞−1 = 0, 𝑝𝑘 = 𝑛𝑘 𝑝𝑘−1 + 𝜖𝑘 𝑝𝑘−2 , 𝑞𝑘 = 𝑛𝑘 𝑞𝑘−1 + 𝜖𝑘 𝑞𝑘−2 ,

(4) when 𝑘 ≥ 0.

Then for all integers 𝑘, 𝑘 ≥ −2, the convergents for 𝜔 are 𝐶𝑘 =

𝑝𝑘 𝑞𝑘

.

Proof. Observe that the preconvergent 𝐶−1 is our place-holder △. Since the initial partial denominator is 𝑛0 , 𝐶0 = 𝑛0 𝐶−1 ⊕ 𝐶−2 = 𝑛0 . By Definition 7, 𝑛 𝑝 + 𝜖1 ⋅ 1 𝑝 𝑛 𝑝 + 𝜖1 𝑝−1 𝑛 𝑛 + 𝜖1 𝜖 = 1 0 = 1. 𝐶1 = 𝑛0 + 1 = 1 0 = 1 0 𝑛1 𝑛1 𝑛1 ⋅ 1 + 𝜖1 ⋅ 0 𝑛1 𝑞0 + 𝜖1 𝑞−1 𝑞1 Thus the proposition is true for 𝑘 = 1. In terms of the general mediant notation of Chapter IV, this equation is 𝐶1 = 𝑛1 𝐶0 ⊕ 𝜖1 𝐶−1 = 𝜖1 𝑛1 𝐶0 ⊕ 𝐶−1 because

𝑝1 𝑞1

=

−𝑝1 −𝑞1

.

Next, suppose that 𝐶𝑘 = 𝑛𝑘 𝐶𝑘−1 ⊕ 𝜖𝑘 𝐶𝑘−2 is true for 𝑘 ≥ 1. We must show that 𝐶𝑘+1 = 𝑛𝑘+1 𝐶𝑘 ⊕ 𝜖𝑘+1 𝐶𝑘−1 . Observe that 9 Although we constructed the Stern-Brocot tree with respect to fractions between 0 and 1, a similar construction occurs for each interval [𝑛, 𝑛 + 1] for all 𝑛 ∈ ℤ. Alternatively, given a positive irrational number 𝜔, we can find continued fraction representations for 𝜔 − ⌊𝜔⌋ and then add ⌊𝜔⌋ to those representations.

A global approach to continued fractions

319

𝐶𝑘+1 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , … , 𝜖𝑘+1 𝑛𝑘+1 ] 𝜖1 = 𝑛0 + 𝜖2 𝑛1 + 𝑛2 + ⋱𝑛 𝑘−1 + (

𝜖𝑘 ) 𝜖 𝑛𝑘 + 𝑘+1 𝑛𝑘+1 𝜖 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , … , 𝜖𝑘−1 𝑛𝑘−1 , 𝜖𝑘 (𝑛𝑘 + 𝑘+1 )] 𝑛𝑘+1 𝜖 = (𝑛𝑘 + 𝑘+1 )𝐶𝑘−1 ⊕ 𝜖𝑘 𝐶𝑘−2 𝑛𝑘+1 𝜖 (𝑛𝑘 + 𝑘+1 ) 𝑝𝑘−1 + 𝜖𝑘 𝑝𝑘−2 𝑛 (𝑛 𝑝 + 𝜖𝑘 𝑝𝑘−2 ) + 𝜖𝑘+1 𝑝𝑘−1 𝑛𝑘+1 = 𝑘+1 𝑘 𝑘−1 = 𝜖𝑘+1 𝑛 (𝑛 𝑞 𝑘+1 𝑘 𝑘−1 + 𝜖𝑘 𝑞𝑘−2 ) + 𝜖𝑘+1 𝑞𝑘−1 ) 𝑞𝑘−1 + 𝜖𝑘 𝑞𝑘−2 (𝑛𝑘 + 𝑛𝑘+1

=

𝑛𝑘+1 𝑝𝑘 + 𝜖𝑘+1 𝑝𝑘−1 𝑝 = 𝑘+1 . 𝑛𝑘+1 𝑞𝑘 + 𝜖𝑘+1 𝑞𝑘−1 𝑞𝑘+1

We refine Proposition 16 into an algorithm for generating successive convergents using the notation of the general mediant from Chapter IV. As 𝜖𝑘 𝑛𝑘 𝐶𝑘−1 ⊕ 𝐶𝑘−2 = 𝑛𝑘 𝐶𝑘−1 ⊕ 𝜖𝑘 𝐶𝑘−2 , we adopt the former representation for the next convergent 𝐶𝑘 rather than the latter because it seems to be a simpler expression (although we must remember that a double negative is a positive). Next, we observe that Proposition 16 implies that successive convergents of tame continued fractions are adjacent Farey fractions. Lemma 17: A Euclidean relation between successive convergents. Let 𝜔 be a positive irrational number with a tame continued fraction representation. Let 𝐶𝑘 be the convergents of 𝜔 for all 𝑘 ≥ −2. 𝐶0 is either ⌊𝜔⌋ or ⌈𝜔⌉. For all 𝑎 𝑐 𝑘 ≥ −1, with 𝐶𝑘 = and 𝐶𝑘−1 = , then 𝑏

𝑑

𝑎𝑑 − 𝑏𝑐 = ±1. That is, 𝐶𝑘−1 and 𝐶𝑘 are adjacent fractions in some Farey sequence for all 𝑘 ≥ 1. Proof. Solving 𝜔 = 𝑠𝐶−1 ⊕ 𝐶−2 = 𝑠 means that 𝐶0 is either either ⌊𝜔⌋ or ⌈𝜔⌉. Observe that 1 ⋅ 1 − 0 ⋅ 0 = 1 and 𝑛0 ⋅ 0 − 1 ⋅ 1 = −1. So the lemma is true when 𝑘 is −1 and 0. 𝑝 Assume that the lemma is true for some integer 𝑘 ≥ 0. Let 𝐶𝑘+1 = . We 𝑞

must show that 𝑝𝑏 − 𝑞𝑎 = ±1. We know that there exists a nonzero integer 𝑚 with 𝐶𝑘+1 = 𝑚𝐶𝑘 ⊕ 𝐶𝑘−1 . Thus, 𝑝 = 𝑎𝑚 + 𝑐 and 𝑞 = 𝑏𝑚 + 𝑑. Observe that 𝑝𝑏 − 𝑞𝑎 = (𝑎𝑚 + 𝑐)𝑏 − (𝑏𝑚 + 𝑑)𝑎 = 𝑎𝑏𝑚 + 𝑏𝑐 − 𝑎𝑏𝑚 − 𝑎𝑑 = 𝑏𝑐 − 𝑎𝑑 = ±1. By Proposition IV.15, successive convergents are adjacent Farey fractions.

320

Chapter IX: Continued Fractions

For a given irrational number 𝜔, the next lemma recursively produces partial denominators and convergents rather than starting with a list of partial denominators and then finding the convergents. Lemma 18: The global algorithm. Let 𝜔 be a positive irrational number. Let 𝑘 > 1 and assume that partial denominators 𝜖𝑖 𝑛𝑖 and convergents 𝐶𝑖 for 𝜔 are 𝑐 𝑎 known for −1 ≤ 𝑖 < 𝑘. Let 𝐶𝑘−1 = and 𝐶𝑘−2 = . The next convergent 𝐶𝑘 is 𝑏

𝑑

𝐶𝑘 = 𝜖𝑘 𝑛𝑘 𝐶𝑘−1 ⊕ 𝐶𝑘−2 , where 𝑛𝑘 is either the floor or the ceiling of 𝜖𝑘 𝑠𝑘 and 𝜖𝑘 = sgn(𝑠𝑘 ) with 𝑠𝑘 =

𝑐 − 𝜔𝑑 . 𝜔𝑏 − 𝑎

(5)

Proof. Let 𝑟𝑘 be the remainder terms as in Lemma 12. Let 𝑠𝑘 be the reciprocal of 𝑟𝑘 . Once we extend the definition of partial denominators in Definition 7 to include real numbers, by Proposition 16 we have 𝜔 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , … , 𝜖𝑘−1 𝑛𝑘−1 ,

𝑠 𝑎+𝑐 1 1 , ] = 𝐶𝑘−1 ⊕ 𝐶𝑘−2 = 𝑠𝑘 𝐶𝑘−1 ⊕ 𝐶𝑘−2 = 𝑘 𝑟𝑘 𝑟𝑘 𝑠𝑘 𝑏 + 𝑑

where 𝑠𝑘 = 𝑠 is the solution to 𝜔= which is 𝑠 =

𝑐−𝜔𝑑 𝜔𝑏−𝑎

𝑠𝑎 + 𝑐 , 𝑠𝑏 + 𝑑

. By Lemmas 12 and 17, this lemma is true.

Now we rework Example 15 in terms of this lemma. Example 19: Revisiting Example 15—a global approach to a regular convergent. This time, to approximate 𝛾 in accordance with the regular continued fraction algorithm, our partial denominators 𝑛𝑘 will be ⌊𝑠𝑘 ⌋, where 𝑠𝑘 is given by 1 0 Equation (5). By Proposition 16, 𝐶−1 = and 𝐶0 = ⌊𝛾⌋ = 0 = . Let 𝑎 = 0, 𝑏 = 1, 0 1 𝑐 = 1, 𝑑 = 0, and 𝑘 = 1, in the notation of the proposition. As in Example 15, 𝜖𝑘 = 1 for all positive integers 𝑘. At each step we box the partial denominator. Step 1: By Lemma 18, 𝑠1 = 𝑛1 = 1 and 𝐶1 = Step 2: With 𝐶1 = and 𝐶2 =

𝑝2 𝑞2

=

𝑎

𝑝1 𝑞1

=

= 1

0+1⋅1

=

1−𝛾⋅0 𝛾⋅1−0

=

1 𝛾

≈ 1.732. The floor of 𝑠1 is 1, so

1

= . So 𝑝1 = 1 and 𝑞1 = 1. 1

and 𝐶0 =

𝑏 1 0+1⋅1 1 1+1⋅1

𝑐−𝛾𝑑

𝛾𝑏−𝑎 1+1⋅0

𝑐 𝑑

0

−𝛾

1

𝛾−1

= , we have 𝑠2 =

= . So 𝑝2 = 1 and 𝑞2 = 2. 2

≈ 1.365, 𝑛2 = 1 ,

A global approach to continued fractions

321

B = C ⊕ (−A) C= B ⊕ (−A) B is a parent of A B= c C= vu d A=B⊕C B is the father of A ⊕ B C is a parent of A A= a b A⊕B A is the mother of A ⊕ B, A⊕2B

R

A⊕3B A⊕4B

A ⊕ B = C ⊕ (−2A)

3A⊕5B 3A⊕4B

C ⊕ (−3A) C ⊕ (−4A)

2B⊕3A

2A⊕3B

2A⊕5B

B⊕2A

3B⊕4A

3B⊕5A

B⊕3A 2B⊕5A

C ⊕ (−5A) B⊕4A

Figure 8. General mediants on a subtree of the Stern-Brocot tree, same as Figure IV.9. Step 3: With 𝐶2 = and 𝐶3 =

𝑝3 𝑞3

=

𝑎

=

1

and 𝐶1 =

𝑏 2 1+2⋅1 3

1+2⋅2

𝑐 𝑑

1

1−𝛾⋅1

1

𝛾⋅2−1

= , we have 𝑠3 =

≈ 2.73, 𝑛3 = 2 ,

= . So 𝑝3 = 3 and 𝑞3 = 5. 5

+

Step 4 : Continuing in this fashion gives the same list of convergents as in Example 15. ♢ Proposition 20: Convergence of tame continued fractions.∗ The tame continued fraction [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …] converges to some real number. Proof. By Proposition 10 and Lemmas 17 and 18, the convergents, 𝐶0 , 𝐶1 , 𝐶2 , and so on, for this continued fraction exist, and successive convergents are neighbors in some Farey sequence. For each 𝑘 ≥ 1, let 𝐷𝑘 = 𝐶𝑘+1 ⊕ (−𝐶𝑘 ). Because the continued fraction is tame, the denominators of its successive convergents strictly increase, so by Proposition IV.38 we know that 𝐶𝑘 and 𝐷𝑘 are the parents of 𝐶𝑘+1 . Let 𝛼𝑘 be the closed interval between 𝐶𝑘 and 𝐷𝑘 for each 𝑘. Because 𝐶𝑘 and 𝐷𝑘 are the parents of 𝐶𝑘+1 , 𝐶𝑘+1 ∈ 𝛼𝑘 . See Figure 8. Identify 𝐶𝑘+1 with node 𝐴, 𝐶𝑘 with node 𝐵, and 𝐷𝑘 with node 𝐶 in the figure. Observe that 𝐶𝑘+2 = 𝜖𝑘+2 𝑛𝑘+2 𝐶𝑘+1 ⊕ 𝐶𝑘 ∈ 𝛼𝑘 . For the same reason, 𝐶𝑚 ∈ 𝛼𝑘 for all 𝑚 ≥ 𝑘. That is, the convergents of any tame continued fraction go ever down the SternBrocot tree; they never double back on themselves. Note that 𝛼𝑘+1 ⊂ 𝛼𝑘 for all 𝑘 ≥ 1. By Proposition IV.15, the length of 𝛼𝑘 collapses to zero as 𝑘 gets large. Thus a standard analysis theorem guarantees that the intersection of all of these nested closed intervals is a set consisting of one real number, call it 𝜔. Therefore the convergents for this continued fraction converge to 𝜔. So now it makes sense to write 𝜔 = [𝑛0 ; 𝜖1 𝑛1 , 𝜖2 𝑛2 , 𝜖3 𝑛3 , …].

322

Chapter IX: Continued Fractions

A plethora of continued fractions Besides the regular continued fraction algorithm illustrated in Examples 15 and 19, other continued fraction algorithms are available. In Chapter VII, we developed the harmonic algorithm to approximate irrational numbers with rational numbers. We claimed that it is equivalent to a continued fraction algorithm. Which one? The first few harmonic convergents for 𝛾 are 1 4 15 71 , , , . 2 7 26 123 Since this list does not exactly match the first few regular convergents for 𝛾 as given by Equation (3), the harmonic and regular algorithms are not equivalent. To explore other options, we consider the following list. As we saw in Example 14 and Lemma 18, to define a continued fraction algorithm we simply adopt a decision rule on whether the magnitude of the next partial denominator is the floor or ceiling of |𝑠𝑘 | at each stage in our tree of all possible customary convergents. The following list10 is a showcase of various continued fraction algorithms within this general context as defined by their specific decision rules, each referenced by a capital letter. For any given positive irrational number 𝜔, all algorithms except Algorithm 𝑀 generate tame continued fraction representations for 𝜔. 1,

Definition 21: Common decision rules.11 Given a positive irrational number 𝑎 𝑐 𝑐−𝜔𝑑 𝜔, let 𝐶𝑘 be convergent 𝑘 of 𝜔. With 𝐶𝑘−1 = and 𝐶𝑘−2 = , let 𝑠𝑘 = . The 𝑏 𝑑 𝜔𝑏−𝑎 following list, each algorithm labeled with a capital letter, gives various ways of defining 𝜖𝑘 𝑛𝑘 from 𝑠𝑘 . R: As we have seen, the regular continued fraction’s decision rule is 𝑛𝑘 = ⌊|𝑠𝑘 |⌋. As will be shown in Corollary 23, 𝑠𝑘 > 0 for all 𝑘. Thus 𝜖𝑘 = 1 and 𝑛𝑘 = ⌊𝑠𝑘 ⌋. We also refer to this continued fraction as Algorithm 𝑅. N: The negative continued fraction’s decision rule is 𝑛𝑘 = ⌈|𝑠𝑘 |⌉. As will be shown in Corollary 23, 𝑠𝑘 < 0 for all integers 𝑘 ≥ 1. Thus 𝜖𝑘 = −1. We refer to this continued fraction as Algorithm 𝑁. 10 The seven items in this list give an idea of the variety of ways in which the next partial fraction may be chosen. For subsequent chapters of this book, we primarily use only rules 𝑅, 𝑍, and 𝐺. 11 Computer algebra system codes are given in Appendix III that will generate the continued fraction partial denominators and convergents for Algorithms 𝑅, 𝑍, and 𝐺, in Codes 17, 18, and 19, respectively.

A plethora of continued fractions

323

Z: The nearest integer continued fraction’s decision rule is 𝑛𝑘 = [|𝑠𝑘 |] 1

or, equivalently, 𝑛𝑘 = ⌈|𝑠𝑘 | − ⌉ sometimes referred to as the arithmetic mean 2 rule.12 This nearest integer continued fraction, or NICF, was first studied by Adolf Hurwitz (1859–1919) in the late nineteenth century [74]. We also refer to NICF as Algorithm 𝑍 because ℤ is used to denote the set of integers. S: Let 𝜎 = is

3−√5 2

≈ 0.382, sometimes called a silver mean. The silver mean rule 𝑛𝑘 = ⌈|𝑠𝑘 | − 𝜎⌉.

So if 𝑠𝑘 ≈ 4.31, then 𝑛𝑘 = 4. But if 𝑠𝑘 ≈ 4.39, then 𝑛𝑘 = 5. This singular continued fraction, or SCF, was also introduced by Hurwitz. He reasoned that, all other things being equal, if we must choose between two candidates for office—such as the center of a basketball team—we often tend to favor the taller one. Thus the silver mean rule favors choosing ⌈|𝑠𝑘 |⌉ over ⌊|𝑠𝑘 |⌋. Furthermore, as will be shown in Proposition 22, choosing ⌈|𝑠𝑘 |⌉ always results in 𝐶𝑘+1 being closer to 𝜔 than is 𝐶𝑘 . G: Let 𝑃 = 𝜖𝑘 ⌊|𝑠𝑘 |⌋𝐶𝑘−1 ⊕ 𝐶𝑘−2 and 𝑄 = 𝜖𝑘 ⌈|𝑠𝑘 |⌉𝐶𝑘−1 ⊕ 𝐶𝑘−2 . A greedy decision rule is to choose ⌊|𝑠𝑘 |⌋ if |𝑃 − 𝜔| < |𝑄 − 𝜔|, 𝑛𝑘 = { ⌈|𝑠𝑘 |⌉ otherwise. The algorithm is called greedy because we are choosing the option yielding the 𝐶𝑘 convergent nearer to 𝜔. An elegant way to apply this greedy rule is given in Proposition 26. O: Let 𝑃 and 𝑄 be defined as in Algorithm 𝐺. As introduced in Definition 10, 1 𝑝 𝑝 a good rational approximation13 for 𝜔 is one where |||𝜔 − ||| < 2 , where 𝑝 𝑞

and 𝑞 are positive integers. The fraction ||𝜔 − 𝑝 || < | 𝑞|

1 2𝑞

𝑝 𝑞

𝑞

𝑞

is a really-good approximation if

. With this in mind, an optimal decision rule is to choose 2

𝑛𝑘 = {

⌊|𝑠𝑘 |⌋ ⌈|𝑠𝑘 |⌉

if 𝑃 is a really good approximation to 𝜔, otherwise.

This optimal continued fraction rule was introduced by Wieb Bosma in 1987 [14]. By slightly modifying the proof of Corollary 23, as the reader may show, 12 An alternate, yet equivalent, rule is 𝑛 𝑘 1

|𝑠𝑘 | = 𝑚 +

1

= ⌈|𝑠𝑘 |− |⌉. The beauty of this alternate rule is that when 2

1

for some integer 𝑚, the value of 𝑛𝑘 is 𝑚, whereas the value of [𝑚 + ] is ambiguous. 2 1 13 See Exercise VI.2 for fractions 𝑝 approximating the golden mean 𝜙 where ||𝜙 − 𝑝 || < . | √5𝑞2 𝑞 𝑞| 2

324

Chapter IX: Continued Fractions

y c

(0, d )

s= −2

(−1, c−a ) d−b

s= −3 s = −4

y=ω y=

s=1 s= 2

a b

(−1, c−a ) d−b

s-axis

a. ω between ab and cd.

diants of

𝑏

c d

(0, )

s=1

a

y= b y=ω s= 2 s-axis

b. ab and cd on the same side of ω.

Figure 9. The hyperbola 𝑦 = 𝑎

s = −2

y

𝑐

𝑐+𝑎𝑠

adorned with general me-

𝑑+𝑏𝑠

and . 𝑑

at each stage 𝑘 in the continued fraction algorithm, at least one of 𝑃 and 𝑄 is a really-good approximation. Similarly to Proposition 26, Proposition 28 gives an elegant way to apply 𝑂’s decision rule.

M: This next rule is a variation on the regular 𝑅 continued fraction rule. It is called the Minkowski diagonal continued fraction and was introduced by Hermann Minkowski in 1901. Using a trick as given in Equation (8) below and 𝑝 as we demonstrate in Example 24, Algorithm 𝑀 discards any convergent 𝑞

1

generated by Algorithm 𝑅 if 𝑞|𝑞𝜔 − 𝑝| > . 2

For the tameness of each of these continued fraction algorithms, note that 𝑅 generates tame continued fractions. By Proposition 25, an outline in Exercise 4d, and [13], so do 𝐺, 𝑍, 𝑁, 𝑆, and 𝑂. The next proposition outlines an option we can exercise when constructing a continued fraction for a given irrational number. Proposition 22: A closer convergent. Let 𝜔 be a positive irrational number. Suppose that convergents 𝐶𝑖 for 𝜔 have been chosen for all nonnegative integers 𝑖 with 𝑖 < 𝑘, where 𝑘 is a positive integer. Then 𝐶𝑘 can be chosen so that it is closer to 𝜔 than is 𝐶𝑘−1 .

A plethora of continued fractions

325

Proof. With respect to the terminology of Lemma 18, let 𝐶𝑘−1 = 𝑠𝑘 =

𝑐−𝜔𝑑 𝜔𝑏−𝑎 14

, and 𝜖𝑘 = sgn(𝑠𝑘 ). Let 𝑓(𝑠) =

𝑎𝑠+𝑐 𝑏𝑠+𝑑

𝑎 𝑏

, 𝐶𝑘−2 =

𝑐 𝑑

,

. The graph of 𝑦 = 𝑓(𝑠) is a hy-

perbola, as exemplified in Figure 9. If 𝑘 = 1, then the 𝑦-axis is an asymptote 𝑐 1 of the hyperbola, and we identify = with △. Otherwise, the 𝑦 intercept of the hyperbola is at (0, 𝑎

𝑐 𝑑

𝑑

0

). As 𝑠 → ±∞ the graph of the hyperbola approaches 𝑎

𝑐

the asymptote 𝑦 = . If 𝜔 is between and , as illustrated in Figure 9a, then 𝑏 𝑏 𝑑 the hyperbola and the line 𝑦 = 𝜔 intersect at 𝑠 = 𝑠𝑘 where 𝑠𝑘 > 0. So 𝜖𝑘 = 1. 𝑐 𝑎 Then the set { , 𝑓(⌊𝑠𝑘 ⌋), 𝜔, 𝑓(⌈𝑠𝑘 ⌉), } is in either increasing or decreasing order. 𝑑

𝑎

𝑏

Thus, 𝑓(⌈𝑠𝑘 ⌉) is closer to 𝜔 than . So 𝐶𝑘 (which must be either 𝑓(⌊𝑠𝑘 ⌋) or 𝑓(⌈𝑠𝑘 ⌉)) 𝑏 can be chosen to be closer to 𝜔 than 𝐶𝑘−1 . 𝑐 𝑎 However, if and are on the same side of 𝜔, as illustrated in Figure 9b, the 𝑏 𝑑 left-hand branch of the hyperbola intersects the line 𝑦 = 𝜔 at 𝑠 = 𝑠𝑘 where 𝑠𝑘 < 0. 𝑎 So 𝜖𝑘 = −1. As before, the set {𝑓(𝜖𝑘 ⌊|𝑠𝑘 |⌋), 𝜔, 𝑓(𝜖𝑘 ⌈|𝑠𝑘 |⌉), } is in either increasing 𝑏 𝑎

or decreasing order. Thus 𝑓(𝜖𝑘 ⌈|𝑠𝑘 |⌉) is closer to 𝜔 than . So 𝐶𝑘 (which is either 𝑏 𝑓(𝜖𝑘 ⌊|𝑠𝑘 |⌋) or 𝑓(𝜖𝑘 ⌈|𝑠𝑘 |⌉)) can be chosen to be closer to 𝜔 than 𝐶𝑘−1 . Hidden in the proof of Proposition 22 is an explanation of why 𝜖𝑘 = 1 for the regular continued fraction and 𝜖𝑘 = −1 for the negative continued fraction for all positive integers 𝑘. By way of review and clarification, note that algorithms 𝑅, 𝑁, 𝑍, 𝐺, and 𝑂 are customary by Definition 21; thus, the magnitude of all partial denominators 𝜖𝑖 𝑛𝑖 generated by these algorithms are all at least 1 for integers 𝑖 > 0. Corollary 23: Regular and negative convergents, adapted from [83, Theorem 18]. Let 𝜔 be a positive irrational number. The convergents of Algorithm 𝑅 oscillate about 𝜔, whereas the convergents of Algorithm 𝑁 approach 𝜔 from 𝑎 𝑐 above. Let 𝐶𝑘 = and 𝐶𝑘−1 = be regular convergents of 𝜔 for all 𝑘 > 0. Then 𝑏 𝑑 either 1 1 or |𝜔 − 𝐶𝑘 | < 2 . |𝜔 − 𝐶𝑘−1 | < 2 2𝑑 2𝑏 Proof. Consider Algorithm 𝑅. By Definition 21, 𝐶0 = ⌊𝜔⌋ < 𝜔. So 𝑠1 = and 𝑛1 = ⌊

1 𝑤−⌊𝑤⌋

1 𝑤−⌊𝑤⌋

>1

⌋. Thus

𝐶1 =

𝑛1 ⌊𝑤⌋ + 1 1 = ⌊𝜔⌋ + > ⌊𝜔⌋ + 𝜔 − ⌊𝜔⌋ = 𝜔. 𝑛1 𝑛1

14 We have portrayed the case where the hyperbola is always decreasing (except at the point of discontinuity). The proof of the proposition is valid for the other case—when the hyperbola is always increasing.

326

Chapter IX: Continued Fractions

Since 𝜔 is between 𝐶0 and 𝐶1 , 𝑠2 > 0 as in Proposition 22. Since 𝑛2 = ⌊𝑠2 ⌋, 𝐶2 is on the same side of 𝜔 as 𝐶0 . Thus 𝐶2 < 𝜔, and so on. 1 Consider Algorithm 𝑁. 𝐶0 = ⌈𝜔⌉ > 𝜔 and 𝑠1 = < −1. So 𝜖1 = −1, 𝜔−⌈𝜔⌉

𝑛1 = ⌈

1 ⌈𝜔⌉−𝜔

⌉, and

𝑛1 ⌈𝜔⌉ − 1 1 = ⌈𝜔⌉ − > ⌈𝜔⌉ − (⌈𝜔⌉ − 𝜔) = 𝜔. 𝑛1 𝑛1 Since both 𝐶0 and 𝐶1 are on the same side of 𝜔, 𝑠2 < 0 as in the proposition. Since 𝑛2 = ⌈|𝑠2 |⌉, 𝐶2 will be on the same side of 𝜔 as both 𝐶0 and 𝐶1 , and so on. To prove the last statement of the corollary, observe that, as already shown, 𝜔 lies between 𝐶𝑘−1 and 𝐶𝑘 . Thus 1 1 1 < 2 + 2. |𝜔 − 𝐶𝑘 | + |𝜔 − 𝐶𝑘−1 | = |𝐶𝑘 − 𝐶𝑘+1 | = 𝑏𝑑 2𝑏 2𝑑 The second equality in the above expression is true by Lemma 17, and the inequality is true because the geometric mean of two different positive numbers 1 1 and 2 is less than their arithmetic mean; see Exercise III.5f. Since a con2 𝐶1 =

𝑏

𝑑

tradiction would result if |𝜔 − 𝐶𝑘−1 | ≥ true.

1 2𝑑 2

and |𝜔 − 𝐶𝑘 | ≥

1 2𝑏2

, the corollary is

What is especially nice about regular convergents—as presented in Exercise 𝑝 1 𝑝 5—is that whenever |||𝜔 − ||| < 2 , is a regular convergent, where 𝜔 is a positive 𝑞

2𝑞

𝑞

irrational number and 𝑝 and 𝑞 are positive integers. Example 24: Continued fractions for 𝑒. In this example we illustrate the various decision rules of Definition 21. To indicate which algorithm is being used in the text, we subscript each list of partial denominators with one of the letters 𝐺, 𝑀, 𝑁, 𝑂, 𝑅, 𝑆, and 𝑍. R: The regular continued fraction for 𝑒 begins with 𝑛0 = ⌊𝑒⌋ = 2. With 𝐶−1 = 2

and 𝐶1 =

1

, we have 𝑠1 =

1 𝑒−2 15

Continuing in this fashion gives

≈

1 0.718

≈ 1.39. So 𝜖1 = 1 and 𝑛1 = 1.

𝑒 = [2; 1, 2, 1, 1, 4, 1, 1, 6, 1, 1, 8, …]𝑅 → {2, 3,

8 11 19 87 106 , , , , , …} . (6) 3 4 7 32 39

N: The negative continued fraction for 𝑒 begins with 𝑛0 = ⌈𝑒⌉ = 3. With 𝐶−1 = 3

1

1

𝑒−3

and 𝐶1 = , 𝑠1 =

1 0

1 0

≈ −3.55. So 𝜖1 = −1, 𝑛1 = 4, and so on, giving

𝑒 = [3; −4, −3, −2, −2, −2, −3, −8, −3, −2, …]𝑁 → {3,

11 30 49 68 , , , , …} . 4 11 18 25

15 For the moment, ignore the underline beneath various partial denominators in the list of Equation 6. Its significance is explained in the paragraph on M below.

A plethora of continued fractions

327

G, S, Z: For 𝑒, the greedy, singular, and nearest integer algorithms all yield the same result. We illustrate Algorithm 𝑍. Thus, 𝑛0 = [𝑒] = 3. As with Algorithm 1 𝑁 above, 𝑠1 = ≈ −3.55. So 𝜖1 = −1, 𝑛1 = [3.55] = 4, and so on, giving 𝑒−3

𝑒 = [3; −4, −2, 5, −2, 7, −2, 9, −2, 11, …]𝑍 → {3,

11 19 106 193 , , , , …} . 4 7 39 71

(For an example where Algorithms 𝐺 and 𝑍 yield different results, see Example 27.) M, O: For 𝑒, the optimal algorithm and Minkowski’s diagonal algorithm both yield the same result. We illustrate Algorithm 𝑀. From Chapter VII, a rational 𝑝 𝑝 1 approximation for 𝜔 is said to be good if |||𝜔 − ||| < 2 . This inequality is the 𝑞

𝑞

𝑞

same as 𝑞|𝑞𝜔−𝑝| < 1. The value 𝜃 = 𝑞|𝑞𝜔−𝑝| is said to be the normalized error or simply the error 𝜃𝑘 in step 𝑘 of a continued fraction algorithm for 𝜔. The list of errors 𝜃𝑘 corresponding to the list of regular convergents in Equation (6) is {0.72, 0.28, 0.46, 0.51, 0.20, 0.48, 0.51, 0.14, 0.49, 0.503, 0.11, …}.

(7)

For example, we recalculate the third error term 𝜃2 ≈ 0.46 in Equation 7. Since 8 𝑅’s third convergent is 𝐶2 = , 𝜃2 = 3|3𝑒 − 8| ≈ 0.46. Observe that in Equa3 tion (6) the underlined partial denominators 𝑛𝑘+1 (all of which are equal to 1) correspond to those errors 𝜃𝑘 less than 0.5 from Equation (7). Algorithm 𝑀 will discard these partial denominators using the pruning tool of the following identity: 1 1 𝛼+ (8) =𝛼+1− 1 𝛽+1+𝜓 1+ 𝛽+𝜓

where 𝛼, 𝛽, and 𝜓 are algebraic expressions. Thus in any sequence of partial denominators we can replace [… , 𝜖𝑘−1 𝑛𝑘−1 , 1, 𝑛𝑘+1 , …]

with

[… , 𝜖𝑘−1 (𝑛𝑘−1 + 1), −(𝑛𝑘+1 + 1), …],

thereby shortening any finite expansion of partial denominators. In particular, 106 11 , and Algorithm 𝑀 will discard the convergents 𝐶0 = 2, 𝐶3 = , 𝐶6 = 1457

4

39

𝐶9 = . To do so, apply Equation (8) at each underlined 1 in Equation (6), 536 resulting in 𝑒 = [3; −3, 2, −5, 2, −7, 2, −9, …]𝑀 → {3,

8 19 87 193 1264 , , , , , …} , 3 7 32 71 465 1

the convergents of which all have (normalized) error less than . (See Exam2 ple 29 for an application of algorithm 𝑂.) ♢

328

Chapter IX: Continued Fractions

Why the ugly duckling 𝐺 is really a swan Compared to Algorithms 𝑅, 𝑁, 𝑍, and 𝑆, Algorithm 𝐺 may seem to be a poor one because, computationally, it is the most complicated. Faced with the choice of choosing ⌊|𝑠𝑘 |⌋ or ⌈|𝑠𝑘 |⌉ as its next partial denominator in approximating the irrational number 𝜔, the regular algorithm chooses the former and the negative algorithm chooses the latter. The nearest integer algorithm selects the one nearer to |𝑠𝑛 |. The singular algorithm skews its decision according to a silver mean rather than the arithmetic mean. What does the greedy algorithm do? At step 𝑘, let its 𝑘 − 1 and 𝑘 − 2 convergents be 𝐶𝑘−1 and 𝐶𝑘−2 . From the two mediants 𝜖𝑘 ⌊|𝑠𝑘 |⌋𝐶𝑘−1 ⊕ 𝐶𝑘−2 and 𝜖𝑘 ⌈|𝑠𝑘 |⌉𝐶𝑘−1 ⊕ 𝐶𝑘−2 , 𝐺 selects the one closer to 𝜔, so deciding between ⌊|𝑠𝑘 |⌋ and ⌈|𝑠𝑘 |⌉. However, as we show, this cumbersome rule can be replaced with a dynamic decision rule based upon the denominators of 𝐶𝑘−1 and 𝐶𝑘−2 . First we show that Algorithm 𝐺 produces tame continued fractions. Proposition 25: The greedy algorithm is tame. Let 𝜔 be a positive irrational number. Let 𝜖𝑘 𝑛𝑘 and 𝐶𝑘 be Algorithm 𝐺’s partial denominators and convergents for 𝜔, for all positive integers 𝑘. Then 𝑛𝑘 ≥ 2 and 𝐶𝑘 is closer to 𝜔 than any other simpler fraction. Proof. Without loss of generality we assume that 0 < 𝜔 < 1 (otherwise we would 1 1 analyze the irrational number 𝜔−⌊𝜔⌋). If 𝜔 < , then 𝑛0 = 0 = 𝐶0 . If 𝜔 > , then 2 2 𝑛0 = 1 = 𝐶0 . In either case, no fraction simpler than 𝐶0 is closer to 𝜔. Suppose 1 𝑛 0 1 1 𝑛0 = 0. Since 𝐶−1 = and 𝐶1 = 0 = , Equation (5) yields 𝑠 = > 1 = 2. So 0

1

1

𝜔

( ) 2

𝑛1 ≥ 2 and 𝜖1 = 1. Observe that 𝜔 is between the Farey fraction neighbors 1 ⌈𝑠⌉

1 ⌊𝑠⌋

and

, and no fraction simpler than these is closer to 𝜔. Next, suppose that 𝑛0 = 1.

This time, 𝐶−1 = 1

1 0

and 𝐶1 =

𝑛0 1

=

1 1

𝜔 > , 𝑠 < −2. Thus 𝑛1 ≥ 2 and 𝐶1 = 2

neighboring Farey fractions 1 +

1 ⌊−𝑠⌋

, and Equation (5) yields 𝑠 = −𝑛1 +1 −𝑛1

and 1 +

= 1− 1

⌈−𝑠⌉

1 𝑛1

1 𝜔−1

. Since

. Since 𝜔 is between the

(one of which is 𝐶1 ), no fraction

simpler than 𝐶1 is closer to 𝜔. Suppose that for some integer 𝑘 > 1, we know that 𝑛𝑗 ≥ 2, 𝐶𝑗 is closer to 𝜔 than any simpler fraction, and 𝐶𝑗 = 𝜖𝑗 𝑛𝑗 𝐶𝑗−1 ⊕ 𝐶𝑗−2 for all integers 1 ≤ 𝑗 ≤ 𝑘. Let 𝐶𝑘+1 = 𝜖𝑘+1 𝑛𝑘+1 𝐶𝑘 ⊕ 𝐶𝑘−1 . Since 𝜔 is between the Farey neighbors 𝜖𝑘+1 ⌊|𝑠𝑘+1 |⌋𝐶𝑘 ⊕𝐶𝑘−1 and 𝜖𝑘+1 ⌈|𝑠𝑘+1 |⌉𝐶𝑘 ⊕𝐶𝑘−1 , no fraction simpler than 𝐶𝑘+1 is closer to 𝜔. If 𝑛𝑘+1 = 0, then 𝐶𝑘+1 = 𝐶𝑘−1 , a contradiction since this would mean that 𝐶𝑘−1 , a fraction simpler than 𝐶𝑘 , is closer to 𝜔 than 𝐶𝑘 . If 𝜖𝑘+1 𝑛𝑘+1 = −1, then 𝐶𝑘+1 = (−𝐶𝑘 )⊕𝐶𝑘−1 = 𝐶𝑘 ⊕(−𝐶𝑘−1 ), a fraction simpler than 𝐶𝑘 and closer to 𝜔 than 𝐶𝑘 , a contradiction. If 𝜖𝑘+1 𝑛𝑘+1 = 1, then 𝐶𝑘+1 = 𝐶𝑘 ⊕ 𝐶𝑘−1 . But 𝐶𝑘+1 = 𝐶𝑘 ⊕ 𝐶𝑘−1 = (𝜖𝑘 𝑛𝑘 𝐶𝑘−1 ⊕ 𝐶𝑘−2 ) ⊕ 𝐶𝑘−1 = (𝜖𝑘 𝑛𝑘 + 1)𝐶𝑘−1 ⊕ 𝐶𝑘−2 ,

Why the ugly duckling 𝐺 is really a swan

329

a contradiction to the choice of 𝐶𝑘 being convergent 𝑘, because 𝐶𝑘+1 would have been preferred as convergent 𝑘 rather than 𝐶𝑘 . Proposition 26: A mean-value rule for 𝐺. Let 𝜔 be a positive irrational num𝑎 𝑐 ber. At step 𝑘 ≥ 1 with respect to Algorithm 𝐺, let 𝐶𝑘−1 = and 𝐶𝑘−2 = . Let 𝑠𝑘 =

𝑐−𝜔𝑑 𝜔𝑏−𝑎

𝑏

𝑑

and 𝜖𝑘 = sgn(𝑠𝑘 ). If |𝑠𝑘 | − ⌊|𝑠𝑘 |⌋ < 𝛿𝑘 where 𝛿𝑘 =

𝑑 + 𝜖𝑘 ⌊|𝑠𝑘 |⌋ 𝑏 , 2𝑑 + 𝜖𝑘 (2⌊|𝑠𝑘 |⌋ + 1)𝑏

(9)

then 𝑛𝑘 = ⌊|𝑠𝑘 |⌋; otherwise 𝑛𝑘 = ⌈|𝑠𝑘 |⌉. Equivalently, 𝑛𝑘 = ⌈|𝑠𝑘 | − 𝛿𝑘 ⌉. 𝑎𝑚+𝑐

Proof. Let 𝑓(𝑚) = . For the moment, think of 𝑚 as a positive integer. Since 𝑏𝑚+𝑑 𝑓 is continuous and monotonic for all 𝑚 > 0, there exists a number 𝛿 > 0 with 𝑓(𝑚) + 𝑓(𝑚 + 1) 𝑓(𝑚 + 𝛿) = . (10) 2 Although we can solve this equation with pencil and paper, it may take some time. Use of a computer algebra system16 almost immediately gives 𝑑 + 𝑚𝑏 𝛿= , 2𝑑 + (2𝑚 + 1)𝑏 which is equivalent to Equation (9) when 𝑠𝑘 ≥ 1, 𝜖𝑘 = 1, 𝑚 = ⌊𝑠𝑘 ⌋, and 𝛿 = 𝛿𝑘 . Now think of 𝑚 as a negative integer. The analog to Equation (10) is 𝑓(𝑚 − 𝛿) =

𝑓(𝑚) + 𝑓(𝑚 − 1) , 2

and its solution via a CAS is 𝑑 + 𝑚𝑏 , 2𝑑 + (2𝑚 − 1)𝑏 which is equivalent to Equation (9) when 𝑠𝑘 ≤ −1, 𝜖𝑘 = −1, 𝑚 = 𝜖𝑘 ⌊|𝑠𝑘 |⌋, and 𝛿 = 𝛿𝑘 . Thus, if |𝑠𝑘 | − ⌊|𝑠𝑘 |⌋ < 𝛿𝑘 , then 𝑛𝑘 = ⌊|𝑠𝑘 |⌋ because 𝑓(⌊|𝑠𝑘 |⌋) is closer to 𝜔 than is 𝑓(⌈|𝑠𝑘 |⌉). Otherwise, 𝑛𝑘 = ⌈|𝑠𝑘 |⌉. 𝛿=

To illustrate the use of this mean-value decision rule, we contrast Algorithms 𝐺 and 𝑍 with respect to the irrational number 𝜋. Example 27: Algorithms 𝐺 and 𝑍 differ. Although the nearest integer and the greedy rules often agree, they sometimes differ. With respect to the irrational number 𝜋, we demonstrate that 𝐺’s ninth convergent is not the same as 𝑍’s ninth convergent. The seventh and eighth 𝐺 and 𝑍 convergents for 𝜋 are 𝑐 80143857 𝑎 245850922 𝐶7 = = and 𝐶8 = = . 𝑑 25510582 𝑏 78256779 16 See

Code 20 in Appendix III.

330

Chapter IX: Continued Fractions 𝑐−𝜋𝑑

≈ −2.41, the nearest integer algorithm chooses its 𝜖9 𝑛9 term as Since 𝑠9 = 𝑏𝜋−𝑎 −2. Thus, 𝑍’s ninth convergent, denoted by 𝑍9 , is 411557987 𝑍9 = −2𝐶8 ⊕ 𝐶7 = . 131002976 What about Algorithm 𝐺? We use Proposition 26 and calculate 𝛿9 : 𝛿9 =

𝑑 + 𝜖𝑘 ⌊|𝑠𝑘 |⌋ 𝑏 25510582 − 2 ⋅ 78256779 = ≈ 0.385. 2𝑑 + 𝜖𝑘 (2⌊|𝑠𝑘 |⌋ + 1)𝑏 2 ⋅ 25510582 − (2 ⋅ 2 + 1)78256779

Since |𝑠9 | − ⌊|𝑠9 |⌋ ≈ 2.41 − 2 = 0.41 > 0.385 ≈ 𝛿9 , the greedy algorithm chooses its 𝜖9 𝑛9 term as −3. Thus, 𝐺’s ninth convergent, denoted by 𝐺9 , is 657408909 . 𝐺9 = −3𝐶8 ⊕ 𝐶7 = 209259755 Although 𝑍9 ’s (normalized) error term 𝜃9 is about 0.33 and 𝐺9 ’s (normalized) error term 𝜃9 is about 0.75, the reader may check that 𝐺9 is indeed closer to 𝜋 than is 𝑍9 . ♢

An interlude delineating Algorithm 𝑂∗ The optimal continued fraction’s decision rule for choosing the next convergent is remarkably similar the greedy continued fraction’s rule. Proposition 28: A mean-value rule for 𝑂. Let 𝜔 be a positive irrational num𝑎 𝑐 ber. At step 𝑘 ≥ 0 with respect to Algorithm 𝑂, let 𝐶𝑘−1 = and 𝐶𝑘−2 = . Let 𝑠𝑘 =

𝑐−𝜔𝑑 𝜔𝑏−𝑎

𝑏

𝑑

and 𝜖𝑘 = sgn(𝑠𝑘 ). If |𝑠𝑘 | − ⌊|𝑠𝑘 |⌋ > 𝛿𝑘 where 𝛿𝑘 =

𝑑 + 𝜖𝑘 ⌈|𝑠𝑘 |⌉ 𝑏 2𝑑 + 𝜖𝑘 (2𝑏⌈|𝑠𝑘 |⌉ + 1)

(11)

then 𝑛𝑘 = ⌈|𝑠𝑘 |⌉, otherwise 𝑛𝑘 = ⌊|𝑠𝑘 |⌋. Equivalently, 𝑛𝑘 = ⌈|𝑠𝑘 | − 𝛿𝑘 ⌉. Proof: By Exercise IV.6c, at least one of 𝜖𝑘 ⌊|𝑠𝑘 |⌋𝐶𝑘−1 ⊕ 𝐶𝑘−2 or 𝜖𝑘 ⌈|𝑠𝑘 |⌉𝐶𝑘−1 ⊕ 𝐶𝑘−2 is a really-good approximation to 𝜔. If both are really-good, then we choose the latter because otherwise the next partial denominator will be either ±1, which may give a non-tame continued fraction expansion. Therefore we choose the latter if 𝑎𝜖 ⌈|𝑠 |⌉ + 𝑐 | 1 | |𝜔 − 𝑘 𝑘 |< , (12) | 𝑏𝜖𝑘 ⌈|𝑠𝑘 |⌉ + 𝑑 | 2(𝑏𝜖𝑘 ⌈|𝑠𝑘 |⌉ + 𝑑)2 otherwise we choose the former. As an algebra exercise we leave for the reader, Equation (12) becomes |𝑠𝑘 | − ⌊|𝑠𝑘 |⌋ > 𝛿𝑘 , where 𝛿𝑘 is given in (11). Example 29: Euler’s constant via 𝑂. Let 𝜔 = 𝛾 ≈ 0.577216. By Proposition 28, 1

Step 0: 𝑠0 = 𝛾, 𝛿0 = . Since 𝛾 − ⌊𝛾⌋ ≈ 0.5777 > 𝛿0 , then 𝑛0 = 1. 2

Dominance domains

331

Step 1: 𝑠1 ≈ −2.365, 𝛿 = 𝜖1 𝑛1 = −2. Step 2: 𝑠2 ≈ 2.738, 𝛿2 = 𝜖2 𝑛2 = 3. Step 3: 𝑠3 ≈ −3.81, 𝛿3 = 𝜖3 𝑛3 = −4.

3 7 7 16 26 59 149

Step 5: 𝑠4 ≈ −5.35, 𝛿4 = 𝜖4 𝑛4 = −5.

324

≈ 0.428, 𝜖1 = −1, |𝑠1 | − ⌊|𝑠1 |⌋ ≈ 0.365 < 𝛿1 . So ≈ 0.437, 𝜖2 = 1, |𝑠2 | − ⌊|𝑠2 |⌋ ≈ 0.738 > 𝛿2 . So ≈ 0.441, 𝜖3 = −1, |𝑠3 | − ⌊|𝑠3 |⌋ ≈ 0.81 > 𝛿3 . So ≈ 0.460, 𝜖4 = −1, |𝑠4 | − ⌊|𝑠4 |⌋ ≈ 0.325 < 𝛿3 . So

Thus, 𝛾 = [1; −2, 3, −4, −5, 3, 13, …]𝑂 which is the same expansion as given by the nearest integer continued fraction algorithm. However the reader may 15403 and 𝑍’s seventh convergent is check that 𝑂’s seventh convergent is 𝑂7 = 26695

18438

3035

𝑍7 = , while their common sixth convergent is . The reader may check 31943 5258 that 𝑂7 is a really-good approximation for 𝛾 whereas 𝑍7 fails to be one. ♢

Dominance domains We now answer in part a question that arose when exploring signatures of real numbers in Chapter VII. How does the graph of 𝒮𝜔 change as 𝜔 changes, where 𝜔 is a real number? More simply, on what interval 𝐽 about the rational number 𝑎 𝑎 does well-approximate every point in 𝐽? The next proposition is our partial 𝑏 𝑏 answer. To understand the notation in this proposition, review the Chapter IV material on the father and mother fractions of any fraction between 0 and 1. 𝑐ˆ be respectively 𝑏 𝑑 𝑑ˆ ˆ 𝑐 +𝑎𝑠 𝑐+𝑎𝑠 𝑎 and 𝑔(𝑠) = ˆ . For 𝑛 ≥ 2, let the father and mother of . Let 𝑓(𝑠) = 𝑏 𝑑+𝑏𝑠 𝑑 +𝑏𝑠 𝑎 𝑎 the order-𝑛 dominance domain for , denoted by ℬ𝑛 ( ), be the interval whose 𝑏 𝑏 endpoints are 1 𝑎 1 𝑎 (13) (𝑓(𝑛 − 1) + ) and (𝑔(𝑛 − 1) + ). 2 𝑏 2 𝑏 𝑎 𝑎 𝑝 Then is nearer any point in ℬ𝑛 ( ) than any other fraction where 𝑞 < 𝑑 + 𝑛𝑏. 𝑎

Proposition 30: A dominance interval for . Let

𝑏

𝑏

Proof. Observe that 𝑓(𝑛) is the mediant of than 𝑓(𝑛) exists between 1

𝑎

𝑎 𝑏

𝑐

and

𝑞

𝑎 𝑏

and 𝑓(𝑛−1), and no fraction simpler

and 𝑓(𝑛 − 1). Furthermore, any point 𝑥 between 𝑎

𝑎 𝑏

and (𝑓(𝑛 − 1) + ) is closer to than to 𝑓(𝑛 − 1), giving the desired result. The 2 𝑏 𝑏 ˆ on the other side of 𝑎 is similar. argument involving 𝑔 and 𝑑 𝑏

332

Chapter IX: Continued Fractions 1 1 19 1 15 17

1 13

1 4

3 4 11 15 2 7

5 18

5 17

1 7

1 9

6 19 4 13

6 17 1 3

3 5 10 16

3 17

1 6

1 8

1 1 1 1 16 1 12 10 18 14

5 19

3 2 19 13

1 2 2 2 11 19 17 15

5 13

5 14

2 5

7 8 17 19

7 18

3 8

3 4 13 17 2 9

1 5

3 16

7 19 4 11

4 19

2 11

5 12

1 4

3 14

8 5 6 11 13 7 17 9 15 19 4 9

3 7

1 2

7 16

𝑎

Figure 10. Dominance domains ℬ3 ( ) for all fractions 𝑏 with 𝑏 ≤ 19.

𝑎 𝑏

≤

1 2

𝑎

To depict dominance domains ℬ𝑛 ( ) as figures, we render them as ellipses 𝑏

whose left and right endpoints correspond to Equation (13). Figure 10 displays17 about threescore dominance domains of order 3 for all fractions no larger than 1 7 1 1 1 and no simpler than (except that for the fractions and we display ℬ4 ( ) 2

1

19

3

2

1

3 5

and ℬ6 ( ) to avoid an image with overlapping ovals, although ℬ3 ( ) and ℬ3 ( ) 2 4 19 still overlap slightly).

The harmonic algorithm is a chameleon As currently defined, the harmonic algorithm 𝐻 of Chapter VII and the nearest integer continued fraction algorithm 𝑍 turn out to be the same algorithm with respect to generating the same convergents for any given positive irrational number 𝜔. Slight modifications to the harmonic algorithm transform 𝐻 into any of the other continued fraction algorithms we have discussed except 𝑀. 17 A

figure somewhat like Figure 10 appears in [103, p. 266].

The harmonic algorithm is a chameleon

333

Definition 31: A modified harmonic rule. Recall that the decision rule of Algorithm 𝐻 from Chapter VII is the greatest integer function. We say that the harmonic algorithm 𝐻 of Chapter VII is a modified version with respect to Algorithm ℱ (where ℱ is any of the algorithms 𝐺, 𝑁, 𝑂, 𝑅, 𝑆, or 𝑍) if we replace the greatest integer decision rule with the decision rule for ℱ. Proposition 32: The harmonic chameleon 𝐻. Let 𝜔 be a positive irrational number. At step 𝑘 with respect to the continued fraction ℱ (where ℱ is any of 𝐺, 𝑎 𝑐 𝑁, 𝑂, 𝑅, 𝑆, or 𝑍), let its 𝑘−1 and 𝑘−2 convergents be 𝐶𝑘−1 = and 𝐶𝑘−2 = . Let 𝑏 𝑑 𝐶𝑘−1 be a harmonic convergent for 𝐻. Let 𝐻𝑘 be the next harmonic convergent (starting from 𝐶𝑘−1 ) using 𝐻’s modified decision rule with respect to ℱ. Then 𝐶𝑘 = 𝐻𝑘 . 𝑎𝑠+𝑐

= Proof. Recall from Definition 21 that to find 𝐶𝑘 , we let 𝑠0 be the solution to 𝑏𝑠+𝑑 𝜔 and apply ℱ’s decision rule to 𝑠0 to obtain 𝜖𝑘 𝑛𝑘 , so that 𝐶𝑘 = 𝜖𝑘 𝑛𝑘 𝐶𝑘−1 ⊕ 𝐶𝑘−2 . Recall from Proposition VII.21 that to find 𝐻𝑘 , let 𝜖 = sgn(𝑏𝜔 − 𝑎). Let 𝑟 be the integer solution to 𝑎𝑟 ≡ −𝜖 mod 𝑏, for 0 < 𝑟 < 𝑏. Let ˆ𝑠0 be the solution to 𝜖 , 𝑏𝜔 − 𝑎

𝑏𝑠 + 𝑟 = 1

(14)

𝜖

namely, ˆ𝑠0 = ( − 𝑟). Then apply 𝐻’s modified decision rule to ˆ𝑠0 , so pro𝑏 𝑏𝜔−𝑎 ducing integer 𝑛 where 𝑛 is either ⌊ˆ𝑠0 ⌋ or ⌈ˆ𝑠0 ⌉. Thus, the denominator of 𝐻𝑘 is 𝑏𝑛 + 𝑟, and its numerator is the integer [𝜔(𝑏𝑛 + 𝑟)]. 𝑎𝑠+𝑐 = 𝜔 is The proof of the proposition follows upon showing that solving 𝑏𝑠+𝑑 essentially the same as solving Equation (14). We do so by analyzing four cases, corresponding to the number of ways that 𝐶𝑘−1 , 𝐶𝑘−2 , and 𝜔 can be ordered from least to greatest. Since the reasoning for each case is similar, we belabor Case 1 and leave the details of the other cases to the reader. Case 1: Suppose

𝑎 𝑏

< 𝜔
2. (iv) Therefore 𝐶𝑘+1 = 𝑚𝐶𝑘 ⊕ 𝐶𝑘−1 for some integer 𝑚 with |𝑚| ≥ 2. Adapt the argument of part (d) to show that Algorithm 𝑁 generates tame continued fractions. Adapt the argument of part (d) to show that Algorithm 𝑆 generates tame continued fractions. Provided 𝑥 = [𝑎0 ; 𝜖1 𝑎1 , 𝜖2 𝑎2 , 𝜖3 𝑎3 , …] repeats and exists, use mathematical induction to prove that 𝑥 = 𝑝 + 𝑞√𝑟, where 𝑝 and 𝑞 are rational numbers and 𝑟 is a nonnegative integer. Recall Proposition VII.1: The simplest fraction. Let 𝐴 and 𝐵 be two irrational numbers whose simple continued fractions differ for the first time at partial fraction 𝑘. Let 𝑎𝑘 and 𝑏𝑘 be partial denominators 𝑘 for 𝐴 and 𝐵, respectively. Then 𝐶 = [𝑐0 ; 𝑐1 , 𝑐2 , … , 𝑐𝑘−1 , 𝑐𝑘 ] is the simplest fraction between 𝐴 and 𝐵, where 𝑐𝑖 is the common partial denominator 𝑖 of 𝐴 and 𝐵 for 0 ≤ 𝑖 ≤ 𝑘 − 1, and 𝑐𝑘 = min{𝑎𝑘 , 𝑏𝑘 } + 1. Complete the outline below to prove this result. (i) Without loss of generality, assume that 𝑎𝑘 < 𝑏𝑘 . If 𝑘 = 0, then the simplest fraction between 𝐴 and 𝐵 is 𝑎0 + 1. (ii) Let 𝐶𝑘−2 = [𝑐𝑜 ; 𝑐1 , … , 𝑐𝑘−2 ] and 𝐶𝑘−1 = [𝑐𝑜 ; 𝑐1 , … , 𝑐𝑘−1 ]. Let 𝑓(𝑡) = 𝑡𝐶𝑘−1 ⊕ 𝐶𝑘−2 . By Proposition 22, 𝑓(𝑡) is continuous and monotonic on the interval (0, ∞). Observe that 𝑓(𝑎𝑘 ) fails to lie between 𝐴 and 𝐵, but 𝑓(𝑚) lies between 𝐴 and 𝐵 for all integers 𝑚 with 𝑎𝑘 < 𝑚 ≤ 𝑏𝑘 . (iii) Observe that the denominators of 𝑓(𝑚) increase monotonically.

Exercises

349

5. Explain why each of the following statements are true in the proof of the following result: A sufficient regular condition. Let 𝜔 be a positive irrational num𝑟 𝑟 1 𝑟 ber and be a reduced fraction with ||𝜔 − || < 2 . Then is a reg𝑠 𝑠 2𝑠 𝑠 ular convergent. (a) Suppose that vergents

𝑝𝑘

𝑟 𝑠

is not a regular convergent. Then there exist regular con-

and

𝑞𝑘 𝑝𝑘 |

(b) So 𝑞𝑘 |||𝜔 −

(c) Then |||𝜔 −

𝑞𝑘+1

such that 𝑞𝑘 ≤ 𝑠 < 𝑞𝑘+1 .

| ≤ 𝑠 |||𝜔 −

𝑞𝑘 | 𝑝𝑘 |

|
1. (b) The signatures of some numbers such as √2, √3, the golden mean, and 𝑒 display no obvious suggestions for a specific number of branches, un√229−15

like, for example, 𝑥 = [0; 15, 15, 15, …] = and 𝜋, which have 2 respective obvious branch numbers 15 and 7. Can you guess why? (Hint: Consider a continued fraction list of partial denominators whose first few terms are relatively small integers, and contrast the associated signature with that of a list whose first few partial denominators are relatively large.) 10. Suppose surface gravity on a uniformly dense Earth is 12 m/sec2 . With respect to Table 1, how would Marie’s answers change?

Strand X: The Longevity of the 17-year Cicada The longevity of the cicada insect is a curious entomologic-mathematical mystery.1 Cicadas live underground in their nymph stage sipping sap from tree roots. Seventeen years later they emerge from the ground all at once as a brief-lived, bewinged, singing, mating horde.

Figure 1. Adult cicada, author sketch.

Various predator-prey dynamic schemes have been used to model why the cicada lives so long. One math-modeling group concluded that “no plausible ecological mechanisms [exist] that select for periods being” as large as seventeen years. Instead the explanation for such insect longevity probably “resides in physiological or genetic mechanisms” [88]. More recently, Markus [60] presented a predator-prey “evolutionary game” where predator and prey jump to new life cycles according to a fitness heuristic in which the prey ultimately finds refuge in a

1 Unlike the cicada, most insect species live less than several years. Some termite queens and various beetles have life spans up to sixty years, but their longevity, like that of humans, is serendipitous.

351

352

Strand X: The Longevity of the 17-year Cicada

prime number.2 One reviewer called Markus’s game interesting, “however [the game] doesn’t really explain why periodical cicadas have 13-year or 17-year cycles” [114]. That is, currently there is no explanation. Although speculative, we offer a continued fraction suggestion. Here is our outline of conjectures and calculations. Conjecture 1: Cretaceous cicadas. Cicada species have been in existence on Earth for at least 100 million years. Furthermore, their similarly structured ancestors may have been in existence for at least 200 million years. Rationale. Archaeologists have found amber crystals into which cicadas had become entombed 100 million years ago—and a study “reveals a remarkable ♢ degree of morphological conservatism over 100 million years” [118], [148]. Conjecture 2: Instinctive awareness of time of day and season of the year. The cicada can instinctively sense the seasons of the year and the time of day even though it lives underground. Rationale. The cicada feeds on the root ends of trees. Trees have a daily photosynthesis cycle. This cycle is a veritable clock for the cicada. Trees are induced into various stages of foliage, sap flow, and root development by changes in duration of daylight. This cycle is a yearly calendar for the cicada. ♢ Conjecture 3: Instinctive awareness of the Moon’s phases. The cicada may be able to sense the current phase of the Moon. Rationale. This assumption may seem audacious—and is the science fiction in this discussion. However, various studies have concluded that honeybee hormone intensity varies according to the phases of the Moon [102]. Being diurnal, the honeybee is in its hive during the night at the only time when the Moon may appear as more than a cloud in the sky. Perhaps the bee can sense variation in gravity due to the relative positions of the Moon and Sun. In completely different studies, marine biologists have concluded that the adult coral, even though it has neither eyes nor brain, possesses a gene that “allows the coral to sense blue light and to work out what phase the Moon is in” [66]. Once a year at spring during a full moon, presumably even when the night sky is overcast, over 400 different species of coral all spawn simultaneously. How does coral distinguish between a quarter moon and a full moon? How does coral know when spring occurs? Finally, if bees and coral have a knack for knowing the Moon’s phases, other creatures may as well. 2 Briefly, in this game, Markus allows the life cycle of cohorts of a creature’s progeny to mutate by a year or two in each generation. However, in this game, Markus severely restricts predator cohorts never to mutate to a life cycle more than half the life cycle of its prey. Thus in due course, when the prey stumbles upon a prime as its life cycle, the predator is eventually thwarted in finding a life cycle for which its future generations could often feast on the prey.

Strand X: The Longevity of the 17-year Cicada

9 7 17 8 13 6 11 15 10 19

1 2

9 16

11 19

Figure 2. From 12 +

5 9

353

3 5

4 7

2 3

7 10 11 12 17 18 8 5 13 8 2 3

to 12 +

10 19

9 14 12 7 19 11

11 17

over the eons.

The acceleration due to Earth’s mass at Earth’s surface is 𝑔 ≈ −9.8 m/sec2 . The Moon’s mass changes this value by about 0.000033 m/sec2 while the Sun’s mass changes this value by about 0.0057 m/sec2 . Since the former variation can be both positive and negative (positive when the point 𝑃 on Earth’s surface is facing the Moon and negative when 𝑃 is facing away from the Moon), this variation due to the Moon alone is about 0.00006 m/sec2 . The similar variation with respect to the Sun is about 0.011 m/sec2 . Perhaps these variations are enough for a bee or cicada to sense instinctively. Although these variations in gravity seem incredibly small, they are the quantities governing the tides. ♢ Conjecture 4: Leapfrogging timespans. The cicada leapfrogged over the eons from lesser lifespans to greater lifespans. From the model presented in Example IX.45, about two hundred million years ago, the Moon’s relative angular ve2 locity was in the dominance domain of 12 + moons/year. So about 200 million 3 years ago, perhaps the creature that became the cicada had a lifespan of three years. As the eons passed, the Moon’s relative angular velocity transitioned into 3 the dominance domain of 12 + . Likewise, perhaps the cicada transitioned to a 5 life span of five years, and so on, to an ever increasing life span, as the number of 2 9 moons per year decreased from about 12 + to 12 + approximately 75 million 3 17 years ago, following the progression of dominance domains indicated in Figure 2.

354

Strand X: The Longevity of the 17-year Cicada

Rationale. In particular, imagine that the Sun and Moon orientation 𝑌 is recorded in genes of the cicada 𝒜 as a time stamp, perhaps at the moment when laid as an egg. For simplicity, imagine that this time stamp was set in late spring at noon on 15 June in year 𝑋. To further simplify, suppose that 𝑌 is a full moon. Suppose that the next occurrence 𝑡 years later of a full moon near noon on some day near 15 June serves as a trigger for 𝒜 to emerge from the ground. Suppose that this nearness with respect to noon is a matter of minutes and that the nearness with respect to 15 June is no more than two days. Table 1. A sequence of decreasing rational numbers corre𝑎 sponding to 12 + . 𝑏

𝑏

3

5

7

9

11

13

𝑎

2

3

4

5

6

7

𝑏

3

5

7

9

11

13

15

17

19

8

9

10

15

17

19

𝑎

Let the number of moons per year be 12 + , where 𝑎 and 𝑏 are relatively 𝑏 prime positive integers with 𝑎 < 𝑏. Let 𝑡 be the integer number of years after year 𝑋, where 1 ≤ 𝑡 ≤ 𝑏. In year 𝑡, let 𝑤(𝑏, 𝑡) be the time in days from noon on 15 June to the time of the full moon nearest 15 June. We call 𝑤(𝑏, 𝑡) the emergence window radius for the cicada. Table 2 shows the window radii 𝑤(𝑏, 𝑡) for all values of 𝑏 and 𝑡 where 1 ≤ 𝑡 ≤ 𝑏 and 𝑏 ranges over the odd integers from three through nineteen.3 For example, with 𝑏 = 3, corresponding to the lunar phase period of 2 12 + moons/year, the window radius at 𝑡 = 1 and 𝑡 = 2 years is about 𝑤(3, 1) ≈ 3 10 days ≈ 𝑤(3, 2), which means that no full moon occurs during 6 through 24 June in year 𝑋 + 1 and 𝑋 + 2. For the thousands of years when the Moon’s phases 2 were clearly within the dominance domain of 12 + , the cicada’s genetic trigger 3 could fire without confusion after three years of underground living. However, as time went on and the Moon’s phases wandered outside the dominance domain 2 of , perhaps the cicada’s genetic trigger for emerging morphed into some kind 3 of countdown. 3 But as the Moon’s phases wandered into the dominance domain of 12 + , 5 the old genetic trigger mechanism may have quickened. This time, during years 𝑡 = 1 through 5 the window radii were 12, 6, 6, 12, and 0 days, respectively. Again, the window radii for 𝑡 = 1 through 𝑡 = 4 were perhaps sufficiently large for the cicada’s genetic trigger to fire without confusion on year 𝑡 = 5. 3 In

𝑎

particular, these window radii of Table 2 were calculated using the fractions from Table 1. 𝑏 For any window radii exceeding two days, we rounded the radius to the nearest integer. See Code 22 in Appendix III for an outline of how to generate such tables.

Strand X: The Longevity of the 17-year Cicada

355

Perhaps similar phenomena occurred when the Moon’s phases wandered 4 5 6 7 into the dominance domains of 12 + , 12 + , 12 + , and 12 + . 7 9 11 13 When the Moon’s phases eventually wandered into the dominance domain 8 of 12 + , the cicada may have had difficulty weathering year 𝑡 = 13 without 15 its genetic trigger firing, because in that year the window radius was under two days. Again, perhaps when the Moon’s phases were in the dominance domains 10 9 of 12 + and 12 + , the cicada’s genetic trigger may have fired in years fifteen 17 19 and seventeen, respectively. Table 2. Window radii about 15 June in days with respect to period 𝑏 years and year 𝑡. ⎧ ⎪ ⎪ 3 ⎪ 5 ⎪ ⎪ 7 9 𝑏 ⎨ ⎪ 11 ⎪ 13 ⎪ 15 ⎪ ⎪ 17 ⎩ 19

1 10 12 13 13 13 13 14 14 14

2 10 6 4 3 3 2 1.9 1.7 1.5

3 0 6 8 10 11 11 12 12 12

4

5

6

7

8

12 8 6 5 5 4 3 3

0 4 13 6 10 8 8 9 7 10 6 10 5 11 5

0 3 5 7 8 9 10

13 11 9 8 7 6

𝑡 in years 9 10 11

0 3 5 6 7 8

13 11 10 9 8

0 2 4 5 6

12

13

14

15

13 12 10 9

0 1.9 3 4

14 0 12 1.7 11 3

16

17

18

19

14 0 12 1.5 14

0

Suppose some species 𝒮 of the cicada has a lesser sensitivity to the orientation of the Moon and Sun than the 17-year cicada. Perhaps species 𝒮 might instinctively accept a full moon within, say, a window of four days about 15 June in some year beyond, say 𝑋 + 5, as being sufficiently near orientation 𝑌 that their genetic trigger to emerge from the ground might fire. Thus, as can be seen from the last row of Table 2, the 𝒮 cicadas would emerge as a horde in year thirteen. In fact, some cicada species have lifespans of 13 years. 10 After passing through the domain of dominance of 12 + about one hun19 dred million years ago, what might the cicada have done? We imagine that the cicada may have developed a countdown algorithm to seventeen years, and has maintained that algorithm down through the eons. Furthermore, once the countdown reaches zero, the cicada horde waits until the ground eight inches deep is at least 64∘ F (18∘ C) and until a warm rain moistens the ground [26]. Finally, how can a species increase (or decrease) its lifespan by a significant leap? Once the cicada nymph matures, it is more or less dormant. It naps as a veritable Rip van Winkle4 —and awaits a signal to awake and fly. ♢ 4 In 1819 Washington Irving wrote a story about a man named Rip van Winkle who drank moonshine atop a Catskill mountain in New York, fell asleep, and awakened twenty years later.

356

Strand X: The Longevity of the 17-year Cicada

To emphasize again, the above conjectures are speculation. Yet the sequence of dominance domains through which the Moon has transitioned from about 200 million through 75 million years ago, 3 4 5 6 7 8 9 2 → 12+ → 12+ → 12+ , 12+ → 12+ → 12+ → 12+ → 12+ 3 5 7 9 11 13 15 17 may very well be a continued fraction key to solving the mystery of why the cicada lives so long, a rarity among insects. For at the same time that the Moon was transitioning from natural cycles of new moons with periods 3, 5, 7, 9, 11, 13, 15, 17 years the cicada may also have been transitioning to those same periods. Pure coincidence? Maybe not. In this next chapter, we use continued fractions to help predict the occurrence of a very rare astronomical event in our solar system

Chapter X: Transits of Venus At least once each year, Venus passes between Earth and the Sun. Because the orbital planes of Earth and Venus intersect one another at an angle, only rarely does Venus come directly between Earth and the Sun. On these occasions, the profile of Venus—a transit of Venus across the Sun—can be viewed from Earth. The last transit was in June 2012, as depicted in Figure 3. The next transit will occur in December 2117. Ascertaining the periodicity of the transits is a delicate problem and is a good application of continued fractions, the focus of this chapter. In particular, the periods of Earth and Venus are 𝜏𝑒 ≈ 365.26 days and 𝜏𝑣 ≈ 224.70 days, respectively. By Kepler’s third law of planetary motion, with time 𝑡 in years and distance in astronomical units (AU), 𝑎3 = 𝜏2 , where 𝑎 is the semimajor axis of a planet’s elliptical orbit and 𝜏 is its period. Thus, Venus is 𝜆 ≈ 0.723 AU from the Sun 𝑆. Relative to Earth’s angular frequency of one rotation per year, Venus makes 𝜔0 ≈ 1.62555 rotations per year. From this value how can we deduce the 105-year transit lapse between, say, 2012 and 2117? In general, as we allow the angular velocity 𝜔 to vary, how does the time lapse between transits change? The answer is surprisingly chaotic.

Figure 3. A Venus transit as might have been viewed (with heavy sunglasses) against a finial atop the Taj Mahal’s dome, June 2012, author sketch. 357

358

Chapter X: Transits of Venus

Figure 4. William Crabtree observing a transit; mural at the Town Hall in Manchester, England, by Ford Madox Brown (1821–1893). Source: Wikimedia Commons. In 1629, Johannes Kepler predicted a 1631 transit of Venus and estimated the period between transits as 120 years. The first recorded transit observation was in 1639 by Jeremiah Horox and William Crabtree, as shown in the mural of Figure 4. The standard reference for transit dates is Jean Meeus’s tables spanning 6000 years [99]. Espenak [43], who compiled NASA’s website on transits, call Meeus’s work “an indispensable reference for anyone wishing to do transit calculations.” Danloux-Dumesnils [32] calls Meeus’s original tables [100] “une belle étude.” Much of Meeus’s number crunching is based on “the modern planetary theory VSOP87 of the Bureau des Longitudes of Paris” [99, p. 1]. Against this standard, we contrast our results as obtained by using simple harmonic motion. But first we review why the world’s first international scientific project involved two Venus transits during the eighteenth century.

A historical interlude We begin with a puzzle that was finally resolved, more or less, due to approximately seven hours of intense observation each on 6 June 1761 and 3 June 1769, two dates when transits of Venus were predicted to occur. Puzzle 1: A natural question. How far is the Sun from Earth? When and how did we discover the result correct to within a reasonable margin of error? ♢ Definition 2: Astronomical unit. Let 𝐸, 𝑉, and 𝑆 represent Earth, Venus, and the Sun. We say that one astronomical unit, denoted by 1 AU, is the distance 𝑎 between 𝐸 and 𝑆.

A historical interlude

359

R

θ a

E

S

Figure 5. Viewing 𝑆 from 𝐸.

S

screen of the Sun

d

V

D 0.723 AU

T1 E

θ1

C

T2 R θ2

1 AU

a. Lines of sight from 𝐸 through 𝑉.

b. Two transits of 𝑉 across 𝑆.

Figure 6. Projection of Venus onto the Sun with respect to Earth.

Perhaps the first recorded guess (when converted into kilometers) for the value of 𝑎 = 1 AU is due to Aristarchus of the third century BC: about 8 million km. This value was also championed much later by Ptolemy and Copernicus. Some of the reasoning behind obtaining this value involves solar eclipses and the geometry depicted in Figure 5, where 𝑅 = 𝑎 sin 𝜃, 𝑅 is the radius of 𝑆, and 2𝜃 is the angle subtended by 𝑆 as viewed from 𝐸. Johannes Kepler—after studying the geocentric parallax of Mars (half of the angle subtended by Mars as viewed from Earth)—bumped the value of 𝑎 up to at least 24 million km. With the advent of the telescope, the guesses improved. In 1716, after reflecting upon observations he had taken forty years earlier of Mercury transiting the Sun, Edmund Halley realized that similar observations of Venus transiting the Sun could be used to quantify 𝑅, and thus 𝑎, an idea which James Gregory had advanced earlier in 1663. Halley then predicted that 𝑎 was “14 000 semi-diameters of the Earth” or about 90 million km, and outlined a plan to test the theory [65]. To simplify his argument, imagine that the orbits of both 𝑉 and 𝐸 and the equator of 𝑆 lie on a plane 𝑃 through the centers of 𝐸, 𝑆, and 𝑉. Imagine further that 𝐸’s rotation period about its axis is one year, so that 𝐸 always shows the same face toward 𝑆, just as the Moon always shows the same face to 𝐸. Assume that

360

Chapter X: Transits of Venus

the disk of the Sun 𝑆 is a circle.5 As illustrated in Figure 6a, allow two observers on 𝐸, 𝑑 units apart with respect to 𝑃 (one close to the north pole and the other close to the south pole), to observe 𝑉 as it passes between 𝑆 and 𝐸. Recall from the introduction to this chapter that distance 𝑆𝑉 is approximately 0.723 AU. By similar triangles, 𝑑 𝐷 = , 1 − 0.723 0.723 where 𝐷 is the distance between the two lines of sight onto the screen of the Sun, illustrated in Figure 6a. So 𝐷 ≈ 2.61𝑑. As shown in Figure 6b, the transits of 𝑉 as seen by the two observers will appear as two parallel chords 𝑇1 and 𝑇2 that are 𝐷 units apart and are positioned at respective latitudes 𝜃1 and 𝜃2 , where 𝐶 is the center of the Sun. The distance between two chords at latitudes 𝜃1 and 𝜃2 on the unit circle is sin 𝜃1 − sin 𝜃2 . By similar triangles we know that sin 𝜃1 − sin 𝜃2 𝐷 = . 1 𝑅 Thus 𝑅 = 𝐷/(sin 𝜃1 − sin 𝜃2 ), which in turn—since 𝑅 = 𝑎 sin 𝜃 from p. 359— yields the value for 𝑎 in miles. Furthermore, to accommodate the phenomenon of Earth’s rotation, as Halley calculated, if the observers measure the time lapse for 𝑇1 and 𝑇2 to be traced across the face of 𝑆 to the nearest second, then 𝑎 “can be determined to within one part of 500” [65]. But for Halley, the next transit of Venus was forty-five years in the future. Therefore he charged astronomers of two generations hence to do what he could not. “ ‘Even on his death-bed’ whilst holding a glass of wine in his hand, Halley said, ‘I wish that many observations of this phenomenon might be taken by different persons at separate places’ ” [163]. Helen of Troy, Christopher Marlowe (1564–1593) wrote, was a face that launched a thousand ships. For Halley, too, his idea launched quite a few—for the best places to view the transit were the far north and the southern seas. The astronomers who answered Halley’s challenge had two windows of opportunity to observe a transit of Venus, eight years apart. Hundreds of eyes focused on the Sun. Of course, some of the observers encountered cloudy skies or faulty equipment. The first window was 1761, in the midst of the Seven Years’ War (the French and Indian War in the American Colonies). One scientific expedition aboard the HMS Seahorse returned shortly after setting sail, having experienced serious damage—eleven dead and thirty-seven wounded—in a dogfight with a French frigate. Nevertheless, after appropriate repairs, the ship set sail again. Two of its

5 The

difference between the Sun’s polar and equatorial radii is about 10 kilometers.

A historical interlude

361

passengers, Jeremiah Dixon (1733–1779) and Charles Mason (1728–1786), eventually obtained excellent data from Cape Town, South Africa.6 The French astronomer Jean-Baptiste Chappe d’Auteroche (1722–1769) obtained very good data during both windows in 1761 and 1769, the first in Siberia working together with the Russian Academy and the second time along the Gulf of California with Spanish colleagues. Unfortunately, Chappe and most of his team lost their lives shortly after their California observations due to a typhoid epidemic in the community. Another astronomer, Jesuit Maximilian Hell (1720– 1792), was commissioned by the Danes to observe at an island off the northern Norwegian coast; most of his transit day was overcast—except for two brief moments of clarity: both the ingress and the egress of Venus within the Sun. The unlucky yet resourceful Guillaume Le Gentil (1725–1792) of France experienced overcast skies throughout both days in the regions of the Indian Ocean. Perhaps the most famous of the observers was Captain James Cook (1728– 1779) and the crew of the HMS Endeavour. Although their mission was originally commissioned in anticipation of the 1769 Venus transit, they discovered much about life down under.7 As reviewed recently in detail by Teets [154], James Short (1710–1768) analyzed transit data from sites as far afield as South Africa and northern Finland, and published his conclusions in the December 1761 issue of the Philosophical Transactions of the Royal Society that 𝑎 was 93 726 000 miles. Once the data from all of these eighteenth-century observations were consolidated in London and Paris, the scientific world concluded that 𝑎 ≈ 153 ± 1 million km, almost achieving Halley’s hoped-for precision. Years later, after the Venus transits of the nineteenth century, Simon Newcomb (1835–1909) refined the result to 𝑎 ≈ 149.59±0.31 million km. Today, by way of radar echolocation and spacecraft Doppler-telemetry techniques, this value has been refined down to within—as a website of the Ohio State University astronomy department says—the length of a football or soccer field.

6 Because of their fame with the Venus transit, these two astronomers later were engaged to determine the disputed boundary between the Pennsylvania and Maryland Colonies and so established the eponymous Mason-Dixon line, the line that separates the “North” and “South” in the United States. 7 While sailing to the South Pacific via the Horn, they stopped in Rio de Janeiro for supplies. Due to Portuguese/British tensions, no one but the captain was allowed ashore. Greatly vexed, the British botanist (and future president of the Royal Society) Joseph Banks—who could see new and exciting flora on either side of the ship—wrote that he felt “like a Frenchman laying swaddled in linen between two of his Mistresses, both naked and using every possible means to excite desire.” Months later on Tahiti where the team observed the transit, Banks cataloged many new plants and “sampled the pleasures of free love” [163].

362

Chapter X: Transits of Venus

A Venus-Earth-Sun model We assume8 that the orbits of Earth 𝐸 and Venus 𝑉 are circles, where 𝐸’s orbit is the unit circle (with radius 1 AU) and 𝑉’s orbit is a circle with radius 𝜆 ≈ 0.723 AU. Furthermore, we assume that 𝐸’s orbit lies in the 𝑥𝑦 plane with 𝑆 at the origin 𝑂 and that 𝑉’s orbit lies in a plane containing 𝑂 inclined at angle 𝜉 ≈ 3.39∘ to the 𝑥𝑦 plane. We call the line where these orbital planes intersect the nexus line or, according to Meeus [99], the line of nodes. The nexus line in Figure 7 is labeled 𝐵𝐶. A nexus point or node for Venus—𝐹 and 𝐺 in the figure—or for Earth— 𝐵 and 𝐶 in the figure—is where the orbit of 𝑉 or 𝐸 intersects the orbital plane of 𝐸 or 𝑉, respectively. Transits only occur when 𝐸 and 𝑉 are both near 𝐵 and 𝐹, respectively, or both near 𝐶 and 𝐺. The former transit is called a fall transit because in modern times 𝐸 is at 𝐵 in early December; it is also called, according to Meeus, an ascending transit, because as 𝑉’s profile moves across 𝑆 from left to right its trajectory rises. The latter transit is called a spring transit because 𝐸 is at 𝐶 in early June; it is also called a descending transit, because the corresponding trajectory decreases. 𝐸’s and 𝑉’s positions at any time are given respectively by: cos(2𝜋𝑡) 1 0 0 cos(2𝜋𝜔𝑡) cos 𝜉 sin 𝜉 ] [ sin(2𝜋𝜔𝑡) ] , (1) 𝐸(𝑡) = [ sin(2𝜋𝑡) ] and 𝑉(𝑡) = 𝜆 [ 0 0 0 − sin 𝜉 cos 𝜉 0 where 𝜔 is the relative angular velocity of 𝑉 with respect to 𝐸. For simplicity, we initially position 𝑉 and 𝐸 at their spring nexus points. Thus, in our model, spring transits can only occur near integer years, 𝑛, and, by symmetry, fall transits 1 can only occur near half-years, 𝑛 + . The value of 𝜔 for the actual 𝑉 and 𝐸 is 2 𝜔0 = 𝜏𝑒 /𝜏𝑣 ≈ 1.62555. The 3×3 matrix in Equation (1) corresponds to a clockwise rotation by 𝜉 about the 𝑥-axis, so as to be consistent with a descending (spring) transit occurring near nodes (nexus points) 𝐶 and 𝐺, where 𝐶 = (1, 0, 0). A line parametrized by 𝑢 from 𝐸 through 𝑉 at time 𝑡 is 𝑃(𝑢, 𝑡) = (𝑉(𝑡) − 𝐸(𝑡))𝑢 + 𝐸(𝑡).

(2)

To find 𝑉’s projection onto 𝑆 as viewed from 𝐸(𝑡)—an ideal geocentric point in space at 𝐸’s center—we imagine that 𝑆 resides within a rotating plane or screen 𝑆(𝑡) ever perpendicular to 𝐸(𝑡). Figure 7 shows the two orbital planes and 𝑉’s projection onto the screen as viewed from 𝐸.9 The plane 𝑆(𝑡) of 𝑆 can be written as 𝑋 ⋅ 𝐸(𝑡) = 0 (3) 8 A reader unfamiliar with vectors and matrices may wish to read through Appendix II before reading these next two sections. 9 In this chapter and in Chapter XII, we will identify points in ℝ3 with their corresponding position vectors. That is, we will treat them as identical.

A Venus-Earth-Sun model

363 een scr

E’s orbit

C G

V’s

j pro

ect

ion

V’s orbit V(t)

Sun O F

nexus point for V nexus point for E

E(t)

B axis between the orbital planes

Figure 7. The screen of the Sun through the Sun’s center.

where 𝑋 is a general point (𝑥, 𝑦, 𝑧) on the screen. When 𝐸 and 𝑉 are on opposite sides of the screen at time 𝑡—which happens if and only if 𝐸(𝑡) ⋅ 𝑉(𝑡) < 0—we take the projection point of 𝑉 onto the screen as that screen point between the planets. We combine Equations (2) and (3) to find the point 𝑋(𝑡) where the line intersects the plane. That is, equation 𝑃(𝑢, 𝑡) = 𝑋 and Equation (3) yield the following system of four equations with four unknowns 𝑥, 𝑦, 𝑧, and 𝑢, as well as the time variable 𝑡: 𝑥 = (𝜆 cos(2𝜋𝜔𝑡) − cos(2𝜋𝑡))𝑢 + cos(2𝜋𝑡), ⎧ ⎪ 𝑦 = (𝜆 cos 𝜉 sin(2𝜋𝜔𝑡) − sin(2𝜋𝑡))𝑢 + sin(2𝜋𝑡), ⎨ 𝑧 = −𝜆 sin 𝜉 sin(2𝜋𝜔𝑡)𝑢, ⎪ ⎩ 0 = 𝑥 cos(2𝜋𝑡) + 𝑦 sin(2𝜋𝑡).

(4)

ˆ (𝑡) = 𝐸 ˆ (𝑡) where Writing Equation (4) as a matrix equation gives 𝐴𝑋 1 0 ⎡ 0 1 ⎢ 𝐴=⎢ 0 0 ⎢ ⎣ cos(2𝜋𝑡) sin(2𝜋𝑡)

0 cos(2𝜋𝑡) − 𝜆 cos(2𝜋𝜔𝑡) 0 sin(2𝜋𝑡) − 𝜆 cos 𝜉 sin(2𝜋𝜔𝑡) 1 𝜆 sin 𝜉 sin(2𝜋𝜔𝑡) 0 0

⎤ ⎥ ⎥ ⎥ ⎦

(5)

ˆ (𝑡) and 𝐸 ˆ (𝑡) are the vectors (𝑥, 𝑦, 𝑧, 𝑢) and (cos(2𝜋𝑡), sin(2𝜋𝑡), 0, 0), reand 𝑋 spectively. Expanding the determinant along the last row of the matrix and using a definition of the determinant and the identities cos(𝛼 ± 𝛽) = cos 𝛼 cos 𝛽 ∓

364

Chapter X: Transits of Venus

T113.5 −1

1 distances in AU a. A wide screen.

T117.5

T121.5

b. Zooming in near the Sun.

Figure 8. Trajectories of 𝑉’s shadow on the screen of 𝑆. sin 𝛼 sin 𝛽 gives det(𝐴) = −1 + 𝜆(cos(2𝜋𝜔𝑡) cos(2𝜋𝑡) + cos 𝜉 sin(2𝜋𝜔𝑡) sin(2𝜋𝑡)) 𝜆

= −1 + ((1 + cos 𝜉) cos(2𝜋(𝜔 − 1)𝑡) + (1 − cos 𝜉) cos(2𝜋(𝜔 + 1)𝑡)) 2

𝜆

≤ −1 + (|1 + cos 𝜉| + |1 − cos 𝜉|) = −1 + 𝜆 < 0. 2

ˆ (𝑡) = 𝐴−1 𝐸 ˆ (𝑡). Since it would be Because the determinant of 𝐴 is never zero, 𝑋 convenient to see these points of intersection on a stationary screen rather than ˆ (𝑡) clockwise the dynamic plane 𝑆(𝑡), we rotate the first two components of 𝑋 about the 𝑧-axis by 2𝜋𝑡 radians. The result of such a transformation is a set of points whose first three components trace 𝑉’s projection onto the screen of 𝑆. Finally, since the first component of such points will always be 0 and we are uninterested in 𝑢, we project this set of points to obtain their second and third components as ordered pairs, which we index as 𝑊(𝑡) = (𝑊1 (𝑡), 𝑊2 (𝑡)): 0 𝑊(𝑡) = [ 0

1 0 0 1

cos(2𝜋𝑡) sin(2𝜋𝑡) ⎡ 0 ⎢ − sin(2𝜋𝑡) cos(2𝜋𝑡) ] 0 ⎢ 0 0 ⎢ 0 0 ⎣

0 0 ⎤ 0 0 ⎥ −1 ˆ 𝐴 𝐸(𝑡). 1 0 ⎥ ⎥ 0 1 ⎦

(6)

Figure 8a shows the path of 𝑉’s projection onto the screen over 1.5 years. Figure 8b is a close-up of the screen near 𝑆 over a period of about ten years, displaying three arcs of 𝑉’s projection. The arc labeled 𝒯113.5 corresponds to a fall transit near 𝑡 = 113.5 years. The arc 𝒯117.5 corresponds to 𝑉 and 𝐸 being on opposite sides of 𝑆 near 𝑡 = 117.5; as such, we display the disk of 𝑆 in front of this arc. The arc 𝒯121.5 misses the disk of 𝑆.

Conditions for a transit to occur The next proposition gives a necessary condition for a transit of Venus to occur.

Conditions for a transit to occur

365

base of truncated cone

C V

D

S

disk of the Sun

B

E’s orbit

Figure 9. Maximum separation from the nexus for a transit: a cone of possible shadows. Proposition 3: A litmus test.10 Let 𝑊(𝑡) be the projection point of 𝑉 onto the screen of 𝑆 with respect to 𝐸 at time 𝑡, as given by Equation (6). In order for a 1 transit of Venus to occur at time 𝑡 = 𝑛 or 𝑡 = 𝑛 + , 𝐸 and 𝑉 must be on the same 2 side of 𝑆 and 1 ‖𝑊(𝑛)‖ < 0.05 or ‖‖𝑊(𝑛 + )‖‖ < 0.05. (7) 2 Derivation/Proof. In order to find how far from its nexus 𝑉 may wander and still be part of a transit across 𝑆, we project the disk of 𝑆 through 𝑉 out to 𝐸’s orbit, forming a cone as illustrated in Figure 9, which displays the situation where the base of the truncated cone is tangent to 𝐸’s orbit. Let 𝜌 be the radius of this base with center point 𝐷. To approximate where this extreme position for 𝑉 occurs, we linearize the orbits of 𝑉 and 𝐸, and imagine that they proceed along lines perpendicular to the nexus line 𝐵𝐶, as illustrated in Figure 10. That is, we imagine that 𝑉 and 𝐸 proceed along tangent lines to their orbits once they pass the nexus point; such tangent lines are approximately perpendicular to the nexus line. In this figure, we take the distance 𝑆𝐵 as 1 AU. The distances 𝑆𝑉 and 𝑆𝐷 are 𝑘𝜆 and 𝑘, where 𝑘 is a marginally larger-than-1 deformation factor due to linearization. With 𝑠 ≈ 0.00465 AU as the radius of 𝑆, from similar triangles we see that 𝜌 𝑠 , = (8) 𝑘𝜆 𝑘(1 − 𝜆) 10 Code

23 of Appendix III shows how to implement this litmus test with a CAS.

366

Chapter X: Transits of Venus

plan

C

rbit ’s o V f eo

V

kλ S

it or b V’s k(1−λ) h

γ

D ρ

ξ

λ 1−λ

E’s orbit

B

Figure 10. Maximum separation from the nexus for a transit: a linear approximation of orbits. which gives 𝜌 ≈ 0.0178 AU. Furthermore, sin 𝜉 =

𝜌 ℎ

and

tan 𝛾 = ℎ,

(9)

where 𝜉 is the angle between the two orbital planes, 𝛾 is the angle between the nexus line and the line between 𝑆 and 𝑉, and ℎ is distance 𝐵𝐷. By Equations (8) and (9), 𝑠(1 − 𝜆 𝑠(1 − 𝜆) 𝛾 = tan−1 ( ≈ 0.0301, (10) )≈ 𝜆 sin 𝜉 𝜆𝜉 since the arguments of the inverse tangent and sine are so small. Thus, in order to be part of a transit, 𝑉 may wander no further than about 𝜆𝛾 ≈ 0.0218 AU from the nexus. By Equation (10), the lapse of time 𝐿𝑣 for 𝑉 to travel this far from its nexus is 𝑠(1 − 𝜆) (11) 𝐿𝑣 ≈ ≈ 1.08 days. 2𝜋𝜆𝜔0 𝜉 The corresponding maximal time 𝐿𝑒 that 𝐸 may stray from its nexus points and still take part in a transit is 𝐿𝑒 =

𝛾 ≈ 42 hours < 2 days. 2𝜋

(12)

Since the speed at which a transit is traced across 𝑆 is bounded by 10.34 AU/year as indicated by the graph of ‖𝑊 ′ (𝑡)‖ in Figure 11, ‖𝑊 ′ (𝑡)‖ < 10.34 AU/year ≈ 0.0284 AU/day

(13)

for all 𝑡. Let 𝑡0 be a medial transit time, a time of a spring transit near integer time 1 𝑛 or of a fall transit near half-year time 𝑛 + where 𝑊1 (𝑡0 ) = 0. Since the time 1

2

between 𝑡0 and either 𝑛 or 𝑛+ must be at most about 42 hours by Equation (12), 2

Conditions for a transit to occur

367

speed in AU/yr transit occurs here V and E on opposite sides of S 4 2 0.0

1.0 time t, in years

2.0

Figure 11. Speed, ‖𝑊 ′ (𝑡)‖, of 𝑉’s shadow across the screen of 𝑆. Table 3. Years and half-years at which the spring and fall transits occur; twin transits are paired with the dominant twin underlined. {0, 113.5, 227, (340.5, 348.5), (454, 462), 575.5, 689. 802.5, 916, 1029.5, (1143, 1151), (1256.5, 1264.5), 1378, 1491.5, 1605, 1718.5, 1832, (1945.5, 1953.5) } 1

the most that ‖𝑊(𝑛)‖ or ‖𝑊(𝑛 + )‖ can differ from ‖𝑊(𝑡0 )‖ is approximately 2

(0.0280 AU/day)(42 hours) ≈ 0.0496 AU by Equation (13). Since |𝑊2 (𝑡0 )| < 𝑠, 0.05 AU is about the most that ‖𝑊(𝑛)‖ or 1 ‖𝑊(𝑛 + )‖ can be. Therefore, our litmus test to determine if integer year 𝑛 or 2

1

half-year 𝑛 + is a promising one for a transit is for 𝑉 and 𝐸 to be on the same 2 side of 𝑆 and for 1 ‖ ‖ ‖𝑊(𝑛)‖ < 0.05 or ‖𝑊(𝑛 + )‖ < 0.05. ♢ 2 Applying Equation (7) to the integers 0 to 2000 with 𝜔 = 𝜔0 , we find the promising years of Table 3.11 We may consider the half-year dates to be fall transits and the whole-year dates to be spring transits. Double-checking the dates in Table 3 by graphing the arc 𝑊(𝑡) against the disk of 𝑆 verifies that each of the years or half-years corresponds to a spring or fall transit, respectively, and are the only transits during this 2000-year period in our 11 In

Code 23 of Appendix III, we show how to implement the litmus test of Proposition 3.

368

Chapter X: Transits of Venus

dominant tw

in T462

T454 Figure 12. A twin pair of descending spring transits.

model. As can be seen, the familiar differences 8, 105.5, and 113.5 between successive transit times appear—good news for our model. The entries in the table eight years apart have been grouped as ordered pairs; their associated transits are called twins or doubles. For example, spring transits occur in our model in both year 454 and year 462. For a twin transit, we say that the transit member whose path across 𝑆 comes closer to 𝑆’s center is the dominant transit of the two. In Tables 3, 4, and 5 dominant twins are underlined. As can be seen in Figure 12, and as indicated in Tables 3 and 4, 𝒯462 is the dominant member of the twin transit (𝒯454 , 𝒯462 ). If a transit has no twin, it is a singleton transit. 𝒯227 is a singleton. In due course, we show how to modify our model to simulate actual transit dates.

Recognizing the pattern To find a natural transit period, we focus on spring transits for a season. From Table 1 we drop the fall transit dates and are left with Table 2. When we refer to the spring transit year 𝑛𝑗 from the table, where 𝑗 ≥ 0, we mean term 𝑗 in row 2 or the dominant transit year if the term is a twin. For example, 𝑛2 = 462 as evidenced by Figure 12. Observe that the first eight spring transits constitute a complete residue set modulo 8. Furthermore, 𝑛𝑗 mod 8 just happens to be 3𝑗 mod 8, which suggests that the relative motion of the planets induces a shuffling of the transit year residues modulo eight. We thus refer to 3 as a shuffling factor. Table 4. Spring transits. 𝑗 transit year 𝑛𝑗 𝑛𝑗 mod 8 3𝑗 mod 8

0 1 2 0 227 (454, 462) 0 3 6 0 3 6

3 4 5 689 916 (1143, 1151) 1 4 7 1 4 7

6 7 1378 1605 2 5 2 5

8 1832 0 0

Recognizing the pattern

369

Sun −16

−8

0

8

16

Figure 13. 𝑉’s projection given by 𝑊(𝑡) near 𝑡 = −16, −8, 0, 8, 16. To help understand this eight-fold dynamic, observe that every eight years 𝐸 and 𝑉 pass each other not far from where they had passed each other eight years before, with 𝑉 a bit further ahead of 𝐸 each time. We say that the arc given by 𝑊(𝑛 years ± 1 week) is rung 𝑛 in a ladder of arcs. As the years go by, these rungs step monotonically upwards (or downwards) to a climax before reversing their progression, with rung 8𝑛 being slightly above or below rung 8(𝑛 + 1) for all integers 𝑛. Near the spring transit years, neighboring rungs are separated by a distance somewhat more than the radius of 𝑆, as illustrated in Figures 8b, 12, and 13; the dots in Figure 13 represent 𝑉’s projection at 𝑡 = −16, −8, 0, 8, 16 years. With 𝑞 = 8, the approximate distance 𝑑(𝑞) between neighboring rungs near transit years is the distance between 𝑊(𝑞) and its projection onto 𝑊(0+ ), where we take 0+ as one hour: 𝑊(𝑞) ⋅ 𝑊(0+ ) ‖ ‖ 𝑊(0+ )‖ ≈ 0.00672 AU. 𝑑(𝑞) = ‖𝑊(𝑞) − ‖ ‖ 𝑊(0+ ) ⋅ 𝑊(0+ )

(14)

Since 𝑠 < 𝑑(𝑞) < 2𝑠, a sequence of at most two successive rungs may cross the face of 𝑆, whereas if a rung crosses near the center of 𝑆, then only one rung in that succession of rungs can correspond to a transit. When we extend the data given in Table 2 to all integers, the data points seem to sort themselves. Plotting {(𝑛, 𝑊1 (𝑛))}𝑛≥0 corresponding to the times when 𝐸 is at its spring nexus point shows a hodge-podge of dots across one hundred years in Figure 14a. Yet, when we look at a longer period of time, the trend is clear. Figure 14b displays the data across 2000 years. It appears as if 𝑉’s projection when sampled at 𝐸’s spring nexus point lies on one of eight branches through the data, which all appear to be uniformly spaced translates of one another. By Equations (5) and (6), finding the periodicity within {(𝑛, 𝑊1 (𝑛))}𝑛≥0 is equivalent to finding the periodicity present within 𝒮𝜔 = {(𝑛, sin(2𝜋𝜔𝑛))}𝑛≥0 — which is our familiar signature of 𝜔 from Chapter VII—as 𝑛 ranges over integer values. Figure 15 shows that when restricted to the years 8𝑛 where 𝑛 is an integer—and when adjacent points are connected by line segments—the two curves display the same periodicity for 𝜔 = 𝜔0 . The curves appear to have a root near 𝑡 ≈ 917, but no spring transit occurs at either 912 = 8(114) or 920 = 8(115) years because in our model 𝑉 and 𝐸 are on opposite sides of 𝑆 at both times. However, near the next root 𝑡 ≈ 1834, a transit occurs at 𝑛 = 1832 = 8(229) years but not at 1840 years, because 𝑉’s projection falls just outside 𝑆’s disk in that year.

370

Chapter X: Transits of Venus

1.0

AU

AU 1.0

20

years

years

60

−1.0

−1.0

a. A hodge-podge of dots.

1000 1500

500

b. A better perspective.

Figure 14. Horizontal component of 𝑉’s projection at 𝐸’s spring nexus over time.

AU 1.0

(8n, W1(8n)) (8n, sin(2πσ(8n))

500

years

1500

−1.0 Figure 15. Paths through 𝑊1 (𝑡) and sin(2𝜋𝜔𝑡) when 𝑡 = 8𝑛 years and 𝜔 = 𝜔0 . The nearest integer continued fraction algorithm 𝑍 for 𝜔 ≈ 1.62555 gives 𝜔 ≈ 1.62555 ≈ [2; −3, −3, 28, 29]𝑍 ⟶ {2, From Chapter VII, using the second convergent of 𝑊1 is approximately 𝑇=

2𝜋 𝜔−

13

13 8

5 13 369 , , }. 3 8 227

(15)

, we know that the period

≈ 1834.29 years

8

and that branch 1 is a 𝛽 = 𝑇/8 ≈ 229.29 year translate of branch 0. To verify the fourth row of Table 4, that 3 is the shuffling factor, observe by Proposition VII.21 that the points on branch 1 correspond to integers of the form 8𝑛 + 𝑟 where 0 < 𝑟 < 8. Since 13/8 < 𝜔, to find 𝑟 we solve Equation (VII.6),

Recognizing the pattern

371

T6655

T6647 Figure 16. Spring transit near 3𝑇 + 5𝛽. namely 13𝑟 = −1 mod 8, giving 𝑟 = 3. Thus the points on branch 2 correspond to integers of the form 8𝑛 + 2𝑟, or 8𝑛 + 6, and so on, replicating the fourth row. As 𝜔 varies within 13 1169 1119 , ℬ6 ( ) = ( (16) ) ≈ (1.62264, 1.62645), 8 720 688 an order-six dominance domain as defined in Chapter IX, the shuffling factor 𝑟 13 13 for 𝒮𝜔 is 𝑟 = 3 when 𝜔 > and is 𝑟 = −3 mod 8 = 5 when 𝜔 < . Outside this 8 8 domain, 𝒮𝜔 may not display an eight-fold branch structure. 𝑝 For a given 𝒮𝜔 , knowing the branch number 𝑞, the fraction approximating 𝑞

𝜔, the period 𝑇, the translate distance 𝛽 of its first branch, and the shuffling factor 𝑟, we can predict when transits occur. Before reading the proof of this next proposition, the reader may wish to review the material from Chapter VII. Proposition 4: The transit rule. Let 𝑘, 𝑛, and 𝑗 be integers, 0 ≤ 𝑗 < 𝑞. If time 𝑡 = 0 is a spring transit, then another spring transit occurs at integer year 𝑚 near time 𝑘|𝑇| + |𝛽|𝑗 if and only if 𝑚 = 𝑞𝑛 + (𝑗𝑟 mod 𝑞) and 𝑚 is no further from 𝑘|𝑇| + |𝛽|𝑗 than from either 𝑚 − 𝑞 or 𝑚 + 𝑞. If either 𝑚 − 𝑞 or 𝑚 + 𝑞 is a transit year as well, then 𝑚 is the dominant member of the twin. To ascertain whether 𝑚 ± 𝑞 is also a spring transit, utilize the decision rule of Equation (7). Proof. From Proposition VII.14, an integer 𝑚 in ℕ is on branch 1 of 𝑞 branches if and only if 𝑚 = 𝑞𝑘 + 𝑟 where 𝑘, 𝑟 ∈ ℕ, 𝑟 satisfies the equation 𝑝𝑟 ≡ −𝜖 mod 𝑞, 𝑝 is the fraction approximating 𝜔, 𝜖 = sgn(𝑞𝜔 − 𝑝), and 0 < 𝑟 < 𝑞. Thus 𝑚 is on 𝑞

branch 𝑗 if and only if 𝑚 = 𝑞𝑘 + (𝑗𝑟 mod 𝑞). Thus 𝑘|𝑇| + |𝛽|𝑗 lies either between 𝑚 and 𝑚 + 𝑞 or between 𝑚 and 𝑚 − 𝑞. In either case 𝑚 is at least as close to 𝑘|𝑇| + |𝛽|𝑗 as is either of 𝑚 ± 𝑞. Example 5: An application of the transit rule. We illustrate the transit rule for 𝜔 ≈ 1.62555. Let 𝑘 = 3 and 𝑗 = 5. Since 3𝑗 mod 8 = 7, we want to find

372

Chapter X: Transits of Venus

1.0

−1.0

T7028 T7019 T7010 T7001 T6992 T6983 T6974

8000

2000

a. Sω, ω = 11√2 .

b. A transit family of septuplets, ω = 11√2 .

10

10

Figure 17. A transit with 𝜔 other than 𝜔0 . the transit year 𝑚 = 8𝑛 + 7 which is closest to 𝑘|𝑇| + 𝑗|𝛽| ≈ 6649.3. Then 𝑚 = 8(830) + 7 = 6647 while 𝑚 + 8 = 8(831) + 7 = 6655. So year 6647 is a singleton transit, while year 6655 is a near-miss as shown in Figure 16. ♢ As for fall transits, a similar rule applies, except that the eight branches through 1 the data corresponding to time 𝑛 + are 2

1 𝑦𝑗 = sin(𝛼(𝑡 − 𝛽(𝑗 + ))). 2 11√2

Example 6: Another application of the transit rule. Let 𝜔 = ≈ 1.55563. 10 The graph of the signature 𝒮𝜔 of 𝜔 shows that it has 𝑞 = 9 branches; see Figure 17a. The nearest integer continued fraction algorithm 𝑍 gives 𝜔=

11√2 3 14 2181 ≈ [2; −2, 4, −156]𝑍 → {2, , , }. 10 2 9 1402

The second convergent

14 9

of 𝜔 shows that 𝑝 = 14, and the third convergent

shows that 𝛽 should be near 1402. The period of 𝒮𝜔 is 𝑇 =

1 𝜔−

14 9

2181 1402

≈ 12600.3

years, so 𝛽 = 𝑇/9 ≈ 1400.03. And 𝜖 = sgn(𝑇) = 1. Solving 𝑝𝑟 = −𝜖 mod 𝑞, namely 14𝑟 = −1 mod 9, gives 𝑟 = 7, the shuffling factor. Now let 𝑘 = 0 and 𝑗 = 5, which means we are looking for a transit year with residue 𝑗𝑟 mod 9 ≡ 8 near time 5𝛽 = 5𝑇/9 ≈ 7000.17. Thus, 𝑚 = (777)(9) + 8 = 7001 is a transit year. With this new value of 𝜔, 𝑉 has receded from 𝑆, so the distance 𝑑(9) between the rungs has changed to 𝑑(9) ≈ 0.0014 by Equation (14), which means that we have more than twin transits; in fact we have septuplets, as shown in Figure 17b. ♢

A reality check

373

Y linear model Z approximation of the June 2012 transit actual June 2012 transit path

Figure 18. Hunting for a phase angle 𝛿.

A reality check How does our model compare to reality? A phenomenon omitted thus far from our transit model is the tendency for objects to rotate—including the orbital planes of 𝑉 and 𝐸, a feature called precession. The values 𝜏𝑒 and 𝜏𝑣 used to define 𝜔0 are the periods of the two planets with respect to the background of the fixed stars. To adapt our model appropriately, we must incorporate slightly different periods, namely, the time it takes for a planet to return to its aphelion, a point in the planet’s orbit furthest from the Sun. Since 𝐸 precesses faster than 𝑉, as time goes on the nexus line rotates and hence spring and fall transits occur later in the year. Because precession rates are tiny compared to 𝜔0 , we arbitrarily take 𝜔0 ≈ 1.625550000 as computed on page 362. Meeus [99, p. 13] predicts that “an almost exactly central transit will take place on 11 July 5900”—a transit through 𝑆’s center. Thus from 2012 to 5900, the spring transit has now become a summer transit, having slipped forward by about 35 days over a lapse of 3888 years, which implies that the change in the relative orbital speeds of 𝑉 and 𝐸 with respect to the nexus line 35𝜔0 is Δ𝜔 ≈ ≈ 0.0000397559; this means that we might try the new angular 3888𝜏𝑒

velocity 𝜔1 = 𝜔0 − Δ𝜔 ≈ 1.625510244. Next, we need a phase shift 𝛿 to start our model. From [99, p. 48], the transit of 6 June 2012 crossed 𝑆’s boundary at 𝑌 ≈ 39.45∘ and at 𝑍 ≈ 291.4∘ measured counterclockwise from the top of 𝑆, shown as a dashed line in Figure 18. Adjusting Equations (1) and (5) so that the trigonometric arguments 2𝜋𝜔𝑡 are replaced by 2𝜋𝜔(𝑡+𝛿), where 𝛿 is an indeterminate phase shift, and using a search method to find 𝛿 by dynamically plotting 𝑊(𝑡 − 2012) near 𝑡 = 2012 yields the solid-line transit in Figure 18, suggesting that 𝛿 ≈ 0.00102 is a good match. The reason that the two transit lines are non-parallel is because 𝐸’s and 𝑉’s actual orbits have

374

Chapter X: Transits of Venus

positive eccentricity. When we apply Equation (7) in this adjusted model for the years from 700 to 3000 ad, we find the promising spring transit Gregorian year possibilities shown in Table 5. The underlined years indicate a match between our results and Meeus’s. Not bad for a linear model. But can we do better? Table 5. The linear model versus Meeus’s model. Linear model

{

(781, 789) (2004, 2012)

Meeus’s model

{

(789, 797) (2004, 2012)

(1024, 1032) 2255) (1032, 1040) (2247, 2255)

1275 2498 (1275, 1283) (2490, 2498)

1518 2741

(1761, 1769) (2984, 2992)

(1518, 1526) (2733, 2741)

(1761, 1769) (2976, 2984)

To do so, we work backwards through the transit rule and find a magic an13 gular velocity. Since 𝜔1 is within the dominance domain of and is less than 8

13

as evidenced by (16), the corresponding shuffling factor is 𝑟 = 3. We make 8 use of a second unusual spring transit year: 183 bc, whose corresponding transit Meeus describes as “almost central.” The difference between 5900 ad and 183 bc is 6083 years. Identify 𝑡 = 0 with year 5900. Thus, year 183 bc is referenced by 𝑡 = −6083 = 8(−761) + 5, which means that 5 ≡ 3𝑗 mod 8, whose solution 1 is 𝑗 = 7. Using the angular velocity 𝜔1 gives the associated period 𝑇1 = 13 ≈ 𝜔1 −

1959.85. We then solve 𝑘𝑇1 + 7

7𝑇1 8

𝑘 = −4 and solve (𝑘 + )𝑇2 = −6083, obtaining 𝑇2 = 8

angular velocity 𝜔2 satisfies 𝑇2 =

8

= −6083, getting 𝑘 ≈ −3.98. Next, reset 𝑘 as 1 𝜔2 −

13 8

48664 25

. Since our adjusted

,

13 25 13 9888 1 + = + = ≈ 1.6255137267795495644. 𝑇2 8 48664 8 6083 When we generate transits by the transit rule using angular velocity 𝜔2 across the years 2000 bc to 4000 ad, we get an exact match with actual spring transits from Meeus’s results, as shown in Table 6. As can be seen, the difference between successive entries in Table 6 is 243 years except when passing from 2733 to 2984, the year marked with an asterisk. The match between the results of these two approaches with respect to the recessive partner in twin transits is less spectacular. 𝜔2 =

Table 6. Spring transit years, generated by the transit rule. 1884 bc 546 2984∗

1641 bc 789 3227

1398 bc 1032 3470

1155 bc 1275 3713

912 bc 1518 3956

669 bc 1761 4199

426 bc 2004 4442

183 bc 2247 4685

60 2490 4928

303 2733 5171

A ﬁnal thought

375

An easier way to determine when transits occur The denominator of 𝑍’s third convergent for 𝜔0 , Venus’s relative angular velocity, is 227, as given by Equation (15). The data of Table 6 clearly shows that 227 years is not the time lapse between successive Venus spring transits. To account for the precession of planetary orbits so as to adjust appropriately the guess of 227 years, we must nudge 𝜔0 slightly. Rather than interpolate using ideal central transit data as we did above, an alternate way to find the time lapse between Venus spring transits is to take advantage of the recursive nature of continued 5 𝑎 13 𝑐 fractions. The first and second convergents for 𝜔0 are = and = . As we 𝑑 3 𝑏 8 saw in Chapter IX, in general the third convergent must be of the form 5 + 13𝑛 , 𝑓(𝑛) = 3 + 8𝑛 where 𝑛 is some integer. Observe that 𝑓(28) = 227. To account for the modest impact of precession on 𝜔0 , a better third convergent for a more accurate angular velocity should be 𝑓(𝑚) for some integer 𝑚 near 28. Table 7. Possible time lapses between spring transits. 𝑛 𝑓(𝑛) lapse

27

28

29

30

356

369

382

395

219

227

235

243

219 227

235 243

From Table 7, we see that the time lapse should be one of 219, 235, or 243 years. Checking NASA’s tables to confirm if any of these are correct, we find the denominator of 𝑓(30) to be 𝑞 = 243 years. And indeed, the next June transit of Venus is 243 years from 2004, namely the year 2247. Furthermore, 𝑍’s third 395 convergent for 𝜔2 is . By symmetry, the time lapse between successive transits 243 of Venus, spring to fall, or fall to spring, should be half of this period, namely 121.5 years. Of course, because the relative angular velocity of Venus lies within the 13 periodicity domain of and twin transits often occur, we sometimes must adjust 8 121.5 years by eight years more or less. In particular, the last Venus transit was a twin transit in June of 2004 and 2012; the next fall transit will occur in December of 2117. The time lapse between June 2004 and December 2117 is 113.5 years, which is eight years less than 121.5 years.

A ﬁnal thought What we have shown is that the cycle of transits is the way it is because 𝑉’s angu13 lar velocity 𝜔0 is enmeshed within the dominance domain of . This dominance 8

376

Chapter X: Transits of Venus

in turn induces a modulo-eight shuffling of successive transit years by a factor of three, a phenomenon reflected in the 6000-year standard tables of transits generated by Meeus [99]—provided we partition transits into two families, spring transits and fall transits, and discard one of the years from each twin transit. With respect to permanence, in the life cycle of 𝑆, 𝑆 slowly loses mass and swells to giant status and so the orbits of the planets recede from 𝑆, which means that the transit cycle for 𝑉 may change dramatically. The rational numbers with 13 small integer denominators near in increasing order are 8

3 11 8 29 21 13 31 18 23 28 33 5 7 , , , , , , , , , , , }, { , 2 7 5 18 13 8 19 11 14 17 20 3 4 a portion of the Farey series ℱ20 from Chapter IV. Eons from now, the natural periodicity of the Venus transit may change from 8 to 13 or 19. Hopefully people will still be here to see. Exercises 1. Assume that the masses of Earth, the Moon, and the Sun will remain the same indefinitely, and that the Moon recedes from Earth at 3.5 cm/year, while Earth recedes from the Sun at 15 cm/year. Estimate the time, if it exists, when twelve moons is exactly one year. 7

7

2. Generate a table much like Table 2 for 12 + and 12 + moons per year. De17 19 scribe any qualitative differences in the distribution of window radii between the two tables. 3. (a) Imagine that two observers view a transit of Venus at Earth’s two poles. Assume they are 𝑑 = 8000 miles apart. How far apart, 𝐷 miles, do the transits across the Sun appear to be? (b) Estimate the number of widths of length 𝐷 needed to form a radius of the Sun. (The Sun’s approximate radius is 432 000 miles.) (c) Now estimate the length of an astronomical unit. Assume that the Sun subtends an angle of 0.5∘ as viewed from Earth. 4. (a) Use the transit rule to find the date of the transit when 𝑘 = 3 and 𝑗 = 6 for 𝜔0 . (b) Repeat the calculation for 𝜔2 , where 𝜔2 is defined on p. 374. (c) Repeat the calculation for 𝜔 = 1.5277121212 … . 5. (a) Find a number 𝜔 for which 𝒮𝜔 has five branches and for which the shuffling factor is three. (b) Find a number 𝜔 such that 𝒮𝜔 has thirteen branches each of which has period 𝑇 with ⌊|𝑇|⌋ ≈ 3000.

Exercises

377 𝑝

6. (a) Experiment with dominance domains ℬ𝑛 ( ), and determine, if possible, 𝑞

a rule of thumb to use to find the least integer 𝑛 for a given fraction

𝑝 𝑞

such that 𝒮𝜔 appears to have 𝑞 branches for every 𝜔 in the domain. For 𝑝 13 example, as in (16), for = , the answer appears to be 𝑛 = 6. 𝑞

8

(b) In order to recognize the number of branches 𝑞 for the signature 𝒮𝜔 for a given irrational number 𝜔, one period 𝑇 of any of its branches must contain a sufficient number of data points. Estimate in general how many data points on a branch are needed to recognize the connected sequence of these data points as a sine curve. For example, four points is too few, and one hundred points is more than enough. 7. (a) In Equation (5) for the matrix 𝐴, replace each occurrence of 2𝜋𝜔𝑡 with 2𝜋𝜔(𝑡 + 𝛿) where 𝛿 = 0.1 years. When will the next spring transit of Venus occur? (b) Generalize part (a) for any such phase shift 𝛿. 8. (a) Using the ideas of this chapter, generate a table analogous to Table 6 for Venus’s fall transits. (b) Determine the periodicity of Mercury transits. 9. (a) How often would a Martian colony experience an Earth transit? A Martian year is about 668.98 Earth days, and Mars’s orbital inclination with respect to Earth’s orbital plane is 1.850∘ . (b) On the screen of the Sun, how far apart do Earth’s successive transit rungs appear to be? (c) Determine the analog of Equation (7) for Earth transits with respect to Mars. 10. With respect to an Earth transit as viewed from Mars, the relative angular velocity of Earth is 𝜔 ≈ 668.98/365.26 ≈ 1.83152. (a) Generate the first few convergents for 𝜔. (b) How many branches does 𝒮𝜔 appear to have? (c) Estimate the period of these branches. (d) As with Earth and Venus, the relative angular velocity of Earth and Mars is affected by precession. Produce a table analogous to Table 7 listing some possible time lapses between successive Earth transits of the same family as viewed from Mars. By family, we mean a set of transits occurring in approximately the same part of the year; for example Venus has two families of transits: spring transits and fall transits.

378

Chapter X: Transits of Venus

(e) Mars has two Moons, Phobos and Deimos, whose respective distances from Mars and orbital periods about Mars are 9377 km, 23436 km, 0.31891 Martian days, and 1.26244 Martian days. The polar radius of Mars is 3376.2 km. How often does Phobos lap Deimos? How often will an observer on Deimos see Phobos move across the face of Mars? With respect to the orbital plane of Mars, Phobos’s orbital plane has an inclination of 1.08∘ and Deimos’s orbital plane has an inclination of 1.79∘ .

Strand XI: Meton of Athens Meton, an Athenian astronomer, championed the adoption of a nineteen-year calendar in about 432 bc. He is considered to be the first “scientific Greek astronomer” because he was the first Greek to use instruments when observing. Apparently he was the first to set up a sundial in Athens. He also was skilled in devising water-supply to various city-states. Meton’s observatory was located atop the steps adjacent to the old Athenian Assembly meeting place called the Pnyx, as shown in Figure 1.

Figure 1. A long eastern ridgeline from Meton’s observatory, above the old steps on the lower right, with the Acropolis in the upper left, author sketch. From his observatory, Meton tracked the position of each sunrise against a long ridgeline. After several years of gathering data, he determined that the aver5 age length of a year was about 365 days, although this value is about 𝛿 ≈ 30.2 19 minutes longer than a tropical year—the length of time between successive summer solstices.1 Even though such an error grows to 19𝛿 ≈ 9.56 hours in nineteen 1 The tropical year is about 20.45 minutes shorter than a sidereal year—the length of time for Earth to return to where it had been before with respect to the fixed stars.

379

380

Strand XI: Meton of Athens

years, Meton loved this result because the Moon’s phases cycle with approximate period 29.5 days, a length known as a moon, a lunar month, or a lunation, and nineteen years of lunar months is almost exactly 235 moons. In nineteen years, the lunar and solar methods of tracking time nearly coincide. Why not adopt such a calendar? he proposed. In his day, each city-state had their own peculiar way of tracking time, such as so many moons or years after the last olympic games, or so many moons or years after a ruler gained power. To be sure, the Greek world somewhat acquiesced to Meton’s idea, but apparently no city-state ever officially adopted this calendar. As the years slipped by, about one hundred years after Meton, Callippus (circa 370–300 bc) proposed a modified calendar of four Metonic cycles one of which is lessened by a day, making a calendar of 76 years. Astronomers continued using this cycle at least until 46 bc when Julius Caesar approved the Julian Calendar, a modification of the Egyptian calendar, for the Roman Empire. What else is known about Meton? The Roman historian Plutarch mentions Meton in passing in one of his Parallel Lives contrasting Greek and Roman heroes. In his Life of Nicias, an Athenian leader, Plutarch takes us to the year 412 bc. Should the Athenian fleet attack Sicily? Gods and oracles are consulted. Omens are read. For several days, ravens peck at a statue of Pallas-Athena atop a bronze palm tree, so felling its fruits of gold to the ground—a bad sign. Then it was either because he feared such signs as these, or because, from mere human calculation, he was alarmed about the expedition, that the astrologer Meton, who had been given a certain station of command, pretended to be mad and set his house on fire. Some, however, tell the story in this way: Meton made no pretence of madness, but burned his house down in the night, and then came forward publicly in great dejection and begged his fellow citizens, in view of the great calamity which had befallen him, to release from the expedition his son, who was about to sail for Sicily in command of a trireme. Meton also appears in a more light-hearted role in Aristophanes’ comedy The Birds, which debuted two years earlier, in 414 bc. Briefly, the birds decide to build themselves a city, midway in the air between the land below and the gods above, called Cloud Cuckoo Land. A succession of men volunteer to help, yet all of them are soundly thrashed and driven from the domain. The first is a poet hoping to be hired to dispense fine-sounding words. The second is a prophet with omens for sale. The third is Meton, a civil engineer, who offers to design their city. The fourth is a government official with lists of regulations. The fifth is a lawyer with a bunch of laws. Then the gods volunteer to help, but they too are outwitted and banished. And so the birds establish a utopia by and for themselves along with anyone who chooses to become a bird-brain.

Strand XI: Meton of Athens

381

Figure 2. Meton on his way to Cloud Cuckoo Land, sketch by author. The mast and spar also serve as a large wooden compass and straight-edge. When the play first opened Meton was probably in the audience. He is the only Athenian singled out for laughter by name. Perhaps the following translated excerpt from The Birds is but a simple roast of a celebrity in their midst. This passage is the entirety of Meton’s dialogue with Pithetaerus, an eminent Cloud Cuckoo Land official whose name means Trusted Friend. Meton: I have come in person … Pithetaerus: (to himself ) Here’s another pest. Meton: I propose to subject the atmosphere to geometrical measure, and divide it in acre lots. Pithetaerus: In God’s name, who are you? Meton: Who am I? Sir, I am Meton, known throughout Greece. Pithetaerus: Tell me, though—pointing to Meton’s instruments—what are those things? Meton: Aerial measuring rods. To demonstrate—since the atmosphere in shape is like a domed pot cover, I place this curvilinear ruler above, insert a compass ... You follow? Pithetaerus: Not a bit!

382

Strand XI: Meton of Athens

Meton: Applying a straight-edge ruler, I take successive measurements until your circle becomes a square whose center is a market-place, and towards that center—just as a star which, although circular, radiates rays of light— run roads from all directions. Meton hands Pithetaerus the large wooden compass. Pithetaerus: What a Thales2 you are! … pause … Meton! Meton: Yes? Pithetaerus: Let me give you some advice. Step off the road a little. Meton: Why, what’s wrong? Pithetaerus: Among us rages an epidemic of xenophobia, a harmony of hearts for exiling all humbugs. Pithetaerus proceeds to beat Meton with the compass. Meton: Oh, mercy! Help! Pithetaerus: Ah, well, what did I say? Now, please, geometrize yourself away.

2 Thales (circa 624–546 bc) was one of the Seven Sages of Greece. Aristotle regarded him as the first true Greek philosopher. His mathematical discoveries included the use of geometrical reasoning to estimate how far ships were from shore. He is reputed to be the first to try to explain physical phenomena without reference to the gods.

Chapter XI: Lunar Rhythms Hooray! It’s Hanukkah!3 It’s Easter!4 It’s Ramadan!5 It’s Diwali!6 It’s Durin’s Day!7 It’s Tét!8 When next will such lunar holidays arrive on the same date with respect to the seasons and, specifically, the Gregorian calendar? From Puzzle VII.27, we saw that the phases of the Moon cycle with period nineteen years. Is this the best period? As an application of continued fractions, this chapter attempts to give an answer. One difficulty with characterizing the Moon’s motion is that it involves the three-body problem. After Isaac Newton derived Kepler’s laws from first principles assuming an inverse square law of gravitation, he focused on Earth, the Moon, and the Sun, so as to determine where the Moon would be at any time and ultimately gave up, saying to Edmund Halley that the three-body problem had “made his head ache, and kept him awake so often, that he would think of it no more.” [153, p. 160] Although the mean time it takes the Moon to complete one circuit of Earth with respect to the Sun is about 29.53 days (the synodic period), the exact time varies up to about 7 hours from this mean. Long ago in 1178, Moses Maimonides, a renowned medieval Jewish scholar who among many other things had been a physician to Saladin, said, “The Sun knows its way, the Moon does not.” Indeed, in 1887, Henri Poincaré showed the futility of searching for an analytic lunar cycle formula, that the very pattern is one of chaos. Of course, we can extend our predictions of Earth’s and the Moon’s positions to a reasonable degree of accuracy arbitrarily far into the future and past using dynamical simulations. In [44], Fred 3 A Jewish holiday starting on the 25th day of the lunar month Kislev, where the beginning of each month is a new moon. 4 A Christian holiday on the first Sunday after the full moon following the spring equinox. 5 A month-long Muslim fasting holiday starting with the new moon that initiates the lunar month of Ramadan. 6 A Hindu holiday whose zenith is the new moon between mid-October and mid-November. 7 A holiday of the fictional realm of Middle Earth starting on the first day of the last moon of autumn [156, p. 96]. 8 Also known as the Chinese New Year, which is usually the second new moon after the winter solstice.

383

384

Chapter XI: Lunar Rhythms

Figure 3. Phases of the Moon, courtesy of NASA. Espenak, a NASA astronomer who specialized in predicting eclipse dates, logged the dates of the four quarters of the Moon over a 6000-year period. Keeping in mind Poincaré’s observation, we proceed with caution and use the simple harmonic motion Earth-Moon-Sun model of Equation (VII.14). In this model, the Moon at time 𝑡, with respect to a fixed Earth at the origin and a fixed Sun along the positive 𝑥-axis, is at (cos 2𝜋𝜔𝑡, sin 2𝜋𝜔𝑡) where 𝜔 is the relative angular velocity, 𝜔 ≈ 12.368747 cycles per year, and 𝑡 is in years. Let 𝑤(𝑡) = sin 2𝜋𝜔𝑡, the second component of the Moon’s position in our system, a measure of how far the Moon is from being new or full at time 𝑡.

Predicting the time lapse between successive new moons A plot of 𝜔’s signature, 𝒮𝜔 , displays nineteen branches, corresponding to 𝜔’s har235 . From Chapter VII, the shift between successive branches monic convergent 19 should be the denominator of 𝜔’s next harmonic convergent. Figure 4 is a graph of 𝒮𝜔 along with a zoomed-in window near the root of branch 1 (of nineteen branches). As can be seen, this shift distance is 160. Applying the harmonic algorithm 𝐻 or the nearest integer continued fraction algorithm 𝑍 to 𝜔 generates the convergents

Predicting the time lapse between successive new moons

385

1

500

2000

t

inset 158

160

162

−1

Figure 4. A connected branch and inset graph of the Moon’s signature, 𝒮𝜔 .

136 235 1979 37 → → → . 3 11 19 160 This information suggests that the phases of the Moon should approximately repeat themselves according to the Earth-Moon-Sun model every nineteen years, just as Meton observed, and also every 160 years. Which one is better? As a clue, the roots of sin 2𝜋𝜔𝑡 nearest 19 years and 160 years are, respectively, 4.38 hours less than 19 years and 0.34 hours more than 160 years. On this basis, 160 appears to be a better time lapse than 19 when minimizing the average time lapse between new moons modulo the nearest integer in years of the lapse’s length. But is it really? Before we consult actual data, we adapt our simple model appropriately to predict actual averages for these two time lapses. One glaring weakness of our simple model is that it ignores the precession of Earth, a phenomenon noted by Hipparchus9 in about 130 bc. Earth’s solstices and equinoxes rotate against the Zodiac, the background of stars along Earth’s equatorial plane, with a period of 25 800 years. As one of the significant blockbuster punchlines in Isaac Newton’s Principia of 1687, Newton predicted that Earth must be flattened at its poles (outlined in Chapter VIII). He showed that the combined gravitational force exerted by the Moon and the Sun on the mass in Earth’s bulging equatorial region was enough to explain Earth’s precession rate. Although its orbital eccentricity is highly exaggerated, Figure 5 shows Earth’s orbit precessing with time, where 𝐹 represents a fixed star, 𝑆 is the Sun, 𝐸 is Earth, and 𝑃 is the perihelion10 of Earth’s orbit. Figure 5a displays Earth at time 12 →

9 Hipparchus is credited with making the first star charts and trigonometric tables, gathering together all that was known up to his day in astronomy. 10 The perihelion of a planet is the point along its orbit nearest the Sun.

386 E

Chapter XI: Lunar Rhythms S

F P

a. At time 𝑡 = 0 years.

E

F

S

F

S

E

P

P

b. Once around, not quite one year.

c. At 𝑡 = 1 year.

Figure 5. Earth’s orbit precessing with time. 0 when 𝐸, 𝑆, 𝑃, and 𝐹 are aligned. Figure 5b displays Earth after it completes 1 one complete orbit, at time 𝑡 ≈ (1 − ) years. Note that Earth’s orbit has 25800 precessed clockwise by a small amount. Because of precession, the line through 𝐸 and 𝑃 fails to pass through 𝐹. At time 𝑡 = 1 (sidereal) year, 𝐸, 𝑆, and 𝐹 are once more aligned, but 𝑃 fails to be on this line. In our simple Earth-Moon-Sun model, in one year Earth completes one lap in its orbit. Yet in one year, Earth actually travels slightly more than one lap in its orbit, which means that each year the 1 simple model loses year with respect to how time is actually measured. In 25800

(365.25)⋅(19)

nineteen years this precession loss in our simple model is 25800 days ≈ 6.46 hours, and, similarly, in 160 years this loss is 2.265 days.

≈ 0.2690

Definition 1: Spans of years. We say that a short span of years is an ordered pair of new moon dates approximately 19 years apart and a long span of years is an ordered pair of new moon dates approximately 160 years apart, with the first components being less than the second components and where the new moon dates lie between 8 October and 7 November, an arbitrary range of 31 days chosen because they more or less encompass the time of autumn.11 Should the dates fail to be unique for a particular year, the October dates are preferred over the November dates. As a specific instance of a standard short span, here is 𝑆1996 : 𝑆1996 = (12 Oct 1996 @ 23:07, 13 Oct 2015 @ 00:06).

(1)

Definition 2: Deficient centuries and lapse spans. In the Gregorian calendar every year divisible by 4 is a leap year except for century years nondivisible by 400. Such century years as 1900 are said to be deficient. Because of the artificial nature of this leap year assignment, three kinds of short spans exist: those that contain 3, 4, or 5 leap years. We say that a short span 𝑆𝑛 is standard if no year from 𝑛 + 1 to 𝑛 + 19 is a deficient century year. The lapse of a short span 𝑆𝑛 = (𝛼, 𝛽) is (𝛽 − 𝛼) − 19 years. As with the short spans, we have three cases of long spans. 11 In my part of the world, the days before early October still feel like summer, and the days following early November feel like winter.

Predicting the time lapse between successive new moons

387

Table 1. A sample of new moon dates 19 years apart. 𝛼 2 Nov 1480 @ 8:48 22 Oct 1500 @ 23:23 11 Oct 1520 @ 15:54 29 Oct 1540 @ 20:50 19 Oct 1560 @ 6:49 5 Nov 1600* @ 22:53 25 Oct 1620 @ 13:55 15 Oct 1640 @ 4:10 3 Nov 1660 @ 1:06 22 Oct 1680 @ 11:55 12 Oct 1700 @ 10:16 31 Oct 1720 @ 11:42 20 Oct 1740 @ 16:35 9 Oct 1760 @ 1:36 27 Oct 1780 @ 17:10 18 Oct 1800 @ 8:58 6 Nov 1820 @ 0:08 25 Oct 1840 @ 8:59 14 Oct 1860 @ 14:38 2 Nov 1880 @ 15:55 23 Oct 1900 @ 13:27 12 Oct 1920 @ 0:50 30 Oct 1940 @ 22:03 20 Oct 1960 @ 12:02 9 Oct 1980 @ 2:50 27 Oct 2000 @ 7:58 16 Oct 2020 @ 19:31 4 Nov 2040 @ 18:56 24 Oct 2060 @ 9:25 13 Oct 2080 @ 2:44 average lapse:

hour lapse 24.633 15.683 8.1167 18.667 24.000 8.1500 21.983 22.183 11.300 11.317 24.017 21.883 8.1500 −8.6000 24.167 18.717 8.0667 15.567 24.517 18.533 7.2000 19.667 24.633 14.350 8.7333 19.667 23.633 14.250 8.9000 22.800 16.16

𝛼 + 19 years 3 Nov 1499 @ 9:26 23 Oct 1519 @ 15:04 12 Oct 1539 @ 0:01 30 Oct 1559 @ 15:30 20 Oct 1579 @ 6:49 6 Nov 1619 @ 7:02 26 Oct 1639 @ 11:54 16 Oct 1659 @ 2:21 3 Nov 1679 @ 12:24 22 Oct 1699 @ 23:14 13 Oct 1719 @ 10:17 1 Nov 1739 @ 9:35 21 Oct 1759 @ 0:44 8 Oct 1779 @ 17:00 28 Oct 1799 @ 17:20 19 Oct 1819 @ 3:41 6 Nov 1839 @ 8:12 26 Oct 1859 @ 0:33 15 Oct 1879 @ 15:09 3 Nov 1899 @ 10:27 23 Oct 1919 @ 20:39 12 Oct 1939 @ 20:30 31 Oct 1959 @ 22:41 21 Oct 1979 @ 2:23 9 Oct 1999 @ 11:34 28 Oct 2019 @ 3:38 17 Oct 2039 @ 19:09 5 Nov 2059 @ 9:11 24 Oct 2079 @ 18:19 14 Oct 2099 @ 1:32 st.dev: 7.79

We say that a long span 𝐿𝑛 contains the year 𝑚 if 𝑛 < 𝑚 ≤ 𝑛 + 160. Long spans, such as 𝐿2015 , containing one deficient century are said to be standard. Similarly, the lapse of a long span 𝐿𝑛 = (𝛼, 𝛽) is (𝛽 − 𝛼) − 160 years. The units for 𝑆𝑛 and 𝐿𝑛 are in (fractions of) hours. For example, 𝑆1899 is nonstandard, whereas 𝑆1900 is standard. From Equation (1), the lapse of the short span 𝑆1996 is 0.96 hours. With respect to Definitions 1 and 2, we can find the theoretical average length of both short spans and long spans.

388

Chapter XI: Lunar Rhythms

Proposition 3: The short-span rule. The expected time lapses between new moons 19 years apart are about −7.34 hours, 16.66 hours, and 40.66 hours, respectively, for the corresponding short spans containing 5, 4, and 3 leap years. Proof. Among the standard short spans, if 𝑛 ≡ 0 mod 4, 𝑆𝑛 contains the leap years 𝑛 + 4, 𝑛 + 8, 𝑛 + 12, and 𝑛 + 16, that is, 4 leap years; observe that 𝑆𝑛 fails to include 29 February in year 𝑛 because 𝑆𝑛 starts at some date between 8 October and 7 November. If 𝑛 ≡ 2 mod 4, 𝑆𝑛 contains the leap years 𝑛 + 2, 𝑛 + 6, 𝑛 + 10, 𝑛 + 14, and 𝑛 + 18, that is, 5 leap years. Similarly, if 𝑛 ≡ ±1 mod 4, then 𝑆𝑛 contains 5 leap years. Thus, among the nonstandard short spans, if 𝑛 ≡ 0 mod 4, then 𝑆𝑛 contains 3 leap years; otherwise 𝑆𝑛 contains 4 leap years. Denote by 𝛿 the average lapse of short spans in hours that contain 5 leap years. Then the average lapse of short spans containing exactly 4 or 3 leap years is respectively 𝛿 + 24 or 𝛿 + 48 because the spans of years respectively lack 1 or 2 occurences of 29 February compared to the short spans containing 5 leap years. In order to deduce 𝛿, recall that the root of sin(2𝜋𝜔𝑡) nearest 𝑡 = 19 years is 4.38 hours shy of 19 years, and in 19 years Earth precesses by 6.46 hours, which means that the average lapse of short spans should be −4.38 + 6.46 ≈ 2.08 hours. Since the pattern of leap days cycles in 400 years, let us consider the years 1800–2199. The years 1800–1880, 1900–2080, and 2100–2180 yield standard short spans, for a total of 343. Of these, 88 start with a multiple of 4. Of the 57 nonstandard short spans corresponding to 1881–1899, 2081–2099, and 2181–2199, 45 contain 4 leap years while the remaining 12 contain 3 leap years. Since the average lapse of a short span should be 2.08 hours, 255 (88 + 45) 12 𝛿+ (𝛿 + 24) + (𝛿 + 48) ≈ 2.08, 400 400 400 which means that 𝛿 ≈ −7.34 hours. Proposition 4: The long-span rule. The expected time lapses between new moons 160 years apart are about 2 hours more than 2 days, 3 days, and 1 day, respectively, for the corresponding long spans containing 1, 2, and 0 deficient centuries. Proof. As 𝑛 ranges over the 400 years 1800 to 2199, the standard long spans are 𝐿1800 through 𝐿1899 , 𝐿1940 through 𝐿2039 , and 𝐿2100 through 𝐿2139 , giving 240 standard spans. Long spans containing no deficient centuries range from 𝐿1900 through 𝐿1939 , giving 40 spans; and long spans containing two deficient centuries are 𝐿2040 through 𝐿2099 and 𝐿2140 through 𝐿2199 , giving 120 spans. Thus standard long spans occur 60% of the time, long spans with no deficiencies occur 10% of the time, and long spans with two deficiencies occur 30% of the time. Let Δ be the average day lapse between new moons 160 years apart for standard long spans;

Checking the expected length of short and long spans

389

for 30% of the remaining long spans this lapse will be Δ + 1, and for the last 10% of long spans this lapse will be Δ − 1. Solving 𝑤(𝑡) = 0 near 𝑡 = 160 years gives a root that is 0.014 days beyond 160 years. Altogether, the average lapse for new moons 160 years apart is 2.265 + 0.014 = 2.279 since in 160 years Earth precesses by about 2.265 days. So 0.60Δ + 0.30(Δ + 1) + 0.10(Δ − 1) ≈ 2.279, which means that Δ ≈ 2.08 days ≈ 49.9 hours.

Checking the expected length of short and long spans Table 1 is a listing of thirty non-overlapping short spans ranging from 1480 to 2099 wherein each span contains precisely four leap years (four occurrences of 29 February). The asterisk marking the year 1600 in the table’s first column serves to alert the reader of the 1582 calendar change from the Julian Calendar—in which every fourth year is a leap year—to the Gregorian Calendar—in which every fourth year is a leap year except at century years nondivisible by 400. The central column of this table gives the time difference in hours, modulo 19 years, between short span dates: second component minus first component. Thus, for example, in the first row of the table, the difference between 3 November 1499 and 2 November 1480 is 24.63 hours. The mean and standard deviation of these thirty short span differences are 𝑥1 ≈ 16.16 and 𝑠1 ≈ 7.79 hours. With 𝛿 ≈ −7.34 hours (the average short span lapse containing five leap years), observe that the difference between the expected mean 𝛿 + 24 ≈ 16.34 and the observed mean 𝑥1 ≈ 16.16 is |16.34 − 16.16| = 0.18 hours ≈ 11 minutes—not bad for our simple model. Table 2 is a listing of thirty non-overlapping long spans ranging from year 2000 bc through year 2881 ad. Most of these spans contain no deficient centuries, but not the years in the first column marked by asterisks, such as 1601* and 1761**. The notation 𝑛* means that the long span (𝑛, 𝑛 + 160) contains exactly one deficient century. For example, between 1601 and 1761, the year 1700 is deficient. The notation 𝑛** means that the corresponding long span contains exactly two deficient centuries. For example, between 1761 and 1921, both 1800 and 1900 are deficient. Thus, the time differential within a single-asterisk long span must be reduced by 24 hours, and within a double-asterisk long span by 48 hours. The central column of this table reflects this adjustment. Note also that no long span in the table contains the year 1582, the year the Gregorian Calendar supplanted the Julian Calendar in our time reckoning and a year in which our calendar lost eleven days. The mean and standard deviation of these long span differences are 𝑥2 ≈ 25.62 hours and 𝑠2 ≈ 3.31 hours. Observe that the difference between the expected mean (Δ ≈ 25.9 hours) and the observed mean is about 0.3 hours ≈ 18

390

Chapter XI: Lunar Rhythms

minutes. The ratio of 𝑠1 to 𝑠2 is about 2.35; equivalently, the ratio of their variations is over 5.5. In various trials of thirty short and long spans, this ratio waxed higher at times. Does our simple model anticipate a variation ratio this high, or is this ratio inflated due to chaos?

Table 2. A sample of new moon dates 160 years apart. 𝛼 5 Nov 2000 bc @17:15 8 Oct 1840 bc @6:34 9 Oct 1680 bc @12:17 10 Oct 1520 bc @18:21 11 Oct 1360 bc @23:41 13 Oct 1200 bc @3:42 14 Oct 1040 bc @6:00 15 Oct 880 bc @5:54 16 Oct 720 bc @3:11 16 Oct 560 bc @22:58 17 Oct 400 bc @19:07 18 Oct 240 bc @17:21 19 Oct 80 bc @18:12 20 Oct 81 @ 21:07 22 Oct 241 @ 1:24 23 Oct 401 @ 6:38 24 Oct 561 @ 12:18 25 Oct 721 @ 17:09 26 Oct 881 @ 19:52 27 Oct 1041 @ 20:02 28 Oct 1201 @ 17:51 29 Oct 1361 @ 14:12 25 Oct 1601* @ 23:11 27 Oct 1761** @ 22:04 30 Oct 1921 @ 23:38 1 Nov 2081** @ 3:04 4 Nov 2241* @ 7:23 8 Oct 2401* @ 3:10 10 Oct 2561** @ 6:52 13 Oct 2721 @ 9:09 average lapse:

hourly lapse 26.117 29.717 30.067 29.333 28.017 26.300 23.900 21.283 19.783 20.150 22.233 24.850 26.917 28.283 29.233 29.667 28.850 26.717 24.167 21.817 20.350 19.917 22.883 25.567 27.433 28.317 28.917 27.700 26.283 23.967 25.62

𝛼 + 160 years 6 Nov 1840 bc @19:22 9 Oct 1680 bc @12:17 10 Oct 1520 bc @18:21 11 Oct 1360 bc @23:41 13 Oct 1200 bc @3:42 14 Oct 1040 bc @6:00 15 Oct 880 bc @5:54 16 Oct 720 bc @3:11 16 Oct 560 bc @22:58 17 Oct 400 bc @19:07 18 Oct 240 bc @17:21 19 Oct 80 bc @18:12 20 Oct 81 @ 21:07 22 Oct 241 @ 1:24 23 Oct 401 @ 6:38 24 Oct 561 @ 12:18 25 Oct 721 @ 17:09 26 Oct 881 @ 19:52 27 Oct 1041 @ 20:02 28 Oct 1201 @ 17:51 29 Oct 1361 @ 14:12 30 Oct 1521 @ 10:07 27 Oct 1761 @ 22:04 30 Oct 1921 @ 23:38 1 Nov 2081 @ 3:04 4 Nov 2241 @ 7:23 6 Nov 2401 @ 12:18 10 Oct 2561 @ 6:52 13 Oct 2721 @ 9:09 14 Oct 2881 @ 9:07 st.dev. 3.31 hours

Expected value of the variation in spans of years∗

391

Expected value of the variation in spans of years∗ Recall that 𝑤(𝑡) = sin(2𝜋𝜔𝑡). If a new moon should occur every 𝑞 years, then the difference 𝑤(𝑡 + 𝑞) − 𝑤(𝑡) should be small and the variation of this difference should be small as well. Definition 5: Variance of a function. The average value, denoted by 𝜇𝑔 , of a continuous function 𝑔(𝑥) over the interval [0, 𝑇] for some positive number 𝑇 is 𝑇

𝜇𝑔 =

1 ∫ 𝑔(𝑥) 𝑑𝑥. 𝑇 0

The variance, denoted by var(𝑔) = 𝜎𝑔2 , of 𝑔(𝑥) over the interval [0, 𝑇] is 𝑇

var(𝑔) = 𝜎𝑔2 =

1 ∫ (𝑔(𝑥) − 𝜇𝑔 )2 𝑑𝑥. 𝑇 0

The standard deviation, denoted by 𝜎𝑔 , of 𝑔(𝑥) is the square root of the variance of 𝑔(𝑥) over the interval. Lemma 6: Moon variance with respect to spans. Let 𝑞 be a fixed positive 1 integer. The variance of sin(2𝜋𝜔(𝑡 + 𝑞)) − sin(2𝜋𝜔𝑡) over [0, ] is 𝜔

𝑠(𝑞) = √1 − cos 2𝜋𝜔𝑞. Proof. Let 𝑓(𝑡, 𝑞) = 𝑤(𝑡 + 𝑞) − 𝑤(𝑡) = sin(2𝜋𝜔(𝑡 + 𝑞)) − sin(2𝜋𝜔𝑡). Since the 1 1 period of 𝑤(𝑡) is 𝑇 = , the average value of 𝑓(𝑡, 𝑞) over the interval [0, ] with 𝜔 𝜔 respect to 𝑡 is 0. By definition of the variance of 𝑤(𝑡 + 𝑞) − 𝑤(𝑡), the variation 𝑣(𝑞) is 1 𝜔

2

𝑣(𝑞) = 𝜔 ∫ (sin(2𝜋𝜔(𝑡 + 𝑞)) − sin(2𝜋𝜔𝑡)) 𝑑𝑡 = 1 − cos(2𝜋𝜔𝑞).

(2)

0

The square root of Equation (2) is the desired standard deviation, 𝑠(𝑞). Figure 6 is a graph of the standard deviation 𝑠(𝑞) of lapses of 𝑞 years between new moons, as 𝑞 ranges from 1 to 500. In particular, the standard deviation is lowest at 𝑞 = 160, and is not quite so low at twice and thrice this value. Indeed, since 𝑠(19) 𝑠(19) ≈ 0.0275 and 𝑠(160) ≈ 0.0021, ≈ 12.90. If we make the reasonable 𝑠(160)

assumption of a positive correlation between Moon displacement as determined by our model and as determined by NASA data, then our simple Earth-MoonSun model has indeed anticipated the standard deviation at 𝑝 = 160 years being significantly less than at 𝑞 = 19 years in NASA’s data. The fact that it does so is somewhat remarkable when considering that the Moon’s position in time is chaotic. That is, our humble Earth-Moon-Sun model is fairly powerful.

392

Chapter XI: Lunar Rhythms

standard deviation

0.10

q = 19 0.02

q = 160, low point 300

100

q years

Figure 6. Standard deviation, 𝑠(𝑞) = √1 − cos(2𝜋𝜔𝑞). Yet it is conceivable that the actual pattern of the phases of the Moon has a 𝑞 value, call it ˆ 𝑞 , other than 19 and 160 for which the corresponding standard deviation is even lower. One reason for thinking so comes from Chapter X on the transit of Venus. The relative angular velocity 𝜓 of Venus with respect to Earth is 𝜓 ≈ 1.62555, and the nearest integer continued fraction algorithm yields the following convergents for 𝜓: 3 8 13 369 10714 (3) 2→ → → → → . 2 5 8 227 6591 As we saw in Figure X.14, 𝒮𝜓 clearly displays 𝑞 = 8 branches. In fact, Venus 369

transits often occur as a twin transit, eight years apart. From the convergent 227 in Equation (3), one might conjecture that the time lapse between Venus transits should be 𝑞 = 227 years. But as we saw, capitalizing on two nearly central transits of Venus separated by over 6000 years led us to reduce 𝜓 by 0.00004, which 13 395 changes the convergent following in Equation (3) to . Indeed, the correct 8 243 time lapse between June transits of Venus is 243 years. The next one should occur in year 2247. Table 3. Some possible values for ˆ 𝑞. 𝑛 ˆ 𝑞 = 19 ⋅ 𝑛 + 8

6 7 8 9 10 122 141 160 179 198

Since we cannot duplicate this approach to predicting the time lapse ˆ 𝑞 between new moons, we capitalize on knowing that the time lapse ˆ 𝑞 must correspond to a first component of a point on branch 1 of the nineteen branches of 235 𝒮𝜔 ˆ is very near 𝜔 and where is a convergent of 𝜔 ˆ . As follows from ˆ where 𝜔 19

Final thoughts

393

Equation (VII.21)12 , these first components are of the form 19𝑛 + 8 and must be near 19 ⋅ 8 + 8. So some reasonable candidates for ˆ 𝑞 are those in Table 3. Consulting Espenak’s data for spans of length 122, 141, 179, and 198 from Table 3, we find that 141 years is the more agreeable result. Table 4 gives thirty spans of 141 years. Each of these spans contains either 34 or 35 leap days. In the central column, an asterisk means that the corresponding span contains 34 leap days, and the span length was therefore decreased by 24 hours. For example, the span 1601–1742 contains 34 leap days for a span length of 50.82 hours, but we reduce it by 24 hours to 35.82. For a cycle length of 141 years, the table gives the mean as about 33.72 hours with a standard deviation of 𝑠3 ≈ 3.13 hours. Since 𝑠3 < 𝑠2 , 141 bests a cycle of 160 years with respect to standard deviation.

Final thoughts The observed standard deviations for the time lapse between new moon dates 19, 141, and 160 years apart are counterintuitive. Since the Earth-Moon-Sun system is chaotic, one might think a priori that the chaos should be cumulative—that the standard deviations of time lapses between new moons should increase as the time lapses increase. Our simple Earth-Moon-Sun model suggests that the contrary is true for specific lapses, and the data support and emphasize this result. In Tables 1, 2, and 4, we purposely selected a regular sequence of non-overlapping spans to allow, if possible, the data to be that of independent events. However, even when we selected overlapping spans, the resultant standard deviations gave comparable values. Such results are in keeping with a rule of Hofstadter [72, p. 299] about chaos: An eerie type of chaos can lurk just behind a façade of order—and yet, deep inside the chaos lurks an even eerier type of order. Within the façade of regularity of the successive phases of the Moon from month to month is a troublesome irregularity of up to about seven hours—yet a closer examination, as we have seen, reveals an eerie regularity at specific lapses between the phases. Finally, with respect to this chapter’s introductory question, how do the various lunar holidays fare in terms of spans of nineteen years? Table 5 shows that lapses (in days) between Hanukkah dates nineteen years apart are close to zero. However, Table 6 shows that lapses between Easter dates nineteen years apart have a greater variation. But Easter must fall on a Sunday. So instead of expecting an error of zero days, we expect an error of about half a week, or three or 12 With 𝑝 𝑞

=

235 19

, we have 𝜖 = sgn(𝑞𝜔 − 𝑝) = 1. Solving 𝑝𝑟 ≡ −𝜖𝑞 for 𝑟 gives 𝑟 = 8.

394

Chapter XI: Lunar Rhythms Table 4. A sample of new moon dates 141 years apart. 𝛼 28 Oct 598 bc @ 11:35 2 Nov 452 bc @ 9:10 11 Oct 298 bc @ 19:50 16 Oct 152 bc @ 23:38 4 Nov 1 @ 1:39 2 Nov 145 @ 13:37 18 Oct 301 @ 12:41 16 Oct 445 @ 21:28 31 Oct 601 @ 18:51 29 Oct 745 @ 16:02 15 Oct 901 @ 9:09 13 Oct 1045 @ 1:24 28 Oct 1201 @ 17:51 26 Oct 1345 @ 2:29 25 Oct 1601 @ 23:11 25 Oct 1745 @ 1:54 12 Oct 1901 @ 13:11 10 Oct 2045 @ 10:37 27 Oct 2201 @ 22:25 26 Oct 2345 @ 8:13 13 Oct 2501 @ 12:55 11 Oct 2645 @ 20:25 28 Oct 2801 @ 15:44 1 Nov 3045 @ 4:55 15 Oct 3101 @ 2:15 12 Oct 3245 @ 18:37 29 Oct 3401 @ 21:07 29 Oct 3545 @ 0:45 15 Oct 3701 @ 4:59 14 Oct 3845 @ 13:05 average lapse:

hour lapse 28.82 37.02 28.62 36.77 32.87 28.57 34.40 28.60 36.58 30.50 36.68 31.72 35.22 34.68 35.82* 29.37* 36.87 30.12* 35.97* 33.45 34.48* 35.17* 31.18* 31.02* 29.97 36.12* 28.20* 33.20 27.90* 31.72* 32.72

𝛼 + 141 years 29 Oct 457 bc @ 16:24 3 Nov 311 bc @ 22:11 13 Oct 157 bc @ 0:27 18 Oct 11 bc @ 12:24 5 Nov 142 @ 10:31 3 Nov 286 @ 18:11 19 Oct 442 @ 23:05 18 Oct 586 @ 2:04 2 Nov 742 @ 7:26 30 Oct 886 @ 22:32 16 Oct 1042 @ 21:50 14 Oct 1186 @ 9:07 30 Oct 1342 @ 5:04 27 Oct 1486 @ 13:10 28 Oct 1742 @ 11:00 27 Oct 1886 @ 7:16 14 Oct 2042 @ 2:03 12 Oct 2186 @ 16:44 30 Oct 2342 @ 10:23 27 Oct 2486 @ 17:40 15 Oct 2642 @ 23:24 14 Oct 2786 @ 7:35 30 Oct 2942 @ 22:55 3 Nov 3186 @ 11:56 16 Oct 3242 @ 8:13 15 Oct 3386 @ 6:44 1 Nov 3542 @ 1:19 30 Oct 3686 @ 9:57 17 Oct 3842 @ 8:53 16 Oct 3986 @ 20:48 st.dev: 3.13 hours

four days, which agrees more or less with the central row of the table. Subject to a few assumptions, Stephen Woodcock at the University of Technology, Sydney, Australia, speculates that this somewhat chaotic integer sequence in the middle row of Table 6 repeats about every 5.7 million years [94]. Ramadan is a true lunar holiday in that its start date is twelve moons after the last start. Since a year minus twelve moons is about eleven days, we see in Table 7 that the next Ramadan starts twelve moons (one year lessened by eleven days) after the start of the last Ramadan. Because a month is about a moon, we choose to let the lapses in the central row be the differences between the days

Exercises

395 Table 5. Hanukkah dates nineteen years apart.

year day lapse day year

2001 10 Dec 1 11 Dec 2020

2002 2003 30 Nov 20 Dec −1 −1 Nov 29 19 Dec 2021 2022

2004 2005 2006 8 Dec 26 Dec 16 Dec 0 0 −1 8 Dec 26 Dec 15 Dec 2023 2024 2025

2007 5 Dec 0 5 Dec 2026

2008 22 Dec 3 25 Dec 2027

2009 12 Dec 1 13 Dec 2028

2010 2 Dec 0 2 Dec 2029

2009 12 Apr 4 16 Apr 2028

2010 4 Apr 1 1 Apr 2029

Table 6. Easter dates nineteen years apart. year day lapse day year

2001 15 Apr −3 12 Apr 2020

2002 2003 2004 31 Mar 20 Apr 11 Apr 4 −3 −2 4 Apr 17 Apr 9 Apr 2021 2022 2023

2005 2006 27 Mar 16 Apr 4 −4 31 Mar 20 Apr 2024 2025

2007 8 Apr −3 5 Apr 2026

2008 23 Mar 5 28 Mar 2027

of the months of Ramadan starts that are nineteen years apart. For example, the lapse between 24 April 2020 and 16 November 2001 we take as (24−16) = 8 days. When day 𝑌 in year 𝑋 + 19 is less than day 𝑍 in year 𝑋, we add 𝑌 to the number of days in 𝑍’s month and from it subtract 𝑍. For example, the lapse between 3 April 2022 and 27 October 2003 is (31 + 3) − 27 = 7 days. Since a month is about three to four days longer than a moon, we might expect this lapse to be about that long. But instead it is about eleven days minus about three to four days. Table 7. Ramadan dates nineteen years apart. year day lapse day year

2001 2002 2003 2004 16 Nov 5 Nov 27 Oct 15 Oct 8 8 7 8 24 Apr 13 Apr 3 Apr 23 Mar 2020 2021 2022 2023

2005 4 Oct 7 11 Mar 2024

2006 2007 2008 2009 2010 23 Sep 12 Sep 1 Sep 21 Aug 10 Aug 8 6 7 7 6 1 Mar 18 Feb 8 Feb 28 Jan 16 Jan 2025 2026 2027 2028 2029

We leave as exercises for the reader the generation of similar tables for Diwali and Tét. Exercises 1. If the period of the Moon had been incidental to Meton’s argument for adopting a nineteen-year calendar, what somewhat small integer 𝑛 > 19 would have been a better choice for a calendar of 𝑛 sidereal years? (Hint: Recall that one year is about 365.256 days.)

396

Chapter XI: Lunar Rhythms

Io Jupiter

Ganymede Europa

Callisto

Figure 7. The four moons of Jupiter. 2. Estimate how long ago the nineteen-year cycle of the Moon began. See Assumptions 2 and 3 of the final section of Chapter VII. Estimate how long a 160-year cycle should last. 3. Devise a medium-span rule applicable to the lapse between new moons 141 years apart similar to the short-span rule for 19 years and the long-span rule for 160 years. 4. Table 1 was compiled using disjoint short spans. Instead, generate a similar table using overlapping short spans. In particular, as left-hand endpoints of the thirty short spans, use NASA’s data for the leap years 1904, 1908, … , 2200. Contrast your results with Table 1. 5. Generate a table much like Table 1 using a short span of twenty years rather than nineteen years. 6. What time lapse—other than 19, 141, or 160—gives a lesser variation in the difference between new moons? Intuitively, one might conjecture that the inherent chaos of the Earth-Moon-Sun system might destroy any hope of finding a lapse longer than 141 which has less variation. Yet the fact that 141 bests 19 may give a little hope. One difficulty in continuing with analysis similar to what we have done is that we have lost a precise value of 𝜔. That is, to arrive at 1 ˆ 𝑞 = 141, we increased 𝜔 by 0.0000387597 ≈ , giving a modified value of 𝜔 5800 which we will call 𝜔 ˆ ≈ 12.368786, which means that 𝑝 ˆ = round(ˆ 𝑞𝜔 ˆ ) = 1744. So ˆ𝑟 = 122, which means that the next convergent for the value into which 𝜔 has morphed should have denominator 𝑚 = 865 by the harmonic algorithm. Does 865 yield a lesser variation? (Since 865 is large, rather than using disjoint spans of years, try using spans of years shifted in time by, say, ten years.) 7. The largest moon of Jupiter is Io. With respect to Io, design a Metonic-yearlike calendar for Jovians (people who live on Jupiter) as so many Jovian years. 8. The four most-visible moons of Jupiter are Io, Europa, Ganymede, and Callisto. Suppose the centers of Jupiter, Io, Europa, Ganymede, and Callisto are collinear at time 𝑡 = 0. How many years will it be before they achieve this state again? See Figure 7.

Exercises

397

9. The mean length of a tropical year13 𝐿 is 365 days, 5 hours, 48 minutes, and 45.14 seconds. The nearest integer continued fraction algorithm 𝑍 applied to 𝐿 gives 𝐿 ≈ [365; 4, 8, −4]𝑍 ⟶ {365,

1461 12053 46751 , , }. 4 33 128

The Gregorian calendar is based upon the first convergent 12053

1461 4

. Devise an-

other calendar based upon the second convergent . Will we need leap 33 years or drop years? Describe the analog of the Gregorian calendar deficientcentury rule (see p. 386). 10. Produce tables for the holidays Tét and Diwali similar to Tables 5 through 7.

13 A

tropical year is the time from summer solstice to summer solstice.

Strand XII: Eclipse Lore and Legends A solar eclipse occurs when the Moon passes between Earth and the Sun, obscuring some part of the Sun as viewed from somewhere on Earth, and a lunar eclipse occurs when the Moon passes into the shadow of Earth. Observing an eclipse on Earth within the main shadow cast by the Moon is dramatic. For example, here is a fragment from the Greek poet Archilochus describing a solar eclipse that may have occurred on 6 April 648 bc [150]: Nothing can be surprising any more or impossible or miraculous, now that Zeus, father of the Olympians has made night out of noonday, hiding the light of the gleaming sun, and...fear has come upon mankind. After this, men can believe anything, expect anything. Don’t any of you be surprised in the future if land beasts change places with dolphins and go to live in their salty pastures, and get to like the sounding waves of the sea more than the land, while the dolphins prefer the mountains. Figure 1 is a satellite photo capturing the Moon’s main shadow over Mexico during a solar eclipse. Figure 2 is an image of the Moon entering Earth’s shadow during a lunar eclipse. The reason that some of the left-hand side of the Moon remains visible is because the light refracted through Earth’s atmosphere affords some illumination. Ancient Chinese mythology explains a solar eclipse as an invisible celestial dragon swallowing the Sun, an ill omen for all, including the leaders of the state. As such, astronomers or wise men were engaged by many courts to study the skies, track the planets, the Moon, and the Sun against the background of the fixed stars, and interpret their portent for the populace, especially the royal family. Failure to anticipate momentous events such as eclipses or to glimpse the coming of a comet often resulted in compounded fear and confusion. For example, as legend has it, when given sufficient advance warning of a solar eclipse, the Chinese emperor would decree that teams of drummers and 399

400

Strand XII: Eclipse Lore and Legends

Figure 1. A complete solar eclipse near Mexico City, courtesy of NASA.

Figure 2. A lunar eclipse, courtesy of NASA. archers be ready to both raise a rhythmic cacophony and launch cascades of arrows to frighten away the dragon. In Babylon, so goes the legend, the king’s advisors would appoint a substitute king for the duration of an eclipse, whereafter the poor man would be summarily executed, so fulfilling the expected inescapable calamity accompanying such astronomical events. The first recorded solar eclipse was probably the one that was visible from China on 22 October 2134 bc. Unfortunately, the royal astronomer Hi Xi failed to anticipate the event because of, says a legend, having had too much to drink.

Strand XII: Eclipse Lore and Legends

401

Figure 3. The Antikithera device, on display at the National Archeological Museum in Athens, author sketch. For such a crime Hi Xi was hanged, and thus arose the adage, “No astronomer is ever drunk during an eclipse.”1 Against such traditions of doom, ancient astronomers were under enormous pressure to get it right. The Babylonians kept years of careful records, with some of their lunar eclipse data dates gathered from places far from Babylon. Once they had amassed a sufficient base of recorded phenomena, they spotted a pattern. Lunar eclipses tend to recur every 223 moons—where a moon is about 29.5 days, the mean length of time between new moons. Thus, knowing history meant knowing the future, at least in as far as eclipses were concerned. To help bring such data patterns alive, for both students and kings, astronomers devised mechanical gadgets to simulate the heavens. As some evidence of this technology, Cicero (106–43 bc), a Roman orator and politician who studied with the astronomer Posidonius on the island of Rhodes, described an instrument recently constructed by our friend Posidonius, which at each revolution reproduces the same motions of the Sun, the Moon and the five planets that take place in the heavens each day and night [96, p. 274]. In 1900, sponge divers off the coast of the Greek island Antikithera discovered a wreck sixty meters deep dating back to at least the first century bc. One of the artifacts retrieved from the wreck was a strange blob of melded copper, as shown in Figure 3. Careful inspection of this object revealed it to be an assemblage of over thirty interlocking gear wheels, one of which has nineteen teeth and 1 The extant source for this story is the Shangshu, the Book of Documents, whose compilation, according to one tradition, was overseen by Confucius (551–479 bc). Other versions of the story, perhaps mistranslations, say that two astronomers, rather than one, were decapitated.

402

Strand XII: Eclipse Lore and Legends

Figure 4. A gear with 19 teeth within a gear of 223 teeth in the Antikithera device, not to scale. another of which has 223 teeth, the very numbers making up a Metonic cycle and making up the cycle governing eclipse recurrence. To appreciate how skilled these craftsmen of old were, Figure 4 shows two gears, one with nineteen teeth and the other with 223 teeth. Making the former one seems a reasonable task; making the latter one seems incredible. After over a century of reverse engineering in studying this artifact, researchers and artisans have built working models. In accordance with their discoveries, some of the gears are mounted off center in a peg-and-slot arrangement much like the trammel of Archimedes presented in Chapter VIII. Turning a manual crank to rotate the gears results in toy planets moving in non-simple-harmonic-motion orbits. As a closing episode in the lore of eclipses, we feature the lunar eclipse of 1 March 1504, as commemorated by the minted coin of Figure 5. A few months before that event, in his fourth and final voyage to the Caribbean, Christopher Columbus beached his last two caravels on the north coast of Jamaica for repairs. For months the local people had kindly provisioned the crew, but then grew weary of the task. Alarmed, and knowing of an impending lunar eclipse, Columbus threatened to have his god extinguish the Moon and rain wrath upon them. Luckily for the admiral, the eclipse occurred on schedule. The locals pleaded for mercy. Columbus staged a pardon towards the end of the forty-eight-minute event. And local provisions continued until the crew was rescued by a ship out of Haiti [105, pp. 653–655].

Strand XII: Eclipse Lore and Legends

Figure 5. Columbus delivering a lunar eclipse, 1950 silver coin, British Virgin Islands.

403

Chapter XII: Diophantine Eclipses In this chapter we develop two algorithms to generate the dates of successive solar eclipses. Both algorithms start by assuming simple harmonic motion of the Moon about Earth. The first method is an algebraic vector approach, and the second method is a continued fraction and Diophantine equation approach. Though quite different, the two algorithms yield the same conclusions.

Adapting the Earth-Moon-Sun model To determine when eclipses recur, we proceed similarly to how we analyzed the lapses between transits of Venus in Chapter X. Again, we fix Earth and the Sun

Figure 6. Final stages of an annular solar eclipse on 20 May 2012 viewed from Sundown, Texas; courtesy of Jimmy Westlake, Colorado Mountain College. 405

406

Chapter XII: Diophantine Eclipses Moon orbit Moon ξ

ascending node

above (north)

as

t

2πωt

e

Earth

we st

descending node eclipti c plan e

projection of Moon orbit onto the ecliptic plane

Sun

below (south)

locus of node positions in the ecliptic plane

Figure 7. The Moon along its draconic orbit intersecting the ecliptic plane. in the ecliptic plane, the plane of Earth’s orbit. The Moon’s orbital plane and the ecliptic plane meet at a mean angle of 𝜉 ≈ 5.145∘ . The Moon’s orbit intersects the ecliptic plane at two points, called the ascending node and the descending node—where the orbit goes from below the ecliptic plane to above and from above to below, respectively, as illustrated in Figure 7, where the Moon is nearing the descending node. From observation, these nodes rotate clockwise with period 18.6 years in the ecliptic plane, displayed as a dashed circle in the figure. The mean time lapse for the Moon to return to an ascending node, the draconic month 𝑝𝑑 , is 𝑝𝑑 ≈ 27.212221 days. Exercises 1 and 2 show how to deduce this draconic month when one knows the sidereal month 𝑝𝑙 , the mean time lapse for the Moon to return to the same point with respect to the fixed background of the stars. Meanwhile, with respect to the Sun, the Moon’s periodicity is 𝑝𝑚 ≈ 29.530589 days, the mean synodic period of the Moon about Earth, which is the mean length of time between new moons. A time lapse of 𝑛 synodic months is 𝑛 lunations. When the Moon is near a node, an eclipse may occur: a solar eclipse at new moon, or a lunar eclipse at full moon. 𝑝 Let 𝜔 be the ratio of the synodic and draconic periods, 𝜔 = 𝑚 ≈ 1.085196. 𝑝𝑑

So, in one lunation, the Moon has completed about 1.085 draconic circuits. Definition 1: Mosun and gesun. As terminology to help analyze this model, we define the mosun as the line through the centers of the Moon and Sun. Similarly, we define the gesun as the line through the centers of Earth and the Sun. We say that a solar eclipse is central if at some point during the eclipse the Sun, the Moon, and Earth are collinear so that the mosun and gesun are the same. For simplicity, we say that a solar eclipse is total if at some moment during the

Adapting the Earth-Moon-Sun model

(0, H)

407

secondary shadow primary shadow Moon center at m(cos ψ, sin ψ)

(0, L) x-axis

mosun

ψ Earth center at (0, 0) y-axis

gesun

Sun center at (s, 0)

Figure 8. The Moon’s shadow along the 𝑦-axis at new moon. eclipse the mosun intersects Earth, and a we say that a solar eclipse is partial if the mosun never intersects Earth but some part of Earth is within the Moon’s shadow. Let the radii of Earth, the Moon, and the Sun be 𝐸 = 6400 km, 𝑀 = 1738 km, and 𝑆 ≈ 6.96 × 105 km, respectively, and let the mean distances of the Moon and Sun from Earth be 𝑚 ≈ 384000 km and 𝑠 ≈ 149.6 × 106 km, respectively. We assume that the Moon’s orbit about Earth is simple harmonic motion. In particular, let (𝑢, 𝑣) = 𝑚(cos 2𝜋𝜔𝑡, sin 2𝜋𝜔𝑡) (1) be the position of the Moon in its orbital plane, where 2𝜋𝜔𝑡 is the counterclockwise angle from the Moon’s ascending node and 𝑡 is time, as illustrated in Figures 7 and 10. At 𝑡 = 𝑛 lunations, let 𝜓 be the central angle at Earth between the Moon and Sun. Since 𝜉 is the angle between the orbital planes of Earth and the Moon, the most that 𝜓 can be is 𝜉. The shadow of the Moon cast by the Sun projected onto the 𝑦-axis, as illustrated in Figure 8, consists of a primary shadow and a secondary shadow. Since 𝜓 is so small, the boundary of the primary shadow is approximately obtained by following a ray from the top of the disk of the Sun through the top of the disk of the Moon onto the 𝑦-axis, and by following a ray from the bottom of the disk of the Sun through the bottom of the disk of the Moon onto the 𝑦-axis. The boundary of the secondary shadow is approximately obtained by following a ray from the top of the disk of the Sun through the bottom of the disk of the Moon, and by following a ray from the bottom of the disk of the Sun through the top of the disk of the Moon. An astronaut positioned along the 𝑦-axis inside the the primary shadow will be mostly in the dark, and inside the secondary shadow will see some portion of the Sun obscured by the Moon.

408

Chapter XII: Diophantine Eclipses

up

dary of per boun

shadow

S

M

r E

primary

m

x-axis

s

Figure 9. Finding the maximum radius 𝑟 of the primary shadow.

Eclipse duration How long does a solar eclipse last? Lemma 2: Solar eclipse length. The maximum length of a solar eclipse observed globally from Earth is about three hours and fifty minutes. Derivation. The maximum duration ℳ of a solar eclipse occurs when the eclipse is central, which means that the path of the mosun across the disk of Earth should trace an Earth diameter. Since the speed of the Moon about Earth is 2𝜋𝑚/𝑝𝑚 , we have 2𝜋𝑚ℳ/𝑝𝑚 = 2𝐸. Therefore 2𝑝 𝐸 days ℳ≈ 𝑚 ≈ 3.84 hours ≈ 3 hours 50 minutes. ♢ 2𝜋𝑚 For example, the eclipse of 15 January 2010 was first visible near eastern Cameroon at 5:14 London time and ended in western South Korea at 8:59 London time, for a total duration of 3 hours 45 minutes. However, the maximum duration of an eclipse with respect to a specific location is much shorter. For example, the maximum duration of this particular eclipse with respect to a specific geographic location was 11 minutes 8 seconds near the Maldive Islands in the Indian Ocean. Subject to actual distance fluctuations of the Moon from Earth, Exercise 4b asks the reader to explore these global and local maximal solar eclipse durations.

A sufﬁcient condition for eclipses In this section, we find the approximate radius of the primary shadow when the mosun and gesun coincide, and find how far the mosun can be from Earth during a solar eclipse. Lemma 3: Radius of primary shadow. The radius of the largest primary lunar shadow during a solar eclipse is about fifty kilometers. Derivation. Sometimes when viewing a total eclipse of the Sun near where the mosun strikes Earth’s surface, the eclipse is annular—the apparent disk of the Moon is a little smaller than the apparent disk of the Sun—which means there is no primary shadow. So to find the largest possible primary shadow radius, we

A sufﬁcient condition for eclipses

409

consider the case where the Moon is at perigee and Earth is at perihelion. That is, assume that the Moon is 𝑚 = 363000 km from Earth and Earth is 𝑠 ≈ 147.1 × 106 km from the Sun. Consider Figure 9 depicting the radii of the primary shadow 𝑟, the Moon, and the Sun, along with their distances from Earth. The slope of the upper boundary of the primary shadow computed two different ways is 𝑆−𝑟 𝑀−𝑟 = , 𝑚−𝐸 𝑠−𝐸 whose solution is 𝑟 ≈ 48.9 km, giving our model’s best guess as to the maximum radius of the primary shadow during an eclipse. ♢ Proposition 4: Maximum Mosun distance from Earth’s center during a solar eclipse. Let (0, 𝐻) be the point where the mosun crosses the vertical axis, as shown in Figure 8, during a solar eclipse. The most that 𝐻 can be is about 9900 kilometers. Derivation. With the Moon at 𝑚(cos 𝜓, sin 𝜓), 0 ≤ 𝜓 ≤ 𝜉, the slope of the mosun can be written in two ways as 𝑚 sin 𝜓 𝐻 = , 𝑠 − 𝑚 cos 𝜓 𝑠 which means that 𝐻=

𝑠 𝑚 sin 𝜓 . 𝑠 − 𝑚 cos 𝜓

(2)

Thus, by Equation (2) with 𝐻 = 𝐸, the angle 𝜓 for which the mosun merely grazes Earth is about 𝜓 ≈ 0.95∘ . Let (0, 𝐿) be the lower boundary point of the Moon’s secondary shadow on the 𝑦-axis. The slope of the lower bounding line for the secondary shadow can be written in two ways as 𝑆 − (𝑚 sin 𝜓 − 𝑀) 𝑆−𝐿 = , 𝑠 𝑠 − 𝑚 cos 𝜓 which gives 𝐿=𝑆−

𝑠(𝑆 + 𝑀 − 𝑚 sin 𝜓) . 𝑠 − 𝑚 cos 𝜓

(3)

When 𝐿 = 𝐸, Equation (3) gives 𝜓 ≈ 0.0258 ≈ 1.48∘ . Then by Equation (2), 𝐻 ≈ 9932 km, which we round down to 9900 km for simplicity. A solar eclipse will occur during a new moon if the mosun is at most about 9900 km above or below Earth’s center, a rule we call Condition 1:2 Condition 1 for a new moon to be an eclipse: |𝐻| ≤ 9900 km. 2 Condition

2 appears in Exercise 5c.

(4) ♢

410

Chapter XII: Diophantine Eclipses orbit Moon

ξ 2πωn

descending node

E (0, 0, 0) u =m cos2 πω n

m n ψ n2πω si ξ m v=

M

new moon at (X, 0, Z) S

(X, 0, 0) ascending node

(s, 0, 0)

ecliptic plane

Figure 10. Characterizing ecliptic coordinates, (𝑋, 0, 𝑍), for a new moon at 𝑛 lunations.

Finding 𝐻 at any lunation For this section the reader may wish to review lines and planes in ℝ3 as summarized in Appendix II. Proposition 5: Mosun distance during a solar eclipse. Let (0, 𝐻) be the point where the mosun crosses the vertical axis at lunation 𝑛. Then 𝐻=

𝑠 𝑚 sin 𝜉 sin(2𝜋𝜔𝑛) 𝑠 − 𝑚√1 − (sin 𝜉 sin(2𝜋𝜔𝑛))2

.

Derivation. To determine where the Moon is after 𝑛 lunations with respect to the ecliptic plane, recall that at each successive lunation the Moon is new, and at lunation 𝑛 the Moon is at (𝑢, 𝑣) = 𝑚(cos 2𝜋𝜔𝑛, sin 2𝜋𝜔𝑛) with respect to the ascending node at (𝑚, 0) in the Moon’s orbital reference scheme in Equation (1). Meanwhile, with respect to the ecliptic plane, the Moon is at (𝑋, 0, 𝑍) where 𝑋 is close to 𝑚. In fact 𝑋 = 𝑚 cos 𝜓, and 𝑍’s magnitude is bounded by 𝑚 sin 𝜉. See Figure 10. Therefore 𝑍 = 𝑚 sin 𝜉 sin(2𝜋𝜔𝑛), which along with the Pythagorean theorem yields 𝑋 = √𝑚2 − 𝑍 2 = 𝑚√1 − (sin 𝜉 sin(2𝜋𝜔𝑛)2 . Let 𝒬 be the plane through Earth’s center normal to the gesun; we call 𝒬 the screen of Earth, illustrated in Figure 10. The mosun line parametrized by 𝛾 is (𝑠, 0, 0) + 𝛾(𝑋 − 𝑠, 0, 𝑍).

(5)

The projection point (0, 0, 𝐻) of the new moon at (𝑋, 0, 𝑍) with respect to the Sun onto the screen of Earth 𝒬 is obtained by solving (𝑠, 0, 0) + 𝛾(𝑋 − 𝑠, 0, 𝑍) = (0, 0, 𝐻)

Finding 𝐻 at any lunation

411

(0, 0, H)

ascending node (X, 0, Z) mosun

(0, 0, 0) Moon descending node

gesun

Earth

(s, 0, 0)

Sun

Q , the screen of Earth, normal to the gesun

Figure 11. Projecting the Moon onto the screen of Earth at new moon. n = 24

n = 47 n=0 n=6

Figure 12. A few tracks of the mosun sweeping across Earth. for 𝛾, yielding 𝛾=

−𝑠 , 𝑋 −𝑠

which means that 𝐻 = 𝛾𝑍 =

𝑠 𝑚 sin 𝜉 sin(2𝜋𝜔𝑛) 𝑠 . 𝑍= 𝑠−𝑋 𝑠 − 𝑚√1 − (sin 𝜉 sin(2𝜋𝜔𝑛))2

(6) ♢

Figure 12 displays a few of the tracks or transits of the mosun sweeping across 𝒬 at lunations near an ascending node. For each track, the dot along the vertical midline of the disk represents the point at which an observer would experience maximum eclipse duration. Observe that for 𝑛 = 24 lunations, the corresponding eclipse is partial because the mosun is above the north pole but less than 9900 km above Earth’s center. When the Moon is approaching a descending node, the paths angle downward rather than upward as the mosun sweeps from west to east across the face of Earth.

412

Chapter XII: Diophantine Eclipses

Using Condition 1 to ﬁnd the lapse between successive eclipses With Equation (6), selecting those lunations 𝑛 for which |𝐻| ≤ 9900 km and 0 ≤ 𝑛 ≤ 250 gives the list3 0, 6, 12, 18, 23, 24, 29, 35, 41, 47, 53, 59, 65, 70, 76, 82, 88, 94, 100, 106, 111, 112, 117, 123, 129, 135, 141, 147, 153, 158, 159, 164, 170, 176, 182, 188, 194, 199, 200, 205, 211, 217, 223, 229, 235, 241, 246, 247. (7) The successive time lapses between the terms of (7) are 6, 6, 6, 5, 1, 5, 6, 6, 6, 6, and so on. Thus, it appears as if the only possible time lapses between successive solar eclipses are 1, 5, or 6 synodic months apart. Furthermore, Table 1 catalogs each of these solar eclipses as being either total (T), when |𝐻| ≤ 6400 km, or partial (P), when 6400 < |𝐻| ≤ 9900. Table 1. Predicting solar eclipses, partial (P) and total (T), starting at an exact eclipse. 0: T 65 : P 129 : T 194 : T

6: T 70 : P 135 : T 199 : P

12 : T 76 : T 141 : T 200 : P

18 : P 82 : T 147 : T 205 : P

23 : P 88 : T 153 : P 211 : T

24 : P 94 : T 158 : P 217 : T

29 : T 100 : T 159 : P 223 : T

35 : T 106 : P 164 : T 229 : T

41 : T 111 : P 170 : T 235 : T

47 : T 112 : P 176 : T 241 : P

53 : T 117 : P 182 : T 246 : P

59 : T 123 : T 188 : T 247 : P

When we find the first thousand eclipses starting from an exact eclipse at 𝑛 = 0, the last eclipse occurs at 𝑛 = 5347 lunations. Among these one thousand eclipses, lapses between successive eclipses of 1, 5, and 6 days occur with proportions 8.9%, 20.8%, and 70.3%, respectively, which agree fairly well with the actual values from NASA’s website: 11.4%, 23.1%, and 65.5%–not bad for our simple model.

Continued fraction insight Rather than using the vector calculus of Proposition 5 as we did in generating Table 1, we can obtain the same results using continued fractions. Applying the nearest integer continued fraction algorithm 𝑍 to 𝜔 ≈ 1.085196, the ratio of the synodic and draconic periods of the Moon, yields 1 13 51 242 777 3350 , , , , 𝜔 ≈ [1; 12, −4, −5, 3, 4]𝑍 → { , }. 1 12 47 223 716 3087 The denominators of this sequence of convergents, 1, 12, 47, 223, 716, 3 See

Code 24 of Appendix III for how to generate this list with a CAS.

(8)

Continued fraction insight

413

suggest that the lapses between solar eclipses could be 1, 12, 47, 223, and 716 lunations apart because, for example, the ratio 242/223 means that 223 lunations is almost the same time lapse as 242 draconic cycles. Furthermore, if 𝑛 = 0 lunations corresponds with a central eclipse at an ascending node, then at 𝑛 = 12 lunations and 𝑛 = 716 lunations, an eclipse will occur near an ascending node. By symmetry, at half these lunations, namely, at 𝑛 = 6 lunations and 𝑛 = 358 lunations, an eclipse will also occur at a descending node. When consulting NASA’s tables of solar eclipses, we can find many instances of pairs of eclipse dates whose time differences are indeed about 1, 12, 47, 223, and 358 lunations apart. For example, with all dates given in London time, on 1 June 2011 at 21:17:18 and on 1 July 2011 at 8:39:30 partial solar eclipses were visible near the north pole and south pole, respectively, about 29 days, 9 hours, and 22 minutes apart—almost one synodic month. The partial eclipse of 25 November 2011 and the total eclipse of 13 November 2012 are about 12 lunations apart. Both 21 June 2001 at 11:58 and 8 April 2005 at 20:32 are total eclipse dates and are about 47 lunations apart, with the former being maximally visible around noon on St. Helena and the latter being maximally visible at 10:30 am on Tahiti. To produce a table from Equation (8) similar to Table 1, we focus on 223 and 358 lunations. Exercise 9b asks the reader to explore what results can be gleaned by focusing on 6 and 47 lunations. Definition 6: Saros and inex cycles. A sequence of successive eclipses 223 lunations apart is referred to as a saros cycle, a term coined by Edmund Halley, who adapted the Babylonian term sar. The eclipses in any particular saros cycle all occur near an ascending node or all occur near a descending node. A succession of solar eclipses 358 moons apart is called an inex cycle, which is an abbreviation of going into a cycle and exiting a cycle. The successive eclipses in any particular saros cycle alternate in occurring at ascending and descending nodes. Proposition 7: A Continued fraction solar eclipse rule. Let 𝑝 and 𝑞 be integers with |𝑝| ≤ 33 and |𝑞| ≤ 21. If lunation 0 is a central eclipse, then lunation 223𝑝 + 358𝑞 might be a solar eclipse. Derivation. If we have a central eclipse at lunation 0, then at 223𝑝 lunations we should have another eclipse, provided the integer 𝑝 is not too large. From observation, suppose we determine that the mosun transits of these successive eclipses in this saros sequence are separated by 280 km.4 Dividing 9900 km—our bound on 𝐻 in Condition 1—by 280 gives about 35.4, which we round down to 𝜂0 = 35. That is, beyond 𝜂0 lapses of 223 lunations, we exceed the bound of Condition 1 and no longer experience an eclipse. Since this factor can range from −𝜂0 to 𝜂0 , a saros cycle contains about 2𝜂0 + 1 = 71 eclipses. 4 Checking this result with Equation (6) gives 𝐻 ≈ −280.1 km at 223 lunations and about twice that distance at twice 223 lunations, and so on.

414

Chapter XII: Diophantine Eclipses

north

vertical axis L

k = −7(223) k=0 k = 1(223) k = 7(223)

k = 21(223) south Figure 13. A cascade of moson tracks on the disk of Earth with respect to the ascending node. Figure 13 shows mosun transits across the disk of Earth at various multiples of 223 lunations. In general, for eclipses in a saros cycle associated with an ascending node, as the cycle begins, the Moon’s shadow barely touches the north pole in a partial eclipse. As the months go by, the Moon’s shadow sweeps ever southward to produce partial eclipses, until waxing into total eclipses, and then waning again to partial eclipses. As the cycle ends, the Moon’s shadow barely touches the south pole. For eclipses with respect to a descending node, the cycle starts with the Moon’s shadow south of Earth and ends with it north of Earth. In particular, if 𝑘 = 0 lunations corresponds to a central eclipse occurring at an ascending node, then we use this approximate rule of thumb: At 223𝑘 lunations the mosun is −280𝑘 km along ℒ,

(9)

where ℒ is a vertical axis through Earth’s center, with 0 km corresponding to Earth’s center and |𝑘| ≤ 𝜂0 . Similar to the saros cycle, for an inex cycle, at 358𝑞 lunations we should have another eclipse, provided 𝑞 is not too large. Let 𝜇1 be an upper bound on 𝑛’s magnitude, where 𝑞 is not too large. Our rule of thumb to determine 𝜇1 is to balance it against the bounding value 𝜂0 = 35 of the Saros cycle. That is, we want 358𝜇1 = 223𝜂0 , which means that 𝜇1 ≈ 21.8. Rounding down, let 𝜇1 = 21. However, we balance this result again to refine our guess for 𝜂0 , calling it 𝜂2 . Solving 358𝜇1 = 223𝜂2 gives 𝜂2 ≈ 33.7, which we round down to 𝜂2 = 33. This balancing criterion between the saros and inex cycles means that a practical range for eclipses along a typical saros cycle is ±𝜂2 for a total of 67 eclipses, and the

Some Diophantine magic

415

range along a typical inex cycle is ±𝜇1 for a total of 43 eclipses. Again, from observation, suppose we determine that the distance between mosun transits at 358𝑝 and 358(𝑝 + 2) is 72 km.5 As with the saros cycle, we use this approximate rule of thumb: At 358𝑘 lunations the mosun is (−1)𝑘 36𝑘 km along ℒ,

(10)

where |𝑘| ≤ 𝜇1 . Because 280𝜇1 = 9240 ≤ 9900 km = 𝐻 and 223𝜂2 = 7359 ≈ 7539 ≈ 358𝜇1 , we stop our guesswork for finding appropriate bounds for integer values of |𝑝| and |𝑞|. Again, suppose we have a central eclipse at an ascending node at lunation 0. Then at integral linear combinations of 223 and 358, 223𝑝 + 358𝑞, we might also have eclipses, provided |𝑝| ≤ 33 and |𝑞| ≤ 21, an idea used by the Dutch astronomer G. van den Bergh, who painstakingly cataloged the complete panorama ♢ of solar and lunar eclipses, [158] [159]. With Proposition 7, we try generating a table of eclipse dates between 𝑘 = 0 lunations and 𝑘 = 265 lunations, where 265 was chosen after some experimentation so that our table contains a total of fifty eclipses. For example, we could go backwards in time thirteen lapses of 358 lunations and then forwards in time twenty-one lapses of 223 lunations to arrive at an eclipse date of twenty-nine lunations.

Some Diophantine magic An easy way to sift through all of these linear combinations is to solve a family of Diophantine equations. Lemma 8: A Diophantine solar eclipse algorithm. Let 𝜂 = 33 and 𝜇 = 21, and let 𝑘 be an integer between 1 and 265. Suppose that lunation 0 corresponds to a central solar eclipse. Then lunation 𝑘 could be the date of a solar eclipse whenever there exists a solution to the Diophantine equation 223𝑝 + 358𝑞 = 𝑘 where |𝑝| ≤ 𝜂, |𝑞| ≤ 𝜇, 𝑝 = −61𝑘 + 358𝑡, 𝑎 = 38𝑘 − 223𝑡, and 𝑡 is an integer satisfying the inequalities {

⌈ ⌈

−𝜂+61𝑘 358 −𝜇+38𝑘 223

⌉≤𝑡≤⌊ ⌉≤𝑡≤⌊

𝜂+61𝑘 358 𝜇+38𝑘 223

⌋,

(11)

⌋.

5 Using Equation (6) the reader may check that at 358 lunations, 𝐻 ≈ −36.44 km, and at 716 lunations, 𝐻 ≈ 72.89 km.

416

Chapter XII: Diophantine Eclipses Table 2. Eclipses at lunation 𝑘 where 𝑘 = 223𝑝 + 358𝑞, 1 ≤ 𝑘 ≤ 265. month 𝑝 𝑞 month 𝑝 𝑞 month 𝑝 𝑞 month 𝑝 𝑞 month 𝑝 𝑞

6 12 18 23 24 29 35 −8 −16 −24 29 −32 21 13 5 10 15 −18 20 −13 −8 59 65 70 76 82 88 94 −19 −27 26 18 10 2 −6 12 17 −16 −11 −6 −1 4 112 117 123 129 135 141 147 −30 23 15 7 −1 −9 −17 19 −14 −9 −4 1 6 11 164 170 176 182 188 194 199 20 12 4 −4 −12 −20 33 −12 −7 −2 3 8 13 −20 217 223 229 235 241 246 247 9 1 −7 −15 −23 30 −31 −5 0 5 10 15 −18 20

41 47 5 −3 −3 2 100 106 −14 −22 9 14 153 158 −25 28 16 −17 200 205 −28 25 18 −15 252 258 22 14 −13 −8

53 −11 7 111 31 −19 159 −33 21 211 17 −10 264 6 −3

Derivation. Since 358 and 223 are relatively prime, Euclid’s greatest common divisor algorithm gives −61 ⋅ 223 + 38 ⋅ 358 = 1. By Proposition III.25, the solutions for these Diophantine equations (as 𝑘 ranges from 1 through 265) occur when 𝑛 = −61𝑘 + 358𝑡

and

𝑚 = 38𝑘 − 223𝑡

for all integers 𝑡. Since |𝑛| ≤ 𝜂 and |𝑚| ≤ 𝜇, {

−𝜂 ≤ −61𝑘 + 358𝑡 ≤ 𝜂, −𝜇 ≤ 38𝑘 − 223𝑡 ≤ 𝜇,

which is equivalent to Equation (11).

♢

Observe that the integer 𝑡 in Equation (11) is governed by a tight bound. For most values of 𝑘, 𝑡’s lower bound exceeds its upper bound. For example, with 𝑘 = 3, Equation (11) collapses to the vacuous string of inequalities 1 ≤ 𝑡 ≤ 0. In other words, three lunations after 𝑘 = 0 fails to give a solar eclipse. When we test6 the integers 1 ≤ 𝑘 ≤ 265 for which the two left-hand and the two right-hand bounds for 𝑡 in Equation (11) are all the same, we generate Table 2. 6 See

Code 25 of Appendix III for how to do this in a CAS.

Some Diophantine magic

417

Table 3. NASA dates following a central eclipse. eclipse 16 July 2186 9 January 2187 6 July 2187 29 December 2187 26 May 2188 24 June 2188

lapse with preceding eclipse date — 6 lunations 6 lunations 6 lunations 5 lunations 1 lunation

To interpret Table 2, consider six lunations. In order to generate the value 𝑘 = 6 lunations as a linear combination of 223 and 358, think of going backwards in time eight steps of 223 lunations and then going forward five inex steps of 358 lunations. So 6 = −8 ⋅ 223 + 5 ⋅ 358. Observe that the data in Table 1 and Table 2 are consistent. Also observe that the (𝑝, 𝑞) values of Table 2 suggest whether the associated lunations correspond to total or partial eclipses. By (9) and (10), our rule of thumb to determine where the mosun strikes ℒ at 223𝑛 + 358𝑚 lunations is −280𝑝 + (−1)𝑝 (36𝑝) km.

(12)

For example, with 𝑘 = 47 lunations, (𝑝, 𝑞) = (−3, 2). For these values of 𝑝 and 𝑞, applying Equation (12) yields 912 km, whereas Equation (6) yields 913.6 km. Thus we conclude that at 47 lunations, a total eclipse occurs. For 𝑘 = 159 lunations, applying Equation (12) to (𝑝, 𝑞) = (−33, 21) yields −9996 km, whereas Equation (6) yields −9874. Since these values are less than −6400 km, we conclude that at 159 lunations, a partial eclipse occurs. To check these results against NASA’s data, we need an exact solar eclipse. NASA predicts that on 16 July 2186, the eclipse at maximum local duration will last 7 minutes 29 seconds, almost reaching NASA’s theoretical maximum duration of 7 minutes 31 seconds. When we look for the next few successive solar eclipses following this special eclipse we find that they match Table 2, as shown in Table 3. The next two examples show how we can extend the results beyond 264 lunations from a central solar eclipse. Example 9: A longer-range forecast. As we saw from the list in (7), at least one solar eclipse should occur in any span of six lunations. With this idea in mind, we find a solar eclipse that occurs somewhere between 1000 and 1006 lunations after the central eclipse on 16 July 2186.

418

Chapter XII: Diophantine Eclipses

Solution. Applying the formula of Equation (6) to lunations 1000 to 1006 gives the kilometer distances 32556, 33863, 25696, 10340, −7908, −23945, −33282. The only one with magnitude less than the Equation (4) critical value of 9900 km corresponds to lunation 1004. Now 1004 lunations is about 81 years, two months, and 1.2 days. And in 81 years, precession accounts for a loss of 1.1 days. So 81 years, two months and 2.3 days from 16 July 2186 is approximately 18–19 September 2267. NASA’s data bank indicates that a total7 eclipse of the Sun will occur at 5:23 am on 19 September 2267 (and no other solar eclipse occurs during lunations 1000 to 1006). Furthermore, solving the Diophantine equation 233𝑝 + 358𝑞 = 1004 gives 𝑝 = −61 ⋅ 1004 + 358𝑡 and 𝑞 = 38 ⋅ 1004 − 223𝑡, where 𝑡 is any integer. Observe that 𝑝 and 𝑞 are nearest 0 when 𝑡 = 171. In particular, at 𝑡 = 171, we have 𝑝 = −26 and 𝑞 = 19 and 1004 = −26 ⋅ 223 + 19 ⋅ 358. So the solar eclipse on 19 September 2267 is a recurrence of the solar eclipse that occurred 26 ⋅ 223 lunations before the central solar eclipse of 16 July 2186. ♢ Example 10: A more distant eclipse projection. Recall that 𝜔 is the ratio of the synodic and draconic periods. In Example 9 we used the third and fourth 𝑍 777 242 and for 𝜔 ≈ 1.085196 to find the occurrence of solar eclipses convergents 223 716 between 1000 and 1006 lunations after the central eclipse of 16 July 2186. In this 3350 example we use the fifth convergent to forecast a solar eclipse 3087 lunations 3087 after 16 July 2186. Solution. The span of 3087 lunations is about one day short of 249 years and seven months. In this span of time, about 3.5 days are lost to precession. Thus 3087 lunations from 16 July 2186 is about 18–19 February 2436. Since 2436 is a leap year, a better estimate might be 17–18 February 2436. Checking NASA’s tables confirms that an annular solar eclipse is slated to occur at 9:48 on 17 February 2436. ♢

Lunar eclipses The ideas governing lunar eclipses are the same as for solar eclipses. From a global perspective, lunar eclipses last about as long as solar eclipses. Lemma 11: Length of a lunar eclipse. The maximum length of a lunar eclipse is a little less than four hours. 7 Even though Equation (6) applied to lunation 1004 gives −7908 (which suggests that lunation 1004 should be a partial eclipse), Equation (12) applied to 1004 = −26 ⋅ 233 + 19 ⋅ 358 gives about −6600 km, very close to our bound of 6400 km for being a total eclipse.

A reality check

419

(0, b) (0, 0)

(s+m, S)

(m, E) gesun

Moon orbit

Earth

Sun

Figure 14. Earth’s shadow at the Moon’s orbit. Derivation. Consider Figure 14. The origin is where the gesun intersects the Moon’s orbit left of the Sun and Earth. The line through the north pole of Earth at approximate point (𝑚, 𝐸) and the north pole of the Sun at approximate point (𝑠 + 𝑚, 𝑆) has slope 𝑆−𝐸 𝑠 and vertical intercept 𝑏 (𝑠 − 𝑚)𝐸 + 𝑚𝑆 ≈ 4630 km, 𝑠 which means that the cone of Earth’s shadow at the Moon’s orbit has diameter 9260 km. Adding 2𝑀 to this diameter to account for the radius of the Moon, and dividing this augmented diameter by the speed at which the Moon traverses its circular orbit namely 2𝜋𝑚/𝑝𝑚 where 𝑝𝑚 is the synodic period, gives about 3 hours and 44 minutes, the approximate maximal length of time the Moon lies partially or totally within Earth’s shadow. ♢ 𝑏=

To compare Lemma 11 with actual NASA data, the lunar eclipse of 16 July 2000 lasted 3 hours 56 minutes, during which time the Moon was completely invisible for 107 minutes. We leave the compilation of lunar eclipse tables as an exercise.

A reality check All the results in this chapter were derived by assuming simple harmonic motion of the Moon about Earth. In so doing, we have ignored a host of variables governing the behavior of this system. For example, the Moon fails to rotate about Earth’s center. Instead, Earth and the Moon rotate about their barycenter, a point about 4671 km along the line segment from Earth’s center to the Moon’s center. All the planets, especially Jupiter, influence the motion of Earth and the Moon. The eccentricities of Earth and the Moon are greater than zero. Moreover, at present Earth’s eccentricity is cyclical with a period of about 100 000 years [9]. The Moon is currently receding from Earth by about 3–4 cm/year, whereas Earth may be receding from the Sun by as much as 15 cm/year [87].

420

Chapter XII: Diophantine Eclipses

Figure 15. A solar eclipse predicted for 8 April 2024, courtesy of NASA. Accommodating these and other factors within a solar system model is complex, the analysis of which is far beyond the scope of this text. Nevertheless, our simple model is resilient enough to generate reliable predictions of solar eclipses, albeit with up to about six hours in error. For the time and venue of the next eclipse visit NASA’s impressive data bank of both solar and lunar eclipses over six millennia. For example, Figure 15 shows some of the detail of a solar eclipse predicted to occur on 8 April 2024. The path of the mosun across Earth—displayed as a narrow band—crosses the Rio Grande near the moment of greatest local eclipse duration at approximately noon local time. The arcs sweeping across this band indicate the region on Earth from which a viewer will see only a portion of the Sun obscured by the Moon. Since the time lapses between successive solar eclipses are either one, five, or six lunations, at least two solar eclipses occur each year—perhaps in a venue near you.

A ﬁnal note As we have seen, the vector calculus approach of testing whether Equation (6) at lunation 𝑘 is bounded by 9900 km is an efficient way to find eclipse dates. Solving the Diophantine equation 233𝑝 + 358𝑞 = 𝑘 using the criterion in Equation (11) exposes some insightful structure that is omitted by the vector approach. For

Exercises

421

example, if 𝑝 is positive and 𝑞 is negative for a solution of 𝑘 lunations, then we know that the eclipse at 𝑘 lunations is a recurrence 223𝑝 moons into the future of an eclipse 358|𝑞| moons ago (with respect to time 0 of lunation 0). In summary, the solution to linear Diophantine equations and the algorithms of continued fractions are clever applications of the Euclidean algorithm for finding the greatest common divisor of two positive, relatively prime integers. That the motion of heavenly bodies—and in particular the age-old mystery of finding the pattern of eclipse phenomena—can be characterized using these simple ideas is almost magical. Exercises 1. The mean anomalistic period 𝑝𝑎 of the Moon is 𝑝 ≈ 27.554550 days, the time lapse of the Moon at successive perigees in its orbit with respect to the background of fixed stars. Assume that the major axis of the Moon’s orbit rotates 𝜓 ≈ 40.7∘ counterclockwise (same direction as the Moon’s orbit) each year. From this information, estimate the mean sidereal period 𝑝𝑙 of the Moon, the lapse between the Moon returning to the same point with respect to the fixed background of the stars. 2. Just as Earth’s sidereal orbit precesses with period about 25 800 years, so too does the Moon’s sidereal orbit precess with period about 18.6 years. With the sidereal period from Exercise 1, use this information to approximate the draconic period 𝑝𝑑 of the Moon—the time lapse between the Moon returning to the same point with respect to the ecliptic plane. 3. In tandem with an atlas, use Figure 16 to estimate the radius of the Moon’s primary shadow in this 1715 sketch by Edmund Halley. 4. (a) The value of the Moon’s perigee—the mean nearest distance of the Moon to Earth—and the Moon’s apogee—the mean furthest distance of the Moon to Earth—are, respectively, 363 396 km and 405 504 km. Use this information to approximate the extreme numbers of solar eclipses in a Saros cycle. (b) Using the information of Exercise 4a, estimate the maximal durations of solar eclipses both globally and locally. From NASA’s website on eclipses, find eclipses that closely realize these extremes. (c) With the information from Exercise 4a, estimate the longest time for the Moon to be totally within Earth’s shadow. 5. (a) Explain why the succession of paths of solar eclipses across the disk of Earth proceed from north to south for a saros cycle associated with an ascending node, and why the opposite is true for the cycle associated with a descending node.

422

Chapter XII: Diophantine Eclipses

Figure 16. Halley’s sketch of a solar eclipse over London, 1715. (b) What is the maximum number of solar eclipses that can occur in a oneyear interval? (c) An alternate algebraic condition to generate solar eclipses. Let 𝜃(𝑡) = min{2𝜋𝜔𝑡 mod 𝜋, 𝜋 − (2𝜋𝜔𝑡 mod 𝜋)}, where 2𝜋𝜔𝑡 is the central angle at Earth’s center measured counterclockwise from the ascending node to the Moon at time 𝑡 in lunations. Use Equation (6) to show that Condition 1 (Equation (4)) is approximately equivalent to Condition 2: Condition 2 for lunation 𝑛 to be an eclipse: 𝜃(𝑛) ≤ 16.7∘ . Generate Table 1 afresh by using Condition 2. 6. (a) With its mass held constant, what would Earth’s radius need to be in order for the only time lapses between solar eclipses to be a multiple of 6 lunations? (b) Suppose Earth had radius 10 000 km (and the same mass). Generate a list of possible solar eclipses analogous to Equation (7). 7. Let lunation 0 be the occurrence of a central eclipse. Let 𝑘 be an integer between 0 and 265. Show that the only values of 𝑘 for which the Diophantine equation 223𝑝 + 358𝑞 − 3087 = 𝑘,

Exercises

423

where |𝑝| ≤ 30 and |𝑞| ≤ 20, holds are those months occurring in Table 2. That is, the denominator of the fifth convergent yields no more information about solar eclipses than is already given by the third and fourth convergents. 8. Select an arbitary non-exact total solar eclipse ℰ from NASA’s data banks. Estimate the 𝐻 value corresponding to ℰ. Use this information to estimate the date of the central solar eclipse nearest to ℰ. 9. (a) Use the bounds 𝜂 = 35 and 𝜇 = 21 to generate a table much like Table 2. What entries occur in this table that fail to occur in Table 2? (b) Use the cycles of six and forty-seven (rather than 223 and 358) to generate a table much like Table 2. Contrast your results with Table 2. 10. With respect to lunar eclipses, derive a formula analogous to Equation (6). Generate a list of lunar eclipse months analogous to the list (7). Compare your results with NASA’s data banks for lunar eclipses.

Appendix I: List of Symbols Used in the Text The symbols in the list below are arranged roughly in the order in which they first appear in the text. [𝑛0 ; 𝑛1 , 𝑛2 , 𝑛3 , … , 𝑛𝑘 ] A finite simple continued fraction (p. xiii). [𝑛0 ; 𝑛1 , 𝑛2 , 𝑛3 , …] An infinite simple continued fraction (p. xiv). 𝐶𝑖 From context, either convergent 𝑖 (p. xiii) or a set indexed by 𝑖. |𝑥| From context, the absolute value of the number 𝑥 (p. xv), or the cardinality of the set 𝑥 (p. 112). 𝐺 From context, the constant of universal gravitation (p. xv). CAS An acronym for Computer Algebra System (p. xviii). ♢ The end of an example, derivation, puzzle (p. 4). ℤ The set of integers (p. 6). [𝑥] The nearest integer function (p. 6). 𝒮𝜔 The signature of 𝜔 (pp. 6, 243). 𝒫𝜔 The phyllotaxis of the irrational number 𝜔 (p. 7). 𝑎|𝑏 The integer 𝑎 divides the integer 𝑏 (p. 10).

▽ ▽ (a string of digits)𝑚

The cuneiform symbol for the integer one (p. 14). The cuneiform symbol for ten (p. 14). The representation of a number in base 𝑚 (p. 16).

𝑘

∑ 𝑎𝑖

The sum 𝑎1 + 𝑎2 + ⋯ + 𝑎𝑘 (p. 16).

𝑖=1

𝜙(𝑛)

The end of a proof (p. 19). The Euler phi function of integer 𝑛 (p. 20); in context 𝜙 may be the golden mean (p. 194).

𝑘

Π 𝑎𝑖

𝑖=1

The product (𝑎1 )(𝑎2 ) ⋯ (𝑎𝑘 ) (p. 20). 425

426

Appendix I: List of Symbols Used in the Text The Mayan symbol for zero (p. 24). 𝑖 ∞ ℕ ℤ+ 𝑎∈𝐴 ±𝑛 𝐴⊂𝐵 {𝑥| 𝑥 has property 𝑋} ∅ 𝑛! 𝑏∉𝐵 𝑝𝑅𝑞 𝑎≡𝑏 𝐴∼𝐵 𝐴∩𝐵 𝐴∪𝐵 mex(𝐴) 𝐴𝑐 ∗ 𝑎+𝑏 𝑒 gcd(𝑎, 𝑏) 𝑎 ≡ 𝑏 mod 𝑐 ℝ ℚ

From context, the imaginary number √−1 (p. 26). Infinity (p. 33). The set of natural numbers, {0, 1, 2, …} (p. 37). The set of positive integers (p. 37). The element 𝑎 belongs to set 𝐴 (p. 37). Plus or minus the integer 𝑛 (p. 37). Set 𝐴 is a subset of set 𝐵 (p. 37). The set of all 𝑥 satisfying property 𝑋 (p. 37). The empty set, also known as the null set (p. 40). The factorial of the nonnegative integer 𝑛 (p. 41). Element 𝑏 does not belong to set 𝐵 (p. 42). Element 𝑝 is related to element 𝑞 by rule 𝑅 (p. 45). Element 𝑎 is equivalent to element 𝑏 (p. 45). Object 𝐴 is similar to object 𝐵 (p. 46). The intersection of sets 𝐴 and 𝐵 (p. 46); ∩ is also the hieroglyphic ten (p. 109). The union of sets 𝐴 and 𝐵 (p. 49). The minimal excluded value from the set 𝐴 (p. 50). The complement of the set 𝐴 (p. 50). The nim sum of the integers 𝑎 and 𝑏 (p. 53). From context, 𝑒 is the number 𝑒 ≈ 2.718 (p. 65), or the eccentricity of an ellipse (p. 279). The greatest common divisor of 𝑎 and 𝑏 (p. 74). 𝑐|(𝑎 − 𝑏), where mod is short for modulo (p. 89). The set of real numbers (p. 97). The set of rational numbers (p. 107). The hieroglyphic symbol for one-half (p. 109). A hieroglyphic symbol denoting a fraction; 1

ℱ𝑛 𝑎 𝑐 ⊕ 𝑏 𝑑 ⌊𝑥⌋ ⌈𝑥⌉ [𝑥] max(𝐴) ℂ 𝑛⊙𝐴 ⊕ 𝐵

thus, ∩ is (p. 109). 10 A Farey series of fractions (p. 112). 𝑎 𝑐 The mediant of the fractions and (p. 111). 𝑏 𝑑 The floor of the real number 𝑥 (p. 114). The ceiling of the real number 𝑥 (p. 114); the nearest integer function [𝑥] is defined on p. 6. The maximum element in set 𝐴 (p. 114). The set of complex numbers (p. 117). 𝑎 𝑐 The general mediant of 𝐴 = and 𝐵 = , namely,

𝑎𝑛+𝑐 𝑏𝑛+𝑑

𝑏

(p. 124).

𝑑

Appendix I: List of Symbols Used in the Text 𝑛𝐴 ⊕ 𝐵 {𝐴||𝐵}

427

Simpler notation for the general mediant (p. 124). A representation of the simplest quantity between quantity sets 𝐴 and 𝐵 (p. 137).

⟨𝑛0 ; 𝑛1 , 𝑛2 , …⟩

Quarter note, eighth note, and dotted quarter note (p. 143). The sign of the number 𝑠, ±1 (p. 152). From context, Euler’s constant, 𝛾 ≈ 0.5772 (p. 150). Format representing a Babylonian fraction (p. 170).

(𝑛)

The binomial coefficient, 𝑛 choose 𝑘 (p. 186).

∑ 𝑥𝑖 𝑥𝑗

The sum of all terms 𝑥𝑖 𝑥𝑗 where 𝑖 < 𝑗 (p. 202).

•|

•

•|•

sgn(𝑠) 𝛾

𝑘

𝑖 1. We say that the (𝑖, 𝑗) minor of 𝑀, denoted by minor(𝑖, 𝑗), is the (𝑛 − 1) × (𝑛 − 1) matrix obtained from 𝑀 by deleting row 𝑖 and column 𝑗 from 𝑀, where 𝑖 and 𝑗 are integers with 1 ≤ 𝑖 ≤ 𝑛 and 1 ≤ 𝑗 ≤ 𝑛. We say that the determinant of a 1 × 1 matrix—which has just one entry—is equal to that lone entry. The determinant of 𝑀, denoted by det(𝑀), is recursively computed by expanding across any row of 𝑀. In particular, when expanding across row 𝑖, 𝑛

det(𝑀) = ∑ (−1)𝑖+𝑗 𝑎𝑖𝑗 det(minor(𝑖, 𝑗)). 𝑗=1

For example, let 𝑀 = [

𝑎 𝑏 ] . The determinant of 𝑀, when expanding across 𝑐 𝑑

row 1, is det(𝑀) = (−1)1+1 𝑎11 det(minor(𝑎1, 1 )) + (−1)1+2 𝑎12 det(minor(𝑎1, 2 )) = 𝑎𝑑 − 𝑏𝑐. 𝑝 𝑞 Let 𝑁 = [ 𝑠 𝑡 𝑣 𝑤

𝑟 𝑢 ] . Expanding across row 2 of 𝑁 gives 𝑥

det(𝑁) = −𝑠 det [ 𝑎 ⎡ 11 𝑎21 ⎢ Similarly, det ⎢ ⎢ 𝑎31 ⎣ 𝑎41

𝑞 𝑤

𝑟 𝑝 𝑟 𝑝 𝑞 ] + 𝑡 det [ ] − 𝑢 det [ ]. 𝑥 𝑣 𝑥 𝑣 𝑤

𝑎12 𝑎22 𝑎32 𝑎42

𝑎13 𝑎23 𝑎33 𝑎43

𝑎14 𝑎24 𝑎34 𝑎44

𝑎12 −𝑎41 det [ 𝑎12 𝑎32

𝑎13 𝑎13 𝑎33

𝑎11 𝑎14 𝑎14 ] + 𝑎42 det [ 𝑎21 𝑎31 𝑎34

𝑎11 −𝑎43 det [ 𝑎21 𝑎31

⎤ ⎥, when expanding across the last row, is ⎥ ⎥ ⎦

𝑎12 𝑎22 𝑎32

𝑎13 𝑎23 𝑎33

𝑎14 𝑎24 ] 𝑎34

𝑎14 𝑎11 𝑎24 ] + 𝑎44 det [ 𝑎21 𝑎34 𝑎31

𝑎12 𝑎22 𝑎32

𝑎13 𝑎23 ] . 𝑎33

Definition 21: The inverse of a square matrix. Let 𝐴 be an 𝑛 × 𝑛 matrix. We say that 𝐴−1 is the inverse matrix for 𝐴 if 𝐴𝐴−1 = 𝐼 = 𝐴−1 𝐴 where 𝐼 is the identity matrix, an 𝑛 × 𝑛 matrix whose diagonal terms are 1 and whose other entries are 0. Proposition 22: Inverse existence. Let 𝐴 be an 𝑛 × 𝑛 matrix with nonzero determinant. Then 𝐴−1 exists.

Appendix II: An Introduction to Vectors and Matrices

435

Proof. For a proof in general, see any linear algebra text. We show that the propo𝑎 𝑏 sition is true when 𝑛 = 2. Let 𝐴 = [ ]. Given that 𝑎𝑑 − 𝑏𝑐 ≠ 0, the inverse 𝑐 𝑑 matrix is 1 𝑑 −𝑏 𝐴−1 = [ ]. 𝑎𝑑 − 𝑏𝑐 −𝑐 𝑎 The reader may show that 𝐴𝐴−1 = 𝐼 = 𝐴−1 𝐴 where 𝐼 = [

1 0 ]. 0 1

Appendix III: Computer Algebra System Codes The lines of Mathematica code in this appendix implement various algorithms presented in the text. If you use a different computer algebra system (CAS), many of these code modules can be adapted to your system. Code 0: Mathematica syntax. In this section we alert the user to frequently used Mathematica syntax that may be different in other CASs. Comments. Surround comments with (* ... *). Lists. Use L[[i]] to refer to element i in list L. If a list has an element that itself is a list, two indices are needed. For example, if L1 is {{2, 3}, {5, 3}}, then L1[[2]] refers to the list {5, 3}, whereas L1[[2, 1]] refers to the element 5. 𝑎 𝑏 Matrices. The 2 × 2 matrix [ ] is stored as a list of lists: mat = 𝑐 𝑑 {{a, b}, {c, d}}. To multiply matrices, use the operation Dot (.), as in mat.mat. If V is a vector, V.V gives the dot product. If V has length 2, Mathematica automatically uses the column vector for V in the calculation of the matrix product mat.V. Logical operators. Use && for “and” and || for “or”. Conditional statements. The phrase If[ expr, stmt ] executes statement stmt if the expression expr is true. Furthermore, If[ expr, stmt1, stmt2 ] is interpreted as, If expr then stmt1 else stmt2. In statement construction, any stmt may be a single statement or a sequence of statements separated with semicolons. Do loop. The expression Do[ stmt, {i, imin, imax} ] executes stmt for integer i values imin to imax. Block. A helpful trick in function construction is to use local variables so that calls to the function avoid unintended consequences. One way to implement this trick is to place the code for a function within a Block, whose syntax is Block[ { varlist }, stmt ]. The variables in varlist are local to the 437

438

Appendix III: Computer Algebra System Codes

block and are separated with commas. These variables may be initialized within the list, such as Block[ { a, b, c = 5 }, ... ]. Functions. Define functions using funcname[ inputvariablelist ] := body;. By custom, user-defined function names usually start with a lowercase letter so as to avoid conflicts with standard Mathematica commands and functions. The input variable list determines the variables to be given in a call to the function. Each variable name must end in an underscore—for example, pattern[ L_, R_ ] := Block[ body ];. In the body of the function, variables L and R are used without the underscore. ♢ Code 1: Built-in functions. Many of the Mathematica commands we use in the codes of this appendix are standard functions or routines that may have similar counterparts in the CAS of your choice. Here are a few examples. • To find the decomposition of any given integer 𝑛 into a product of powers of its prime divisors, use FactorInteger. L1 = FactorInteger[n] (*L1 is the output of this list*) When n = 1000, FactorInteger returns L1 as {{2, 3}, {5, 3}}, which is interpreted as 23 ∗ 53 . • To find 𝜙(𝑛), the Euler phi function or the totient of Chapter I, use EulerPhi[n]. For example, EulerPhi[10] returns the integer 4. • Prime[n] returns the 𝑛th prime. For example, Prime[4] returns 7. • PrimeQ[n] tests whether 𝑛 is prime, returning True if it is prime and False otherwise. • BaseForm[m, n] writes the integer 𝑚 in base 𝑛. Thus BaseForm[45, 2] returns (101101)2 . • FromDigits[string, m] accepts a string of alpha-numeric symbols in base m and returns the value as a decimal integer. For example, FromDigits["1AB", 16] returns the decimal number 427. ♢ Code 2: Nim addition. The following functions perform nim addition. The function nimAdd outputs the nim sum of two nonnegative integers. nimAdd[m_,n_]:= Block[{k = 1,y = Min[{m,n}], z=Max[{m,n}], sum=0}, While[k < z, k = 2*k]; (*obtain power of 2 at least z*) While[k >= 1, (*loop *) If[(y