Prelude to calculus and linear algebra


149 29

English Pages 332 [360] Year 1968

Report DMCA / Copyright

DOWNLOAD PDF FILE

Recommend Papers

Prelude to calculus and linear algebra

  • Commentary
  • Scanned book from archive.org
  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Citation preview

JOHN R. HUBBARD AT DEPARTMENT OF MATHEM ALLEGHENY

COLLEGE

ARD JOHN R. HUBB ICS AT DEPT. of MATHEM EGE

- COLUMBUS

COLL

ft

Digitized by the Internet Archive in 2023 with funding from Kahle/Austin Foundation

https://archive.org/details/preludetocalculuO000john

PRELUDE

AND

TO

CALCULUS

LINEAR

ALGEBRA

The

Appleton-Century

Mathematics

Series

Raymond W. Brink and John M. H. Olmsted, Editors

A First Year of College Mathematics, 2nd ed., by Raymond W. Brink Algebra—College Course, 2nd ed., by Raymond W. Brink

Analytic Geometry, rev. ed., by Raymond W. Brink College Algebra, 2nd ed., by Raymond W. Brink Plane Trigonometry, 3rd ed., by Raymond W. Brink

Spherical Trigonometry by Raymond W. Brink An Introduction to Matrices, Vectors, and Linear Programming by Hugh G. Campbell Modern Basic Mathematics by Hobart C. Carter Elementary Concepts of Modern Mathematics by Flora Dinkines Parts also available individually under the following titles: Part Part Part Introduction

I, Elementary Theory of Sets II, Introduction to Mathematical Logic III, Abstract Mathematical Systems to the Laplace Transform by Dio L. Holl, Clair G. Maple, and Bernard

Vinograde Introductory Analysis by V.O. McBrien

College Geometry by Leslie H. Miller Advanced Calculus by John M. H. Olmsted Calculus with Analytic Geometry (2 volumes) by John M. H. Olmsted Intermediate Analysis by John M. H. Olmsted Real Variables by John M. H. Olmsted Solid Analytic Geometry by John M. H. Olmsted The Real Number System by John M. H. Olmsted Analytic Geometry by Edwin J. Purcell Calculus with Analytic Geometry by Edwin J. Purcell

Analytic Geometry and Calculus by Lloyd L. Smail Calculus by Lloyd L. Smail The Mathematics of Finance by Franklin C. Smith

PRELUDE

AND

TO

CALCULUS

LINEAR

ALGEBRA

John

M.

H. Olmsted

Southern

sje

\New York

Illinois University

{AAC} APPLETON-CENTURY-CROFTS C

DIVISION

OF

MEREDITH

CORPORATION

Copyright © 1968 by MEREDITH CORPORATION All rights reserved

This book, or parts thereof, must not be used or reproduced in any manner without written permission. For information address the publisher, Appleton-Century-Crofts, Division of Meredith Corporation, 440 Park Avenue South, New York, N. Y. 10016. 628-1

Library of Congress Card Number: 68-14040

Copyright © 1966 by Meredith Publishing Company under the title of Calculus With Analytic Geometry PRINTED

E 67865

IN THE

UNITED

STATES

OF AMERICA

TO

CYNTHIA

AIMNTMVS

OF:

PREFACE

This volume is the first of a set of three containing the material for a first course in calculus, and the subject matter immediately prerequisite thereto. The other two volumes are entitled Basic Concepts of Calculus and A Second Course in Calculus. The three-volume set is a condensation of the author’s two-volume Calculus with

Analytic Geometry, in this same Appleton-Century Mathematics Series, obtained by deleting from the earlier book all honors sections, most of the especially difficult and sophisticated proofs, and many of the more advanced exercises. For a general discussion of purposes and objectives of the total three-volume set the reader is referred to the Preface to Calculus with Analytic Geometry that is reprinted on the pages immediately following this preface. Compression has not been the only purpose in revising the author’s original text in calculus. A second objective was inspired by the close accord existing between the Calculus with Analytic Geometry and the general spirit of the recommendations of the Committee on the Undergraduate Program in Mathematics (CUPM), of the Mathematical Association of America. This three-volume revision has been fashioned to approximate the first few courses in college mathematics outlined in the 1965 booklet entitled A General Curriculum in Mathematics for Colleges (GCMC). The main deviation from the GCMC recommendations is a more consistent emphasis on

rigor. The GCMC recommends a “spiraling” upward from intuitive discussion to more rigorous content. The present set of books is dedicated to a slightly different form of spiraling, which maintains a substantial flavor of rigor throughout, while progressing steadily from elementary aspects of rigor in the simplest contexts to relatively advanced phases in some of the more difficult settings of analysis. An underlying principle in the present volumes is that it is pedagogically preferable to pursue a joint discussion and development of intuition and rigor, rather than to treat them separately, the intuitive first and the rigorous second. This goal, as explained in the preface of the Calculus with Analytic Geometry, is a realistic one and is accom-

plished by focusing attention first on relatively lightly quantified global concepts, including the Riemann integral, and then undertaking a study of more heavily quantified local concepts, in particular the various forms of the limit idea. This first volume is designed for a 4-hour, 1-semester course of approximately 48 class hours (aside from tests and reviews), although with deletions or reduced em-

phases, or with postponements of certain sections for incorporation into subsequent Vii

Viil

PREFACE

courses, the book can be used for a shorter course, such as a 5-hour, 1-quarter course of around 40 class hours. The first five chapters are sufficient prerequisite for the following volume, Basic Concepts of Calculus, with the exception of a few exercises on the reflecting properties of the conics. The special feature of this volume that should be noted is its unusual suitability as a preparation for modern courses in beth calculus and linear algebra. This is achieved through immediate use of sets and logic and the central role played by the concept of a vector space, exemplified by vector spaces of functions and vector spaces of ordered pairs, triples, and n-tuples, and the concept of an algebra, exemplified by algebras of functions and algebras of matrices. A course based on this volume would replace a traditional course in analytic geometry, while including some topics from college algebra. Such a course would correspond to the Math 0 course of the GCMC,

but at a more strictly college level.

Because of frequent cross references among the three volumes of which this is the first, and the parent two-volume book, we shall use the following abbreviations: CWAG: PCLA: BEG; SCG:

Calculus with Analytic Geometry, Prelude to Calculus and Linear Algebra, Basic Concepts of Calculus, A Second Course in Calculus.

The second volume of this set, BCC, is devoted to functions of a single real variable. The final volume, SCC, contains topics from multivariate calculus, intro-

duced as early as possible, as well as such remaining topics concerning a single variable as special techniques of integration, improper integrals, approximation methods, infinite series, and differential equations. These last two volumes, BCC and

SCC, correspond approximately to the GCMC courses Math 1 and 2, and Math 3 and 4, respectively. Each of the two volumes is designed for a 5-hour, 1-semester course of some 60 class hours, although other arrangements are possible. One alternative distribution of the material of the three volumes, for a sequence of four 5-hour quarters, is as follows: ‘

First quarter:

PCLA, BCC, Second quarter: BCC, Third quarter: PCLA, SCC, Fourth quarter: SCC,

Chapters 1-S, Chapters 1-4. Chapters 5-13. Chapters 6-10, Chapters 1 through §508. §509 through Chapter 12.

For those wishing to use the three-volume set for regular sections of a course, in conjunction with CWAG for honors sections, the following is a list of correspondences: Volume

PCLA

Chapter

1-5 6

7 8 9 10

CWAG

Chapter or Section

Chapters 1-5 Chapter 13 §§2701-2704 §§2801-2806 §§2811-2828 §§2901-2922

PREFACE

BCC

SCC

Chapters Chapters Chapter Chapter Chapters Chapter

§§2701, Chapters Chapter Chapters Chapter

6-12 14-18 21 20, §§2807-2808 30-32 22 2705-2708, 3125-3126 23-24 33 25-26 34

J.M.H.O. Carbondale, Illinois

ne

eqns

| AEE wy nes yale? ~

7. ie

~

enon. i

%.

i

~

=

‘ (a

im

te

oi

: ths

yp.

¢

ViLene Cw

PREFACE

TO

CALCULUS WITH ANALYTIC

GEOMETRY

A perennial problem facing teachers of mathematics is the determination of the proper time for introducing the student to the “delta-and-epsilon” type of mathematical rigor, sometimes referred to informally as “epsilonics.” Experience in recent years has convinced many people that much of the subject matter of “epsilonics” can be effectively taught at a much earlier stage in a student’s academic career than had at one time seemed possible. However, the fact remains that most students do have difficulty when first confronted with the limit concept. What is the best solution for the problem posed above? Should a considerable period of time be invested in a purely intuitive approach before the student is asked to face the harsh realities of mathematical analysis? Or is it possible to present “epsilonics” in such a fashion and with such a timing that the student can assimilate the essential ideas in a natural way and with a reasonably correct intuitive understanding? For purposes of discussion let it be agreed that the limit concept is a complicated and difficult one for most students new to calculus. Explicitly or implicitly the formulation of a limit statement uses not only a universal quantifier and an existential quantifier but also an implication involving two inequalities (or the equivalent in terms of a neighborhood and a deleted neighborhood). One of the principal assumptions determining the character of the present book is that the most appropriate groundwork for the study of limits is a substantial amount of detailed and careful work with inequalities, implications, and quantifiers before these are all combined into the type of compound statement needed for limits. For this reason, and also because of the many variants that involve “neighborhoods of infinity,” the treatment of limits has been placed as late as possible, in Chapter 10. This postponement of limits is made possible, in part, by the earlier discussion, in Chapter 9, of continuity (free from

the complications

of infinities and deleted neighborhoods),

but more

especially by the still earlier focusing of attention on such global concepts as the Riemann integral, in Chapters 6 and 7, and uniform continuity, in Chapter 8. Placing the introduction of the definite integral before that of the derivative has a well-recognized historical background, but it also has a sound logical justification. When the Riemann integral is approached by means of step-functions the definition of integrability involves only two quantifiers and a single inequality, and the definiXI

Xli

PREFACE

tion of the Riemann integral itself is simultaneously the supremum of one set and the infimum of another. The avoidance, at this stage, of implications between pairs of inequalities provides a natural arena for students to get a mathematical workout with quantifiers. Then later, when it is time to struggle with limit statements, the quantified portions have become familiar objects and are no longer a barrier. It is possible that some readers of this preface are not disposed at first to agree with the proposition that the global concept of uniform continuity is a simpler one than the local concept of ordinary continuity. Undoubtedly, if one wishes only to talk intuitively in terms of “approach” or “nearness” with accompanying “handwaving” gestures, uniform continuity is more complicated. However, it is when one

expresses himself in exact terms that this concept becomes simpler. More precisely, it is nearly always simpler to find an explicit 8 in terms of ¢ for a function on a set than for a function at a point. An example may help clarify this statement. For the “squaring” function x — x2 on the compact interval [3, 5] a suitable § can be found quite simply to be ¢/10, whereas for this same function at the point 3, either 8 turns out to be something like min(1, «/7) or else « must be artificially limited in size. In

fine, the position adopted in this book is that to concentrate on relatively simple global concepts before turning attention to point-wise continuity and limits makes good sense, and is preferable in the long run to the more traditional intuitive approach which necessitates an often painful relearning process later. The dominant philosophy of this book emphasizes concepts and structure, with the dual objectives of developing an appreciation for a truly beautiful and well-conceived subject and exploiting the great potentials of these concepts and structures. The concept of vector space should serve to illustrate what is meant. Vector spaces are introduced first for function spaces, with examples such as bounded functions, step-functions, and polynomials. It then becomes natural to develop the Riemann integral as a positive linear functional on the vector space of integrable functions. Then later, many of the standard theorems of continuous functions.and limits find simple expression when cast in the language of vector spaces and algebras of functions. Furthermore, these same ideas become tools when it is shown that the Riemann integral can be obtained as the limit of a sum. Later in the book, generous

attention is again devoted to vector spaces and algebras, this time in the context of matrices and vectors in two and three dimensions. Eigenvalue techniques are used effectively to simplify transformations of equations of the second degree. In order to achieve the goals enumerated above, the first two chapters contain an introduction to sets and logic. Truth sets serve to explain exactly what an implication is. The quantifier symbols V and 3 are used widely, and their role in the formulation of negations is discussed carefully and used frequently. Both the nature of a proof and the meaning of a counterexample are given considerate attention. The groundwork laid in the first two chapters forms the basis for the extensive use of both sets and logic throughout the book. It is felt that the time spent initially in forming a sound foundation for the ideas that are so essential to calculus is well worth the investment, and is amply justified by the ultimate dividends in the form of understanding and intellectual satisfaction. Alternative formulations are sometimes included. For example, several limit

PREFACE

xii

statements are expressed in terms of deltas and epsilons, neighborhoods and deleted neighborhoods, and mappings. These alternatives help provide a broad and varied texture for the underlying flow of ideas, as well as emphasizing a basic unity that pervades many apparently dissimilar settings. Much attention is given to applications. For example, Chapter 7 is devoted entirely to such applications of the Riemann integral as area, volume, work, and distance. Applications of the derivative are scattered through most of the chapters. Differential equations find their first use in Chapter 18, where they are applied to problems involving gravitation, orthogonal trajectories, radioactive decay, bacterial growth, cooling, and mixing. The final Chapter 34 returns to the subject of differential equations, concentrating on linear equations with constant coefficients and their

applications. Extremal problems receive unusually thorough consideration, with full attention given to sufficiency conditions and endpoint extrema. Area and volume are introduced axiomatically, each being a positive additive function on a ring of sets and satisfying a certain completeness axiom. For a function of a single real variable these axioms suffice for establishing the theorem that the ordinate set of a nonnegative Riemann-integrable function f on a compact interval has area equal to the integral of f. Elliptic notation (for example, “the function e* sin 3x”) that the student learns to understand and use both the strict tion. This is important since both notations abound in the of the world. Special attention should be called to the character and sections, suitable for enrichment

content in honors courses.

is discussed clearly so f and the elliptic notamathematical literature

arrangement of honors These honors sections

are planned so as to permit an honors course running concurrently with a standard course to cover the standard as well as the extra material, and also to permit exploratory reading and extra assignments for interested and capable students who are not registered in a special honors class. In order that a student may be able to transfer either out of or into an honors sequence between terms it is desirable that the honors course keep pace with the standard course to which it corresponds. This is made especially feasible for courses using this book by the uniform arrangement of the honors sections, which constitute exactly twenty percent of the sections, as follows: Every chapter contains a multiple of ten sections, including sections of exercises; every section whose number ends with the digit 9 is an honors section, and every section whose number ends with the digit 0 is the accompanying honors section of exercises. All honors material is identified by the letter H. The division of the book into two volumes is done simply to reduce the bulk of the alternative of a single tome. The place of division between Chapters 21 and 22 achieves a local minimum for cross-references between the two volumes. The use of a table of integrals is limited to the second volume. The book is designed for courses of three semesters — or four or five quarters — meeting four or five days a week. Prerequisite is a standard high school mathematical preparation, including trigonometry and college algebra. Review material in trigonometry and such topics as mathematical induction is included. Over 6000 exercises of all levels of difficulty are available, for practice, for chal-

X1V

PREFACE

lenge, and for individual exploration. These exercises form an essential part of the book, and are designed to fortify and deepen, as well as guarantee, learning. Answers to nearly all problems are given in the back portions of the two volumes. Illustrative examples are liberally provided. The author wishes to express his appreciation for the extensive aid and numerous Suggestions given by Professor R. W. Brink. He is also indebted to many others for their helpful comments, given both informally in conversation and more specifically

on paper. J.M.H. O. Carbondale,

Illinois

CONTENTS

ELC CAC Cte

Rae

Sede

RUE

Dh

ear,

Preface to Calculus with Analytic Geometry

1

101 102 103 104 105 106 107 108 109 110 | a 112 ELS 114 115 116

RELATIONS,

AND

es

ANA

....... 0.00 cece ees

Vil Xl

FUNCTIONS

MEVECvETUG (SON ates Fe NP Pre ape Sey be gh osaig sees Gresfaint gens Sra De CSMICTITOCTS TAI SUDSCUS) 5 clog Gieh Sieste cease cued pele Sa a AUS we yslewerisls AG IVE Ese saa aCOMP EINECS bent eke eS con loss ore be Ste Ge wanes as (CUB COTIISE NYS TUT EET EY OTe ve les rae ra ae oan ere NINet sec IONS COMUIMAIONS | 420.0 2. sateen cie eos oon en's Lier eke & Sw Tere SG a Men CIABEAINS TOE StAtCIMeMt(S co. c5e hina ian eae «o's 00>.mS nsete) DG steve 8 SYSLOGTSS? SOUR UTSTaRS © Be aes eek ape Rony em sr ren TEPC) RCPS ES «a ek SihsySin nr RN ea ele cee am eA inG@oduction torvaniables-and TUNCtIONS forall there exists Vr

3

301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 one

os

eee

Eos

THE

REAL

NUMBER

SYSTEM

Introductions binary operavionSi.. 46 4 es we aes ee eee AXIOMS. O0 a:lGld 5 time. eae cesnnn eects hocks howtos) Oca se ee PIXON CISES) «SAORI Pe cabole 8 Gare Feo tO RINE SAE See Ee eee ee ee Ordered fields: sierra ee Ne eR Se Ie cow SRaen ae eens Cae eee |S). Co)he) (on Gentian Ra ne a PI NE GSE meant VyCeara sn ar OCS, ) 6)ts Natural’ numibers.2 3 with domain: x € ®, then p(5) is the true statement 5 > 3 and p(1) is the false statement 1 > 3. If p is the formula fis an even function, where the domain of p= is the set of all real-valued functions of a real variable with domain ®, then P(f) is a true statement if fis the function defined by f(x) = x? for x € ®, and p(f) is a false statement if fis the function defined by f(x) = x3 for x € &.

Example 5. The solution sets of the ten formulas of Example 3 are: Hoes Way for (12): @ (there is no solution that is a member of 9v, the set of natural numbers); for (13): the collection of all nonempty subsets of @; for (14): the collection of all finite subsets of @, including Q; for (15): the set of all points (x, y) of the Cartesian plane ® X ® such that x + y = 4; this set corresponds to a /ine in the Euclidean plane £2 (cf. Chapter 4). for (16):

the subset of the solution set for (15) such that x and y are integers;

tony) 2 ds 2)n@r0)i: fOLLs)eml-4)-(2,.3)(254),.(3, 2), (3,.3),.G, 4)}5 for (19): the collection of all even functions with domain = ® and range C ®&; for (20): @ (there is no odd function with domain = ® whose values are ail positive).

Any two formulas with the same domain and the same truth set are said to be equivalent. Example 6. formulas

(24) are equivalent.

If the range of the variable 7 is the set SU of all natural numbers, then the two

A 2a

nS

ELEMENTARY

32 202

NEGATIONS,

DISJUNCTIONS,

LOGIC AND

IN MATHEMATICS

[$202

CONJUNCTIONS

For simplicity of notation, and not out of any basic necessity, we shall limut our considerations in this section to formulas involving a single variable x. We shall also assume (again for simplicity) that all formulas under consideration have a common domain D. In the main it will be convenient to denote the truth set of a formula by the upper case or capital form of the letter used for the formula. Thus the truth set of the formula p will be denoted P and the truth set of the formula q will be denoted Q: (1)

P=

oa

Ce) Isstries.

OF

Geos

tinue.

If Sis any statement, its negation, denoted ~S, is the statement “‘it is false that S,” although in informal practice any statement logically equivalent to “‘it is false that S”’ is called the negation of S and denoted ~S. For example, if x is a particular real number and S is the statement “‘x is negative,’ then ~S can be formulated in the following fashions (among many): “‘it is false that x is negative,” “‘x is not negative,” “x is nonnegative,” ‘‘x is positive or zero,” and ‘‘x is greater than or equal to 0.” In any case, whenever S is a true statement ~S is a false statement, and whenever S is a false statement ~S is a true statement. Finally, S is the negation of its negation (S = ~-~S), and we have:

ae is true if and only if S is false, ~S is false if and only if S is true.

(2) Example 1.

The following are examples of statements and their negations:

GneeNesey o Hil BG

Wo, Be a

OD)

5341,.2) E12,

al:

(5)

S: Every even function is a polynomial, ~S: There exists an even function that is not a polynomial.

In (3), S is true and ~S is false.

In (4) and (5), S is false and ~S is true.

We now extend the concept of negation to formulas, as follows: Definition I. Jf p is any formula, with domain D, its negation, written ~p, is the formula with domain D whose value at any point x of D is the negation of p(x):

(6)

(~p\(x) = ~(p(x))

(domain: x € D).

Equivalently, ~p is the formula with domain D whose truth set is the complement P' of P in the universe or space D:

(7)

{x |(~p)(x) is true} = {x |p(x) is false} = P’ = D\ P.

(Cf. Fig. 202-1.)

§202]

NEGATIONS,

DISJUNCTIONS

AND

CONJUNCTIONS

33

Figure 202-1

Example 2. Ifp is the formula defined

(8)

2x —j)-(domain:

x €

2, 354),5});

the related truth set is P = {1, 2, 3}. The negation ~p of p may be written (9)

2x)

a(domainaex

© | ly 23e4.5)0))

Wwithetruthyset Pp’ — {455}! Example 3.

The formula

(10)

x 23

(domain:

x € @),

with truth set {x € ®| x = 3} has the negation (11)

x 6 (domain:

x € ®).

22.

36

204

ELEMENTARY

LOGIC

IN MATHEMATICS

[§204

IMPLICATIONS, CONVERSES, AND CONTRAPOSITIVES

As in §202 we shall assume for convenience that all formulas involve a single variable x and have a common domain D. Definition I. Let p and q be formulas with common domain D and with truth sets P and Q, respectively. Then whenever P is a subset of Q, P C Q, p is said to imply q, andq is said to be implied by p, with the notation p = q, read “‘p implies q,” or gq=p, read “‘q is implied by p.”’ In short:

(1)

p=9¢=

F.C.

The statement p does not imply q, or q is not implied by p, denotedp + q, or q # p, is the negation of the statement (1):

(2)

(p#q) = ~(P$9)

and is the same as the statement P is not a subset of Q:

(3)

pHq=P¢

Q.

The statement p = q is called an implication, and the statement p # q is called the negation of an implication. Discussion. The implication p = q states that g must be true whenever p is true. If we write the inclusion P C Q inthe form P\ Q = P(\ Q’ = @, we see that the implication p= q can be expressed by the statement that for no point of D is it possible to have p true and q false. This is sometimes expressed as follows: “‘It is impossible forp to be true and q to be false simultaneously.”” The negation p 4 q, on the contrary, states that the set P is not contained entirely within the set Q and therefore intersects the complement of Q: P(\ Q’ # @. In other words, p 4 q means that there must be a point of D for which p is true and gq is false. These observations point up the fact that an implication expresses nonexistence (of members of the set P (\ Q’) whereas its negation expresses existence (of at least one member of P\ Q’). Above all it should be stressed that the negation of one implication is never another implication. (In particular, the negation of the implication p = q is NOT the implication p = ~q!) (Cf. Fig. 204-1.) In proving that an implication p = q isfalse it is necessary to establish the existence of a point x belonging to P and not to Q@. Sucha

ena!

PAA Figure 204-1

LG

§204]

IMPLICATIONS,

CONVERSES,

AND

CONTRAPOSITIVES

37

point x for which p(x) is true and q(x) is false is called a counterexample. Finally, if explicit expressions for p(x) and q(x) are available, it is often convenient to use the notation p(x) = g(x) to mean p= q. This is illustrated at the end of Example 2, below. Example 1.

Let p and q be the formulas with domain ®, given in terms of the variable x:

(4)

JOR

SEO.

GR

Be 5. Therefore the negation can be expressed in the form (x S$ 1) V (« > 5). Since (4) is a disjunction, its negation is the conjunction of the two negations x = O0and x S 10, which can be written in the concise form 0 S x S 10.

Example 7. Prove the equivalence of the two formulas A C B and B’ C A’, where A and B are subsets of a space S and the common domain of the two formulas is the set of all ordered pairs (A, B) of subsets A and B of S:

(39)

AGB

=

B GAS

Solution. The formula on the left of (39) is equivalent to the formula (that is, the ordered pairs (A, B) for which A C B are precisely those for which (0). Similarly, the formula on the right of (39) is equivalent to the formula B’ ()\ Finally, since B’ ()\ (A’)’ = B’()\ A = AT) B’, it follows that the two formulas equivalent.

4/)\B’ = @ 4 () B’ = (A’)’ = @.

of (39) are

If S and T are statements, the converse of the implication S = T is the implication T= S. If p and qg are formulas having a common domain, the converse of the implication p = q is the implication q => p. The converse of a true implication may or may not be true. In fact, any of the four combinations of truth and falsity for an implication and its converse are possible. An implication and its converse are both true if and only if the two formulas involved are equivalent. On the other hand, an implication and its converse may both be false without having equivalent formulas. (Cf. Example 8.) Example 8. The formulas in the following implications are all assumed to have the common domain &: (40)

x ~x

< 4is true; the converse x < 4= x < 2 is false;

$204]

IMPLICATIONS,

CONVERSES,

AND

CONTRAPOSITIVES

(41)

x x? < 2x is true;

(43)

x 2 is false; the converse

x

x >

4]

x < 4is true;

2=> x < 5 is false.

The contrapositive of the implication p => q, where p and q have a common domain D, is the implication ~g => ~p. If P and Q are the truth sets of p and q, respectively, then their complements P’ and Q’ are the truth sets of the negations ~p and ~g, respectively. Since the implication p = q is equivalent to the set inclusion PC Q, and since the implication ~q = ~p is equivalent to the set inclusion Q’ C P’, it follows from Example 7 that the implication p => q and its contrapositive ~q = ~p are equivalent statements whenever p and q are formulas with a common domain:

(44)

Law of Contraposition: (p =q)

= (~q

> ~p).

The equivalence (44) is of great importance in mathematical proof, which is not surprising in view of the extent to which mathematics involves the establishment of implications. The most common method of establishing the implicationp > q is to assume that x is an arbitrary member of the truth set P of p— that is, that p(x) is true — and then to show that x must consequently be a member ofthe truth set Q of q. On frequent occasions, however, it is more practical — or even essential — to make the assumption that g(x) is false and to pursue logical reasoning from that point of departure in order to infer that p(x) must also be false. This method of proof will be referred to as the contrapositive method. A veriant on the contrapositive method of proof is the indirect method, also called proof by contradiction or reductio ad absurdum.

By this method

one assumes

that the implication p = q is false by assuming the existence of a point x such that p(x) is true and q(x) is false, and then draws an inference incompatible with at least one of the two fundamental assumptions: p(x) is true and q(x) is false. The idea of this form of proof lies in the following way of saying that the truth set P is not a subset of the truth set Q:

(45)

Pi\ OFF

@.

The contradiction obtained from the assumption that there exists a point x belonging to P()\ Q’ is usually either of the form p(x) isfalse or of the form q(x) is true. We conclude this section with the following definition of the compound statement p=>q=", where p, q, and r are formulas with a common domain:

(46)

P=qene@rH=QA gG=”).

This is in keeping with the notation A C B C C for the conjunction of A C B and BCC, and 1 < x < 4 for the conjunction of | < x and x < 4. The notation of (46) extends in like fashion to conjunctions involving four or more formulas, It should be observed that there is no p>q>r>s, p=qa>r—>s=>u,:-:. p= q p> (=> 7"), Since both occasion to confuse (46) with either (p => q) => r or has = and g=>r are statements rather than formulas, and the implication symbol formula. been given no meaning when it is placed between a statement and a

[§205

IN MATHEMATICS

LOGIC

ELEMENTARY

42 EXERCISES

205

In each of the following exercises the formulas are assumed to have

Ik; Bae

The common

SR

Bb JOR PPK 58 eR

domain in every case is ®.

=> sh

Wee

2s JOR SES BR GRse

By,

IR ER

domain.

p= q. If p= q explain why. If p * q find a

In Exercises 1-4, determine whether p > q or counterexample.

a common

So

ADD

> 7.

OS Gna

In Exercises 5-8, determine whether S = T or S # T. Gy OIA SS 3 = Ce USO Ss ©.

OR NEO Say

By S245 3

Ss ©: Ch SASS 3 = S38 USO

Ib SRA se Sy = S28 IO SS Oe

In Exercises

9-11, prove the specified law.

9. (34), §204.

10.

Illustrate with Venn diagrams.

(36), §204.

11. (37), §204.

In Exercises 12-16, prove the stated law of logic. 12. Law of the excluded middle:

13. Commutative laws:

14. Law of addition:

p

p V ~p

= ft.

VqeqVp,pANqeq

p= (p V q).

16. Hypothetical syllogism:

Illustrate with Venn diagrams.

p.

15. Law of contradiction:

p A ~p = f.

(p= q) A G@=>r)>(:p=}/r).

In Exercises 17-20, formulate the contrapositive of the given implication. The common domain throughout is ®. Do not establish truth or falsity of any implication. Wh

0

ae

SS 2

19 (c< QV

GS

UY Se SS Geese SS al

HS 3 S07 7

2055

~p(x) is true)

@ (4x € A 9 p(x) is false);

(14)

~ (Ax € A 35 p(x) is true)= (Vx € A, ~p(x) is true) 0, 8x? — 24x +7

Negation of (10):

Vn € MN, 2" S 33.

= 0;

Implications and their negations (§203) are readily expressed quantifiers V and 3:

(15)

P=qo—(Wx € P,x € Q)

(16)

(Pp¥QgeGxeEPdx ¢€ Q) = (4x 5 p(x) is true and q(x) is false).

in terms of the

= (Vx 5 p(x) is true, g(x) is true);

Once more we see why the negation of an implication cannot be another implication: an implication is a universal statement, while its negation is an existential statement. Example 4. The following two implications (one true and one false) and their negations (one false and one true) are expressed with quantifier symbols (the domain in each case is ®): (17)

XD > BX N. In other words, we are seeking an implication of the type x > N= f(x) > M. The next order of business is therefore to find the number N, and we write

(2)

V OV Tans saat 3G 1) er

The entire statement sought is therefore

(3)

VICES

ING

Roe

N=] fo) > MM,

We now seek a formulation of the negation of (3). This we shall do bit by bit. In the first place. the negation of any statement of the form (1) is of the form

(4)

3) WWE G sees,

where the dots - - - represent the negation of the corresponding part of (1). From (2), we see that (4) then becomes

(5)

IMERIVNER,:::

and the negation of (3) is

(6)

IMECRIVNER,

> M. x >NHSf(xX)

Finally, since the negation of an implication is an existential statement, we formulate (6) in positive terms:

(7)

IMECRIVNER,

SM. Jx>NIfx%)

In words, (7) says: ‘There exists a real number M such that for every real number WN there exists a number x greater than N such that f(x) is less than or equal to M.”

IN MATHEMATICS

LOGIC

ELEMENTARY

46

[$208

Example 2. Give precise meaning to the informal statement that there are arbitrarily large values of x for which the values of the real-valued function f with domain & vanish. Then by means of quantifiers formulate the negation of this statement in positive terms, and express the result in words. We first restate the proposition, still informally but with an attempt to give adSolution. ditional meaning to it: ‘‘No matter how large a positive number N we may be given, there must exist a larger (positive) number x such that f(x) = 0.’ As in Example 1, we start:

(8)

IVECO

and continue:

(9)

WIN EW,

slse S IN Soe,

and finally:

(10)

WINE

OC

Six

WY SiGe) = ©.

The negation of (10) is constructed piecemeal:

(11)

BIIN (EG See.

(12)

INE

PIVx>N,:::,

and finally,

(13)

dN SG Ooi Ne

oO.

In words, (13) states: ““There exists a positive number MN such that for all x greater than N, F(x) is nonzero.”

By reasoning similar to that employed in the two preceding examples, we see that the negation of any statement framed in terms of the quantifiers V and J has a ‘“‘dual”’ formulation obtained by replacing each of the two symbols VY and i by the other and each of the symbols 3 and , (that is, the comma) by the other, and replacing the final portion of the statement by its negation.

208

EXERCISES

In Exercises 1-4, express the given statement in terms of either the universal quantifier or (The statements are not necessarily true.)

the existential quantifier.

1. For every positive number x, x2 — x + 4 is positive. 2. For every real number x, 36x2 + 12x is an integer. 3. There exists a natural number 7 such that 3”2 + 5x is odd. 4. There exists a linear function fsuch that f(1) = 2 and f(2) =

1.

In Exercises 5-8, express the negation of the statement of the indicated exercise in positive terms using quantifiers. 5. Exercise 1.

6. Exercise 2.

7. Exercise

3.

8. Exercise 4.

In Exercises 9-16, express the given statement in terms of quantifiers. Then formulate the negation in positive terms. Finally, express the negation in words. (The given statement is not necessarily true.)

9. For every natural number », there exists a natural number m greater than 72.

§208]

EXERCISES

47

10. For every natural number n, there exists a natural number m such that 2m = n. 11. For every positive number M, there exists a positive number N such that for every number x less than N, x + 1 is less than M.

12. For every real number M, there exists a real number

N such that x > Nimplies x2 > M.

13. There exists a real number x such that every natural number #7 is less than x. 14. There exists a real number x such that for every real number y, xy = x.

15. There exists a positive number M such that for every positive number 6, there exists a number x such that 0 < x < dandxM S11.

16. There exists a positive number M such that for every positive number N there exists a number x such that Mxy ~0. =O

— 0.

(xiv) Zero has no reciprocal; that is, there is no number y such that 0-y = 1. (xo) (x = 0) A & # 0) = Gy)

(xvi) (b # 0) A (dS (xviii) (b # 0)

Ad # >

= x1y-!, oF papsenicd xy eay

= g

(xii) (6

0) A (dx 0)

2.5 =

E45= Ce

(xix) (—1)(-1) = 1.

(xxi) (—x)(—y) = xy. (xxi) y= 0=> a = en = =

(xx) (—1)x = —x.

(xxii) —(xy) = (—x)y = x(—y). (xxiv) x(y — z) = xy — xz.

§304]

ORDERED

FIELDS

53

(xxv) (a+ b\(e + d) = ac + ad+ be + bd. (xxvi) If x2 = x- x, x2 — y2 = (x — y\(x+ y). (xxvii) The general linear equation x = —D/a. 303

ax + b = 0, a #0,

has a unique solution

EXERCISES

In Exercises 1— 4, prove the given statement or establish the given equation for an arbitrary field.

1. The additive inverse of a number is unique.

Ay)

= — xy,

. There is only one number having the property of the number 1 of Axiom II(iii), §203. WNee WwW mp

OR

een,

5. Show that the set of all nonzero members of a field with the operation of multiplication (instead of addition) is a group. 6. Show that the set consisting of the distinct numbers 0 and 1 only, with the following addition and multiplication tables, is a field:

Addition 304

ORDERED

Multiplication

FIELDS

Statements involving such phrases as “‘less than,” “greater than or equal to,” and “cannot exceed” depend on what is called the ordering of the real number system. There is more than one way to describe the properties of ® that are based on ordering, but probably the simplest is in terms of a particular subset © of & consisting of those numbers that we shall call positive. In terms of the properties specified for this set @, we shall be able to define what are called relations defined by inequalities. 99

66

Before formulating Axiom IV which, together with Axioms I-III, states that the real number system is an ordered field, we introduce a definition of closure:

Definition I. Jf T is a binary operation on a nonempty set B and if A is a nonempty subset of B, @# ACB,

then A is closed with respect to T (or closed under T) if and only if

(1)

CA xX A = OG)

¢ 4. 1%)

- Example 1. As will be seen (§309), the set of all integers is closed with respect to addition. However, the set of all odd integers is not closed with respect to addition (for example, 3 and

THE

54

REAL

NUMBER

SYSTEM

(§304

5 are odd integers but their sum is not). The statement of Property (x7), §302, can be expressed: The set of nonzero numbers is closed with respect to multiplication. However, the set of nonzero numbers is not closed with respect to addition. (Why not?)

We are now ready for the next axiom: IV. Order There exists a subset & of & such that (i) & is closed with respect to addition:

(x y)E PX CS>X+YVES. (ii) @ is closed with respect to multiplication:

(x, y)E OX

P= xVE@.

(iii) There is no number x such that both x and —x are members of ®. (iv) If x is any nonzero number, then either x or —x is a member of &. Definition II.

An ordered field is a field § with a subset & satisfying Axiom IV.

Definition III.

A number x is positive if and only ifx © &. A number x is nega-

tive if and only if —x

€ @.

Axioms IV(ii/) and (iv) together state that for an arbitrary number x exactly one of the following three statements is true: x € ®, x = 0, —x € &. In other words, exactly one of the following three statements is correct: x is positive, x is zero, and

x is negative. In particular, 0 is neither positive nor negative, and 0 is the only number that is neither positive nor negative. Definition IV. The relations , S, and = are the following subsets of the Cartesian plane ® X B:

VA IA IV

Wai) aera icaen (Cowie aoe Ch), (a YIG-x€ e)V (x= yy}, (yO Sy Sie) eG

In other words,

< y if and only if y — x is positive,

> y if and only ifx — y is positive, be eesS y if and only if eitherx < yorx = y, x 2 y ifand only if eitherx > yorx = y. Each of the four formulas defined by the four sentences x y, x Vv and x 2 y is called an inequality. These are read “‘x is less than y,” “x is greater

than y,”’ “x is less than or equal to y,” and “‘x is greater than or equal to y,” respectively. In each case the domain is the Cartesian plane ® & ®. The word inequality

§304]

ORDERED

FIELDS

55

is also used for a formula or statement resulting from substituting values for variables in any inequality as defined above. Inequalities of the form x < yorx>y (in distinction to those of the form x < y or x = y)are called strict inequalities. It follows from Definition IVthat the two inequalities x < yand y > xare equivalent and that the two inequalities x < y and y = x are equivalent. The sense of an inequality of the form x < y or x S yis said to be the reverse or opposite of that of an inequality of the form x > y or x = y. The simultaneous inequalities x < y and y < z are usually written together, thus: x < y < z, and the simultaneous inequalities x > y and y > z are usually written Xe Vo aes Lhat,as, (2)

RZ

and =. Since, in formulas (2)-(6), three variables are involved, the domain in each case is implicitly assumed to be the Cartesian product R X R X R. If either (2) or (3) holds for three numbers x, y, and z, the number y is said to be between x and z. Two inequalities of opposite sense should not be combined in the manner of (2 )(6). For example, x < y > zand x > y < zare both combinations of symbols to which we shall not attempt to assign meaning. Example 2. Solution.

Prove thatx

>0@ x € @.

Since, from Definition IV, ce SVeraO

Ee ©,

we have only to show that x — 0 = x, for every x € ®. ties of the type illustrated in §302:

But this follows from field proper-

x—-O=x+(-0)=x+0=x.

Notice that for any x the statement x > 0 is another way of saying “‘x is positive.” Similarly, the statement x < 0 is another way of saying ‘‘x is negative.” Example 3.

Establish the transitive law for each of < and >:

(7)

«Kx=-ZzEP.

THE

56

REAL

[§304

SYSTEM

NUMBER

Each of these is a consequence of the closure of ® with respect to addition. (9), since

(11)

For example, in

GV—-xy+@-y=yt(-—)+2+(-y) = 74+ (—-x) =2-x,

it follows that whenever

y — x and z ~ y are members of @, z — x must also be.

Example 4. Establish the law of trichotomy: following statements holds. (12)

SERS WA

For any numbers x and y exactly one of the

Be NE

Oe IY

Solution. Define z= y — x. Then, from the remarks following Definition III, we can infer that exactly one of the following three statements holds for z:

(13)

“ss,

s=0,

2
y, and x = y, each with domain ® X R:

AX LY)

xX 2 Vs

(x Sy)ax>y, “x>yoxsy, mx

2 y)ex 0.

(vi) The reciprocal of a positive number is positive and the reciprocal of a negative number is negative.

§305]

EXERCISES

57

(vii) The quotient of two numbers is positive if and only if the two numbers are either both positive or both negative. (viii) The quotient of two numbers is negative if and only if one of the two numbers is positive and the other is negative.

(ix) x < Oif and only if x is negative. (x) The sum of any two negative numbers is negative. (xi) Multiplication or division of both members of an inequality by a positive number preserves the order relation: If z > 0, then x < y implies xz < yz and x/z < y/z, and x S y implies xz S yz and x/z S y/z; similarly for Sands.

(xii) Multiplication or division of both members of an inequality by a negative number reverses the order relation: If z < 0, then x < y implies xz > yz and x/z > y/z, and x S y implies xz = yz and x/z = y/z; similarly for >and

305

=.

EXERCISES

In the following exercises, prove the given statement, where all letters designate real numbers. 1. The equation x2 + 1 = 0 has no real root; that is, no member x of ® can satisfy this equation. 2

1 1 Zs — VS — GS

Vier VisS aXe

4.a 0. (ii) If n is a natural number, then n = 1; that is, 1 is the least natural number. (iii) If m and n are natural numbers, then m + nis a natural number; that is, N is closed with respect to addition. (iv) Ifm and n are natural numbers, then mn is a natural number, that is, Nis closed with respect to multiplication. (v) If m and n are natural numbers and m n+

1.

By the induction assumption, and the fact that m => 2, we have m'tl=m-m>nemeand+1)=n+n2n+1,

and therefore (20) is true whenever (19) is. By the fundamental induction (Theorem II, §306) the proof is complete. Example 3.

theorem of mathematical

Prove that ® (or, in general, any ordered field S$) is infinite.

Solution. Assume that & is a finite ordered field, and let x denote its largest member, which exists by Theorem VI. Since 1 > 0, we have by addition of x to both members: x + 1 > x + 0 = x, and consequently $ contains a member x + 1 greater than its greatest member x. This is the desired contradiction to the assumption that $ is finite, and therefore 5 must be infinite. The argument just given establishes the fact that there is no largest real number. In a similar fashion (cf. Ex. 13, §308) it can be shown that there is no largest natural number.

308

EXERCISES

In Exercises 1-6, use mathematical induction to establish the formula for n € 9.

eS wah28 gy waited elegy, Hl

263

Pree

Pea

miriatel) pieced

fail Sieseal is Qn FL" One Cneaa

3.5

3. The sum of the first n odd natural numbers 1, 3,---, (2n — 1) is n?.

4, 124+ 42 +---+ (3n — 2)? = 3n(6n? — 3n — 1). §.1-24+2-34---+

nant

1) = inst

I(r

2).

6. 1-34+2-4+---+

n(n 4+ 2) = n(n t+ 1)(2n+ 7).

7. Prove the following form of the fundamental theorem of mathematical induction (Theorem II, §306): Let k be a fixed natural number and let p be a formula with domain Mt \ {1, 2,--+, k} such that (i) p(k + 1) is true,

q

(ii)

Then

p(n) is true

(OU eeepc

pt.

a

i

p(n+ 1) is true.

That is, p(n) is true for every n © SONAL, Qyeres ck}.

THE

64

REAL

[§309

SYSTEM

NUMBER

8-11, establish the given inequality for the specified values of n € MN (cf.

In Exercises Ex.'7):

Saale 2 nea

OS ag ss Sei

10. 27 > n2,n = S.

o)

11. 3" > 3, n = 4.

12. Illustrate the law of Pascal’s triangle (Theorem XI, §307) forO

Sr 0 for all x of its domain is called positive-valued or strictly positive, and a function fsuch that f(x) < 0 for all x of its domain is negativevalued or strictly negative. Example 3. Let f be the greatest integer function (f(x) = [x]), let g be the absolute value function (g(x) =

|x|), and let / be the zero function (A(x) = 0), all restricted to the domain D = [—2, 2] (cf. Example 2). Then the following inequalities are true:

(21)

if

SG See

The following inequalities are false:

(22)

fah

hss.

Finally, the absolute value of a (real-valued) function fwith domain D is denoted |f| and defined to be the nonnegative function with domain D:

(23)

If] () = [f@)I, x € D.

§505]

EXERCISES 133 Equivalently, ifg is the absolute-value function g(x) = |x| with domain @, then lf] is defined to be the composite of f by g:

(24)

lf| = g°f, domain

Example 4, Since the function f(x) = x2 — x —2 df+ ug, where \ and uw € &.

More generally, a linear combination of the n functions fi, f2,: + , of the form

(4)

fn is any function

Mfi + A2f2 + °°: +Anfns where 1, 2, °°° 5 An E RB.

The first two of the following three theorems are almost immediate consequences of the preceding definitions. The third requires mathematical induction. Theorem I. Every vector space V is closed with respect to (i) the formation of negatives and (ii) subtraction, and contains the zero function:

(aS

Ui

nen,

@) (f,8) © UX U>f —2 = 0, (iii) O€ V. Proof. Part (i) is obtained by letting \ = —1in(2). Iffand g belong to UV, then from part (7), both fand —g belong to VU, and since V is closed with respect to addition, f + (—g) € UV. Since f— g = f + (—g), part (ii) is proved. Part (iii) can be obtained by letting fand g be the same arbitrary member of U(‘U is nonempty) and using (ii).

Theorem II. A nonempty function space © of functions having a common domain D is a vector space ifand only if it is closed with respect to the formation of linear combinations of pairs of its members. (5)

Q)#) © GX

Qand

GF, sre Ul

U0

fe

enn.

§507]

BOUNDED

FUNCTIONS

137

Proof. Assume first that U is a vector space, that \ and uw € ®, and that fand g € U. By (2), both \f and yg € V, and hence by (1) their sum Af + ug € VU. Now assume (5). If \ and yu are both set equal to 1, then (5) states that (1) holds, while if

\ is permitted to be arbitrary and yp is set equal to 0, (5) states that (2) holds. fore is a vector space.

There-

Theorem III. A nonempty function space V0 of functions having a common domain D is a vector space if and only if it is closed with respect to the formation of arbitrary finite linear combinations of its members:

(6)

Nido,An GO and fi,f2,2- >; fs © O NM tan? ae

a An fr 1 0s

Proof. Since (6) implies (5) and (5) implies that U is a vector space, we have only to show that every vector space VU 1s closed with respect to finite linear combinations. Let A be the set of all natural numbers n such that the implication (6) is true. We already know that 1 € A and 2 € A, and wish to establish (7)

NnEASnN+LIEA.

Accordingly, we assume that the implication (6) holds for some particular natural number # and seek to prove as a consequence that this same implication holds for n + 1, or: (8)

M,

WIG? ep

Sift

Nae ntl

ABf2



R

oo

AON

[35°

ae Ant

Ss}

ns et

Anti fn

€ x0)

© OU

Since the right-hand sum in (8) can be written

@)

Mie

et

feet = (ATi a>

or Ansa) 4 Anel faqs

by the induction assumption this function is the sum of two members of U and is therefore also a member of U. Therefore the implication (7) is established, and by Theorem I, $306, 4 = % and the proof is complete. 507

BOUNDED

FUNCTIONS

A real-valued function is bounded above, bounded below, or Definition. bounded if and only if its range is a set of real numbers that is bounded above, bounded below, or bounded, respectively.

A similar definition applies to a functionf bounded above, bounded below, or bounded on a set A contained in the domain of f; this is obtained by applying the preceding definition to the restriction offto A. In terms of quantifiers, the three conditions for a real-valued functionf with domain

D to be bounded

above, bounded

below, or bounded, take the following

forms, respectively:

()

D> fG)- 1/x, x > 0, is unbounded above. (Cf. Fig. 507-3.) of

2b

i 2b

o|e

Figure 507-3

Solution.

Negating the existential statement (1) gives the universal statement

(7)

VDC

Gx

D

f(x) Sb,

VRC

Ie) > f(a)

or, in positive terms:

(8)

sd.

COMBINATIONS

140

OF

FUNCTIONS

[§508

For the functionf: x > 1/x, x > 0, (8) becomes

(9)

V bie Reale S 0

1) x8.

Since for positive x the final inequality of (9) is equivalent to xb < 1, we can establish the 6 > 0, choose x = 1/26. In either case, truth of (9) as follows: If 6S 0, choose x = 1, andif x 1s positive and xb < 1.

Of interest within the spirit of the present chapter is the following theorem: Theorem II. The set 8 of all bounded real-valued functions with domain equal to a prescribed nonempty set D is a vector space.

Proof. We wish to show that (/) iffand g € ® thenf+ g¢ Band (i)ifAE R and f© ®thend\f € ®B. Ineach case we use Theorem I for simplicity: (/) If |f(x)| S ¢ and |e(x)| < d for all x € D, then by the triangle inequality (Theorem I(vi/), $311), f(x) + g(x)| S |f0d| + |g@)| S ¢ + d for every x € D, and hence by Theorem I, f+g is bounded. (ii) If [fQ)| Sc, then |Af(x)| = [A|-[ fd] S [Al-e, and Af is bounded. Some of the techniques of vector spaces are exhibited in the proof of the following theorem:

Theorem III. Proof.

Every polynomial is bounded on every bounded interval.

We first show that the function x", where n € 9, is bounded if x is. This

follows from the fact that |x| = |x|" S cif |x| S c(cf. Theorem X, §307, Ex. 31, $313). If Gis the space of all bounded real-valued functions with the given bounded interval J as their common domain, then the functions (10)

DHE

Ne

cs

Gal,

are all members of ®. Since @ is a vector space and every polynomial on Jis a linear combination of the functions of (10), every polynomial on J must be a member of ®, and therefore bounded. Note. IfJisa bounded interval, and if II and @ are the vector spaces of polynomials and bounded functions, respectively, on J, then the preceding Theorem III can be expressed by the inclusion statement

(11)

TT ee:

Example 2. Find an upper bound for the values of |f| where fis the polynomial defined as follows, for the indicated interval:

(12)

fC)

ox ee

Solution. Since |x| S$ 3, we have for |f(x):

(13)

[fOd| S [xP + Slax] +2 52741542 = 44.

Therefore 44 is an upper bound for the values of |f]. 508

EXERCISES

In Exercises 1-6, determine whether the given collection of functions on & is a vector space. Justify your conclusion.

§509]

ABSTRACT

VECTOR

SPACES

141

. All polynomials of the form ax* + bx7, where a and b € &. . All linear functions ax + b, where a and b € J.

. All polynomials vanishing at at least one point. . All polynomials vanishing at 0. . All even functions f(x) such that f(0) = 1. WO ff mn a N =.

All even functions f(x) such that f(1) = 0.

In Exercises 7-12, use (8), §507, to show that the given function is unbounded above on the specified set. 7. x?,x € @.

9.

8. 1/x2,x € @.

1+ x)/x,0 XS) = Avs

(iii)

X € § and(f, 2g) € UX

(jv)

QA wWEFXFandfEe

USAC

+H g) =

fH dg,

VLSAt+ywf=AWt

uf

We shall now formulate the concept of /inear functional for a general vector space, although most applications and illustrative examples will be to vector spaces of realvalued functions.

Definition I]. A linear functional* on a vector space 0 over a field § is a function ¢ on VU into & that has the following two properties:

(i) ¢ is additive: (ii)

fandg€0>¢(f+ 8) = o(f) + 4(g),

¢ is homogeneous:

d € § and fe V=> df) = AP(/).

Nore. It will be assumed henceforth that for every vector space under consideration the field § of Definition II is the real number system @&. With § = ®, a linear functional on a vector space ‘U is simply a real-valued function on ‘U having properties (/) and (ii).

A frequently used property of linear functionals is their subtractivity: Theorem. Jf ¢ is alinear functional on a vector space © over a field §, then & is subtractive there:

(1) Proof.

(2)

fandg €

U=¢(f— g) = ¢(f) — o(g).

Since (f— g) +g =f, and ¢ is additive:

Hf) = $(f — g) + 8) = of — 8) + (8),

and the result follows by subtraction of 4(g). *Also called a linear form.

$509]

ABSTRACT

VECTOR

SPACES

143

Example 1. Let U bea vector space of real-valued functions on a nonempty common main D, and let a be an arbitrary but fixed member of D. For anyf € U, define ¢:

(3)

do-

o(f) = f(a).

Then ¢ is a linear functional since f(a) is a real number, and (i) @ is additive: o(f+ g) =

G+ ga) = f(a) + g@) = $f) + (8);

(ii) > is homogeneous:

(Af) = (f(a) = f(a)

= \d(f). For a specific illustration of the functional of this example, consider the vector space II of all polynomials on the interval J = [1, 4], and let a = 2. If fis the polynomial f(x) = x7 + x — 3 then ¢(f/) = 44 2 —3 = 3, and if g(x) = 2x — 10 then ¢(g) = 4 — 10 = —6. (Cf. Fig. 509-1.) ay

Figure 509-1

Example 2. Let U be a vector space of real-valued functions on a common domain D consisting of at least two distinct members, and let a and 4 be any two distinct members of D, to be held fixed. For anyf € U, define ¢:

(4)

of) =f@ +f).

Then ¢ is a linear functional since f(a) + f (4) is a real number, and (/) ¢ is additive:

¢(/+ g)

= (f+ gia) + F+8)6) = f@ + s@ +f) + 8) = F@ + FO) + [g@ + | = o(f) + o(g); Gi) @ is homogeneous: ¢AS) = AN@ + AAO) = A\F@ + AF) = A f(@ + f(d)] = A¢(f). For a specific illustration of the functional of this example, consider the vector space of all linear functions on the interval [1, 3], and let a = 1 and 5 = 2. If f is the function 2x — 3 then $(f) = (2 — 3) + (4 — 3) = 0, and if g(x) = 3x +5

then $(g) = (3 + 5) + (+ 5) = 19. Definition III. Let U be a vector space of real-valued functions. A linear functional ¢ on V is positive, written ¢ > 0, if and only if

(5)

20 f 20>¢(f)

and ¢ # 0, that is, there exists an f such that $(f) # 9. Example 3. The functional ¢ of Example 1 is positive since the inequality f 2 0 implies the inequality f(x) = 0 for all x of D, and hence the inequality ¢(f) = f(@) 2 0; and furthermore, if f is the constant function 1, ¢(f) = 1 > 0. f2 0 implies Example 4. The functional ¢ of Example 2 is positive since the inequality both f(a) = 0 and f(6) = 0, and hence ¢(f) = f@ + f() 2 0. Furthermore, if f is the constant function 1, ¢(f) = 2 > 0.

COMBINATIONS

144 510

THE SIGMA

OF

[§510

FUNCTIONS

FUNCTIONAL

In §509 two examples are presented to show that if, for any vector space of realvalued functions on a set D that consists of at least two points, ¢(f) is defined to be either the value of fon a fixed point of D or the sum of the values of fon two distinct fixed points of D, then ¢ is a linear functional. This idea can be extended to a domain D consisting of at least n distinct points, for any givenn € 9, with the values of f being added at a prescribed set of n distinct points of D. In the present section we shall be concerned with this general idea, but only for the specific case where D is the set of nonnegative integers: D = U {0}. For present purposes let

(1)

D=NU

{0},

and let U be the vector space of al/ real-valued functions with domain D. Let A = {m,m-+ 1,---,n} be any nonempty set of consecutive integers in D and define, for any fe U:

(2)

o(f) = fm) + flm + 1) +°°+ fn).

Then the real-valued function ¢ with domain VU is a linear functional.

(The proof

follows the lines of the two examples of $509, with the aid of the general associative, commutative, and distributive laws of $307.) This linear functional is called the sigma functional, with the notation:

(3)

b=

= De

= ss

AOL:

The notation (3) is called the sigma summation notation, or simply the sigma notation. 6 Example 1. If ¢ = YS and f(x) = x2, then 3

, DO) oa

(4)

SP

5

Gs 2

10

Example 2. If ¢ = >) and f(n) = 2n — 1, then 1

(5)

10

of) =>

Only

21 499455

E19

S100)

n=1

0 Example 3. If ¢ = DS and g(k) = k!, then 0 0

(6)

$(g) ae kt! =O!=1. =O)

There are two special comments .

be made:

.

regarding the linear functional >) that should .

i

o

(i) The letter k that appears in the last part of (3) may be replaced by any other

letter or symbol that is not already being used; it is what is called a dummy variable:

§510]

THE

()

SIGMA

FUNCTIONAL

145

I= S1o= Ssfo= (ii) If fis the constant function 1, then ¢(f) = ¢(1) can be explicitly evaluated:

tet

Dati

(8)

ian—m41.

In obtaining the results of the following examples it will be helpful to have the theorem: Theorem.

For any natural number n,

(0)

Sw ~flk = n=) — FO.

Proof.

The left-hand member of (9) is

Ol

Ol Drs

).— fn = 11,

which can be rewritten

Ce

(i ete |fre Bl) Ke

Dict

=a Lf(1) — fO)I,

or

et

=)

Ee Nf

2) by = 2h

[= fO)],

which is equal to the right-hand side of (9). Several useful summation formulas will now be derived in the following examples,

using the linearity of the functional >>, property (8), and the preceding theorem. Example 4. Derive the formula

(10) Solution.

Dk H=1t24--:+2 = Gn 4 1).

&=1

If f(n) = n?, formula (9) becomes

(11)

> [k2 — (k — 1)7] = ye (2k —4) = nr’. k=1

By the linearity of pes and property (8), (11) can be written: T

(12)

Dy

=

ka n= 1

Solving for >> k gives (10). k=1

Example 5. Derive the formula

(13) :

Yi

k=1

= PE BH

+

= alt

Qn + I.

Solution.

FUNCTIONS

OF

COMBINATIONS

146

(§511

Let f(n) = n3. Then formula (9) gives n

(14)

ss [k3 — (k — 1)3] = > Bk? — 3k +1) = 23, k=1

k=1

and hence, by properties of >) and formula (10): I

(15)

305-3 k=

> k+n=3) k=1

_

kK —Snnt+ lI t+n=e,

k=1

3D)virkt = nif? + 8 + 1) — 1) = tn? k

+ 3n + 1);

_

and‘(13) follows. Example 6. Derive the formula of Example 1, §307:

(16)

12 + 32 +.--+ (2n — 1)? = gn(4n? — 1).

Solution.

The left-hand member of (16) can be written ye, (2k — 1)?, which by the linearity k=1

nr

of Ds can be expressed I

(17)

3 (4k2 — 4k + 1) = 4 kz — 4k +> ie

By formulas (8), (10), and (13), this becomes

(18)

n(n + 1)(2n + 1) — 2n(n + 1) + 2 = Gn(4n2 + 6n + 2 — 6n — 6 + 3),

which simplifies to the right-hand member of (16).

511

EXERCISES

1, Let U be the vector space consisting of all real-valued functions on an interval /, and let a, b, and c be any three distinct points of J, to be held fixed.. For any f€ V, define ¢:¢(f) =f@+f) +f. Prove that ¢ is a positive linear functional.

In Exercises 2-5, evaluate the given sum.

k=2

n=0

5

if

4 t=22

58) Giese) j=l

6. Show that the formula (1), §307, of the general distributive law, Theorem II, §307, can be written n

n

XLV k=1

k=1

eV:

7. Show that the formula (14), §307, of the binomial theorem, Theorem XII, §307, can be written

@+y=)>) k=0

(;)aen—k yk,

§512]

STEP-FUNCTIONS

147

8. Show that the formula (2) of Exercise 14, §308, for the sum S of an arithmetic progression can be written

S= 2, (4 + (k — 1)d) = na + An(n — 1d. 9. Show that the formula (4) of Exercise 15, §308, for the sum S of a geometric progression with r ~ 1 can be written n

S=

>> af!

=

a — ar"

k=1

:

l—r

In Exercises 10-12, derive the formula. n

n

2

10. 5) 3 = 4n%(n+ 12 = (x) ; k=1

11. >> k4 = Ayn(n + 1)(2n + 1)Gnr? + 3n — 1). 12. >) AS = Ayn2(n + 1)2(2n? + 2n — 1). In Exercises 13-15, use the methods of §510 to derive the formula of the specified exercise of §308. 13. Exercise 4.

14. Exercise 5.

15. Exercise 6.

In Exercises 16-19, derive a formula for the given sum.

16. 2? + 524+ 82+.---+

(3n — 1).

17. 1-24+ 2-34+3-4+---+n(n+

1).

18.12 9)4222.3 432-4 et n2(n +1). 19, 1-22 4°2-32'43:42 +--+ n(n +1). 512

STEP-FUNCTIONS

In this section we shall discuss a vector space that is of especial importance in mathematics because of its fundamental role in the theory of integration. This vector space is the function space whose members are step-functions, which are defined on compact — that is, bounded closed — intervals in terms of nets: Definition I.

Jf J is a compact interval [a, b], a net N on I is any finite subset of I

that contains a and b, with the notation: (1)

Nios (do, Gita

2

IV

I,

where ao = a, a, = b, and the points of the net are arranged in increasing order: (2)

C=

dg

Cay

0, then A fis also increasing (or decreasing) on J. Show that if fis increasing (or decreasing) on J and if \ < 0, then Afis decreasing (or increasing) on J. function there.

Solution. If f and g are increasing functions on J, then f+ g is also, since x1 < x2 => (f+ g)(x1) = fxr) + g(x1) S f(x2) + g(x2) = (f+ g)(x2). If, in addition, at least one of these functions is strictly increasing on / a strict inequality < results, and f+ g is strictly increasing. Similar remarks apply to the sum of two decreasing functions: the sum is also decreasing, and if in addition one of the terms is strictly decreasing so is the sum. For the statements concerning scalar multiples a typical case is sufficient. Assume that fis increasing on J and

§514]

MONOTONIC FUNCTIONS

153

that \ < 0, and let x; < x2. Upon multiplying both members of the inequality f(x1) S f(x2) by the negative number

)\ we obtain Af (x1) 2 Af (x2) (cf. property (xii), §304), and hence

OPS)1) = OAf)(x2) and df is decreasing on I.

Although monotonic functions are important individually in mathematics, they lack a completely satisfactory structure as a class. For example, the space of monotonic functions is not a vector space since it is not closed under addition, as the following example shows: Example 3. Show that although both x? + x and —2x are monotonic on the interval (0, 1], their sum is not.

Solution. By Examples 1 and 2, x? + x is a strictly increasing function on [0, 1], and by elementary considerations —2x is strictly decreasing there. However, the sum of these two functions is f(x) = x? — x, which is neither increasing nor decreasing on (0, 1] (cf. Fig. 514-2): x? — x is not increasing, since 0 < 4 and f(0) = 0 > f(4) = —1; x2 — x is not decreasing

since 3 < 1 and f(4) = — i < f(1) = 0.

af

O



i os ASees 4

x

1

2

ON seh 7

Figure 514-2

Example 4. Show that the product of two increasing functions on an interval may not be monotonic there. Show that the product of two nonnegative increasing functions on an interval is increasing there. Comment on the product of two nonnegative decreasing functions. Solution. The function x and the function x are both (strictly) increasing on the interval [—1, 1], but their product x? is neither increasing nor decreasing there. Assume now that both fand g are increasing on J and are nonnegative there. If x1 < x2 (x1 and x2 € J), then multiplication of both members of the inequality f(x1) S f(x2) by the nonnegative number 2(x1) gives f(xi)g(x1) S f(x2)g(x1). Similarly, multiplication of both members of the inequality g(x1) < g(x2) by the nonnegative number f (x2) gives f(x2)g(«1) S f(x2)g(x2). Combining these inequalities gives f(x1)g(x1) S f(x2)g(x2), as desired. A similar proof shows that the product of two nonnegative decreasing functions is decreasing.

Norte 2. If ¢ is a positive linear functional on a vector space VU of real-valued functions, then ¢ has the following property:

(6)

(f,8)€

UX Vandfs

g>P/(/) S G(Q).

(The reason is that if ¢ is positive andf S g, then ¢(g — f) = ¢(g) — ¢(f) 2 0.) It can also be shown (cf. Ex. 21, §516) that a nonzero linear functional ¢ satisfying (6 ) is positive. For this reason, by analogy with (1) of this section, positive linear functionals are also called in-

_creasing linear functionals.

COMBINATIONS

154

OF

[$514

FUNCTIONS

Example 5. Find a lower bound and an upper bound ) x? — 4x + 11 on the interval {—1, 2].

for the values of the function

Solution. We analyze the given function term by term. In the first place, the function x? is monotonic on (0, 2], and 0 S x? S 4 there, and x? is monotonic on [—1, 0], and 0 S Xe there. Therefore, on the interval [—1, 2], 0 < x2 S 4. The function —4x is monotonic on (—1, 2],and —8 < —4x < 4there. We therefore have the following sets of inequalities, valid on [—1, 2]: O) See! ss 4 —8

s —4x

$4,

job es iiss i, Addition of corresponding members of these inequalities gives

Seek

a

Lies 9:

Therefore 3 and 19 are a lower bound and an upper bound, respectively, for the values of the

given function on the interval {—1, 2].

Example 6. Find an upper bound for the values ofthe function |x3 — x — 8| on the interval

Pi

Solution. Since the functions x3, —x, and —8 are all monotonic on the interval [—1, 2], we have the inequalities

and therefore —ll*

|f@)issie-and.|e(x)|' Sd.

Therefore x € D= |(fg)(x)| = |f@)g(x)| = [fC)|-le@)| S cd, and fg € B. (Cf. Theorem I, $507.)

Theorem II. algebra.

Jf I is a compact interval, the space § of all step-functions on I is an

156

OF

COMBINATIONS

[§515

FUNCTIONS

Proof. If f and g are any two step-functions on J, let $ be a net for which bothf and g are step-functions. Then for any open interval of $, since both f and g are constant there, so is their product fg. Example.

If fand g are the step-functions of Example 2, §512, given by the tables:

Occasionally it is convenient to have available the following theorem as an aid in establishing the fact that a certain vector space of functions is also an algebra:

Theorem III. 4 vector space V of functions is an algebra if and only if it is closed under squaring; that is, if and only if (3)

i SO fA

©.

Proof. Since (3) is a special case of (1) with g = f, any algebra of functions is closed under squaring. Now assume (3) and write one-fourth the difference of the

expansions (f-+ g)? = f2 + 2fg + g? and (f — g)? =f? — 2fg + g2 in the form

(4)

Ser

8) rea

hea

2

Since U is assumed to be a vector space that is closed under squaring we have the two implications

(5)

f and.g © UR

Gd 2g)2-6.0,

(6)

fandge€v>(f-

gyre.

Consequently, whenever f and g belong to U, so does the right-hand member of (4), and hence U is closed under multiplication and the proof is complete. Norte. Much as the concept of vector space can be abstracted (§509), so can that of algebra. An algebra is a vector space UO over a field $ of scalars on which there is defined a binary operation of multiplication, where the product of members f and g of © is denoted fe, such that the following homogeneous and distributive laws hold:

(i)

(Au) € FX F and (f,g)E

UVXV>Af)ug)

= Qw(fe),

§516]

EXERCISES

i)

fadhena

157

vet Daf th (f+ gh = fh + gh.

An associative algebra is an algebra © for which the following associative law holds:

(ii)

f, g, andh € U0 =

f(gh) = (fg)h.

A commutative algebra is an algebra ‘© for which the following commutative

(iv)

law holds:

fand g€ U= fe = gf.

516

EXERCISES

In Exercises 1-4, show that the given function is monotonic on the given interval.

1. sgn x (cf. Example 2, §115), [a, 5].

2. [x] (cf. Example 3, §115), [a, 5}.

3. x2 + 5x — 6, [0, 6].

4. —2x3 — 7x + 12, (—3, 2).

In Exercises 5-8, show that the given function is not monotonic

on the given interval.

5. |x|, [—8, 8].

6. x2 — 12x — 6, (5, 8).

7. |x2 — 8x|, [6, 10].

8. x3 + 100x2 — x, (0, 100).

In Exercises 9-12, find a lower bound and an upper bound for the values of the given function on the specified interval.

9. x3 — 4x + 20, [1, 4].

10. x3 — 5x — 30, (1, 3).

11. x? + 3x — 10, (—2, 1).

12. 25 — 2x — x2, [—2, 3].

In Exercises 13-20, find an upper bound for the values of the given function on the specified interval.

13. |x? — 8x + 10], [1, 4]. 15. |x3 + 5x2 — 5], (—2, 1).

14, |x2 — 3x + 1], (1, 3). 16. [14x — 3x2 — x3|, [—2, 3].

17.

at

[1, 4] (cf. Exs. 9, 13).

18.

et

(1, 3) (cf. Exs. 10, 14).

19. a.

(—2, 1) (cf. Exs. 11, 15).

20. 2)

[—2, 3] (cf. Exs. 12, 16).

21. Prove that a nonzero linear functional ¢ satisfying (6 ), §514, is positive. 22. Use mathematical

induction

to prove

that if U

is an algebra

of functions,

then

In Exercises 23-26, describe the step-function fg, where f and g are the step-functions defined in the indicated exercise of §513, by means of a table. Draw graphs of f, g, and fg. 23. Exercise

1.

24. Exercise 2.

25. Exercise 3.

26. Exercise 4.

158

COMBINATIONS

OF

FUNCTIONS

[§516

In Exercises 27-30, prove the given statement about the composite fo g of two monotonic functions, where the range of g is a subset of the domain off. Show that in each case the adverb strictly can be included. 27. Iff and g are increasing, then fo g is increasing. 28. If fand g are decreasing, then fo g is increasing. 29. If fis increasing and g is decreasing, then fo g is decreasing. 30. If fis decreasing and g is increasing, then fo g is decreasing.

Conic

601

Sections

INTRODUCTION

The mathematical curves known as conic sections, or simply conics, were originally defined and studied by the ancient Greeks in terms of the literal interpretation of the words conic and sections, that is, as (plane) sections of (right circular) cones. The principal types of curves obtained in this fashion are known as parabolas, ellipses, and hyperbolas, although circles as special cases of ellipses, and pairs of straight lines as limiting cases of the other curves, are usually considered to be members of the general family of conics. Since their original introduction, conic sections have proved to be of great interest in themselves, and of the utmost importance scientifically. We mention in passing the central role of conics in planetary, satellite, and stellar motion as studied in astronomy and space physics, and the reflecting properties of these curves and their uses in optics. In order to obtain results more quickly and efficiently than is possible when conic sections are defined and treated as sections of cones, we shall depart from the classical approach and instead define and study the various conic sections as particular types of Joci. For a proof of the equivalence of these two approaches to conic sections, see C. B. Morrey, Jr., University Calculus with Analytic Geometry (Reading, Massachusetts, Addison-Wesley Publishing Co., 1962), pp. 268-270. In this chapter all loci will be assumed to be in the Euclidean plane. 602

THE PARABOLA

Definition. 4 parabola is the locus of a point that is equidistant from a fixed line D called the directrix of the parabola and a fixed pointf not on D called the focus of the parabola.

159

160

CONIC

SECTIONS

[$602

+c = 0 and if the If the directrix D of a parabola C has the equation ax + by C consists of all parabola focus is the point (x1, y1), then by (1) and (3), §413, the points (x, y) such that lax + by + c¢| = V(x — x1)? +O — yi)? Va2 + b2

(1)

Since equation (1) is cumbersome to handle, we choose, for a given parabola C, a coordinate system that gives a much simpler equation. A particularly appropriate set of axes is obtained by letting the x-axis be perpendicular to the directrix D and pass through the focusf, directed positively from D towardf, and by letting the y-axis be parallel to D and halfway between D andf. (Cf. Fig. 602-1.)

Figure 602-1

Letting the coordinates of f be (c, 0), where c > 0, the equation of D becomes

x = —c,orx+cz=0.

Therefore the distance of a point (x, y) from D is |x + c|

and its distance fromf is \(x — c)? + y2, so that equation (1) in this coordinate system is

(2) By Exercise

Poaebetale

Oe au

7, §305, equation (2) is equivalent to that obtained by squaring:

OnE) Pe

0) Taye,

which simplifies to (3)

y? = 4cx.

The parabola of Figure 602-1 is said to “open to the right.” Similar derivations hold in the following cases, where c is assumed to be positive: (i)f= (—e, 0) and D is the line x — c = 0 (cf. Fig. 602-2a); (ii)f= (0, c) and D is the line y + c = 0 (cf. Fig. 602-25); (iii) f= (0, —c) and D is the line Vi C20 (cf. Fig. 602-2c). (Give derivations of the equations appearing in Figure 602-2, in Ex. 23, §604.)

$603]

VERTICES

DIFFERENT

y

FROM

THE

ORIGIN

161

ey

Vy

D

f (0, c) ,0 (= ¢,{ ) Vo

x

v

3

f x=c

y=-c

D

y?= —4cx

x= dey

x? =—4cy

Parabola opening to the left

Parabola opening up

Parabola opening down

(a)

(b)

(c)

Figure 602-2

For any parabola the line through the focus and perpendicular to the directrix is called the axis. From (3) we infer that the axis of any parabola C is a line of symmetry of C. The point on the axis of a parabola halfway between the focus and the directrix is called the vertex of C. In the parabola (3) and in the parabolas of Figure 602-2 the vertex is the origin in each case. Example 1.

Find the focus and the directrix of the parabola x2 = 20y.

Solution. The equation x2 = 20y is of the type x? = 4cy of Figure 602-245, with c = 5S. Therefore the focus is (0, 5) and the directrix is y = —5. The parabola ‘‘opens upward.” Example 2.

Write the equation of the parabola having focus (— 3, 0) and directrix x = 3.

Solution. This parabola is of the type illustrated in Figure 602-2a, therefore has equation y2 = —12x.

603.

VERTICES

DIFFERENT

FROM

with c = 3, and

THE ORIGIN

Consider the parabola C with vertex at the point (A, k), with focus c units to the right at the point (h + c, k), and with directrix vertical and c units to the left with equation

x =

h—c,orx—h+c=0.

(Cf. Fig. 603-1.)

Then from (1), $602,

the equation of C is

(1)

Pie) — NG Se ee (irk).

Squaring gives

Ce

2c)

= (x hh)? — 2x — hb) ce? + (y — ky,

or, equivalently,

(2) for the equation of C.

(y — k)? = 4c(x — h)

162

CONIC

[§603

SECTIONS

oy

ie (hk) (ht c,h) x=h—c

(y — k)’we l| 4¢(x —h) Figure 603-1

Similar derivations hold for parabolas opening to the left, up, or down, as indicated in Figure 603-2, where c is assumed to be positive in each case. (Give derivations of the equations appearing in Figure 603-2, in Ex. 24, § 604.) Example 1.

Write the equation of the parabola with focus (5, —2) and directrix y = 4.

The vertex v must be on the vertical line x = 5, with y-coordinate the average of Solution. —2and 4. Thereforev = (h, k) = (5, 1). Since c must be equal to 3, the equation of Figure 603-2c gives (x — 5)? = —12(y — 1), or x? — 10x + 12y +13 = 0. (Cf. Fig. 603-3.)

If equation (2) or any of the equations of the parabolas of Figure 603-2 is expanded the resulting equation in the two variables x and y has the following two properties: (i) (ii)

There is no term involving the product xy. The equation is quadratic in one of the variables and linear in the other. yy

PS

y

ipa e

(h, k + c)

; y=kr+e

/

t

v|(h, k) U\(h, k)

~G

Oe

x

iS

O

words

(hk —c)

(y—k)?=—4c(x—h)

(x—h)?=4e(y—k)

; (x—h)°=—4c(y—k)

Parabola opening to the left

Parabola opening up

Parabola opening down

(a)

(b)

(c)

Figure 603-2

x

$603]

VERTICES

DIFFERENT

FROM

THE

y

ORIGIN

163

y

(a)

(b) Figure 603-3

In Chapter 9 (cf. §910) we shall study conic sections by means equation of the second degree, having the form: (3)

of the general

ax? + 2cxy + by2 + 2px + 2qyv+d=0.

We can now state a converse to the preceding statement about (7) and (ii). If an equation of the form (3) satisfies (/) and (ii) its graph is a parabola. The technique of showing this combines completing a square (cf. §413) and factoring a constant, and is illustrated in the two examples that follow. Example 2. Solution.

Describe in detail the graph of the equation x? — 6x — 8y — 7 = 0. We complete a square with the variable x:

(4)

Cer

Ot)

=e)

and factor the constant 8 on the right:

(5)

Ca 32 = 80

2):

By comparison with the equation for Figure 603-24, this represents the parabola that has vertex (3, —2) and opens upward (8 is positive) with the distance between the vertex and either the focus or the directrix equal to 2(8 = 4c). Therefore (cf. Fig. 603-35), the focus is (3, 0) and the directrix is y = —4. Example Solution.

3.

Describe

in detail the graph

of the equation

3y? + 15y — 9x —1=0.

Division by 3, completing a square with y, and factoring out the constant 3 give

(6)

(y? + Sy + 48) = 3x +34 38,

(7)

(ye)? = 3c +38).

Therefore the graph is the parabola that has vertex (—4%, — 3) and opens to the right with

c = tr: 8. Its focus is thus so) 1

aleo

(—$2 + 3, — $) = (—1, — 8) and its directrix is x = —3% — 2

[$604

SECTIONS

CONIC

164

The following facts are almost immediate consequences of the basic principles set Norte. forth above: The graph of the quadratic function y = f(x) = ax? + bx + c, wherea ¥ 0, isa parabola. This parabola opens up if a > 0 and opens down if a < 0. The vertex of this : iy b b 4ac — b? ee parabola is the point sare — = - Similar statements hold when

2a

2a ssa

the variable x is given as a quadratic function of the variable y.

604

EXERCISES

A graph should accompany each of the following exercises.

In Exercises 1—4, find the focus and the directrix of the given parabola. Ire

0y:

In Exercises

We WE = Sy. 5-16, write the equation

3. y? =

=x.

of the parabola

4. x2 =

—12y.

satisfying the given conditions.

5. Focus (0, 6), vertex (0, 0).

6. Focus (—5, 0), vertex (0, 0).

7. Directrix x =

8. Focus (0, —4), directrix y = 4.

—1, vertex (0, 0).

9. Vertex (0, 0), focus on the x-axis, passing through (2, 5). 10. Vertex (0, 0), horizontal directrix, passing through (2, —5).

11. Vertex (—1, 4), focus (2, 4).

12. Vertex (3, 1), directrix

13. Focus (2, —1), directrix x = 6.

14. Focus (—5, 3), directrix

y = 0. y =

—1.

15. Vertex (—3, 1), vertical directrix, passing through (0, 0). 16. Vertex (2, 4), focus on the line x = 2, passing through (0, 0). In Exercises 17-22, find the vertex, focus, and directrix of the given parabola.

17.

y2 — 20x + 14y — 11 = 0.

18. x2 + 16x — 36y — 260 = 0.

19. x2 + 18x + 14y + 67 = 0.

20. y2 + 22x

— 8y + 60 = 0.

21. 3y2 + 16x + 36) + 92 = 0.

22. 5x2 — 100x + 28y + 556 = 0.

23. Derive the equations of Figure 602-2. 24. Derive the equations of Figure 603-2. 25.

Prove

the statements

of the Note,

§603,

concerning

the function

ax? + bx +c.

In Exercises 26 and 27, use the definition of a parabola and equation (1), §602, to find the equation of the parabola with the given focus and directrix. Simplify the equation to a form without radicals. 26. Focus (4, 4), directrix x + y = 0. 21. Focus (—%, —+%), directrix x-+ y = 4.

28. A cable of variable density hangs in the form of a parabola with vertical axis of symmetry. If the ends of the cable lie at the endpoints of a horizontal segment 300 feet long and the lowest point of the cable is 100 feet below the midpoint of this segment, what is the vertical distance to the cable from the points of trisection of this segment?

§605]

THE

ELLIPSE

165

29. A parabolic arch with vertical axis of symmetry has a horizontal base of length 2a and altitude 4. Find the equation of the arch in an appropriately chosen coordinate system. 30. The latus rectum of a parabola C is the chord of C through the focus of C perpendicular to the axis of C. Prove that the length of the latus rectum of a parabola is twice the distance between its focus and directrix. 31. Prove that a parabola with the origin as focus and the x-axis as axis of symmetry must have an equation of the form y2 = 4c(x + c),c ¥ 0.

605

THE ELLIPSE

Definition. An ellipse is the locus of a point the sum of whose distances from two fixed points is a constant. The two fixed points are called the foci of the ellipse.

If the two foci of an ellipse are identical, the ellipse is a circle.

We shall assume

henceforth in this chapter that the ellipses under discussion are noncircular, that is,

that the two foci are distinct. For simplicity we shall choose a coordinate system so that the two foci lie on the X-axis equidistant from the origin, with designations:

fi:(c,0), f2:(—c, 0), where c > 0. (1) Since for any point p in the plane other than a point on the x-axis between f; and f2, the sum of the distances of p from fi and from /2 is greater than the distance 2c between these two foci, we shall denote the constant sum of the two distances in

the definition of an ellipse by 2a, where

(2)

Gc.

(By means of (2) we make explicit our assumption that an ellipse is not simply the line segment joining the two foci.) Accordingly, the equation of an ellipse in the coordinate system chosen assumes the form

(3)

Vx — 0)? + y2 + VO + €)? + y? = 2a.

(Cf. Fig. 605-1.)

Figure 605-1

[$605

SECTIONS

CONIC

166

It is our purpose now to simplify equation (3) to a form without radicals. The simplest procedure manipulatively is to transpose a radical, square, and after simplifying the resulting equation, to arrange terms with the single remaining radical on one side, and square again. We shall not follow this procedure since, in general, squaring does not yield a result equivalent to the equation (or inequality) upon which the squaring is applied, and the justification fer squaring for this particular method is difficult. Although the algebraic manipulations are more awkward, we shall proceed by squaring the two members of (3) as they stand. This first squaring step is justified since the two members of (3) are nonnegative (cf. Ex. 7, $305). The result can be written

(x2 — 2ex + €? + y2) + (x2 + 20x +e? + y?)

aPC? Mee

a eo

od

Cp

a eye

= len,

or after simplification, (4)

4c2x2 =

VO? + y2 + c?)2 —

2a? —

(x? + y? 4+ c?).

Squaring the two sides of (4) gives, after simplification:

a(x2 + y2 + ¢2) = c2x2 + af,

(>)

We pause now to show that equations (4) and (5) are equivalent. In the first place (4) implies (5) since u = v always implies u2 = v2. To show that (5) implies (4) we show that (5) implies the inequality (x? + y? + c2) < 2a? in order that we may infer that the squaring of (4) involves only squaring of positive quantities. Accordingly, we assume (5) in the form (a? — c2)x2 — (a? — c?)a2? = —a?’y? to infer (a2 — c?)x? S (a? — c*)a2, and consequently, since a2? — c2? > 0, x? S a2. Therefore, from (5), a2(x2 + y2 + c?) = c?x2 + a4 S c?a2 + a4, and hence

tee) eel ee

Coe

eed tn,

as desired. We continue by writing (5) in the form

(6)

(a? — c2)x? + ary? = aX(a? — 2),

and therefore, with

(7)

b=

GPa?

bb? = a2 = e2, 2b2 4 et =a

the equation of the ellipse simplifies to

(8) or

9 (9)

b?x? + ary? = a2b?2,

tone ptt

|

It is clear from the form of equation (9) (cf. §115) that the ellipse (9) is symmetric with respect to each coordinate axis and the origin. The points (—a, 0) and (a, 0) are called the vertices of the ellipse (9). The segment joining the vertices is called

$605]

THEDELEIPSE

167

the major axis, the segment joining the points (0, —b) and (0, b) is called the minor axis, and (0, 0), the common midpoint of the major and minor axes is the center of (9). The terms major axis and minor axis are also used to designate the lengths of the corresponding segments just defined, 2a and 2, respectively. The numbers a and 6 are called the major semiaxis and the minor semiaxis, respectively. The Pythagorean relationship among the numbers a, 5, and c (a being the hypotenuse) is shown in Figure 605-2a. If the foci are taken on the y-axis, at the points (0, c) and (0, —c), a derivation similar to the one given above leads to the equation (cf. Fig. 605-2): tv

Hy) to

(0, b)

vertex / focus @

2 2 ae

ty

(0, y —b)

vertex

(a)

|(0, —a)

(b) Figure 605-2

The numbers a, b, and c are again

In this case the vertices are (0, a) and (0, —a).

related by (7), and the major and minor axes and semiaxes are defined by analogy with the definitions given above. Example 1. axis 6. Solution.

Find the equation of the ellipse having foci (2, 0) and (—2, 0) and major

With the notation used above, c = 2,a = 3, and b? = at —c?

2

2

=9 —4=5.

é

Therefore the equation is = + = = 1. (Cf. Fig. 605-32.)

Example 2.

Solution. (11)

Find the vertices, major axis, minor axis, and foci of the ellipse 6x2 + 5y? = 90.

Division by 90 puts the equation in the form x2 2 1s + ee

1

CONIC

168

[$606

SECTIONS

y

oy,

V5

x

—3

x

3

=Vi5

—V5 oe

ce

[Ope

5

(a) Figure 605-3

Either from (11) or from the fact that the x-intercepts are +15 and the y-intercepts are se vi8 (as can be seen directly from the original equation), we know that the foci are on the

y-axis, and that a = 18 and 6 = 15. Therefore the vertices are (0, + «/18), the major axis is the segment (0, —18)(0, 18), and the minor axis is the segment (—V15, 015, 0). To find the foci we use (7) and get c2? = a? — b? = 18 — 15 = 3,andc = V3. Therefore the

foci are the points (0, + 3). (Cf. Fig. 605-35.) Norte. The definition of an ellipse given in this section is based on the following property: If two pins are placed in a sheet of paper at a distance 2c, if a closed loop of string of length 2c + 2a is placed around the pins, and if a pencil point is placed within the loop, then the curve traced out by the pencil moving so as to keep the loop taut is an ellipse with major axis 2a and foci at the two pins.

606

ECCENTRICITY

The eccentricity e of an ellipse is defined to be the ratio of (/) the distance between its foci to (i) the length of its major axis. In the notation of §605,

(1)

pags 2a

1s

a

a

a

Since 0 < c < a, e satisfies the inequalities

(2)

O

oral:

A circle can be considered as a special case of an ellipse with eccentricity zero. In this chapter we are eliminating circles from consideration, and therefore we shall adhere to the strict inequalities (2).

$607]

CENTERS

Example 1.

DIFFERENT

FROM

x2

2

5

7

Solution. In the notation of (10), §605, e? = c*/q? = 2/7. Therefore e = 2 = 114.

Solution.

ORIGIN

eal a2 =7

and

42 = 5, so

that

c2 =2

and

Write the equation of the ellipse with eccentricity } and foci (+3, 0).

In the notation

of §605,

c = 3, and therefore

b? = a? — c? = 36 — 9 = 27, the equation can be written

607

169

Find the eccentricity of the ellipse

(3)

Example 2.

THE

CENTERS

DIFFERENT

FROM

a = c/e = 3/3 = 6. D + = = Il

Since

THE ORIGIN

If an ellipse has center at the point (A, k) and foci on a line through (A, k) parallel to one of the coordinate axes, then derivations similar to those of $605, with x and y replaced by x — A and y — k, respectively, lead to the equations given in Figure

607-1. If either of the equations given for the ellipses of Figure 607-1 and put in the form (1)

ax? + 2cxy + by? + 2px + 2qvy

is expanded

+d=0

of the general equation of the second degree (the coefficients a, b, and c in (1) are not to be confused with the constants a, b, and c of Figure 607-1) the following three properties of the resulting equation appear:

vertex | (h, k + a)

focus

| SN

(Ah, k +)

vertex

(h + a, k)

(h +¢,k)

(h—a,k) O

x

b

ee

(h, k —c)

k — a) vertex! (h,

O

(x —h)? ie (y—k)? Mb a2

focus

(x

oo

ib) res

y=

(a)

ee ZA

a

(b) Figure 607-1

ee

x

CONIC

170

(‘) (ii) (iii)

[$607

SECTIONS

There is no term involving the product xy. The equation is quadratic in each of the variables x and y. The coefficients of the terms in x? and in y? are unequal and of the same sign.

As in §603, for parabolas, there is a converse to the preceding statement: If an equation of the form (1) satisfies (/)-(i/i), and if its graph consists of more than one (real) point, then its graph is an ellipse. The procedure is again that of completing squares, and is illustrated in Example 2, below. Example 1. Write the equation of the ellipse with center (—2, 5), one focus at (—2, 7), and one vertex (—2, 2).

Solution.

With the notation of §605,

adapted to the present case, a = |5 — 2| = 3 and

c = |7 — 5| = 2. Therefore b? = a? — c? = 9 — 4 = 5, and the equation of the ellipse is :

;

given by the equation of Figure 607-16:

as

2)2 — 5) . ) aa se 9 ) = 1. (Cf. Fig. 607-2a.)

(—2, 2) ie.

x

(a)

O

f

(b) Figure 607-2

Example 2.

Describe the curve

(2) Solution.

(3)

x? + 5y? + 6x — 20y — 151 = 0. We

prepare

CZ

to complete

Ow

squares,

factoring

Ary

= Ay

5 from

both terms

involving y:

) = 151.

Supplying the missing terms 9 and 4 within the parentheses is equivalent to adding 9 + 5:4 = 29, and therefore the quantity 29 must be added to the right-hand member of (3) as well, giving

(4)

(53)?

5 =)

ed)

Division by 180 gives an equation of the form of that of Figure 607-la, with a= 180 = 13.42 (to two decimal places), b = 6, h = —3, and k = 2. Therefore (2) is an

$608]

EXERCISES

171

ellipse with center (—3, 2), vertices (—3 + 180, 2), major semiaxis 180 = 65, and minor semiaxis 6. Since c? = a? — b? = 180 — 36 = 144, c = 12 and the foci are (33

"0:

Again, we make explicit our assumption that a hyperbola does not consist simply of points on the line through the foci by denoting the absolute value of the difference of the two distances in the definition of a hyperbola by 2a, where (2)

ORGGEAG:

Accordingly, the equation of a hyperbola in the coordinate system chosen assumes the form

(3) (Cf. Fig. 609-1.)

WG =o? + 2 — Va + 0)? + | = 2a.

$609]

THE

HYPERBOLA

173

y

f,: (c, 0)

Figure 609-1

The algebraic simplification of (3) is similar to that of (3), §605. squaring the two nonnegative members of (3) and simplifying:

(4)

We start by

Vx? + y2 + €2)2 = 4c2x2 = (x2 + y2 + c2) — 222.

Squaring the two members of (4) gives, after simplification: (5)

a*(x2 + y2 + c2) = c2x2 + a4.

In order to show that (4) and (5) are equivalent we need to show that (5) implies the inequality (x? + y2+ c?) > 2a2. Accordingly, we assume (5) in the form (c2 — a)x? — (c? — a*)a2 = a?y? to infer (c?2 — a?)x? = (c2 — a?)a2, and consequently, since c? > a2, x2 2 a2. Therefore, from (5), a%(x2 + y2 + c2)2 c2a2 + a‘, and hence x2 + y? + c? 2 c2 + a? > a2 + a? = 2a?, as desired.

We continue by writing (5) in the form

(6)

(c2 — a2)x? — ay? = a%(c? — a2),

and therefore, with

(7)

b=V2—a@,

Bb =2-—a@,

the equation of the hyperbola simplifies to

(8)

b2x2 — a2y2 = a2b?,

or

0)

a Siig=

a+b?

= e?,

174

CONIC

SECTIONS

[$609

It is clear from the form of equation (9) (cf. §115) that the hyperbola (9) is symmetric with respect to each coordinate axis and the origin. The points (—a, 0) and (a, 0) are called the vertices of the hyperbola (9). The segment joining the vertices is called the transverse axis, the segment joining the points (0, —b) and (0, b) is called the conjugate axis, and (0, 0), the common midpoint of the transverse and conjugate axes is the center of (9). The terms transverse axis and conjugate axis are also used to designate the /engths of the corresponding segments just defined, 2a and 2b, respectively.

The numbers a and 6 are called the transverse semiaxis

and the conjugate

semiaxis, respectively. The Pythagorean relationship among the numbers a, 6, and c (c being the hypotenuse) is shown in Figure 609 -2a.

focus ¢ (0, c) vertex

(0, a)

vertex | (0, —a)

focus ¢ (0, —c)

(0) Figure 609-2

§610]

ASYMPTOTES

175

If the foci are taken on the y-axis, at the points (0, c) and (O, —c), a derivation

similar to the one given above leads to the equation (cf. Fig. 609-26):

10

Ape pated

10)

a2

b2

L.

In this case the vertices are (0, a) and (0, —a). The numbers a, b, and c are again related by (7), and the transverse and conjugate axes and semiaxes are defined by analogy with the definitions given above. Example 1. Find the equation of the hyperbola having foci (0, +4) and conjugate axis 6. (For a graph, cf. Example 1, § 610.)

Solution.

The equation is of the form (10) with

6 = 3. Since c = 4,a =

and the equation becomes

16

9 = v7,

Example 2. Find the vertices, transverse axis, conjugate axis, and foci of the hyperbola 3x2 — 5y? = 7. (For a graph, cf. Example 1, § 610.)

Solution.

Division by 7 puts the equation in the form

(11)

pasa md 3

5

We therefore have, in the notation of (9), a? = and 6? = 7, and hence c? = $+ 7 = 38. The vertices are thus (+24; 0) (+2721, 0), the transverse axis is the segment

(—321, 0)-y21, 0), the conjugate axis is the segment (0, — 21V35)(0, 235), and the foci are (+ V$$, 0) = (+7210, 0).

610

ASYMPTOTES

The straight lines that contain the diagonals of the rectangles in Figure 609 -2 have the property that if « is an arbitrary positive number, then any point of the hyperbola concerned that is sufficiently remote from the origin is within a distance less than « from some point on one of these lines. (We shall prove this statement presently.) For this reason, these lines are called asymptotes of the corresponding hyperbolas. To be specific, the equations of the asymptotes of the hyperbolas in Figure 609-2 are as follows: 5 al oa OM

or

Fae

bar

oe

a 0

and

niga= 9 rie pe a = meee 3 — Tee Asymptotes of Wes

OD

he

ak aeem O

and

2

Asymptotes fam

Aa

=e

5 Oe

ee

toe

aah: =)

Us, slegeie = 0 moa

It should be noted that for either type of hyperbola, the equation of the pair of asymptotes is given by replacing the constant term | by the constant term 0, and the - individual equations of the asymptotes are given by factoring.

176

CONIC

In order to prove the statement about of Figure 609-2a, we use symmetry in quadrant, and choose on the hyperbola same x-coordinate. The problem, then,

SECTIONS

[§610

the asymptotes of the hyperbola (9), §609, order to focus our attention on the first and the line y/b = x/a points having the is to prove that the difference

byh = CNb —=— D ee = at Nx PE a

(1)

can be made less than any preassigned positive number e (however small this number may be) by requiring x to be sufficiently large. The quantity (1) can be rewritten as follows:

Fi ope a

Ne? l

Oe EN xtye—@

ee ee @xtye-a@

ab x4+yP-a

If x> a, then yx? — a2 > O and x + yx? — a? > x, and therefore

(2)

D=

CT ce

x+yx2—a2 ~ x

We now insist that x be so large that the quantity ab/x is less than e. Since the inequality ab/x < « is equivalent to x > ab/e, we have only to guarantee that the number x be greater than both a and ab/e to be certain that the difference D of (1) is less thane. This completes the proof for the hyperbola (9), $609. The corresponding proof for the hyperbola (10), $609, is similar and will be omitted. Note. Asymptotes provide a useful aid in drawing hyperbolas. After the numbers a and 5 in equations (9) and (10), § 609 , are found, the most convenient steps in drawing an approximate graph of either of these equations are usually: (i) draw the rectangles indicated in Figure 609 -2, (ii) draw the asymptotes, (i//) plot the vertices, and (iv) sketch the hyperbola. 2 he Example 1. Draw graphs of the hyperbolas ss— 9 1 and 3x2 — S5y2 = 7 of Examples 1 and 2, §609.

Figure 610-1

§611]

ECCENTRICITY

177,

Solution. For the hyperbola of Example 1, §609, we havea = y7 = 2.65 (to two decimal places) and b = 3, and thus the rectangle and vertices shown in Figure 610-la. The hyperbola itself is shown in Figure 610-15. For the hyperbola of Example 2, §609, we have a= 4421 = 1.53 (to two decimal places) and 6 = 435 = 1.18 (to two decimal places), and thus the rectangle and vertices shown in Figure 610-2a. The hyperbola itself is shown in Figure 610-24. Example 2. and y = —2x.

Find the equation of the hyperbola with foci (+3, 0) and asymptotes y = 2x

Solution. From Figure 609-2a we know that the absolute value of the slopes of the asymptotes is b/a, and therefore we have in this case b/a = 2, or

(3)

b = 2a.

Since c = 3, we also have, from the equation c? = a? + 62,

(4) From

(PPD (3) we have b* = 4a?, and hence from (4), Sa? = 9, or a? = 2, and consequently

b2 = 38

Therefore

the equation

of the hyperbola

can

2 2 be written ~ — a = 1, or a. a

202 —5y* = 36.

Figure 610-2

611

ECCENTRICITY

The eccentricity e of a hyperbola is defined to be the ratio of (7) the distance between its two foci to (ii) the length of its transverse axis. In the notation of §609, (1)

c Va + b2 es DC Aenea a =O i amy, a

Since c > a, e satisfies the inequality (2)

Cal.

pe bbee +2

CONIC

178 Example.

Find

the equation

[§612

SECTIONS

of the hyperbola

of eccentricity

2 and vertices (0, +5).

Since a = 5 and e = 2, equation (1) gives c = ae = 10, and Solution. b2 = c2 — @ = 100 — 25 = 75. By (10), §609, the equation can be written

consequently

Oe: Sees 2

Ap)

O55

is the slope of either of the asymptotes of the hyperbola of Figure 609-2a, Note 1. If then the eccentricity e of this hyperbola is equal to e = yi + m2. For the hyperbola of

Figure 609-26 the relationship is e = V1 + (1/m)? = V1 4 m?/m.

Note 2. An equilateral hyperbola is one whose transverse length, or equivalently, in the notation of §609,a = b. Each is necessary and sufficient for a hyperbola C to be equilateral (i) the asymptotes of C are perpendicular, (ii) the eccentricity

612

CONJUGATE

and conjugate axes are of equal of the following two conditions (give a proof in Ex. 37, §614): of C is y2.

HYPERBOLAS

The hyperbolas

Sedeer cue ae 2s alan l a 2 2 pe

(1)

are said to be conjugate. The conjugate axis of each is the transverse axis of the other.

They have identical asymptotes, and their foci are at the same distance from their common

center (cf. Fig. 612-1).

Example.

613.

The hyperbolas 3x2 — 5y? = 2 and 3x2 — Sy? =

CENTERS

DIFFERENT

FROM

THE

—2 are conjugate.

ORIGIN

If a hyperbola has center at the point (A, k) and foci on a line through (A, k) parallel to one of the coordinate axes, then derivations similar to those of §609, with x and y replaced by x — hand y — k, respectively, lead to the equations given in Figure 613-1.

If either of the equations for the hyperbolas of Figure 613-1 put into the form

(1)

ax

26x) 4 Dy?

is expanded and

2px + 2qy -- a = 0

of the general equation of the second degree (the coefficients a, b, and c in (1) are not to be confused with the constants a, b, and c of Figure 613-1)

the following three

properties of the resulting equations appear: (i) There is no term involving the product xy. (ii) The equation is quadratic in each of the variables x and y. (iii) The coefficients of the terms in x? and in y? are of opposite sign.

$613]

CENTERS

DIFFERENT

FROM

Figure 612-1

Figure 613-1

THE

ORIGIN

W729

[§613

SECTIONS

CONIC

180

As in §§ 603 and 607, there is a converse to the preceding statement: If an equation of the form (1) satisfies (/)-(iii), then its graph is either a hyperbola or a pair of intersecting straight lines. The procedure once more is that of completing squares, and is

illustrated in Examples 2 and 3, below. Write the equation of the hyperbola with foci (3, 10) and (3, —2), and eccen-

Example 1. tricity 2.

c= 6. By (1), §6l11, Since the distance between the foci is 2c = 12, Solution. a = c/e = 6/2 = 3. Therefore b? = c? — a? = 36 — 9 = 27. Since the center is midway between the two foci its coordinates are (3, 4), and the equation of the hyperbola —

ee

9

2

i

eee

2

BACH eels)

27

y.

ny.

focuse (3,10) See ee

(3,7)

vertex

ee (5,—4) e focus

center Peay eet

(O94)

vertex (3, 1)

oe

OT focus ¢(3, —2)

(a)

(b) Figure 613-2

Example 2.

Describe the curve

(2) Solution.

(3)

3x2 — Sy? — 12x — 40y — 95 = 0. We prepare to complete squares:

3(x2 — 4x

) — 502 + 8y

=

The next step adds 12 — 80 =

—68 to each side:

(4)

3(x — 2) — Sy + 4)? = 27.

&,

Division by 27 gives an equation of the form of that of Figure 613-la, with a = 3, be V2 = 3/15 = 2.32 (to two decimal places), A = 2, and k = ae Therefore (2) is a hyperbola with center (2, —4), vertices (—1, —4) and (5, —4), transverse semiaxis 3, and

conjugate semiaxis 215.

Since c? = a? + 6? = 9 + 22 = 22, ¢ = 6\2 = $y10 = 3.79 (to

two decimal places), and the foci are (2 + 3.79, —4), or (—1.79, —4) and (5.79, —4). The eccentricity is equal to e = c/a = 24/10 = 1.26 (to two decimal places). The asymptotes are

given by 3(x — 2)? — S(y + 4)? = 0, or

(5)

V5(y + 4) = ++3(x — 2),

and are the two lines through (2, —4) with decimal places). (Cf. Fig. 613-26.)

slopes

«4/2 =

a5 ATS =

+ 0.77 (to two

§614]

EXERCISES

Example 3.

18]

Describe the curve

(6)

3x? — Sy — 12x — 40y — 68 = 0.

Solution.

Completion

of squares as in the solution of Example

(7)

2 gives the equation

3(x — 2? — Sy + 4 = 0.

Therefore the ‘“‘curve” (6) consists of the two asymptotes (5) of the hyperbola (2).

614

EXERCISES

A graph should accompany each of the following exercises. In Exercises 1-12, show that the hyperbola satisfying the given conditions has its center at the origin, and find its equation. fem Vertexa(l10)stoci(=-2.0)) . Vertices (0, +1), eccentricity 3.

. Foci (0, +7), conjugate axis 10. . Vertices (+4, 0), through (5, 3). . Vertex (6, 0), nearer focus (8, 0), eccentricity 4. . Vertex (0, 12), farther focus (0, —13), conjugate semiaxis 5. . Vertex (0, 3), focus (0, 4), transverse semiaxis 3. . Vertex (6, 0), farther focus (—9, 0), eccentricity 3. MN NH eN © & N W . Asymptotes — i—).

y2 = 4x2, conjugate axis 8.

Asymptotes x2 = 7y2, focus (0, 4).

. Asymptotes y* = x2, through (3, 4).

—_a"

12. Difference of distances from (+5, 0) is equal to +4.

In Exercises 13-18, find the vertices, lengths of the transverse and conjugate axes, foci, eccentricity, and asymptotes of the given hyperbola.

x2 Los 15. 9y2 —

yp Got 16x? = 1.

17. x* — 3y? = 15. In Exercises

we 14.

ee 5

16. 25x? — 144)? = 81.

18. 2y2 — x? = 10.

19-30, find the equation of the hyperbola satisfying the given conditions.

19. Foci (—1, 4) and (9, 4), transverse axis 4. 20. Foci (—2, 2) and (—2, 12), eccentricity 3.

21. Vertices (5, —6) and (5, 6), conjugate axis 20. 22. Vertices (—1, —1) and (7, —1), slopes of asymptotes 23. Center (0, 3), vertex (—5, 3), focus (—8, 3).

24. Center (4, 2), vertex (4, 5), focus (4, —3). 25. Center (3, —2), vertex (3, —1), conjugate semiaxis 5.

+3.

CONIC

182

[$615

SECTIONS

26. Center (—2, —1), focus (4, —1), eccentricity 3.

27. Difference of distances from (—2, 7) and (10, 7) is equal to +6.

28. Asymptotes 3x? = 4y?, through (J, 2). 29. Vertex (10, 3), nearer focus (10, 7), conjugate axis 6. 30. Vertex (—7, 1), farther focus (11, 1), conjugate axis 12.

In Exercises 31-36, find the center, vertices, lengths of the transverse and conjugate axes, foci, eccentricity, and asymptotes of the given hyperbola. 31. 9x2 —

16y? — 54x — 64y —

33. x2 — 4)? —

10x —

35. x2 — 2y2 + 12x — 37.

loy

127 = 0.

+ 13 |=

0.

12y + 16 = 0.

32. 16x2 — 9y2 + 32x + 72y + 16 = 0.

34. 9x? — y2 + 54x + 2y + 71 = 0. 36. 3x2 — y2 — 24x + 16y —

13 = 0.

Prove the statement of Note 2, §611.

In Exercises 38-41, write the equation of the hyperbola conjugate to the given hyperbola.

a

See = hye = 3

39, 3y2 — 2x2 = 13.

40. 2x2 — 2 — 8x + 6y — 29 = 0.

41. x2 — 3)? — 8x — 6y + 16 = 0.

42. Prove that the distance between a focus and an asymptote of a hyperbola is equal to the conjugate semiaxis.

43. Prove that the product of the distances from any focus of a hyperbola to the two vertices is equal to the square of the conjugate semiaxis. 44. Prove that the distance between a focus of ahyperbola and the nearer vertex is less than the conjugate semiaxis. 45. A latus rectum of a hyperbola C is a chord of C through a focus of C perpendicular to the transverse axis of C. Prove that the length of a latus rectum ofa hyperbola with transverse and conjugate semiaxes a and J, respectively, is equal to 2b2/a.

615

LOCUS

PROBLEMS

The idea of /ocus problem was defined in §415. We are now able to consider and resolve many new locus problems, with the aid of the conic sections, parabolas, ellipses, and hyperbolas. Once again, there are two principal categories of locus problems, those framed within a given coordinate system, and those for which a coordinate system may be introduced. Example 1. Find the locus of a point p moving so that the sum of the squares of the distances ofp from the origin and from the x-axis is equal to 4. Solution.

If the point p has coordinates

(x, y), then

the required condition can

be ex-

pressed by means of the equation

(1)

(x2 + y*) + 7) = 4.

Therefore the locus is an ellipse with center (0, 0), vertices (+2, 0), and minor axis 2,2. Example 2.

Find the nature of the locus of a point moving in the Euclidean plane so that

its distance from a fixed point is equal to its distance from a fixed line.

§616]

EXERCISES

183

Solution. There are two cases. We first consider the case where the fixed point lies on the fixed line. For simplicity, we choose a coordinate system such that the fixed point is the origin and the fixed line is the y-axis. The given condition is then expressed by the equation New + y2 = |x|, or equivalently, y2 = 0, and the locus consists of the x-axis.

We now consider the case where the fixed point does not lie on the fixed line. We choose a coordinate system such that the fixed point is the origin and the fixed line has the equation x + y = 0, where y > 0. The given condition now is expressed by the equation yx? + y2

= |x + y|, or equivalently, y2 = 2yx + y? = 2y (x + 4y), and the locus is a parabola.

616

EXERCISES

A graph should accompany each of the following exercises. In Exercises 1-8, write the equation of the locus of a point p : (x, y) in the Euclidean plane subject to the given restriction. Then describe the graph. 1. The distance of p from the line x = 3 is twice the distance of p from the point (0, 0). 2. The distance ofp from the line y + 2 = 0 is three times the distance ofp from the point (0, 2). 3. The distance of p from the point (—9, 0) is three times the distance of p from the line x+1=0.

4. The distance of p from the point (0, —3) is twice the distance of p from the x-axis.

5. The sum of the squares of the distances of p from the two x — 3y = 0 is equal to 45.

lines x + 3y = 0 and

6. The square of the distance ofp from the line 2x + y = 0 minus the square of the distance of p from the line x + 2y = 0 is equal to 15. 7. The sum of the squares of the distances of p from the point (2, 0) and the line x + 2 = 0 is equal to 10. 8. The square of the distance ofp from the point (2, 0) minus the square of the distance ofp from the line x + 2 = 0 is equal to 16.

In Exercises 9 and 10, find the nature of the locus of the point p in the Euclidean plane subject to the given restrictions. 9, The sum of the squares of the distances of p from two fixed intersecting lines is a positive constant.

10. The sum of the squares of the distances of p from a fixed point and a fixed line is a constant greater than half the square of the distance between the point and the line. (The point may or may not lie on the line.) 11. A moving circle passing through a fixed point remains tangent to a fixed line not containing the point. Find the locus of the midpoint of the circle.

12, In Example 2, §615, in the case where the fixed point does not lie on the fixed line, prove that the origin is a focus of the parabola.

Polar

701

INFORMAL

REVIEW

Coordinates

OF TRIGONOMETRY

We begin this chapter with a brief review of some of the elementary properties of the standard trigonometric functions. A short description of an analytic development of these properties is given in Chapter 11 of the sequel volume, Basic Concepts of Calculus. A more complete treatment is to be found in Chapter 19, CWAG. We shall now merely list metric functions, show their measure, state fundamental basic approach at present is

the basic relations among the standard six trigonographical representations in terms of angles in radian identities, and give a few illustrative examples. The intuitive and geometrical.

We shall consider the sine and cosine functions as fundamental, connected by the identity (1)

sin2 @ + cos? @ =

l,

and therefore limited in values by the inequalities (2)

=WeStsin ots! if

—l sScoséS1.

The relationship of sin 6 and cos @ to the unit circle x2 + y? =

1, with the angle @in

“standard position” is shown in Figure 701-1, with indications of the four quadrants and the signs of these two functions in each of the quadrants. Because of the relationship exhibited in Figure 701-1, the trigonometric functions are also called the circular functions. 184

§701]

INFORMAL

REVIEW

OF

TRIGONOMETRY

(cos 57,sin 37) =

et y=1

185

(0, 1)

(cos 0, sin @)

Quadrant| 2

Quadrant il

Quadrant}

Quadrant

3

0

4

ant)

(sey'ar)

(=a)

(+,-)

>

>

(cos 0, sin 0)

;

(cos7r, sin7r) = (= 1,0)

=

(1, 0)

ee3 | \= (0, —1) COS oe 5 7, sin

6, sin 0) (cos

Figure 701-1

The graphs of y = sin x and y = cos x in rectangular coordinates are shown in Figure 701-2. These functions are odd and even, respectively: sin (—x) = —sin x, cos (—x) = cos x.

Sea

ae

i

Figure 701-2

The four other trigonometric functions can be defined in terms of sin x and cos x: (3)

tan x=

sin x , COS x

cot x =

cos x —— sin x

(4)

SEC

1 ) COs x

csc x =

1 —— sin x

=

for values of x where the denominator does not vanish, and are related by the identities:

(5) (6)

cot x

=

tan x

)

tan2 x + 1 = sec? x,

tan x

=

cot x

)

cot2 x + 1 = csc? x,

- for values of x where the denominators are nonzero.

of

[§701

COORDINATES

POLAR

186

4

(ese 6, sec @)

(cot 6, tan @)

(b)

(a)

Figure 701-3

The relationships of these functions to the unit circle are shown in Figure 701-3, and their graphs are shown in Figures 701-4 and 701-5. y

y

3 ou

+

|

-

al —

Omi

als.

{

DSN



3

3)

Oe

gt

y = tan x

y = cot x Figure 701-4

§701]

INFORMAL

REVIEW

OF

TRIGONOMETRY

Y

187

y

|

| |

|

||

||

| 4 3

O

757

ae

ee

$1

ae Z

2

i

x

i

1

Hi

'

t

sm

7

br

3

ai

Ie

Db:

T

0

5m

TT

|

|

|

|

Vi

SCC XL

y = csc x

Figure 701-5

The addition and subtraction formulas for sine and cosine are

(7)

sin(x + y) = sin x cos y + cos x sin y,

(8)

COs(x =- yp) =(cOS x COS y — sin x sin y,

where either the upper sign is to be used on both sides of the equation, or the lower sign is to be used on both sides. These are valid for all real numbers x and y. From these can be derived the addition and subtraction formulas for the other functions. For example, Ohman

Ge s> y)=

sini _ y) . sil X COS y-== cos x siny cos(x +y) cosxcosy + sin x sin y sin x cos y

cos x sin y

cos xX COS y

cosxcosy

tanx =a tany

_sinxsiny

1 -+tanxtany

~ cCosxcosy COS.X—

y = 2 cos

~F» cos

y = —2 sin ~

ee =a

sin 2.

EXERCISES

In Exercises 1-8, derive the indicated formula or verify the indicated figure of §701.

1. Figure 701-1.

2. Figure 701-3a.

3. Figure 701-30.

4. (9).

5. (10)+(12).

6. (13).

7. (14)-(17).

8. (18)-(21).

9. Derive the identity cot (x + y) =

COURACOLAeeal cot

y + cot x

10. Derive the double-angle identity cot 2x =

cot? x —

|

ZICOUnX

In Exercises 11-14, derive the triple-angle identity. 11° sin 3x = sin x@ cos? x — 1) = 3 sin x — 4 sin?’ x.

IP2, (COS She = ANeose — Si Gos se 133 tans:

is:

3 tan x — tan3 x 1 — 3 tan2 x

14. cot 3x =

3 cot x — cot? x 1 — 3 cot? x

In Exercises 15 and 16, prove the identity.

15. 8 sin* x = 3 — 4 cos 2x + cos 4x.

16. 8 cost x = 3 + 4cos 2x + cos 4x.

§703]

CURVES

IN POLAR

COORDINATES

189

In Exercises 17-27, prove the identity. 17. sin(@r += x) = cos x.

18. cos(47 + x) =

= sin x.

19. sin(ia + x) = 3\/2(cos x + sin x).

20. cos(}r + x) =

34/2(cos x * sin x).

21. tan(x + 7) = tan x.

22. cot(x + 7) = cot x.

233

tan

Xl Ain D2

1 + tan x

25. Bata

1 — tan x

COST% 5 sin x



sin x a 1+ cos x

x oy a

1+cosx = sin x



; ae a SI

OG ir (x + in).

ies

sinx =

i

°

1 — cosx z

LoSes Peal ar)

sinx — siny

tan 3(x — y)

27. sin? x — sin? y = sin(x + y) sin (x — y). 28. Use mathematical induction to prove that for every natural number 1, it is possible to express COS nx as a polynomial in cos x, and sin nx as sin x times a polynomial in cos x.

703

CURVES

IN POLAR

COORDINATES

If (p, 6) is an arbitrary ordered pair of real numbers, then (p cos 6, p sin @) is an ordered pair of real numbers and therefore determines a point p in the plane with coordinates

(1)

x =p cos 6,

=) pisini’:

The usual geometric interpretation of (1) is given in Figure 703-1, where p is shown as either a positive or a negative number and @ is the measure of an angle in standard position (having its initial side coincide with the oriented positive half of the x-axis. The numbers p and @ are called polar coordinates of the point p; p is called a radial coordinate and @ an angular coordinate of p. This relationship will be written p:[p, 6], with square brackets. The origin, given by p = 0, is called the pole of the polar coordinate system. Let us consider now the reverse problem of finding pairs of polar coordinates for a given point p : (x, y). If we square both members of the equations (1) and add the results we obtain

(2)

pe ES

eee

If p is the origin, or pole, there is no problem; p = 0 and 6 is completely arbitrary. Otherwise, if x2 + y2 > 0, there are precisely two determinations for p, one positive and one negative. For either one of these determinations we have y

ay

(pcos@, psin 8)

(|p| cos6, |p| sin6)

o


0, and 6 ranges from 0 to 27, including 0 but not including 27, then the point (a cos 6, a sin 6) moves around the

circle x? + y? = a?, occupying every point of the circle exactly once. If a < 0, the point (a cos 6, a sin @) is diametrically opposite the point (—a cos 6, —a sin 6), and the graph is therefore again the circle x? + y? = a? (cf. Fig. 703-4a). y

(a cos 6, asin 6)

(—acos@, —asin@)

(a)

(b) Figure

Example 4.

703-4

Describe the graph of the equation @ = a, where a is a constant.

Solution. The graph is a straight line through the origin, the points on one side being given by positive p, those on the other side by negative p, and the origin by p = 0. Ifa = 0, +a, +27,--- the graph of 6 = a is the entire x-axis, called the polar axis. Example 5.

Transform

to polar coordinates:

(i) x = a; (ii) y = 6, where a and 6 are

constants.

(i): Direct substitution from (1) gives the equation p cos 6 Se If a = 0, Solution. this can be written more simply @ = 37. If a ¥ 0, then pcos 6 ¥ 0 for points on the line x = a, and therefore cos 6 # 0 and the equation can be written

p=asec0,ax#0.

(4)

(ii): The equation y = b becomes p sin@ = 4 in polar coordinates. written more simply 0 = 0. If 6 ¥ 0, this equation is equivalent to (5)

p = bcesc 0, b# 0.

Most equations in polar coordinates take the form

(6)

p = f(),

or, occasionally,

(7)

pu

(0):

If 5 = 0 this can

be

[§703

COORDINATES

POLAR

a2

In either case the graph can be found by computing values of the function f(#) and setting up a table of values. In practice, however, it is more expedient to plot only a few “key points,” or to graph a portion of the curve and deduce the full curve from that portion. One feature of a curve that is frequently helpful is the set of values of 6that correspond to p = 0, since these usually disclose the behavior of the curve near the origin.

Example 6. Solution.

Graph p = asin 6, where a > 0, and show that the graph is a circle.

The table of values

gives the points shown in Figure 703-5a. The fact that p = 0 for integral multiples of 7, where sin 6 does not attain extreme values, tells us that the x-axis is a tangent to the graph. The maximum value of p is clearly a, assumed when 6 = 37. Multiplication by sin @ gives p sin @ = a sin? 6 = 0, so that the graph lies entirely above the x-axis, except for the origin. The complete graph is shown in Figure 703-55. To prove that the graph of p = asin @ isa circle, we consider the related graph obtained by multiplying by p: p? = apsin 6, which in rectangular coordinates can be written

(8)

Xe ey?

ay

Or

2X? =) = 3d)

y

Ga). ¥y

ae

(0, a)

p=asin@ x

x

>

(a)

(b) Figure 703-5

The graph of (8) is a circle of radius 3a and center (0, 3a). By factorization of p? — ap sin@ = 0 we see that this circle is the union of the two graphs (/) p = 0 and (ii) p — asin@ = 0. Since the graph of (/) consists of the origin only, and since the origin is a point of the graph of (i), we conclude that the graph of (i) alone is the circle of Figure 703-5d.

Nore. The graph of p = asin@ is a circle whether a > 0 or a < 0 (cf. Ex. 38, §704). The graph of p = acos @ is also a circle, whether a > 0 or a < 0 (cf. Ex. 39, §704).

§703]

CURVES

Example 7. Solution.

IN POLAR

COORDINATES

193

Graph the cardioid p = a(1 — cos 8), where a > 0.

The table of values

6 || 0

an

$1

el] 0 | 3(22— \3a | 3(2- 2a gives the points shown in Figure 703-6a. symmetric with respect to the x-axis.

a4

y

y

ea

@

os 2a

The complete graph, as shown in Figure 703-64, is

t

x

x 2a

a

p = a(1-—cos6) (a)

Example 8. Solution.

Figure 703-6

b

i

Graph the four-leaved rose p = a sin 20, where a > 0.

The table of values

An extension of this table for 37 VAN ) IIA ) go < o n ¥y

at

ail

p= asin 20 Figure 703-7

(b)

POLAR

194

[§704

COORDINATES

negative values for p, and therefore a portion of the curve located in the fourth quadrant. Similarly, for 7 < 6 < 32, 2x S 20 S 32, asin 20 = 0, and this portion of the curve lies in the third quadrant, and for $4 S 0 S 2, 3m S 26 S 4n, a sin 20 S 0, and this portion of

the curve lies in the second quadrant. The fact that the function a sin 26 vanishes at integral multiples of 42 is indicated in Figure 703-7a by short vertical and horizontal line segments

through the origin. The extreme values of a sin 20 at @ = jm, 3a, jm, and 77 give points shown in Figure 703-7a.

Example 9. Solution.

The entire curve is shown in Figure 703-74.

Graph the lemniscate p* = a* cos 26, where a > 0.

The graph is real if and only if cos 20 = 0, and therefore for —ja S 0 S ir

and for 24 < 6 < 2, and is imaginary for +7 < @ < ?n and for ?a < 6 < fa. The vanishing of cos 20 at 6 = {7, {7, 5, and 77 is indicated in Figure 703-8a by short line segments through the origin. Extreme values of p = +a are given by @ = 0 and zw. These points and those corresponding to 6 = +47, 27, and Z7, where p = +42a, are also shown in Figure 703-8a. The entire graph is given in Figure 703-8).

p?= a’cos 20

(a)

(6) Figure 703-8

704

EXERCISES

In Exercises 1-6, one set of polar coordinates of a point is given. Find another set [p, 9] with 0 S @ < 2m, graph the point, and find the rectangular coordinates of the point. 1. [4, $7].

23" al:

5. [V13, Arccos (—3/113)].

3. [—6, 3a].

4. [—2, gn].

6. [V17, 2m — Arctan 4).

In Exercises 7-12, the rectangular coordinates of a point are given. find two sets of polar coordinates [p, 6] with 0 < @ < 2r.

70g) 10. (12, 0).

8. (—5, 53). Tine):

Graph the point and

9, (—4y3, —4). 1255; 1).

§705]

POLAR

In Exercises

EQUATIONS

OF

CONICS

195

13-18, transform the given equation to rectangular coordinates, and graph.

13. 6 = in.

14. p = 2sec 0.

15°"

16.

17.

18. p = 8cos0.

p = —3.

p = —6sin@.

=" —4 esc 0;

In Exercises 19-24, transform the given equation to polar coordinates, and graph. 19k a

=O)

22. x2 ++ y2 = 16.

20. x = —3.

Z1.y = "1.

23.

24

x2 + y2 = 10x.

ny

1)

Ie

In Exercises 25-37, draw a graph of the given equation. 25.

Cardioid,

p = a(l — sin @), a > 0.

26. Cardioid,

p = al + cos @), a > 0.

27. Three-leaved rose, p = asin 36,a > 0.

28... Three-leaved rose, p = acos 36,a > 0.

29.

30. Five-leaved rose,

p = acos 50,a > 0.

31. Lemniscate, p? = a? sin 26, a > 0.

32. Lemniscate, p? =

—a? cos 26,a > 0.

33.

Limacon, p = 3 — 2 cos 0.

34. Limagon, p = 3 — cos 6.

35.

Limacon, p =

36.

Four-leaved rose,

p = acos 26,a > 0.

1 — 2cos @.

37. Spiral of Archimedes,

Limagon, p =

p = a0,a > 0. (Take a =

1 — 2sin 6.

1.)

38. Draw the graph of p = a sin 6, a < 0, and show that the graph is a circle. equation of this circle in rectangular coordinates.

Write the

39. Draw the graph of p = acos 6, a ¥ 0, and show that the graph is a circle. Consider the two cases a > 0 and a < 0. Write the equation of this circle in rectangular coordinates.

705

POLAR

EQUATIONS

OF CONICS*

The conic sections — parabolas, ellipses, and hyperbolas — are graphs of equations that assume a particularly simple form when expressed in terms of polar coordinates, with a focus at the origin or pole. As we shall see, the equation can be put in a form where p is a simple rational function of either cos @ or sin 9, and all three types of conics can be treated by a single unifying form of expression. This polar form is especially suitable for study of space trajectories and celestial mechanics. We consider the three classes of conics in turn. I.

Parabola.

As shown in §602, the equation of the parabola with vertex at the origin, focus at (c, 0), and directrix x + c = 0, where cc > 0, is y? = 4cx. By §603, the equation of the “shifted” parabola with vertex (—c, 0), focus (0, 0), and directrix x + 2c = 0

is

(1)

y? = 4c(x + ¢)

*This section, together with the Exercises of §706, can be omitted without disturbing the continuity of the remaining material. It is not prerequisite to any portions of this book or of the sequel volumes, Basic Concepts of Calculus and A Second Course in Calculus.

196

POLAR

COORDINATES

(cf. Fig. 705-1). Equation (1) can (x + 2c)?, or with p? = x2 + y?,

(2)

be

written

[§705 x? + y2 = x? + 4cx + 4c? =

p? = (x + 2c).

Extracting square roots and using polar coordinates to express x = pcos 6, we observe that the parabola (1) is the union of the graphs of the two equations p = pcos@+ (3)

2c and p = Pre

=

2c Cane—

—p cos 6 — 2c or, equivalently, of the two equations iS GOS = lls fp

|

—2c Meena

SS)

cos 6 #

2

il.

We now make a simplifying observation: whenever [p, 6] is a pair of polar coordinates satisfying either one of the equations (3), then [—p, 6 + 7] is a pair of polar coordinates satisfying the other equation, and since the two points [p, 6] and [—p, 9 + 7] are the same point, the two equations (3) have the same graph. For simplicity we choose the first equation of (3), for which p > 0, and with the notation p = 2c for the. (positive) distance between the focus and directrix (cf. Fig. 705-lc), we have for the equation of the parabola (1) in polar coordinates: ye P ray

(4)

6~2nnr,

y

n€

J,

y

y

D

x+2c=0

v

=e

f

/

1(0,0)

(—c,0)\) t—

(a)

De

x

vy?=4e(x +c) —}

(b)

(c)

Figure 705-1

Example 1. Write the equation in polar coordinates of the parabola with focus at the origin and vertex on the x-axis 3 units to the left. Solution. tion

Since the vertex is midway between the focus and the directrix, p = 6, and equa6

(4) iS Di=

II.

i

ee

Ellipse.

The equation of the ellipse with foci (—c, 0) and (c, 0) and major and minor semiaxes a and 6, respectively, is (x2/a?) + (y2/b2) = 1, with a2 = b2 + ¢

§705]

POLAR

EQUATIONS

OF

CONICS

197

(cf. $605). Therefore the equation of the “shifted” ellipse with foci (0, 0) and (2c, 0) and center (c, 0) is (§607): _

(5)

a9

2

2

+5-1

@=R +e

(cf. Fig. 705-2a). Equation (5) can be written a2x2 + a2y2 = c2x2 + 2b2cx + b4 or, with p2 = x? 4+ y2,

(6)

a2p? = (cx + b2)2.

1(0,0)

(0)

(2c, 0)

x is

p

_._ €P ~ 1-ecosé

(0) Figure 705-2

The graph of (6) is the union of the graphs of the two equations ap = cx + b? and ap = —cx — b?, but since these two graphs are identical, for the same reasons as those used in connection with equations (3), we choose one, ap = cx + b? = cp cos @ + b2, or

(7)

p=

b2 a—ccosé@

It is not hard to show that (7) gives positive values to p for all values of #6. Recalling from §606 the definition of the eccentricity e of an ellipse: dé;

C=

e=>-

(8)

we can rewrite equation (7) by dividing by a and using (8):

,

@)

eee

P=

1 —(c/aycos@

1—ecosé

and therefore, after multiplication by 1 — ecos@: (10)

p — ep cosé = p — ex = b?2/a,

from which we have p = ex + (b2/a) = e(x + (6?/ae)), or

(11)

oe

POLAR

198

[§705

COORDINATES

Interpreting equation (11) as a description of a locus, we have an interesting new formulation of an ellipse as the locus of a point moving so that the ratio of its distance from a fixed point to its distance from a fixed line not passing through thefixed point is a constant.

The fixed point is a focus, the fixed line is a directrix, and the constant

ratio is the eccentricity of the ellipse. From (11), the equation of the directrix is x + (b2/ae) = 0, so that the directrix is a vertical line located at a (positive) distance b2/ae to the left of the origin or focus. If we label by p this distance between the

directrix D and the focusf, we have

(12)

Pema

wes ae cle

so that the equation (9) of the ellipse can be put in the form

pe ORS oat Ea cond

(9)

Example 2. Show that for the ellipse (13) the distance between the center and a directrix is a/e and the distance between a vertex and the corresponding directrix is (a — c)/e.

Solution.

The distance between the center and a directrix is c + p (cf. Fig. 705-26), b2 be a ap : and by (8) and (12),c + p=c+ ae -.ne = = - Therefore the distance between a :

Cs

peta

2!

vertex and the corresponding directrix is — — a = -

=

e

Example 3.

Find the rectangular coordinates of the foci and vertices of the ellipse

20

14

Cy

p

Beet eg Tet 3 — 2cos 0

Also find the major and minor axes and the eccentricity. Solution.

Substitution

p = 4: (20, 0), (—4, 0).

of 6 = 0 and

@ = m gives the two

vertices,

with

p = 20 and

(Cf. Fig. 705-3.) Therefore the major axis is 2a = 24, anda = 12. 20

Division in (14) by 3 gives p =

1 — 3 cos 0

» and e = 3. Therefore c = ae = 8. Since the

center is (8,0)the foci are (0, 0) and (16, 0). Therefore 6? = a? — c? = 144 — 64 = 80,b = 45, and the minor axis is 8Y5. y

(—4, 0)

Figure 705-3

§705] III.

POLAR

EQUATIONS

OF

CONICS

199

Hyperbola.

The equation of the hyperbola with foci (—c, 0) and (c, 0) and transverse and conjugate semiaxes a and J, respectively, is (x2/a2) — (y2/b2) = 1, with c2 = a2 + b2 (cf. $609). Therefore the equation of the “shifted” hyperbola with foci (—2e, 0) and (0, 0) and center (—c, 0) is (§ 613):

(15)

iar cea ei aera

Mies

(cf. Fig. 705-4a). |Equation (15) can be simplified in much the same way that (5) was, to the same form a’p? = (cx + b2)? as (6). (Give details in Ex. 15, §706.) By the same reasoning as that used for (6) and (7) we can extract a single square root to write the equation of the hyperbola (15) in the form (7) or, with the notation e = c/aand p = b?/ae, in the form (13). The only difference in the present case is that p, given in (7) or (13), is no longer restricted to positive values. In fact, positive values of p (for which cos 6 < a/c) correspond to points of the right-hand branch of the hyperbola and negative values of p (for which cos @ > a/c) correspond to points on the left-hand branch. The asymptotes correspond to cos @ = a/c. The italicized statement above defining an ellipse as a locus applies in equal measure to a hyperbola, it being understood that absolute values should be taken in order that the distances involved are positive.

Za

p

=

ep

~ 1—ecos 0

(b) Figure 705-4

Example 4. Write the equation in polar coordinates of the hyperbola with a focus at the origin and with vertices at the points whose rectangular coordinates are (— 12, 0) and (—2, 0). The major axis is 2a = 10, and a = 5. The center is (—7, 0) and therefore Solution. 3: tA, CAS , From the equation c = 7 and the eccentricity See 1

(16)

p : wel

ep 3P te Tp "e.cogdemels—=* 7 cos:-0

we find p by substitution of 6 = 7:

p=2=

Tp

ae

:

Tp = 24, p = 24/7, and the equation

sought is p = 24/(5 — 7 cos 6). An easy check on our work is given by substitution of 6 = 0 in this last equation: p = 24/(5 — 7) = —12, in agreement with the vertex (—12, 0).

[§705

COORDINATES

POLAR

200 IV. Synthesis.

If the eccentricity e of a parabola is defined to be 1, then the locus definition given for an ellipse, as well as equation (13), applies to all three classes of conic sections. The three types of curve, restricted to points near the focus in question, are indicated in Figure 705-5.

alo)

Ey

Oe

6=0

py ae

6=0

O

al

P~1—ecos6’ ©

UWE. :

d=0

O

a

2p

UGE

iep ere

>

P=

cp

=

T=ecos6’©

Ellipse

Parabola

Hyperbola

(a)

(b)

(c)

Figure 705-5

For conic sections with different orientation relative to the focus — that is, opening to the left, upward, or downward instead of to the right in the neighborhood of the focus located at the origin — equations similar to (13) apply. These can be obtained from (13) by replacing @ in turn by 6 + 7, 6 — 4m, and 6 + 47m. (Cf. Fig. 705-6.)

To the left

ne

ep 1+ ecos 6 (a)

Upward

Be

EBs 1—esin# (b)

Downward

Ae

ep_ 1+esin#é (c)

Figure 705-6

Example 5. Write the equation in polar coordinates of the ellipse with focus at the origin, eccentricity 3, and directrix y + 2 = 0.

§706]

EXERCISES

Solution.

There are two cases.

at the origin, then 5

Poin 0

(i) If y+ 2 = 0 is the directrix corresponding to the focus

from Figure

bole

P=

2 —sin

201

705-64,

with p = 2, the equation

is p = ee as = 1 —esin 0

r (ii) If y+ 2 = Ois the directrix corresponding to the focus that is

not at the origin, then the equation of Figure 705-6c is appropriate. In this case, the distance from one focus to the directrix of the opposite focus is equal to the sum of the distance between the foci and the distance p between a focus and the corresponding directrix.

this equation takes the form

U7)

In this case

2=2c+p=2ae+tp=actp.

From the equation p = ep/(1 + e sin @) we find the two vertices by substituting @ = 32 and § = $m, the corresponding values of p being

(18)

4

=

ep l+e

;

=

Z

ep l—e

Since the major axis is the sum of these distances, 2a =

2P

2p

coy

Ee

l+e

+

Sa

1—e

aid

ee

1 — e

From (17) we have 2 = 2p + p = 3p, p = $. The equation sought is therefore

p

706

:

ip eS

ep

et

ee

35

ee

1+esn@

1+4sin0

$5

=

=

2+sin 0

6

.

10 + 5 sin6

EXERCISES

In Exercises 1—6, draw the graph of the given polar equation. Give the rectangular coordinates of each focus and vertex. Locate the center of each central conic, and draw the asymptotes of each hyperbola.

‘a

Cetel i Tree ea 9 Li ag prey: 2

baipartemuanet > ee ee 6 ee ee eee Peps Tan Sat

Rr e e e Ser = Riemer aeatG 10 686 eee DN ere 6

In Exercises 7-12, find the polar equation of the conic section having one focus at the origin (pole), and satisfying the additional given conditions. Draw a graph. All points are specified in rectangular coordinates. 7. Parabola, vertex (0, 6).

9. Vertices (0, 7), (0, —3). 11. Focus (—8, 0), vertex (—S, 0).

8. Eccentricity 1, directrix

x + 8 = 0.

10. Focus (—3, 0), vertex (2, 0). 12. Directrices

y = 4, y = 6.

In Exercises 13 and 14, find the polar equation of a conic having one focus at the origin (pole), and satisfying the additional given conditions. Draw a graph. All points are specified in rectangular coordinates. (In each case there are two solutions.)

13. Eccentricity 4, vertex (0, 3).

14. Eccentricity 2, vertex (—4, 0).

15. Give the details in the reduction of equation (15), §705, to the form a*p? = (cx + 6*). 16. Show that for the hyperbola p = ep/(1 — e cos 6) the distance between the center anda directrix is a/e and the distance between a vertex and the corresponding directrix is (c — a)/e.

Vectors

801

VECTORS

IN TWO

in the

Plane

DIMENSIONS

The concept of a two-dimensional vector is presented in the following Definition. All vectors considered in this chapter will be assumed to be two-dimensional.

In

the following Chapter 9, n-dimensional vectors are introduced so that they may be used in connection with systems of linear equations. Applications are then made to the two-dimensional Euclidean plane in the latter sections of Chapter 9, and to three-dimensional Euclidean space in Chapter 10. Definition.

4 two-dimensional vector is an ordered pair (h, k) of real numbers

h and k. The numbers h and k are called the components or coordinates of the vector (h, k); h is the x-component or x-coordinate and k is the y-component or y-coordinate. Two vectors (h, k) and (c,d) are equal, written (h, k) = (c, d), if and only if their corresponding components are equal: h = c andk = d. Otherwise they are unequal, written (h, k) ¥ (c,d). The vector (h, k) is represented by the directed line segment joining any point (a, b) to the point (a +h, b + k) (cf. Fig. 801-1). A vector may also be denoted by a single letter with an arrow, thus: 6. The magnitude or absolute value of a vector 6 = (h, k) is denoted and defined:

(1)

|| = |(A, k)] = VA? + 2.

The zero vector is the vector 0 = (0, 0) having zero magnitude.

A unit vector is

a vector with magnitude equal to 1. A unit vector is also called a direction. If p: (x1, yi) and q : (x2, y2) are any two points of the plane, the term the vector pq 202

$802]

LINEAR

COMBINATIONS

OF

VECTORS

203

means

the vector (x2 — x1, y2 — yi) represented by the directed line segment pq. The radius vector ee the point p : (x1, y1) is the vector Op = (%1, 1), where O is the origin. The vector pq, as represented by the directed line segment pq, is said to have initial point p and terminal point q (cf. Fig. 801-1).

(h, k)

q

|

| (0,ieee

The vector (h, k)

Dp

The vector pq

(a+th,b+k)

y D

;

fee 7

O (a, b) The vector (h, k)

The radius vector Op Figure

Example.

801-1

If p = (3, —1),¢ = (1, 5), r = (6, 4), and s = (4, 10), then

pq = rs = (—2, 6). Nore: The preceding example illustrates the fact that two directed line segments represent the same vector if and only if they are parallel, similarly directed, and of equal length.

802

LINEAR

COMBINATIONS

OF VECTORS

Addition of vectors and multiplication of a vector by a real number are denoted and defined:

(1)

(u1, U2) + (v1, v2) = (ur + U1, U2 + v2),

(2)

Mui, U2) = (Aut, Au2).

That is, if # = (wi, u2) and 6 = (v1, v2) are any two vectors, then the sum @ + 6 of ii and @ is the vector W = (w1, W2), where wi = uw + 1 and w2 = u2 + v2, and the product \i is the vector whose coordinates are Au; and du2. Any real number used in the context (2) is called a scalar, and the vector Au is called a scalar multiple of a.

204

IN THE

VECTORS

[§802

PLANE

Vector addition can be represented either by means of the diagonal of a parallelogram (Fig. 802-la) or the completion of a triangle (Fig. 802-15). Multiplication of a vector ii = pg by a scalar gives a vector represented by a line segment parallel to the original segment pq (cf. Fig. 802-1c).

(b) Figure

(c)

802-1

Since vectors represented by parallel directed line segments can also be represented by directed line segments on the same line, such vectors are called collinear. Thus, for all scalars \, the vectors # and dw are collinear. If \ > 0, % and du are similarly directed; if \ < 0, # and Xu are oppositely directed. If \ = —1, the vector (—1)u, also written —i, is called the negative of uw. Any vector and its negative are therefore oppositely directed collinear vectors of the same magnitude. Vectors that are not collinear are called noncollinear. Subtraction of vectors is denoted and defined:

(3)

a—b=a+4+

(3D).

If u and 6 are represented by directed line segments with a common initial point, the vector “ — & is represented by the directed line segment from the terminal point of & to the terminal point of u (cf. Fig. 802-2).

Figure

Division of a vector by a nonzero scalar is defined: 4 (4)

ae

|

£ = -0Dv. ear

The result of dividing a nonzero vector by its magnitude:

(5)

oe

|O|

802-2

§803]

LINEAR

DEPENDENCE

AND

INDEPENDENCE

205

is a unit vector collinear with it, and similarly directed. The vector (5) is the result of normalizing 3. Vector magnitude has some of the properties of absolute value in the real number system (Theorem I, §311). Proofs of the following properties, for two-dimensional vectors, are requested in Exercises 32-35, §804, where suggestions are provided: Properties of magnitude

(Z)

|é| = 0; |é| = Oif and only if 6 = 6.

(ii) [dO] = |A|6], A € AB. (iii) al=_ [al Rp d€ B\ 10) (iv)

The triangle inequality* holds:

| + 6| < |a| + |al.

(oe) |e] = lol; la — o| = |6 —al.t

(vi)

|a — 6] S |a| + fd}.

(vii)

||a| — |é|| S la — dj.

The vector \u% + ub, where d and yw are scalars, is called a linear combination of the vectors «7 and 6. More generally, a linear combination of a set of vectors 01, 02,°**, 0, is a vector of the form

(6)

NET

A202

NO

where Aq, A2,°°*, An are scalars. Note. the set of addition, MG + ©)

It is not difficult to see that the conditions of Definition I, $509, are satisfied by U2 of all two-dimensional vectors (U2 is a commutative group with respect to 16 = 6, and the homogeneous law (ui) = (Aw)éi and the distributive laws = AX + AD and (A + pw)#* = AX + pH hold), and therefore U2 is a vector space

over §&.

Example.

(3, 5) + (—1, 4) = (, 9),

Gi.)

— 4, 4) = a1),

62; =3) = (12, =18): =—(=3,

i= G01),

|(2, —6)| = V4 + 36 = 2y10, |3(1, —2)| = |G, —6)| = 45 = 3V5 = 3|(1, —2)J, iat) + (1, 4)

803

LINEAR

DEPENDENCE

14, 3)| = 10'S V5 + yt7 = (2, —1)| + (1, 4).

AND

INDEPENDENCE

The following fundamental definition applies to ah arbitrary space U (of vectors) over a field of scalars 5: *In geometrical terms, no side of a triangle can exceed the sum of the other two (cf. Fig. 802-14). +The absolute value of the difference between two vectors having a common initial point is the dis‘tance between their terminal points (cf. Fig. 802-2).

[§803

VEGTORSAINSTHERRLANE

206

Definition I.

The vectors 61, 62,:: +, 6, are linearly dependent if and only if there

exist scalars \1, \2,°**, \n not all equal to 0 such that

(1)

181 + dade ++ +++ Andn = 8.

If no such scalars exist, that is, if (1) holds only for scalars \1, X2,°**, Xn all of which

are zero, then the vectors 61, 62,:**, 6, are linearly independent.*

We begin our discussion of linear dependence and independence by considering sets {01, d2,--°, 8,} Of two-dimensional vectors for the special cases n = 1, n = 2, and n = 3,inturn.

I.

We start with n =

1.

A single vector 6 is linearly dependent if and only if it is the zero vector: b = 0.

Proof. If 6 = 0, then there exists a nonzero scalar 1 such that ld = 6 = 0. Conversely, if there exists a nonzero scalar \ such that \d = 0, then division by d gives ee = xOr2)

~1%

| : (;\)e

; Ila Daas es

.

and 6 must be the zero vector.

II. Two vectors ui and 6 (in two dimensions) are linearly dependent if and only if they are collinear or, equivalently, if and only if one is a scalar multiple of the other. If one of the two vectors i and b is linearly dependent — that is, equal to the zero vector — then the two vectors tu and 0 are linearly dependent. If tu is a linearly independent (nonzero) vector, and if ti and t are linearly dependent (6 is a vector collinear with u), then 0 is uniquely representable as a linear combination (scalar multiple) of i. Proof. Assume first that uw and 6 are linearly dependent, that is, that there exist scalars \ and uw not both of which are zero such that

(2)

hu + pd = O.

If « + 0 we can solve (2) for 3b:

(3)

b= —*G, bb

and w and 0 are collinear (even in the extreme case when d = 0). Similarly, if \ ~ 0, we can solve for w in terms of 6. Conversely, if i and 6 are collinear it is possible to write one as a scalar multiple of the other, say: (4)

D =

pi

where vy is an appropriate scalar multiplier depending on the lengths and relative sense of u and 8, and consequently w and 6 are linearly dependent:

(5)

vi + (—1)6 = 0,

*It should be appreciated that the concepts of linear dependence and linear independence apply to sets of vectors rather than to the vectors as individuals. Our terminology is well established in general mathematical usage. For example, we speak of “two disjoint sets” (although neither set is “disjoint” by itself), “‘similar triangles” (although no triangle is “similar” by itself), and “perpendicular lines” (although no line is “perpendicular” by itself). In contrast to these examples displaying properties of sets are the following examples displaying properties of individuals: “bounded sets,” “equilateral

triangles,” and ‘‘vertical lines.”

$803]

LINEAR

DEPENDENCE

AND

INDEPENDENCE

207

where at least the coefficient of 6 is nonzero. Next, assume that one of the vectors u and 0 is the zero vector — for definiteness let z = 0. Then

(6)

la +0-5=

1:04+06—6,

and uw and © are linearly independent since at least one (exactly one in this case) of the two scalars | and 0 in (6) is nonzero. Finally, if # and é are collinear and a # 0, then u in equation (2) must be nonzero, so that it is possible to write é as a scalar multiple of u: 6 = vi. To show that this representation is unique, we write 6 = ru = y2u, and

by subtraction

and

a distributive

law (cf. the Note, §802)

obtain

(v1 — v2) = 0 and consequently, since # # 0, ») — v2 = 0, and ») = 12. III. Three vectors ty, 6, and w (in two dimensions) are always linearly dependent. For any three vectors it is always possible to express one as a linear combination of the other two. If u and 6 are any two linearly independent (noncollinear) vectors, then any vector W is uniquely representable as a linear combination of i and 6b.

Proof. We shall prove the third statement first, using a geometrical argument (for a purely algebraic proof, cf. Example 3, $907). Accordingly, let a and 6 be any two noncollinear vectors, and let w be an arbitrary vector. Since u@ and 0 are noncollinear, both are nonzero and can be represented by nonzero noncollinear directed line segments having their initial points at the origin O (cf. Fig. 803-1). If # is also represented by a directed line segment from the origin, let lines L and M be drawn through the terminal point of w parallel to the two vectors 6 and 4d, respectively. Let U and V be the vectors collinear with # and b, respectively, having as initial point the origin and as terminal points the appropriate intersections of the =

3

f

lines L and M with the lines containing # and 6. Then U and V can be written as scalar multiples of # and 3d, respectively:

)

=

have w represented as a linear combination of a and 0:

(7)

E

U = \u, V = wo, and since

Ww = Ad + po.

Figure

803-1

Y

w =

>

>

U + V, we

IN THE

VECTORS

208

[§803

PLANE

To prove the uniqueness of the representation (7), we assume written:

(8)

that w can be

W = iW + w1H = 2d + pad.

Then, by subtraction and factoring by a distributive law, we have (A; — d2)u + (u1 — y2)d = O, and since w and 6 are linearly independent (by II, above), A1 — A2 = uw) — w2 = 0, A1 = A2, and yw = pz, and the representation (7) is unique. Furthermore, as a result of (7), we can say that a, 6, and W are linearly dependent since (7) can be put in the form

(9)

hi + pb + (—1)¥ = 0,

and at least the scalar coefficient —1 is nonzero. Now let i, 6, and w be any three vectors. If # and @ are linearly independent we have shown in (9) and (7) that

i, b, and Ww are linearly dependent and that w can be expressed as a linear combination of u and o. Assume finally that # and 6 are linearly dependent, and let \ and yu be scalars, not both zero, such that \w + yo =

(10)

0. Then,

hi + pb + Ow = O,

and since X, uw, and O are not all equal to 0, uw, 6, and w are linearly dependent. Furthermore, it is possible to solve for a or 6 or both (depending on the location of

a nonzero coefficient) in terms of the other two vectors. Definition I].

Let S =

This completes the proof.

{01, b2,---,6,} be a set of vectors in a vector space V over

a field of scalars §. Then the vectors ofS are said to span the space V if and only if every vector b ofU is expressible as a linear combination of 61, 62,° ++, bn: UO =

{drb1

+ Azd2

+--+ ++ Anda | Ar € F, A2 € Fy +, An E SF}.

The set S is a basis or a base set of vectors for O if and only if the vectors 61, 62,°*-, 0, span ‘0 and are linearly independent. Owing to the property expressed in the third sentence of III, above, any set of two linearly independent (noncollinear) vectors % and @ in the plane is a basis for the space U2 of two-dimensional vectors. The two linearly independent vectors # and é span the space U2, since {Az + wo|r and wp € R} = V2. Similarly, any single nonzero

vector % spans the space Uj; consisting

of all scalar

multiples \w of i,

and {iz} is a basis for the space U1. In the two-dimensional vector space U2, the standard basic pair of unit vectors are denoted and defined:

(11)

i= (1,0), HO).

The vectors i and j are also called the standard coordinate vectors of U>.

For this

basis the representation of an arbitrary vector 6 = (uv, v2) is particularly simple: (12)

v=

vi AF va].

We conclude this section by establishing three fundamental theorems that underlie the entire theory of linear dependence and independence. These incorporate

$803]

LINEAR

DEPENDENCE

AND

INDEPENDENCE

209

some of the results derived above, but are so expressed that they are available for vector spaces in general. Theorem I. /f the vectors of a finite set S are linearly independent, then the vectors of any subset R of S are also linearly independent. Equivalently, if the vectors of a finite set R are linearly dependent, then the vectors of any finite superset S of R are also linearly dependent. Proof. Let R consist of the m linearly dependent vectors 61, d2,---, im, and let S consist-of the--vectors 01, 02;-* >, Gm,°>:, On, wWhetre-n 2 m. If \1, 2,-**; Am are

scalars not all of which are 0 such that \18; + A2d2 +---+ Anim = 6, then 0’s can be adjoined to produce the set of numbers \j, \2,°°-, Am, 0,°--, 0 not all of which are O such that didi + Azb2 +°°>+ AmOm + Odme1 +:--+ 05, = 0, and therefore the vectors of S are linearly dependent. Theorem II.

The vectors 61, 62,:: +, tn, where n > 1, are linearly dependent if and

only if one of the vectors is a linear combination of the others. Proof. “If: Assume that the vector combination of the others:

(13)

Ue = MOT

0, where

Nea iteet

k € {1, 2,---,n},

Ang bet

is a linear

PA, 0.

Then since

(14)

Nor

Bees + Ag tbe

+ (Lb

+ Nee tBee1 + °°

+ Ande = 8,

and not every coefficient in (2) is equal to 0 (—1 # 0), the vectors are linearly dependent. “Only if’: Assume that the vectors 01, 52, :-: , 0, are linearly dependent, and in the equation

(15) assume

A181 + Azd2 + +++ + Anon = O, that d, is a nonzero

Nor fees

Ht Agr beer

coefficient.

Then

+ Angi Oe41 + °°* +Andn,

from (15) we can

and by dividing

write

—A,d, =

by —)x

solve

for ox:

(16)

; (BY

Theorem III.

eee (2s tice dett7 et

The vectors

Ha

Sse A

Vk+1

ee

01, 62,°::, 6, are linearly independent

i Dp.

if and only if

whenever a vector b is represented as a linear combination of 01, 62,°**, Un, this representation is unique.

Proof. “If”: The vector 0 can be represented as a linear combination (1) with all coefficients equal to 0. If this representation is unique, then (1) holds only for scalars \1, \2,°°*, An all of which are 0, and the vectors &1, d2,---, 5, are linearly independent by definition. “Only if”: Assume that 01, 02,°°*, Un are linearly independent, and that

+--+ + Ande = wib2 + w2b2 + °°

B= dyO1 + Azd2

(17)

[§803

PLANE

IN THE

VECTORS

210

+ Madr.

By subtraction we can write

(18)

(Ar — mi)dir + (2 — pa)i2 +--+

+ On = Ma)on = 0,

and by the linear independence of 61, 52, ---, d, every coefficient on the left of (18) must be 0.

Therefore

\1 = ui, \2 = #2,

°°°, An = Mn, and the representation (17)

as a linear combination of 1, 2, -+-, 0, is unique.

of

Example 1. The vector (0, 3) is linearly independent since it is nonzero. The vectors (3, 9) and (2, 6) are linearly dependent since 2(3, 9) + (—3)(2, 6) = (0, 0). The vectors (2, 5) and (1, 3) are linearly independent since neither is a scalar multiple of the other; for example, there is no real number X such that (2, 5) = A(1, 3), or 2 = \ and 5 = 3X. If (x, y) is an arbitrary two-dimensional vector, then (x, y) is a unique linear combination of (2, 5) and (1, 3):

(19)

(xy y= ACPD)

ny 3);

since the corresponding system in ) and yn,

2r SUG iptcee = teed 56,

(20)

SA + 3u = y, has the unique solution \ = 3x — y, uw=

Example 2.

—5x + 2y.

Prove that the diagonals of a parallelogram bisect each other.

First solution.

Let ABCD

(Fig. 803-2a)

midpoint of the diagonal from A to C.

be the given parallelogram, and let M

By assumption, 4M

= MC.

be the

We wish to prove that

M is also the midpoint of the diagonal from D to B or, in vector notation, that DVM = MB. —>

Using the equality CB = DA we have —

DM



MB

aa



2



= DA + AM, =





MC + CB = AM

+ DA.

By the commutative law (Note, §802), the desired equality is obtained.

A

B

Ee

(a)

(b) Figure 803-2

Second solution.

Let P be the point of intersection of the diagonals (Fig. 803-24). _—







—>

—-



—>



Then



AB = AP + PB = DC = DP + PC,

and consequently by subtraction,

(21)

MP = IKE = IDP = JB.

Since the vectors on the two sides of (21) are collinear with AC and DB, respectively, and since —_ 4 Z AC and DB are linearly independent, each of the two differences in (21) must be the zero > ~ —> — _ 4 vector 0, and hence 4P = PC and DP = PB. In other words, P is the midpoint of each diagonal.

—_

§804]

804

EXERCISES

211

EXERCISES

In Exercises 1-6, find the point s from the given data.

1. pg = 73,p= (3, 8),q = (4, 5), r= (6, 7). 2. Pg = =7rs,p= (1, 2),q= (6, —5), 7 = G, 4). SoA 3%

Pq

— =

Os, P =

Es 4.

Op

(—7,

THA) Ng

(—7,

11), q
|3; —4)|:

a

6i+4j|

a,

—————

|

14. |—4(5i + f)|= |—4]-|5i +J].

|6i+ 4y|

| (3, —4)

ry

16.

iG, —4)|



1

17. |(4i — 7/) — 2 + 3/)| = \(2i + 37) — Gi - FI. Teo a 7) (Sal) 1G — 7), 19. |(5i — 6j) + (i+ 2j)| S |5i — Oy] + [37+ 2i].

20. ||(6, 2)! — |G, 3)l] S |, 2) — @, 3). In Exercises 21-26, determine whether the vectors are linearly dependent or linearly independent. Give reasons for your conclusion.

21. 3i. pan lB) (3). — 9):

22. (0, 0). 2A Gi 5y) 0:

7500.15), (255):

26. i+), 21+ 4), 31+ 9.

In Exercises 27-29, express one of the vectors as a linear combination of the remaining vectors.

Be; = 30), — 181-1527):

28. (0, 0), (6, 17).

py

OFA. Vie ae

In Exercises 30 and 31, use vector methods to prove the given geometrical theorem.

30. The segment joining the midpoints of two sides of any triangle is parallel to the third side and half as long. 31. The medians of a triangle (that is, the line segments joining the vertices to the midpoints of opposite sides) intersect in a point of trisection of each.

In Exercises 32-35, prove the specified properties of magnitude of vectors, given in § 802.

32. (i)-(iii).

Suggestion for (iii):

1D)

Use (ii) and |é| = Ari

33. (iv).

lent

to

Suggestions:

By Exercise

[$805

PLANE

IN THE

VECTORS

212

7, §305, if a = (ui, u2) and 6 = (v1, v2), this is equiva-

(ua + v1)? + (v2 + v2)? S ui? + up? + 2Vur2 ae uz2\o12 + v22 + vi? + 22,

uv) + u2v2 S Vui2 ae uy

Vo12 + 22.

|u101 4 4202| and (ujv2 —

u2v)? = 0.

34. (v)(vi).

35. (vii).

Suggestion for (vi):

Suggestion:

From

Square

again,

making

use

of

mv

+ u2v2

OF S

|a — 6| = |a+ (—d)| S [a] + |—-F.

(iv),

|a| = |(@ — 6) + 8] S |a— 8] + jd]

and

[a] =

(6 — @) + a| S |a — b| + [a]. Use Theorem I(v), §311. 36. Prove that if p is the midpoint of a line segment pip2, and if O is the origin (all in the Euclidean plane), then —

=>

OPiri Op? Op = PS.

a

In particular, deduce that the radius vector of the midpoint p of the segment joining the two points (x1, v1) and (x2, y2) is

On 2: (3 ar x2 V1 i ). 2 2 37. Prove that the sum of any number of vectors (in the plane) is the zero vector if and only if they can be represented by the sides of a closed polygon (possibly self-crossing) directed in cyclic sequence.

805

DIRECTION

ANGLES,

COSINES,

AND

NUMBERS

For certain purposes it is more convenient to study a general line or directed line in the plane in terms of two angles, rather than the single angle known as the inclination whose tangent is the s/ope of the given line or directed line. For our present objective we must redefine the measure of an angle. If Land M are any two directed lines in the plane, we shall define the measure of the angle 6 between L and M as follows: If L and M are parallel and similarly directed, then 6 = 0; if L and M are parallel and oppositely directed, then 6 = 7. Otherwise, @ is the least positive measure of the angle between the rays issuing from the point of intersection of L and M, in the directions ofL and M (cf. Figure 805-1). In any case, then, the angle 0 between two directed lines is symmetric — the angle between L and M is equal to the angle between M and L — and satisfies the inequalities:

(1)

Ors" 7:

Figure

805-1

§805]

DIRECTION

ANGLES,

COSINES,

AND

NUMBERS

213

Any directed line L in the plane has two direction angles: (7) the angle a between L and the (positively) directed x-axis, and (ii) the angle 6 between L and the (positively) directed y-axis. (Cf. Fig. 805-2, where the line L is taken through the origin for convenience.) The cosines of these directions angles, (2)

A=cosa,

wu = cos 8,

are called the direction cosines of L.

E

1

+e)

&

Ye

First quadrant

Second quadrant

lid

ie R

YR

Third quadrant

Fourth quadrant Figure

805-2

Direction angles and direction cosines of any nonzero vector or directed line segment are defined as with a directed line. If d is a nonzero vector represented by the directed line segment from the point p: : (x1, y1) to the point p2 : (x2, y2), and if d is the (positive) distance between p; and pz, then the direction cosines of 6 are equal to

h=cosa==—

(3)

"|, p= sos 6 =



=

=— (01, 02) = (2 = X1, y2 — Vi):

OreiromessO2,

ie

(4)

= cOS a = A oo

VI =) a

v2 a (= = COS [83 =5—,

D Fi (A,Ns 2) =, ae,

(Cf. Fig. 805-3.) Since d2 = (x2 — x1)? + (v2 — y1)?, we have from (3):

(5)

2 + p2 = 1.

That is, for any directed line L in the plane, the sum of the squares of the direction cosines is equal to 1. In other words, if \ and yp are the direction cosines of a directed line L, then (d, ») is aunit vector. This unit vector (A, ~), sometimes called the direction of L, is collinear with L and similarly directed.

VECTORS

214

Py: (%4, 4)

IN THE

[§805

PLANE

Do: (Xo, Yo)

Po:(Xo, Y2)

e

e

Uy= X_— Xy

Dy: (x1, yy)

Do: (X9, Yo)

Pg: (Xo, Yq) Figure

805-3

Any ordered pair (/, m) of numbers / and m not both of which are zero is a set of direction numbers of a directed line or line segment L if and only if the vector (/, m) is collinear with L, or equivalently, if and only if the vector (/, m) is collinear with the vector (\, ). This condition of collinearity holds if and only if there exists a nonzero constant k such that (/, m) = kQ), uw) = (kA, ku), or 1]= kX and m = ky. From (3), with k = d, we see that x2 — x; and y2 — y; are direction numbers of the directed line segment from (x1, yi) to (2, y2). Since a vector (/,m) collinear with a direction (A, u) is also collinear with the opposite direction (—A, — yz), / and m are called direction numbers of the undirected line L as well as of the directed line L. Example. The vector (—2, 3) represented by the directed line segment joining the point (—1, 1) to the point (—3, 4) has direction cosines

(6) (Cf. Fig. 805-4.)

—p) \ = cosa= ae

fi

y13

cos B i :

y13

With the notation used above, d = v13, identity (5) becomes ;“5 + 7°5 = 1, y

Figure

805-4

$806]

THE

SCALAR

OR

INNER

OR

DOT

PRODUCT

215

and the following are ordered pairs of direction numbers for the line L through (—1, 1) and (—3,4):* (—2, 3), 2, —3), (46); (6,-—9), (—20, 30). The two directions for this line are 3

( ee =.) a( 2 Ve) icy aa Eien eye

806

THE

SCALAR

OR

INNER

OR

DOT

PRODUCT

The scalar product or inner product or dot product of two vectors D = (Yj, v2) is denoted and defined:

(1)

.

ab

T S =

+

=

# = (uw, w2) and

S

In this section we shall discuss the geometric significance of this concept.* If either @ or o is the zero vector, w-6 = 0. Assuming for present purposes that neither @ nor 0 is the zero vector, let 6 be the angle between w and 6, and let (Ai, 1) and (A2, u2) be the unit vectors whose components are the direction cosines of a and 0, respectively. Applying the law of cosines to the triangle whose vertices are the origin O and the two points pi : (Ai, u1) and pz: (A2, uw2), and denoting by c the distance between p; and pz (cf. Fig. 806-la), we have:

eee Ce

Ee

eGo

2A

© 2A? ==

Fait)

ss

te A) rats (A

or after simplification (since \1;2 + wi2 = A22 + p22 =

(2)

D

Oe? 5

22)

22) = 201A2- + wha)

1):

COS 6 = d1A2 + pie.

y

(a) Figure

From

806-1

(4), §805, we have wm = dla, u2 = mila, v1 = d2[6|, and vz = u2|6|, and

hence wv;

+ w2v2

=

(AiA2 + uim2)|i|

|.

By (2), then, wv;

+ u2v2 =

|| - [5| “COS 0.

If, for the case where either @ or & is the zero vector we permit the indeterminate angle @ to be arbitrarily assigned, we have from (1) the general theorem: *This product is called a scalar product because the result is always a real number.

VECTORS

216

Theorem I.

IN THE

[$806

PLANE

The scalar product of two vectors i and & is equal to the product of

their magnitudes and the cosine of the angle between them:

ub = |ii|-|b|-cos 6.

(3)

If we agree to say that the zero vector is orthogonal (perpendicular) to every vector, we have the following corollary to (3):

Theorem II.

Two vectors are orthogonal if and only if their scalar product is zero.

Example 1. The vectors (10, —15) and (12, 8) are orthogonal since 10-12 — 15:8 = 0. The vectors (11, —5) and (6, 13) are not orthogonal since 11:6 — 5-13 = 1 # 0.

We now assemble a number of facts concerning the scalar product, whose proofs are left to the exercises (Exs. 38-41, $808):

(i) The dot product of a vector and itself is often denoted by means exponent 2. In this special case (% = 6), formula (3) gives

of an

w= ii = |al?.

(4) (ii)

If the component of w in the direction of 6, denoted comp, wu, is defined to be

the number || cos 6 (cf. Fig. 806-16),

then z:-6 = (comp,a)|6|.

In particu-

(iii)

lar, if 6 is a unit vector, w7:-6 = comp,w, and if u@ = (uw, u2), comp; and compjt = uz. (Cf. (11), $803.) The following commutative law holds for all vectors # and ob:

(iv)

The following two distributive laws hold for all vectors w, 6, and w:

= m4

uv = Bu.

u(6+ WW) =a (i + 3):0 | (v)

b+ a-w, uw + vw.

The following homogeneous law holds for all scalars i and ob:

\ and yw and vectors

(Ad): (ub) = (Au)(Ud). (vi) (vii)

(5)

The associative law for scalar products is meaningless product of two vectors is not a vector.

since the scalar

If % = (a, b) is any nonzero vector, if 7; = (x1, y1) is the radius vector of a fixed point (x1, y1), and if 7 = (x, y) is the radius vector of a general point (x, y), then the equation of the line L through (x1, y1) orthogonal to a can be written

u-(F — 71) = Ax — x1) + 460 — yi) = 0. Consequently, if ax + by +c =0 is the equation of any line L, the coefficients of x and y are the components of a vector orthogonal to L. (Cf. Fig. 806-2a.)

(viii)

If 6 = (J, m) is any nonzero vector, if 7; = (x1, y1) is the radius vector of a fixed point (x1, yi), and if 7 = (x, y) is the radius vector of a general point (x, y), then the point (x, y) lies on the line M through (x1, y1) parallel to é

$806]

THE

SCALAR

OR

INNER

OR

DOT

y

PRODUCT

217

y

u

3

Se

L

(x1, Y;)

M

(a, 92)

(X4,1)

=

x

x

(a)

(b) Figure

806-2

if and only if 7 — 7; and 6 are linearly dependent; that is, if and only if the components of 7 — r; and those of é are proportional: Of 2,

(6)

Sr

=

Va

say

ere

where a zero denominator in (6) is to be interpreted to mean corresponding numerator is also zero. (Cf. Fig. 806-20.)

that the

Example 2. The equation of the line through the point (3, 4) perpendicular to the vector (5, —2) can be written (5, —2)-(x — 3, y— 4) = 5(x — 3) — 2(y — 4) = 0, or 5x — 2y = 7. Example 3.

The equation of the line through the point (3, 4) parallel to the vector (5, —2) x-—-3 y-—4 can be written » or 2x + Sy = 26. The equation of the line through the point



=2

. Ray (3, 4) parallel to the vector (5, 0) can be written a

oa Sane

or

y—4=0,

or y = 4.

Note. A vector, as defined in $801, is an ordered pair of real numbers. This ordered pair is necessarily associated with a particular rectangular coordinate system. A change of coordinate system may produce a change in the ordered pair. (An example is indicated in Figure 806-3.) However, since the magnitude of any vector and the angle between any two

v2, -v3)

Vibe (a)

(b) Figure

806-3

IN THE

VECTORS

218

[$807

PLANE

vectors are unchanged by any rectangular coordinate transformation, Theorem I establishes the fact that the scalar product of any two vectors is independent of the rectangular coordinate system. (In Figure 806-3, the magnitudes of # and @ are 2 and 2, respectively, in either coordinate system, the angle 6 is 47 in either coordinate system, and the scalar products are

2:14+0-:1=2

and

y2:y2— y2-0 = 2,

in coordinate systems (a) and (4), respectively.)

Example 4. Simple Simon defines what he calls the circle product of two vectors u = (u1, u2) and 6 = (v1, v2), thus: + u20Y. HO’ = uv2

(7)

Show that the circle product is not independent of the coordinate system. Solution. The vectors # and 6 of Figure 806-3a have the circle product 2, while the vectors a and t of Figure 806-35 have the circle product —2.

Example 5.

Discuss the cancellation law for scalar products:

(8)

(@@-6 = 0-0) > db = W.

Solution. The example di = i+j, 6 = i,and v=] shows that (8) is false as it stands, since (i+ 7): =i-i+/7-i=1 and @+/)-j =i-j+ 7-7 = 1. However, if the equation u-b = a-w holds for a// vectors 4, then 6 and W must be equal. This follows from the fact that if 7-6 = u-W for all a, then #-(6 — Ww) = 0 for all @ In particular, this last equation holds when u# = 6 — wW, and therefore, by (4), b — W =

807

: 0, and

(i — w)-(6 — w) = |b — we = 0,

d = Ww.

EXERCISES

In Exercises 1-3, find the direction angles of the directed line segment from the first point to the second.

1 ON4) C7):

(Os

DE(HZ, 3).

3. (3, —2, (—1, 2).

In Exercises 4-8, find the direction cosines of the given directed line or line segment

4. Positively directed x-axis.

5. Negatively directed y-axis.

6. From the origin into the first quadrant, making equal acute angles with the coordinate axes. Thy Jertoan (SO) ior, 3):

8. From (1, 5) to (4, 4).

In Exercises 9-11, use (5), §805, to show that the two given angles do not form a pair of direction angles for any directed line.

7, at47. 9. zis

1 10. 0, 4r.

ie

In Exercises 12-14, find three distinct sets of direction numbers for the directed line segment from the first point to the second.

1b4, @, 3), @& 12).

13. (—1, 6), G, 1).

14. (2, 4), (—5S, 5).

In Exercises 15-20, find the scalar product of the two given vectors. 15. i and i.

16. i and j.

ig fy 7 and i — i

$807]

EXERCISES

18. (3, —5) and (2, —3).

219

19. 61 — j and 2i + 7/.

20. (—3, 1) and (2, 4).

In Exercises 21-23, use (3), §806, to find the cosine of the angle between the two vectors.

2A Oe

f) (= 21).

9 Fah) SE CRG 6

BEES

2G Op.

In Exercises 24-26, determine whether or not the two vectors are orthogonal.

2AM(G7410), (6.13):

26. (7, 9), (14, —11).

255121 sy, 121 4.9),

In Exercises 27-29, find the two unit vectors that are orthogonal to the given vector.

Sie

2a).

29:0, —6).

ONE

In Exercises 30 and 31, find the component of the first vector in the direction of the second.

207687).

1.

BI 6

ee:

32. Verify the two distributive laws (/v), §806, for the three vectors # = (2,

—4), 6 = (—3, 1),

im = (UO)

33. Verify the homogeneous law (v), §806, for the given scalars and vectors. (A OL Di

\ =

—3,

In Exercises 34 and 35, use (vii), §806, to obtain the equation of the line through the given point (x1, yi) and orthogonal to the given vector i.

34. (x1, v1) = (2, 0), &@ = 5i+ 7).

35. (x1, y1) = U, —4), @ = (8, —3).

In Exercises 36 and 37, use (viii), §806, to obtain the equation of the line through the given point (x1, v1) and parallel to the given vector 0.

36. (x1, 11) = (—2, 1), 6 = ©, 4).

37. (Hi, yi) = 3,4), b = 21 —j.

In Exercise 38-41, prove the indicated facts listed in §806. 38. (i)-(ii).

39. (éii)-(iv).

40. (v)-(vi).

41. (vii)-(viii).

42. Give an algebraic proof of the inequality

(1)

|a-6) < |a|- la], or luv + wove] S Yur? + u2? Vor? + m2,

and relate the result to the trigonometric inequality |cos 6| < 1. Suggestion: suggestion given in Ex. 33, §804.

Use the last

43. Prove that equality in (1) holds if and only if the vectors # and 0 are linearly dependent.

44. Prove the general triangle inequality:

jo: + bo +---+8,|

S bi] + leo] +--++ lel.

45. Prove that any two nonzero orthogonal vectors are linearly independent.

46. Prove the parallelogram law for two-dimensional vectors: The sum of the squares of the diagonals of any parallelogram is equal to the sum of the squares of the four sides,

@ + ty + a — 0? = 207

0).

47. Simple Simon defines what he calls the square product of two vectors 4 = (u1, u2) and @L] 6 = wuz + viv. Show that the square product is not independent of b = (v4, v2) thus: the coordinate system.

Matrices

901

MATRICES

AND

THEIR

LINEAR

COMBINATIONS

Two important generalizations of the concept (ordered pair of real numbers) are: (i)

of a two-dimensional

vector

ann-dimensional vector, or ordered n-tuple of real numbers, written Vi v2

(1)

Di

(C0On)

On



;

)

Un

consisting of the m components or coordinates v1, v2,:--, U, € ®, arranged either ina horizontal row or in a vertical column;

(ii)

(2)

anm

Xn (read “‘m by n’’) rectangular matrix

A

=

a\1

ai2

ae

Aln

a21

a22

sarees

a2

Ami

Am2

ah

fa

:

)

Amn

consisting of the components or coordinates of the m row vectors (a11, @12,°**, Gin), (21, G22, ***, G2n), ***, (Ami, Gm2,°**, Amn) Or equivalently of the components coordinates of the n column vectors 220

or

§901]

MATRICES

AND

THEIR

LINEAR

COMBINATIONS

a1

ai2

Ain

Ami

m2

Gran

221

The double subscript notation a;; of (2) indicates the element of 4 in the ith row and jth column, the matrix A also being denoted 4 = (a;,) if the numbers m and n are otherwise indicated or clear from context. If a matrix A is denoted (a;,), the element ai; 18 called the general element of 4. An n-dimensional row vector can also be thought of as a 1 X n matrix, and an m-dimensional column vector can also be thought of as an m X 1 matrix. The row vectors and column vectors of a matrix are also called rows and columns for simplicity. Two matrices have the same shape if and only if they have the same number of rows and the same number of columns. An m X n matrix is square if and only if m = n. Equality between two matrices A and B, written A = B, holds if and only if A and B are identical element by element: A and B are of the same shape, m X n, and a;; = 6;; fori = 1, ---, m andj = 1, *++, n. Inequality, written A ~ B, means that equality A = B is false. The main or principal diagonal of a square matrix A = (a;;) consists of the elements aj;, i= 1,---,n. The order of a square nm X n matrix is n. Although we shall obtain results for general m X n matrices, most of the concrete

applications of the general theory will be to matrices where m and n are small, usually less than or equal to 3 in this chapter, and less than or equal to 4 in the following Chapter 10. All that is important in the present section is that the matrices being considered have the same shape. We now State the fundamental definition that permits us to form linear combinations of matrices having the same shape, and to regard the set of all such matrices as a vector space: Definition. Jf A = (a;;) and B = (b;;) are any two m X n matrices and if d is any real number, called a scalar, then the sum C = A + B of A and B and the scalar multiple D = \A of\ andA are them X nmatrices C = (ci;) and D = (dij) defined by (3)

Ciy

= ay

+

Oj,

Oi) =

Gry,

if =

l1,225

(pas

Ohi e

or, in more complete form:

Qi1*** Gin Dut

if

ees ine

bi - + Din Dr


u%,, the vector Ayu) + A2w2 +--+ + Anu, Can be the zero vector all of whose components are 0 only incase \; = A2 =:::=A, = 0. The n vectors (5) are the standard basic unit vectors of U,1.

902

LINEAR

TRANSFORMATIONS

AND

PRODUCTS

OF MATRICES

Let T be a function whose domain is a vector space 0 and whose range is a subset of a vector space WW. Such a function is called a linear transformation of 0 into W if and only if it has the following two properties:

(1)

Tis additive:

wandobc

U>

Ti + &) = TH) + To), (2)

T is homogeneous:

\ € ® and #@ €

VU=> T(Au) = AT(U).

For reasons of notational simplicity rather than of conceptual necessity, we shall now particularize both spaces U and ‘W to the space U2 of two-dimensional vectors. Let T represent an arbitrary linear transformation of U2 into U2. We shall show that T corresponds in a specific fashion to a unique 2 X 2 matrix A. It turns out to be more convenient if the space U2 is thought of as the space V2: of two-dimensional column vectors than it is if U2 is regarded as the space U12 of two-

dimensional vector:

row vectors.

ae [Ves Thus, we shall write 7 = (5):j=

0 (‘) and a general

[§902

224

MATRICES

(3)

= wi + Wj = a0) pee 8

%

_

0

(3) a fe 5s(":)

We now construct the matrix 4 as follows: Let the first column of A be the image under T of the vector i, and let the second column of A be the image under T of the vector /: a

.

isa

l

Q2\

0

a

i) a r(4) a 7: r(?)

ail

a12\

a ie i

Using the linearity of T we can write out the image 6 of u = ui + ujin terms of the elements of A:

(5)

6 = T(i) = AG) = T(mi + uj) = wT) + wT) ey

ail a2\

ahs ie a22

Cie uia21

a ee u2Q22

a (ae =. ai2u2\_ a21u, + anu)

In other words, if T(@@) = 6 = on ae v2J, then

(6)

Vr = a1

+ Qy2uW2,

v2 = a2iu1 + a22U2.

Since the basic unit vectors i andjdetermine the elements of the matrix A given by (4), the matrix A that defines the mapping 7 according to the equations (6) is unique. To express this idea more specifically, assume there is a 2 X 2 matrix B = (6;;) such that if 6 = T(%) then (7)

Oy = biiuy + Di2t2,

v2.= bait) + boar.

Then if uw; = 1 and w2 = O equations (6) and (7) give ai1 = bi; and az; = 621, while if uy = O and uz = | equations (6) and (7) give diz = bj2 and a22 = b22, whence

AmB. The converse of the preceding result is immediate: If A is any 2 X 2 matrix, then the transformation T defined by (6) is a linear transformation of U2 into U2. In other words, the linear transformations of U2 into U2 and the 2 & 2 matrices are in

one-to-one correspondence. Let us now look at the composite of two linear transformations T and U, where the matrix corresponding to Tis A = (a;;) and that corresponding to U is B = (b;,). That is, let the function

W = ToU

be defined:

(8)

W(u) = T(U(@)) for every i € U2.

In the first place, W is a linear transformation of U2 into itself:

[ W is additive: (9)

Wi + 6) = T(U(a + 8)

= T(U(4) + Ul) = TUM) + T(UG)) = Wi) + WOE); W is homogeneous:W(\ui) = T(U(U))

= T(AU(a)) = AT(U@)) = WH). Therefore W is given by a2 X 2 matrix C = (c;;) whose elements we shall now find. The first column of C is the image of i:

oe) me w (6) a a) C21 0 bay

=

aiibi1 + ai2b21 d21b11 + ar2b2,

§902]

LINEAR

TRANSFORMATIONS

AND

PRODUCTS

OF

MATRICES

225

and the second column of C is the image ofj:

ee a w(?) 4 ave C22 l b22

_

(411b12 + a12b22 a21b12 + a22b22

so that

fie an wT (ae ai2b21)

C=

(10)

C21

€22

421611 + a22b21

axib1> + eee) a21b12 + ar2b22

In other words, the composite T° U of the two linear transformations T and U has a matrix C = (c;;) whose general element c;; is obtained from the elements of the matrix A = (a;;) of T and those of the matrix B = (b;,) of U by the formula: 2

Gh)

cj=>>

daar

Oe

1

2.

If A and B are two given 2 X 2 matrices, the matrix C defined by the formula (11) is called the product AB of the matrices A and B, and we have the theorem:

Theorem. Jf T and U are any two linear transformations of U2 into V2, with matrices A and B, respectively, then the composite T° U of T and U is a linear transformation of 02 into 02 whose matrix is the product AB of A and B. Nore 1. The rule for forming the product AB of two matrices A and B can be remembered in terms of dot products of row vectors of A and column vectors of B: the element in the ith row and jth column of the product AB is the dot product of the ith row of A and the jth column of B. This is shown schematically:

su toed) Matlaan 3) nc) Mools:) (oo) s) PNOZOYNO@

pee

pe

le oon

2.

QPOU

(521456

NE ©

G) © (19. 22).

Nad JT eR oe Se gh tteee 43 50 SIME TN (ET? 0\ R/ she MG MAN 5% 2\" 1"

Note

TF

92)\ 284) PONS 4 ees

—3

Matrix multiplication is not in general commutative.

10 That is, if A and B are

general 2 X 2 matrices, the two products AB and BA are not the same:

(13)

AB # BA.

This can be shown by a particular example, thus: ee

(14)

Nie52 6\ . (198922

€ Ae

fee

:)ia Ge 4 is a

eee

=) 7 ( 8/\3

|)

4

Suppose now that 7 is a linear transformation from the n-dimensional vector space U, = Uni into the m-dimensional vector space Um = Umi. If i, t2,°**, Un denote the n unit column vectors of (5), §901, constituting the standard basic n-tuple for V,, we can define in the manner of (4), above, n column vectors of Um:

[$902

MATRICES

226 aii

i

a2

0

Qin

0

art

0)

a22

1

Oy

0

l

Qmn

0

QAm2

0

Qm1

(led bagcarl peweabel hee

=v fa

Y

ape?

Seapensi

(15)

As in (5), it follows from the linearity of 7 that if 4 is the matrix (a;;) whose column vectors are given by (15), and if

ii =

:

is an arbitrary vector of Up,

Un

then & = T(i) is the vector of U,, represented:

(16)

U1 v2 U3

2 j=

Opi Wate Cle ates SUS eet et (OCaten ODUM A222 OpsUs alate ae Gants AS1UT G3 22 te 3 3Usy tae a) cia alla .

=

bs

Um

Ami Uy +

Qm2u2

ei QAm3U3

—- ca,

AP AmnUn

n

Or i = > ajjuj,i = 1, 2, --+, m. That is, every linear transformation from 0, into j=1 Um corresponds uniquely toanm X nmatrix A = (ai;) by means of the equations (16).

Let us examine the composite of two linear transformations T and U, where U maps U, into U, and T maps VU, into Un. That is, W = ToU is the linear transformation mapping VU, into U, defined: (17)

W(a) = T(U(@)) for every @ € U>.

Then, as in (9), W is additive and homogeneous, so that all three transformations T, U, and W correspond to rectangular matrices, thus: aii

(ig) 7

ai2

se

Qivd22

Ain

4" a

Ami Gm2

***

gl

bi

biel Oi

Amn

b12

see

baba

bip

ers be

DrinDam*

P|,

Ww:

Dnp

(Ol

C{2

see

C21

C22

°°" C

oF a

Cm1

Cm2°**

Clip

*P

Cmp

By the same reasoning that led to equation (11) we infer from (16) that the matrix C = (c;;) 1s given by (19)

hy,

=

> ODEs k=1

i=

Il, ROP

ONTTE

i

l, Dea

aD

The m X p matrix C = (c¢;;) defined by (19) is called the product of the m x n matrix A = (a;;) and the n X p matrix B = (6;;), written C = AB. Since the ele-

$903]

ALGEBRA

OF

MATRICES

227

ment c;; of C = AB in the ith row and jth column is the scalar product of the ith row vector of A and the jth column vector of B, the product AB of two rectangular matrices can be formed only when the number of columns of the first factor A is equal to the number of rows of the second factor B. Equations (16) defining the linear transformation 7 can be written in the form of the product of two matrices, where the second column vector: U1 (20)

v2

Um

=

/

factor is a one-column

Qit

Giz

°°

24h

ut

a21

a22

=

Q2n

u2

aml

Am2

ay

Amn

Un

matrix, or

;

or more compactly, 6 = Au. In particular, in the case of a linear transformation of U2 into U2, the equations given by (5) become

(21)

UA) v2

iB

Gates a21

012 Ur \ _ ( @iits + ai2u2 azz 7\ U2 a21u, + a22U2 iL QNAS € :) (5) =

Example 2.

\

17 (35)

1 1 3

Nv OA =

903

ALGEBRA

|

|

a) |

WwW — ie) aS

a |

=ee)

Ss

NO Ww MO

OF MATRICES

We first consider in this section the collection 9%2 of all 2 X 2 matrices. We have already seen ($901) that S112 is a vector space. We shall now see that it is in addition an associative algebra (cf. Note, §515), that is, that the following laws hold for all 2 X 2 matrices A, B, and C and real numbers ) and yu: (1)

Homogeneous law:

(2)

Distributive laws:

(3)

Associative law:

(AA)(uB) = (u)(AB).

A(B + C) = AB+ AC, (A + B)C = AC+ BC. A(BC) = (AB)C.

Establishing laws (1) and (2) gives no difficulty, and the proofs are left to Exercise 31, $904. The associative law (3) is more difficult and we give two proofs:

[§903

MATRICES

228

First proof of (3): If A = (aij), B = (b:)), and C = (c;,), then the element in the ith row and jth column of A(BC) is 2 (4)

ye Qik k=1

2 SS DnCns m=1

=

ain(biic1;

SF b12¢2;)

a

aj2(b21¢1 ; ae b22¢2/).

The element in the ith row and jth column of (AB)C is 2 2 OS & aubin) Cmj = (aitbi1 + aizb21)c1; + (aitb12 + ai2b22)c2;. k=1 m=1

(5)

Since the right-hand members of (4) and (5) are equal, (3) is proved.

Second proof of (3): Let T, U, and V be the linear transformations of U2 into U2 that correspond (uniquely) to the matrices A, B, and C, respectively. Then the associative law (3) corresponds to the associative law To(UcoV) = (ToU)oV of transformation composition. Since al/ transformations obey the associative law whenever the composites involved are defined (Example 5, §501), (3) is established. Norte. The laws (1), (2), and (3) always hold for a// rectangular matrices and real numbers \ and uw whenever the sums and products involved exist. Consequently, the set SI, of all n X nmatrices is an algebra, called a matrix algebra, for every n € 9t. Since the commutative law for multiplication of square matrices fails when there is more than one row or column (Note 2, $902), the matrix algebra 91, for n > 1, is noncommutative. This is in contrast to the function-space algebras considered earlier in this book, all of which are commutative. All algebras treated in this book are associative.

Two special matrices play distinctive roles in the algebra Sz of 2 & 2 matrices: (6)

; The zero matrix

7 0 =

al

(9

(7) The identity matrix J = &

:)

These matrices obey the following laws in M2 (cf. Ex. 32, $904):

(8)

The zero matrix:

0--

(9)

The identity matrix:

A= 4-0 {4 ="Ar=

= 4, 0A=

40 = 0,

A)

The matrices 0 and / therefore correspond to the numbers 0 and 1 in the real number

system.

The linear transformation that corresponds to the identity matrix is the

identity transformation that maps every point of U2 onto itself (Example

13, §111).

The determinants of 0 and / are 0 and 1, respectively: |0| = 0, |Z| = 1. (Cf. §905.) Statements similar to the preceding ones apply to the algebra o1,, for any positive integer n, where the zero and identity matrices, denoted 0 and J, respectively, are defined: rere 0 0:--0

(10)

ON

eee 0. OPEeG

(Cf. Ex. 32, $904.)

eee Goro

Het

§904]

904

EXERCISES

229

EXERCISES

In Exercises 1-4, form the indicated linear combination.

1. 6(7) (_}).

2 a} ‘)- (3 =)

Bhs 08) —84(-4910),

4. (4 : ;)= ae es A

In Exercises 5-8, show that the vectors are orthogonal, and find the magnitude of each.

5. Gd; 3, —2), (—7,-5-4).

6. (3, —5, 1), (4, 2, —2).

7.. 6, 25—4, 5G,

8. (1, —2, 7, —6), (—3, 3, 3, 2).

=1, 3,1).

In Exercises 9 and 10, Tis a linear transformation ofU2 into U2 that transforms the vectors (V)and (5)as indicated.

Find the 2 < 2 matrix that represents T.

+0)-() QQ)

)-()70)-0)

In Exercises 11 and 12, T is a linear transformation of U2 into U2 that transforms the vectors (‘)and a) as indicated.

Find the 2 x 2 matrix that represents 7.

at) eG) ee ser) (27) O\nfl

3

\iea

oN

ON

ees

3

ye)

3\_

4.0\;

In Exercises 13-15, find the indicated product. 2

-1\/3

Aye (ies 6

5

1

NG s)

2

ri € pa

i

1\/2

-8

aes it NG ef)

©

te )

In Exercises 16 and 17, Tis a linear transformation of U2 into ‘U3 that transforms the vectors (5)and ( . as indicated.

1 16. (5) =

Find the 3 x 2 matrix that represents 7.

1 2 1 r(_7) =

3 2 |:

1

1

1 17. r(5) =

0) 2 1 7(_1) = 4

3 0 | =

In Exercises 18 and 19, Tis a linear transformation of 03 into U2 that transforms the vectors 1 2 3 0,11), and{ 2 | as indicated. Find the 2 x 3 matrix that represents 7.

0

0

1

[§905

MATRICES

230

In Exercises 20-23, find the indicated product.

1

1

avn

HK:

3

23. |=2

OG

0}:

1 35 5 N/a

6

Ts gs ae ee

dob

a

@):

ly)

20. (5c

=a Oya

YO

ee

eo ee

Onna

In Exercises 24 and 25, verify the homogeneous law for the indicated product.

L(t IG ~o)} |

2

2

[a(t 3)[PG =)

-1

5

©

1

3

In Exercises 26 and 27, verify the distributive law for the indicated product.

(5 OG Ge vl ia IG SIS) 6

—4

3-4

—2

3

—4

7

5

-2

Sek GN.

In Exercises 28 and 29, verify the associative law for the indicated product.

4

0\/2

28. (1 ale

—3\/-1

0

ml 2 at

—3

am" &

2\/1

6

ale gee

2

-—4

5)

30. Prove properties of magnitude (7), (i/), (iii), and (v) $802, for n-dimensional vectors. Assuming the triangle inequality (iv) (cf. Ex. 5, H$520, CWAG), prove properties (vi) and (vii). 31. Prove the homogeneous and distributive laws (1) and (2), $903, for 2 < 2 matrices. 32. Establish the laws (8) and (9), $903, and their analogues for 91,,.

905

DETERMINANTS

In our further work with vectors and matrices we shall find it helpful to use determinants. Although it will be assumed that the student has already had some previous experience with determinants, we shall review the basic definitions and properties sufficiently to permit anyone familiar with the subject to proceed in the text. Most of the determinants to be used in this chapter will be of order 2 or 3, although we shall make occasional contact with determinants of order higher than 3. Let us recall first the expansion of a determinant of order 2, written in the form of a 2 X 2 matrix except that vertical lines are used in place of parentheses:

(1)

4i1

ai2

a21

422

= 411422 —

412421.

The number @11@22 — ai2a21 on the right-hand side of (1) is called the determinant :

of the matrix

E xamplele 1.

ait

aj2

a21

422



Lie ; i =4-6=— 4

2.

0 -—3 (oe x 15. ; 3 (ee 0 — (—15) =

$905]

DETERMINANTS

231

The expansion of a determinant of order 3 has 6 terms:

(2)

Example 2.

Gite

Gio.

ais

421 431

422 @32

423) a33

= @11022033 + a12423431 + a13a21a32 — 411023032 — @12421433 — Q13022431.

1

2

3

0

4

5} = 1-4-6

Oe 3

2—

0

gle

—I

+ 2-5-0

+ 3-0-0

00.68 3-4-0 4

— 1-5-0

0

5 4) = 3-5-1 + (—1)(—4)(=4) 4-0-2-2 = 3-(—4).2

—4

y

1



(—1)-2-1

— 0-5-(—4)

= 15

— 164+ 24+

2 = 25,

Note 1. The “arrow” method indicated below can be used as an aid in evaluating 2 x 2 and 3 X 3 determinants, but is not appropriate for any determinants of order higher than 3:

The word determinant in its strictest sense means a certain real-valued function on the set of all square matrices. The determinant of a matrix A is written det(A), det A, or |A|. We shall also find it convenient to use the word determinant to mean the ordered pair (A, det A), to call the number det A its value, and to interpret the notation of the elements of a matrix contained within vertical bars in this sense. Thus,

@11422 — ai2a21

1s the value of the

determinant

ai1

12

a22

of the matrix

ait 412 ies oo) Listed below are nine properties of determinants, usually proved in any good course in college algebra, which we shall assume. Before listing these properties let us recall the two related concepts of minor and cofactor of an arbitrary element of a determinant. The minor of any element of a determinant of order n is the determinant of order n — 1 (or its value) obtained by removing all elements of the original determinant that are located in the same row or column as the element in question.

Example 3.

wa

i

cats

a

The

3

minors

of the

elements

BALE

: are, in left-to-right

in

the

second

row

i iziee=36 |=33, 81 -36

order:

7

of the

determinant

2

If an element of a determinant is located in the ith row and jth column, then its cofactor is equal to the product of its minor and the number (—1)*”.

232

MATRICES

[$905

Example 4. The cofactors of the elements in the second row of the determinant in Example 3 are, in left-to-right order:

(—1)2+!.33 = —33, (—1)?*?-30 = 30, (—1)?*3-(—9) = 9.

Nine Properties of Determinants:

(i) The value of a determinant is not changed if the rows and columns are interchanged. (ii) If the elements of one row (or column) of a determinant are all zero, the value of the determinant is zero. (iii) If the elements of one row (or column) of a determinant are multiplied by the same constant factor, the value of the determinant is multiplied by this factor. (iv) If one determinant is obtained from another by interchanging any two rows (or columns), the value of either is the negative of the value of the other. (v) If two rows (or columns) of a determinant are identical, the value of the determinant is zero. (vi) If two determinants are identical except for one row (or column), the sum of their values is given by a single determinant obtained by adding corresponding elements of the dissimilar rows (or columns) and leaving unchanged the remaining elements. (vii) The value of a determinant is not changed if to the elements of any row (or column) are added a constant multiple of the corresponding elements of any other row (or column). (viii) The value of a determinant is equal to the sum of the products obtained by multiplying each element of any row (or column) by its cofactor. (ix) If each element of any row (or column) of a determinant is multiplied by the cofactor of the corresponding element of a different row (or column), the sum of these products is zero. Nore 2. When (viii) is used for the evaluation of a determinant, the process is called expansion by, or with respect to, the elements of the row (or column) under consideration.

Example 5.

The matrix of cofactors of the elements of the determinant

1 |—-4 8

2 -3 —9 11 if 6

of Example 3 (with elements in corresponding positions) is as follows:

—131 —33 —5

112 30 |

44 9 |-

Evaluation of the determinant by the elements of the first row gives —131 + 224 — 132 = —39. Similarly, evaluations with respect to the elements of the second and third rows are 132 — 270 + 99 = —39 and —40 + 7 — 6 = —39, respectively. The corresponding evaluations by columns are, from left to right, —131 + 132 — 40 = —39, 224 — 270 +7 = —39, and —132 + 99 — 6 = —39. The value of the determinant by any one of the six preceding evaluations is therefore —39.

$905]

DETERMINANTS

233

Nore 3. The pattern of factors (—1)‘+/ that distinguish cofactors from minors is similar to that of a checkerboard. For a 3 X 3 determinant:

+



+

+-— + The following example illustrates the manner in which (viii), in the preceding list of properties, can be used to simplify the evaluation of a determinant. This is the method that should be used for evaluating any determinant of order greater than 3. Example 6. By (vii), in the preceding list of properties, the value of the determinant of Examples 3, 4, and 5 is the same as that of the determinant obtained by adding to the second column minus twice the first:

1 =G 8

(3)

2 SMa =

23 & 1 NG 6

1 Do ee =e

8 tae 6

It is also equal to the value of the determinant obtained from (3) by adding to the third column three times the first:

1 Om Ate Sicko

(4)

405513 ela 6+ 24

eee = Se

6 a 95030

Expanding (4) with respect to the elements of the first row gives

Example 7.

=)

Sn

rie9

39

alge aed

ei

8

Rae

9

= =32:

Evaluate the fourth-order determinant

it i i il ee ee i235 23h) Ly 2a band Solution.

Subtraction of the first column from each of the others, followed by expansion

with respect to the elements of the first row, gives 1 heer ty te?

1 Zien 2 en

hea! ie tk epee Breads oe ioatmed este lhe 4

i

oO @ @ 111 led at De lg eh HET IT BE Ps iby 3 del ee 3

Again, subtraction of the first column from each of the others, followed by expansion with respect to the elements of the first row, gives

me 2 ee

ais—sl pty i 2S = 1 23> J 38

Le'O:0 11 =|) = |) tse ls 32

The following theorem is a basic and important one in the algebra of matrices. - We shall give a proof only for the 2 X 2 case. For a general proof see §2815, CWAG, or a good book on linear algebra.

[$906

MATRICES

234

The determinant of the product of two n X n matrices A andB is equal

Theorem.

to the product of their determinants: |AB| = |A|-|B|. Proof forn = 2. Let A = (aij), B = (6;;), and C = (cij) = AB. Then routine expansions of the two members of the following equation establishes the desired equality:

aiibi1 + ai2b21 =ai1bi2 + ai2b22 = (11422 — a12a21)(b11b22 — bi 2b21). azibi1 + az2b21 a21bi2 + a22b22 This verification is left to the reader. Example 8.

Verify the preceding Theorem for

a

4=(}

: Solution. i

ak:3)and

e/a ve (2 2)

B =

5A ily ©) : The matrix AB is Ge a whose determinant

‘ = 102 — 32 = 70.

17 19—17| _ is equal to ieee ia =

This is equal to the product of the determinants of A and B,

which are, respectively, equal to 10 and 7.

A corollary that will be found useful in later sections is based on the definition: Definition. A matrix is nonsingular if and only if it is square and its determinant is nonzero. Any other matrix is singular. Corollary.

906

Any product of nonsingular matrices is nonsingular.

EXERCISES

In Exercises 1-8, evaluate the given determinant. 1

a+b

Gap

“la—b

3.

5

a+b

@—-—B

3 4 —1

=7 1 0

5

2,

3,

1

1 =f —

—4 3 1 ae Ce

= 6

10 CES) 6

5). 6 =I

1

20 iS 16

30 WS 26

Oy

th

tke

al

os

i | |

@ 2 3

2 @ 4

8 a ©

85)

i a il i al laee3t 9 Oe ah IG iS) PS)

ae al GE)

Tellme Dene S se8ipe 23): it 2 3 a Al i @ B ae &

@a—b

l= BO O83 SB)

1 1 SS ilé peli GA DEE 1B G25

$907 |

SYSTEMS

OF

LINEAR

EQUATIONS;

CRAMER'S

RULE

235

. In Exercises 9-16, verify the Theorem, §905, for the product of matrices given in the indicated exercise of §904, or for the product given here. 9. Exercise 13. ee



1}, 10)

eed

Ne

7

11. Exercise 15.

O

QYENG

pea)

0

14. | —1

4

ipsa! s

the a

i

Ola

@W° BD 3

Onn

esan 4)

SYSTEMS

OF

Nee

1

—-1

—3}{ -1

—3

4

0)

29

0-08

3

O

EQUATIONS;

©

©

Le

0O

2

O}:

0

4

7

| A2eee Ay0/1098

1G) | 0e6— 7

S84

LINEAR

1 2

1

PONY ets at

15.

12. Exercise 23.

9

3% SiO—l > S6\5

0)

907.

10. Exercise 14.

5, 4 0-6

i

0) ey Dh

Oo ©

©

i

RULE

CRAMER’S

A system of m linear equations in n variables or unknowns

x1, X2,°°°, Xn has

the form

QW api22 tt A inknt = KI, Q2xiX1 4- G22%2 °° + ad5X, =k,

(1)

Lami X1

4

QAm2X2

ar hes =F AmnXn

=

kee

or, in matrix form,

(2)

AX = K,

where A is the coefficient matrix (a;;), i = the n- and m-dimensional column vectors

(3)

EST

1,--:,m,j =

X1

ky

x2

k2

POR ene Xn

1,-:-,”, and X and K are

er Kin

A set of nm numbers x1, X2,:**, Xn that satisfies all m equations (1) — or equivalently a column vector X satisfying (2) — is called a solution of the system of equations (1). In this section we shall be interested first in the case m = n, that is, where A is square and the number of equations in (1) is equal to the number of unknowns. For such systems we now state Cramer’s rule, usually included with work on determinants in courses in college algebra.

Theorem I.

Cramer’s Rule.

Jf the coefficient matrix A of the system (1), with

m = n, is nonsingular, then the system has one and only one solution. :

In this solution

* After the Swiss mathematician Gabriel Cramer (1704-1752). Cramer’s rule is important more for

its theoretical content than as a practical means of obtaining solutions to systems of linear equations.

[$907

MATRICES

236

the value of each variable is equal to the quotient of two determinants; the denominator is the determinant |A\, and the numerator is obtained from it by replacing the column of coefficients of the variable in question by the column K:

xy =

(4)

Example 1.

Ai

Gree Kt vs Gi,

Qo)

Gia

Gy

Gn2)

CU

TEN Ie © 2

a

G21 G22

je

An1

°°

Qn2

Ky

07,

k,

Sana



feel

PO

(8

OSL o> = ili

2p *° Anj

C2; °°"

Gnn

Solve for x in the system Sx -— yt

z= 2,

3x + 4y.— 2z = 7,

x — 2y — 4z = 6. Solution.

By Cramer’s rule, and properties of determinants, §905,

ee

2 healt (oy ie se (2s PS pe ice ee eee ea eer see yl (vine 232), = 120660 A) hd 9 23: dl aview=94 6 I SO a

In case all of the constants ki, k2,---, km are equal to 0 the equations of (1) are called homogeneous. A system of homogeneous linear equations always has the so-called trivial solution, which is the zero vector 0 every element of which is equal to 0: x1 = x2 =:**= X, = 0. The big question here is, ‘‘Does a system of homogeneous linear equations have any nontrivial solutions, that is, solutions other than the trivial solution?” The following theorem and its corollary give the general statements. We present the proofs for the cases nm= 1 and n = 2. For the cases n > 2, see $2816, CWAG. Theorem II.

The system

Gi ier cle Oiahs oe (5)

Q21

ile anne Shey

aie iO) Sean,

IO)

Lanixt + ai2xr +:°++ danXn = 0 of n homogeneous linear equations in the n unknowns x, X2,°**, Xn has a nontrivial solution if and only if the coefficient matrix A = (a;,) is singular:

(6)

PA "or

20:

§907]

SYSTEMS

OF

LINEAR

EQUATIONS;

CRAMER’S

RULE

237

More generally, the system

MINN GHAI

(7)

At2k2) 4 2 22X20 4

Gmi Xie + Gm2X7 +° ofm homogeneous

1 Cink, = 0; a7 X= 0, "+ dmnxX, = 0

linear equations in the n unknowns x1, X2,‘**, Xn, where m = n,

has a nontrivial solution if and only if every n X n determinant whose rows are an arbitrarily selected set of n rows of the coefficient matrix A = (a;;) vanishes. Proof forn = 1: Ifn = 1, the statement of Theorem II simplifies to the following: The system of equations a\x = 0, a2x = 0,°+-, amx = 0 has a nontrivial solution if and only if every coefficient a1, a2,°**, Am is equal to 0. This is obvious. Proof for n = 2: We know from Cramer’s rule that if there exists a nontrivial solution for the system (7), then the subsystem made up of any two of the equations from (7) must have a singular coefficient matrix (if the coefficient matrix were nonsingular there would be only the trivial solution x1 = x2 = 0 found by substituting 0 for the two k’s in (4)). The “‘only if” part follows immediately. For the “‘if” part, ,

Qj

for n = 2, we assume that every determinant of the form (:

aj

2

to the additional assumption that the system (7) has only the trivial solution x1 = x2 = 0. It follows, then, that the related system of m equations in the single unknown x1,

(8)

aiiX1 = ax1

=*** = Amx1 = 0,

has only the trivial solution x; = 0, for otherwise, if x; were a nontrivial solution of (8), (x1, 0) would be a nontrivial solution of (7). By the present theorem for the case n = 1, there must exist an integer j € {1, 2,---,m} such that aj, + 0. We now define X1 = 4j2,

(9)

x2 =

—aj1.

Then (x1, x2) is a nontrivial solution of (7) since for every i =

. oe aiix1 + ai2x2

(10)

ait

nay

= Ai2

ey

1, 2,°°-, m:

= 10)

This is the desired contradiction, and the proof for n = 2 is complete. The following corollary is an easy consequence of Theorem II: Corollary.

(1)

The system aiiX1 az1x1

+ ai2x2 +:°++ atnxn = DO, + a22xX2 +°°:+ arnx, = 0,

AmiX1

ae Am2X2

++ aN: ai AmnXn

=

0

6

es*| where i and/

jl @j2 are distinct integers of the set {1, 2,---, m}, is equal to 0, and seek a contradiction

[$907

MATRICES

238

of m homogeneous linear equations in the n unknowns X1, X2,° °°, Xn has a nontrivial

solution if m Tp = 5: Dee

7S

Za

4y +

4.

3x — Ty

=

il,

In Exercises 5- 8, use Theorem II, § 907, to show that the system of homogeneous linear equations has a nontrivial solution, and obtain the solutions by solving for x in terms of y in Exercises 5 and 6, and for x and y in terms of z in Exercises 7 and 8. 5 ~

8x + 12y = 0, {@ess Gy = 0. 5x + 3y — 4z = 0, 3x +2y+ z=0,

Ue

Ilx

6

20x — 15y = 0, 165 — 12)7 = 0:

6x — Sy — 15z = 0, 8. < 4x — 3y — 1lz = 0,

+ 7y — 2z = 0.

ee = U3)

era

(0)

In Exercises 9-12, use Example 2, § 907, to determine whether the two given vectors are linearly independent or linearly dependent.

‘ ¢

10

GN

AG BP orog 11. (17, 13), (0, 0).

oe

10.5 \36) Nor 32) 12912513) it):

In Exercises 13-18, find the inverse A~! of the given matrix A, and check your answer by forming either the product AA~ or the product A! A.

Biba

15. | —3 1

eed)

6

—-ll

pe 0

{) 4}-

5) ©

a)

Ane

16.

2 0 =>)

0 4 |

—3 6 |1

$910] 17.

THE

GENERAL

Lapel 50" One ise? De Ose.) O00

EQUATION

oO 0) 33 2

OF

THE 18.

SECOND 0 0 Oma —1 1 2 1

DEGREE -1 1 3 0

243

1 4 0 0

In Exercises 19-22, find 4~!, B-', AB, BA, (AB)~', and (BA)~', and check your answer by verifying that (4B)-! = B-!4~! and (BA)! = 4-!B"!,

9.

{34 4 3-7 4= (5 pB=(_3 3)

7h,

Ales

Ll) 25 1 OQ iWeb 73) = ||O Omer 0

-3 1 0

j= 20.4 =(_3 -—4 —6 1

22.

2 A =| —3 1

3 B= =4 1 0

Smez (4 3)

1 OP B= 0

Oo @ il (0 1 4 1225

In Exercises 23-26, solve the system of equations of the indicated exercise by use of matrices and equation (5), §908. 23.

Exercise 1.

24. Exercise

2.

25. Exercise 3.

26. Exercise 4.

27. In the matrix algebra 312, prove that the additive cancellation law A ++ B= A+C3 B=C holds. Prove that the multiplicative cancellation law in the form (A #0) A

(AB = AC) => B = C fails, but that this law in the form (|A| # 0) A (AB =

AC) SB=C

holds.

0 28. Prove that the set of all diagonal 2 * 2 matrices of the form (5 y)where x and y € ®, is an algebra, but not a field. 29. Prove that the set of all nonsingular 2 X 2 matrices with the operation of matrix multiplication is a noncommutative group (cf. Definition II, §302).

910

THE GENERAL

EQUATION

OF THE SECOND

DEGREE

The general equation of the second degree in the two variables x and y can be written in the form (1)

S (x y) = ax? + 2cxy + by? + 2px + 2qy +d = 0,

where a, b, and c are not all equal to zero. In the remaining sections of this chapter we shall study this equation and learn how its graph is related to the graphs of conic sections studied in Chapter 6. In this study we shall make repeated use of techniques of matrix algebra. The function on the left of (1) is called a quadratic polynomial in x and y. To see how matrices are related in a natural way to equation (1), we write the left-hand member of this equation in the form x(ax + cy + p)

(2)

(ny)

eyex SD

q) by Gyaate@).

Guided by the suggestive form of (2) we define two matrices:

[$910

MATRICES

244

beri a= owpiap

(3)

Se be ae: Be Ota:

The matrix E is called the matrix of the equation (1) and e is called the matrix of the second degree terms of (1). Example 1.

For the equation 5x? — 7y? — 12 = 0, the matrices of (3) are

5 (fo

0

5

‘i a1),

0

0

Q

=122

Ee,

0

0}:

For the equation x2 — 12xy + 8y? — 6x + S5y + 11 = 0, the matrices of (3) are 1 e=

—6

6 4;

8

1 )

Jos

—6

-6

—3

8

ee

3

eres6

As an aid in AT of anm X n and jth column n,j = 1,:-+,m.

our study of equation (1) we give two definitions: The transpose matrix A = (a;;) is then X m matrix whose element in the ith row is the element a;; in the jth row and ith column of A, i = l,:--, That is, to put it informally, the transpose of a matrix is obtained by changing its rows into columns and its columns into rows. A matrix A is symmetric if and only ifit is (square and) equal to its transpose: A = A’. The matrices e and E of (3) are thus both symmetric matrices. Example

2.

1

2N\2

lies 1)

(; :) -(

1 8

i —-1 5)

8

a:

— ey Wey

Ip

(*) =e

5

The second degree part, ax? + 2cxy + by?, of the expression f(x, y) of (1) and (2) can be given a more specifically matrix formulation in terms of the two-dimensional column vector

o

Pe ()

With the notation of(3) and (4), the second degree part of f(x, y) is equal to (5)

ax + 2cxy + by? = XT eX,

since (x (2 ‘).) = (x (2 Pe = x(ax + cy) + ylex ++ by) = ax? + 2cxy + by?.* “Technically, X7e X isa 1 X 1 matrix rather than the single element ax? + 2cxy + by? contained within it. However, for simplicity of notation we shall not attempt to distinguish between a singleelement matrix (A) and its lone member \. It is for this reason that we avoid placing parentheses around the left-hand member of equation (5).

§910]

THE

GENERAL

EQUATION

OF

THE

SECOND

DEGREE

245

In general, if A = (a;;) is a symmetric n X n matrix and X is an n-dimensional column vector with elements x1, x2,---, x, in its n rows, then the expression

(6)

XTAX

is called a quadratic form in the n variables x1, x2,-**, Xp. In later sections we shall make use of the uniqueness of the representation of any quadratic form: Theorem

I.

Jf A and B are symmetric n X n matrices, and if the corresponding

quadratic forms have identical values: AT AX = XT BX

(7)

for all n-dimensional column vectorsX with elements x1, X2,°**, Xn, then the matrices A and B are equal: A = B.

Proof forn = 2:

Let A and B be written: Gere

2

aac

a (2 p Bare (¢ oy)

Then, ifX =

Gh we have the equation

(9)

ax? + 2cexy + by? = a’x? + 2c’xy + b'y?,

true for all numbers x and y. With x = 1, y = 0, we have a = a’, with x = 0, y = 1, we have b = b’, and with x = 1, y = 1, we havea + 2c + 6 =a’ + 2c’+ bi, and hence e-= c’. We close this section with a fundamental property of transposes: Theorem II. Jf the product AB of the two matrices A andB can be formed, then the transpose of the product is equal to the product of the transposes in reverse order:

(10)

(AB)? = BTA,

This rule extends to products of any number of factors: (ABC)! = CTBTAT, (ABCD)? ERDLGUBTATA wri.

Proof. In the first place, since the number of columns of A is equal to the number of rows of B, the number of columns of B7 is equal to the number of rows of AT and the product B7A7 exists. Let A be the m X n matrix (a;;), let B be the n X p matrix (b;;), let G = (gi;) be the p X n matrix BT’, and let H = (h;;) be the n Xm matrix AT. Since the element in the ith row and jth column of (AB)? is the element in the jth row and ith column of AB:

a

k=1

oy eee

a P= 1 een.

and the element in the ith row and jth column of B’A7 = GH is de Bik k=1

hy

=

> k=1

bxi a jk

=») k=1

jk

be, J =

1G Wa? Oh

Maal

fleas lab Ake Soa

sy

[$911

MATRICES

246

the desired equation (10) is established. The rule for more than two factors follows from that for two by the associative law. For example, (ABC)! = [(AB)C]? = CYABYE = CBA) = CBA. Example 3.

a=

it

2

(j :)and

B=

ie

Sy

(5 4

g) then

5 i ueAVAlacE CMe mesic: 2IAE caay =|(5 Al 4 ®)| “(i 31 “i = Dist arool ee jal Brat =| 5 4}; > 133! 9 6 215i vf

911

TRANSLATIONS

AND

=

—_

ROTATIONS

If two distinct rectangular coordinate systems in the plane are given, as indicated in Figure 91l-la, then any point p in the plane has two sets of coordinates, one for each coordinate system. The problem of expressing each set of coordinates in terms of the other presents itself. We shall resolve this problem by reducing ay

vy

(a)

ay

(b) Figure 911-1

it to two simpler parts by means of introducing a third coordinate system with axes parallel to those of one of the original coordinate systems and with origin coincident with that of the other system, as shown in Figure 911-15 and c. The transformations corresponding to sets of axes related in these two ways will be called translations and rotations, respectively. Thus, in Figure 911-15 we think of first translating the xy-axes into coincidence with the x’’y’’-axes, and then rotating the x’’y’’-axes into coincidence with the x’y’-axes. In Figure 911-lc we reverse the order, first rotating the xy-system into the x’’y’’-system and then translating the latter to the x’y’-system. I.

Translations

Let the x’-axis be parallel to the x-axis and similarly directed, and let the y’-axis be parallel to the y-axis and similarly directed (cf. Fig. 911-2), and let the origin O’

$911]

TRANSLATIONS

AND

ROTATIONS

247

Figure 911-2

of the x’y’ coordinate system p have coordinates (x, y) in system. Then its two radius systems are similarly denoted regarded as the radius vector

(1)

have coordinates (A, k) in the xy-system. Let a point the xy-system and coordinates (x’, y’) in the x’y’vectors (or radii vectores) in these two coordinate (x, y) and (x’, y’), respectively. Since (h, k) can be of O’ in the xy-system, we have by vector addition

(x,y) = (x,y’) + (, ),

or in terms of numbers, Xe

@

th,

e = Vat K-

The equations (1) or (2) are called the equations of the translation represented in Figure 911-2. Solving (1) and (2) for x’ and y’ in terms ofx and y, we have the equations of the inverse translation: x =x

(3)

—h,

x’,y’) = (% y) — A, &) ‘ ee)

ee

/

In §§603, 607, and 613 we obtained equations of parabolas whose vertices are not at the origin, and of ellipses and hyperbolas whose centers are not at the origin. In every case the transformation of the equations can be considered as the result of a translation of coordinates. In cases where the nature and graph of a conic section are determined by the process of completing a square, we can think of the result as a simplification by translation. We illustrate this idea by two further examples. Example 1.

Solution.

Sketch a graph of the equation y3 — 3y? + 3y = x? + 4x + 5.

Completion of a cube and of a square give

y3 — 3y2 + By —1 =x? 4+4e4+5-1L=?+4e4+4,

vy— 18 =

4+ 2),

or yi

Therefore

the graph

=x",

is a semicubical

X=

eX

parabola

2,

Yo=y



1,

with cusp at (—2,

1).

(Cf. Fig. 911-3.)

[$911

MATRICES

248

G1)? (x42) ye=x?

(—2, 1)

1

+

Figure 911-3

Example 2. Sketch a graph of the equation xy — 5x + 6y + 2 = 0 by means of a simplifying translation.

Solution. Since (x + 6)(y — 5) = xy — 5x + 6y — 30, the given equation can be written (x + 6)(y— 5) + 32 = 0, or SE

BY

The graph is shown in Figure 911-4.

ee se St,

SW ey = 5,

(Cf. Example 6.)

(x + 6)(y—5) = —382 xy = —32

(—6, 5)

Figure 911-4

x!

$911]

II.

TRANSLATIONS

AND

ROTATIONS

249

Rotations

Let the x’y’ coordinate system be obtained from the xy coordinate system by a rotation about the origin O through an angle 6, and let the standard basic unit vectors in the x’y’- and xy-systems be /’, ib i, and i as shown in Figure 911-5. Ifp is an arbitrary point, with coordinates (x, y) in the xy-system and coordinates (x’, x’) in the x’y’-system, then the radius vector Op can be simultaneously represented:

(4)

Op = xit y= x’ + yf.

Figure 911-5

The vector i’ has x- and y-components cos @ and sin 6, respectively, and the vector j', similarly, has x- and y-components cos(@ + 37) = —sin @ and sin(@ + 37) = cos 6, respectively, so that (5)

i’ = cos 6i+ sin 6),

iy =

—sin 6i + cos 6 j.

Substitution of (5) into the right-hand member of (4) gives

(6)

xi + yj = x(cos 0 i+ sin 6/) + y(—sin 6 i + cos 0 /) = (x cos 6 — y sin 6)i + (x’ sin 6 + y’ cos 6)j.

Since i and j are linearly independent vectors, the representation in (6) is unique (cf. Theorem III, $803), and consequently

7 (7)

=x

cos 6 = yy sin 6,

y = x’ sin 6 + y’ cos 8.

Solving (7) for x’ and y’ in terms of x and y, or by independent derivation, we have

(8)

x’ = xcosé+ysin8,

‘ y'

=

—xsiné@+ y cos 6.

The linear coordinate-transformation (7) is called a rotation about the origin, and its matrix

[$911

MATRICES

250

—sin 6

cos@

©)

a

is called a rotation matrix.

& On

ghacos )

With the notation X¥ =

ee the trans-

G and x =

formation (7) and its inverse (8) can be written in matrix form: (10)

ms

Kw

Me == TEX,

Formulas (9) and (10) point up the following fact about rotation matrices: Every rotation matrix r is nonsingular, and its inverse is equal to its transpose: Chie)

ase

(The matrix r is nonsingular

since its determinant is equal to cos? @ + sin? 6 =

I,

and equation (11) follows from

wt = (6088

~

\sing

—sin 0

cos?

cosé/\—siné@

«sin?\ —

fl #0

cosé/

\O 1) |

ou,

(cf. Theorem I, §908).) It should also be observed that the two rows of r are orthogonal unit vectors, and the two columns of r are orthogonal unit vectors. Example 3. gm are

The rotation equations for a rotation of coordinate axes through an angle of

— y), x= Uy3x' y3y¥), \y=40'+

2s

ee ly = i—x + v3»).

Example 4. Simplify the equation 13x? — 8xy + 7y2 = 30 by means of a rotation through the angle 6 = Arctan 2. Then identify the graph. Solution. 1



NG

Since sin @ = 2/5 and cos @ = 1/15, the rotation 1

equations (7) are x =

(x’ — 2y’), y = —=(2x’ + y’), and the given equation becomes

V5

7

AB(x!2 — 4x'y! + 4y"2) — $x"? — 3x'y! — 2y) + FAX? + Ax'y! + y'2) = 30, 5113 — 16 + 28)x’2 + (—52 + 24 + 28)x’y’ + (52 + 16 4 7)y’2] = 30, 5/27) 15/2230) x’2 + 3y'2 = 6,

The graph is an ellipse with major semiaxis 6 and minor semiaxis \/2. Example 5.

By means of the equations for a rotation through an angle of 17,

;x = 4y2("' — y’), ;x’ = yx + y), y= W20'+y),

ly’ =W2C% 4»).

the equation (12)

SVS,

WES)

is transformed to 3(x’2 — y’?) = X, or

(13)

x'2 — yl2 = 2).

§912]

THE

GENERAL

EQUATION

AND

CONIC

SECTIONS

Since (13) is an equilateral hyperbola (cf. Note 2, §611) with asymptotes y’ = equilateral hyperbola with asymptotes x = 0 and y = 0. Example 6.

251 +x’, (12) is an

Describe the graph of

(14)

(Ce = Ip = 1) = Xs,

NO.

Solution. A translation to the point (4, k) followed by a rotation through 17, as in Example 5, shows that (14) is an equilateral hyperbola with asymptotes x = A and y = k. Nore. The remarks made concerning equation (11) and the rotation matrix r extend to n dimensions, where an orthogonal matrix is defined as any nonsingular matrix whose inverse is

the same as its transpose. If A is an orthogonal matrix, then since the determinant |A7| of the transpose is equal to that of A, it follows from the equation AA” = I that |AA?”| = |.4|-|A7| = |A|? = |/|.= 1, and the determinant |A| of A is equal to +1. The linear transformations whose matrices are orthogonal with determinant | are called rotations, and those whose matrices are orthogonal with determinant —1 are called skew-rotations.

912

THE GENERAL

EQUATION

AND

CONIC

SECTIONS

Before stating our goals and outlining our program for the remainder of this chapter, we should first understand exactly what are meant by a degenerate graph and an imaginary graph.” \f f(x, y)is a nonconstant polynomial in the two variables x and y, then the graph of the equation f(x, y) = 0 is degenerate if and only if f(x,y) can be expressed as the product of two nonconstant polynomials, St (x, y) = 2(x, y)h(x, y), where the coefficients in the terms of g(x, y) and A(x, y) may be either real or imaginary. In this case the degenerate graph of f(x, y) = 0 consists of the graphs of the two equations g(x, y) = 0 and A(x, y) = 0. If there are no ordered pairs (x, y) of real numbers x and y such that f(x, y) = 0, the graph of f(x, y) = 0 is imaginary. Example 1. andx+y=

The graph of x2 — y? = O is degenerate, consisting of the two lines x — y = 0 0.

Example 2. The graph of x? + y? = 0 is degenerate, consisting of the two “imaginary lines” x — iy = Oand x + iy = 0. Its real graph consists of one point, (0, 0). It is sometimes called a ‘‘point-circle with radius 0.” Example 3. The graph of x? + y? + 1 = Ois imaginary. One of its “imaginary points” is (0, i). Its real graph is empty. It is sometimes called an “imaginary circle.”

We shall show in this section that the graph of any equation of the second degree is a conic section, or is degenerate or imaginary. In the following section we shall find means of determining the general nature of the graph of a given second degree equation, that is, of determining whether the graph is an ellipse, hyperbola, parab*In the remaining sections of this chapter we shall find occasional use for both real and imaginary numbers. The properties of complex numbers needed for present purposes are no more than those normally studied in high school algebra. For a more formal and extended treatment of complex numbers, see Chapter 12 of A Second Course in Calculus.

ola, degenerate, or techniques, with the of the graph of any Our first task will

(1)

[$912

MATRICES

252

we shall develop refined imaginary. In §§$915 and 916 exact location and shape the finding aid of matrix algebra, for second degree equation in x and y. be to show that if

fy) = ax? = exy + by7 a 2px

gy

ed

0

is an arbitrary equation of the second degree (a, b, and c are not all equal to zero), there exists a rotation r such that the equation in the new variables x’ and y’, (2)

g(x’, y’) =

q’x'2

a

2c'x'y’ ae b’y'2 ans pix

at 2q'y’ = des

0,

has no term in x’y’, that is, c’ = 0. If c in equation (1) is zero, no change need be made, so we assume now that c # 0, and substitute into (1):

(3)

x = x’ cos 6

—.ysin.0,

y=x'sinéd+ y’ cos 6.

The result of this substitution is

(4)

a(x’2 + + +

cos? 6 — 2c(x’2 sin b(x’2 sin? 2p(x’ cos

2x’y’ sin 6 cos 6 + y” sin? @) 6 cos 6 + x’y’(cos? 6 — sin2 0) — y’”2 sin 6 cos 6) 6+ 2x’y’ sin 6 cos 6 + y’2 cos? @) 6 — y’ sin 6) + 2q(x’ sin @ + y’ cos 0) + d= 0.

The coefficient a’ of the terms of (4) in x’? is equal to a cos? 6+ 2c sin @ cos 6 + b sin? 6, and similarly for the coefficients of the terms in x’y’, y’2, x’, and y’, and for the constant term d’. The full set of coefficients is: (a’ a cos? @ + 2c sin 6 cos 6 + 6 sin? 6, b’ = asin? 6 — 2c sin 6 cos 6 + b cos? 4,

(9)

c’ = —(a — b)sin @ cos 6 + c(cos2 6 — sin? 6), p’ / = pcos é+qsin 6, q’ = —psinéd+qcos0,d' =d.

The third equation of (5) involving c’ can be written c’ = —3(a — b)sin 26 + c cos 26, so that c’ is zero for any angle @ such that 2c cos 26 = (a — b)sin 26, or for any angle @ such that

(6)

Colne —b DE

Before continuing, we pause to draw a useful inference from the first three of equations (5). It is clear from the form of these equations that if a = 6 = c = O then a’ = b’ =’ = 0, and therefore that whenever equation (1) fails to be of the second degree, then equation (2) also fails to be of the second degree. In other words, by the

contrapositive, if equation (2) is of the second degree, then so is equation (1). The converse is also true since equation (1) can be obtained from (2) by means of the

rotation 7~! that is the inverse of the rotation r, and the argument just presented applies in equal measure to this inverse rotation: Equation (2) is of the second degree whenever (1) is. (Cf. Ex. 37, $917.) Since equation (1) was assumed to be of the second degree we can now proceed with the knowledge that (2) is of the second degree too.

§912]

THE

GENERAL

EQUATION

AND

CONIC

SECTIONS

253

We now drop primes from equation (2), for simplicity of notation, and assume that a coordinate system has been chosen such that c = 0:

(7)

ax? + by? + 2px + 2qy + d= 0.

There are two main cases to consider:

Case (i): ab ¥ 0. If a and b are both nonzero, then the method

of completing

squares discussed in §§607 and 613 permits the application of a suitable translation that simplifies the equation (7) to a form where the coefficients of the variables x and y vanish:

(8)

ax? + by2+d=0.

If a and b are of one sign and d is of the opposite sign, then the graph of (8) is an ellipse (possibly a circle). If a, b, and d are all of the same sign, then the graph of (8) is imaginary (sometimes called an imaginary ellipse). If a and 6 are of one sign and d = Q, then the real graph of (8) consists of one point only (the origin), and the graph is called degenerate (if imaginary quantities are permitted, the graph of ax* + by? = 0 reduces to two imaginary lines meeting in the real point (0, 0); for

instance, if a and b are positive, ax? + by? factors into (Jax + ivby)(yax — ivby), and the two “imaginary lines” are given by the two equations Vax + If a and b have opposite signs and d # 0, then the graph of (8) is a Finally, if a and b have opposite signs and d = 0, then the graph of (8) two intersecting straight lines (for example, if a > 0 and 6 < 0, then

ivby = 0). hyperbola. consists of ax? + by?

factors into (Vax + —by)(Vax — \—by), and the two intersecting lines are given by the two equations Vax + \—by = 0). Case (ii): ab = 0. Since not both a and b can be equal to zero (the equation (7) is quadratic rather than linear), we shall assume for definiteness that a # O and b = 0, and by means of a translation (found by completing a square) we can assume further that p = O so that (7) reduces to

(9)

ax? + 2qgy

+d=0.

If g # 0, an additional translation simplifies the equation to one without a constant term (cf. § 603): (10) whose graph is a parabola.

(11)

ax2 + 2qy = 0, Ifq = 0, (9) becomes

ax2+d=

If a and d have opposite signs, then the graph sisting of the two parallel lines x = +\/—d/a. graph of (11) is imaginary (also degenerate and nary lines x = +\/—d/a). Finally, if d = 0, consists of the two identical lines x = 0 and x -

0. of (11) is If a and consisting the graph

the degenerate graph cond have the same sign, the of the two parallel imagiof (11) is degenerate and

= 0.

Example 4. In Example 4, § 911, for the equation 13x? — 8xy + We = 30 SQ, @ Sal b =7,andc = —4. With @ = Arctan 2, from (12), §701,

[$913

MATRICES

254 76

1

1 — tan2 6

CCl et arte)

il = al

AG

Sed idee

4

Ait

De

in agreement with (6).

913.

EXERCISES

In Exercises 1-4, write down the matrix E for the given quadratic polynomial. 1. 3x2 —

2. 5x2 + y?.

5)? — 2.

4. 6xy + 3y2 — 2x 4+ 12y 4+ 9.

3. x2 — 4xy + 10x — 7.

In Exercises 5 and 6, write down as a quadratic polynomial the quadratic form given in matrix form.

so o(3 aC)

ac a(t 4)())

In Exercises 7-10, verify that (AB)” = BTAT for the given product AB.

8.5 -9(} ~9)

~G ~)G) 2

—5\/6

3

acer ee te aA 9, la Tr s) = eo aes F bu Lie 3

—4

Db = in ees 10204 6 52 Tiel ii

pass) peak NA |

In Exercises 11 and 12, write down the equations for the translation to parallel axes passing through the given point (A, k) in the original coordinate system. Also give the equations for the inverse transformation.

1G

1k) = 4, =7):

12. (h, k) = (—3, 10).

In Exercises 13 and 14, write down the equations for the rotation about the origin through the given angle. Also give the equations for the inverse transformation.

13. Arcsin (— #).

14. Arccos (— 2).

In Exercises 15-18, with the aid of the formulas cot 20 = (2 tan 6)/(1 — tan? 4) or cot 20 = (cot? @ — 1)/2 cot 6, show through the given angle @ would transform the given equation the term in x’y’ is 0. Do not actually perform the rotation

(a — b)/2c and either tan 20 = that a rotation about the origin to one in which the coefficient of substitutions.

15. @ = Arctan 3; 3x2 — 6xy — S5y2 — 20 = 0. 16. 6 = Arccot 3; 7x? + 6xy — y2 — 15 = 0.

17. 6 = Arccot3; 8x2 + 12xy + 3y2 — 12 = 0. 18. 6 = Arctan 3; 7x? — 12xy + 2y? — 18 = 0.

914

INVARIANTS

Whenever a translation degree, of the form

or a rotation is applied, any equation of the second

§914]

INVARIANTS

(1)

255

f(x, y) = ax? + 2cxy + by? + 2px + 2qy +d =0,

with matrix

GG E=\|{c

(2)

Fe 6b qh

pqed is transformed into a new equation of the form (3)

2(x’, y’) =

a’x!2

aL NEC Ue +

b’y’2 ae 2p’x!

ae 2q'y' ae d' =

0,

with matrix /BY

(4)

=

a’

c’

p’

c’

b’

q’

p’

q’

al

B

The new coefficients a’, b’, c’,--- may in some cases be entirely distinct from their

original counterparts a, b, c,---, and in other cases be only partially altered. Example 1.

In Example 2, § 603, the equation

(5)

x? — 6x — By —7=0

was altered in form by a completion of a square and factoring, to (x — 3)? = 8(y + 2). In other words, by means ofthe translation x’ = x — 3, y’ = y + 2, or equivalently, x = x’ + 3, y = y’ — 2, equation (5) is simplified to (6)

Xe

The matrices E of (5) and E’ of (6) are

(7)

E=

a

1

053

0

0

rae

—4)

1 E’=]0

Oey

0

0

QO

—4}-

On =a)

20

In this example the coefficients a, b, c, and g are unchanged; that is,a = a’ = 1,b = b’ = 0, c=c' = 0, and g =q' = —4. However, p = —3 is changed to p’ = 0, and d = —7 is changed to d’ = 0.

Example 2.

In Example 2, §607, the equation

(8)

x? + 5y2? + 6x — 20y — 151 = 0

was altered by two completions of squares to (x + 3)? + S(y — 2)? = 180. In other words, by means of the translation x’ = x + 3,y’ = y — 2, or equivalently, x = x’ — 3,y = y’ + 2, equation (8) is simplified to

x'2 4+. 5y’2 = 180.

(9)

The matrices E of (8) and E’ of (9) are

1 (10)

E=|0 3

0 5 —10

3 —10} —151

i

E’=[{0

@ 5 02-0

0 0 |}: 2150

In this example the coefficients a, 6, and c are unchanged: a = a’ = 1, Db = bY = 5, aval c = c' = 0, andp = 3 is changed to p’ = 0, g = —10 is changed to q’ = 0, and d = —151 is changed to d’ = —180.

[$914

MATRICES

256 Example 3.

In Example 4, §911, the equation

(11)

13x2 — 8xy + 7y? = 30

is transformed by a rotation to

5x!2 + 15y’2 = 30

(12)

(and then by division by 5 to x’? + 3y’2 = 6). The matrices E of (11) and E’ of (12) are

13 (13)

—4

E=\-4

0

7 0

0

Ol; —30

J

5

0

0

=O 0

is 0

0 }—30

In this example, a = 13 is changed to a’ = 5, b = 7 is changed is changed to c’ = 0. The other coefficients are unchanged:

to b’ = 15, and c = —4 p = p’ = 0, g=q' = 0,

Gl = Gh es 30):

The three preceding examples illustrate the idea of invariance when an equation is transformed by a change of coordinates. In Examples | and 2 the coefficients a, b, and c of the terms of second degree are all individually invariant. In Example 3, every coefficient except for a, b, and c is invariant. In all three of these examples there are further invariants that involve the coefficients in combination rather than individually. We mention one of these now: the determinant of the matrix E is

invariant.

That is, |E| = |E’| in every case:

3 0 3 NB —4 0

0

OF QO

3 —4

dT

0 Sa LOM

3 ior

ees 0 5 7 O}=)0 O30 0

=

I (0

0 0)

Oy

fo Ol ON 0

0 = 4) =

ee 00

ae0 ee 0

0 Oo = —180

0 0 -15 Oe 030

16,

=900,

71530

2250;

Our primary goal in this section will be to establish certain invariants in general, for both translations and rotations. We start by observing from the last equation of (5), $912, d = d’, that d is an invariant for rotations. More precisely: Theorem I. For any rotation about the origin, the constant term d in the general equation of the second degree is an invariant: d= d’. (in Examples3,.d = "ds — -430)) We

now

turn

our

attention

to translations.

If the

translation

x = x’ + 4h,

y = y’ + kis applied, substitution into the general equation of the second degree, (1), followed by expansion and collection of terms, produces a set of relationships between the primed and the unprimed coefficients, somewhat similar to the equations (5), $912, for a rotation. (Give a proof of the following formulas in Ex. 35

§917.)

:

§914]

INVARIANTS

257

(a’ = a, Di=b, Cr & p’ I =ah+ck+p, q’ =ch+bk +4,

14 Ge,

|d’ = ah? + 2chk + bk2 + 2ph + 2gk +. We conclude:

Theorem If. For any translation applied to the general equation of the second degree, the following are invariant: a, b, and c. In other words, the matrix e of the second degree terms is invariant:

a o

¢ Pisa Le s)=e 26

2a

ccs ih

The simplest invariant involving more than one coefficient is the sum a + 6 of the terms on the main diagonal of the matrix e, called the trace of e: Theorem III. of e is invariant:

For any translation and for any rotation about the origin, the trace

(16)

at+b=a

+5

Proof. The fact that a + 6 is invariant with respect to translations follows from the fact that a and 6 are individual invariants, a = a’ and 6 = b’. To prove that a + 6 is invariant with respect to rotations is only slightly more difficult. If we add the first two lines of (5), $912, we get (17)

a’ +b’ = (acos26 + 2c sin 6 cos 6 + b sin20) + (a sin20 — 2c sin 6 cos 6 + 6b cos?@)

= a(cos26 + sin2@) + b(sin2@ + cos?6) = a+ b, and the invariance of a + b is established. This particular invariant will be exploited especially in §916. (In Example 3,@ +b = 134+7=a' +0’ =5+ 15.) Finally, two invariants that will be of more immediate concern are the determinants of the two matrices e and E. We shall establish the invariance of the determinant D of e. Proof of the invariance of the determinant A of the matrix E is given in §2824, CWAG. Theorem IV. For any translation and for any rotation about the origin, each of the following determinants is invariant:

(18)

D=lel\=[ 5

Proof for D for translations:

(Oh

XG

Since e is invariant so is D.

Proof for D for rotations: With the notation X7 e X of (5), $910, and with that for the rotation r, given by (10), §911, X = rX’, we have by substitution, and Theorem II, $910,

[§914

MATRICES XT eX = (PX e (rX’) = XT ev)X’,

258 (19)

and therefore, by the uniqueness ofthe representation of a quadratic form (Theorem I, § 910), the transformed matrix of e is

ol bi)

(29)

|

sin@\/a

cos@

en

ot eee

CN

0

—sin 6)

c\/cosé

\sinie | aces2

\eib)

©" | \ sin 6 cos)

and consequently, when we take determinants (§ 905): (21)

D

pret a c!

S|) Senet b! ci |7 | le|

Ir|

Mr Ley

pee

C

el = b

D,

and the invariance of D is established.

Before drawing up our conclusions that enable us to identify graphs of second degree equations we need to show that two additional properties are invariant, this time with respect to multiplying all members of an equation by a nonzero constant (give a proof in Ex. 36, $917): Theorem V. Jf the equation (1) is altered by multiplying both members by the nonzero constant k, so that

(22)

a! = ka, 6! = kb, c! = ke, p! = kp,’ = ka, d’ = kd,

then the following are invariant:

(1) (ii)

the sign of D, whether A vanishes.

We are now and V:

ready to put to work

the invariants identified

in Theorems

IV

Theorem VI. Depending on the sign of D = ab — c? and the vanishing of A, as defined in (18), above, the nature of the graph of the equation (1) of the second degree is given in the table: ID = Gh) =

(23)

A #0 A=

€ > OID

=

oy

=

© 0 and D < 0. It should be appreciated that both

§914]

INVARIANTS

299

real and imaginary ellipses are included under D > Oand A # 0. The more delicate analysis of -§916 is needed to distinguish between real and imaginary ellipses. We now consider the case D = 0, which corresponds to case (ii), §912. Since the matrices of (10) and (11), §912, are

0810 quo"0 On 0F gle and s1.0-70 0); Oman 00° od

(24)

respectively, the case A # 0 corresponds to a parabola, and the case A = 0 corresponds to a pair of parallel (possibly identical) lines. Corollary. A necessary and sufficient condition for the graph of the equation (1) to be degenerate is that the determinant A = |E| vanish. Note 1. Because of Theorem VI, a point (or pair of intersecting imaginary lines) is also called a degenerate ellipse, a pair of (real) intersecting lines is also called a degenerate hyperbola, and a pair of parallel lines is also called a degenerate parabola. Furthermore, in this same spirit, the graph of any equation of the second degree in x and y is often called a conic section. Example 4. the form:

The invariance of the determinant

(25)

13

=

Solution.

7),P 10) 15),

Identity the conic section 2x? — 4xy + 3y? — 6x + 10y —1=0. Since

puuliea A 2

the curve is an ellipse.

—2

Since

SSO

2

—2

the ellipse is nondegenerate.

-3

rey re

—3

Example 6.

3 takes

S)e4)5.40

—4> Example 5.

D of the matrix e for Example

5s

ee Cay

1

For further information see Example 3, §916.

Identify the conic section 2x2 — xy — 6y? + Sx + lly — 3 = 0.

We first double all coefficients in order to avoid fractions in the matrices e and E: Solution. 4x2 — 2xy — 12y2 + 10x + 22y — 6 = 0. Since

4

-—-1

lie Is S eet the curve is a hyperbola.

Since

4 A=j=1 5

—-1 —=12 Li

5 11)=0, 6

[$915

MATRICES

260

the hyperbola is degenerate, and consists of two straight lines. lines, see Example 3, §915.

Identify the conic section x? — 6xy + 9y? + 8x — 4y +5

Example 7. Solution.

For determination of these

= 0.

Since

1

be e the curve is a parabola.

—3

A =i

Since

= the parabola is nondegenerate.

1 |-3 4

—3 9 -2

4 —2) = —100 ¥ 0, 5

For more information, see Example 4, §916.

Note 2. As illustrated in Example 7, a second degree curve is a parabola (possibly degenerate) if and only if the second degree terms form either a perfect square or a constant multiple of a perfect square. The reason for this is that the simplified equations represented by the matrices (24) have only the single term ax’? of the second degree. The second degree part of the original equation, then, must be equal to the product of the constant a and the square of the linear expression x’ in terms of x and y.

915

SIMPLIFICATIONS

BY TRANSLATIONS

Ellipses and hyperbolas are called the central conics because each has a center of symmetry (§§605, 609). In order to study central conics in the context of the general equation of the second degree, (1)

ax? + 2cxy + by? + 2px + 2qgy +d =

0,

we see first of all what it means for the origin to be a center of (1). In §115 we defined the origin to be a center, or point of symmetry, of a graph I if and only if whenever the point (xo, yo) € T, then the point (—xo, —yo) € I also. We retain this definition, but must now permit the numbers xo and yo to be complex, in order to include the degenerate and imaginary graphs that may arise. With this under-

standing we have the following theorem, for which we shall present here simply the “if? part of the proof.

Theorem I.

(For the “only if” part, cf. $2826, CWAG.)

The origin is a center of the graph of (1) if and only if p= q = 0.

Proof of “if” part:

Ifp = q = 0, then the origin is a center of (1) since

(2) a(—xo0)? + 2c(—x0)(— yo) + b(—yo)? + d = axo? + 2cxoyo + byo? + d. We now have the means of setting up a procedure for finding the coordinates of the center of any central conic:

Theorem II. The point (h, k) is a center of the graph of (1) ifand only if the following two equations obtained from the first two rows of the matrix E, of (3), §910, are satisfied:

§915]

SIMPLIFICATIONS

(3)

BY TRANSLATIONS

Ne

261

micah

ch + bk +q = 0. Furthermore, if (h, k) is a center of (1), and if a translation to this point as the origin of an x'y'-coordinate system is made, the new constant term d’ of the transformed equation is obtained from the third row of E as follows:

(4)

d' = ph+qk +d.

Proof. If a translation to the point (h, k) (whether (A, k) is a center or not) is made, the coefficients of x’ and y’ in the transformed equation relative to the coordinate system having (A, k) as origin, are 2p’ and 2q’, respectively, where p’ and q’, as given by (14), §914, are the left-hand members of the equations (3) above. Therefore, by Theorem I, this point (A, k) is a center if and only if p’ = q’ = 0; that is, if and only if the system (3) is satisfied. Assume now that (A, k) is a center of (1), and write the last equation of (14), §914, as follows: (cf. (2), §910):

(5)

d= h(ah + ck p)'-+ k(ch + bk + q)-+ (ph + qk + d).

By (3), the quantities in the first two pairs of parentheses are zero, and the righthand member of (5) reduces to (4). This completes the proof. Example 1. Simplify the equation x2 + Sy? + 6x — 20y — 151 = O of Example 2, § 607, by means of a translation to its center.

1 Solution.

0

Since E = | 0

3

5

3

-10

—10

}, equations (3) for the center become

—151 h+

(6)

3=0,

Ne eee

with the solution h = —3,k = 2. The new constant term d’ is equal to d’ = 3(—3) — 10(2) — 151 = —180, and the simplified equation is x’* + Sy’? = 180 (cf. Example 2, § 914). Example 2. Simplify the equation 2x? — 4xy + 3y? — 6x + 10y — 1 = 0 of Example 5, §914, by means of a translation to its center.

2 Solution.

Since

E =|

—2

—2

3

—3

5

-3 5 }) the equations (3) for the center become —-1

2h — 2k —3

7)

= 0,

Ey eeket 0,

with the solution

A=

—4,k

=

—2.

The new constant term is d’ = (—3)(—5) + 5(—2),-

1

= — 4, and the simplified equation is therefore (8)

Qx'2 — 4x'y! + 3y’2 = 412.

Example 3. Simplify the equation 4x2 — 2xy — 12y? + 10x + 22y —6=0 6, §914, by means of a translation to its center. Solution.

Since E =

4

—-1

5

{—1 5

—12 11

11 —6

of Example

J, the equations (3) for the center become

[$916

MATRICES

262

4h— —h—

(9) with the solution

A=

k+ 5=0, 12k + 11 = 0,

—1, k = 1. The new constant term is d’ = 5(—1) + 1101) — 6 = 0,

and the simplified equation is 4x’2 — 2x’y’ — 12y’ = 0, or with x’ = x + land y’ = y — (10)

2x’2

Therefore 2x + 3y

916

—x’y’ — 6y'2 = (2x44 3y)Q! — 2y’) = Qe + 3y —

the degenerate hyperbola of Example x —2y +3 =0. —1 =Oand

SIMPLIFICATIONS

6, §914,

Ix — 2y $3)

consists

of the

1,

= 0.

two

lines

BY ROTATIONS

In order to simplify an equation (1)

ax? + 2cxy + by? + 2px + 2qy

+d = 0,

where c 0, by means of a suitable rotation, we may use equation (6), $912, in conjunction with the formula (cf. (12), $701) for the tangent of 20 in terms of ¢t= tan@,

(2)

cot.20 =

Cs

Xe

> Dele9

aye

and solve the resulting quadratic equation in /: (3)

ct?

— (6 — a)jt—c = 0.

Since the product of the roots of (3) is —1, one root must be positive and one negative. For definiteness we shall denote by the letter ¢ the positive root of (3). In other words, we shall consider a rotation through a /first-quadrant angle 6 such that (4)

tan 6 = ¢ = the positive root of (3).

From this point we could sin 6 = t/y1 + 12), and x = x’ cos@ — y’ siné, y ous and cumbersome and,

proceed by finding cos @ and sin @ (cos 6 = 1/y1 + #2, substitute into (1) the rotation formulas ((7), §911), = x’ sin @ + y’ cos 6. However, this procedure is laborifortunately, unnecessary.

Associated

with the matrix e = (: 5)is the following quadratic equation, b known as the characteristic equation of e:

(5)

PanelGime

ats C= =O.

where D = |e| = ab — c?. The important property of the characteristic equation (5) that will be of value to us is that it is invariant with respect to both translations and rotations, that is, the two coefficients —(a + 6) and D are both invariant (Theorems III and IV, §914). To put it in more specific terms, if equation (1) is transformed into a new equation as the result of any combination of translations and rotations, and if the notation for the transformed equation is the same as for (1) except for the use of primes, then for every real number £,

(6)

le-H=P-@+b}%E+D = P-@tbE+D=le— Hl,

where D’ = |e’| = a’b’ — c’2.

$916]

SIMPLIFICATIONS

BY ROTATIONS

263

Now suppose that equation (1) is transformed by the rotation about the origin through the angle @ prescribed in (4), so that c’ = 0. Then, in terms of the new primed coefficients, the characteristic equation of e’ becomes

(7)

ei

2 — (a

be + ab’ = (6

aye — bY) = 0,

and a’ and b’ are the roots of the characteristic equation (5). These roots are called the characteristic roots or eigenvalues” of the matrix e (and also of the transformed matrix e’). Thus, it appears that in order to obtain the simplified matrix e’ all that is necessary is to solve the characteristic equation (5), and place the two roots Ie a 0 oh , . a’ and b’ on the main diagonal: e’ = ( ) However, it is not quite that simple

Ob’ because we do not yet know which eigenvalue is a’ and which is 5’! The answer to this question is given in the following theorem, whose proof is givenin §2827 CWAG:

Theorem. /f equation (1) is transformed by the rotation about the origin through the angle 6 prescribed in (4), then the coefficient a’ of x'? is the larger of the two characteristic roots of the matrix e in case c > 0, and is the smaller of the two roots in case c < 0. Furthermore, the value of t = tan 0 of (4) is equal to (8)

i

a—a' Cc

pat ~

change in either diagonal element C

C

We summarize procedures for simplifying equations of the second degree. There are two principal cases.

I. Central conics. matrix E to the form (9)

First translate axes to the center in order to simplify the

Fa

@

@¢@

C 0

De} © @

W

Then solve the characteristic equation &2 — (a + b)§ + D = 0, and denote by a’ the larger or smaller root according as c is positive or negative. The simpli-

fied equation is then

(10)

a’x’2 + b’'y’2 +d =),

obtained by rotation through the first-quadrant angle Arctan 7, where / is given by

(8). II. Parabolas and parallel lines. First rotate axes through Arctan /, where f is given by (8). The algebraic details can be simpiified by using the fact that the second degree part ax? + 2cxy + by? of (1) — at least after multiplication of (1) by a suitable constant — is a perfect square. After the rotation simplification is complete, a suitable translation can be found by the method of completing squares in order to place the vertex at the new origin in case the curve is nondegenerate. (meaning *The word eigenvalue is an interlingual hybrid, made up of the German word eigen characteristic, proper, peculiar, or own) and the English word value.



[$916

MATRICES

264

Example 1. Simplify the equation 13x? — 8xy + 7»? = 30 of Example 4, §911, by means of the eigenvalue techniques of the present section.

13 The matrix e is ee

Solution.

—4 ne } : ‘)and the characteristic equation of e is & — 20 +

75 = 0. Since the characteristic roots are 5 and 15 and since c = —4 < 0, we place 5 and 15 on the main diagonal of the new matrix e’ with the smaller root 5 appearing first:

SX agit : : Lge is) Since the constant term d is invariant under rotations about the origin, the 0 simplified equation is 5x’? + 15y’2 — 30 = 0, or x’2 + 3y’2 = 6. The acute angle of the 13 —5 7—15 = 2. The graph is shown in rotation, by (13), has tangent equal to 4 4 Figure 916-la. Y=

(

Example 2.

Simplify the equation 2x2 + 4xy — y2 — 12 = 0 by means of the eigenvalue

techniques of the present section.

Solution.

SAR i 2 met ; : The matrix e is (; ee:)and the characteristic equation ofe is



§ -6 = 0.

Since the characteristic roots are 3 and —2 and since c = 2 > 0, we place 3 and —-2 on the a . ; 3 0

main diagonal of the new matrix e’ with the /arger root 3 appearing first: e’ =

0

oh.

Since the constant term d is invariant under rotations about the origin, the simplified equation

is 3x’ — 2y’2 = 12. The acute angle of the rotation, by (13), has tangent equal to —1l+2 2

1 =D

7|-

The graph, as shown in Figure 916-14, is a hyperbola with transverse axis 4

and conjugate axis 2/6.

(a) Figure 916-1

Example 3. Complete the analysis of the ellipse 2x2 — 4xy + 3y2 — 6x + 10y —1=0 of Example 5, §914, and Example 2, §915, by means of a rotation about the origin in the x’y'-system. Draw a graph of the curve, showing all three coordinate systems.

$916] Solution.

SIMPLIFICATIONS

BY

ROTATIONS

265

We first simplify the equation by a translation to the point (— 3, — 2), as in

Example 2, § 915: 2x’2 — 4x’y’ + 3y’2 = 4%. The characteristic equation is

— 5+ 2 = 0,

and the eigenvalues are therefore 3(5 + 17) = 0.44 and 4.56, to two decimal places. Since c = —2 < 0 we place 0.44 and 4.56 on the main diagonal of the new matrix e’’ with the smaller root 0.44 appearing first: e’’ = ( es

new third coordinate system.

hee where double primes are used for the

Since the constant term is invariant under rotations about the

origin the simplified equation is $(5 — V17)x’”? + $65 + V17)y'? = 42, or approximately 0.44x'’2 + 4.56y’2 = 4%. The acute angle 6 of the rotation has tangent equal to wee! = 0.78 (approximately), and @ = 38° (to the nearest 10’). The major and minor semiaxes are approximately 4.65 and 1.44, respectively, to two decimal places. The graph is shown in Figure 916-2a. Example 4. Use the eigenvalue techniques of this section to find the vertex and axis and simplified equation of the parabola x2 — 6xy + 9y? + 8x — 4y + 5 = 0 of Example 7, §914. Draw its graph. : 5 ae a, 1 -3 Solution. We start with a rotation of axes about the origin. The matrix e is (ne a:

the characteristic equation is £ — 10 = 0, and the eigenvalues are 10 and 0. Since c = —3 < 0 we place 0 and 10 on the main diagonal of the new matrix e’ with the smaller 0 @ ima root 0 appearing first: e’ = ( io) The acute angle of the rotation is 6 = Arctan 3, so that the x’-axis is the line x — 3y = 0 and the y’-axis 916-26). The rotation equations (7) and (8), § 911, are

(

if (

| )

3x’ — y’

10

a

c

e tenry



*i x!

+

, )

xXx

3y’

pe

3x+y =

ee.

a

(a) Figure 916-2

is the line 3x + y = 0 (Cf. Fig.

10 —

=e

a

meio

3y

[$917

MATRICES

266

The original equation can now be written

(x — 3y)2 + 8x — 4y +5 = (10 y’P +

20x’ — 20y’

Vio

+5=0,

or

Cae)

rl a)

or approximately,

(y’ — 0.32)? = —0.63(x’ + 0.63). With x” = x’ + +10, or x’ + 0.63, approximately, and with yp” = y’ — zovi0, or y’ — 0.32, approximately, the simplified equation can be written

yee approximately.

—sNtOx

or

3/2 =) — 0.63 x”,

The vertex of the parabola in the x’y’-coordinate system is at the point

x’! = — $\10, y’ = Zoy10. In the original xy-coordinate system, by (16), the vertex is at the point x = — 75,» = yy. The axis of the parabola is the line through the vertex with slope 3, and therefore (in the original coordinate system) is the line with equation y — 75 = 4(x + 75), or x — 3y + 1 = 0. The graph is shown in Figure 916-26.

917

EXERCISES

In Exercises 1 and 2, transform the given equation by means of the given translation, and verify the invariants described in Theorems II and IV, §914. 1. 3x2? — 4xy

+ y?+2x

—4V4+5=0;

2. 2x? = Dey a 3y? — 4x 2 2y

x =2%'4+3,y=y' -—4.,

6 = 0; 4 =x

— 2

= yy 3:

In Exercises 3 and 4, transform the given equation by means of a rotation about the origin through the given angle, and verify the invariants described in Theorems I, III, and IV, §914.

3. 7x2 + 6xy — 4y? — 10 = 0; Arctan 2.

4, 3x2 — 2xy + 2y? — 7 = 0; Arctan $. In Exercises 5-12, use Theorem VI, §914, to identify the given conic section. 5, 2x2 + 6xy

+ y? —4x —2y+12=0.

7. 9x2 — 12xy + 4y? 4+ 2x 4+ 2y = 0. 9, 3x? — 4xy + 4y? + 2x — 6y — 30 =0. 11. x2 — 4xy + 4y? + 2x — 4y = 0.

6. 2x? + 6xy + 5y? — 2x — 4y — 20 = 0.

8. 3x2 + 2xy — y2 + 16x 10. 3x2 +

—4y +5 =0.

2xy — 2y2 — 6x + 2y + 10 =0.

12. 3x2 + 12xy + 12y? — 2x —6y

+5 =0.

In Exercises 13-16, find the center, and simplify the given equation by means of a translation to this center as the new origin.

13. 2x2 — 4xy + 5y? — 12x + 24y + 24 =0.

§917]

EXERCISES

14, x? + 4xy

+ y? + 10x

15, x2 — 2xy

—y?— 2x 4+10y-7=0.

267

+ 8y + 16 = 0.

16. 3x2? + 2xy + y2?-—4y+4=0.

In Exercises 17-26, use eigenvalue techniques to simplify the given equation by means of a rotation about the origin through an acute angle 6. Identify the conic section. Find tan 6, and draw a figure, showing both sets of axes and a graph of the conic.

17. 2x2 — 4xy + 5y? 19. x2 — 2xy

—6 = 0.

— y? = 0.

18.

x2 ++ 4xy

+ y?+3=0.

20. 3x2 + 2xy + y2?—2=0.

21. 10x? — 12xy + 5y? — 14 = 0.

22. 8x2 + 8xy — Ty? = 0.

23. x2 — 4xy + 3y?+1=0.

24.

25. 11x? + 10xy — 13)? — 84 = 0.

26. 30x? — 20xy + 9)? = 0.

x2 + 2xy + 5y? —4=0.

In Exercises 27—30, draw a graph of the conic section in the specified exercise with the aid of the simplification achieved in the exercise indicated within parentheses. Show three sets of axes. DPE XCleisenlon(E

xen li):

28. Exercise 14 (Ex. 18).

29. Exercise 15 (Ex. 19).

30. Exercise 16 (Ex. 20).

In Exercises 31-34, use both a rotation about the origin and a translation to simplify the given equation. Identify the conic section. Give the tangent of the acute angle of rotation 0. In case of a parabola, write the equation of the axis of symmetry and give the coordinates of the vertex in the original coordinates. In case of degeneracy, write the equations of the two lines in the original coordinates. Draw a figure, showing three sets of axes and a graph of the conic.

31. x2 — 2xy + yp? — 8x — 8) + 16 = 0.

32. 4x2 + 12xy + 9y? — 20x — 30y + 25 = 0. 33. 9x?

— Oxy

34. 4x2 + 4xy

+ py? + 6x — 2yv —3 = 0. + y? — 14x —

12y + 22 = 0.

35. Prove formulas (14), §914.

36. Prove Theorem V, §914. 37. Show that if r is the rotation matrix (9), §911, then its inverse r~! is given by a matrix of the same form, where @ is replaced by —@.

38. Prove that the set of all rotation matrices of the form of (9), §911, with the operation of matrix multiplication, form a commutative group (cf. Definition I, §302). cos 0 sin 0 )is orthogonal with determinant —1 for every 39. Prove that the matrix ( sin@ —cos@ value of 6. Show that the set of all matrices having this form do not forma group. (Cf. Ex. 38.)

10

Vectors

1001

RECTANGULAR

COORDINATES

in Space

IN SPACE

For the analytic study of figures in Euclidean space £3, it is helpful to introduce a rectangular coordinate system, somewhat similar to that of the plane, but consisting of three mutually perpendicular directed lines meeting in a point O called the origin. Let a uniform number scale be assigned to each axis, with the number 0 corresponding to the origin in each case. Although it is not essential for all purposes to have

equal scales on the three axes, we shall assume unless a statement to the contrary is specifically made that the three number scales have equal units. In close analogy with the plane, where there exists a one-to-one correspondence between points and ordered pairs of numbers, in space there exists a one-to-one correspondence between points and ordered triplets or triads of numbers, called the coordinates of the points. If the coordinate axes are labeled x, y, and z, the corresponding coordinates of any

point p are called the x-coordinate, the y-coordinate, and the z-coordinate, respectively, ofp and defined as follows: The x-coordinate (or y-coordinate or z-coordinate) ofp is the number of the number scale on the x-axis (or y-axis or z-axis, respectively) that corresponds to the point of intersection of that axis and the plane through p parallel to the other two axes. This correspondence is expressed symbolically thus: p: (x, y, z) means thatp is the point with coordinates x, y, and z. This point is also labeled (x, y, z) for simplicity. (Cf. Fig. 1001-1.) The coordinate planes are the three planes each of which contains two of the coordinate axes, the xy-plane containing the x- and y-axes, the xz-plane containing the x- and z- axes, and the yz-plane containing the y- and z-axes. Orthogonal projections of the point (x, y, z) are defined as follows: the orthogonal projection of (x, y, z) on the xy-plane (or xz-plane or yz-plane) is the point (x, y, 0) (or (x, 0, z) or (0, y, z), respectively). The orthogonal projection of (x, y, z) on the 268

$1001]

RECTANGULAR

COORDINATES

IN SPACE

269

X-axis (Or y-axis or z-axis) is the point (x, 0, 0) (or (0, y, 0) or (0, 0, z), respectively).

(Cf. Fig. 1001-1.) Although the following definition has no absolute mathematical meaning* it is a convenient one to have because ofits close relation to physical pictures or models of coordinate systems and its help in visualization: Definition. A rectangular coordinate system is right-handed if and only if the axes are arranged so that if the xy-plane is viewed from a point on the positive half of the z-axis, a counterclockwise ninety-degree rotation in this plane about the origin carries points on the positive half of the x-axis into points on the positive half of the y-axis. The system is left-handed if and only if it is not right-handed.

(0, y, 2)

(0, y, 0)

y

Figure 1001-1

Note 1. Using the concept of a right-handed screw (that is, one way that when it is turned about its axis in a clockwise sense it moves we can describe a right-handed rectangular coordinate system as one screw whose axis coincides with the z-axis advances in the positive through the right angle xOy.

that is threaded in such a away from the observer), for which a right-handed z direction when turned

Nore 2. The relationship among the axes expressed in this definition is a cyclic one. The x-, y-, and z-axes could be replaced (in the definition) by the y-, z-, and x-axes, respectively or by the z-, x-, and y-axes, respectively, without changing the effect of the definition.

Examples of right-handed coordinate systems are shown in Figure 1001-2. *Without reference to the physical universe it would be impossible to explain to an extraterrestrial being what we mean by clockwise motion, a right-handed screw, or a southpaw pitcher.

fra bie VECTORS

[$1002

IN SPACE

x

Figure 1001 -2

The open first octant is x > 0, y > 0,andz of whose coordinates are words first octant usually tive:

1002

VECTORS

the set of points (x, y, z) all of whose coordinates are posi> 0. The closed first octant is the set of points (x, y, z) all nonnegative: x = 0, y = 0, and z = 0. The unmodified refer to the open first octant.

IN SPACE

We incorporate the ideas of the Definition, $801, and some of those of $901, into a new formulation designed especially for the space £3: Definition. 4 three-dimensional vector 6 is an ordered triad or triplet (v1, v2, 03) of real numbers. The numbers v4, v2, and v3 are called the components or coordinates of the vector 6 = (v4, 02, v3); v1 is the x-component or x-coordinate, v2 is the y-component or y-coordinate, and v3 is the z-component or z-coordinate. Two vectors t = (U1, U2, U3) and 6 = (U4, v2, v3) are equal, written t = 6, if and only

if their corresponding components are equal: uy; = U1, Uz = U2, and u3 = v3. Otherwise they are unequal, written t # 6. The vector (v1, v2, 03) is represented by the directed line segment joining any point (a1, a2, a3) to the point (a1 + v1, a2 + 02, a3 + v3). The magnitude or absolute value of a vector 6 = (U1, v2, v3) is denoted and defined

|5] = |i, v2, v3)| = Yor? + v22 + 032.

(1)

The zero vector is the vector 0 = (0, 0, 0) having zero magnitude.

A unit vector

is a vector with magnitude 1. A unit vector is also called a direction. If p: (x1, ¥1, 21) andg : (X2, y2, Z2) are any two points of space, the term the vector pq means the vector (x2 — X1, y2 — yi, Z2 — 21) represented by the directed line segment pq. The radius vector of the point p: (x1, ¥1, 21) is the vector Op = (x1, y1, 21) where O is the origin. The vector PQs as represented by the directed line segment pq, is said to have initial point p and terminal point q (cf. Fig. 1002-1).

Unless a statement to the contrary is made, all vectors considered in this chapter will be assumed to be three-dimensional. Notre. As in E2, two directed line segments represent the same vector if and only if they are parallel, similarly directed, and of equal length. Example

1.

If p aa qe

0, 6), C=



(6, 35 4), Iie =e

>

(5, =1;, 4), and

jag, = mR = (GS, 3 SZ

5

=

(10,

2. 2»). then

)

Related to the definition (1) of the magnitude of a vector is the following extension to three dimensions of the distance formula (x2 — x1)? + (v2 — yi)? of

§1002]

VECTORS

IN SPACE

271

2o—

2]

42 — Xy

x

Radius vector Op

Vector pq

D:(%1,Y4) 2)

DX

Vee)

(a)

C= Xo; Vos 25)

(b) Figure 1002-1

the plane. This formula states in particular that the magnitude of any vector is equal to the length of any line segment that represents that vector. Distance formula.

(2)

The distance between

the points (x1, y1, 21) and (x2, y2, Z2) is

d = Ve — 11)? + G2 — ni? + @ = 21).

Proof. Formula (2) is a consequence of the Pythagorean theorem applied twice. Let a, b, and c be the lengths of the edges of the parallelepiped R illustrated in Figure 1002-15 having (x1, 1, 21) and (x2, y2, z2) as Opposite vertices:

(3)

ee

eX

oe yy ec = |72'— v2|,

The problem is to show that

(4)

ad = a2 +b? + c?.

If a diagonal of length e is drawn in the base rectangle of R, as shown in Figure 1002-2, then (4) is a consequence of the two Pythagorean equations

(5)

a2+$62=e2,

¢2 = d, e+

the first related to a right triangle in the base of R, and the second related to the right triangle marked with heavy lines in Figure 1002-2.

b Figure 1002-2

IN

VECTORS

272

[$1003

SPACE

The distance between the points p : (6, 3, —1) andgq: (Gains) as

Example 2.

+6 — (—-))? = V4 + 16 + 36 = 2y14,

V4— 62 + (7 — 32

and is equal to the magnitude of the vector pq = (—2, 4, 6).

1003.

LINEAR

COMBINATIONS

OF VECTORS and

Nearly all the definitions and statements of §§802

803

have their counter-

parts in three-dimensional space. We give a few of these, but shall feel free to use three-dimensional analogues of the plane cases when the meaning is clear from context or close parallel. For the sake of easy reference and completeness, we permit present definitions to particularize (and hence duplicate) some of those of §§901 and 902, with n = 3. Addition and multiplication by a scalar are defined as follows:

(1) (2)

(ui, 43, U3) aa (Ui, 02,03) = (Ue 01, te 7 02, U3

03),

A(u1, U2, U3) = (Aut, AU2, AU3).

The vector (—1)i, also written — i, is called the negative of 7. Subtraction of vectors

is denoted and defined as follows:

(3)

Gee)

Sums, differences, and scalar multiples of vectors in space are represented in terms of segments, triangles, and parallelograms in the same fashion as sums, differences, and scalar multiples of vectors in the plane, as indicated in Figures 802-1 and 802-2. The vectors of a set are coplanar if and only if they can be represented by directed line segments lying in a plane. Vectors that are not coplanar are called noncoplanar. Vector magnitude for vectors in space satisfies the seven properties (i)-(vii) listed in § 802. A linear combination of a set of vectors 01, 62 ,:--, 6, is a vector of the form

(4)

NiBina= AaDs SPP

sano

where di, A2,°°°, An are scalars.

The set U3 of all three-dimensional vectors is a vector space over &. The vectors 01, &2,°--+, 0, are linearly dependent if and only if there exist scalars

M1, A2,°°*, An not all equal to O such that (5)

A101

+

A202

+-:---+

},v,



0.

If no such scalars exist, that is, if (5) holds only for scalars \1, \2,:-*, Xn all of which are zero, then the vectors 01, d2,:--, 6, are linearly independent. The three general theorems of §803 apply without change to the space 3 of three-dimensional vectors. For purposes of concrete clarification, as we did in $803, we shall consider sets {&1, 62,---, ,} of three-dimensional vectors for various special values of n. In this case, we take n = 1, 2, 3, and 4, in turn. We start with Ves

$1003]

I.

b= 0:

LINEAR

COMBINATIONS

OF

VECTORS

273

A single vector 6 is linearly dependent if and only if it is the zero vector:

Proof.

The proof is the same as with I, §803.

II. Two vectors # and 6 (in three dimensions) are linearly dependent if and only if they are collinear or, equivalently, if and only if one is a scalar multiple of the other. If one of the two vectors u and 6 is linearly dependent — that is, equal to the zero vector — then the two vectors w and 6 are linearly dependent. If % is a linearly independent (nonzero) vector, and if # and 6 are linearly dependent (é is a vector collinear with u), then 6 is uniquely represented as a linear combination (scalar multiple) of i. Proof.

The proof is the same as with II, $803.

III. Three vectors u#, 6, and w (in three dimensions) are linearly dependent if and only if they are coplanar. If @ and 6 are linearly independent and if is any vector coplanar with w and 8, then w is uniquely represented as a linear combination of @ and 0. Proof.

Let u, 6, and

w be any three coplanar

vectors,

and consider

the two

vectors # and v. If w and o are linearly dependent, then so are i, 6, and W by Theorem I, $803. If and 6 are linearly independent vectors, then the geometrical argument given in the proof of III, $803, and illustrated in Figure 803-1, can be repeated to give a representation of w as a linear combination of % and 6, and hence a proof that uw, 0, and Ww are linearly dependent. Conversely, if %, 6, and w are any three linearly dependent vectors, then one (say wW to be specific) must be a linear combination of the other two (# and 6) and hence is representable by means of a directed line segment lying in the same plane as segments representing w and 0. Finally, if a and & are linearly independent and d, 0, and w coplanar, then u, 6, and w are linearly dependent so that there exist constants a, 8, and y not all zero such that ai + Bb + yw = 0. In this equation y cannot be 0 since a and 6 are linearly independent, and therefore # = (—a/y)iu + (—6/y)é. This representation of as

a linear combination of u@ and 6 is unique by Theorem III, $803. IV. Four vectors i, 6, W, and £(in three dimensions) are always linearly dependent. If a, 0, and w are any three linearly independent (noncoplanar) vectors, then any vector £ is uniquely representable as a linear combination of i, 0, and W.

We shall give an algebraic proof for the following statement, leaving the Proof. remaining few details as an exercise: Jf i, 0, and W are linearly independent vectors, then any vector & is representable as a linear combination of u, 6, and w. (Give a geometric proof of this same statement, similar to that of III, §803, iO EX. 39, §1004.)

Let a = (m1, u2, U3), D =

(U1, V2, U3), W = (w1, W2, w3), and — = (&, £2, £3),

and consider the set of three homogeneous equations in the four unknowns x1, x2, 3, XA: UyxX1 + Vix2 + wix3 + £1x4 I

(6)

u2X1 + v2x2 + wox3 + b2Xx4 | U3x1 + 3x2 + W3x3 + §3%4

0, 0, 0.

IN

VECTORS

274

§1003}

SPACE

By the Corollary to Theorem II, §907, this system, which is equivalent to

+ x4é = 0,

+ x26 + x3

xii

(7)

has a nontrivial solution. Since a, 6, and W are linearly independent, x4 ~ 0, and (7) can be solved for — as a linear combination of i, 6, and w.

Owing to the property expressed in the second sentence of IV, above, any three linearly independent (noncoplanar) vectors in space form a basis for the space The three vectors #, 6, and w of IV span the space

03 of three-dimensional vectors.

U3 since {Ai + wd + vw |, w, and »y € R} =

U3.

In the vector space U3 the standard basic triad of unit vectors are denoted and defined:

(8)

P=

ONO) (0,150)

ke—=1

(O50 1).

The vectors i,j, and k are called the standard coordinate vectors of U3. For this basis the representation of an arbitrary vector 6 = (v1, v2, v3) is particularly simple:

(9)

b= vi + voj + o3k.

Example 1. Prove that the two vectors #@ = (uw, u2, 43) and 6 = (v1, v2, v3) are linearly dependent if and only if

(10)

U2

U3

U3

«COU

U2

U3

U3

Uj

UL

U2

Solution. Linear dependence of u# and ¢ is equivalent to the existence of scalars \ and p not both 0 such that Aw + yo = 0 or, equivalently, to the existence of a nontrivial solution (A, ) for the system uid =

(11)

Uh

=

0,

u2d + van = 0,

U3 + v3 = 0, of three homogeneous linear equations.

By Theorem II, §907, where m = 3 and n = 2, the

system (11) has a nontrivial solution if and only if the three 2 * 2 determinants of (10) vanish. For example,

3

[ome

Die 7

Il

as

il

the vectors (1, 2, 3) and (4, 8, 12) are linearly dependent

2 esi 0, whereas the vectors (1, 2, 3) and (1, 7, 3) are linearly independent and

3 d ie? 3) and], | are nonzero Example

and ; Ul =

2.

Prove

that

although

the

three

3 il za4bs vectors

0 )-

w=

(uj, u2 , 43),

© = (v1, v2,

v3), and

W = (1, W2, w3) are linearly independent if and only if

(12)

uj

U2

U3

U1

v2

v3)

Wi

W2

W3

~ 0.

Solution. The vectors 4, ¢, and w are linearly independent if and only if the only scalars r, mu, and y such that Ad + wo + vw = OareX = uw = v = 0, or, equivalently, if and only if the only solution of the system

§1004]

EXERCISES mA + vip + wy

(13)

275 = 0,

u2r + vy + wo = 0, u3A + v3 + w3y = 0,

of three homogeneous linear equations in \, u, and » is the trivial solution \ = je = jp By Theorem II, §907, the system (13) has only the trivial solution if and only if the coefficient determinant — whose value is equal to that in (12) — is nonzero. For example, the three vectors (1, 2, 3), (0, 1, 4), and (0, 0, 1) are linearly independent since

1? 2as3 01 4=1+0 = net

(14)

whereas the three vectors (3, —1, 2), (5, 4, —1), and (11, 19, —10) are linearly dependent since

cs | 2| See he) i ett9 —/10

(15)

Example 3. Show that the three vectors (3, —1, 2), (5, 4, —1), and (11, 19, —10) discussed in connection with (15) are linearly dependent by representing (11, 19, —10) as a linear combination of (3, —1, 2) and (5, 4, —1). Solution. The problem is to solve the system

3X + Su = 11, —\ + 4u = 19,

(16)

2.—

pw =

—10

for \ and yp. From the first two equations of(16) we find \ = —3 and yp = 4. Substitution in the third equation of (16) verifies that \ = —3 and » = 4 isa solution of that equation as well, and hence of (16). Therefore

(11, 19, —10) =

1004

—3(3, —1, 2) + 4(5, 4, —1) = (—9, 3, —6) + (20, 16, —4).

EXERCISES

A figure should accompany each of Exercises

1-6.

1. Find the orthogonal projection of the point (5, —1, 6) on each coordinate axis.

2. Find the orthogonal projection of the point (5, —1, 6) on each coordinate plane.

3. A rectangular parallelepiped with faces parallel to the coordinate planes has opposite vertices at the origin and the point (5, 4, —3). Find the other six vertices. 4. A rectangular parallelepiped with faces parallel to the coordinate planes has opposite vertices at (2, 3, 5) and (8, 5, 7). Find the other six vertices. 5. Find the coordinates of the point where the line through (— 3, 4, 1) parallel to the X-axis meets the plane through (7, 3, —1) parallel to the yz-plane.

6. A plane parallel to one of the coordinate planes passes through three of the following points:

Pi:

G; 4, =),

P2:

©; 3% 6),

P3+

(7,

3, =),

P+:

(=,

Find the three points and name the parallel coordinate plane. In Exercises 7-10, find the point s from the given data.

7. pg = 7s, p =

= ©, 3, 2), r = (0,8, 4): (2,7, 1),¢

4, 6),

and

JOS. (=o: 3; 2).

VECTORS

276

[$1004

IN SPACE

8. = sp = 3; —2, —5),9 =(G 7p slr = 9. Op = gs,p = (1, 4, —1),¢ = (6, —2, —3). 10. ps = 5g, p = (8, 0, 5), q = (2, 6, —3).

4 1,2):

In Exercises 11-14, find the distance between p and q in the indicated exercise. 14. Exercise 4.

13. Exercise 3.

12. Exercise 2.

11. Exercise 1.

In Exercises 15-18, perform the indicated steps.

1556558) 2G) 1-2): 17.12) 5j — 7) — G8

16.74; = j = 3k). 18) 342 — 20) S42

40).

a:

In Exercises 19-24, verify the given equality or inequality.

19. |-5G —f + &] = |—5]-|i -—F+ &l.20. |(4, 7, —2) — 3, —1, ©| = |G, —1, © — , 7, —2)]. 21.

‘s48) =

=

2G pe See 22. |(27 — 27 4- kl

1G; 4, 8)! |—2|

23. |3i+ 4) + (57 — 2k)| S [31+ &| + [57 — 24].

In Exercises 25-31, determine whether the given vectors are linearly dependent or linearly independent. Give reasons to validate your conclusion.

25. 0.

26. (0, 0, 1).

PT OMA GYR (1 e259):

28) 12/5 432)

29, (15.159),

3.0) 1) Onl):

BI) i 4 2y.3)

4k 5k

ee

30. (1,,—1,,.0) 20) 0, — 1) (Oe):

6 Ti = 8)

Ok.

In Exercises 32-34, express the first vector as a linear combination of the remaining vectors.

32. (1, —1, 0), (1, 0, —1), (0, 1, —1).

B30 ak 349 27) 2374 Sie ty

eee Ane ee ka

eee

In Exercises 35-38, determine whether the vectors are (/) collinear, (i) coplanar but not collinear, or (iii) noncoplanar. 35.

(ile il. =).

CG

=I,

4), G; ile sal

36; 10; 2 Sj 15, 377614310). 4h i =), 4 38.

(5, eh

7), (3, Ss)

4), C1

ah

6) 3)

Oe

eee naa i 72)).

39. Give a geometric proof of the statement set forth at the beginning of the proof of

IV, §1003.

40. If p is the midpoint of the segment joining the points p; : (1, y1, z1) and p2 : (x2, y2, Z2), and if O is the origin, prove that

$1005]

DIRECTION

Op =

()

COSINES;

THE

al om yi

x1

Opi + Op2 =

SCALAR

a

,

+

PRODUCT

y2 ZI

5)

277

on Z2

D

(Cf. Ex. 36, §804.) In Exercises 41 and 42, use (1) to find the midpoint of the segment joining the two points. (GiExe405)

ALS G7i=12)'8), (4.6), 1005

DIRECTION

COSINES;

42. (6,4, —11), 0, —9, —3). THE SCALAR

PRODUCT

If L and M are any two directed lines in space that pass through the origin, then a unique 6 between L and M and satisfying the inequalities (1)

OS

657

is determined as in §805. If L and M are arbitrary directed lines in space, not necessarily passing through the origin, then the angle between L and M is defined to be the angle between the lines L’ and M’ that are parallel to L and M, respectively, and are similarly directed, and that pass through the origin. In space any directed line L has three direction angles: (i) the angle a between L and the (positively) directed x-axis, (ii) the angle 8 between L and the (positively) directed y-axis, and (iii) the angle y between L and the (positively) directed z-axis (cf. Fig. 1005-1a). The cosines of these direction angles, (2)

X=cosa,

»=cosf,

v =cos 7,

\z

z

y

ey,

Vector pq

Directed line

Ge

YA

Di (X45.Nr 24)s @: (Kop Voie)

(a)

(b) Figure 1005-1

are called the direction cosines of L. Direction angles and direction cosines of any nonzero vector or directed line segment are defined as with a directed line. The direction cosines of the directed line segment from p : (x1, y1, 21) tog : (X2, 2, 22), of length d (cf. Fig. 1005-15), are equal to (3)

X=

sa

==

mos

ba = cos

B =

Dae

yl

=

Cs

2)

7 =

ee!

7

VECTORS

278

[$1005

IN SPACE

or, if 6= (v1, v2, v3) = pg = (x2 — x1, y2 — Vi, 22 — 21): (4)

A = COSo

Dv

= Joy # = COS B=

y)

D

aly = (G08 y=

|S Lert}

Since d2 = (x2 — x1)?

+ (2 — yi)? + (22 — 21)%, we have from (3):

(5)

Mf yet?=1.

That is, for any directed line L, the sum of the squares of the direction cosines is equal to 1. In other words, if \, u, and v are the direction cosines of a directed line L, then (A, uw, v) is a unit vector. This unit vector (A, u, v), sometimes called the direction of L, is collinear with LZ and similarly directed. If (J, m, n) is a nonzero vector collinear with (A, u, v), where A, u, and v are the direction cosines of a directed line L, then /, m, and n are a set of direction numbers of L. In this case there exists a nonzero number k such that / = k\, m = ku, and n = ky. From (3) with k = d, we see that x2 — x1, y2 — y1, and zz — z are direction numbers of the directed line from (x1, y1, 21) to (x2, y2, 22). Since a vector (/, m, n) collinear with a direction (A, », v) is also collinear with the opposite direction (—\, —, —yv), /, m, and n are called direction numbers of the undirected line L as well as of the directed line L. Example 1. The vector (8, —1, —3), represented (—5, 4, 1) to (3, 3, —2), has direction cosines

\ = cOS

@ =

8 ——= p = cOS 8B =

74



by the directed

line segment

from

1 3} ——] vy = COSY = — ——:

v74

74

With the notation used above, d = 74, (5) becomes $$ + 7; + 7% = 1, and the following are sets of direction numbers for the line L through (—S, 4, 1) and (3, 3, —2): (8, —1, —3), (—8, 1, 3), (24, —3, —9), (—40, 5, 15). The two directions for L are + Arv74 (8, —1, —3). The scalar product or inner product or dot product of two vectors i = (u1, u2, U3) and 6 = (v1, v2, v3) is denoted and defined: (6)

u: B= mv, + u2v2 + U303.

Most of the statements concerning the scalar product given in §806 transfer without change. The proofs involve such minor alteration that they are left as exercises .

Theorem I. The scalar product of two vectors i and & is equal to the product of their magnitudes and the cosine of the angle between them:

(7)

ib =

ii|-|5|- cos 0.

In particular, if (1, “1, v1) and (X2, 2, v2) are unit vectors,

(8) Theorem II. Example 2. 2740)

cos 8 = Airh2 + mime + 172. Two vectors are orthogonal if and only if their scalar product is zero. The vectors (3, 7, —2) and (5, —1, 4) are orthogonal since 3-5 — 7-1 —

§1006]

EXERCISES

279

Note 1. Theorem II shows that orthogonality as defined after (3), §901, for n-dimensional vectors is consistent with Euclidean orthogonality when n = 3.

Further properties:

OR (ii) (iii)

= 4-4-= ul: If comp, &@ = |u| cos 6, then w-6 = (comp, #)|dl. Commutative law: i-6 = 6-i.

(iv)

Distributive laws:

(v)

Homogeneous law:

w-(6 + W) = a-6 + -w,(4 + 6)-w = aw

(Ai)-(uo) = (Au)(-6),

+ ow.

X and p € @.

(vi)

The associative law for scalar products is meaningless.

Note

2.

The scalar product of any two vectors is independent

of the rectangular co-

ordinate system.

Example 3.

Show that the only vector orthogonal to each of three linearly independent

vectors i, 6, and W is the zero vector 0. Solution.

Assume the contrary, and let ¢ be a nonzero vector orthogonal to each of 4, ¢,

and w: (-4 = ¢-6 = €-w = 0. Then € is orthogonal to every linear combination of 4, 6, and w:

&-(Qa + wi + vw) = AE-4 4+ we-b + ve-wW = 0, A, wu, andy € R. By IV, §1003, ¢ itself can be represented as a linear combination of 4, 6, and w, and therefore

¢€ is orthogonal to itself: ¢-¢ = ||? = 0. Consequently, |¢| = 0 and ¢ = 0.

1006

EXERCISES

In Exercises 1-3, determine which ordered triads are sets of direction angles.

1. 67, 47, 37. 1

i:

i

2. ET, 27, BT. 1

at

3. 27, 0, a7.

2

if

3

In Exercises 4— 9, find the direction cosines of the given directed line or line segment. 4. Positively directed x-axis.

5. Negatively directed x-axis.

6. Positively directed z-axis.

7. From the origin into the first octant (cf. $1001), making equal angles with the coordinate axes.

8. From (3, 8, 1) to (/, 2, 10).

9. From (0, —5, 13) to (6, 4, —5).

In Exercises 10-12, find three distinct sets of direction numbers for the line segment joining the two points. 10.

ce il, 0), as 0, =i):

11. ry

il —1), (1,

1

12. (9, 1, —7), (4, —8, 2).

In Exercises 13-20, find the scalar product of the two given vectors.

13. jand i.

14. Kand k.

1).

15. jand k.

16. i and k. 18. 2) — 7 — 7k and —6i + 5; — k.

17, 4,4, —7) and 6, —4, —2)

20. (6, 0, —3) and(4, —7, 1).

ya

sian 3)

1990)

[$1007

IN SPACE

VECTORS

280

In Exercises 21-23, find the cosine of the angle between the two vectors.

DING 23.

Q,

04 0)) (dno a): 35 )5 G=3;

DoS]

ed

era

D5 2).

In Exercises 24-26, determine whether or not the two vectors are orthogonal.

24. (2, 8, 3), (5, 1, —6).

25.31 =) 40k, Sho

26. (—7, 5, 11), (—3, —13, 4). In Exercises 27 and 28, find the dot product, cos 6 (where @ is the angle between the two vectors), and the component of each vector in the direction of the other.

27. 4 = (6, 3,,—3), b = (—6, 2, 9).

28.-4= (=2,.3, —1), 0 = Q; 27 —1):

29. Let a, 6, and y be three angles such that O SaS7,05 687, and0S yr. Prove that a, 8, and y are direction angles for some directed line in three dimensions if and only if cos? a + cos? B + cos? y = l.

30. Give an algebraic proof of the inequality

[uz-t] < |a|-[é ,

or

juror + 202 + U303| Ss Vur2 + U2? + U3? Vor? + v2 + 032,

and relate the result to the trigonometric inequality |cos 6] S 1. (Cf. Ex. 42, §808.) 31. Prove that the vectors of any set of mutually orthogonal nonzero independent.

1007

EQUATIONS

OF PLANES

AND

vectors are linearly

LINES

Let II be an arbitrary plane, let p : (xo, yo, zo) be any fixed point of II, and let f = (a, b, c) be any nonzero vector normal (perpendicular) to Il. Then a point q: (x, y, z) lies in the plane I if and only if the vector Pq = (x — x0, y — yo, Z — Zo) iS perpendicular to the vector fi (cf. Fig. 1007-1). Since two vectors are perpendicular if and only if their scalar product is zero (Theorem II, §1005) and since hi pq = a(x — xo) + b(y — yo) + c(z — zo), we have established the theorem:

SY

Q1

Figure 1007-1

$1007]

EQUATIONS

OF

PLANES

AND

LINES

281

Theorem I. The plane through the point (xo, yo, zo) and perpendicular to the nonzero vector (a, b, c) is the graph of the equation (1)

ax — xo)

+ by — yo) + c(z — zo) = 0.

If equation (1) is expanded and if the substitution d = —axo — byo — czo is made, we also have the first part of the theorem: Theorem II. (2)

Every plane 1 is the graph of a linear equation of the form ax + by+cz+d=0,

in which the coefficients a, b, and c form a set of direction numbers for the lines normal to II. Conversely, the graph of every equation of the form (2), where a, b, and c are not all zero, is a plane perpendicular to the vector (a, b, c).

Proof. The only part remaining to be proved every coefficient is zero, and therefore we can find for (2) (for example, if a # 0, we can take xo = plane II through (xo, yo, zo) and perpendicular to hence of (2) as well.

is the last. By assumption, not a solution x = x0, y = yo, Z = Zo —d/a, yo = 0, and zo = 0). The (a, b, c) is the graph of (1), and

Nore 1. Although the equation (2) is not the only linear equation whose graph is II (the equation could be multiplied by an arbitrary nonzero constant, for example), we shall find it convenient to refer to (2) as “‘the equation of II.’”’ We shall also speak elliptically of “the planeax + by + cz+d= 0” to mean the plane that is the graph of ax + by + cz+d=0.

Example 1. Find the equation of the plane parallel to 3x — 2y + 4z + 3 = 0 passing through the point (1, 1, —2). First solution.

By (1) the equation is 3(x — 1) — 2(y — 1) + 4(z + 2) = 0, or 3x — 2y +

4z+7=0.

Second solution. The equation can be written in the form 3x — 2y + 4z + d = 0. Substitution gives 3 — 2 —8+d=0,ord=7. Example 2. Find the equation of the plane p2:(2, 7, —1), and p3 : (—4, 5, 3).

through

the three

points p: : (1, 12, 1),

Let the equation of the plane be ax + by + cz +d = 0. Substitution of the Solution. coordinates of the three points gives the following system of three equations in the four unknowns 4a, b, c, d:

(3)

a+12b6+ c+d=0, 2a+ 7b—- c+d=0, —4a+ 564+ 3c+d=0.

Elimination of d by subtraction (the first and second equations and the first and third equations) gives the system 1 a — 56 — 2c = 0,

(4)

5a

Th

2c = 0.

VECTORS

282

IN SPACE

[$1007

Subtraction once more gives 4a + 125 = 0. Therefore a solution of (3) is given by a = 3, b = —1,c = 4, and d = 5, and the equationis 3x — y+ 4z + 5=0. Another solution is given in Example 4, $1010.

Example 3. Show that the plane ax + by + cz + d = Ois (i) parallel to the x-axis if and only if a= 0, (ii) parallel to the y-axis if and only if b= 0, and (iii) parallel to the z-axis if and Onilyather— 0: Solution. Saying that a plane II and a line L are parallel is equivalent to saying that lines perpendicular to II are perpendicular to L. Therefore the plane ax + by + cz + d = 0 is parallel to the x-axis if and only if the vectors (a, 6, c) and (1, 0, 0) are orthogonal, and these vectors are orthogonal if and only if their dot product 1-a¢ + 0 + 0 = ais equal to 0. This establishes part (i). Parts (ii) and (iii) are proved similarly with the vector (1, 0, 0) replaced by the vector (0, 1, 0) or the vector (0, 0, 1).

Norte 2. The result proved in Example 3 can be stated in condensed form: A plane is parallel to a coordinate axis if and only if the variable of that axis is missing from its equation.

Now let L be an arbitrary line in space, let p : (xo, yo, Zo) be an arbitrary point on L, and let (/, m, n) be a set of direction numbers of L, that is, a nonzero vector collinear with L. Then a point qg : (x, y, z) lies on L if and only if the vector Pq is collinear with L, or equivalently, if and only if Pq = (x — Xo, y — yo, Z — Zo) and (/, m, n) are collinear. One way to express this is as follows: Theorem III.

A set of equations of the line L through the point (xo, yo, zo) and

having direction numbers 1, m, and n is

(5)

xo)

= (V — yom

= @ — zo) 4

or 2S mee)

)

A

We

RD)

eC

Os

where both (5) and (6) are to be interpreted as statements of proportionality. In other words, any vanishing denominator in (6) means the vanishing of the corresponding numerator, and not “division by zero.” Note 3. Whereas a plane is specified by means of a single linear equation in x, y, and z, a line requires more than one such equation. Equations (5) or (6) can be regarded as equivalent to the system of three equations

(7)

m(x — xo) = ly — yo), n(x — xo) = Kz — zo), nly — yo) = m(z — zo),

or to an appropriate pair of (7). Each equation (7) is an equation of a plane containing L. Equations (6) are called equations of L in symmetric form.

Example 4. can be written

Equations of the line L through (6, 7, —3) parallel to the vector (4, —5, —1)

§1007]

(8)

EQUATIONS

OF

PLANES

AND

LINES

283

—5x — 6) = 4 — 7), —@ — 6) = 42 + 3), -V — 7) = —S(z + 3),

or any pair of these last three, such as (simplified)

(9)

5x + 4y — 58 =0,

x+4z+6=0.

Equations (8) are equations of three planes containing L, each of which is parallel to one of the coordinate axes. Equations (9) are equations of the two that are parallel to the z-axis and the y-axis, respectively. Example 5. Write equations of the line L through (1, 5, —2) perpendicular to the plane 11 = 0.

7x — 8z+

Solution. Since (7, 0, —8) is a vector perpendicular to the given plane it is parallel to L. Therefore the equations of L can be written Meee

i,

es

ee

> ee

ee:

)

or x — 1 7

z+2 =

koe

=o

=O}

or (10)

8& +7z+6=0,

y—S5=0.

The first plane of (10) is the plane through L parallel to the y-axis. The second plane of (10) is the plane through L perpendicular to the y-axis or, equivalently, parallel to both the x-axis and the z-axis. The line L is parallel to the xz-plane since the plane y — 5 = 0 is parallel to the xz-plane.

Example 6. 7) O), Bs 10 Solution.

Write equations of the line L through the two

points p:(9,

5, —1) and

Since Pa = (0, 0, 2), the equations of L can be written Kee?

Vie

he Z er,

(Poe Ka ee or (11)

x—-9=0,

y=i=0.

The planes (11) are the planes through L perpendicular to the x-axis and the y-axis, respectively. Example 7.

The equation of the plane through the point (11, 6, —7) and perpendicular to

the line Mir 20. sie

Ones

7

ey

—4

can be written 0(x — 11) + 5(v — 6) — 42 + 7) = 0, or Sy — 4z —

58 = 0.

A second way to express the statement that the point q : (x, y, z) lies on the |line L through p:(xo, yo, Zo) with direction numbers /, m, and n if and only if Pq and (1, m, n) are collinear is that there exists a real number ¢ such that Pq ===), Tit), cOL:

VECTORS

284

IN

[$1008

SPACE

(x — x0, y — Yo, Z — 20) = (It, mt, nt).

(12)

Solving (12) for x, y, and z in terms of #, and regarding ¢ as a parameter for the line L (cf. §411), we have the following parametric equations for L:

Theorem IV. A set of parametric equations of the line L through (xo, yo, Zo) and having direction numbers 1, m, and n is

the point

x = xo + I, mt, y=yot+ Z = Zo + Nt.

(13)

Example 8.

The equations of the lines of Examples 4, 5, and 6 can be written, respectively,

(14)

SOQ eh,

Wey — si,

BS SS = iE

yo

eS he =

=

fe

ek ish OE yr . Se

SP

Sp

The parametric form (13) for the equations of a line is often convenient for finding points of intersection. Example 9.

Find the point where the line

(15)

es

es

meets the plane

(16)

3x —y+7z+5=0.

Solution.

We write equations (15) in parametric form and substitute in (16):

3(3 + 21) — (-2 + 5) + 7-1

—- 490 +5 =0,

or 9 — 271 = 0. Therefore r = 4, and the point is

1008

DISTANCE

BETWEEN

A PLANE

AND A POINT

The analogue of Theorem II, §413, for three dimensions is the following: Theorem. Jf ax + by +cz+d=0 is the equation of a plane 1 and if pi: (x1, 1, Z1) is any point, then the distance d between I and p, is given by the formula

(1) Proof.

(2)

d — ax t by + cz1 +d) Va? + B+ 0? The parametric equations of the line L through p: perpendicular to II are

X=X+at

y=yt+bt

z=21 + ct.

§1009]

EXERCISES

285

To find the point p : (x, y, z) where the line L meets II we substitute the expressions (2) for x, y, and z in ax + by + cz +d = 0, and solve for f:

(3)

a(x1 + at) + bG1 + bf + c(z1 + ct) + d = 0,

whence ¢ = —(ax1 + by: + cz; + d)/(a2 + b2 + c2). following expression for d?:

(4)

A(X

From

(2),

we

have

the

— x1)? te) — yi)? + @ — 21)

Se? (Ce

2,2 ye

2

— (ax1_teee by + cz + d)

and (1) follows immediately. Example.

The distance between the plane 5x — y — 6z —

11 = Oand the point (1, —8, 2)

is

[5-1 + (—1)-(—8) + (—6):2 — 11] oe

425 -F 1e-36

1009

62

EXERCISES

In Exercises 1 and 2, find the equation of the plane through the given point parallel to the given plane. 1. (3, —1, 8); 2x

—7y —z+11 =0.

2. (4, —7, —2);

x + 3y — 4z

—17

= 0.

In Exercises 3-6, find the equation of the plane through the three given points. 3.

(1, 0, 0), (0, V3 0), (0,

Ss

(1, 2, 0), (3;

=,

=i \ 6).

0), (0, De

4.

(3, 0, 0), (0, —

6.

a25

—4).

0,

=s)}

05 0), (0, oF 5).

(0, =e,

2), (0, 4,

=I).

In Exercises 7 and 8, show that the plane is parallel to a coordinate axis, and identify that axis.

7. Ix

— 13y +3 = 0.

8. 2y + 9z — 10 = 0.

In Exercises 9 and 10, show that the two given planes are perpendicular. 9, 3x + 2y + 7z — 10 = 0, 5x + 3y — 3z = 0. 10. 6x — 3y — 4z4+ 15 = 0, 5x — 2y + 9z 4+ 16 = 0. In Exercises 11-13, find the equation of the plane through the given point perpendicular to the given line or vector. Sha al le

(Qt U, —1);

13. (3, —1, —5);

3

Ze —|

4

12.

(3, 0, 4);

x 5 a

oe

=

radius vector of (3, —1, —5).

In Exercises 14 and 15, find the equation of the plane that is the perpendicular bisector of the segment joining the two given points.

14, (6, —1, —4), (2, 7, 8).

15. (9, 2, —5), (—3, 4, 1).

[§1009

IN SPACE

VECTORS

286

In Exercises 16 and 17, find the equations of the line through the given point perpendicular to the given plane. Write these equations in symmetric form, and also in the form of a pair of linear equations corresponding to two planes.

16. (4,6, —3);

2x —-y —5z+7=0.

17. (1, —2, 4); 4x

—2y

+ 3z

-—8=0.

In Exercises 18-20, find the equations of the line through the two given points. Write these equations in symmetric form, and also in the form of a pair of linear equations corresponding to two planes. 18. @G; i

—5), (2; ip 3) f

19.

(8, 2, —4), ©; a

=i):

20. (7, 6, 8), (2, 6, 8). In Exercises 21 and 22, write the equations in symmetric form of the line through the given point that is parallel to the line whose equations are given in parametric form.

21. (—1,0, 3); x = —8+ 44,y = —1 22520450) ec — Oa

a

— 24,2

= 44 7t.

Oe

In Exercises 23-29, write the equations in parametric form for the line whose equations are requested in the specified exercises. 23. Exercise 16.

24. Exercise 17.

25. Exercise 18:

DT WEXeLcise 20:

28. Exercise 21.

29. Exercise 22)

In Exercises 30 and 31, show that the two lines are parallel. same or distinct. 305)

26. Exercise 19.

Determine whether they are the

ae

de Pe ag

1

—3

ee:

—2’

Vin Ome Z

-1

3

2

In Exercises 32 and 33, show that the two given lines are perpendicular. Mi

Bc

i

3

eee

—2

33, x — 6-46

ta ee

—4°

y=

—74 2 =

y+3

Zz

—4

5

4 —7 + 3ts

x =

—of, yy = 10 — 247 =

1a

2e

In Exercises 34 and 35, find the point where the line meets the plane.

34.

XZ

a

Ag

5

Opeizeta)

Sey

35, oe GaGa

baits 7 .

aad

j

wi

}

vail’

ire

P

eit >

b

, ue

he

ein.

ae i) sae 4

9@.

tal : aber)

Ainadia

Uh

aidtins wham :



of 5

bale

AMIE

Ale



ii

FPR

lt

ups,

rue rT, uF

.

(

@')

“yt

(a

3B

re Ha

nein 4+H4

ont. (a >

ly

On

4+ 42 + 16n >

+13

=

3n2 + 3n + 1.

. Hints: Extend the domain of f to 9t by means of the formula (1), §308, and show that (2)

holds for all € 9U. Part of the problem is to show that (na + $n(n — 1)d) + (a +- nd) = (n + loa + 3(n + Ind. ; a — ar"

15. Hints: Cf. Ex. 14, §308.

If r #1, part of the problem

is to show

A + ar" =

al A

a — ar'tl

ab =geu

16. 1.

Hint: x"*! — yt) = x(x — y)

+ yx" — y"). Hint: Let p be the formula with domain 9 such that p() is the statement of the exercise for eachn € MN.

306

ANSWERS

§313. 1.

union of sets intersection of sets slash, set difference the set of all---such that: -ordered pair Cartesian product 14, Cartesian plane relation domain of p, range of p function domain of f, range of f value of f at x function one-to-one correspondence n factorial image of A underf identity function on A characteristic function of A value of fat (x, y) elliptic notation for a function point p with coordinates x and y 319

SPECIAL

SYMBOLS

Meaning

signum function greatest integer or bracket function formula truth set statement

negation disjunction conjunction implies (does not imply) is (is not) implied by is (is not) equivalent to true formula false formula universal quantifier existential quantifier such that sum of two numbers additive identity additive inverse of a number difference of two numbers product of two numbers

multiplicative identity multiplicative inverse of a number quotient of two numbers field square of inequality minimum maximum nth power

a number symbols for numbers of a finite set of a finite set of a number

binomial coefficient

Page

7, oi 31 31 31 ZO) 33 33 36, 38 36, 38 38, 39 37 37 43 43 43 49 49 49 50 50 50 50 50

51 53, 61 54 61 61 61, 65 62

(a, b), (a, 5], [a, 5), a, 5]

ordered field of rational numbers bounded intervals

+o

plus infinity, minus infinity

65 66-67 67

unbounded intervals absolute value of a number neighborhood of a point max of x (—x) and 0 supremum of A least upper bound of 4 infimum of A greatest lower bound of A square root function

67-68 68 70 74 1 75 76 76 78

Q 5 =

2

(=, 2), (—@, a); (a, + &), [a, +2);

[>|

(= © 5 +o)

Ne

Kx) sup A = l.u.b.(A) inf A = g.l.b.(A)

Vx

sup (A) = L.u.b.4 inf(A) = g.l.b.4

SPECIAL Symbol

SYMBOLS

Meaning

2

square root of 2 line, line segment, or directed line segment directed line segment slope differences x2 — x1, y2 — y1 inclination distance between two points composite of two functions inverse function sum of two functions product of two functions difference of two functions quotient of two functions scalar multiple of a function additive inverse of a function constant function zero function unity function square of a function inequality symbols for functions absolute value of a function

Pip2

Pip2 m

Ax, Ay (84

Api, p2) Pe

pre IV GS IIA

vector space

V i=) oagagnac see

= iS)—

OR x© i tsi cr | ~ wy

SL

£ ces

Ler}



vector space of constant functions vector space of polynomials vector space of real-valued functions vector space of bounded functions linear functional positivity for a linear functional

sigma functional nets on an interval vector space of step-functions point with polar coordinates two-dimensional vector magnitude of a vector zero vector

vector radius vector sum of two vectors scalar multiple of a vector negative of a vector difference of two vectors

321 Page

78 86 86 86 87 91 113 123 126 129 129 129 130 130 130-131 130 130 130 130 32 132 135, 142 135 135 Iso 140 142 143 144

147-148 149 189 202 202 202 202, 270 203, 270 203, 272 203029. 204, 272 204, 272

quotient of a vector and a scalar

204

standard basic unit vectors

208 243

direction angles

SPECIAL

322

Meaning

Symbol A = COS a, wp = Cos B

l,m °

Olewt)

Beh PS RS SS

det(A) = det A = |A| Av E €

AT XT AX

D = |e| A=

|E|

&b = (v1, v2, v3)

| = [Ce, e2, 09)

Or= (020, 0)

MOL a

SYMBOLS

Teh

Nl,

direction cosines direction numbers scalar or inner or dot product circle product of two vectors square product of two vectors n-dimensional vector matrix sum of two matrices scalar multiple of a matrix difference of two matrices set of all m X n matrices n-dimensional vector space magnitude of an n-dimensional vector zero n-dimensional vector composite of two transformations product of two matrices algebra of m X n matrices zero matrix identity matrix determinant of a matrix inverse of a matrix matrix of a second degree equation matrix of second degree terms transpose of a matrix quadratic form determinant of e determinant of E vector in space magnitude of a vector zero vector linear combination of vectors standard coordinate vectors direction angles direction cosines direction numbers

scalar or inner or dot product

Page

213 214 21S 218 219 220 220 221 221 ny | 222 222 227 222 224 225-226 221-228 228 228 Zo 240 244 244 244 245 2d Zon 270 270 270 PUG(8: 274 PAG) Pais) 278 278

INDEX (The numbers refer to pages)

INL IN IL. SP

Associative algebra, 157

Abelian group, 52

Associative law, 5, 8, 49, 50, 60, 125,

Abscissas, axis of, 22 Absolute value, of a function, 132 of a number, 68 of a vector, 202, 222, 270

ISOM owe227, Asymptote of a hyperbola, 175 Asymptotic cone, 298 Axis, 25

properties of, 69 Acute angle between lines, 94 Addition, law of, 42 (Ex. 14)

conjugate, 174, 296, 297 coordinate, 22, 268

Addition, of functions, 129 of matrices 221 of numbers, 49 Of Vectors, 2039221, 272

minor, 167 of abscissas, 22

major, 167

of ordinates, 22 of a quadric surface, 296-297 of revolution, 292

Addition and subtraction formulas of trigonometry, 187 Additive identity, for functions, 131 for numbers, 50 Additive inverse, for functions, 131 for numbers, 50 Additivity for a linear functional, 142 Algebra, abstract, 156 associative, 157 commutative, 157 of bounded functions, 155 of constant functions, 155 of functions, 155-157 of matrices, 227-228 of polynomials, 155 of step-functions, 155 Altitude of a triangle, 119 (Ex. 52) Analytic geometry, 24 Angles, direction, 213, 277 Angular coordinate, 189 Answers and hints, 303 Archimedean property, 77 Arithmetic mean, 57 (Ex. 14)

of symmetry, 25 polar, 191 reali transverse, 174, 296, 297

Base set of vectors, 208, 223, 274

Basis, 208, 223, 274 Between, 55 Binary operation, 49 Binomial coefficient, 62 Binomial theorem, 62 Boldface type, 2

Bound, greatest lower, 76 least upper, 75 lower, 66 upper, 66 Bounded above, 66 Bounded below, 66 Bounded function, 137-140 Bounded interval, 67 323

324 Bounded set, 66

Bracket function, 27

Cancellation, law of, 51, 52, 218, 243 (Bos, 27) Cardioid, 193, 195 (Exs. 25, 26) Cartesian plane, 14 (COnicn, 22. 25) of acircle, 115

of a conic section, 260-262 of an ellipse, 167 of a hyperbola, 174

INDEX Composite of transformations, 224

Composite propositions, 45 Composition of functions, 123 Condition, necessary, 37 necessary and sufficient, 39 sufficient, 37 Cone, asymptotic, 298 quadric, 297 Conic sections, 159-183, 195-201,

251-266 Conicoid, 295-299 Conjugate axis, 174, 296, 297

of symmetry, 22, 25 Central conic, 260 Centroid of a triangle, 112 (Ex. 21) Characteristic equation, 262 Characteristic function, 18 Characteristic roots, 263

Conjugate hyperbolas, 178 Conjugate semiaxis, 174, 296, 297 Conjunction, 33, 34 Consecutive natural numbers, 59 Constant, 12 Constant function, 16, 130

Circles ial

Constant term, 19

in polar coordinates, 192 Circle product, 218 Circular disk, 115 Circular functions, 184-188 Classification of conic sections, 258

Contains, 3 Contradiction, law of, 42 (Ex. 15) Contradiction, proof by, 41 Contraposition, law of, 41 Contrapositive, 41 Converse of an implication, 40

Closed disk, 115

Coordinate, angular, 189

Class, 2

Closed half-plane, 84, 105

radial, 189

Closed interval, 67, 68

Coordinate axes, 22, 268

Closed under an operation, 53

Coordinate planes, 268

Closure, 53 Coefficient, 19 binomial, 62 leading, 19

Coordinates, of an ordered pair, 13 of a vector, 202, 220, 270 polar, 189-201 rectangular, 21, 23, 268

Coefficient matrix, 235

Coplanar vectors, 272

Collection, 2

Correspondence,

Collinear points, 91 Collinear vectors, 204

one-to-one, 16 Cosine of the angle between two

Column, 221

13

vectors, 215, 278

Column vector, 220

Cosines, direction, 213,277

Combination, linear, 136, 205, 222, 272

Counterexample, 37

Commutative algebra, 157 Commutative group, 52

Cramer’s rule, 235 Cylinder, 291

Commutative law, 5, 8, 42 (Ex. 13), 49,

50,60, 125, 130, 157, 216.225, 279 Compact interval, 68 Complement, 4 Complete ordered field, 75 Completeness of the real number system, 74-77 Components of a vector, 202, 220, 270 Composite function, 123

Decreasing function, 151 strictly, 152 Definition by induction, 60 Degenerate conic section, 259 Degenerate ellipse, 259 Degenerate graph, 251 Degenerate hyperbola, 259 Degenerate interval, 67

INDEX Degenerate parabola, 259 Degree of a polynomial, 19

325 Eccentricity, of a conic, 168, 177, 198, 200

De Morgan, A., 8

of an ellipse, 168

De Morgan laws, 8, 39 Dense set, 78

of a hyperbola, 177 of a parabola, 200

of the set of irrational numbers, 79

Density,

Eigenvalue, 263

of the set of rational numbers, 78 Dependence, linear, 206, 272 Dependent variable, 16 Descartes, René, 14, 24

Ellipse, 165-171 degenerate, 259 imaginary, 253 in polar coordinates, 195-201

Determinant, 230-234

Ellipsis, 20

of the product of two matrices, 234 Diagonal, main, 221 principal, 221 Diagonal matrix, 243 (Ex. 28) Difference, of functions, 129 of matrices, 221 of numbers, 50 of sets, 9 of vectors, 204, 272 Directed distance, 22 Directed line, 91 Directed line segment, 86 Direction, 202, 213, 270,278 Direction angles, 213, 277 Direction cosines, 213, 277 Direction numbers, 214, 278 Directrix, of a conic, 198, 200 of a cylinder, 291 of a parabola, 159 Disjoint sets, 9 Disjunction, 33, 34 DiskeelelS

Ellipsoid, 295 Elliptic paraboloid, 298 Empty set, 3 Endpoint, of an interval, 67 of a line segment, 86 leg belniny, IL, 2,3, 3, I, AOR, 271 27 Equation, characteristic, 262 Equation of a line, general, 99-102 intercept form, 102 point-slope form, 97 slope-intercept form, 98 two-point form, 97 Equation of a plane, 281 Equations of a line, 282, 284 parametric, 110, 284 Equivalent formulas, 31, 39 Equivalent inequalities, 70 Equivalent statements, 38 Euclid, fundamental theorem of, 62, 65 Euclidean plane, 22 Even function, 26 Even integer, 65

Distance between a line and a point, 114 Distance between a plane and a point, 284

Even natural number, 62 Excluded middle, law of the, 42 (Ex. 12)

Element of a set, 2

Distance between points, 22, 113, 271

Existential quantifier, 43

directed, 22 Distinct, 2

Exponent, 61, 65 Exponents, laws of, 61, 65

Distributive law, 8, 12, 51, 60, 130, (Sais

7- 21

6,227,279

Division, of functions, 130 of numbers, 50 of a vector by a scalar, 204, 222 Domain, of a function, 14-16

of a relation, 14

Extended real number system, 67 Extension ofa function, 17-18

Factorial function, 18 False formula, 37

Family, 2

of a variable, 12 (footnote) Dot product, 215, 223, 278

Field, 49-51 complete ordered, 75

Double-angle identities, 187-188 Double negation, law of, 39 Dummy variable, 5, 144

finite, 53 (Ex. 6) ordered, 53-57 Finite induction, 60-64

326 Finite set, 60 First octant, 270 Focus, of an ellipse, 165, 198

of a hyperbola, 172, 199 of a parabola, 159, 195 Form, linear, 142 (footnote) quadratic, 245 Formula, 30

INDEX positive linear, 143 sigma, 144 Functions, difference of, 129 product of, 129

quotient of, 130 sum of, 129 Fundamental theorem, of Euclid, 62, 65 of mathematical induction, 59

false oi true, 37

Formulas, equivalent, 31, 39 Function, 12, 15 bounded, 137-140 bracket, 27

characteristic, 18 circular, 184-188 composite, 123 constant, 16, 130

decreasing, 151 even, 26

factorial, 18 greatest integer, 27 identity, 18

increasing, 151 inverse, 126-128 linear, 19 monotonic, 151 multiple-valued, 15 (footnote)

negative-valued, 132 nonnegative, 132 odd, 26 of areal variable, 16 of two variables, 19

on-into, on—onto, 15, 16 polynomial, 19 positive-valued, 132 power, 61, 65 rational, 19 real-valued, 16 signum, 27 single-valued, 15 (footnote) square-root, 78 statement, 30 strictly decreasing, 152 strictly increasing, 152

strictly negative, 132 strictly positive, 132 unity, 130 Function space, 135

Functional, increasing linear, 153 linear, 142

General element of matrix, 221 General equation of the first degree, 99-103 General equation of the second degree, 163, 243-246 Generator of a cylinder, 291 Generatrix of a surface of revolution, 292 Geometry, analytic, 24

Graph, degenerate, 251 imaginary, 251 in polar coordinates, 191

of an equation, 24 of a formula, 81 of a function, 24

of an inequality, 81 of a relation, 81

of a system, 107 Greatest integer function, 27 Greatest lower bound, 76

Group, 52 Abelian (commutative), 52 Half-plane, closed, 84, 105 open, 84, 105

Hints and answers, 303 Homogeneous law, 131, 142, 216,

DT, DYE Homogeneous linear equation, 236 Horizontal line, 83 Hyperbola, 172-181 degenerate, 259 in polar coordinates, 199 Hyperbolic paraboloid, 298 Hyperboloid, of one sheet, 296 of two sheets, 297 Hypothetical syllogism, 42 (Ex. 16) Identities, trigonometric, 185-188 Identity, 3

additive, 50, 131 multiplicative, 50, 131

INDEX Identity function, 18

Identity matrix, 228 If and only if, 39

lieeeathenssi7 Image of a set, 18

Imaginary circle, 251 Imaginary ellipse, 253 Imaginary graph, 251 Implication, 36-38 Inclination, of a directed line, 91 of an undirected line, 92 Include, 3

Incomplete ordered field, 78 Increasing function, 151

strictly, 152 Increasing linear functional, 153 Independence, linear, 206, 272 Independent variable, 16 Indirect method of proof, 41 Induction, mathematical, 59-64

Inductive definition, 60 Inductive set, 58 Inequalities, equivalent, 70 simultaneous, 107 solving, 70-72 Inequality, 54 linear, 105-107 strict, 55 triangle, 69, 74 (Ex. 33), 205, 219 (Ex. 44) Infimum, 76 Infinite set, 60 Infinity, minus, 67 plus, 67 Inner product, 215, 223, 278

Input-output machine, 13 Integer, 65 even, 65 negative, 65 odd, 65 positive, 65 Intercept, 98 of a plane, 237 (Ex. 39)

of a surface, 290 Intercept equation, 102, 237 (Ex. 39) Intermediate-membership property, 77 Intersection of sets, 7-9 of vector spaces, 141 (Ex. 18) Interval, 66-68 bounded, 67 closed, 67-68

327 compact, 68

degenerate, 67 finite, 68 half-open, 67 infinite, 68 open, 66-68 unbounded, 67-68 Intervals of a net, 148 Invariants, 254-260 Inverse, additive, 50, 131 multiplicative, 50 Inverse function, 126-128 Inverse of a matrix, 240 Irrational number, 65 Irrationality of V2, 78

Latus rectum, of an ellipse, 172 (Ex. 39)

of a hyperbola, 182 (Ex. 45) of a parabola, 165 (Ex. 30) Law, associative, 5, 8, 49, 50, 60, 125,

NBO. US 227 cancellation, 51, 52, 2] Ce, 4S} (ae, 227/)) commutative, 5, 8, 42 (Ex. 13), 49,

SO), GO), AS, WSO IS7/, AKG, ALS, 2S distributive, 8, 12, 51, 60, 130, 131,

US UO, 227, LYS homogeneous, 131, 142, 216, 227, 279 of trichotomy, 56

parallelogram, 219 (Ex. 46) transitive, 55, 60 Laws, De Morgan, 8, 39

of exponents, 61, 65 of logic, 39, 41, 42 (Exs. 12-16) Leading coefficient, 19 Least upper bound, 75 Left-handed coordinate system, 269

Lemniscate, 194, 195 (Exs. 31-32) Length of an interval, 67

Limagon, 195 (Exs. 33-36) Line, 81-114

directed, 91

equation of general, 99-103 equations of, 282-284 horizontal, 83 in polar coordinates, 191 intercept form, 102 of symmetry, 25 parametric equations, 110, 284 point-slope form, 97 slope-intercept form, 98

INDEX

328

Member of a set, 2

two-point form, 97 undirected, 91 vertical, 83

Line segment, 86 directed, 86 Linear combination, 136, 205, 222, 272

Midpoint, of an interval, 67 of asegment, 109, 212 (Ex. 36), 276 (Ex. 40) Minor axis, 167 Minor semiaxis, 167

Linear dependence and independence,

Minus infinity, 67

AVG; Linear Linear Linear

Modus tollendo ponens, 39

LUZ equation, homogeneous, 236 equations, systems of, 235-239 expression, 82

Linear form, 142 (footnote) Linear function, 19 Linear functional, 142

function, 151 (footnote)

Monotonically increasing (decreasing ) 151 (footnote)

Multiplication, of functions, 129 of a matrix by a scalar, 221 of numbers, 50

increasing, 153

of a vector by a scalar, 203, 272

positive, 143

Linear inequality, 105-107 Linear space of functions, 135 Linear transformation, 223 Locus problems,

Monotone

Monotonic function, 151

119-122,

182-183

Multiplicative identity, 50, 131 Multiplicative inverse, 50 Multiple, scalar, 130, 142, 203, 221, 272 Multiple-valued function, 15 (footnote)

Logic, laws of, 39, 41, 42 (Exs. 12-16) Lower bound, 66 greatest, 76

Natural number, 58

even, 62

Machine, input-output, 13 Magnitude of a vector, 202, 222, 270 properties of, 205 Main diagonal, 221 Major axis, 167 Major semiaxis, 167 Mapping, 13, 15 Mathematical induction, 59-64

Matrix, coefficient, 235 diagonal, 243 (Ex. 28)

identity, 228

odd, 62 Necessary and sufficient condition, 39 Necessary condition, 37 Negation, 32 of a formula, 32 of an implication, 36 of a quantified statement, 43-46 of a statement, 32 Negative, of a function, 131 of a matrix, 221 of a number, 50 of a vector, 204, 272

of second degree terms, 244

Negative number, 54 Negative reciprocal, 92 Negative slope, 88

of the general equation of the second degree, 244

Negative-valued function, 132 Net, 147

orthogonal, 251 rectangular, 220 rotation, 250, 301 singular, 234

Non-Abelian group, 52 Noncollinear vectors, 204 Noncommutative group, 52

inverse, 240

nonsingular, 234

square, 221

symmetric, 244 ZETOMe eeeS

Matrix algebra, 228 Mean, arithmetic, 57 (Ex. 14) Median of a triangle, 112 (Ex. 21)

Noncoplanar vectors, 272 Nondecreasing function, 151 (footnote) Nonempty set, 3 Nonincreasing function, 151 (footnote) Nonnegative function, 132 Nonnegative number, 56 Nonpositive number, 56

INDEX Nonsingular matrix, 234

329 hyperbolic, 298

Nontrivial solution, 236

Parallel lines, 89-91, 101, 259

Nonzero number, 50

Parallelogram law, 219 (Ex. 46)

n-tuple, ordered, 15, 220

Parameter,

Number, irrational, 65

Parametric equations of a line, 110, 284

110, 284

natural, 58

Partition, 147

negative, 54 nonnegative, 56

Pascal, Blaise, 62 (footnote) Pascal’s triangle, 62

nonpositive, 56

Perpendicular

positive, 54

Plane, Cartesian, 14

rational, 65

equation of, 281 Euclidean, 22

real, 48-80 Number scale, 21

Numbers,

lines, 93, 101

Plus infinity, 67 Point, 2, 21-24, 268

direction, 214, 278

of symmetry, 22, 25 of trisection, 110 Point-circle, 116, 251

Oblate spheroid, 296 Octant, first, 270 Odd function, 26

Point-slope equation, 97 Polaraxis, 191 Polar coordinates, 189-201 and conic sections, 195-201 Pole, 189 Polynomial, 19

Odd integer, 65 Odd natural number, 62 On-—into function, 15 On-into relation, 15 On-onto function, 16 On-—onto relation, 15 One, 50, 130 One-to-one correspondence, :

16

Only if, 37

quadratic, 243 Polynomial function, 19 A Polynomials, vector space of, 135 Ns : Positive linear functional, 143 Positive number, 54

Open disk, 115

Open half-plane, 84, 105 ;

ere: e,

Positiv

88

Positive terms, 44

Open interval, 66-68 ; ; Operation, binary, 49 i Opposite sense, 55 : Order of a square matrix, 221

ae ; Positive-valued function, 132 Power, 61, 65 Power function, 61, 65 Prcipaldiiconal 221

Oe complete: ue ae Ordered n-tuple, 15, 220 : Ordered pair, 13 ‘ ; 5 Ordered triple (triad, triplet), 15, 268 Order for f P 132 We ea,

Product, circle, 218 dot, inner, or scalar, 215, 223, 278 of functions, 129 ; of numbers, 50 : of a scalar and a matrix, 221 of ascalar and a vector, 203, 272 of two matrices, 225, 226

= mere,

is

me ace ee

s

square, 219 (Ex. 47)

Product formulas, trigonometric, 188

eee

Product of matrices, determinant of, 234

ee

oes

DN

a

2 A 268

Orthogonal vectors, 216, 223, 278

Pair, ordered, 13

Parabola, 159-164, 195, 259, 263 Paraboloid, elliptic, 298

Projection, orthogonal, 23, 268

Prolate spheroid, 296

pioorbyconeedicion. 41

Proper extension, 18 Proper restriction, 18 Proper subset, 3 Properties of absolute value, 69

INDEX

330 Quadratic form, 245 Quadratic polynomial, 243 Quadric cone, 297 Quadric surface, 295-299 Quantifier, existential, 43 universal, 43 Quotient, of functions, 130 of numbers, 50

Radial coordinate,

189

Radius, of a circle, 115 of an interval, 67 Radius vector, 203, 270

Range, of a function, 14-16 of arelation, 14 of a variable, 12, 29 Rational function, 19 Rational number, 65 Real axis, 21 Real number system, 48-80 extended, 67 Real-valued function, 16 Real variable, function of a, 16 Reciprocal, 50 Reciprocal, negative, 92

Rectangular coordinates, 21, 23, 268 Rectangular matrix, 220 Reductio ad absurdum, 41 Relation, 14 on-into, on-onto, 15

Representation of a vector, 15 Restriction of a function, 17 Reverse sense, 55 Right-handed coordinate system, 269 Root, characteristic, 263 square, 78 Rose, four-leaved, 193, 195 (Ex. 29) five-leaved, 195 (Ex. 30) three-leaved, 195 (Exs. 27, 28) Rotation, 249-251, 301 simplification by, 262-266 Rotation matrix, 250, 301 Row, 221 Row vector, 220 Ruling of a cylinder, 291

Scalar, 203 Scalar multiple, 130, 142, 203, 221, 272

Scalar product, 215, 223, 278 Second degree, general equation, 163,

243-246 Section of a surface, 289-290 Segment, 86

Semiaxis, conjugate, 174, 296, 297 major, 167 minor, 167 of a quadric surface, 296-297

transverse, 174, 296, 297 Sense of an inequality, 55 Setar bounded, 66 dense, 78

empty, 3 finite, 60 inductive, 58

infinite, 60 nonempty, 3 solution, 31, 70 jnautdel, Sil, 740)

Set-builder notation, 10 Set difference, 9 Set membership, 2 Set subtraction, 9 Sigma functional, 144 Sigma summation notation, 144 Signum function, 27 Simplification, law of, 39 Simplifications by rotations, 262-266

Simplifications by translations, 260-262 Simultaneous inequalities, 107 Simultaneous system, graph of, 107 Sine of the angle between two vectors, 288 Single-valued function, 15 (footnote)

Singular matrix, 234 Skew rotation, 251

Slope, in tangent form, 91-92 negative, 88 of a directed line segment, 86 of a line, 89 of a line segment, 87 positive, 88 Slope-intercept equation, 98 Solution, nontrivial, 236 of a formula, 31

of an inequality, 70 of a system, 107 trivial, 236 Solution set, 31, 70

Solving inequalities, 70-73

INDEX Space, 4, 268 function, 135 Span, 208, 274

Special symbols, list, 319 Spheroid, 296 Spiral of Archimedes, 195 (Ex. 37) Square, of a function, 130 of a number, 53, 61 Square matrix, 221 Square product, 219 (Ex. 47) Square root function, 78 Square root of 2, 78 Statement, 29 Statement function, 30 Step-function, 147-150 Straight line in polar coordinates, 191 Strict inequality, 55 Strictly decreasing function, 152 Strictly increasing function, 152 Strictly monotonic function, 152 Strictly negative function, 132 Strictly positive function, 132 Subnet, 148 . Subset, 3 proper, 3 Subtraction, of functions, 129 of matrices, 221 of numbers, 50 of sets, 9 of vectors, 204, 272 Subtractivity of linear functionals, 142 Sufficient condition, 37 Sum, of functions, 129 of matrices, 221 of numbers, 49 of vectors, 203, 221, 272 Sums and differences of sines and cosines, 187 Supernet, 148 Supremum, 75 Surface, quadric, 295-299 Surface of revolution, 292

Syllogism, hypothetical, 42 (Ex. 16) Symbols, special list of, 319 Symmetric form for equations of a line, 282

Symmetric matrix, 244 Symmetric set, 22, 25 Symmetry, axis of, 25 center of, 22, 25

331 line of, 25 point of, 22, 25 with respect to a line, 25

with respect to a point, 22, 25 Systems of linear equations, 235-239 homogeneous linear, 236-238

Tangent formula for slope, 91-92 Tautology, 39 (footnote) Term, constant, 19 Terminal point, 203, 270 Trace, of a matrix, 257 of a surface, 290 Transformation, 13, 15 additive, 223 homogeneous, 223 linear, 223

Transitive law, 55, 60 Translation, 246-247, 300

simplification by, 262-266 Transpose of a matrix, 244 Transverse axis, 174, 296, 297 Transverse semiaxis, 174, 296, 297 Triangle, Pascal’s, 62

Triangle inequality, 69, 74 (Ex. 33), 205, 219 (Ex. 44) Trichotomy, law of, 56 Trigonometric functions, 184-188 Trigonometry, informal review, 184-188 Triple (triad, triplet), ordered, 15, 268 Trisection, points of, 110 Trivial solution, 236 True formula, 37 Truth set, 31, 70 Two-point equation, 97

Unbounded, 66 Unbounded above, 66 Unbounded below, 66 Unbounded interval, 67-68 Undirected line, 91 Unequal, 2, 3, 13, 202, 221, 270 Union of sets, 5-7 Uniqueness of inverse, 52, 53 (Ex. 1) Uniqueness of unity, 53 (Ex. 3) Uniqueness of zero, 51 Unit vector, 202, 222, 270 Unit vectors, standard basic, 208, 223 74:

332 Unity, 50 Unity function, 130 Universal quantifier, 43 Universal set of a variable, 12

INDEX linearly independent, 206, 272 noncollinear, 204 noncoplanar, 272 orthogonal, 216, 223, 278

unequal, 202, 270

Universe, of discourse, 4 of a variable, 12

Venn, John, 5

Upper bound, 66

Venn diagram, 5-10 Vertex, of an ellipse, 166

Values, of a function, 16 of a variable, 12, 29 Variable, 12, 29

dependent, 16

of a hyperbola, 174 of a parabola, 161 of a quadric surface, 297-298 Vertical line, 83

dummy, 5, 144

independent, 16 Vector, column, 220

n-dimensional, 220 radius, 203, 270 row, 220 three-dimensional, 270 two-dimensional, 202 unit, 202, 222, 270 zero, 202, 222, 270 Vector space, abstract, 141-142 of bounded functions, 140 of constant functions, 135 of functions, 135

Weakly increasing (decreasing), 151 (footnote ) Weakly monotonic, 151 (footnote)

Well-ordering principle, 59

x-axis, 22, 268 x-coordinate, 23, 268

y-axis, 22, 268

y-coordinate, 23, 268

of polynomials, 135 of step-functions, 149 Vectors, collinear, 204

coplanar, 272 equal, 202, 270 linearly dependent, 206, 272

Zero, 50

Zero function, 130 Zero matrix, 228 Zero vector, 202, 222, 270

JOHN R. HUBBAPD DEPT. of MATHEMATICS

COLUMBUS COLLEGs

QA

39 mo 5Y.

Olmsted, J.M.H. Prelude to calculus linear algebra

Columbus College Libragy Columbus, George

and

ieesite