Elements of analytic number theory

165 27 645KB

Russian Pages [89] Year 2013

Table of contents :
Chapter 1. Algebraic and transcendental numbers
§1.1. Field of algebraic numbers. Ring of algebraic integers
1. Preliminary information
2. Minimal polynomial
3. Algebraic complex numbers
4. Algebraic integers
§1.2. Diophantine approximations of algebraic numbers
1. Diophantine approximation of degree
2. Dirichlet approximation theorem
3. Liouville theorem on Diophantine approximation of algebraic numbers
§1.3. Transcendentality of e and
1. Hermite identity
2. Transcendentality of e
3. Symmetrized n-tuples
4. Transcendentality of
§1.4. Problems
Chapter 2. Asymptotic law of distribution of prime numbers
§2.1. Chebyshev functions
1. Definition and estimates
2. Equivalence of the asymptotic behavior of Chebyshev functions and of the prime-counting function
3. Von Mangoldt function
§2.2. Riemann function: Elementary properties
1. Riemann function in 0 Rez>1
2. Distribution of the Dirichlet series of a multiplicative function
3. Convolution product and the Möbius inversion formula
4. Euler identity
5. Logarithmic derivative of the Riemann function
6. Expression of the integral Chebyshev function via the Riemann function
§2.3. Riemann function: Analytic properties
1. Analytic extension of the Riemann function
2. Zeros of the Riemann function
3. Estimates of the logarithmic derivative
4. Proof of the Prime Number Theorem
§2.4. Problems
Chapter 3. Dirichlet Theorem
§3.1. Finite abelian groups and groups of characters
1. Finite abelian groups
2. Characters
3. Characters modulo m
§3.2. Dirichlet series
1. Convergence of L-series
2. Landau Theorem
3. Proof of the Dirichlet Theorem
Chapter 4. p-adic numbers
§4.1. Valuation fields
1. Basic properties
2. Valuations over rationals
3. The replenishment of a valuation field
§4.2. Construction and properties of p-adic fields
1. Ring of p-adic integers and its properties
2. The field of p-adic rationals is the replenishment of rationals in p-adic metric
3. Applications
§4.3. Problems
Bibliography
Glossary
Index

Recommend Papers

Elements of Number Theory

121 43 4MB Read more

Analytic number theory

122 30 2MB Read more

Analytic Number Theory: An Introductory Course (Monographs in Number Theory) 9812389385, 9789812389381

This valuable book focuses on a collection of powerful methods of analysis that yield deep number-theoretical estimates.

122 58 3MB Read more

Number Theory Arising From Finite Fields: Analytic And Probabilistic Theory 0824705777, 9780824705770

"Number Theory Arising from Finite Fields: Analytic and Probabilistic Theory" offers a discussion of the advan

112 15 2MB Read more

Analytic Number Theory: Proceedings of the Japanese-French Symposium Held in Tokyo

381 6 860KB Read more

Ways of Being. Elements of Analytic Ontology 9780231899437

110 93 5MB Read more

Linear Algebra with Elements of Analytic Geometry 5030017895

120 53 9MB Read more

Analytic Theory of Abelian Varieties 9780521205269, 0521205263

The study of abelian manifolds forms a natural generalization of the theory of elliptic functions, that is, of doubly pe

99 54 5MB Read more

$Analytic theory of continued fractions 9780821821060, 0821821067$

Analytic theory of continued fractions 9780821821060, 0821821067

The theory of continued fractions has been defined by a small handful of books. This is one of them. The focus of Wall&#

390 15 3MB Read more

Handbook of analytic operator theory 9781138486416, 1721731741

422 52 2MB Read more

Elements of analytic number theory

Author / Uploaded
Kolesnikov P.S.
Vdovin E.P.

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

File loading please wait...

Citation preview

ELEMENTS OF ANALYTIC NUMBER THEORY

P. S. Kolesnikov, E. P. Vdovin

Lecture course

Novosibirsk, Russia 2013

Contents Chapter 1. Algebraic and transcendental numbers § 1.1. Field of algebraic numbers. Ring of algebraic integers 1. Preliminary information 2. Minimal polynomial 3. Algebraic complex numbers 4. Algebraic integers § 1.2. Diophantine approximations of algebraic numbers 1. Diophantine approximation of degree ν 2. Dirichlet approximation theorem 3. Liouville theorem on Diophantine approximation of algebraic numbers § 1.3. Transcendentality of e and π 1. Hermite identity 2. Transcendentality of e 3. Symmetrized n-tuples 4. Transcendentality of π § 1.4. Problems

4 4 4 6 9 11 13 13 15

Chapter 2. Asymptotic law of distribution of prime numbers § 2.1. Chebyshev functions 1. Deﬁnition and estimates 2. Equivalence of the asymptotic behavior of Chebyshev functions and of the prime-counting function 3. Von Mangoldt function § 2.2. Riemann function: Elementary properties 1. Riemann function in Re z > 1 2. Distribution of the Dirichlet series of a multiplicative function 3. Convolution product and the M¨ obius inversion formula 4. Euler identity 5. Logarithmic derivative of the Riemann function

30 31 31

2

18 20 20 23 25 26 29

32 34 35 35 36 37 38 39

Contents

3

6.

Expression of the integral Chebyshev function via the Riemann function 40 § 2.3. Riemann function: Analytic properties 43 1. Analytic extension of the Riemann function 43 2. Zeros of the Riemann function 47 3. Estimates of the logarithmic derivative 48 4. Proof of the Prime Number Theorem 51 § 2.4. Problems 55 Chapter 3. Dirichlet Theorem § 3.1. Finite abelian groups and groups of characters 1. Finite abelian groups 2. Characters 3. Characters modulo m § 3.2. Dirichlet series 1. Convergence of L-series 2. Landau Theorem 3. Proof of the Dirichlet Theorem

56 56 56 58 60 60 60 66 68

Chapter 4. p-adic numbers 71 § 4.1. Valuation ﬁelds 71 1. Basic properties 71 2. Valuations over rationals 74 3. The replenishment of a valuation ﬁeld 76 § 4.2. Construction and properties of p-adic ﬁelds 79 1. Ring of p-adic integers and its properties 80 2. The ﬁeld of p-adic rationals is the replenishment of rationals in p-adic metric 81 3. Applications 84 § 4.3. Problems 85 Bibliography

86

Glossary

87

Index

88

CHAPTER 1

Algebraic and transcendental numbers § 1.1. Field of algebraic numbers. Ring of algebraic integers 1. Preliminary information. Let us recall some basic notions from Abstract Algebra. Throughout we use the following notations: ℙ ℕ ℤ ℚ ℝ ℂ

is is is is is is

the the the the the the

set set set set set set

of of of of of of

all prime numbers; positive integers (the set of natural numbers); all integers; all rational numbers; all real numbers; all complex numbers, ℂ = ℝ + ιℝ, ι2 = −1.

Given a ﬁeld F , symbol F [x] denotes the ring of polynomials in variable x with coeﬃcients in F . If f (x) = a0 + a1 x + · · · + an xn ∈ F [x], ai ∈ F , is chosen so that an ̸= 0, then n is called the degree of f (x), it is denoted by deg f , while an ∈ F is called the leading coeﬃcient of f (x), and if an = 1 then f (x) is called monic. If f (x) = 0 (all coeﬃcients are equal to zero) then the degree of f (x) is said to be −∞. If f (x), g(x) ∈ F [x], g(x) ̸= 0, then there exist unique q(x), r(x) ∈ F [x] such that f (x) = g(x)q(x) + r(x),

deg r < deg g.

(1.1)

These polynomials (quotient q(x) and remainder r(x)) can be found by the well-known division algorithm. If r(x) = 0 then we write g ∣ f (g divides f ). One may easily note the similarity between division algorithms in the ring of integers ℤ and in the ring of polynomials F [x]. Indeed, these are particular examples of Euclidean rings, and there are many common features and problems that can be solved in similar ways for integers and polynomials. In particular, the greatest common divisor (gcd) d of two polynomials f, g ∈ F [x] is deﬁned as a monic common divisor which is divided by every 4

§ 1.1. Field of algebraic numbers

5

other common divisor, i.e., d = gcd(f, g) if and only if d ∣ f , d ∣ g, and for every h ∈ F [x] with h ∣ f and h ∣ g it follows that h divides d. To ﬁnd gcd of f and g, one may use the Euclidean algorithm based on the following observation: If f and g are related by (1.1) then gcd(f, g) = gcd(g, r). Moreover, if d = gcd(f, g) then there exist p(x), s(x) ∈ F [x] such that f (x)p(x) + g(x)s(x) = d(x). Exercise 1.1. Let f1 , . . . , fn ∈ F [x] be a ﬁnite family of polynomials over a ﬁeld F . Prove that there exists a unique monic greatest common divisor of f1 , . . . , fn . Suppose R is a commutative ring with an identity (e.g., R = ℤ or R = F [x] as above). A subset I ⊆ R is called an ideal of R if a ± b ∈ I for every a, b ∈ I, and ax ∈ I for every a ∈ I, x ∈ R. For example, the set of all even integers is an ideal of ℤ; the set {f (x) ∈ F [x] ∣ f (α) = 0} is an ideal of F [x], where α is an element of some extension ﬁeld of F . Since an intersection of any family of ideals is again an ideal, for every set M ⊆ R there exists minimal ideal of R which contains M , it is denoted by (M ). It is easy to note that {∑ } (M ) = xi ai ∣ xi ∈ R, ai ∈ M . i

An ideal I of R is said to be principal if there exists a ∈ R such that I = (a), where (a) stands for ({a}). Recall that a commutative ring R is called an integral domain (or simply a domain) if ab = 0 implies a = 0 or b = 0 for all a, b ∈ R. In particular, ℤ and F [x] are integral domains. An integral domain R such that every ideal of R is principal is called a principal ideal domain. Exercise 1.2. Prove that ℤ and F [x] (where F is a ﬁeld) are principal ideal domains. In particular, if f (x), g(x) ∈ F [x] then ({f, g}) = (gcd(f, g)). If R is a domain, then we can consider the ﬁeld of fractions of R. In order to construct it we start with the Cartesian product R × (R \ {0}) of R (here each pair (a, b) corresponds to fraction ab ). Now deﬁne an equivalence relation (a1 , b1 ) ∼ (a2 , b2 ) ⇐⇒ a1 b2 = a2 b1 . Let Q be the set of equivalence classes of R × (R \ {0}) under this equivalence. Deﬁne the addition and multiplication on representatives by (a1 , b1 ) + (a2 , b2 ) = (a1 b2 + a2 b1 , b1 b2 ),

(a1 , b1 ) · (a2 , b2 ) = (a1 a2 , b1 b2 ).

§ 1.1. Field of algebraic numbers

6

We leave for the reader to prove that all operations deﬁned are correct and that Q is a ﬁeld under these operations. Let I be an ideal of R. Then R is split into a disjoint union of congruence classes a+I = {a+x ∣ x ∈ I}, a ∈ R, and the set of all these classes (denoted by R/I) is a ring with respect to natural operations (a + I) + (b + I) = (a + b) + I,

(a + I)(b + I) = ab + I.

The ring R/I obtained is called a factor ring of R over I. For example, ℤ/(n) = ℤn , the ring of remainders modulo n. A proper ideal I of R is maximal if there are no proper ideals J of R such that I ⊂ J. For example, if R = ℤ then (n) is maximal if and only if n = ±p, where p is a prime natural number; if R = F [x] then (f ) is maximal if and only if the polynomial f is irreducible over F . Note that if I is a maximal ideal of a commutative ring R then R/I is a ﬁeld. Indeed, if a + I ̸= 0 (i.e., a ∈ / I) then J = {xa + b ∣ x ∈ R, b ∈ I} is an ideal of R such that I ⊂ J. Therefore, J = R, and thus all equations of the form (a + I)X = c + I, c ∈ R, have solutions in R/I. Exercise 1.3. Prove that ℝ[x]/(x2 + x + 1) is a ﬁeld isomorphic to the ﬁeld ℂ of complex numbers. 2. Minimal polynomial. A complex number α ∈ ℂ is algebraic if there exists a nonzero polynomial f (x) ∈ ℚ[x] such that f (α) = 0. A non-algebraic complex number is said to be transcendental. An algebraic number α is called an algebraic integer if there exists a monic polynomial f (x) ∈ ℤ[x] such that f (α) = 0. Every rational number α ∈ ℚ ⊂ ℂ is obviously an algebraic one. Moreover, as we will see later, a rational number is an algebraic integer if and only if it is an integer. √ √ 1 3 are algebraic inExercise 1.4. (1) Prove that 2 and + ι 2 2 tegers. (2) Prove that if α ∈ ℂ is an algebraic number then Re α and Im α are algebraic numbers. Whether the same statements are true for an algebraic integer α? (3) Show that the cardinality of the set of all algebraic numbers is countable. Since the cardinality of the entire set of complex numbers is uncountable (continuum), transcendental numbers do exist. However, it is not so easy to show an example of such a number accompanied with reasonable proof.

§ 1.1. Field of algebraic numbers

7

Let α be an algebraic number. Denote by x a formal variable, and let I(α, x) = {f (x) ∈ ℚ[x] ∣ f (α) = 0}. It is clear that I(α, x) is an ideal of the ring ℚ[x]. Since ℚ[x] is a principal ideal domain, the ideal I(α, x) is generated by a single polynomial. Namely, the monic polynomial of minimal positive degree from h(x) ∈ I(α, x) is a generator of the ideal I(α, x), it is called the minimal polynomial for α. Given an algebraic number α, denote its minimal polynomial by hα (x) and say deg hα to be the degree of α. Lemma 1.5. Let α be an algebraic number, and let h(x) be a monic polynomial from ℚ[x]. Then the following conditions are equivalent: (1) h(x) = hα (x); (2) h(α) = 0, and h divides every f ∈ I(α, x); (3) h(α) = 0, and h(x) is irreducible over ℚ. Proof. (1) ⇒ (2) It is obvious by deﬁnition. (2) ⇒ (3) Assume h(x) is reducible over ℚ, i.e., it can be decomposed into nonscalar factors as follows: h(x) = h1 (x)h2 (x),

where

hi (x) ∈ ℚ[x], deg hi ≥ 1.

Then h(α) = h1 (α)h2 (α) = 0, so for either of i = 1, 2 we have hi (α) = 0. Therefore, the corresponding polynomial hi (x) belongs to I(α, x), hence, hi (x) is a multiple of h(x), which is impossible due to deg hi < deg h. (3) ⇒ (1) Let hα (x) be the minimal polynomial for α. Then h(x) ∈ I(α, x) = (hα (x)), so hα ∣ h. Since h(x) is irreducible and deg hα > 0, we have h(x) = hα (x). □ √ For example, if √ α ∈ ℚ then hα = x − α. For α = 2, hα = x2 − 2. The 1 3 number α = + ι satisﬁes the equation α3 + 1 = 0, but its minimal 2 2 polynomial is hα (x) = x2 − x + 1. Exercise 1.6. Prove that a minimal polynomial does not have multiple roots. If α is an algebraic number, and β ∈ ℂ is a root of hα (x) then β is said to be conjugate to α. Therefore, every algebraic number α of degree n has exactly n pairwise diﬀerent conjugate complex numbers α1 , . . . , αn ,

§ 1.1. Field of algebraic numbers

8

including α itself. Moreover, hα (x) =

n ∏

(x − αi ) ∈ ℚ[x].

i=1

Given an algebraic number α, the minimal polynomial hα ∈ ℚ[x] is uniquely deﬁned. If α is an algebraic integer then there also exists a monic polynomial f ∈ ℤ[x] of minimal degree such that hα ∣ f . In order to prove that these f and hα coincide, we need the following observation. Let h(x) ∈ ℚ[x] be a monic polynomial with rational coeﬃcients, p1 pn−1 n−1 p0 + x + ··· + x + xn , gcd(pi , qi ) = 1. h(x) = q0 q1 qn−1 Denote by q(h) the least common multiple of the coeﬃcients’ denominators: q(h) = lcm (q0 , . . . , qn−1 ).

(1.2)

Then for q = q(h) we have qh(x) = a0 + a1 x + · · · + an−1 xn−1 + an xn ∈ ℤ[x], where the gcd of all coeﬃcients is equal to the identity: (a0 , . . . , an ) = 1. Indeed, assume a0 , . . . , an have a common divisor d > 1. Then d ∣ q = an , and for b = q/d ∈ ℤ we have bh(x) ∈ ℤ[x]. Therefore, qi ∣ bpi for all i = 0, . . . , n − 1, so qi ∣ b, and, ﬁnally, q ∣ b = q/d, which is impossible for d > 1. Polynomials with relatively prime integral coeﬃcients are studied in Abstract Algebra, they are called primitive. The following statement is well-known. Exercise 1.7 (Hauss Lemma ). Prove that the product of primitive polynomials is also a primitive polynomial. Proposition 1.8. Let α be an algebraic integer. Then the minimal polynomial hα (x) has integral coeﬃcients. Proof. Suppose f (x) ∈ ℤ[x] be a monic polynomial with integral coeﬃcients such that f (α) = 0. Then Lemma 1.5 implies hα ∣ f , i.e., f (x) = hα (x)g(x),

g(x) ∈ ℚ[x].

Since both f and hα are monic polynomials, so is g. Denote q = q(hα ), q ′ = q(g), where the function q(·) is given by (1.2). According to the remark above, qhα (x) and q ′ g(x) are primitive polynomials in ℤ[x]. However, qq ′ f (x) = (qhα (x))(q ′ g(x)).

§ 1.1. Field of algebraic numbers

9

The Hauss Lemma implies qq ′ f (x) to be primitive, while f (x) has integral coeﬃcients itself. Therefore, qq ′ = 1, i.e., all coeﬃcients of hα and g are integral. □ Thus, there is no need to deﬁne separately integral minimal polynomial and degree for algebraic integers. Let us also note that for every algebraic number α there exists c ∈ ℕ such that cα is an algebraic integer: It is enough to consider c = q(hα )deg hα . 3. Algebraic complex numbers. Let us denote the set of all algebraic numbers by 𝔸. Recall that the Fundamental Theorem of Algebra states ℂ to be an algebraically closed ﬁeld, i.e., every non-constant polynomial over ℂ has a root in ℂ. In this section, we will prove that 𝔸 ⊂ ℂ is the minimal algebraically closed subﬁeld of ℂ, i.e., 𝔸 is the algebraic closure of ℚ. Lemma 1.9. The following statements are equivalent for α ∈ ℂ: (1) α ∈ 𝔸; (2) ℚ[α] := {f (α) ∣ f (x) ∈ ℚ[x]} is a ﬁnite-dimensional vector space over ℚ; (3) ℚ[α] is a subﬁeld of ℂ. Proof. (1) ⇒ (2). Note that f (x) = q(x)hα (x) + r(x) by the division algorithm, deg r < deg hα . Hence, f (α) = r(α), and the latter is a linear combination over ℚ of 1, α, . . . , αn−1 , where n = deg hα . Therefore, dim ℚ[α] ≤ n. Moreover, 1, α, . . . , αn−1 are linearly independent since n is the minimal possible degree of a polynomial over ℚ annihilating α, so dim ℚ[α] = deg hα . (2) ⇒ (1). It is enough to note that 1, α, α2 , . . . are linearly dependent, so there exist a0 , a1 , . . . , an ∈ ℚ such that a0 · 1 + a1 α + · · · + an αn = 0, end at least one of ai is nonzero. Hence, h(x) = a0 + a1 x + · · · + an xn ∈ ℚ[x] is a nonzero polynomial annihilating α. (1) ⇒ (3) Obviously, ℚ[α] is a subring of ℂ. Let 0 ̸= f (α) ∈ ℚ[α], then f ∈ ℚ[x] \ I(α, x). Therefore, hα does not divide f . Since hα is irreducible, we have gcd(f, hα ) = 1. Hence, there exist polynomials u(x), v(x) ∈ ℚ[x] such that u(x)f (x) + v(x)hα (x) = 1.

§ 1.1. Field of algebraic numbers

10

Then for x = α we obtain u(α)f (α) = 1, i.e., f (α)−1 = u(α) ∈ ℚ[α], i.e., ℚ[α] is a ﬁeld. (3) ⇒ (1) Suppose ℚ[α] is a subﬁeld of ℂ. It is enough to consider the case when α ̸= 0. Since α−1 ∈ ℚ[α], there exists f (x) ∈ ℚ[x] such that α−1 = f (α), i.e., h(α) = 0 for h(x) = xf (x) − 1, deg h ≥ 1. □ Corollary 1.10. If α1 , . . . , αn ∈ 𝔸 then ℚ[α1 , . . . , αn ] := {f (α1 , . . . , αn ) ∣ f (x1 , . . . , xn ) ∈ ℚ[x1 , . . . , xn ]} is a ﬁnite-dimensional vector space over ℚ. Proof. For n = 1, it follows from Lemma 1.9. By induction, since dim Q[α1 , . . . , αn−1 ] < ∞ and dim Q[αn ] < ∞, dim Q[α1 , . . . , αn ] ≤ dim Q[α1 , . . . , αn−1 ] · dim Q[αn ] < ∞ (all dimensions are over ℚ).

□

Exercise 1.11. Prove that Q[α1 , . . . , αn ] is a subﬁeld of ℂ provided that α1 , . . . , αn ∈ 𝔸. Theorem 1.12. The set of all algebraic numbers is a subﬁeld of ℂ. Proof. Since 1, 0 ∈ ℂ are obviously algebraic, it is enough to prove the following two statements: (1) If α and β are algebraic numbers then α ± β and αβ are also algebraic numbers; (2) If β ̸= 0 is an algebraic number then 1/β is also an algebraic number. (1) Since ℚ[α ± β], ℚ[α · β] ⊆ ℚ[α, β] ⊆ ℂ, we have dim ℚ[α ± β] < ∞, dim ℚ[α · β] < ∞ by Corollary 1.10. Thus by Lemma 1.9 α ± β, αβ ∈ 𝔸. (2) By Lemma 1.9, β −1 ∈ ℚ[β], so ℚ[β −1 ] ⊆ ℚ[β] which is ﬁnitedimensional over ℚ. Hence, dim ℚ[β −1 ] < ∞ and thus β −1 ∈ 𝔸. □ Theorem 1.13. The ﬁeld 𝔸 is algebraically closed. Proof. Suppose α0 , . . . , αn ∈ 𝔸, n ≥ 1, αn ̸= 0, φ(x) = α0 + α1 x + · · · + αn xn ∈ 𝔸[x]. It is enough to show that φ(x) has a root in 𝔸. Without loss of generality, assume αn = 1. Indeed, by Theorem 1.12 we may divide φ(x) by αn , and the result is still in 𝔸[x]. By the Fundamental Theorem of Algebra, φ(x) has a root β ∈ ℂ. Note that ℚ[β] ⊆ ℚ[α0 , . . . , αn−1 , β].

§ 1.1. Field of algebraic numbers

11

Since β n can be expressed as a linear combination of β k , k = 0, . . . , n − 1, with coeﬃcients depending on αi , i = 0, . . . , n − 1, as β n = −α0 − α1 β − · · · − αn−1 β n−1 , we have dim ℚ[α0 , . . . , αn−1 , β] ≤ n dim ℚ[α0 , . . . , αn−1 ] < ∞. Hence, ℚ[β] is a ﬁnite-dimensional vector space over ℚ, and by Lemma 1.9 β∈𝔸 □ Theorems 1.12 and 1.13 imply that 𝔸 is the algebraic closure of ℚ, which is often denoted by ℚ. 4. Algebraic integers. Suppose K[x1 , . . . , xn ] is the ring of polynomials in several variables over a ring K. For f ∈ K[x1 , . . . , xn ] denote by deg f the maximal sum of the degrees of the variables that appear in a term of f with a nonzero coeﬃcient. Namely, f may be uniquely written as f = f0 + f1 xn + · · · + fm xm n , where fi ∈ K[x1 , . . . , xn−1 ]. Assuming deg fi are deﬁned by induction, set deg f = max (i + deg fi ). i=0,...,m

Theorem 1.14. The set of all algebraic integers is a subring of the ﬁeld ℂ. Proof. Ii is enough to show that if α, β are algebraic integers then α ± β and αβ are algebraic integers as well. We will prove a more general fact: Every number of the form ∑ γ= ckl αk β l , ckl ∈ ℤ, k,l

is an algebraic integer. Let hα (x) = a0 + a1 x + · · · + an−1 xn−1 + xn , hβ (x) = b0 + b1 x + · · · + bm−1 xm−1 + xm . By Proposition 1.8, ai , bj ∈ ℤ. Lemma 1.5 implies that hα and hβ are irreducible polynomials over ℚ, thus they have no multiple roots. Denote by α1 , . . . , αn all complex roots of hα (x), and let β1 , . . . , βm stand for all complex roots of hβ (x). To be more precise, set α1 = α, β1 = β. Consider the polynomial ⎛ ⎞ n ∏ m ∏ ∑ ⎝x − p(x) = ckl αik βjl ⎠ ∈ ℂ[x]. i=1 j=1

k,l

§ 1.1. Field of algebraic numbers

12

It is clear that p(γ) = 0 and the leading coeﬃcient of p(x) is equal to the identity. It remains to show that p(x) ∈ ℤ[x]. It follows from the deﬁnition of p(x) that p(x) = f (x, α1 , . . . , αn , β1 , . . . , βm ), where f ∈ ℤ[x, y1 , . . . , yn , z1 , . . . , zm ]. Namely, ⎛ ⎞ n ∏ m ∏ ∑ ⎝x − f (x) = ckl yik zjl ⎠ ∈ ℤ[x, y1 , . . . , yn , z1 , . . . , zm ]. i=1 j=1

k,l

Moreover, every permutation of the variables y1 , . . . , yn or z1 , . . . , zm does not change the polynomial f , i.e., it is symmetric with respect to yi and with respect to zi . Hence, f (x, y1 , . . . , yn , z1 , . . . , zm ) =

nm ∑

ga (y1 , . . . , yn , z1 , . . . , zm )xa ,

a=0

where every polynomial ga is symmetric with respect to yi and with respect to zi . On the other hand, for every a we have ∑ dm ga (y1 , . . . , yn , z1 , . . . , zm ) = ga,d1 ,...,dm (y1 , . . . , yn )z1d1 . . . zm , d1 ,...,dm ≥1

where every polynomial ga,d1 ,...,dm (y1 , . . . , yn ) is symmetric (with respect to yi ) and has integral coeﬃcients. Lemma 1.15. Let Ψ(y1 , . . . , yn ) ∈ ℤ[y1 , . . . , yn ] be a symmetric polynomial on y1 , . . . , yn , deg Ψ = N , and let α1 , . . . , αn ∈ ℂ be the roots of a polynomial h(x) = a0 + a1 x + · · · + an xn ∈ ℤ[x], an ̸= 0. Then aN n Ψ(α1 , . . . , αn ) ∈ ℤ. Proof. By the Fundamental Theorem of Symmetric Polynomials, there exists a polynomial G(t1 , . . . , tn ) ∈ ℤ[t1 , . . . , tn ] such that Ψ(y1 , . . . , yn ) = G(σ1 , . . . , σn ),

deg G ≤ N.

where σi (y1 , . . . , yn ) are the elementary symmetric polynomials on y1 , . . . , yn . an−k , k = 1, . . . , n. Since The Viet formulae imply σk (α1 , . . . , αn ) = (−1)k an deg G ≤ N , we obtain ( ) an−1 N n a0 aN Ψ(α , . . . , α ) = a G − , . . . , (−1) . 1 n n n an an The latter is an integer number.

□

§ 1.2. Diophantine approximations

13

Now we can use Lemma 1.15 for polynomials ga,d1 ,...,dm (y1 , . . . , yn ) to obtain ga,d1 ,...,dm (α1 , . . . , αn ) ∈ ℤ (in this case, an = 1 since αi are algebraic integers). Hence, ga (α1 , . . . , αn , z1 , . . . , zm ) ∈ ℤ[z1 , . . . , zm ] are symmetric polynomials on z1 , . . . , zm , and by Lemma 1.15 we have ga (α1 , . . . , αn , β1 , . . . , βm ) ∈ ℤ. Therefore, p(x) ∈ ℤ[x].

□

§ 1.2. Diophantine approximations of algebraic numbers It is well-known from the course of Analysis that the set of rational numbers ℚ is a dense subset of the set of real numbers ℝ, i. e., for every p α ∈ ℝ and for every ε > 0 there exists ∈ ℚ such that q | | | | |α − p | < ε. (1.3) | q| Given a natural number N , the set ℚN = {p/q ∣ p ∈ ℤ, 0 < q ≤ N }, has 1 the following obvious property: ∣a − b∣ ≥ 2 for all a, b ∈ ℚN . Hence, for N every α ∈ / ℚN (in particular, for an irrational one), we have min ∣a − α∣ > 0.

a∈ℚN

p should have a q an unboundedly large denominator q. This observation raises the following natural question: How to measure the accuracy of a rational approximation relative to the growing denominator values? Therefore, in order to satisfy (1.3) for small ε, the number

1. Diophantine approximation of degree ν. To estimate the accuracy of an approximation by rationals, we will compare the diﬀerence ∣α − p/q∣ with a decreasing function q −ν , ν > 0. Namely, let us consider the following quantity: |} { | | p || ν| εα,ν (q) = min (1.4) q |α − | , ν > 0. q p∈ℤ,p/q̸=α It turns out that the study of the behavior of εα,ν (q) as q approaches inﬁnity leads to a necessary condition for α to be algebraic.

§ 1.2. Diophantine approximations

14

Definition 1.1. A real number α ∈ ℝ possesses a Diophantine approximation of degree ν > 0 if lim εα,ν (q) < ∞.

(1.5)

q→∞

Let us state a useful criterion that allows to determine whether a given number possesses a Diophantine approximation of a given degree. Lemma 1.16. A real number α possesses a Diophantine approximation of degree ν > 0 if and only if there exists a constant c > 0 such that the inequality | | | | |α − p | < c (1.6) | q | qν p holds for inﬁnitely many rationals ∈ ℚ. q Proof. Denote by M = M (α, c, ν) the set of all pairs (p, q) ∈ ℤ × ℕ such that (1.6) holds. Suppose α possesses a Diophantine approximation of degree ν > 0. By Deﬁnition 1.1, there exists c > 0 such that εα,ν (q) < c for inﬁnitely many q ∈ ℕ. Let us ﬁx this c and note that for every such q ∈ ℕ there exists p ∈ ℤ satisfying the inequality (1.6). Therefore, the set M is inﬁnite, and there is no upper bound for the set {q ∣ (p, q) ∈ M for some p}. It remains to show that the set {p/q ∣ (p, q) ∈ M } is also inﬁnite. Indeed, if it were ﬁnite then the left-hand side of (1.6) has a positive lower bound, but the right-hand side of (1.6) approaches zero since ν > 0 and q may be chosen to be as large as we need. Conversely, suppose there exists c such that (1.6) holds for inﬁnitely many rationals p/q. Then the set M deﬁned in the ﬁrst part of the proof is inﬁnite. Assume the set {q ∣ (p, q) ∈ M for some p} has an upper bound N , i.e., {p/q ∣ (p, q) ∈ M } ⊆ ℚN , where ℚN = {p/q ∈ ℚ ∣ p ∈ ℤ, 0 < q ≤ N }. As we have already mentioned above, for every distinct a1 , a2 ∈ 𝔸 the inequality ∣a1 − a2 ∣ > 1/N 2 holds. Hence, any inﬁnite subset S of ℚN has inﬁnite diameter, i.e., for any d > 0 one may ﬁnd p1 /q1 and p2 /q2 in S such that ∣p1 /q1 − p2 /q2 ∣ > d.

§ 1.2. Diophantine approximations

15

In particular, the set S = {p/q ∣ (p, q) ∈ M } ⊆ ℚN contains p1 /q1 and p2 /q2 such that ∣p1 /q1 − p2 /q2 ∣ > 2c. Since (1.6) holds for pi /qi , i = 1, 2, we have | | | | | | | | |α − p1 | < c , |α − p2 | < c ν | | | q1 q1 q2 | q2ν which implies | | ( ) | p1 1 p2 || 1 | 2c < | − | < c + ν < 2c, q1 q2 q1ν q2 a contradiction. Therefore, the set of denominators {q ∣ (p, q) ∈ M for some p} is inﬁnite, so there exist inﬁnitely many q such that εα,ν (q) < c. Hence, the sequence {εα,ν (q)}q∈ℕ has a ﬁnite accumulation point, and (1.5) holds. □ Exercise 1.17. Whether a rational number possesses a Diophantine approximation of degree 1? 2. Dirichlet approximation theorem. Theorem 1.18 (Dirichlet Approximation Theorem). For every α ∈ ℝ and for every N ∈ ℕ there exist p ∈ ℤ and q ∈ ℕ such that | | | | |α − p | < 1 , q ≤ N. | q | qN Proof. Consider the fractional parts of the numbers kα, k = 0, . . . , N : ξk = {kα} = kα − [kα] ∈ [0, 1). Divide the interval [0, 1) into N intervals of length 1/N as follows: [k/N, (k + 1)/N ),

k = 0, . . . , N − 1.

According to the combinatorial Dirichlet’s Principle, when N + 1 numbers ξ0 , . . . , ξN are set into N intervals, there exists at least one interval which contains at least two of these numbers, i.e., ∣ξk1 − ξk2 ∣ < 1/N for some k1 , k2 , 0 ≤ k1 < k2 ≤ N . Let p = [k2 α] − [k1 α], q = k2 − k1 . Then | | | | |α − p | = 1 ∣α(k2 − k1 ) − [k2 α] + [k1 α]∣ = 1 ∣ξk − ξk ∣ < 1 . 1 | q| q q 2 Nq □

§ 1.2. Diophantine approximations

16

Corollary 1.19. If α ∈ ℝ \ ℚ then α possesses a Diophantine approximation of degree ν = 2. Proof. Theorem 1.18 implies that for every natural number N there exist pN ∈ ℤ and qN ∈ ℕ, qN ≤ N , such that | | | | |α − p N | < 1 . (1.7) | qN | N qN Let us show that the sequence {qN }N ≥1 is not bounded. Assume the converse, i.e., suppose there exists a constant M such that qN ≤ M for all N . Then (1.7) implies | | | p || 1 1 | ≤ → 0, min α − | < q N qN N N →∞ p/q∈ℚM | which is impossible. Therefore, (1.7) holds for inﬁnitely many numbers qN . Since qN ≤ N , we have | | | pN || 2 | α − φ(q), bq

§ 1.2. Diophantine approximations

17

which is impossible for suﬃciently large q. The contradiction obtained proves that S is ﬁnite. □ Corollary 1.21. A rational number α does not have a Diophantine approximation of degree ν > 1. □ The result obtained is in some sence paradoxical: All irrational numbers possess Diophantine approximations of degree 2, but neither of rationals has a Diophantine approximation of degree ν > 1. Therefore, the existence of a Diophantine approximation of degree ν ≥ 2 is a criterion of irrationality. Let us apply the criterion above to the base of the natural logarithm 1 1 1 + ..., (1.9) e = 1 + + + ··· + 1! 2! n! also called the Euler’s number. Proposition 1.22. The Euler’s number is irrational. Proof. Consider the sum of the ﬁrst n + 1 summands in (1.9) and reduce to the common denominator: 1 1 pn 1 = . 1 + + + ··· + 1! 2! n! n! Then ) ( | 1 1 1 pn || | + + ... |= |e − n! (n + 1)! n + 2 (n + 2)(n + 3) ) ( 1 1 2 1 < + 2 + ... = . (n + 1)! 2 2 (n + 1)! Deﬁne a function φ : ℕ → ℝ as follows: φ(1) = 1, 2 for (n − 1)! < q ≤ n!, n ≥ 2. φ(q) = (n + 1)! If q ∈ ((n − 1)!, n!] then qφ(q) ≤ n!

2 2 = , (n + 1)! n+1

so lim qφ(q) = 0. However, q→∞

| | |e − |

| p || < φ(q) q|

§ 1.2. Diophantine approximations

18

for inﬁnitely many p/q = pn /n!, n ≥ 2. Hence, e is irrational by Proposition 1.20. □ 3. Liouville theorem on Diophantine approximation of algebraic numbers. Theorem 1.23 (Liouville Theorem). Let α be an algebraic number of degree n ≥ 2. Then α has no Diophantine approximation of degree ν > n. Proof. First, let us ﬁnd a constant M > 0 such that | | | | |α − p | > M | q | qn

(1.10)

for all p ∈ ℤ and q ∈ ℕ. Set h(x) to be a multiple of the minmal polynomial for α with integer coeﬃcients, e.g., h(x) = q(hα )hα (x). Suppose α1 , . . . , αn ∈ ℂ are the complex roots of this polynomial, and assume α1 = α. Then h(x) = an

n ∏

(x − αk ) = an (x − α)

k=1

n ∏

(x − αk ),

k=2

where an is the leading coeﬃcient of h(x). Denote by M the following quantity: )−1 ( n ∏ (∣α∣ + ∣αk ∣ + 1) , M = ∣an ∣ k=2

and let us show that (1.10) holds. If p and q meet the inequality ∣α − p/q∣ ≥ 1 then (1.10) is valid since M < 1 (∣α∣ > 0, n ≥ 2). Assume p and q satisfy the condition ∣α − p/q∣ < 1. In this case, | | |p| | | < 1 + ∣α∣ |q| and thus | | ∣h(p/q)∣ = ∣an ∣ ||α −

| n | | p || ∏ || p || α − | k q| q| k=2 | | |) | | n ( ∏ | |p| | p || | ≤ |α − | ∣an ∣ ∣αk ∣ + || || < ||α − q q k=2

| p || 1 . q| M

§ 1.2. Diophantine approximations

19

Lemma 1.5 implies h(x) to be irreducible over ℚ, hence, it has no rational roots. Therefore, | | | 1 p pn || | ∣h(p/q)∣ = |a0 + a1 + · · · + an n | ≥ n , q q q and (1.10) follows. Finally, apply (1.10) to show that α has no Diophantine approximation of degree ν > n. Assume the converse: Let there exist ν > n and c > 0 such that | | | | |α − p | < c | q | qν holds for inﬁnitely many p/q ∈ ℚ. Then, as it was shown in the proof of Lemma 1.16, the last inequality holds for inﬁnitely many denominators q ∈ ℕ. Then for suﬃciently large q we have c M < n, qν q in contradiction to (1.10).

□

The Liouville Theorem is a powerful tool that allows constructing explicit examples of transcendental numbers. Namely, we obtain the following suﬃcient condition of transcendentality. Corollary 1.24. Let α be a real number. If for every N ∈ ℕ it possesses a Diophantine approximation of degree ν ≥ N then α is transcendental. Example 1.2. The following number is transcendental: α=

∞ ∑

10−n! .

n=1

Proof. Given N ∈ ℕ, consider N ∑ pN pN = 10−n! = N ! . qN 10 n=1

Note that | | ∞ ∑ | | 2 |α − pN | = 10−n! < 2 · 10−(N +1)! = N +1 . | qN | q N n=N +1

§ 1.3. Transcendentality of e and π Therefore, for every ν > 0 there exist inﬁnitely many rationals

20 pN , N ≥ ν, qN

satisfying | | | | |α − pN | < 2 < 2 . ν | N +1 qN | qN qN Hence, α possesses a Diophantine approximation of any degree. Theorem 1.23 implies α is not algebraic. □ § 1.3. Transcendentality of e and π The Liouville Theorem provides a suﬃcient condition for a real number to be transcendental. However, this condition is not necessary. There exist diﬀerent methods to prove transcendentality of a series of important constants, e.g., e and π. In this section, we are going to study one of these methods known as the Hermite Method. Given an analytic function f (z) on the complex plane, denote by ∫x f (z) dz x0

the Riemann integral of f (z) along the straight segment starting at x0 ∈ ℂ and ending at x ∈ ℂ (the Cauchy Integral Theorem implies that this integral does not depend on the choice of a path with the same endpoints x0 and x, we choose the straight segment for convenience). 1. Hermite identity. Lemma 1.25 (Hermite Identity). Let α ∈ ℂ, α ̸= 0, and let f (x) ∈ ℂ[x], deg f ≥ 1. Then for every x ∈ ℂ we have ∫x f (t)e−αt dt = F (0) − F (x)e−αx , (1.11) 0

where

f (n) (x) f (x) f ′ (x) + + · · · + . α α2 αn+1 Proof. According to the Fundamental Theorem of Calculus (the Newton— Leibniz Formula), ∫x d f (t)e−αt dt = f (x)e−αx . dx F (x) =

0

§ 1.3. Transcendentality of e and π

21

On the other hand, d F (x)e−αx = (F ′ (x) − αF (x))e−αx = −f (x)e−αx . dx Hence, the derivatives with respect to x of the both sides of (1.11) coincide. It remains to compare the values at x = 0 to obtain the desired equality. □ The main idea of the Hermite’s method is to apply the Hermite identity (1.11) to a polynomial of the form H(h(x)) =

1 a(n−1)p xp−1 h(x)p , (p − 1)! n

(1.12)

h(x) ∈ ℤ[x], n = deg h, p ∈ ℕ. Let us establish the properties of H(h(x)). Lemma 1.26. Let h(x) = a0 + a1 x + · · · + an xn ∈ ℤ[x], a0 , an ̸= 0, n ≥ 1, and let β1 , . . . , βn ∈ ℂ be the entire collection of roots of h(x) in which every root of multiplicity k appears k times. Then for every p ∈ ℕ, p ≥ 2, the polynomial f (x) = H(h(x)) deﬁned by (1.12) has the following properties: f (j) (0) = 0, 0 ≤ j ≤ p − 2; f (j) (βi ) = 0, 0 ≤ j ≤ p − 1, i = 1, . . . , n; (n−1)p p a0 ; f (p−1) (0) = an (j) f (0) ∈ pℤ, j ≥ p; n ∑ (5) f (j) (βi ) ∈ pℤ, j ≥ p. (1) (2) (3) (4)

i=1

Proof. It is easy to see from the construction of f (x) that 0 is its root of multiplicity p − 1 and every βi is a root of f (x) of multiplicity at least p. As we know from the Abstract Algebra course, a root of multiplicity k of a polynomial f (x) is also a root of f ′ (x), f ′′ (x), . . . , f (k−1) (x). This implies (1) and (2). To prove (3), let us distribute all brackets in the deﬁnition of f (x) and ﬁnd the term of lowest degree in x, namely, the term is 1 a(n−1)p ap0 xp−1 . (p − 1)! n (n−1)p

ap0 . For all other terms in f (x), Its (p − 1)th derivative is equal to an their (p − 1)th derivatives contain x and thus turn into zero at x = 0. Before we proceed with the proof of the remaining statements, note the following general fact. Given a polynomial Φ(x) ∈ ℤ[x], its jth derivative

§ 1.3. Transcendentality of e and π

22

Φ(j) (x), j ∈ ℕ, belongs to j!ℤ[x], i.e., all nonzero coeﬃcients of Φ(j) (x) contain the factor j!. Indeed, for all m ≥ j we have ( ) m m−j m (j) (x ) = j! x ∈ ℤ[x], j and the jth derivatives of xm , m < j, turn into zero. To prove (4), consider Φ(x) =

(p − 1)! (n−1)p an

f (x) = xp−1 h(x)p ∈ ℤ[x].

According to the remark stated above, Φ(j) (x) ∈ j!ℤ[x] and thus f (0) ∈

j! ℤ ⊆ jℤ (p − 1)!

for j ≥ p, Finally, note that h(x) = an

n ∏

(x − βi ).

i=1

Hence, n ∏ anp anp n n (x − βi )p xp−1 = Θ(x, β1 , . . . , βn ), f (x) = (p − 1)! i=1 (p − 1)!

where Θ=x

p−1

n ∏

(x − yi )p ∈ ℤ[x, y1 , . . . , yn ].

i=1

The polynomial Ψ(y1 , . . . , yn ) =

n ∑ ∂j Θ i=1

∂xj

(yi , y1 , . . . , yn ) ∈ ℤ[y1 , . . . , yn ]

is symmetric with respect to y1 , . . . , yn , deg Ψ = deg Φ−j = np+p−1−j < np for j ≥ p, and all coeﬃcients of Ψ are divisible by j!. By Lemma 1.15 1 applied to Ψ(y1 , . . . , yn ), we obtain j! anp n Ψ(β1 , . . . , βn ) ∈ ℤ. j! Since j ≥ p, anp n Ψ(β1 , . . . , βn ) ∈ pℤ, (p − 1)!

§ 1.3. Transcendentality of e and π

23

and it remains to note n ∑

f (j) (βi ) =

i=1

anp n Ψ(β1 , . . . , βn ), (p − 1)!

which proves (5).

□

Remark 1.3. Upon the conditions of Lemma 1.26, assume that βi ∈ ℤ, i = 1, . . . , n. Then the statement (5) of Lemma 1.26 may be enhanced in the obvious way as f j (βi ) ∈ pℤ,

i = 1, . . . , n, j ≥ p.

2. Transcendentality of e. Theorem 1.27. If α ∈ ℚ \ {0} then eα is a transcendental number. Proof. Suppose β = ea/b is an algebraic number for some a/b ∈ ℚ, a ̸= 0. Then e is a root of the equation xa −β b = 0 with algebraic coeﬃcients. By Theorem 1.13, all roots of such equation (in particular, e) are algebraic numbers. Hence, it is enough to show that e itself is transcendental. Assume e is algebraic, and let he (x) ∈ ℚ[x] be its minimal polynomial. Then there exists bn ∈ ℤ such that bn he (x) = b0 + b1 x + · · · + bn xn ∈ ℤ[x]. It is clear that b0 ̸= 0. Choose a prime number p ∈ ℤ such that p > n and p > ∣b0 ∣ (it is possible to make such a choice since the set of primes is inﬁnite). Consider the polynomial h(x) = (x − 1)(x − 2) . . . (x − n) and construct f (x) = H(h(x)) =

1 xp−1 h(x)p (p − 1)!

as in (1.12), where βi = i, i = 1, . . . , n. For every k = 0, 1, . . . , n write the Hermite identity from Lemma 1.25: ∫k 0

f (t)e−t dt = F (0) − F (k)e−k .

§ 1.3. Transcendentality of e and π

24

Multiply each of these equations by bk ek and add the results: n ∑ k=0

∫k bk

n ∑

f (t)ek−t dt =

bk ek (F (0) − F (k)e−k )

k=0

0

= F (0)

n ∑

bk ek −

k=0

n ∑

bk F (k) = F (0)bn he (e) −

k=0

n ∑

bk F (k).

k=0

Therefore, k

n ∫ ∑

bk f (t)ek−t dt = −

k=0 0

n ∑

bk F (k).

(1.13)

k=0

Recall that F (x) =

∑

f (j) (x).

j≥0

Consider the right-hand side of (1.13). Lemma 1.12 and Remark 1.3 imply n ∑

bk F (k) = b0 f (p−1) (0) + b0

∑

f (j) (0) +

j≥p

k=0

n ∑ k=1

bk

∑

f (j) (k).

j≥p

In the last expression, b0 f (p−1) (0) = (−1)np b0 (n!)p ̸≡ 0 (mod p) by the choice of p, all other summands are integer multiples of p. Hence, for every suﬃciently large prime p the right-hand side of (1.13) is a nonzero integer number. Now, let us estimate the absolute value of the left-hand side of (1.13): | | | | n ∫k n | ∑ |∑ k−t | | b f (t)e dt ≤ ∣bk ∣k max {∣f (t)ek−t ∣} k | | t∈[0,k] | k=0 |k=0 0

≤ (n+1) max ∣bk ∣n max {∣f (t)∣}en ≤ C k=0,...,n

t∈[0,n]

C1p 1 np−1 (n−1)np ≤ C , (p − 1)! (p − 1)! C1p = 0, p→∞ (p − 1)!

where C and C1 do not depend on the choice of p. Since lim | | | | n ∫k | |∑ k−t | bk f (t)e dt|| < 1 | | |k=0 0

§ 1.3. Transcendentality of e and π

25

when p is suﬃciently large, but the right-hand side of (1.13) is a nonzero integer and thus its absolute value is greater or equal to 1. The contradiction obtained proves the theorem. □ 3. Symmetrized n-tuples. Definition 1.4. An N -tuple (β1 , . . . , βN ) ∈ ℂN is called symmetrized if

N ∏

(x − βj ) ∈ ℚ[x].

j=1

It is clear that a symmetrized tuple remains symmetrized after every permutation of its components. If we add (or remove) a rational number to (or from) a symmetrized tuple then the tuple obtained is symmetrized. Also, the concatenation of two or more symmetrized tuples is again a symmetrized tuple. To prove the transcendence of π we need the following properties of symmetrized tuples. Lemma 1.28. Let (α1 , . . . , αn ) ∈ ℂn be a symmetrized tuple, and let σ = σk ∈ ℤ[x1 , . . . , xn ], k ∈ {1, . . . , n}, be an elementary symmetric polynomial in x1 , . . . , xn . Then ( ) N ( α1 ) ∑ n αn βj σ e ,...,e = e , N= , k j=1 where (β1 , . . . , βN ) is a symmetrized tuple. Proof. Recall that ∑

σ = σk =

xi1 . . . xik

1≤i1 0, describing the distribution of prime numbers. Namely, π(x) is the cardinality of the set of all primes p ∈ ℙ such that p ≤ x, x ∈ ℝ, x > 0. 30

§ 2.1. Chebyshev functions

31

The main purpose of this section is to prove the following equivalence: x π(x) ∼ . ln(x) x Exercise 2.1. Prove that ∼ li(x), where ln x ∫x dt li(x) = . ln t 2

§ 2.1. Chebyshev functions 1. Deﬁnition and estimates. Here we will establish important relations between the function π(x) and the following functions deﬁned on all positive real numbers: ∑ • ψ(x) = ln p, where Qx = {(p, m) ∣ p ∈ ℙ, m ∈ ℕ, pm ≤ x}. (p,m)∈Qx

The function ψ is called the Chebyshev function; ∫x ψ(t) ˜ dt is known as the integral Chebyshev function. • ψ(x) = t 1

Note that π(x) may be presented in a similar way as ∑ 1. p∈ℙ,p≤x

Note that (p, m) ∈ Qx if and only if p ≤ x and ln(pm ) = m ln p ≤ ln x. Therefore, the sum ∑ ln p = ψ(x) (p,m)∈Qx

contains each ln p as many times as the count of all m ∈ ℕ such that m ln p ≤ ln x. If x > 1 (i.e., ln x > 0) there exist [ln x/ ln p] of such ms (here [·] stands for the integral part of a real number). Hence, ∑ [ ln x ] ψ(x) = ln p, x > 1. (2.1) ln p p∈ℙ,p≤x

Proposition 2.2. The following statements hold: ˜ (1) ψ(x) ≤ π(x) ln x, ψ(x) ≤ π(x) ln2 x for every x > 1; z z ˜ (2) lim ψ(x)/x = lim ψ(x)/x = 0 for every z ∈ ℂ such that x→∞ x→∞ Re z > 1.

§ 2.1. Chebyshev functions

32

Proof. (1) If we omit [·] in (2.1) then the value of this sum may just increase since [x] ≤ x, i.e., ∑ ∑ ∑ ψ(x) = [ln x/ ln p] ln p ≤ ln p ≤ ln x = π(x) ln x p≤x

p≤x

p≤x

(hereinafter, when we use “p” for summation index, we assume p ranges over prime numbers, as in (2.1)). For the integral Chebyshev function, note that ˜ ψ(x) ≤ ψ(x)

∫x

dt = ψ(x) ln x. t

1

The statement (2) immediately follows from (1): If z = 1+a+ιb, a > 0, then ∣ψ(x)/xz ∣ = ψ(x)/x1+a ≤ π(x) ln x/x1+a ≤ x ln x/x1+a = ln x/xa → 0 ˜ as x → ∞. For ψ(x), the proof is completely similar.

□

z ˜ Exercise 2.3. Prove that lim ψ(x)/x = 0 for Re z > 1. x→∞

2. Equivalence of the asymptotic behavior of Chebyshev functions and of the prime-counting function. Theorem 2.4. The following statements are equivalent: (A1) π(x) ∼ x/lnx; (A2) ψ(x) ∼ x; ˜ (A3) ψ(x) ∼ x. Proof. (A1)⇔(A2) By Proposition 2.2, ψ(x) ≤ π(x) ln x for x > 1. Hence, ψ(x) π(x) ≤ , x x/ ln x and the same inequality holds for upper and lower limits of these functions as x → ∞. Namely, ψ(x) ψ(x) ≤ lim ≤ 1, x→∞ x x x→∞ π(x) π(x) (A2) ⇒ lim ≥ lim ≥ 1. x→∞ x/ ln x x/ ln x x→∞ (A1) ⇒ lim

§ 2.1. Chebyshev functions

33

On the other hand, choose a parameter 0 < a < 1 and consider ∑ S(x, a) = ln p, x > 1. xa 1. (2.4) z n n=1 It is easy to see that the series (2.4) is absolutely converging for Re z > 1. In the semiplane Re z ≥ s, s > 1, the series (2.4) converges uniformly with respect to z. Hence, (2.4) is uniformly converging on every compact subset

§ 2.2. Riemann function: Elementary properties

36

in Re z > 1. By the well-known Weierstrass Theorem, the limit of a sequence of analytic functions which is uniformly converging on every compact subset in a given domain is again an analytic function. Therefore, (2.4) deﬁnes an analytic function in the semiplane Re z > 1. Later we will see how to extend ζ analytically into the semiplane Re z > 0. 2. Distribution of the Dirichlet series of a multiplicative function. Recall some notions from the elementary number theory. An arbitrary map f : ℕ → ℂ is called an arithmetic function. An arithmetic function is called multiplicative if f (1) = 1 and f (nm) = f (n)f (m) provided that n, m ∈ ℕ are relatively prime. The Fundamental Theorem of Arithmetic implies any multiplicative function f to be uniquely determined by its values f (pm ), p ∈ ℙ, m ∈ ℕ. Examples of multiplicative functions are given by: • the function I(n), I(pm ) = 1; • the identity function e(n), e(pm ) = 0 (while e(1) = 1); • the M¨ obius function μ(n), μ(p) = −1, μ(pm ) = 0 for m > 1. Lemma 2.7. Let f be a multiplicative function and let z ∈ ℂ. Suppose ∞ ∑ the series f (n)n−z is absolutely converging. Then n=1 ∞ ∑

−z

f (n)n

=

n=1

∏

(∞ ∑

p∈ℙ

d=0

) d

−dz

f (p )p

.

Proof. It is easy to see that for every p ∈ ℙ the series

∞ ∑

f (pd )p−dz

d=0

contains a part of the initial series and thus converges absolutely. Enumerate prime numbers in the increasing order: ℙ = {pn ∣ n ∈ ℕ},

p1 = 2, p2 = 3, . . . ,

and consider the partial product PN =

N ∏

(∞ ∑

n=1

d=0

) f (pdn )p−dz n

.

Since a product of absolutely converging series is distributive, we may distribute the brackets in the last expression to obtain PN =

∞ ∑ d1 ,...,dN =0

−dN z f (pd11 ) . . . f (pdNN )p1−d1 z . . . pN =

∑ n∈MN

f (n)n−z ,

§ 2.2. Riemann function: Elementary properties

37

where MN ⊂ ℕ consists of all natural numbers of the form pd11 . . . pdNN , di ≥ 0. The least natural number that is not in MN is equal to pN +1 , hence, |∞ | ∞ |∑ | ∑ | | −z ≤ f (n)n − P ∣f (n)n−z ∣ → 0 | N| | | n=p n=1

N +1

as N → ∞ (since pN +1 → ∞).

□

3. Convolution product and the M¨ obius inversion formula. Given two functions f, g : ℕ → ℂ, their convolution product is an arithmetic function deﬁned by ∑ (f ◦ g)(n) = f (d)g(n/d), n ∈ ℕ. (2.5) d∣n

where the summation index d ranges over the set of all divisors of n. Exercise 2.8. Prove that the convolution product is associative and commutative. Exercise 2.9. For every arithmetic function f , show f ◦ e = e ◦ f = f . (This is the reason why e is called the identity function.) The following statement shows a nice relation between Dirichlet series and the convolution product. Lemma 2.10. Let f and g be arithmetic functions such that their Dirichlet series ∞ ∞ ∑ ∑ f (n) g(n) , z n nz n=1 n=1 are absolutely converging for some z ∈ ℂ. Then )( ∞ ) (∞ ∞ ∑ g(n) ∑ ∑ f (n) (f ◦ g)(n) = . z z n n nz n=1 n=1 n=1

(2.6)

Proof. Since the product of absolutely converging series is distributive, we may write )( ∞ ) (∞ ∞ ∑ ∑ f (n) ∑ g(n) f (n)g(m) = . z z n n nz mz n=1 n=1 n,m=1

§ 2.2. Riemann function: Elementary properties

38

Introduce new indexes d = n, N = nm and change the order of summation (it is possible due to absolute convergence of the product series): ∞ ∞ ∞ ∑ ∑ ∑ 1 ∑ 1 f (n)g(m) = f (d)g(N/d) = (f ◦ g)(N ). z mz z z n N N n,m=1 N =1

N =1

d∣N

Therefore, (2.6) is proved.

□

Moreover, it is known that the set of all multiplicative functions forms a group with respect to the convolution product, i.e., if f and g are multiplicative functions then so is f ◦ g, and for every multiplicative function f there exists a multiplicative function f −1 such that f ◦ f −1 = f −1 ◦ f = e. Lemma 2.11 (M¨ obius inversion formula). If f : ℕ → ℂ is an arithmetic function and f ◦ I = g then g ◦ μ = f . Proof. Note that I ◦ μ = e. Indeed, if the canonical form of n is q1m1 . . . qrmr , q1 , . . . , qr ∈ ℙ, r ≥ 1, then ∑ ∑ ∑ μ(qj1 qj2 − . . . (I ◦ μ)(n) = μ(d) = μ(1) − μ(qj ) + j

d∣n

j1 1 then ∏ ζ(z) = (1 − p−z )−1 (the Euler identity),

(2.7)

p∈ℙ ∞ ∑ 1 μ(n) = , ζ(z) n=1 nz

(2.8)

where μ(n) is the M¨ obius function. In particular, ζ has no zeros in the semiplane Re z > 1.

§ 2.2. Riemann function: Elementary properties

39

Proof. Apply Lemma 2.7 for f = I to obtain ∏ ∏ ζ(z) = (1 + p−z + p−2z + . . . ) = (1 − p−z )−1 , p∈ℙ

p∈ℙ

when Re z > 1. This proves (2.7). To prove (2.8), apply Lemma 2.7 to f = μ. It is possible to do so ∞ ∑ since ∣μ(n)∣ ≤ 1 and thus the series μ(n)n−z is absolutely converging n=1

for Re z > 1. Hence, ∞ ∑

∏

∞ ∑

p∈ℙ

d=0

( μ(n)n

−z

=

n=1

) d

μ(p )p

−dz

=

∏

(1 − p−z ).

p∈ℙ

Thus (2.7) implies (2.8).

□

5. Logarithmic derivative of the Riemann function. Lemma 2.14. For all n ∈ ℕ we have (Λ ◦ I)(n) = ln n. Proof. For n = 1 the statement is obvious. If n > 1 then consider the canonical distribution of n, n = q1a1 . . . qrar . By the deﬁnitions of the convolution product and of the von Mangoldt function, we have (Λ ◦ I)(n) = 0 +

aj r ∑ ∑ j=1 a=1

Λ(qja ) · 1 =

r ∑

aj ln qj .

j=1

On the other hand, ln n = ln(q1a1 . . . qrar ) = a1 ln q1 + · · · + ar ln qr .

□

Theorem 2.15. In the semiplane Re z > 1, the following identity holds: ∞ ∑ ζ ′ (z) Λ(n) =− . (2.9) ζ(z) nz n=1 Proof. As we have already noted, (2.4) converges uniformly with respect to z in the domain Re z ≥ s for every s > 1. Hence (as we know from complex analysis) the series (2.4) allows term-by-term derivation at every point of the semiplane Re z > 1: ∞ ∑ ln n ζ ′ (z) = − . nz n=1 Multiply the expression obtained by (2.8) for 1/ζ(z): )( ∞ ) (∞ ∑ ln n ∑ μ(n) ζ ′ (z) =− . ζ(z) nz nz n=1 n=1

§ 2.2. Riemann function: Elementary properties

40

Since both series in the right-hand side are absolutely converging, we may distribute the brackets and collect similar terms with nz to obtain ∞ ∑ ζ ′ (z) μ(d2 ) ln d1 = ζ(z) (d1 d2 )z d1 ,d2 =1 ⎛ ⎞ ∞ ∞ ∑ ∑ ∑ 1 (ln ◦μ)(n) ⎝ = . (2.10) μ(d) ln(n/d)⎠ z = n nz n=1 n=1 d∣n

By Lemma 2.14, (Λ ◦ I)(n) = ln n. Lemma 2.11 implies ln ◦μ = Λ, and it remains to apply (2.10) to complete the proof. □ 6. Expression of the integral Chebyshev function via the Riemann function. Denote by La (a ∈ ℝ) the vertical line {z ∣ Re z = a} in the complex plane, and consider La as a path of integration in the upward direction, i.e., from a − i∞ to a + i∞. Theorem 2.16. For every a > 1 the following identity holds: ∫ ( ′ ) z ζ (z) x 1 ˜ − dz for x ≥ 1. ψ(x) = 2πι ζ(z) z 2

(2.11)

La

Proof. Let ℐa (x) stand for the improper integral in the right-hand side of (2.11). It follows from (2.9) that | ′ | | ζ (z) xz | 1 |− | | ζ(z) z 2 | ≤ C ∣z∣2 , for z ∈ La (Re z = a > 1), where C is a constant which does not depend on z. Since the integral | | | |∫ ∫∞ | dz | dy | | | z2 | ≤ 2 a + y2 | | La

−∞

converges, ℐa (x) is absolutely converging. Therefore, it can be adequately evaluated via the Cauchy principal value. By (2.9), ∫ ∑ ∞ Λ(n) xz ℐa (x) = lim dz, (2.12) B→∞ nz z 2 n=1 LB a

§ 2.2. Riemann function: Elementary properties

41

where LB a = {z ∣ Re z = a, −B ≤ Im z ≤ B}. In this expression, the integrand series is uniformly converging with respect to z ∈ La since | | | Λ(n) xz | xa ln n | | | nz z 2 | ≤ a2 na for every z ∈ La (recall that the series

∞ ∑

ln n/na converges for a > 1).

n=1

Hence, the integral in the right-hand side of (2.12) can be evaluated in the termwise way: ∞ ∫ ∑ Λ(n) xz dz. (2.13) ℐa (x) = lim B→∞ nz z 2 n=1 LB a

On the other hand, z = a + ιy, and thus partial integrals over the segments LB a may be estimated as follows: | | |∫ | ∫∞ | Λ(n) xz | ( x )a 1 Λ(n) | | Λ(n) ≤ dz dy ≤ Cxa a , | | | nz z 2 | n a2 + y 2 n |LB | −∞ a

where C is a constant not depending on n and B. Since Λ(n) ≤ ln n, the series in the right-hand side of (2.13) converges uniformly with respect to B, hence, the limit as B → ∞ may be evaluated in the termwise way: ∫ ∞ ∑ 1 Λ(n) (x/n)z 2 dz. ℐa (x) = (2.14) z n=1 La

1 has the only singular point at z = 0. z2 The Laurent series for gn,x (z) at z = 0 has the form ( ) ∞ ∑ 1 1 k ln (x/n)z k z −2 (x/n)z 2 = 1 + z k! The integrand gn,x (z) = (x/n)z

k=1

1 2 ln (x/n) + . . . , 2! and thus the residue at z = 0 (the coeﬃcient at z −1 ) is equal to ln(x/n). Let us now evaluate the integrals in the right-hand side of (2.14) by means of the Cauchy integral theorem. Case 1: x ≥ n, x/n ≥ 1. Consider the circle in the complex plane of √ radius R = B 2 + a2 centered at the origin and denote by CaB the arc of this that lies leftward to the line Re z = a. Being combined with an appropriate = z −2 + ln(x/n)z −1 +

§ 2.2. Riemann function: Elementary properties

42

Figure 1. Integration path in Case 1 segment L′ = LB a , this arc forms a closed integration path which includes the origin for suﬃciently large B (see Fig. 1). Then | | | | |∫ | | 2π−θ | | | | ∫ | 1 1 | | | z ιφ z (x/n) 2 2ιφ ιRe dφ|| | (x/n) 2 dz | ≤ | | | | z R e | |C B | θ a

a

∫2π

≤ (x/n)

1 dφ = O(1/R) = O(1/B). R

0

By the Cauchy integral theorem, ⎛ ⎞ ∫ ∫ 1 ⎜ ⎟ z 1 z 1 ⎝ (x/n) 2 dz + (x/n) 2 dz ⎠ = ln(x/n). 2πι z z LB a

CaB

The second summand in the left-hand side approaches zero as B → ∞, and the limit of the ﬁrst summand is the desired integral over La . Case 2: x < n, x/n < 1. Consider the integration path shown in Fig. 2: It consists √ of the segment LB a (as in the previous case) and of the arc ∣z∣ = R = B 2 + a2 which is located to the right of La . This is a closed

§ 2.3. Riemann function: Analytic properties

43

Figure 2. Integration path in Case 2 path which contains no singularities of the integrand (x/n)z z12 . By the same reasons as those used in Case 1, the integral in the right-hand side of (2.14) is equal to zero for x < n. Summarizing Case 1 and Case 2, conclude that { ∫ ln(x/n), n ≤ x, 1 z 1 (x/n) 2 dz = 2πι z 0, n > x. La

Plug in these expressions into (2.14) to obtain ∫ ( ′ ) z ∑ 1 ζ (z) x − dz = Λ(n) ln(x/n). 2 2πι ζ(z) z La

n≤x

˜ By Proposition 2.6, the last expression is equal to ψ(x).

□

§ 2.3. Riemann function: Analytic properties 1. Analytic extension of the Riemann function. Let us ﬁrst state a general observation that will be useful later. Suppose f (u, z) is a function depending on a real variable u ∈ [a, b] and on a complex variable z ∈ D, where [a, b] is an interval and D is a domain in ℂ. Assume that for every

§ 2.3. Riemann function: Analytic properties

44

u ∈ [a, b] the function f (u, z) is analytic in z ∈ D. In addition, suppose that for every ε > 0 there exists δ > 0 such that ∣Δu∣ < δ, u + Δu ∈ [a, b] ⇒ ∣f (u + Δu, z) − f (u, z)∣ < ε

(2.15)

for all z ∈ D, u ∈ [a, b]. Then ∫b F (z) =

f (u, z) du a

is an analytic function in z ∈ D such that ′

∫b

F (z) =

∂f (u, z) du. ∂z

a

Indeed, the integral over [a, b] is a limit of a sequence of Riemann sums ΣN =

N ∑

Δu = ∣b − a∣/N.

f (uj , z)Δu,

j=1

Each of ΣN is an analytic function in z ∈ D. Condition (2.15) guarantees uniform convergence (with respect to z ∈ D) ∫b ΣN

→ F (z) =

f (u, z) du.

N →∞

a

The Weierstrass theorem implies F (z) to be an analytic function such that the derivative of F (z) is the limit of Σ′N with respect to z. Every Σ′N is equal to a Riemann sum for the function ∂f /∂z, and thus ′ lim SN =

∫b

N →∞

∂f (u, z) du. ∂z

a

The following important statement says that the Riemann function deﬁned by (2.4) in the semiplane Re z > 1 may be analytically extended into a wider region in which the series (2.4) is diverging. ˜ Theorem 2.17. There exists a function ζ(z) deﬁned in the semiplane ˜ ˜ Re z > 0, z ̸= 1, such that ζ(z) = ζ(z) for Re z > 1. Moreover, ζ(z) is analytic at all points of Re z > 0 except for a simple pole at z = 1, in which ˜ = 1. Resz=1 ζ(z)

§ 2.3. Riemann function: Analytic properties

45

Proof. Denote ρ(u) = 1/2 − {u}, u > 0, where {u} = u − [u] is the fractional part of a real number u, [u] is the largest integer not greater than u. Let us ﬁx two natural numbers N < M and consider the following expression: M∫+1/2

I(N, M, z) :=

ρ(u) 1 + z z+1 du = z u u

=

1/2 − u + [u] 1 +z du z u uz+1

N +1/2

N +1/2 M∫+1/2

M∫+1/2

M∫+1/2

1−z 1 du + z u 2

N +1/2

M∫+1/2

z uz+1

du +

z[u] du uz+1

N +1/2

N +1/2

N |M +1/2 |M +1/2 ∫+1 1 1 || z[u] 1 || − + = z−1 | du | z u 2 u N +1/2 uz+1 N +1/2 N +1/2

+

k+1 ∫

M −1 ∑

k=N +1 k

z[u] du + uz+1

M∫+1/2

z[u] du uz+1

M

1 1 1/2 1/2 − − + = (M + 1/2)z−1 (N + 1/2)z−1 (M + 1/2)z (N + 1/2)z ) M −1 ( ∑ N k M N k M + + + z − + . − − (N + 1)z (N + 1/2)z (k + 1)z k (M + 1/2)z M z k=N +1

Reduce similar terms to obtain I(N, M, z) = −

) M −1 ( ∑ N 1 k 1 + + − + z−1 (N + 1)z (k + 1)z k z−1 M k=N +1

=−

=−

N +1−1 + (N + 1)z

M −1 ∑ k=N +1

( −

k+1−1 1 + z−1 (k + 1)z k

) +

1 M z−1

) M −1 ( ∑ 1 1 1 1 + + − (N + 1)z−1 (N + 1)z k z−1 (k + 1)z−1 k=N +1

+

M −1 ∑ k=N +1

1 1 + z−1 . (k + 1)z M

§ 2.3. Riemann function: Analytic properties

46

Finally, I(N, M, z) =

M ∑ k=N +1

1 kz

Hence, if Re z > 1 then ζ(z) =

N ∑ 1 + lim I(N, M, z). z M →∞ n n=1

On the other hand, for Re z > 1 (N + 1/2)−z+1 lim I(N, M, z) = +z M →∞ z−1

∫∞

ρ(u) du. uz+1

N +1/2

Therefore, ∫∞ N ∑ 1 (N + 1/2)−z+1 ρ(u) ζ(z) = + du, +z z z+1 n z − 1 u n=1

Re z > 1,

(2.16)

N +1/2

for every natural N . In (2.16), the ﬁrst summand is an analytic function in the entire ℂ, the second one has a unique simple pole z = 1 (in which the residue is equal to 1), and it remains to analyze the third summand. The integral in the last summand of the right-hand side of (2.16) may be considered as a series: N N ∫∞ ∫+1 ∫+2 ρ(u) ρ(u) ρ(u) du = du + du + . . . . z+1 z+1 u u uz+1 N +1/2

N +1

N +1/2

On every interval of integration ([N + 1/2, N + 1), [N + 1, N + 2), and so on), ρ(u)/uz+1 is a continuous function. Moreover, for every s > 0, the last series converges uniformly with respect to z in Re z ≥ s, and the integrand satisﬁes the condition (2.15). Hence, the third summand in the right-hand side of (2.16) is an analytic function in Re z > 0 and ∫∞ ∫∞ d ρ(u) d ρ(u) du = du. dz uz+1 dz uz+1 N +1/2

N +1/2

˜ Thus, the right-hand side of (2.16) is the desired function ζ(z). ˜ In what follows, we will identify ζ and ζ.

□

§ 2.3. Riemann function: Analytic properties

47

2. Zeros of the Riemann function. Lemma 2.18. For every s > 1 and for every t ∈ ℝ we have ∣ζ 3 (s)ζ 4 (s + ιt)ζ(s + 2ιt)∣ ≥ 1.

(2.17)

Proof. Denote A = ∣ζ 3 (s)ζ 4 (s+ιt)ζ(s+2ιt)∣. The Euler identity (2.7) implies ∏( )−1 . A= ∣1 − p−s ∣3 ∣1 − p−s−ιt ∣4 ∣1 − p−s−2ιt ∣ p∈ℙ

Let us evaluate the natural log of the both sides of the last expression using the well-known relation ln ∣w∣ = Re ln w, w ∈ ℂ \ {0}. Then ∑( ) ln A = − Re 3 ln(1 − p−s ) + 4 ln(1 − p−s−ιt ) + ln(1 − p−s−2ιt ) . p∈¶

Recall that the Taylor series for ln(1 − z) in a neighborhood of z = 0 is given by ln(1 − z) = −z −

z3 z2 − − .... 2 3

This series is absolutely converging when ∣z∣ < 1. Apply this distribution for z = p−s , z = p−s−ιt , and z = p−s−2ιt : ln A =

∑ p∈ℙ

) ∞ ( ∑ p−n(s+ιt) p−n(s+2ιt) p−ns +4 + Re 3 n n n n=1 ∞ ∑∑ ) p−ns ( = 3 + 4 Re p−ιnt + Re p−2ιnt n n=1 p∈ℙ

=

∞ ∑∑ p−ns (3 + 4 cos(nt ln p) + cos(2nt ln p)) . (2.18) n n=1 p∈ℙ

Note that 3 + 4 cos θ + cos(2θ) = 2(1 + cos θ)2 ≥ 0, and thus all summands in the right-hand side of (2.18) are non-negative. Hence, ln A ≥ 0, i.e., A ≥ 1. □ Theorem 2.19. Riemann function ζ(z) has no zeros in the line Re z = 1.

§ 2.3. Riemann function: Analytic properties

48

Proof. Let us estimate ζ(s) in a neighborhood of the pole z = 1, namely, in the interval 1 < s < 2. It is easy to see that ∫∞ ∞ ∑ 1 1 2 1 ≤ 1 + dx = 1 + ≤ s s n x s − 1 s − 1 n=1 1

(since a right Riemann sum for a decreasing function is no greater that its integral). Assume ζ(1 + ιt) = 0 for some t ̸= 0. Since ζ is analytic at 1 + ιt, its derivative is bounded in a neighborhood of this point. In particular, | | | ζ(s + ιt) − ζ(1 + ιt) | | ≤ C, 1 < s < 2. | | | s−1 Hence, ∣ζ(s + ιt)∣ ≤ C∣s − 1∣. Moreover, since ζ(z) is analytic, it is in particular continuous on the interval z = s + 2ιt, 1 ≤ s ≤ 2, and thus there exists a constant M , such that ∣ζ(s + 2ιt)∣ ≤ M for 1 < s < 2. Therefore, we have obtained the following estimates of the factors in (2.17) for 1 < s < 2: ∣ζ(s + ιt)∣ ≤ C∣s − 1∣, 2 , ∣ζ(s)∣ ≤ s−1 ∣ζ(s + 2ιt)∣ ≤ M. Then A := ∣ζ(s)3 ζ(s + ιt)4 ζ(s + 2ιt)∣ ≤ C1 ∣s − 1∣ → 0

as

s → 1 + 0,

where C1 is a constant, but Lemma 2.18 implies A ≥ 1 for all s > 1. The contradiction obtained proves the theorem. □ 3. Estimates of the logarithmic derivative. To prove the asymptotic law of distribution of prime numbers, we need an upper estimate of the logarithmic derivative of Riemann function far oﬀ the pole z = 1. Let us start with estimates of ζ(z) itself and its derivative. We have already seen that ∣ζ(s)∣ ≤ 2/(s − 1) for 1 < s < 2. Proposition 2.20. There exist constants C1 , C2 > 0 such that (1) ∣ζ(s + ιt)∣ ≤ C1 ln ∣t∣, (2) ∣ζ ′ (s + ιt)∣ ≤ C2 ln2 ∣t∣ for 1 ≤ s ≤ 2, ∣t∣ ≥ 3.

§ 2.3. Riemann function: Analytic properties

49

Proof. (1) The explicit expression for ζ(s + ιt) is given by Theorem 2.17: ∫∞ N ∑ (N + 1/2)1−s−ιt (s + ιt)ρ(u) 1 + du. (2.19) ζ(s + ιt) = + s+ιt k s − 1 + ιt us+1+ιt k=1

N +1/2

Suppose N = [∣t∣], and estimate the absolute values of all summands. First, | |N ∫N N |∑ 1 | ∑ 1 dx | | ≤1+ = 1 + ln N ≤ 1 + ln ∣t∣. |≤ | | k s+ιt | k x k=1

k=1

1

Next, | | | (N + 1/2)1−s−ιt | 1 | |≤ , | | 3 s − 1 + ιt since s ≥ 1 and ∣s + ιt − 1∣ ≥ 3. Finally, | | | | ∫∞ ∫∞ | du s + ∣t∣ 2 + ∣t∣ (s + ιt)ρ(u) || | du| ≤ (s + ∣t∣) = ≤ ≤C | | | us+1+ιt us+1 s(∣t∣ − 1)s ∣t∣ − 1 | |N +1/2 ∣t∣−1 since N + 1/2 ≥ ∣t∣ − 1, ∣s + ιt∣ ≤ s + ∣t∣, and s ≤ 2. Therefore, the absolute value of the right-hand side of (2.19) does not exceed B + ln ∣t∣ for some constant B (B does not depend on s and t). But B + ln ∣t∣ < (B + 1) ln ∣t∣ since ln ∣t∣ > 1 for ∣t∣ ≥ 3. (2) The derivative of (2.19) may be evaluated in the termwise way since the improper integral in the third summand is uniformly converging. Thus, ∫∞ N ∑ ln k d (N + 1/2)1−z d zρ(u) ζ ′ (z) = − + + du. kz dz z−1 dz uz+1 k=1

N +1/2

As above, let z = s + ιt, N = [∣t∣]. The second and third summands in the right-hand side of the last expression are bounded (as it was shown in the proof of (1)). To estimate the ﬁrst summand, consider the integral over ln x [3, ∞) of the function , which is decreasing on x ≥ 3: x N ∑ ln k k=1

ks

N

≤

ln 2 ∑ ln k + ≤ C ln2 N. 2 k k=3

§ 2.3. Riemann function: Analytic properties

50

Hence, there exists a constant C2 > 0 such that (2) holds.

□

Proposition 2.21. There exist constants T0 ≥ 3, C3 , C4 > 0 such that −31/4 (1) ∣ζ(s ∣t∣ > C3 ln−8 ∣t∣, | ′ + ιt)∣ ≥ | C3 ln | ζ (s + ιt) | | ≤ C4 ln10 ∣t∣ (2) || ζ(s + ιt) | for 1 ≤ s ≤ 2, ∣t∣ ≥ T0 . Proof. (1) Recall the inequality deduced in the proof of Theorem 2.19: ∣ζ(s)3 ζ(s + ιt)4 ζ(s + 2ιt)∣ ≥ 1. It holds for s > 1 and for all real t, in particular, for ∣t∣ ≥ 3. Moreover, ζ(s) ≤ 2/(s − 1) when 1 < s ≤ 2. By Proposition 2.20(1), ∣ζ(s + 2ιt)∣ ≤ C1 ln(2∣t∣) ≤ 2C1 ln ∣t∣ for s ≥ 1, ∣t∣ ≥ 3. Hence, )−3/4 ( 2 −3/4 −1/4 ln−1/4 ∣t∣ ∣ζ(s + ιt)∣ ≥ ∣ζ(s)∣ ∣ζ(s + 2ιt)∣ ≥C s−1 for 1 < s ≤ 2, ∣t∣ ≥ 3, where C is a constant which does not depend on s and t. Let us ﬁx t, ∣t∣ ≥ 3, and consider the interval 1 + δ ≤ s ≤ 2, where 2 δ = 10 ln ∣t∣ In this interval, ∣ζ(s + ιt)∣ ≥ C ln−31/4 ∣t∣. It remains to estimate from below the quantity ∣ζ(s + ιt)∣ in the interval 1 ≤ s ≤ 1 + δ. Proposition 2.20(2) allows to estimate an increment of ζ(z) in this interval: | 1+δ | |∫ | | | ∣ζ(1 + δ + ιt) − ζ(s + ιt)∣ = || ζ ′ (z) dz || ≤ δ max ∣ζ ′ (z)∣ Im z=t,s≤Re z≤1+δ | | s

≤ 2 ln−10 ∣t∣C2 ln2 ∣t∣ = 2C2 ln−8 ∣t∣. Therefore, ∣ζ(s + ιt)∣ ≥ ∣ζ(1 + δ + ιt)∣ − 2C2 ln−8 ∣t∣ ≥ C ln−31/4 ∣t∣ − 2C2 ln−8 ∣t∣. Since −31/4 < −8, for any constants C and C2 the second summand in the right-hand side becomes negligible for suﬃciently large ∣t∣. Hence, there exists T0 ≥ 3 such that ∣ζ(s + ιt)∣ ≥ (C/2) ln−31/4 ∣t∣,

§ 2.3. Riemann function: Analytic properties

51

for ∣t∣ ≥ T0 . (2) This estimate immediately follows from (1) and Proposition 2.20(2). □ 4. Proof of the Prime Number Theorem. Now we are ready to complete the proof of the main statement of this chapter. Theorem 2.22. The function π(x) is asymptotically equivalent to x/ ln x. Proof. By Theorem 2.4, it is enough to show that lim x→∞ Theorem 2.16 implies ∫ ( ′ ) z−1 ˜ 1 ζ (z) x ψ(x) dz = − x 2πι ζ(z) z2

˜ ψ(x) = 1. x

La

for every x > 1, a > 1. Here La stands for the line Re z = a, integration is made from a − ι∞ to a + ι∞. Choose real numbers U > T > T0 , where T0 is the constant from Proposition 2.21. Since z = 1 is a simple pole, there exists a neighborhood Uδ (a circle ∣z − 1∣ < δ) in which ∣ζ(z)∣ > 0. Note that the region W = {z ∈ ℂ ∣ 1/2 ≤ Re z ≤ 1, ∣ Im z∣ ≤ T, ∣z − 1∣ ≥ δ} ⊂ ℂ may contain only a ﬁnite number of zeros of the Riemann function. Otherwise, if there are inﬁnitely many zeros in a closed bounded region W , the set of zeros has a condensation point in the same region W . Then the well-known uniqueness theorem of an analytic function implies ζ to be zero on W , which is not the case. Therefore, for every T > 0 there exists η > 0 such that S(T, η) = {z ∈ ℂ ∣ 1 − η ≤ Re z ≤ 1, ∣ Im z∣ ≤ T } contains no zeros of the Riemann function. Consider the integration path Γ shown at Fig. 3. It depends on three parameters a, U , and T , where 1 < a < 2, U > T > T0 , and on the quantity η > 0 which is chosen in such a way that the interior of Γ (and Γ itself) does not contain zeros of the Riemann function.

§ 2.3. Riemann function: Analytic properties

52

Figure 3. Integration path Γ Consider the integral 1 I= 2πι

∫ ( ′ ) z−1 ζ (z) x − dz ζ(z) z2 Γ

The only singularity of the integrand inside the closed path Γ is at the point z = 1, and ( ′ ) z−1 ζ (z) x Resz=1 − = 1. (2.20) ζ(z) z2 By the Cauchy integral theorem, I = 1 for all a, U , T . Let us now present the integral I as a sum of ﬁve integrals over straight segments (1)–(5) of the path Γ (see Fig. 3): I = I1 + I2 + I3 + I4 + I5

(2.21)

(for k = 2, 3, 4, Ik contains integrals over both segments (k)). Let us consider these segments separately. Segment 1. As we have already mentioned, lim I1 =

U →∞

˜ ψ(x) . x

§ 2.3. Riemann function: Analytic properties

53

Segment 2. By Proposition 2.21(2), 1 ∣I2 ∣ ≤ 2 2π

∫a

xa−1 ds s2 + U 2

C4 ln10 U

1 10

Hence, ∣I2 ∣ = O(ln

2

U/U ), i.e., lim I2 = 0.

U →∞

Segment 3. By Proposition 2.21(2), 1 ∣I3 ∣ ≤ 2 2π

∫U

C4 ln10 t dt ≤ C 1 + t2

T

∫∞

dt t3/2

= 2C

1 T 1/2

T

10

(since ln t/t1/2 is a bounded function). Note that the constant C does not depend on x. Segment 4. Segments (4) and (5) form a compact set, on which ζ(z) is a nonzero analytic function. Hence, its logarithmic derivative is continuous on these segments. By the Weierstrass theorem on a continuous function, there exists a constant M = M (T, η) such that | ′ | | ζ (z) | | | | ζ(z) | ≤ M (T, η). Then 1 ∣I4 ∣ ≤ 2 2π

∫1

xs−1 M (T, η) M (T, η) 2 ds ≤ s + T2 πT 2

1−η

∫1

xs−1 ds ≤

C¯ . ln x

1−η

The constant C¯ in this expression depends on T and η, since it is proportional to M (T, η). Segment 5. By the same reasons as for the previous segment, ˜ −η , ∣I5 ∣ ≤ Cx where C˜ depends on T and η. Now, evaluate the limit of (2.21) as U → ∞: I=1=

˜ ψ(x) + 0 + J3 + J4 + J5 , x

§ 2.3. Riemann function: Analytic properties

54

where Jk = lim Ik . As it was shown above, U →∞

1 , T 1/2 ¯ C(T, η) ∣J4 ∣ ≤ , ln x ˜ ∣J5 ∣ ≤ C(T, η)x−η . ∣J3 ∣ ≤ 2C

Hence, for every small ε > 0 one may choose T such that ∣J3 ∣ ≤ ε/2. For a given T , the quantities C¯ and C˜ are ﬁxed, so ∣J4 ∣ + ∣J5 ∣ ≤ ε/2 for a suﬃciently large x. Finally, | | | ˜ || ψ(x) | | ≤ ∣J3 ∣ + ∣J4 ∣ + ∣J5 ∣ < ε |1 − | x | for a suﬃciently large x.

□

Corollary 2.23 (Asymptotic formula for nth prime). Let pn , n = 1, 2, 3, . . . , stand for the nth prime number (p1 = 2, p2 = 3 and so on). Then pn ∼ n ln n. Proof. By the deﬁnition of π, π(pn ) = n. Moreover, the Euclid theorem implies pn → ∞. By Theorem 2.22, n→∞

n = π(pn ) =

pn (1 + Rn ), ln pn

lim Rn = 0.

n→∞

Then ln n = ln π(pn ) = ln pn − ln ln pn + ln(1 + Rn ). Multiply these expressions to obtain n ln n = pn (1 + Rn )

ln pn − ln ln pn + ln(1 + Rn ) . ln pn

It is easy to see that ( ) n ln n ln ln pn ln(1 + Rn ) = (1 + Rn ) 1 − + → 1. n→∞ pn ln pn ln pn □

§ 2.4. Problems

55 § 2.4. Problems

(1) Prove that ∑ p∈ℙ,p≤x

ln p = ln x + O(1). p

(2) Show that there exists a constant C > 0 such that ∑ 1 = C + ln(ln x) + O(1/ ln x). p p∈ℙ,p≤x

(3) The famous Riemann hypothesis states that all zeros of ζ(z) in the semiplane Re z > 0 lie in the line Re z = 12 . Assuming Riemann hypothesis is true, prove ψ(x) = x + O(xε+0.5 ), for every ε > 0.

π(x) = ln(x) + O(xε+0.5 )

CHAPTER 3

Dirichlet Theorem In this chapter we prove the famous Dirichlet theorem on the number of primes in an arithmetic progression with coprime diﬀerence and the ﬁrst member. We start with the structure and properties of ﬁnite abelian groups. § 3.1. Finite abelian groups and groups of characters 1. Finite abelian groups. Recall that the set G with an algebraic binary operation “·” is called a group (we often omit “·” and write g1 g2 instead of g1 · g2 ), if the following axioms are satisﬁed: (a) for every a, b, c ∈ G the identity (a · b) · c = a · (b · c) holds; (b) there exists e ∈ G such that for every a ∈ G we have a·e = e·a = a, such element e is called the identity element or the neutral element; (c) for every a ∈ G there exists a−1 ∈ G such that a·a−1 = a−1 ·a = e, such a−1 is called the inverse element. If an additional axiom of commutativity holds: (d) for every a, b ∈ G we have a · b = b · a, then the group is called abelian. If G is abelian, then the additive notation is used very often, so the operation is denoted by “+”, the identity element is denoted by “0”, and the inverse element is denoted by “−a”. By ∣G∣ we denote the cardinality of G. A one-generated group is called cyclic, i.e. a group G is called cyclic if there exists a ∈ G such that G = {an ∣ n ∈ ℤ} (in additive notation, G = {na ∣ n ∈ ℤ}. Notice that for every group G and for every g ∈ G the set {g n ∣ n ∈ ℤ} appears to be a subgroup of G. This subgroup is called the cyclic subgroup generated by g and is denoted by ⟨g⟩. The cardinality ∣⟨g⟩∣ is called the order of g and is denoted by ∣g∣. Clearly ⟨ℤ, +⟩ is an inﬁnite cyclic group and, for every n ∈ ℕ, ⟨ℤn , +⟩ is a ﬁnite cyclic group of order n. Exercise 3.1. Let G be a group and g be an element of G. Then the following hold. 56

§ 3.1. Abelian groups

57

(1) Either ∣g∣ = ∞, or ∣g∣ is the minimal positive integer k such that g k = e. (2) If ∣g∣ = ∞, then ⟨g⟩ ≃ ℤ; if ∣g∣ = n, then ⟨g⟩ ≃ ℤn 1. (3) If g m = e, then ∣g∣ divides m. (4) If g1 , g2 ∈ G are chosen so that g1 ·g2 = g2 ·g1 , and ⟨g1 ⟩∩⟨g2 ⟩ = {e}, then ∣g1 · g2 ∣ = lcm(∣g1 ∣, ∣g2 ∣). In particular, if ∣g1 ∣, ∣g2 ∣ are coprime then ∣g1 · g2 ∣ = ∣g1 ∣ · ∣g2 ∣. If G1 , . . . Gn are groups, then G1 × . . . × Gn = {(g1 , . . . , gn ) ∣ g1 ∈ G1 , . . . , gn ∈ Gn } with coordinate-wise multiplication is called the direct product of G1 , . . . , Gn . If G is abelian and G1 , . . . , Gn are subgroups of G, then the set G1 ·. . .·Gn = {g1 · . . . · gn ∣ g1 ∈ G1 , . . . , gn ∈ Gn } forms a subgroup of G. Exercise 3.2. Let G be an abelian group. Assume that subgroups G1 , . . . , Gn of G satisfy to the following (1) G = G1 · . . . · Gn . (2) for every i we have Gi ∩ (G1 · . . . · Gi−1 · Gi+1 · . . . · Gn ) = {e}. Then G ≃ G1 × . . . × Gn . In such case G is also called a direct product of subgroups G1 , . . . , Gn . Hint: Prove that conditions (1) and (2) are equivalent to the statement “for every g ∈ G there exist unique g1 ∈ G1 , . . . , gn ∈ Gn such that g = g1 · . . . · gn ”. The proof of the following theorem can be found in many algebra textbooks and we do not provide the proof here. Theorem 3.3. Let G be a ﬁnite abelian group. Then there exist d1 , . . . , dn ∈ ℕ such that G ≃ ℤd1 × . . . × ℤdn . Moreover such d1 . . . , dn can be uniquely determined by the following condition: for every i = 1, . . . , n−1, di divides di+1 . Recall that by ℤ∗n we denote the (multiplicative) group of invertible elements in ℤn . Exercise 3.4. Prove that ∣ℤ∗n ∣ = φ(n), where φ(n) is the Euler funcαk 1 tion. If n = pα 1 · . . . · pk is the canonical decomposition of n into the αk 1 product of primes, then φ(n) = φ(pα 1 ) · . . . · φ(pk ) and for every prime p k k k−1 and positive integer k, φ(p ) = p − p . 1Recall that groups G, H are isomorphic if there exists a bijection φ : G → H preserving operation, i.e. φ(g1 · g2 ) = φ(g1 ) · φ(g2 ).

§ 3.1. Abelian groups

58

Hint: Prove that φ ◦ I = E, where I(n) = 1, E(n) = n for all natural n. Here ”◦” stands for the convolution product of arithmetic functions mentioned in Chapter 2. 2. Characters. Let G be a ﬁnite abelian group. A homomorphism2 χ : G → ℂ∗ is called a character of G. The set of all characters of G is ˆ Deﬁne a multiplication on G ˆ by denoted by G. ˆ and g ∈ G we have (χ1 · χ2 )(g) = χ1 (g) · χ2 (g). for every χ1 , χ2 ∈ G ˆ is an abelian group. Moreover, G and G ˆ are isomorTheorem 3.5. G phic. ˆ is clearly an algebraic operation, and Proof. The multiplication on G it is associative and commutative (we leave the proof for the reader). Denote by χe the principal character, it is deﬁned by χe (g) = 1 for all g ∈ G. It ˆ For every χ ∈ G ˆ deﬁne is immediate that χe is the identity element of G. −1 −1 −1 χ by χ (g) = 1/χ(g). Evidently, χ is the inverse element for χ. Thus ˆ is an abelian group. G In view of the theorem about the structure of ﬁnite abelian groups (Theorem 3.3) we have G = ⟨g1 ⟩ × . . . × ⟨gr ⟩ for suitable g1 , . . . , gr ∈ G. Denote ∣gk ∣ by hk , let εk = exp( 2πι hk ). For k = 1, . . . , r deﬁne the map χk : G → ℂ∗ by χk (g1a1 . . . grar ) = εakk . It is straightforward that χk is a character of G for k = 1, . . . , r. It is also clear that χk , χ2k , . . . , χhk k = χe are distinct characters, since their values on gk are distinct. ˆ deﬁned by φ : g a1 . . . grar ⊦→ χa1 . . . χar r . Consider the map φ : G → G 1 1 By the deﬁnition, φ preserves the multiplication. The map φ is clearly injective: if e ̸= x ∈ G, then x = g1a1 . . . grar and there exists ak such that gkak ̸= e; therefore ( ) 2ak πι a1 ar φ(x)(gk ) = (χ1 . . . χr )(gk ) = exp ̸= 1. hk In order to prove that φ is surjective note that gkhk = e implies χ(gkhk ) = ˆ Hence, χ(gk ) is a complex root of unity of order χ(gk )hk = 1 for every χ ∈ G. hk , and thus χ(gk ) = εakk for appropriate ak . It is straightforward to check that χ(g) = (χa1 1 . . . χar r )(g) for all g ∈ G. Therefore, χ = χa1 1 . . . χar r = 2Homomorphism of groups is a map, preserving the operation. Namely, the map φ : G → H is called a homomorphism, if for every g1 , g2 ∈ G we have φ(g1 · g2 ) = φ(g1 ) · φ(g2 )

§ 3.1. Abelian groups

59

φ(g1a1 . . . grar ), and φ is surjective. This completes the proof of the theorem. □ Exercise 3.6. Check all technical statements in the proof. Remark 3.1. Theorem 3.5 is a particular case of Pontryagin duality between discrete and continuous abelian groups. The group of characters ˆ can be considered as a dual group for G and χ1 , . . . , χr is the dual basis G for g1 , . . . , gr . In the proof of the theorem we also derive the following ˆ such Corollary 3.7. For every nonidentity g ∈ G there exists χ0 ∈ G that χ0 (g) ̸= 1. In the theory of characters for ﬁnite groups the following proposition plays an important role. Proposition 3.8. (Orthogonality relations) The following hold ˆ we have (1) for every χ ∈ G { ∑ ∣G∣, if χ = χe ; χ(g) = 0, if χ ̸= χe . g∈G

(2) for every g ∈ G we have { ∑ ∣G∣, χ(g) = 0,

if g = e; if g = ̸ e.

ˆ χ∈G

Proof. Clearly we need to proof the ﬁrst identity in case χ ̸= χe , and the second in case g ̸= e. (1) If χ ̸= χe , then there exists g0 such that χ(g0 ) ̸= 1. Then ⎛ ⎞ ∑ ∑ ∑ χ(g) = χ(gg0 ) = ⎝ χ(g)⎠ χ(g0 ), g∈G

so

∑

so

∑

g∈G

g∈G

g∈G χ(g) = 0. (2) If g ̸= e, then Corollary 3.7 implies that the existence χ0 such that χ0 (g) ̸= 1. Then ⎛ ⎞ ∑ ∑ ∑ χ(g) = (χ0 χ)(g) = ⎝ χ(g)⎠ χ0 (g), ˆ χ∈G ˆ χ∈G

χ(g) = 0.

ˆ χ∈G

ˆ χ∈G

□

§ 3.2. Dirichlet series

60

3. Characters modulo m. Consider G = ℤ∗m . Denote by n ¯ the residˆ ual of n modulo m. Then each character χ ∈ G can be extended to an arithmetic function χ : ℕ → ℂ by { χ(¯ n), if gcd(n, m) = 1; χ(n) = (3.1) 0, otherwise. This construction lead us to the notion of a character modulo m. More formally, an arithmetic function χ : ℕ → ℂ is called a character modulo m, if the following conditions are satisﬁed: (1) χ(n) ̸= 0 if gcd(n, m) = 1; (2) χ(n) = 0 if gcd(n, m) > 1; (3) χ(n1 ) = χ(n2 ) if n1 ≡ n2 (mod m); (4) χ(n1 n2 ) = χ(n1 )χ(n2 ) for all n1 n2 ∈ ℕ. Deﬁne the set of all characters modulo m by Gm . It follows by deﬁnition that each character modulo m is a character of ℤ∗m extended to all positive integers by using (3.1). So the following proposition is an immediate corollary to the properties of characters of ﬁnite abelian groups obtained above. Proposition 3.9. The following hold (1) There exist exactly φ(m) distinct characters modulo m. (2) All characters modulo m forms a group Gm under usual multiplication: (χ1 · χ2 )(n) = χ1 (n) · χ2 (n). (3) If Ωm is a full system of residuals modulo m, then { ∑ φ(m), if χ = χe , χ(n) = 0, otherwise. n∈Ωm

(4) For every n ∈ ℤ we have { ∑ φ(m), if n ≡ 1 (mod m), χ(n) = 0, otherwise. χ∈Gm

§ 3.2. Dirichlet series 1. Convergence of L-series. Given χ ∈ Gm deﬁne an L-series L(z, χ), where z ∈ ℂ, by ∞ ∑ χ(n) . (3.2) L(z, χ) = nz n=1 The series L(z, χ) is called an L-series of character χ.

§ 3.2. Dirichlet series

61

Theorem 3.10. Let χ be a character modulo m. Then the following statements hold. (1) The series L(z, χ) is absolutely converging for Re z > 1. (2) If χ = ̸ χe , then L(z, χ) is uniformly converging in every compact domain of the semiplane Re z > 0. (3) If χ = χe , then L(z, χ) is uniformly converging in semiplane Re z ≥ s for every s > 1. Moreover, L(z, χ) possesses an analytic continuation in domain Re z > 0, z ̸= 1 with the unique simple pole z = 1. Proof. (1) is evident, since ∣χ(n)∣ ≤ 1. (2) Consider the series

∞ ∑ χ(n) , L(z, χ) = nz n=1

∑ set s(x) := n≤x χ(n). The main idea of the proof comes from the harmonic series. In view of Proposition 3.9(3) we have

s(m) =

m ∑

χ(k) =

Moreover, for every q ∈ ℕ we have each N ∈ ℕ the inequality

N ∑ k=1

holds.

¯ = 0. χ(k)

¯ ∗ k∈ℤ m

k=1

∣

∑

∑m

k=1

χ(qm + k) = 0. Therefore, for

χ(k)∣ < m

(3.3)

§ 3.2. Dirichlet series

62

Now consider χ(n) = s(n) − s(n − 1). We obtain N N ∑ s(N ) s(N − 1) χ(n) ∑ s(n) − s(n − 1) = = − + z z n n Nz Nz n=1 n=1

s(N − 1) s(N − 2) s(2) s(1) s(1) − + ... + z − z + = (N − 1)z (N − 1)z 2 2 1 [ ] N −1 1 1 s(N ) ∑ − s(n) · − = Nz (n + 1)z nz n=1 ∫ n+1 N −1 s(N ) ∑ z + dx = s(n) · z z+1 N x n n=1 ∫ n+1 N −1 s(N ) ∑ s(x) + dx, z· z N xz+1 n n=1 in the last step we use the identity s(x) = s(n) for x ∈ [n, n + 1). Set ∫ n+1 s(x) In (z) := z · dx. z+1 x n Now we can bound ∣In (z)∣. In view of (3.3) we have ( ) ∫ n+1 m∣z∣ 1 1 m dx = · − ∣In (z)∣ ≤ ∣z∣ · . xs+1 s ns (n + 1)s n Therefore N −1 ∑

∣In (z)∣
0 by ⎛ ⎞ ∏ L(z, χe ) = ζ(z) · ⎝ (1 − p−z )⎠ , p∈ℙ,p∣m

whence item (3) of the theorem.

□

Lemma 3.11. Set L(z, Gm ) =

∏

L(z, χ).

χ∈H

Then for Re z > 1 the series L(z, Gm ) can be written in the following form ∞ ∑ an , nz n=0

where every an is a nonnegative integer and, moreover, if n = k φ(m) and (k, m) = 1, then an ≥ 1. Moreover, L(z, Gm )(k) = (−1)k

∞ ∑ an (ln n)k , k = 1, 2, . . . nz n=1

Proof. The statement about derivations follows from the fact that the series ∞ ∑ an nz n=0 is absolutely converging for Re z > 1 (as a product of a ﬁnite number of absolutely converging series). Thus we need to show that L(z, Gm ) =

∞ ∑ an . nz n=0

Since χ is multiplicative for every χ ∈ Gm , Lemma 2.7 implies that )−1 ∏( χ(p) L(z, χ) = 1− z . p p∈ℙ

Therefore, L(z, Gm ) =

∏ ∏( χ∈H p∈ℙ

1−

χ(p) pz

)−1 =

∏ ∏ ( p∈ℙ χ∈Gm

1−

χ(p) pz

)−1 ,

§ 3.2. Dirichlet series

64

where we change the order of multiplication, since Gm is ﬁnite and )−1 ∏( χ(p) 1− z p p∈ℙ

is absolutely converging. For every p ∈ ℙ denote by fp the multiplicative order of p in ℤ∗m , i.e. the minimal positive k with pk ≡ q (mod m). Then 1 = χ(1) = χ(pfp ) = χ(p)fp , so χ(p) = exp(2πιk/fp ) for some k = 0, 1, . . . , fp = 1. It follows that the map ψ : Gm → ℂ∗ acting by ψ : χ ⊦→ χ(p) maps Gm into {exp(2πιk/fp ) ∣ k = 0, 1, . . . , fp − 1}. Clearly ψ is a homomorphism. Notice that ψ is surjective. Indeed, we can deﬁne a character χp on a power of a prime rs by ⎧ ⎨ exp(2πι/fp )s , if r = p s 1 if r ̸= p and (r, m) = 1 χp (r ) = ⎩ 0 if r divides m, and extend it on all integers by multiplicativity. Then ⟨χp ⟩ is a subgroup of Gm and ψ(⟨χp ⟩) = {exp(2πιk/fp ) ∣ k = 0, 1, . . . , fp − 1}. So the kernel of ψ has order ∣Gm ∣/fp = φ(m)/fp =: gp . It follows that for every k = 0, 1, . . . , fp − 1 there exists exactly gp characters χ ∈ Gm with χ(p) − exp(2πιk/fp ). Therefore, a polynomial ∏ (1 − χ(p)t) χ∈Gm fp gp

equals (1 − t ) . Thus ∏

L(z, Gm ) =

(1 − p−fp z )−gp .

p∈ℙ;(p,m)=1

Now the Taylor series for (1 − z)−g equals ∞ ∑ (g + k − 1)! k=0

(g − 1)!k!

zk ,

§ 3.2. Dirichlet series

65

and the series is absolutely converging for ∣z∣ < 1. Since ∣p−fp z ∣ < 1, we can apply the Taylor series to the expression of L(z, Gm ). We obtain (1 − p−fp z )−gp =

∞ ∑ (gp + k − 1)! k=0

(gp − 1)!k!

p−fp kz =

∞ ∑ up,k k=0

pkz

,

where { up,k =

0,

if k is not divisible by fp , if k = r · fp .

(gp +r−1)! (gp −1)!r! ,

Since for every p ∈ ℙ the series ∞ ∑ up,k k=0

pkz

is absolutely converging, we obtain the following identity for every N : ∏

(1 − p−fp z )−gp =

p≤N ;(p,m)=1

∞ ∑ an , nz n=1

where { an =

0, up1 ,k1 · . . . · upl ,kl ,

if (n, m) > 1, if (n, m) = 1 and n = pk11 · . . . · pkl l .

Consider 1 < s ∈ ℝ. Then L(s, Gm ) is converging. On the other hand, for every M , M ∑ an ≤ L(s, Gm ), ns n=1 so the series ∞ ∑ an ns n=1

is converging, and so the series ∞ ∑ an nz n=1

is absolutely converging for every z with Re z > 1. It follows that L(z, Gm ) =

∞ ∑ an , nz n=1

§ 3.2. Dirichlet series

66

where the coeﬃcients an -s are deﬁned above. From the deﬁnition we obtain that an are nonnegative integers. Since fp divides φ(m), we obtain that an ̸= 0, if n = k φ(m) for (k, m) = 1. □ 2. Landau Theorem. Consider L(z) := L(z, Gm ) = L(z, χe ) ·

∏

L(z, χ).

χ∈Gm \{χe }

Theorem 3.10 implies that L(z, χe ) is analytic in semiplane Re z > 0 with a unique simple pole z = 1, ∏while each L(z, χ) for χ ̸= χe is analytic in the semiplane Re z > 0, i.e. χ∈Gm \{χe } L(z, χ) is analytic in the semiplane Re z > 0. Thus L(z) is analytic in the semiplane Re z > 0 with one possible simple pole z = 1. In order to prove that z = 1 is indeed a simple pole for L(z) we need to prove that for every χ ∈ Gm \ {χe } we have L(1, χ) ̸= 0. The next theorem helps us to prove the desired statement. Theorem 3.12. (Landau Theorem) Assume that a function F (z) is analytic in Re z > 0 and suppose that for Re z > 1 we can write F (z) as a series ∞ ∑ an , (3.5) F (z) = nz n=1 where an ≥ 0 for every n. Assume also that in the semiplane Re z > 1 we have ∞ ∑ an (ln n)k F (k) (z) = (−1)k , nz n=1 i.e. series (3.5) can be diﬀerentiated term by term in the semiplane Re z > 1. ∑∞ Then the series n=1 anns is converging in the interval s ∈ (0, 2). Proof. We consider the Taylor series for F (s) about s = 2: F (s) =

∞ ∑ F (k) (2)

k!

k=0

(s − 2)k .

(3.6)

The Taylor series of an analytic function is converging in the circle of radius r centered at s0 , where r is the distance to the nearest singular point, i.e. series (3.6) is converging for every s ∈ (0, 2). Now F (k) (2) =

∞ ∑

(−1)n

n=1

an (ln n)k , n2

§ 3.2. Dirichlet series hence F (s) =

67 ∞ ∑ ∞ ∑ (−1)k an (ln n)k · (s − 2)k . · 2 k! n n=1

k=0

The series is converging, moreover, (−1)k and (s−2)k for s ∈ (0, 2) have the same sign, so all terms in the series are nonnegative. Therefore the series is absolutely converging and we can change the order of summation. So we obtain F (s) =

∞ ∑ ∞ ∑ (−1)k n=1 k=0

an (ln n)k · (s − 2)k = k! n2 (∞ ) ∞ ∑ an ∑ (ln n)k (2 − s)k = n2 k! n=1 ·

k=0

// the inner sum is equal to exp((2 − s) ln n) = n2−s // = ∞ ∞ ∑ an 2−s ∑ an · n = , n2 ns n=1 n=1

and the theorem follows.

□

Now we can prove that for every χ ∈ Gm \ {χe } we have L(1, χ) ̸= 0. Corollary 3.13. If χ ∈ Gm \ {χe } then L(1, χ) ̸= 0. Proof. Assume that L(1, χ) = 0. Then L(z) = L(z, Gm ) is analytic in the semiplane Re z > 0. Moreover, ∞ ∑ an L(z) = , nz n=1 where an ≥ 0 and further an ≥ 1 for n = k φ(m) and gcd(k, m) = 1. Every series L(z, χ) is absolutely converging in the semiplane Re z > 1, so L(z) is absolutely converging for Re z > 1. Therefore the series L(z) can be diﬀerentiated term by term any number of times. By the Landau theorem the series ∞ ∑ an ns n=1 is converging for 0 < s < 2. For n = (km + 1)φ(m) we have an ≥ 1, so N

[ m −1] N ∑ ∑ an 1 1 ≥ //for s = // ≥ s n φ(m) km +1 n=1 k=1

→ ∞,

N →∞

§ 3.2. Dirichlet series

68

a contradiciton.

□

3. Proof of the Dirichlet Theorem. First we recall the statement of the theorem. Theorem 3.14. (Dirichlet Theorem) Assume that a, m are natural coprime numbers. Then there exist inﬁnitely many primes of the form a + km, or, equivalently, there exist inﬁnitely many primes p such that p ≡ a (mod m). Proof. Consider the series L(z, χ) =

∞ ∑ χ(n) , nz n=1

where χ ∈ Gm and Re z > 1. First we prove that ∞ ∑ 1 χ(n) · μ(n) , = L(z, χ) n=1 nz

(3.7)

∑∞ where μ(n) is the M¨ obius function. Indeed, the series n=1 χ(n)·μ(n) and nz ∑∞ χ(n) n=1 nz are absolutely converging, therefore by Lemma 2.10 we have (∞ ) (∞ ) ∞ ∑ ∑ χ(n) ∑ χ(n) · μ(n) (χ ◦ (χ · μ))(n) · = . z z n n nz n=1 n=1 n=1 Now χ ◦ (χ · μ) = (χ · I) ◦ (χ · μ) = χ · (I ◦ μ), and the M¨obius inversion formulae (Lemma 2.11) implies that I ◦ μ = e, so χ · (I ◦ μ) = χ · e = e, so we get (3.7). For Re z > 1 we have ∞ ∑ χ(n) ln n L (z, χ) = − . nz n=1 ′

Thus, L′ (z, χ) = L(z, χ) −

∞ ∑ χ(n) · μ(n) nz n=1

(

∞ ∑ χ(n) ln n − nz n=1

) ( ·

) =

∞ ∞ ∑ ∑ ((χ · ln) ◦ (χ · μ))(n) (χ · (ln ◦μ))(n) = − = z n nz n=1 n=1

//Lemmas 2.11 and 2.14// = −

∞ ∑ (χ · Λ)(n) , nz n=1

§ 3.2. Dirichlet series

69

where Λ(n) is the von Mangoldt function, i.e. we obtain the identity ∞ ∑ L′ (z, χ) (χ · Λ)(n) . =− L(z, χ) nz n=1

By deﬁnition we have χ(pk ) = χ(p)k and χ(p) ̸= 0 if and only if gcd(p, m) = 1. Therefore ∞

−

L′ (z, χ) ∑ ∑ χ(p)k · ln p = = L(z, χ) pkz p∈ℙ k=1

//the series is absolutely converging for Re z > 1, so we can change the order of summation// = ∞ ∑ ∑ χ(p)k · ln p

pkz

k=1 p∈ℙ

∑ χ(p) · ln p

=

p∈ℙ

pz

+

∞ ∑ ∑ χ(p)k · ln p k=2 p∈ℙ

pkz

.

Denote the second summand by R(z, χ). We show that R(z, χ) is absolutely converging for Re z ≥ 12 + ε for every ε > 0. Indeed, we need to show that the series )k ∞ ( ∑ ∑ χ(p) ln p pz p∈ℙ

k=2

is absolutely converging, since in this case we can change the order of sumk ∑∞ ∑ mation and derive that R(z, χ) = k=2 p∈ℙ χ(p)pkz·ln p is absolutely converging as well. Now we have ∑ p∈ℙ

Since ∣1 −

χ(p) pz ∣

ln p

)k ∞ ( ∑ χ(p) k=2

≥ ∣1 −

pz

1 ∣, p1/2

=

∑ p∈ℙ

ln p ·

1 χ(p)2 · . 2z p 1 − χ(p) pz

we have

| | | | 1 ln p 1 χ(p)2 4 ln p | | ≤ 1+2ε , | ≤ 1+2ε · |ln p · 2z · 1 χ(p) | √ | p p p ∣1 − ∣ 1 − pz p and thus we obtain that the series R(z, χ) is absolutely converging. By the condition of the theorem gcd(a, m) = 1, so we can choose b so that

§ 3.2. Dirichlet series

70

b · a ≡ 1 (mod m). Then we have ( ) ∑ L′ (z, χ) χ(b) · − = L(z, χ) χ∈Gm

∑ ∑ ∑ χ(b) · χ(p) · ln p + χ(b) · R(z, χ) = pz χ∈Gm χ∈Gm p∈ℙ ⎛ ⎞ ∑ ∑ ∑ ln p ·⎝ χ(b) · χ(p)⎠ + χ(b) · R(z, χ) = z p χ∈Gm χ∈Gm p∈ℙ ∑ χ(bp) equals φ(m) //recall that χ∈Gm

if bp ≡ 1 (mod m) and 0 otherwise // = ∑ ∑ φ(m) · ln p χ(b) · R(z, χ). + z p χ∈Gm

p∈ℙ,p≡a (mod m)

1 2.

The second summand is analytic for Re z > If the theorem is false, then the ﬁrst summand is ﬁnite and therefore it has a precise value for z = 1. On the other hand, ( ′ ) ( ′ ) ∑ ∑ L (z, χ) L′ (z, χe ) L (z, χ) χ(b) · = + χ(b) · . L(z, χ) L(z, χe ) L(z, χ) χ∈Gm

χ∈Gm \{χe }

By Theorem 3.10 and Corollary 3.13 we obtain that the second summand is bounded for z = 1, while for the ﬁrst summand we have L′ (z, χe ) = ln(L(z, χe ))′ , L(z, χe ) and ln(L(z, χe ))′ has a pole at z = 1 since L(z, χe ) has a pole at z = 1. □

CHAPTER 4

p-adic numbers § 4.1. Valuation ﬁelds 1. Basic properties. Let F be a ﬁeld and v : F → ℝ a map from F to the ﬁeld of real numbers. Then (F, v) is called a valuation ﬁeld, while v is called a valuation of F , if (1) For every x ∈ F we have v(x) ≥ 0 and v(x) = 0 if and only if x = 0. (2) v(x + y) ≤ v(x) + v(y) (triangle inequality). (3) v(x · y) = v(x) · v(y). We collect evident properties of a valuation in the next proposition. Proposition 4.1. Let (F, v) be a valuation ﬁeld. Then v(1) = 1, v(−1) = 1, and, for every x ∈ F ∗ and every k ∈ ℤ we have v(xk ) = (v(x))k . Proof. Since v(x) = v(x · 1) = v(x) · v(1) we obtain that v(1) = 1. Now v(−1)2 = v((−1)2 ) = 1 and v(−1) > 0, whence v(−1) = 1. We also 1 . The remaining have 1 = v(1) = v(x · x−1 ) = v(x) · v(x−1 ), so v(x−1 ) = v(x) statement follows immediately. □ If F is the ﬁeld of rationals, then we can deﬁne the following valuations: { 0, if x = 0, (1) v(x) = the trivial valuation (it can be deﬁned 1, otherwise; over arbitrary ﬁeld). (2) vα (x) = ∣x∣α for 0 < α ≤ 1. (3) vp,ρ (x) = ρνp (x) , where 0 < ρ < 1, p is a prime, and νp (x) ∈ ℤ is given by the identity x = pνp (x) · ab , where p does not divide a · b, and vp,ρ (0) := 0; the p-adic valuation. Exercise 4.2. Check, that all deﬁned above valuations satisfy to the deﬁnition. Can α be greater than 1 in item (2)? Can α be less, than 0 in item (2)? 71

§ 4.1. Valuation ﬁelds

72

Clearly, every valuation deﬁnes a topology on F . Namely, we can deﬁne an open sphere of radius ε ∈ ℝ>0 centered at a ∈ F by Bε (a) = {x ∈ F ∣ v(a − x) < ε},

(4.1)

and consider the family of all such spheres as a basis of a topology. It is clear that F with the topology is a Hausdorﬀ space. Moreover, the operations “+”, “·” (considered as maps F × F → F ) and “−”, −1 (considered as maps F → F ) are continuous. In particular, F is a topological ﬁeld. Exercise 4.3. Prove that all operations are continuous maps. Prove that induced topology is a Hausdorﬀ space. Let (F, v) be a valuation ﬁeld, a ∈ F . A sequence {an }n≥1 is said to converge to a (under the valuation v, we use notation an → a), if (v)

lim v(an − a) = 0.

n→∞

The following basic properties of limits hold. Proposition 4.4. The following identities hold: lim (an ± bn ) = lim an ± lim bn ;

n→∞

n→∞

n→∞

lim (an · bn ) = lim an · lim bn ; n→∞ ( n→∞ )−1 n→∞ −1 , lim an = lim an n→∞

n→∞

where lim an and lim bn are assumed to exist and, in the last identity all n→∞ n→∞ an -s and lim an are assumed to be not equal to 0. n→∞

Exercise 4.5. Prove Proposition 4.4. Let F be a ﬁeld and v1 , v2 be its valuations. The valuations v1 , v2 are called equivalent (v1 ∼ v2 ), if, for every sequence {an }n≥1 , we have {an }n≥1 → a ⇐⇒ {an }n≥1 → a. (v1 )

(v2 )

Lemma 4.6. Valuations v1 , v2 of F are equivalent if and only if for every x ∈ F we have v1 (x) < 1 ⇔ v2 (x) < 1. Proof. Assume that v1 and v2 are equivalent. Then for every x ∈ F the inequality v1 (x) < 1 is equivalent to xn → 0. Since v1 and v2 are (v1 )

equivalent, it follows that xn → 0, so v2 (x) < 1. (v2 )

§ 4.1. Valuation ﬁelds

73

Now we prove the converse statement. If v1 is trivial, then v2 , clearly, is also trivial and the claim is evident. Assume that v1 is nontrivial. Then there exists x ∈ F such that v1 (x) ̸= 1. By Proposition 4.1 we obtain that either v1 (x) > 1, or v1 (x−1 ) > 1. Without loss of generality we may assume that v1 (x) > 1 (and so v2 (x) > 1). Choose a sequence {an }n≥1 such that an → a. Therefore, for every m ∈ ℕ, we have xm · an → xm · a, i.e. (v1 )

(v1 )

v1 (xm an − xm a) → 0. Now denote v2 (x) by α (recall that v2 (x) > 1). n→∞ Then for every ε > 0 there exists M ∈ ℕ such that for every m > M we have α1m < ε. Since v1 (xm an − xm a) → 0, it follows that there exists N ∈ ℕ such n→∞

that for every n > N we have v1 (xm an − xm a) < 1. The condition of the lemma implies that for every n > N the inequality v2 (xm an − xm a) < 1 holds. Thus we obtain that for every n > N the inequality v2 (x)m · v2 (an − a) < 1 holds. Hence, for every n > max{M, N } we have v2 (an − a) < ε, i.e. an → a. The implication an → a ⇒ an → a follows by the symmetry. □ (v2 )

(v2 )

(v1 )

Corollary 4.7. Every valuation vα of ℚ, where 0 < α ≤ 1, is equivalent to v1 (x) = ∣x∣. If p is a ﬁxed prime, then for every 0 < ρ < 1 all valuations vp,ρ are equivalent, the corresponding equivalence class is denoted by vp . If p, q are distinct primes, then vp,ρ and vq,ρ are not equivalent. Moreover, vα and vp,ρ are not equivalent. Exercise 4.8. Prove Corollary 4.7 The topology induced on ℚ by a p-adic valuation is called a p-adic topology. The p-adic valuation of a number grows with the power of p in denominator. For example, if an = pn , then an → 0. Valuation n→∞

ﬁelds (F1 , v1 ) and (F2 , v2 ) are topologically isomorphic, if there exists an isomorphism of ﬁelds φ : F1 → F2 such that for every sequence {an }n≥1 , an → a if and only if φ(an ) → φ(a). In particular, if F1 = F2 = F , (v1 )

(v2 )

then v1 ∼ v2 if and only if (F, v1 ) is topologically isomorphic to (F, v2 ) with isomorphism Id, where Id is the identical map. Exercise 4.9. Prove that (ℚ, vp ) is not topologically isomorphic to (ℚ, vq ) for distinct primes p, q. We say that a valuation ﬁeld (F1 , v1 ) can be embedded into a valuation ﬁeld (F2 , v2 ), if there exists an injective homomorphism φ : F1 → F2 such that the restriction of v2 on φ(F1 ) coincides with v1 .

§ 4.1. Valuation ﬁelds

74

2. Valuations over rationals. Theorem 4.10. (On valuations over rationals) Let (ℚ, v) be a valuation ﬁeld. Then either v is the trivial valuation, or v = vα , or v = vp,ρ . Proof. If v is the trivial valuation, we have nothing to prove. Assume that v is not trivial. Then there exists n ∈ ℕ such that v(n) ̸= 1. Indeed, since v is nontrivial, there exists pq ∈ ℚ such that v( pq ) ̸= 1. Hence either v(p) ̸= 1, or v(q) = v( 1q )−1 ̸= 1. Now p, q ∈ ℤ, hence for some p ∈ ℤ, we obtain v(p) ̸= 1. By Proposition 4.1 we have v(p) = v(−p), so there exists n ∈ ℕ such that v(n) ̸= 1. Now one of the following two cases holds: either there exists n ∈ ℕ such that v(n) > 1, or for every n ∈ ℕ the inequality v(n) ≤ 1 holds. Consider these cases separately. Case 1. There exists n such that v(n) > 1. We have v(n) = v(1 + . . . + 1) ≤ 1 + . . . + 1 = n, ◟ ◝◜ ◞ ◟ ◝◜ ◞ n times

n times

α

so there exists α ∈ (0, 1] such that v(n) = n . By Proposition 4.1 we obtain that v(nk ) = nkα for every k ∈ ℤ. Assume that m ∈ ℕ. Then there exists k ≥ 0 such that nk ≤ m < nk+1 . Consider the expansion of m in base n, we obtain: m = a0 + a1 n + . . . + ak nk , where 0 ≤ ai ≤ n − 1 for i = 0, 1, . . . , k; and ak ̸= 0. Now we have v(m) ≤ v(a0 ) + v(a1 ) · nα + . . . + v(ak ) · nkα ≤ //for every i, v(ai ) ≤ ai ≤ n − 1// ≤ (n − 1)(1 + nα + . . . + nkα ) ≤ [ ] n − 1 (k+1)α n−1 α (n − 1) < n · nkα ≤ C · mα . nα − 1 nα − 1 Thus there exists a constant C such that for every m ∈ ℕ we have v(m) < C · mα . We state that for every m we have v(m) ≤ mα . Indeed, assume that there exists k ∈ ℕ such that v(k) > k α , i.e. D > 1. Then for every s ∈ ℕ we have

(4.2) v(k) kα

=

v(k s ) = Ds → ∞. s→∞ k sα On the other hand, m ∈ ℕ.

v(ks ) ksα

< C, a contradiction. Thus (4.2) holds for every

§ 4.1. Valuation ﬁelds

75

Now nk+1 = m + m1 , where 0 < m1 ≤ nk (n − 1). Therefore v(nk+1 ) ≤ v(m) + v(m1 ), so v(m) ≥ n(k+1)α − v(m1 ) ≥ n(k+1)α − mα 1 ≥ )α ) ( ( n−1 kα α α (k+1)α n (n − (n − 1) ) = n > C1 · mα . 1− n It follows that there exists a constant C1 > 0 such that for every m ∈ ℕ we have v(m) > C1 · mα . We state that v(m) ≥ mα .

(4.3)

Indeed, assume that there exists k ∈ ℕ such that v(k) < k α , i.e. D1 < 1. So for every s ∈ ℕ we have

v(k) kα

=

v(k s ) = D1s → 0. s→∞ k sα s

) On the over hand, v(k ksα ≥ C1 , a contradiction. Combining inequalities (4.2) and (4.3) we obtain that for every m ∈ ℕ the identity v(m) = mα holds. −1 −1 By Proposition 4.1 it follows that (v(−m) ) |=|αv(m) and v(m ) = (v(m)) , | | so for every pq ∈ ℚ, the identity v pq = | pq | holds.

Case 2. For every n ∈ ℕ we have v(n) ≤ 1. Choose n so that v(n) ̸= 1 (hence v(n) < 1). Consider the decomposition of n into the product of αk α1 1 primes n = pα · . . . · v(pk )αk , so there 1 · . . . · pk . Then v(n) = v(p1 ) exists a prime p with v(p) = ρ < 1. We show ﬁrst that for every prime q ̸= p the identity v(q) = 1 holds. Otherwise there would exist q ̸= p with v(q) = μ < 1. Since both ρ and μ are less than 1, there exists k ∈ ℕ such that both ρk , μk are less than 21 . On the other hand, gcd(pk , q k ) = 1, so there exist a, b ∈ ℤ such that a · pk + b · q k = 1. Thus we obtain 1 = v(1) = v(a · pk + b · q k ) ≤ v(a) · v(pk ) + v(b) · v(q k ) ≤ 1 1 + = 1, 2 2 · m1 , n = pα2 · n1 ,

1 · ρk + 1 · μk < α1 a contradiction. Now for every m n ∈ ℚ we have m = p where gcd(m1 · n1 , p) = 1, therefore (m) m v(m1 ) v = ρα1 −α2 = ρνp ( n ) = ρα1 −α2 · n v(n1 )

and the theorem follows.

□

§ 4.1. Valuation ﬁelds

76

3. The replenishment of a valuation ﬁeld. A classical result claim that every topological space admits a replenishment. Now we construct a replenishment for a valuation ﬁeld and prove that the replenishment is unique (up to a topological isomorphism). Let (F, v) be a valuation ﬁeld. A sequence {an }n≥1 , where an ∈ F is called fundamental, if for every ε > 0 there exists N such that for every n, m > N we have v(an − am ) < ε. Lemma 4.11. The following hold: (1) Assume that v1 and v2 are equivalent valuations of a ﬁeld F . A sequence {an }n≥1 is fundamental in v1 if and only if it is fundamental in v2 . (2) If {an }n≥1 is a fundamental sequence in (F, v), then {v(an )}n≥1 is a fundamental sequence in ℝ under valuation v(x) = ∣x∣. Proof. We prove item (1) ﬁrst. If v1 is the trivial valuation, then v2 is also trivial and we have nothing to prove. If v1 is nontrivial, then there exists x ∈ F such that v1 (x) = α > 1. Then for every ε > 0 there exists m ∈ ℕ such that α−m < ε. If the sequence {an }n≥1 is fundamental in v1 , then {xm · an }n≥1 is also fundamental in v1 , so there exists N ∈ ℕ such that for every n, k > N we have v1 (xm an − xm ak ) < 1. Whence v2 (xm an − xm ak ) < 1, and so v2 (an − ak ) < α−m < ε. Now we turn to (2). Since v(an ) = v(an −am +am ) ≤ v(am )+v(an −am ), and, by symmetry, v(am ) ≤ v(an ) + v(an − am ), it follows that ∣v(an ) − v(am )∣ ≤ v(an − am ). □ A valuation ﬁeld (F, v) is called complete if every fundamental sequence in G is converging, i.e. it has the limit in F . If a valuation ﬁeld is embedded into a complete valuation ﬁeld (F , v¯) so that F is dense in F then (F , v¯) is called a replenishment of (F, v). Theorem 4.12. For every valuation ﬁeld (F, v) there exists a unique up to isomorphism replenishment (F , v¯). Proof. We construct a replenishment using the standard construction for the real numbers. Let ℱ = {{an }n≥1 ∣ {an }n≥1 is a fundamental sequence of (F, v)} be a set of all fundamental sequences consisting of elements from F . Deﬁne the addition and multiplication on ℱ by {an }n≥1 + {bn }n≥1 = {an + bn }n≥1 , {an }n≥1 · {bn }n≥1 = {an · bn }n≥1 ,

§ 4.1. Valuation ﬁelds

77

Clearly both {an + bn }n≥1 and {an · bn }n≥1 are fundamental sequence. We prove, for example, that {an ·bn }n≥1 is fundamental provided both {an }n≥1 , {bn }n≥1 are fundamental. We have v(an · bn − am · bm ) = v(an · bn − am · bn + am · bn − am · bm ) ≤ v(an · bn − am · bn ) + v(am · bn − am · bm ) = v(an − am ) · v(bn ) + v(bn − bm )v(am ). Since {an }n≥1 , {bn }n≥1 are fundamental sequences, Lemma 4.11(2) implies that {v(an )}n≥1 , {v(bn )}n≥1 are fundamental sequences as well. Therefore the sequences {v(an )}n≥1 and {v(bn )}n≥1 are uniformly bounded by an absolute constant C. Since for every ε there exists N such that for every ε ε and v(bn − bm ) < 2C . So m, n > N we have v(an − am ) < 2C ε ε v(an − am ) · v(bn ) + v(bn − bm )v(am ) < + = ε, 2 2 hence {an · bn }n≥1 is fundamental. Thus ℱ is a commutative ring with 1. Consider I0 = {{an }n≥1 ∣ an → 0}. n→∞

Clearly I0 is an ideal of ℱ. We show that ℱ/I0 is a ﬁeld. Since ℱ is a commutative ring with 1, we remain to prove that each nonzero element of ℱ/I0 has inverse. Consider {an }n≥1 +I0 ̸= 0+I0 . Since {an }n≥1 ∈ ℱ \I0 , it follows that an ̸→ 0, i.e. there exists ε > 0 and N ∈ ℕ such that for every n→∞

n ≥ N we have v(an ) > ε. Consider the sequence { a1n }n≥N . The sequence is well-deﬁned, since an ̸= 0 for n ≥ N . Moreover, for every n, m ≥ N we have ( ) 1 v(am − an ) 1 v(am − an ) v ≤ , − = an am v(an · am ) ε2 so the sequence { a1n }n≥N is fundamental, i.e. { a1n }n≥N ∈ ℱ. Deﬁne the sequence {bn }n≥1 by: bn = 1 for n < N and bn = a1n for n ≥ N . Then {bn }n≥1 ∈ ℱ and an · bn − 1 = 0 for all n > N . Hence, ({an }n≥1 + I0 ) · ({bn }n≥1 + I0 ) = {an · bn }n≥1 + I0 = {1}n≥1 + I0 , i.e. {bn }n≥1 + I0 = ({an }n≥1 + I0 )−1 . Therefore F = ℱ/I0 is a ﬁeld. The embedding F → F is deﬁned by a ⊦→ {an }n≥1 + I0 , where an = a for all n ∈ ℕ (we use the notation {a}n≥1 below). Deﬁne a valuation on ℱ in the following way: if α = {an }n≥1 + I0 , then v¯(α) = lim v(an ). n→∞

§ 4.1. Valuation ﬁelds

78

Clearly, v¯ does not depend on the representative of a coset {an }n≥1 + I0 , since the identity lim (v(an ) + v(bn )) = lim v(an ) + lim v(bn )

n→∞

n→∞

n→∞

holds. It is also evident that the restriction of v¯ on F coincides with v. Now we check the properties of a valuation. (1) v¯({an }n≥1 + I0 ) ≥ 0 and v¯({an }n≥1 + I0 ) = 0 if and only if {an }n≥1 +I0 = I0 , i.e. an → 0. Indeed, since for each n we have n→∞

v(an ) ≥ 0, it follows that limn→∞ v(an ) = v¯({an }n≥1 + I0 ) ≥ 0. If v¯({an }n≥1 + I0 ) = 0, then limn→∞ v(an ) = 0, so {an }n≥1 ∈ I0 . (2) v¯({an }n≥1 + I0 + {bn }n≥1 + I0 ) = v¯({an + bn }n≥1 + I0 ) = lim v(an + bn ) ≤

n→∞

lim (v(an ) + v(bn )) =

n→∞

lim v(an ) + lim v(bn ) =

n→∞

n→∞

v¯({an }n≥1 + I0 ) + v¯({bn }n≥1 + I0 ). (3) v¯({an }n≥1 + I0 ) · v¯({bn }n≥1 + I0 ) = v¯({an · bn }n≥1 + I0 ) = lim v(an · bn ) =

n→∞

lim v(an ) · v(bn ) =

n→∞

lim v(an ) · lim v(bn ) =

n→∞

n→∞

v¯({an }n≥1 + I0 ) · v¯({bn }n≥1 + I0 ). So v¯ is a valuation. Now we show that F is dense in F . If α = {an }n≥1 is a fundamental sequence, then the sequence {αn }n≥1 , where αn = {an }k≥1 ∈ F (recall that {an }k≥1 is a sequence such that all its members are equal to an ), is clearly converging to α. We remain to show that F is complete. Assume that {αn }n≥1 is a fundamental sequence in F . Since F is dense in F , for every n we can choose an ∈ F such that v¯(αn − {an }k≥1 ) < n1 . Let α = {an }n≥1 .

§ 4.2. Construction and properties of p-adic ﬁelds

79

We claim that α ∈ ℱ (so α + I0 ∈ F ) and αn → α. Indeed, for every ε (¯ v)

there exists M such that v¯(αn − αm ) < 3ε for all m, n > M . Further, there exists N ≥ M such that N1 < 3ε . So for n, m ≥ N we have v(an − am ) = v¯({an }k≥1 − αn + αn − αm + αm − {am }k≥1 ) ≤ v¯({an }k≥1 − αn ) + v¯(αn − αm ) + v¯(αm − {am }k≥1 ) < ε, so α is a fundamental sequence. Now, by construction, we have v¯(αn − α) ≤ v¯(αn − {an }k≥1 ) + v¯({an }k≥1 − α) ≤ 1 + lim v(an − am ). n m→∞ Since for every ε > 0 there exists N such that N1 < 2ε and for every n, m ≥ N , v(an − am ) < 2ε , we obtain that for every n ≥ N the inequality v¯(αn − α) < ε holds. Thus αn → α. This completes the proof of existence. (¯ v)

Now we prove the uniqueness. Let (F1 , v1 ) and (F2 , v2 ) be two replenishments of (F, v). We want to construct a topological isomorphism σ. Deﬁne σ in F as the identity map. By deﬁnition, F is dense in both F1 and F2 , so every element of F1 is a limit of a fundamental sequence {an }n≥1 , where each an lies in F . We extend σ to the map σ : F1 → F2 by assuming that for α ∈ F1 such that α = limn→∞ an that σ(α) = limn→∞ σ(an ). Clearly this deﬁnition is correct, since {σ(an )}n≥1 is a fundamental sequence in F2 and since for two fundamental sequences {an }n≥1 , {bn }n≥1 with limn→∞ an = α = limn→∞ bn the sequence {cn }n≥1 , where c2n−1 = an , c2n = bn , is fundamental and limn→∞ cn = α. It is technical to prove that σ is a bijection and that σ preserves the operation, and we leave the detailed proof to the reader. □ Exercise 4.13. Complete the technical details of the proof of Theorem 4.12. § 4.2. Construction and properties of p-adic ﬁelds Clearly, if valuations v1 and v2 of a ﬁeld F are equivalent, then the replenishments of (F, v1 ) and (F, v2 ) are topologically isomorphic. Recall that we denote the class of equivalent valuations vp,ρ of ℚ by vp . A replenishment of (ℚ, vp ) is denoted by ℚp and is called a ﬁeld of p-adic numbers or a p-adic ﬁeld. Theorem 4.12 implies that ℚp exists and is unique up to

§ 4.2. Construction and properties of p-adic ﬁelds

80

isomorphism. In this section we construct ℚp explicitly and derive some its properties. 1. Ring of p-adic integers and its properties. Throughout we ﬁx a prime p. Consider { } 𝒵p = {an }n≥0 ∣ an ∈ ℤ, an ≡ an−1 (mod pn ) for n ≥ 1 . Deﬁne addition and multiplication on 𝒵p as the addition and the multiplication of sequences, i.e. {an }n≥0 + {bn }n≥0 = {an + bn }n≥0 and {an }n≥0 · {bn }n≥0 = {an · bn }n≥0 . Clearly 𝒵p is a ring under these operations. Consider { } ℐp = {an }n≥0 ∈ 𝒵p ∣ an ≡ 0 (mod pn+1 ) . It is also evident that ℐp is an ideal of 𝒵p . Deﬁne ℤp := 𝒵p /ℐp , a ring of p-adic integers. The embedding ℤ → ℤp is deﬁned by m ⊦→ {an }n≥0 , where a0 = a1 = . . . = m. Clearly this embedding preserves the addition and multiplication. Moreover, the unit of ℤ coincides with the unit of ℤp . Now we choose a canonical representation for every element from ℤp . Namely, for every sequence {an }n≥0 ∈ 𝒵p there exists a unique sequence {xn }n≥0 ∈ 𝒵p such that for every n we have 0 ≤ xn < pn+1 and an ≡ xn (mod pn+1 ), in particular {an }n≥0 + ℐp = {xn }n≥0 + ℐp . The sequence {xn }n≥0 with 0 ≤ xn < pn+1 is a canonical representative of the coset {an }n≥0 + ℐp . Theorem 4.14. Let ℤp be the ring of p-adic integers. Then the following hold. (1) An element {xn }n≥0 ∈ ℤp is invertible in ℤp if and only if x0 ̸= 0. ∗ (2) For every 0 ̸= α ∈ ℤp there exist the unique n ∈ ℕ and ξ ∈ ℤp n such that α = p · ξ. (3) ℤp is an integral domain. Proof. Recall that if m ∈ ℤ, then we identify m with its image (m, m, . . .) ∈ ℤp . In particular, 1 = (1, 1, . . .). (1) Necessity. If x0 = 0, then for every n ∈ ℤ we have x0 · n = 0, ∗ therefore {xn }n≥0 ̸∈ ℤp . Suﬃciency. Assume that x0 ̸= 0. Since 0 < x0 < p, it follows that gcd(x0 , p) = 1. Now, by deﬁnition, x1 ≡ x0 (mod p), therefore gcd(x1 , p) = 1. Repeating the argument, by induction, we obtain that gcd(xn , p) = 1 for every n. Therefore, for every n we have gcd(xn , pn+1 ) = 1. Hence, for every n ≥ 0 there exists 0 < yn < pn+1 such that xn · yn ≡ 1 (mod pn+1 ). Since xn ≡ xn−1 (mod pn ) and both xn−1 · yn−1 and xn · yn are equivalent

§ 4.2. Construction and properties of p-adic ﬁelds

81

1 modulo pn , it follows that yn ≡ yn−1 (mod pn ), i.e. {yn }n≥0 ∈ 𝒵p . Since xn · yn ≡ 1 (mod pn+1 ) it follows that there exists zn such that xn · yn = 1 + zn · pn+1 . Therefore {xn }n≥0 · {yn }n≥0 = {1}n≥0 + {zn · pn+1 }n≥0 and the second summand in the right hand side lies in ℐp . (2) If α = {xn }n≥0 ∈ ℤp \ {0}, then there exists a minimal n such that xn ̸= 0. If n = 0 then α = p0 ξ and ξ = α. Suppose n > 0, then, by deﬁnition, xn ≡ xn−1 (mod pn ) and in view of our choice xn−1 = 0, so xn is divisible by pn and xn ̸≡ 0 (mod pn+1 ), in particular, xpnn ̸≡ 0 (mod p) and gcd(xn /pn , p) = 1. Now xn+1 ≡ xn (mod pn+1 ), so xn+1 /pn ̸≡ 0 (mod p). Arguing in the same way by induction for every k ≥ n we obtain that xk /pn ̸≡ 0 (mod p). Moreover, since 0 ≤ xk < pk+1 , we have 0 < xk /pn < pk−n+1 . Consider ) ( xn xn+1 xn+2 xn xn , , . . . , , , , . . . (ﬁrst n + 1 terms are equal). ξ= pn pn pn pn pn In view of item (1) of the theorem, ξ is invertible in ℤp and, by construction, α = pn · ξ. Now we show the uniqueness of n and ξ. If α = pn · ξ = pm · ζ, then the ﬁrst nonzero term of pn · ξ has number n, while the ﬁrst nonzero term of pm · ζ has number m, so n = m. Let ξ = {xn }n≥0 and ζ = {yn }n≥0 . Since pn · ξ = pn · ζ, we obtain that xm = ym for m ≥ n. Moreover xn−1 = (xn

mod pn ) = (yn

mod pn ) = yn−1 .

Arguing in the same way we obtain that for all i < n the equality xi = yi holds (indeed, if xi = yi , then xi−1 = (xi mod pi ) = (yi mod pi ) = yi−1 ). Thus ξ = ζ. (3) Assume that α, β ∈ ℤp \ {0} and α · β = 0. In view of item (2) of the theorem, α = pn ·ξ, β = pm ·ζ, hence α·β = pn+m (ξ ·ζ) = 0. Multiplying the left hand side of the identity by ξ −1 ·ζ −1 we obtain pn ·pm = 0. But (n+m)th term of pn+m equals pn+m ̸≡ 0 (mod pn+m+1 ), a contradiction. □ Corollary 4.15. An integer a ∈ ℤ is invertible in ℤp if and only if gcd(a, p) = 1. 2. The ﬁeld of p-adic rationals is the replenishment of rationals in p-adic metric. Corollary 4.15 implies that rational numbers of the form a b with gcd(b, p) = 1 are naturally embedded into ℤp . Since ℤp is an integral domain, it possesses a ﬁeld of fractions, denote it by ℚp . We introduce a valuation vp on ℚp and show that ℚp is a replenishment of ℚ under a p-adic valuation, i.e. we show that ℚp is a ﬁeld of p-adic numbers.

§ 4.2. Construction and properties of p-adic ﬁelds

82

In view of Theorem 4.14 all elements of ℤp have form pn · ξ, so we need to add the inverses for powers of p, i.e. ∗

ℚp = {pn · ξ ∣ n ∈ ℤ, ξ ∈ ℤp } ∪ {0}.

(4.4)

∗ ℚp

Given α = pn · ξ ∈ set νp (α) = n, and let νp (0) = ∞. For brevity we use ν instead of νp below. Choose 0 < ρ < 1 and set vp (α) = ρνp (α) . Every ab ∈ ℚ can be uniquely written as pn ab11 , where p does not divide ∗

a1 · b1 . Then both a1 and b1 belong to ℤp , and thus ab11 = a1 b−1 1 = ξ is an invertible p-adic integer. Therefore, ab corresponds to pn ξ ∈ ℚp and this correspondence gives the canonical embedding of ℚ into ℚp . Theorem 4.16. In the above notations for every α, β ∈ ℚp the following hold. (1) ν(α · β) = ν(α) · ν(β). (2) ν(α+β) ≥ min(ν(α), ν(β)), moreover for ν(α) ̸= ν(β) the equality holds. (3) vp is a valuation of ℚp and the restriction of vp on ℚ coincides with the valuation vp,ρ of ℚ. Proof. (1), (2). If α = pn · ξ, β = pm · ζ, then α · β = pn+m · (ξ · ζ), therefore ν(α · β) = ν(α) + ν(β). Without loss of generality assume that n ≥ m. Then we have α + β = pn · ξ + pm · ζ = pm (pn−m · ξ + ζ), and pn−m · ξ + ζ lies in ℤp , so ν(α + β) ≥ m = min(ν(α), ν(β)). If n > m, then the ﬁrst term of pn−m · ξ + ζ equals the ﬁrst term of ζ, so is not equal to 0, therefore pn−m · ξ + ζ is invertible in ℤp and ν(α + β) = m. (3) First we check that vp is a valuation. We have vp (α + β) ≤ ρmin(ν(α),ν(β)) = max(vp (α), vp (β)) ≤ vp (α) + vp (β). vp (α · β) = ρν(α)+ν(β) = vp (α) · vp (β). vp (α) ≥ 0 and vp (α) = 0 if and only if α = 0. So vp is a valuation. The restriction of vp on ℚ coincides with vp,ρ and we leave the details of the proof for the reader. □ Exercise 4.17. Prove that the restriction of vp on ℚ coincides with vp,ρ . Theorem 4.18. The valuation ﬁeld (ℚp , vp ) is a replenishment of (ℚ, vp,ρ ).

§ 4.2. Construction and properties of p-adic ﬁelds

83

Proof. We need to prove that ℚp is complete and that ℚ is dense in ℚp . Let {αn }n≥1 be a fundamental sequence in ℚp and every αn has the ∗ form pmn ·ξn , where mn ∈ ℤ and ξn ∈ ℤp ∪{0}. Since the sequence {αn }n≥1 is fundamental, it follows that vp (αn − αm ) → 0. Now vp (αn − αm ) = n,m→∞

ρν(αn −αm ) . If the sequence {mn }n≥ is not stabilizing, then for every n there exists k > n such that mn ̸= mk . In view of Theorem 4.16(2) we obtain ρν(αn −αm ) = ρmin(mn ,mk )

→

n,k→∞

0.

Therefore mn → ∞, i.e. αn → 0. Assume that the sequence {mn }n≥1 n→∞

(vp )

is stabilizing, i.e. there exist N ∈ ℕ, M ∈ ℤ such that for every n ≥ N we have mn = M . Then for every n, m ≥ N we have vp (αn − αm ) = vp (pM (ξn − ξm )) = vp (pM ) · vp (ξn − ξm ), i.e. the sequence {ξn }n≥1 is fundamental. Hence for every m ∈ ℕ there exists Nm ∈ ℕ such that for all n1 , n2 > Nm we have vp (ξn1 − ξn2 ) < ρm . (n) So ξn1 − ξn2 = pm+1 · ξ, where ξ ∈ ℤp . Let ξn = {xk }k≥0 , consider (N ) ξ = {xk }k≥0 , where xm := xm m . We claim that ξ ∈ ℤp and that ξn → ξ. n→∞

Show that xm−1 ≡ xm (mod pm ) (and so ξ ∈ ℤp ). Indeed, by deﬁnition, (Nm−1 ) xm−1 = xm−1 , i.e. for each k > Nm−1 we have v(ξNm−1 − ξk ) < ρm−1 , (N

)

m−1 therefore ξNm−1 − ξk = pm · ζ for some ζ ∈ ℤp . In particular, xm−1 − (k) m xm−1 ≡ 0 (mod p ). Choose k = Nm , then, since ξNm ∈ ℤp , we have

(N )

(N )

(N

)

(N )

m−1 m xm−1 ≡ xm m (mod pm ), whence xm−1 ≡ xm m (mod pm ). Now if k ≥ Nm , then vp (ξk − ξ) ≤ ρm , so ξn → ξ and αn → pM · ξ.

(vp )

(vp )

We remain to show that ℚ is dense in ℚp . Let α = pn · ξ, where n ∈ ℤ ∗ and ξ = (x0 , x1 , x2 , . . .) ∈ ℤp . Set αm = pn · xm ∈ ℚ. Then vp (αm − α) = ρn · vp (0, . . . , 0, xm − xm+1 , xm − xm+2 , . . .) ≤ ρn+m → 0, m→∞

and the theorem follows.

□

Exercise 4.19. Prove that for every α ∈ ℤp there exists a sequence {x(m) }m≥0 , x(m) ∈ ℤ, such that x(m) − α ∈ pm+1 ℤp (in particular, x(m) →

(vp )

α as m → ∞).

§ 4.2. Construction and properties of p-adic ﬁelds

84

3. Applications. The construction of p-adic numbers shows that they are closely related to residuals modulo powers of p. This connection becomes clear due to the following theorem. Theorem 4.20. Assume that F (x1 , . . . , xn ) ∈ ℤ[x1 , . . . , xn ] is a polynomial with integer coeﬃcients. The congruence F (x1 , . . . , xn ) ≡ 0 (mod pm )

(4.5)

has a solution in ℤ for every m ≥ 0 if and only if F (x1 , . . . , xn ) = 0

(4.6)

has a solution in ℤp . Proof. Assume that the equation (4.6) has a solution α1 , . . . , αn ∈ ℤp . Denote by Im the ideal pm · ℤp of ℤp . Then for every m there exist (m) (m) (m) x1 , . . . , xn ∈ ℤ such that αi + Im = xi + Im (this statement follows from Exercise 4.19). Therefore (m)

F (x1 , . . . , x(m) n ) + Im = F (α1 , . . . , αn ) + Im = 0. (m)

(m)

(m)

(m)

Now F (x1 , . . . , xn ) is an integer, so F (x1 , . . . , xn ) ∈ Im ∩ ℤ = (m) (m) pm · ℤ, i.e. x1 , . . . , xn is a solution for the congruence F (x1 , . . . , xn ) ≡ m 0 (mod p ). Now assume that the congruences (4.5) have solution for each m ≥ 0. (m) (m) Consider the sequence {(x1 , . . . , xn )}m≥0 of solutions for congruences (m ) (m ) (4.5). Notice that we may choose a subsequence {(x1 k , . . . , xn k )}k≥0 (mk ) such that xi is converging to αi ∈ ℤp for 1 ≤ i ≤ n. Indeed, con(m) (m) sider zero coordinates of {(x1 , . . . , xn )}m≥0 . We obtain the set of tuples of the form (a1 , . . . , an ), where 0 ≤ ai < p for each i = 1, . . . , n. Since (m) (m) {(x1 , . . . , xn )}m≥0 is inﬁnite, there exists an n-tuple (a1 , . . . , an ) such (m) that the zero coordinate of xi equals ai for inﬁnitely many m and they (m) (m) (m0 ) (m ) form a subsequence {(x1 , . . . , xn 0 )}m0 ≥0 of {(x1 , . . . , xn )}m≥0 . We (m ) (m ) repeat the arguments for the ﬁrst coordinates of {(x1 0 , . . . , xn 0 )}m0 ≥0 (m1 ) (m1 ) and derive an inﬁnite subsequence {(x1 , . . . , xn )}m1 ≥0 . We repeat the arguments for every k ≥ 0. Now we take any element from k-th subsequence (m) (m) (m ) (m ) and obtain the subsequence {(x1 k , . . . , xn k )}k≥0 of {(x1 , . . . , xn )}m≥0 satisfying the condition: for every i = 1, . . . , n the ﬁrst k coordinates of (m ) (m ) xi t are equal to the coordinates of xi k for every t ≥ k. Therefore (mt ) (ms ) (m ) vp (xi − xi ) < ρt → 0, i.e. for every i the sequence {xi k }k≥0 t,s→∞

§ 4.3. Problems

85 (m )

(m )

is fundamental. So the sequences {x1 k }k≥0 , . . . , {xn k }k≥0 have limits α1 , . . . , αn respectively and, by construction, each αi lies in ℤp . Again by construction F (α1 , . . . , αn ) lies in every ideal Im , so F (α1 , . . . , αn ) = 0. □ § 4.3. Problems 1. Prove that a valuation deﬁnes on a ﬁeld the structure of Hausdorﬀ space. Find a topological ﬁeld that is not homeomorphic to a valuation ﬁeld, i.e. there exist topological ﬁelds that do not possess a valuation inducing the same topology. 2. Prove that (ℤp , vp ) is a compact topological space. 3. How many distinct solutions in ℤ5 has the equation x2 + y 2 = 0?

Bibliography [1] Apostol T. V. Introduction to Analytic Number Theory. Springer, New York, 1976. [2] Bateman P. D., Diamond H. G. Analytic Number Theory – An Introductionary Course. World Scientiﬁc Publ., Singapore, 2002. [3] Bicadze A. V. Foundations of the Theory of Analytic Functions of a Complex Variable [Russian]. M.: Nauka, 1969. [4] Borevich Z. I., Shafarevich I. R. Number Theory [Russian]. M.: Nauka, 1985. [5] Buhshtab A. A. Number Theory [Russian]. M.: Prosveschenie, 1966. [6] Chandrasekharan K. Introduction to Analytic Number Theory. Springer, BerlinHeidelberg, 2012. [7] Galochkin A. I., Nesterenko Yu. F., Shidlovskii A. B. Introduction to Number Theory [Russian]. M.: MSU, 1984. [8] Gelfond A. O. Transcendental and Algebraic Numbers [Russian]. M.: Gostehizdat, 1952. [9] Ingam A. V. The Distribution of Prime Numbers [Russian translation]. M.: Librokom, 2009. [10] Karatsuba A. L. Foundations of Analytic Number Theory [Russian]. M.: URSS, 1983. [11] Kostrikin A. I. Introduction to Algebra (Part 3) [Russian]. M.: Fizmatlit, 2004. [12] Vinogradov I. M. Elements of Number Theory [Russian]. M.: Nauka, 1972.

86

Glossary

𝔸, 9 an → a, 72

ℚp , 81

ℂ, 4 f ◦ g, 37

∼, 30

ℝ, 4

(v)

vα , 71 vp,ρ , 71

deg, 4 g ∣ f, 4

ℤ, 4 ζ(z), 35 𝒵p , 80

e(n), 36 gcd, 4 hα (x), 7 I(n), 36 ι, 4 Λ(n), 34 li, 31 L(z, χ), 60 L(z, Gm ), 63 μ(n), 36 ℕ, 4 ∣g∣, 56 ℙ, 4 π(x), 30 ψ(x), 31 ˜ ψ(x), 31 ℚ, 4 87

Index

ﬁeld of fractions, 5

abelian group, 56 algebraic integer, 6 number, 6 arithmetic function, 36 asymptotically equivalent functions, 30

Hermite identity, 20 homomorphism of groups, 58 ideal, 5 identity function, 36 isomorphism of groups, 57

character modulo m, 60 character of a ﬁnite abelian group, 58 Chebyshev function, 31 integral, 31 complete ﬁeld, 76 conjugate numbers, 7 convolution product, 37 cyclic group, 56

leading coeﬃcient, 4 Lindemann theorem, 27 Liouville theorem, 18 L-series of character, 60 M¨ obius function, 36 Mangoldt function, 34 maximal ideal, 6 minimal polynomial, 7 monic polynomial, 4 multiplicative function, 36

degree of a Diophantine approximation, 14 of a polynomial, 4 of an algebraic number, 7 Diophantine approximation, 14 Dirichlet approximation theorem, 15 Dirichlet series, 35 Dirichlet theorem, 68 division algorithm, 4

p-adic valuation, 71 prime-counting function, 30 principal character, 58 principal ideal, 5 domain, 5

Euler function, 57 Euler identity, 38

Riemann hypothesis, 55 zeta-function, 35 ring of p-adic integers, 80

factor ring, 6 ﬁeld of p-adic numbers, 81 ﬁeld of p-adic numbers (p-adic ﬁeld), 79

symmetrized tuple, 25 88

Index transcendental number, 6 triangle inequality, 71 trivial valuation, 71 valuation, 71 valuation ﬁeld, 71

89