Introduction to Quantum Mechanics [2 ed.] 0128223103, 9780128223109

Introduction to Quantum Mechanics, 2nd Edition provides an accessible, fully updated introduction to the principles of q

349 72 5MB

English Pages 434 [419] Year 2020

Table of contents :
Contents
Preface to the 1st edition
Preface to the 2nd edition
About the author
1 Atoms and photons: origins of the quantum theory
1.1 Atomic and subatomic particles
1.2 Electromagnetic waves
1.3 Three failures of classical physics
1.4 Blackbody radiation
1.5 The photoelectric effect
1.6 Line spectra
Supplement 1A. Maxwell's equations
Supplement 1B. The Planck radiation law
2 Waves and particles
2.1 The double-slit experiment
2.2 Wave-particle duality
2.3 The Schrödinger equation
2.4 Operators and eigenvalues
2.5 The wavefunction
Chapter 2. Exercises
3 Quantum mechanics of some simple system
3.1 The free particle
3.2 Particle in a box
3.3 Free-electron model
3.4 Particle in a three-dimensional box
Supplement 3A. Finite square-well potential
Chapter 3. Exercises
4 Principles of quantum mechanics
4.1 Hermitian operators
4.2 Eigenvalues and eigenfunctions
4.3 Expectation values
4.4 More on operators
4.5 Postulates of quantum mechanics
4.6 Dirac bra-ket notation
4.7 The variational principle
4.8 Spectroscopic transitions
Supplement 4A. Perturbation theory
Supplement 4B. Time-dependent perturbation theory for radiative transitions
Chapter 4. Exercises
5 Special functions
5.1 Gaussian functions
5.2 The gamma function
5.3 The Dirac deltafunction
5.4 Leibniz's formula
5.5 Hermite polynomials
5.6 Spherical polar coordinates
5.7 Legendre polynomials
5.8 Spherical harmonics
5.9 Laguerre polynomials
5.10 Series solutions of differential equations
5.11 Bessel functions
5.12 Spherical Bessel functions
Supplement 5A. Particle in a disk
Supplement 5B. Particle in an inﬁnite spherical well
Supplement 5C. Particle in a deltafunction well
6 The harmonic oscillator
6.1 Classical oscillator
6.2 Quantum harmonic oscillator
6.3 Harmonic-oscillator eigenfunctions and eigenvalues
6.4 Operator formulation of harmonic oscillator
6.5 Quantum theory of radiation
Supplement 6A. Anharmonic oscillator
Chapter 6. Exercises
7 Angular momentum
7.1 Particle in a ring
7.2 Free electron model for aromatic molecules
7.3 Rotation in three dimensions
7.4 Theory of angular momentum
7.5 Operator derivation of angular momentum eigenvalues
7.6 Electron spin
7.7 Pauli spin algebra
7.8 Addition of angular momenta
8 The hydrogen atom and atomic orbitals
8.1 Atomic spectra
8.2 The Bohr atom
8.3 Quantum mechanics of hydrogenlike atoms
8.4 Hydrogen-atom ground state
8.5 Schrödinger equation for atomic orbitals
8.6 p- and d-orbitals
8.7 Summary on atomic orbitals
8.8 Reduced mass
Chapter 8. Exercises
9 The helium atom
9.1 Experimental energies
9.2 Schrödinger equation and variational calculations
9.3 Spinorbitals and the exclusion principle
9.4 Excited states of helium
Chapter 9. Exercises
10 Atomic structure and the periodic law
10.1 Slater determinants
10.2 Self-consistent ﬁeld theory
10.3 Aufbau principles
10.4 Atomic conﬁgurations and term symbols
10.5 Periodicity of atomic properties
10.6 Relativistic effects
10.7 Spiral form of the periodic table
Chapter 10. Exercises
11 The chemical bond
11.1 The hydrogen molecule
11.2 Valence bond theory
11.3 Hybrid orbitals and molecular geometry
11.4 Hypervalent compounds
11.5 Boron hydrides
11.6 Valence-shell model
11.7 Transition metal complexes
11.8 The hydrogen bond
11.9 Critique of valence-bond theory
Chapter 11. Exercises
12 Molecular orbital theory of diatomic molecules
12.1 The hydrogen molecule-ion
12.2 The LCAO approximation
12.3 MO theory of homonuclear diatomic molecules
12.4 Variational computation of molecular orbitals
12.5 Heteronuclear molecules
12.6 Electronegativity
Chapter 12. Exercises
13 Polyatomic molecules and solids
13.1 Hückel molecular orbital theory
13.2 Conservation of orbital symmetry; Woodward-Hoffmann rules
13.3 Band theory of metals and semiconductors
13.4 Computational chemistry
Chapter 13. Exercises
14 Density functional theory
14.1 Thomas-Fermi model
14.2 The Hohenberg-Kohn theorems
14.3 Density functional theory
14.4 Slater's X-alpha method
14.5 The Kohn-Sham equations
14.6 Chemical potential
Chapter 14. Exercises
15 Molecular symmetry
15.1 The ammonia molecule
15.2 Mathematical theory of groups
15.3 Group theory in quantum mechanics
15.4 Molecular orbitals for ammonia
15.5 Selection rules
15.6 The water molecule
15.7 Walsh diagrams
15.8 Molecular symmetry groups
Low-symmetry groups
Rotational groups
Dihedral groups
Groups of higher symmetry
15.9 Dipole moments and optical activity
15.10 Character tables
Chapter 15. Exercises
16 Molecular spectroscopy
16.1 Vibration of diatomic molecules
16.2 Vibration of polyatomic molecules
16.3 Rotation of diatomic molecules
16.4 Rotation-vibration spectra
16.5 Molecular parameters from spectroscopy
16.6 Rotation of polyatomic molecules
16.7 Electronic excitations
16.8 Lasers
16.9 Raman spectroscopy
Chapter 16. Exercises
17 Statistical thermodynamics
17.1 Quantum mechanics
17.2 Thermodynamic functions
17.3 The Boltzmann distribution
17.4 Molar partition function
17.5 Ideal monatomic gas
17.6 The Sakur-Tetrode equation
17.7 The Born-Oppenheimer approximation
17.8 Rotation of diatomic molecules
17.9 Rotation of polyatomic molecules
17.10 Molecular vibration
17.11 Electronic contributions
17.12 Summary
Supplement 17A. Low-temperature heat capacity of hydrogen molecules
Chapter 17. Exercises
18 Nuclear magnetic resonance
18.1 Magnetic properties of nuclei
18.2 Nuclear magnetic resonance
18.3 The chemical shift
18.4 Spin-spin coupling
18.5 Mechanism for spin-spin interactions
18.6 Magnetization and relaxation processes
18.7 Pulse techniques and Fourier transforms
18.8 Two-dimensional NMR
18.9 Magnetic resonance imaging
Chapter 18. Exercises
19 Wonders of the quantum world
19.1 The Copenhagen interpretation
19.2 Superposition
19.3 Schrödinger's Cat
19.4 Einstein-Podolsky-Rosen experiment
19.5 Bell's theorem
19.6 Aspect's experiment
19.7 Multiple photon entanglement
19.8 Philosophical problems of quantum theory
Chapter 19. Exercises
20 Quantum computers
20.1 Qubits
20.2 Quantum gates and circuits
20.3 Simulation of a Stern-Gerlach experiment
20.4 Quantum Fourier transform
20.5 Phase estimation algorithm
20.6 Many-electron systems
20.7 Atomic and molecular Hamiltonians
20.8 Time-evolution of a quantum system
20.9 Trotter expansions
20.10 Simulations of molecular structure
Answers to exercises
Chapter 2
Chapter 3
Chapter 4
Chapter 6
Chapter 8
Chapter 9
Chapter 10
Chapter 11
Chapter 12
Chapter 13
Chapter 14
Chapter 15
Chapter 16
Chapter 17
Chapter 18
Chapter 19
Suggested references
Index

Recommend Papers

Introduction to Quantum Mechanics 9811234728, 9789811234729

The author has published two texts on classical physics, Introduction to Classical Mechanics and Introduction to Electri

403 69 7MB Read more

Concise Introduction to Quantum Mechanics [2 ed.] 0750356618, 9780750356619

This extended and updated second edition course text presents a logical and concise introduction to the basic concepts,

121 116 7MB Read more

Introduction to Quantum Mechanics [3 ed.] 1107189632, 9781107189638

Changes and additions to the new edition of this classic textbook include a new chapter on symmetries, new problems and

1,002 97 25MB Read more

An Introduction to Covariant Quantum Mechanics 9783030895891, 9783030895884, 3030895890

This book deals with an original contribution to the hypothetical missing link unifying the two fundamental branches of

113 49 63MB Read more

Concise Introduction to Quantum Mechanics [2 ed.] 0750356618, 9780750356619

102 88 13MB Read more

Quantum Mechanics. A Concise Introduction 9789811976254, 9789811976261

183 47 3MB Read more

Introduction to Quantum Mechanics مقدمة في مكانيكا الكم

447 100 5KB Read more

Introduction to Quantum Mechanics [2nd Edition] 0131118927, 9780131118928

This book first teaches learners how to do quantum mechanics, and then provides them with a more insightful discussion o

573 76 6KB Read more

Quantum Mechanics. A Mathematical Introduction 9781009100502, 9781009118026

123 84 Read more

Quantum Mechanics : An Introduction 9781942270799, 1942270798

477 37 1MB Read more

Introduction to Quantum Mechanics [2 ed.]
0128223103, 9780128223109

Author / Uploaded
S.M. Blinder

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

File loading please wait...

Citation preview

Introduction to Quantum Mechanics

Introduction to Quantum Mechanics 2nd Edition, Revised and Expanded

S.M. Blinder Chemistry and Physics University of Michigan Ann Arbor, MI, United States

Academic Press is an imprint of Elsevier 125 London Wall, London EC2Y 5AS, United Kingdom 525 B Street, Suite 1650, San Diego, CA 92101, United States 50 Hampshire Street, 5th Floor, Cambridge, MA 02139, United States The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, United Kingdom Copyright © 2021 Elsevier Inc. All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions. This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein). Notices Knowledge and best practice in this ﬁeld are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary. Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility. To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein. Library of Congress Cataloging-in-Publication Data A catalog record for this book is available from the Library of Congress British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library ISBN: 978-0-12-822310-9 For information on all Academic Press publications visit our website at https://www.elsevier.com/books-and-journals Publisher: Susan Dennis Acquisitions Editor: Anneka Hess Editorial Project Manager: Andrea Dulberger Production Project Manager: Kumar Anbazhagan Designer: Mark Rogers Cover: The Wave Field, a landscape sculpture by Maya Lin on North Campus, the University of Michigan, Ann Arbor Typeset by VTeX

For Frances, Michael, Stephen, Matthew, Amy, Sarah, Lea, Michele, Kristen, Brooklyn, David and Clara.

Contents

Preface to the 1st edition Preface to the 2nd edition About the author

1.

Atoms and photons: origins of the quantum theory 1.1 Atomic and subatomic particles 1.2 Electromagnetic waves 1.3 Three failures of classical physics 1.4 Blackbody radiation 1.5 The photoelectric effect 1.6 Line spectra Supplement 1A. Maxwell’s equations Supplement 1B. The Planck radiation law

2.

21 24 28 31 32 33

Quantum mechanics of some simple system 3.1 The free particle 3.2 Particle in a box 3.3 Free-electron model 3.4 Particle in a three-dimensional box Supplement 3A. Finite square-well potential Chapter 3. Exercises

4.

1 4 5 7 9 11 14 17

Waves and particles 2.1 The double-slit experiment 2.2 Wave-particle duality 2.3 The Schrödinger equation 2.4 Operators and eigenvalues 2.5 The wavefunction Chapter 2. Exercises

3.

xiii xv xvii

35 36 41 43 46 49

Principles of quantum mechanics 4.1 Hermitian operators

51 vii

viii Contents

4.2 Eigenvalues and eigenfunctions 4.3 Expectation values 4.4 More on operators 4.5 Postulates of quantum mechanics 4.6 Dirac bra-ket notation 4.7 The variational principle 4.8 Spectroscopic transitions Supplement 4A. Perturbation theory Supplement 4B. Time-dependent perturbation theory for radiative transitions Chapter 4. Exercises

5.

71 73 75 79 79 81 84 86 87 89 91 95 96 98 99

The harmonic oscillator 6.1 Classical oscillator 6.2 Quantum harmonic oscillator 6.3 Harmonic-oscillator eigenfunctions and eigenvalues 6.4 Operator formulation of harmonic oscillator 6.5 Quantum theory of radiation Supplement 6A. Anharmonic oscillator Chapter 6. Exercises

7.

66 69

Special functions 5.1 Gaussian functions 5.2 The gamma function 5.3 The Dirac deltafunction 5.4 Leibniz’s formula 5.5 Hermite polynomials 5.6 Spherical polar coordinates 5.7 Legendre polynomials 5.8 Spherical harmonics 5.9 Laguerre polynomials 5.10 Series solutions of differential equations 5.11 Bessel functions 5.12 Spherical Bessel functions Supplement 5A. Particle in a disk Supplement 5B. Particle in an infinite spherical well Supplement 5C. Particle in a deltafunction well

6.

52 53 54 57 59 61 62 64

103 104 106 108 110 113 115

Angular momentum 7.1 7.2 7.3 7.4 7.5

Particle in a ring Free electron model for aromatic molecules Rotation in three dimensions Theory of angular momentum Operator derivation of angular momentum eigenvalues

117 119 119 121 122

Contents ix

7.6 Electron spin 7.7 Pauli spin algebra 7.8 Addition of angular momenta

8.

The hydrogen atom and atomic orbitals 8.1 Atomic spectra 8.2 The Bohr atom 8.3 Quantum mechanics of hydrogenlike atoms 8.4 Hydrogen-atom ground state 8.5 Schrödinger equation for atomic orbitals 8.6 p- and d-orbitals 8.7 Summary on atomic orbitals 8.8 Reduced mass Chapter 8. Exercises

9.

124 125 127

129 130 134 136 138 141 143 145 147

The helium atom 9.1 Experimental energies 9.2 Schrödinger equation and variational calculations 9.3 Spinorbitals and the exclusion principle 9.4 Excited states of helium Chapter 9. Exercises

151 151 154 155 156

10. Atomic structure and the periodic law 10.1 Slater determinants 10.2 Self-consistent field theory 10.3 Aufbau principles 10.4 Atomic configurations and term symbols 10.5 Periodicity of atomic properties 10.6 Relativistic effects 10.7 Spiral form of the periodic table Chapter 10. Exercises

159 163 165 166 170 172 174 174

11. The chemical bond 11.1 The hydrogen molecule 11.2 Valence bond theory 11.3 Hybrid orbitals and molecular geometry 11.4 Hypervalent compounds 11.5 Boron hydrides 11.6 Valence-shell model 11.7 Transition metal complexes 11.8 The hydrogen bond 11.9 Critique of valence-bond theory Chapter 11. Exercises

177 180 181 183 185 187 189 193 195 196

x Contents

12. Molecular orbital theory of diatomic molecules 12.1 The hydrogen molecule-ion 12.2 The LCAO approximation 12.3 MO theory of homonuclear diatomic molecules 12.4 Variational computation of molecular orbitals 12.5 Heteronuclear molecules 12.6 Electronegativity Chapter 12. Exercises

199 202 203 205 207 208 210

13. Polyatomic molecules and solids 13.1 Hückel molecular orbital theory 13.2 Conservation of orbital symmetry; Woodward-Hoffmann rules 13.3 Band theory of metals and semiconductors 13.4 Computational chemistry Chapter 13. Exercises

213 216 220 225 232

14. Density functional theory 14.1 Thomas-Fermi model 14.2 The Hohenberg-Kohn theorems 14.3 Density functional theory 14.4 Slater’s X-alpha method 14.5 The Kohn-Sham equations 14.6 Chemical potential Chapter 14. Exercises

235 238 240 241 243 244 244

15. Molecular symmetry 15.1 The ammonia molecule 15.2 Mathematical theory of groups 15.3 Group theory in quantum mechanics 15.4 Molecular orbitals for ammonia 15.5 Selection rules 15.6 The water molecule 15.7 Walsh diagrams 15.8 Molecular symmetry groups 15.9 Dipole moments and optical activity 15.10 Character tables Chapter 15. Exercises

245 247 248 249 252 254 255 255 261 262 262

16. Molecular spectroscopy 16.1 16.2 16.3 16.4

Vibration of diatomic molecules Vibration of polyatomic molecules Rotation of diatomic molecules Rotation-vibration spectra

267 270 272 275

Contents xi

16.5 Molecular parameters from spectroscopy 16.6 Rotation of polyatomic molecules 16.7 Electronic excitations 16.8 Lasers 16.9 Raman spectroscopy Chapter 16. Exercises

276 277 279 283 288 291

17. Statistical thermodynamics 17.1 Quantum mechanics 17.2 Thermodynamic functions 17.3 The Boltzmann distribution 17.4 Molar partition function 17.5 Ideal monatomic gas 17.6 The Sakur-Tetrode equation 17.7 The Born-Oppenheimer approximation 17.8 Rotation of diatomic molecules 17.9 Rotation of polyatomic molecules 17.10 Molecular vibration 17.11 Electronic contributions 17.12 Summary Supplement 17A. Low-temperature heat capacity of hydrogen molecules Chapter 17. Exercises

293 294 295 298 301 303 303 304 306 307 309 310 311 311

18. Nuclear magnetic resonance 18.1 Magnetic properties of nuclei 18.2 Nuclear magnetic resonance 18.3 The chemical shift 18.4 Spin-spin coupling 18.5 Mechanism for spin-spin interactions 18.6 Magnetization and relaxation processes 18.7 Pulse techniques and Fourier transforms 18.8 Two-dimensional NMR 18.9 Magnetic resonance imaging Chapter 18. Exercises

315 317 319 322 324 326 327 329 332 333

19. Wonders of the quantum world 19.1 19.2 19.3 19.4 19.5 19.6

The Copenhagen interpretation Superposition Schrödinger’s Cat Einstein-Podolsky-Rosen experiment Bell’s theorem Aspect’s experiment

337 339 340 342 345 348

xii Contents

19.7 Multiple photon entanglement 19.8 Philosophical problems of quantum theory Chapter 19. Exercises

349 352 354

20. Quantum computers 20.1 20.2 20.3 20.4 20.5 20.6 20.7 20.8 20.9 20.10

Qubits Quantum gates and circuits Simulation of a Stern-Gerlach experiment Quantum Fourier transform Phase estimation algorithm Many-electron systems Atomic and molecular Hamiltonians Time-evolution of a quantum system Trotter expansions Simulations of molecular structure

Answers to exercises Chapter 2 Chapter 3 Chapter 4 Chapter 6 Chapter 8 Chapter 9 Chapter 10 Chapter 11 Chapter 12 Chapter 13 Chapter 14 Chapter 15 Chapter 16 Chapter 17 Chapter 18 Chapter 19 Suggested references Index

356 358 364 364 369 371 373 376 377 378 381 381 383 384 384 386 388 390 390 392 393 395 396 396 399 401 402 403 405

Preface to the 1st edition

This book is the product of 40 years of distilled experience teaching quantum theory to juniors, seniors and graduate students. It is intended as a less weighty text for one semester of the physical chemistry sequence or for a stand-alone course in quantum mechanics for students of chemistry, material science, molecular biology or earth science—possibly even physics—if the student is willing to put up with chemical applications. The Supplements appended to several Chapters contain optional material for more adventurous students but can be guiltlessly omitted without loss of continuity. I have purposely limited the number of problems after each Chapter and geared them toward conceptual understanding rather than numerical drill. We refer to the excellent problem collections such as Shaum’s Physical Chemistry (McGraw-Hill, New York, 1989) for those desiring a larger selection of problems. I gratefully acknowledge the years of inspiration provided by my many teachers, students, colleagues and family members, too numerous and varied to be cited individually. Ann Arbor, Michigan May 2003

xiii

Preface to the 2nd edition

In the 15 years since publication of the First Edition, I have received an immense number of messages from readers, expressing mostly (but not all!) favorable comments and suggestions for enhancements and additions. In addition, my own experiences during this time have been very rich. I began my second career as a telecommuting computer scientist with Wolfram Research, the creators of Mathematica and other scientific software. In particular, I have been associated with the Wolfram Demonstrations Project, which has given me the opportunity of authoring several hundred interactive modules on quantum phenomena and related subjects and editing of several thousand other contributions by outside authors. I have made a large number of edits of material in the First Edition and have made some significant additions. Some of these are on a somewhat higher level than the original content, but all have been highly recommended by students and colleagues. As a result, this book can no longer be described as a compact, one-semester textbook. But it can still be used as such by judicious omission of some of the advanced topics. I have added Chapter 5 on Special Functions, to replace the separate fragmentary presentations on the quantum-mechanical applications of Hermite, Legendre and Laguerre functions and to place these in a more unified context. I have also added other mathematical topics significant to applications of quantum mechanics. Chapter 14 is an expanded account of density functional theory, which has now become the dominant approach in practical applications of computational chemistry. Chapter 17 covers the statistical thermodynamics of molecular gases, which very naturally follows the account of the parameters derived from molecular spectroscopy. Many undergraduate physical chemistry courses in which the main topic is quantum chemistry also cover statistical thermodynamics. Chapter 20 gives an introduction to quantum computers. These are, at present, largely in a stage which might best be described as unrealized potential, much like nuclear fusion power. Still, the consideration of quantum computation has proven to be very illuminating in enhancing our understanding of the fundamentals of quantum theory and quantum information. This final chapter demands a higher level of proficiency than the rest of the book and may, of course, be omitted. xv

xvi Preface to the 2nd edition

Finally, I would like to renew my thanks of my many colleagues, students, readers, critics and family members, who have made this endeavor not only possible, but highly rewarding. Ann Arbor, Michigan November 2019

About the author

S. M. Blinder is Professor Emeritus of Chemistry and Physics at the University of Michigan. Born in New York City, he got his undergraduate degree at Cornell University, where he did independent study with both Hans Bethe in the Physics Department and Peter Debye in the Chemistry Department. He completed his PhD in chemical physics from Harvard in 1958 under the direction of W. E. Moffitt and J. H. Van Vleck (Nobel Laureate in Physics, 1977). Professor Blinder has nearly 200 research publications in several areas of theoretical chemistry and mathematical physics. He was the first to derive the exact Coulomb (hydrogen atom) propagator in Feynman’s path-integral formulation of quantum mechanics. He is the author of several earlier books: Advanced Physical Chemistry (Macmillan, 1969); Foundations of Quantum Dynamics (Academic Press, 1974); Introduction to Quantum Mechanics (Elsevier, 2004); Guide to Essential Math (Elsevier, 2008, Second Edition 2013); TwentyFirst Century Quantum Mechanics, coauthored with Prof. G. Fano (Springer, 2017). He was the editor of the volume Mathematical Physics in Theoretical Chemistry (Elsevier, 2018). Professor Blinder has been associated with the University of Michigan since 1963. He taught a multitude of courses in chemistry, physics, mathematics and philosophy, mostly however on the subject of quantum theory. He is currently a telecommuting senior scientist with Wolfram Research (creators of Mathematica and other scientific software). In earlier incarnations, Blinder was a Junior Master in chess and an accomplished cellist. He currently lives with his wife in Ann Arbor.

xvii

Chapter 1

Atoms and photons: origins of the quantum theory 1.1 Atomic and subatomic particles

Leucippus

Democritus

The notion that the building blocks of matter are invisibly tiny particles called atoms is usually traced back to the Greek philosophers Leucippus of Miletus and Democritus of Abdera in the 5th Century BCE. The English chemist John Dalton developed the atomic philosophy of the Greeks into a true scientific theory in the early years of the 19th Century. His treatise New System of Chemical Philosophy gave cogent phenomenological evidence for the existence of atoms, suggested by the laws of definite and multiple proportions. The law of definite proportions is based on the observation that chemical compounds appear to contain their constituent elements in small whole number mass ratios. Dalton hypothesized that this was a consequence of the existence of discrete units of matter. This is further reenforced by the law of multiple proportions, whereby different compounds of the same elements combine in a series of related wholenumber ratios. The classic example is provided by the oxides of nitrogen, of which we would now identify the compounds NO2 , NO, N2 O, N2 O4 and N2 O5 . After over 2000 years of speculation and reasoning from indirect evidence, it is now possible in a sense to actually see individual atoms, shown, for example, by Fig. 1.1. Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00009-4 Copyright © 2021 Elsevier Inc. All rights reserved.

1

2 Introduction to Quantum Mechanics

John Dalton

FIGURE 1.1 Image showing electron clouds of individual xenon atoms on a nickel(110) surface produced by a scanning tunneling microscope at (of course!) IBM Laboratories.

The word “atom” comes from the Greek atomos, meaning “indivisible.” It became evident in the late 19th Century, however, that the atom was not truly the ultimate particle of matter. Michael Faraday’s work had suggested the electrical nature of matter and the existence of subatomic particles. This became manifest with the discovery of radioactive decay by Henri Becquerel in 1896—the emission of alpha, beta and gamma particles from atoms. In 1897, J. J. Thomson identified the electron as a universal constituent of all atoms and showed that it carried a negative electrical charge. Its magnitude is now designated −e. To probe the interior of the atom, Ernest Rutherford in 1911 bombarded a thin sheet of gold with a stream of positively-charged alpha particles emitted by a radioactive source. Most of the high-energy alpha particles passed right through the gold foil, but a small number were strongly deflected in a way that indicated the presence of a small but massive positive charge in the center of

Atoms and photons: origins of the quantum theory Chapter | 1

J. J. Thomson

3

Ernest Rutherford

the atom (see Fig. 1.2). Rutherford proposed the nuclear model of the atom. As we now understand it, an electrically-neutral atom of atomic number Z consists of a nucleus of positive charge +Ze, containing almost the entire the mass of the atom, surrounded by Z electrons of very small mass, each carrying a charge −e. The simplest atom is hydrogen, with Z = 1, consisting of a single electron outside a single proton of charge +e.

FIGURE 1.2 Some representative trajectories in Rutherford scattering of alpha particles by a gold nucleus.

With the discovery of the neutron by Chadwick in 1932, the structure of the atomic nucleus was clarified. A nucleus of atomic number Z and mass number A was composed of Z protons and A − Z neutrons. Nuclei diameters are of the order of several times 10−15 m. From the perspective of an atom, which is 105 times larger, a nucleus behaves, for most purposes, like a point charge +Ze. During the 1960’s, compelling evidence began to emerge that protons and neutrons themselves had composite structures, mainly the idea of Murray GellMann. According to the currently accepted “Standard Model,” the protons and neutron are each made of three quarks, with compositions uud and udd, respec-

4 Introduction to Quantum Mechanics

Murray Gell-Mann

Richard Feynman

tively. The up quark u has a charge of + 23 e, while the down quark d has a charge of − 13 e. Despite heroic experimental efforts, free quarks have never been isolated. By contrast, the electron maintains its status as an observable elementary particle. We conclude this Section on atoms with a quote from Richard Feynman, in The Feynman Lectures on Physics, Vol I, p 1-2: If by some cataclysm, all of scientific knowledge were to be destroyed, and only one sentence passed on to the next generation. . . , what statement would contain the most information in the fewest words? I believe it is the atomic hypothesis. . . that all things are made of atoms—little particles that move around in perpetual motion, attracting one another when they are a little distance apart but repelling upon being squeezed into one another. In that one sentence, you will see, there is an enormous amount of information about the world.

1.2 Electromagnetic waves Perhaps the greatest achievement of physics in the 19th century was the unification in 1864 of the phenomena of electricity, magnetism and optics by James Clerk Maxwell. An (optional) summary of Maxwell’s equations is given in Supplement 1A. Heinrich Hertz in 1887 was the first to demonstrate experimentally the production and detection of the electromagnetic waves predicted by Maxwell—specifically radio waves—by acceleration of electrical charges. As shown in Fig. 1.3, electromagnetic waves consist of mutually perpendicular electric and magnetic fields, E and B respectively, oscillating in synchrony at high frequency and propagating in the direction of E × B. The wavelength λ is the distance between successive maxima of the electric (or magnetic) field. The frequency ν represents the number of oscillations per

Atoms and photons: origins of the quantum theory Chapter | 1

James Clerk Maxwell

5

Heinrich Hertz

FIGURE 1.3 Schematic representation of monochromatic linearly-polarized electromagnetic wave.

second observed at a fixed point in space. The reciprocal of frequency τ = 1/ν represents period of oscillation—the time it takes for one wavelength to pass a fixed point. The speed of propagation of the wave is therefore determined by λ = c τ or in more familiar form λν = c,

(1.1)

where c = 2.9979 × 108 m/sec, the speed of all electromagnetic waves in vacuum equals. Usually c is referred to as the speed of light. Frequencies are expressed in hertz (Hz), defined as the number of oscillations per second. Electromagnetic radiation is now known to exist in an immense range of wavelengths including gamma rays, X-rays, ultraviolet, visible light, infrared, microwaves and radio waves, as represented in Fig. 1.4.

1.3 Three failures of classical physics Isaac Newton’s masterwork, Principia, published in 1687, marks the beginning of modern physical science. Not only did Newton delineate the fundamental

6 Introduction to Quantum Mechanics

FIGURE 1.4 The electromagnetic spectrum, showing wavelengths of different types of radiation. Adapted from R. A. Freedman and W. J. Kaufmann III, Universe (Freeman, New York, 2001).

laws governing motion and gravitation but he established a general philosophical worldview which guided all scientific thinking for two centuries afterwards. This system of theories about the physical world is known as “Classical Physics.” Its most notable feature is the primacy of cause and effect relationships. Given sufficient information about the present state of the Universe, it should be possible, at least in principle, to predict its future behavior (as well as its complete history.) This capability is known as determinism. For example, solar and lunar eclipses can be predicted centuries ahead, within an accuracy of several seconds. (But interestingly, we can’t predict even a couple of days in advance if the weather will be clear enough to view the eclipse!)

Atoms and photons: origins of the quantum theory Chapter | 1

7

Isaac Newton The other great pillar of classical physics is Maxwell’s theory of electromagnetism. The origin of quantum theory is presaged by three anomalous phenomena involving electromagnetic radiation, which could not be adequately explained by the methods of classical physics. First among these was blackbody radiation, which led to the contribution of Max Planck in 1900. Next was the photoelectric effect, treated by Albert Einstein in 1905. Third was the origin of line spectra, the hero being Neils Bohr in 1913. A coherent formulation of quantum mechanics was eventually developed in 1925 and 1926, principally the work of Schrödinger, Heisenberg and Dirac. The remainder of this Chapter will describe the early contributions to the quantum theory by Planck, Einstein and Bohr.

1.4 Blackbody radiation It is a matter of experience that a hot object can emit radiation. A piece of metal stuck into a flame can become “red hot.” At higher temperatures, its glow can be described as “white hot.” Under even more extreme thermal excitation it can emit predominantly blue light (completing a very patriotic sequence of colors!). Josiah Wedgwood, the famous pottery designer, noted as far back as 1782 that different materials become red hot at the same temperature. The quantitative relation between color and temperature is described by the blackbody radiation law. A blackbody is an idealized perfect absorber and emitter of all possible wavelengths λ of the radiation. Fig. 1.5 shows experimental wavelength distributions of thermal radiation at several temperatures. Consistent with our experience, the maximum in the distribution, which determines the predominant color, increases with temperature. This relation is given by Wien’s displacement law, T λmax = 2.898 × 106 nm K

(1.2)

8 Introduction to Quantum Mechanics

where the wavelength is expressed in nanometers (nm). At room temperature (300 K), the maximum occurs around 10 µm, in the infrared region. In Fig. 1.5, the approximate values of λmax are 2900 nm at 1000 K, 1450 nm at 2000 K and 500 nm at 5800 K, the approximate surface temperature of the Sun. The Sun’s λmax is near the middle of the visible range (380-750 nm) and is perceived by our eyes as white light.

FIGURE 1.5 Intensity distributions of blackbody radiation at three different temperatures. The total radiation intensity varies as T 4 (Stefan-Boltzmann law) so the total radiation at 2000 K is actually 24 = 16 times that at 1000 K.

John William Strutt, Lord Rayleigh

Max Planck

The origin of blackbody radiation was a major challenge to 19th Century physics. Lord Rayleigh proposed that the electromagnetic field could be represented by a collection of oscillators of all possible frequencies. By simple geometry, the higher-frequency (lower wavelength) modes of oscillation are increasingly numerous since it is possible to fit their waves into an enclosure in a larger number of arrangements. In fact, the number of oscillators increases very rapidly as λ−4 . Rayleigh assumed that every oscillator contributed equally to the

Atoms and photons: origins of the quantum theory Chapter | 1

9

radiation (the equipartition principle). This agrees fairly well with experiment at low frequencies. But if ultraviolet rays and higher frequencies were really produced in increasing number, we would get roasted like marshmallows by sitting in front of a fireplace! Fortunately, this doesn’t happen, and the incorrect theory is said to suffer from an “ultraviolet catastrophe.” Max Planck in 1900 derived the correct form of the blackbody radiation law by introducing a bold postulate. He proposed that energies involved in absorption and emission of electromagnetic radiation did not belong to a continuum, as implied by Maxwell’s theory, but were actually made up of discrete bundles— which he called “quanta.” Planck’s idea is traditionally regarded as the birth of quantum theory. A quantum associated with radiation of frequency ν has the energy E = h ν,

(1.3)

where the proportionality factor h = 6.626 × 10−34 J sec is known as Planck’s constant. For our development of the quantum theory of atoms and molecules, we need only this simple result and do not have to follow the remainder of Planck’s derivation. If you insist, however, the details are given in Supplement 1B.

1.5 The photoelectric effect

FIGURE 1.6 Photoelectric effect.

A familiar device in modern technology is the photocell or “electric eye,” which runs a variety of useful gadgets, including automatic door openers. The principle involved in these devices is the photoelectric effect, which was first observed by Heinrich Hertz in the same laboratory in which he discovered electromagnetic waves. Visible or ultraviolet radiation impinging on clean metal surfaces can cause electrons to be ejected from the metal (see Fig. 1.6). Such an effect is not, in itself, inconsistent with classical theory since electromagnetic waves are known to carry energy and momentum. But the detailed behavior as a function of radiation frequency and intensity can not be explained classically. The energy required to eject an electron from a metal is determined by its work function Φ. For example, sodium has Φ = 1.82 eV. The electron-volt is

10 Introduction to Quantum Mechanics

a convenient unit of energy on the atomic scale: 1 eV = 1.602 × 10−19 J, corresponding to the energy which an electron picks up when accelerated across a potential difference of 1 volt. The classical expectation would be that radiation of sufficient intensity should cause ejection of electrons from a metal surface, with their kinetic energies increasing with the radiation intensity. Moreover, a time delay would be expected between the absorption of radiation and the ejection of electrons. The experimental facts are quite different. It is found that no electrons are ejected, no matter how high the radiation intensity, unless the radiation frequency exceeds some threshold value ν0 for each metal. For sodium ν0 = 4.39 × 1014 Hz (corresponding to a wavelength of 683 nm), as shown in Fig. 1.7. For frequencies ν above the threshhold, the ejected electrons acquire a kinetic energy given by 1 2

mv 2 = h(ν − ν0 ) = hν − Φ.

(1.4)

Evidently, the work function Φ can be identified with hν0 , equal to 3.65 × 10−19 J=1.82 eV for sodium. The kinetic energy increases linearly with frequency above the threshhold but is independent of the radiation intensity. Increased intensity does, however, increase the number of photoelectrons.

FIGURE 1.7 Photoelectric data for sodium (Millikan, 1916). The threshhold frequency ν0 , found by extrapolation, equals 4.39 × 1014 Hz.

In 1905, Albert Einstein proposed an explanation of the photoelectric effect (for which he received the Nobel Prize in 1921). Einstein’s explanation of the photoelectric effect appears trivially simple once stated. He accepted Planck’s hypothesis that a quantum of radiation carries an energy hν. Thus, if an electron is bound in a metal with an energy Φ, a quantum of energy hν0 = Φ will be sufficient to disloge it. And any excess energy h(ν − ν0 ) will appear as kinetic energy of the ejected electron. Einstein believed that the radiation field actually did consist of quantized particles, which were later named photons. Although Planck himself never believed that quanta were real, Einstein’s success with the photoelectric effect greatly advanced the concept of energy quantization.

Atoms and photons: origins of the quantum theory Chapter | 1

11

Albert Einstein

1.6

Line spectra

Most of what is known about atomic (and molecular) structure and mechanics has been deduced from spectroscopy. Fig. 1.8 shows two different types of spectra. A continuous spectrum can be produced by an incandescent solid or gas at high pressure. Blackbody radiation, for example, gives a continuum. An emission spectrum can be produced by a gas at low pressure excited by heat or by collisions with electrons. An absorption spectrum results when light from a continuous source passes through a cooler gas, consisting of a series of dark lines characteristic of the composition of the gas. Frauenhofer between 1814 and 1823 discovered nearly 600 dark lines in the solar spectrum viewed at high resolution. It is now understood that these lines are caused by absorption by the outer layers of the Sun.

FIGURE 1.8 Continuous spectrum and two types of line spectra.

12 Introduction to Quantum Mechanics

Gases heated to incandescence were found by Bunsen, Kirkhoff and others to emit light with a series of sharp wavelengths. The emitted light analyzed by a spectrometer (or even a simple prism) appears as a multitude of narrow bands of color. These so called line spectra are characteristic of the atomic composition of the gas. The line spectra of several elements are shown in Fig. 1.9. It is consistent with classical electromagnetic theory that motions of electri-

FIGURE 1.9 Emission spectra of several elements.

cal charges within atoms can be associated with the absorption and emission of radiation. What is completely mysterious is how such radiation can occur for discrete frequencies, rather than as a continuum. The breakthrough that explained line spectra is credited to Neils Bohr in 1913. Building on the ideas of Planck and Einstein, Bohr postulated that the energy levels of atoms belong to a discrete set of values En , rather than a continuum as in classical mechanics. When an atom makes a downward energy transition from a higher energy level Em to a lower energy level En , it caused the emission of a photon of energy hν = Em − En .

(1.5)

This is what accounts for the discrete values of frequency ν in emission spectra of atoms. Absorption spectra are correspondingly associated with the annihilation of a photon of the same energy and concomitant excitation of the atom from En to Em . Fig. 1.10 is a schematic representation of the processes of absorption and emission of photons by atoms. Absorption and emission processes occur at the same set frequencies, as is shown by the two types of line spectra in Fig. 1.8. Johannes Rydberg found in 1890 that all the lines of the atomic hydrogen spectrum could be fitted to a simple empirical formula 1 1 1 (1.6) =R 2 − 2 , n1 = 1, 2, 3 . . . , n2 > n1 , λ n1 n2

Atoms and photons: origins of the quantum theory Chapter | 1

Neils Bohr

13

Johannes Rydberg

FIGURE 1.10 Origin of line spectra. Absorption of the photon shown in blue causes atomic transition from E0 to E2 . Transition from E2 to E1 causes emission of the photon shown in red.

where R, known as the Rydberg constant, has the value 109,677 cm−1 . This formula was found to be valid for hydrogen spectral lines in the infrared and ultraviolet regions, in addition to the four lines in the visible region. No simple formula has been found for any atom other than hydrogen. Bohr proposed a model for the energy levels of a hydrogen atom which agreed with Rydberg’s formula for radiative transition frequencies. Inspired by Rutherford’s nuclear atom, Bohr suggested a planetary model for the hydrogen atom in which the electron goes around the proton in one of the allowed circular orbits, as shown in Fig. 1.11. A more fundamental understanding of the discrete nature of orbits and energy levels had to await the discoveries of 1925-26, but Bohr’s model provided an invaluable stepping-stone to the development of quantum mechanics. (See Fig. 1.12.) We will consider the hydrogen atom in greater detail in Chap. 8.

14 Introduction to Quantum Mechanics

FIGURE 1.11 Bohr model of the hydrogen atom showing three lowest-energy orbits and transition n = 3 to n = 2.

FIGURE 1.12 Stylized representation of the Bohr model for a multielectron atom. From the logo of the International Atomic Energy Agency.

Supplement 1A. Maxwell’s equations These four vector relations summarize the previously discovered experimental laws describing all known electrical and magnetic phenomena. In these expressions, ρ is the electric charge density, J, the current density, E, the electric field and B, the magnetic induction. Maxwell’s equations in free space (in the absence of dielectric or magnetic media) can be written ∇ · D = ρ,

(1.7)

∇ · B = 0, ∂B ∇ ×E+ = 0, ∂t ∂D . ∇ ×H=J+ ∂t

(1.8) (1.9) (1.10)

Atoms and photons: origins of the quantum theory Chapter | 1

15

The two auxiliary fields D, the electric displacement, and H, the magnetic field are defined by constitutive relations. In free space D = 0 E

B = μ0 H,

and

(1.11)

where 0 and μ0 , are the vacuum electric permittivity and magnetic permeability, respectively. Eq. (1.7) states that an electric field diverges from a distribution of electric charge. This leads to Coulomb’s law. Eq. (1.8) implies the nonexistence of isolated magnetic poles–the magnetic equivalent of electric charges. The most elementary magnetic objects are dipoles, connected pairs of north and south poles which can not be isolated from one another. Eq. (1.9) is an expression of Faraday’s law of electromagnetic induction, which shows how a circulating electric field can be produced by a time-varying magnetic field. Eq. (1.10) contains Ampère’s law showing how a magnetic field is produced by an electric current. The second term on the right, which was added by Maxwell himself, is, in a sense, reciprocal to Faraday’s law, since it implies that a circulating magnetic field can also be produced by a time-varying electric field. In the absence of charges and currents, Maxwell equations can be transformed into three-dimensional wave equations ∇2 −

1 ∂2 E = 0 and c2 ∂t 2

∇2 −

1 ∂2 B = 0, c2 ∂t 2

(1.12)

√ where c = 1/ 0 μ0 = 2.9979 × 108 m/sec, representing the speed of light in vacuum. Possible solutions to Eqs. (1.12) represent synchronized transverse electric and magnetic waves propagating at the speed c, as shown in Fig. 1.3. Even in the classical theory, electromagnetic fields can carry energy and momentum. The energy density of an electromagnetic field in free space is given by 2 ρE = 12 0 E 2 + μ−1 B . (1.13) 0 The energy flux or intensity (energy transported across unit area per unit time across unit area) is given by the Poynting vector S = E × H.

(1.14)

It is significant that the energy density and intensity depend of the square of field quantities. We will exploit an analogous relationship in the interpretation of the wavefunction in quantum mechanics. Maxwell’s first equation is equivalent to Coulomb’s law. In its simplest form, the force between two point charges q1 and q2 separated by a distance r is given by F=

1 q1 q2 . 4π0 r 2

(1.15)

16 Introduction to Quantum Mechanics

The algebraic signs of q1 and q2 determine whether the force is attractive or repulsive. If q1 and q2 are like charges, they repel (F > 0), whereas opposite charges attract (F < 0). In our applications to atomic and molecular structure, it is clumsy and unnecessary to carry the constant 4π0 . We will instead write Coulomb’s law in gaussian electromagnetic units, whereby F=

q 1 q2 . r2

(1.16)

The potential energy of interaction between two charges is related to the force by F = −dV /dr (more generally, F = −∇V ). Coulomb’s law therefore implies V (r) =

q1 q 2 . r

(1.17)

In applications to the quantum theory of atoms and molecules we will also need generalizations of Eq. (1.17) for continuous distributions of charge. For a point charge q located at r = r0 interacting with a charge density ρ(r), the interaction energy is given by an integral over volume ρ(r) 3 V =q (1.18) d r. |r − r0 | For r0 = 0 this simplifies to V =q

ρ(r) 3 d r. r

(1.19)

Analogously, the energy of interaction between two charge densities ρa (r) and ρb (r) is given by a double integral ρa (r1 ) ρb (r2 ) 3 V= d r1 d 3 r2 , (1.20) r12 where r12 = |r2 − r1 |. An electrostatic field can be represented as the gradient of a potential: E = −∇ .

(1.21)

The first of Maxwell’s equations, with D = 0 E, leads to Poisson’s equation for the electrostatic potential: ∇ 2 = −ρ/0 .

(1.22)

In later work, we will apply Poisson’s equation in gaussian units, which has the form ∇ 2 = −4πρ.

(1.23)

Atoms and photons: origins of the quantum theory Chapter | 1

17

In free space, with ρ = 0, this reduces to Laplace’s equation ∇ 2 = 0.

(1.24)

Supplement 1B. The Planck radiation law To apply Rayleigh’s idea that the radiation field can be represented as a collection of oscillators, we need to calculate the number of oscillators per unit volume for each wavelength λ. The reciprocal of the wavelength, k = 1/λ, is known as the wavenumber, and equals the number of wave oscillations per unit length. The wavenumber actually represents the magnitude of the wavevector k, which also determines the direction in which a wave is propagating. Now, all the vectors k of constant magnitude k in a 3-dimensional space can be considered to sweep out a spherical shell of radius k and infinitesimal thickness dk. The volume (in k-space) of this shell is equal to 4πk 2 dk and can be identified as the number of modes of oscillation per unit volume (in real space). Expressed in terms of λ, the number of modes per unit volume equals (4π/λ4 )dλ. Sir James Jeans recognized that this must be multiplied by 2 to take account of the two possible polarizations of each mode of the electromagnetic field. Assuming equipartition of energy implies that each oscillator has the energy kT , where k here is Boltzmann’s constant R/NA . Thus we obtain for the energy per unit volume per unit wavelength range ρ(λ) =

8πkT , λ4

(1.25)

which is known as the Rayleigh-Jeans law. This result gives a fairly accurate account of blackbody radiation for larger values of λ, in the infrared region and beyond. But it does suffer from the dreaded ultraviolet catastrophe, whereby ρ(λ) increases without limit as λ → 0. Planck realized that the fatal flaw was equipartition, which is based on the assumption that the possible energies of each oscillator belong to a continuum (0 ≤ E < ∞). If, instead, the energy of an oscillator of wavelength λ comes in discrete bundles hν = hc/λ, then the possible energies are given by Eλ,n = nhν = nhc/λ,

where n = 0, 1, 2 . . . .

(1.26)

By the Boltzmann distribution in statistical mechanics, the average energy of an oscillator at temperature T is given by −Eλ,n /kT n Eλ,n e Eλ av = . (1.27) −Eλ,n /kT n e Using the formula for the sum of a decreasing geometric progression ∞

n=0

e−nhc/λkT =

1 , 1 − e−hc/λkT

(1.28)

18 Introduction to Quantum Mechanics

FIGURE 1.13 Cosmic Microwave Background. From G. F. Smoot and D. Scott, http://pdg.lbl.gov/ 2001/microwaverpp.pdf.

we obtain Eλ av =

hc/λ . −1

ehc/λkT

(1.29)

This implies that the higher-energy modes are less populated than what is implied by the equipartition principle. Substituting this value, rather than kT , into the Rayleigh-Jeans formula (1.25), we obtain the Planck distribution law ρ(λ) =

1 8πhc . λ5 ehc/λkT − 1

(1.30)

Note that, for large values of λ and/or T , the average energy (1.29) is approximated by Eλ av ≈ kT and the Planck formula reduces to the Rayleigh-Jeans approximation. The Planck distribution law accurately accounts for the experimental data on thermal radiation shown in Fig. 1.5. Remarkably, measurements by the Cosmic Microwave Background Explorer satellite (COBE) give a perfect fit for a blackbody distribution at temperature 2.73 K, as shown in Fig. 1.13. The cosmic microwave background radiation, which was discovered by Penzias and Wilson in 1965, is a relic of the Big Bang 13.8 billion years ago. From the Planck distribution law one can calculate the wavelength at which ρ(λ) is a maximum at a given T . The result agrees with the Wien displacement law with λmax T =

ch . 4.965 k

(1.31)

Atoms and photons: origins of the quantum theory Chapter | 1

19

By integration of Eq. (1.26) over all wavelengths λ, we obtain the total radiation energy density per unit volume E=

∞

ρ(λ) dλ =

0

in accord with the Stefan-Boltzmann law.

8π 5 k 4 4 T , 15c3 h3

(1.32)

Chapter 2

Waves and particles Quantum mechanics is the theoretical framework which describes the behavior of matter on the atomic scale. It is the most successful quantitative theory in the history of science, having withstood thousands of experimental tests without a single verifiable exception. It has correctly predicted or explained phenomena in fields as diverse as chemistry, elementary-particle physics, solid-state electronics, molecular biology and cosmology. A host of modern technological marvels, including transistors, lasers, computers and nuclear reactors are offspring of the quantum theory. Possibly 30% of the US gross national product involves technology which is based on quantum mechanics. For all its relevance, the quantum world differs quite dramatically from the world of everyday experience. To understand the modern theory of matter, challenging conceptual hurdles of both psychological and mathematical variety must be overcome. A paradox which stimulated the early development of the quantum theory concerned the indeterminate nature of light. Light usually behaves as a wave phenomenon but occasionally it betrays a particle-like aspect, a schizoid tendency known as the wave-particle duality. We consider first the wave aspect of light.

2.1

The double-slit experiment

Fig. 2.1 shows a modernized version of the famous double-slit diffraction experiment first carried out by Thomas Young in 1801. Light from a monochromatic (single wavelength) source is passed through two narrow slits and projected onto a screen. Each slit by itself would allow just a narrow band of light to illuminate the screen. But with both slits open, a beautiful interference pattern of alternating light and dark bands appears, with maximum intensity in the center. To understand what is happening, we review some key results about electromagnetic waves. Maxwell’s theory of electromagnetism was an elegant unification of the diverse phenomena of electricity, magnetism and radiation, including light. Electromagnetic radiation is carried by transverse waves of electric and magnetic fields, propagating in vacuum at a speed c ≈ 3 × 108 m/sec, known as the “speed of light.” As shown in Fig. 1.11, the E and B fields oscillate sinusoidally, in synchrony with one another. The magnitudes of E and B are proportional (B = E/c in SI units). The distance between successive maxima (or minima) at a given instant of time is called the wavelength λ. At every point in space, the fields Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00010-0 Copyright © 2021 Elsevier Inc. All rights reserved.

21

22 Introduction to Quantum Mechanics

Thomas Young

FIGURE 2.1 Modern version of Young’s interference experiment using a laser gun. Single slit (left) produces an intense band of light. Double slit (right) gives a diffraction pattern.

also oscillate sinusoidally as functions of time. The number of oscillations per unit time is called the frequency ν. Since the field moves one wavelength in the time λ/c, the wavelength, frequency and propagation velocity for any wave phenomenon are related by λ ν = c.

(2.1)

In electromagnetic theory, the intensity of radiation—the energy flux incident on a unit area per unit time—is determined by the Poynting vector S = μ−1 0 E × B.

(2.2)

The energy density contained in an electromagnetic field, even a static one, is given by 2 . (2.3) ρ = 12 0 E 2 + μ−1 0 B Note that both of the above energy quantities depends quadratically on the fields E and B. To discuss the diffraction experiments described above, it is useful to

Waves and particles Chapter | 2 23

define the amplitude of an electromagnetic wave at each point in space and time r, t by the function (r, t) =

B(r, t) √ , 0 E(r, t) = √ μ0

(2.4)

such that the intensity is given by ρ(r, t) = [(r, t)]2 .

(2.5)

The function (r, t) will, in some later applications, have complex values. In such cases we generalize the definition of intensity to ρ(r, t) = |(r, t)|2 = ∗ (r, t) (r, t),

(2.6)

where ∗ (r, t) represents the complex conjugate of (r, t). In quantummechanical applications, the function is known as the wavefunction.

FIGURE 2.2 Interference of two equal sinusoidal waves. Top: constructive interference. Bottom: destructive interference. Center: intermediate case. The resulting intensities ρ = 2 is shown on the right.

The electric and magnetic fields, hence the amplitude , can have either positive and negative values at different points in space. In fact constructive and destructive interference arises from the superposition of waves, as illustrated in Fig. 2.2. By Eq. (2.5), the intensity ρ ≥ 0 everywhere. The light and dark bands on the screen correspond to constructive and destructive interference, respectively. The wavelike nature of light is convincingly demonstrated by the fact

24 Introduction to Quantum Mechanics

that the intensity with both slits open is not the sum of the individual intensities, i.e., ρ = ρ1 + ρ2 . Rather it is the wave amplitudes which add: = 1 + 2 ,

(2.7)

with the intensity given by the square of the amplitude: ρ = 2 = 12 + 22 + 21 2 .

(2.8)

The cross term 21 2 is responsible for the constructive and destructive interference. Where 1 and 2 have the same sign, constructive interference makes the total intensity greater than the sum of ρ1 and ρ2 . Where 1 and 2 have opposite signs, there is destructive interference. If, in fact, 1 = −2 then the two waves cancel exactly, giving a dark fringe on the screen.

2.2 Wave-particle duality The interference phenomena demonstrated by the work of Young, Fresnel and others in the early 19th Century, apparently settled the matter that light was a wave phenomenon, contrary to the views of Newton a century earlier—case closed! But nearly a century later, phenomena were discovered which could not be satisfactorily accounted for by the wave theory, specifically blackbody radiation and the photoelectric effect.

FIGURE 2.3 Scintillations observed after dimming laser intensity by several orders of magnitude. These are evidently caused by individual photons!.

Deviating from the historical development, we will illustrate these effects by a modification of the double slit experiment. Let us equip the laser source with a dimmer switch capable of reducing the light intensity by several orders of magnitude, as shown in Fig. 2.3. With each successive filter the diffraction pattern becomes dimmer and dimmer. Eventually we will begin to see localized scintillations at random positions on an otherwise dark screen. It is an almost inescapable conclusion that these scintillations are caused by photons, the bundles of light postulated by Planck and Einstein to explain blackbody radiation and the

Waves and particles Chapter | 2 25

photoelectric effect. But wonders do not cease even here. Even though the individual scintillations appear at random positions on the screen, their statistical behavior reproduces the original high-intensity diffraction pattern. This is shown very dramatically in Fig. 2.4. Evidently the statistical behavior of the photons follows a predictable pattern, even though the behavior of individual photons is unpredictable. This implies that each individual photon, even though it behaves mostly like a particle, somehow carry with it a “knowledge” of the entire wavelike diffraction pattern. In some sense, a single photon must be able to go through both slits at the same time. This is what is known as the wave-particle duality for light: under appropriate circumstances light can behave either as a wave or as a particle.

FIGURE 2.4 Diffraction pattern built up from individual photon scintillations. Experiment by R. Austin and L. Page at Princeton University.

Planck’s resolution of the problem of blackbody radiation and Einstein’s explanation of the photoelectric effect can be summarized by a relation between the energy of a photon to its frequency: E = h ν,

(2.9)

where h = 6.626 × 10−34 J sec, known as Planck’s constant. Much later, the Compton effect was discovered, the phenomenon whereby an x-ray or gammaray photon ejects an electron from an atom, as shown in Fig. 2.5. Assuming conservation of momentum in a photon-electron collision, the photon is found to carry a momentum of magnitude p, given by p = h/λ.

(2.10)

26 Introduction to Quantum Mechanics

Eqs. (2.9) and (2.10) constitute quantitative realizations of the wave-particle duality, each relating a particle-like property—energy or momentum—to a wavelike property—frequency or wavelength.

FIGURE 2.5 Compton effect. The momentum and energy carried by the incident x-ray photon are transferred to the ejected electron and the scattered photon.

According to the special theory of relativity, the last two formulas are actually different facets of the same fundamental relationship. By Einstein’s famous formula, the equivalence of mass and energy is given by E = mc2 .

(2.11)

The photon’s rest mass is zero, but in traveling at speed c, it acquires a finite effective mass. Equating Eqs. (2.8) and (2.10) for the photon energy and taking the photon momentum to be p = mc, we obtain p = E/c = hν/c = h/λ.

(2.12)

Thus, the wavelength-frequency relation (2.1), implies the Compton-effect formula (2.10). The best we can do is to describe the phenomena constituting the wave-particle duality. There is no widely accepted explanation in terms of everyday experience and common sense. Feynman referred to the “experiment with two holes” as the “central mystery of quantum mechanics.” It should be mentioned that a number of models have been proposed over the years to rationalize these quantum mysteries. Bohm proposed that there might exist hidden variables which would make the behavior of each photon deterministic, i.e., particle-like. Everett and Wheeler proposed the “many worlds interpretation of quantum mechanics” in which each random event causes the splitting of the entire universe into disconnected parallel universes in which each possibility becomes the reality. Needless to say, not many people are willing to accept such a metaphysically unwieldy view of reality. Most scientists are content to apply the highly successful computational mechanisms of quantum theory to their work, without worrying unduly about its philosophical underpinnings. As Feynman

Waves and particles Chapter | 2 27

put it, “Shut up and calculate!” Much like most of us happily using our computers without acquiring a detailed knowledge of either semiconductor technology or operating-system programming. There was never any drawn-out controversy about whether electrons or any other constituents of matter were other than particle-like. Individual electrons produce scintillations on a phosphor screen—that is how the older TVs worked. But electrons also exhibit diffraction effects, which indicates that they too have wavelike attributes. An analog of the double-slit experiment using electrons instead of light is technically difficult but has been done. An electron gun, instead of a light source, produces a beam of electrons at a selected velocity. Then, everything that happens for photons has its analog for electrons, as shown by the diffraction pattern in Fig. 2.6.

FIGURE 2.6 Slit diffraction pattern for electrons, obtained by Claus Jönsson in Tübingen (1961). Selected as the “most beautiful” experiment in the history of physics by Physics World in 2002.

Diffraction experiments have been more recently carried out for particles as large as atoms and molecules, even for C60 fullerene molecules.

Louis de Broglie

28 Introduction to Quantum Mechanics

De Broglie in 1924 first conjectured that matter might also exhibit a waveparticle duality. A wavelike aspect of the electron might, for example, be responsible for the discrete nature of Bohr orbits in the hydrogen atom (cf. Chap. 8). According to de Broglie’s hypothesis, the “matter waves” associated with a particle have a wavelength given by λ = h/p,

(2.13)

identical in form to Compton’s result (2.10) (which, in fact, was discovered later). The correctness of de Broglie’s conjecture was most dramatically confirmed by the observations of Davisson and Germer in 1927 of diffraction of monoenergetic beams of electrons by metal crystals, much like the diffraction of x-rays. And measurements showed that de Broglie’s formula (2.13) did indeed give the correct wavelength (see Fig. 2.7).

FIGURE 2.7 Intensity of electron scattered at a fixed angle off a nickel crystal, as function of incident electron energy. From C. J. Davisson “Are Electrons Waves?” Franklin Institute Journal 205, 597 (1928).

2.3 The Schrödinger equation Schrödinger in 1926 first proposed an equation for de Broglie’s matter waves. This equation cannot be derived from some other principle since it constitutes a fundamental law of nature. Its correctness can be judged only by its subsequent agreement with observed phenomena (a posteriori proof). Nonetheless, we will attempt a heuristic argument to make the result at least plausible. In classical electromagnetic theory, it follows from Maxwell’s equations that each component of the electric and magnetic fields in vacuum is a solution of the wave equation 1 ∂ 2 = 0, c2 ∂t 2 where the Laplacian or “del-squared” operator is defined by ∇ 2 −

∇2 =

∂2 ∂2 ∂2 + + . ∂x 2 ∂y 2 ∂z2

(2.14)

(2.15)

Waves and particles Chapter | 2 29

Erwin Schrödinger We will attempt now to create an analogous equation for de Broglie’s matter waves. Accordingly, let us consider a very general instance of wave motion propagating in the x-direction. At a given instant of time, the form of a wave might be represented by a periodic function such as ψ(x) = f (2πx/λ),

(2.16)

where f (θ ) might be a sinusoidal function such as sin θ , cos θ, eiθ , e−iθ or some linear combination of these. The most suggestive form will turn out to be the complex exponential, which is related to the sine and cosine by Euler’s formula e±iθ = cos θ ± i sin θ.

(2.17)

Derivatives of exponentials are simpler than those of sines or cosines. Each of the above functions is periodic, with its value repeating as the argument increases by 2π. This happens whenever x increases by one wavelength λ. At a fixed point in space, the time-dependence of the wave has an analogous structure: T (t) = f (2πνt),

(2.18)

where ν gives the number of cycles of the wave per unit time. Taking into account both x- and t-dependence, we consider a wavefunction of the form x (x, t) = exp 2πi −νt , (2.19) λ representing waves traveling from left to right. Now we make use of the Planck and de Broglie formulas (2.8) and (2.12) to replace ν and λ by their particle analogs. This gives

30 Introduction to Quantum Mechanics

(x, t) = exp[i(px − Et)/],

(2.20)

where h . (2.21) 2π Since Planck’s constant occurs in most formulas with the denominator 2π, this symbol, pronounced “aitch-bar,” was invented by Dirac. Eq. (2.19) represents in some abstruse way the wavelike nature of a particle with energy E and momentum p. We will attempt to discover the underlying wave equation by “reverse engineering.” The time derivative of (2.19) gives ≡

∂ = −(iE/) × exp[i(px − Et)/]. ∂t

(2.22)

Thus i

∂ = E. ∂t

(2.23)

Analogously −i

∂ = p, ∂x

(2.24)

and ∂ 2 = p 2 . (2.25) ∂x 2 The energy and momentum for a nonrelativistic free particle are related by −2

p2 1 . E = mv 2 = 2 2m

(2.26)

This suggests that (x, t) satisfies the partial differential equation i

2 ∂ 2 ∂ . =− ∂t 2m ∂x 2

(2.27)

For a particle with a potential energy V (x), the analog of (2.25) is E=

p2 + V (x). 2m

(2.28)

We postulate that the equation for matter waves is then 2 ∂ 2 ∂ + V (x) . = − i ∂t 2m ∂x 2 For waves in three dimensions the obvious generalization is

(2.29)

Waves and particles Chapter | 2 31

i

∂ 2 2 (r, t) = − ∇ + V (r) (r, t). ∂t 2m

(2.30)

Here the potential energy and the wavefunction depend on the three space coordinates x, y, z, which we write for brevity as r. We have thus arrived at the time-dependent Schrödinger equation for the amplitude (r, t) of the matter waves associated with the particle. Its formulation in 1926 represents the starting point of modern quantum mechanics. (Heisenberg in 1925 proposed another version known as matrix mechanics.) For conservative systems, in which the energy is a constant, we can separate out the time-dependent factor from (2.19) and write (r, t) = ψ(r) e−iEt/ ,

(2.31)

where ψ(r) is a wavefunction dependent only on space coordinates. Putting (2.30) into (2.29) and canceling the exponential factors, we obtain the timeindependent Schrödinger equation: 2 2 (2.32) ∇ + V (r) ψ(r) = Eψ(r). − 2m Most of our applications of quantum mechanics to chemistry will be based on this equation.

2.4 Operators and eigenvalues The bracketed object in Eq. (2.31) is called an operator. An operator is a generalization of the concept of a function. Whereas a function is a rule for turning one number into another, an operator is a rule for turning one function in another. The Laplacian is an example of an operator. We provisionally indicate ˆ The action of an that an object is an operator by placing a “hat” over it, e.g., A. operator that turns the function f into the function g is represented by Aˆ f = g.

(2.33)

Eq. (2.23) implies that the operator for the x-component of momentum can be written ∂ pˆ x = −i , (2.34) ∂x and by analogy, we must have pˆ y = −i

∂ , ∂y

pˆ z = −i

∂ . ∂z

(2.35)

The energy, as in Eq. (2.27), expressed as a function of position and momentum is known in classical mechanics as the Hamiltonian. Generalizing to three dimensions,

32 Introduction to Quantum Mechanics

H=

p2 1 + V (r) = (p 2 + py2 + pz2 ) + V (x, y, z). 2m 2m x

(2.36)

We can construct from this the corresponding quantum-mechanical operator Hˆ = − 2m 2

∂2 ∂2 ∂2 + + ∂x 2 ∂y 2 ∂z2 =−

+ V (x, y, z) 2 2 ∇ + V (r). 2m

(2.37)

The time-independent Schrödinger equation (2.31) can then be written symbolically as Hˆ = E .

(2.38)

This form is applicable to any quantum-mechanical system, given the appropriate Hamiltonian and wavefunction. Most applications to chemistry involve systems containing several particles—the electrons and nuclei in atoms and molecules. An operator equation of the form Aˆ ψ = const ψ

(2.39)

is called an eigenvalue equation. Recall that, in general, an operator acting on a function gives another function (see Eq. (2.32)). The special case (2.38) occurs when the second function is a multiple of the first. In this case, ψ is known as an eigenfunction and the constant is called an eigenvalue. (These terms are hybrids with German, the purely English equivalents being “characteristic function” and “characteristic value.”) To every dynamical variable A in quantum mechanics, there corresponds an eigenvalue equation, usually written Aˆ ψ = a ψ.

(2.40)

The eigenvalues a represent the possible measured values of the variable A. The Schrödinger equation (2.37) is the best known instance of an eigenvalue equation, with its eigenvalues corresponding to the allowed energy levels of the quantum system.

2.5

The wavefunction

For a single-particle system, the wavefunction (r, t), or ψ(r) for the timeindependent case, represents the amplitude of the still vaguely defined matter waves. The relationship between amplitude and intensity of electromagnetic waves developed for Eq. (2.6) can be extended to matter waves. The most commonly accepted interpretation of the wavefunction is due to Max Born (1926), according to which ρ(r), the square of the absolute value of ψ(r) is proportional

Waves and particles Chapter | 2 33

Max Born to the probability density (probability per unit volume) that the particle will be found at the position r. Probability density is the three-dimensional analog of the diffraction pattern that appears on the two-dimensional screen in the doubleslit diffraction experiment for electrons, as in Fig. 2.6 for example. In the latter case we have the relative probability a scintillation will appear at a given point on the screen. The function ρ(r) becomes equal, rather than just proportional to, the probability density when the wavefunction is normalized, that is,

|ψ(r)|2 dτ = 1. (2.41) This simply accounts for the fact that the total probability of finding the particle somewhere adds up to unity. The integration in (2.40) extends over all space and the symbol dτ designates the appropriate volume element. For example, in cartesian coordinates, dτ = dx dy dz; in spherical polar coordinates, dτ = r 2 sin θ dr dθ dφ. The physical significance of the wavefunctions makes certain demands on its mathematical behavior. The wavefunction must be a single-valued function of all its coordinates, since the probability density ought to be uniquely determined at each point in space. Moreover, the wavefunction should be finite and continuous everywhere, since a physically-meaningful probability density must have the same attributes. The conditions that the wavefunction be single-valued, finite and continuous—in short, “well-behaved”—lead to restrictions on solutions of the Schrödinger equation such that only certain values of the energy and other dynamical variables are allowed. This is called quantization—the feature that gives quantum mechanics its name.

Chapter 2. Exercises 2.1. In the theory of relativity, space and time variables can be combined into a 4-dimensional vector with x1 = x, x2 = y, x3 = z, x4 = ict. The momentum

34 Introduction to Quantum Mechanics

and energy analogously combine to a 4-vector with p1 = px , p2 = py , p3 = pz , p4 = iE/c. By a suitable generalization of the quantization prescription for momentum components, deduce the time-dependent Schrödinger equation: 2 2 ∂(r, t) − ∇ + V (r) (r, t) = i . 2m ∂t 2.2. Estimate the number of photons emitted per second by a 100-watt lightbulb. Assume a wavelength of 550 nm (yellow light). 2.3. Electron diffraction makes use of 40 keV (40,000 eV) electrons. Calculate their de Broglie wavelength. 2.4. Show that the wavefunction (x, t) = ei(px−Et)/ is a solution of the onedimensional time-dependent Schrödinger equation. 2.5. Show that (r, t) = ei(p·r−Et)/ is a solution of the three-dimensional timedependent Schrödinger equation. 2.6. A certain one-dimensional quantum system in 0 ≤ x ≤ ∞ is described by the Hamiltonian: q2 2 d 2 , − Hˆ = − 2 2m dx x

(q = constant).

One of the eigenfunctions is known to be ψ(x) = A x e−αx ,

α ≡ mq 2 /2 ,

A = constant

(i) Write down the Schrödinger equation and carry out the differentiation. (ii) Find the corresponding energy eigenvalue (in terms of , m and q). (iii) Find the value of A which normalizes the wavefunction according to

∞ |ψ(x)|2 dx = 1. 0

You may require the definite integrals

∞ x n e−ax dx = n!/a n+1 . 0

Chapter 3

Quantum mechanics of some simple system 3.1

The free particle

The simplest system in quantum mechanics has the potential energy V equal to zero everywhere. This is called a free particle since it has no forces acting on it. We consider the one-dimensional case, with motion only in the x-direction, represented by the Schrödinger equation −

2 d 2 ψ(x) = Eψ(x). 2m dx 2

(3.1)

Total derivatives can be used since there is but one independent variable. The equation simplifies to ψ (x) + k 2 ψ(x) = 0,

(3.2)

k 2 ≡ 2mE/2 .

(3.3)

with the definition

Possible solutions of Eq. (3.2) are ⎧ ⎪ ⎨sin kx ψ(x) = const cos kx ⎪ ⎩ ±ikx e .

(3.4)

There is no restriction on the value of k. Thus a free particle, even in quantum mechanics, can have any non-negative value of the energy E=

2 k 2 ≥ 0. 2m

(3.5)

The energy levels in this case are not quantized and correspond to the same continuum of kinetic energy shown by a classical particle. Even for an atom or a molecule, which is most notable for its quantized energy levels, there will exist a continuum at sufficiently high energies, associated with the onset of ionization or dissociation. Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00011-2 Copyright © 2021 Elsevier Inc. All rights reserved.

35

36 Introduction to Quantum Mechanics

It is of interest also to consider the x-component of linear momentum for the free-particle solutions (3.4). According to Eqs. (2.32) and (2.33), the eigenvalue equation for momentum should read pˆ x ψ(x) = −i

dψ(x) = p ψ(x), dx

(3.6)

where we have denoted the momentum eigenvalue as p. It is easily shown that neither of the functions sin kx or cos kx from (3.4) is an eigenfunction of pˆ x . But the e±ikx are both eigenfunctions with eigenvalues p = ±k, respectively. Evidently the momentum p can take on any real value between −∞ and +∞. The kinetic energy, equal to E = p 2 /2m, can correspondingly have any value between 0 and +∞. The functions sin kx and cos kx, while not eigenfunctions of pˆ x , are each superpositions of the two eigenfunctions e±ikx , by virtue of the trigonometric identities 1 1 cos kx = (eikx + e−ikx ) and sin kx = (eikx − e−ikx ). 2 2i

(3.7)

The eigenfunction eikx for k > 0 represents the particle moving from left to right on the x-axis, with momentum p > 0. Correspondingly, e−ikx represents motion from right to left with p < 0. The functions sin kx and cos kx represent standing waves, obtained by superposition of opposing wave motions. Although these latter two are not eigenfunctions of pˆ x they are eigenfunctions of pˆ x2 , hence of the Hamiltonian Hˆ . Orthogonality and normalization of the free-particle eigenfunctions involves the deltafunction. This will be discussed in Sect. 5.3, Eqs. (5.35) and (5.36).

3.2 Particle in a box Also known as a particle in an infinite square well, this is the simplest nontrivial application of the Schrödinger equation, but one which illustrates many of the fundamental concepts of quantum mechanics. For a particle moving in one dimension (again take the x-axis), the Schrödinger equation can be written −

2 ψ (x) + V (x)ψ(x) = E ψ(x). 2m

(3.8)

Assume that the particle can move freely between two endpoints x = 0 and x = a, but cannot penetrate past either end. This can be represented by a potential energy function 0 0≤x ≤a V (x) = (3.9) ∞ x < 0 and x > a.

Quantum mechanics of some simple system Chapter | 3

37

FIGURE 3.1 Potential well and lowest energy levels for particle in a box.

This potential is represented by the dark lines in Fig. 3.1. Infinite potential energy constitute an impenetrable barrier. The particle is thus bound to a potential well, sometimes called a square well. Since the particle cannot penetrate beyond the endpoints x = 0 or x = a we must have ψ(x) = 0 for

x < 0 and x > a.

(3.10)

By the requirement that the wavefunction be continuous, it must be true as well that ψ(0) = 0 and ψ(a) = 0,

(3.11)

which constitutes a pair of boundary conditions on the wavefunction within the box. Inside the box, V (x) = 0, so the Schrödinger equation reduces to the freeparticle form (3.1) −

2 ψ (x) = E ψ(x), 2m

0 ≤ x ≤ a.

(3.12)

We again have the differential equation ψ (x) + k 2 ψ(x) = 0

with

k 2 = 2mE/2 .

(3.13)

The general solution can be written ψ(x) = A sin kx + B cos kx,

(3.14)

38 Introduction to Quantum Mechanics

where A and B are constants to be determined by the boundary conditions (3.11). By the first condition, we find ψ(0) = A sin 0 + B cos 0 = B = 0.

(3.15)

The second boundary condition at x = a then implies ψ(a) = A sin ka = 0.

(3.16)

It is assumed that A = 0, for otherwise ψ(x) would be zero everywhere and the particle would disappear. The condition that sin kx = 0 implies that ka = nπ,

(3.17)

where n is a integer, positive, negative or zero. The case n = 0 must be excluded, for then k = 0 and again ψ(x) would vanish everywhere. Eliminating k between (3.13) and (3.17), we obtain En =

2 π 2 2 h2 2 n = n 2ma 2 8ma 2

n = 1, 2, 3 . . . .

(3.18)

These are the only values of the energy which allow solution of the Schrödinger equation (3.12) consistent with the boundary conditions (3.11). The integer n, called a quantum number, is appended as a subscript on E to label the allowed energy levels. Negative values of n add nothing new because sin(−kx) = − sin kx, which represents the same quantum state. Fig. 3.1 also shows part of the energy-level diagram for the particle in a box. Classical mechanics would predict a continuum for all values E ≥ 0. The occurrence of discrete or quantized energy levels is characteristic of a bound system in quantum mechanics, that is, one confined to a finite region in space. For the free particle, the absence of confinement allows an energy continuum. Note that, in both cases, the number of energy levels is infinite—denumerably infinite for the particle in a box but nondenumerably infinite for the free particle. The particle in a box assumes its lowest possible energy when n = 1, namely E1 =

h2 . 8ma 2

(3.19)

The state of lowest energy for a quantum system is termed its ground state. Higher energies are called excited states. An interesting point is that the groundstate energy E1 > 0, whereas the corresponding classical system would have a minimum energy of zero. This is a recurrent phenomenon in quantum mechanics. The residual energy of the ground state, that is, the energy in excess of the classical minimum, is known as zero point energy. In effect, the kinetic energy, hence the momentum, of a bound particle cannot be reduced to zero. The minimum value of momentum is found by equating E1 to p 2 /2m, giving

Quantum mechanics of some simple system Chapter | 3

39

pmin = ±h/2a. This can be expressed as an uncertainty in momentum approximated by p ≈ h/a. Coupling this with the uncertainty in position, x ≈ a, the size of the box, we can write x p ≈ h.

(3.20)

This is in accord with the Heisenberg uncertainty principle, which we will discuss in greater detail later. The particle-in-a-box eigenfunctions are given by Eq. (3.14), with B = 0 and k = nπ/a, in accordance with (3.17): ψn (x) = A sin

nπx , a

n = 1, 2, 3 . . . .

(3.21)

These, like the energies, can be labeled by the quantum number n. The constant A, thus far arbitrary, can be adjusted so that ψn (x) is normalized. The normalization condition (2.40) becomes, in this case, a [ψn (x)]2 dx = 1 (3.22) 0

the integration running over the domain of the particle, 0 ≤ x ≤ a. Substituting (3.21) into (3.22), nπ a nπx a a sin2 sin2 θ dθ = A2 = 1. dx = A2 (3.23) A2 a nπ 2 0 0 We have made the substitution θ = nπx/a and used the fact that the average value of sin2 θ over an integral number of half wavelengths equals 1/2. (Alternatively, one could refer to standard integral tables.) From (3.23) we can identify the normalization constant A = (2/a)1/2 for all values of n. Thus we obtain the normalized eigenfunctions: ψn (x) =

1/2 2 nπx sin , a a

n = 1, 2, 3 . . . .

(3.24)

The first few eigenfunctions and the corresponding probability distributions are plotted in Fig. 3.2. There is a close analogy between the states of this quantum system and the modes of vibration of a string. The patterns of standing waves on the string are, in fact, identical in form with the wavefunctions (3.24). A notable feature of the particle-in-a-box quantum states is the occurrence of nodes. These are points, other than the two end points (which are fixed by the boundary conditions), at which the wavefunction vanishes. At a node there is exactly zero probability of finding the particle. The nth quantum state has, in fact, n − 1 nodes. It is generally true that the number of nodes increases with the energy of a quantum state, which can be rationalized by the following qualitative argument. As the number of nodes increases, so does the number and steepness

40 Introduction to Quantum Mechanics

FIGURE 3.2 Eigenfunctions and probability densities for particle in a box.

of the ‘wiggles’ in the wavefunction. It’s like skiing down a slalom course. Accordingly, the average curvature, given by the second derivative, must increase. But the second derivative is proportional to the kinetic energy operator. Therefore, the more nodes, the higher the energy. This will prove to be an invaluable guide in more complex quantum systems. Another important property of the eigenfunctions (3.24) applies to the integral over a product of two different eigenfunctions. The following relationship is easy to see from Fig. 3.3:

a

ψ2 (x) ψ1 (x) dx = 0.

(3.25)

0

To prove this result in general, use the trigonometric identity 1 sin α sin β = [cos(α − β) − cos(α + β)] 2 to show that

a

ψm (x) ψn (x) dx = 0

if m = n.

(3.26)

(3.27)

0

This property is called orthogonality. We will show in the Chap. 4 that this is a general result for quantum-mechanical eigenfunctions. The normalization (3.25)

Quantum mechanics of some simple system Chapter | 3

41

FIGURE 3.3 Product of n=1 and n=2 eigenfunctions.

together with the orthogonality (3.27) can be combined into a single relationship a ψm (x) ψn (x) dx = δmn , (3.28) 0

in terms of the Kronecker delta

1 if m = n δmn ≡ 0 if m = n.

(3.29)

A set of functions {ψn } which obeys (3.28) is called orthonormal.

3.3 Free-electron model The simple quantum-mechanical problem we have just solved can provide an instructive application to chemistry: the free-electron model (FEM) for delocalized π-electrons. The simplest case is the 1,3-butadiene molecule

The four π-electrons are assumed to move freely over the four-carbon framework of single bonds. We neglect the zig-zagging of the C–C bonds and assume a one-dimensional box. We also overlook the reality that π-electrons actually have a node in the plane of the molecule. Since the electron wavefunction extends beyond the terminal carbons, we add approximately one-half bond length at each end. This conveniently gives a box of length equal to the number of carbon atoms times the C–C bond length, for butadiene, approximately 4 × 1.40 Å (recalling 1 Å=100 pm=10−10 m). Now, in the lowest energy state of butadiene, the 4 delocalized electrons will fill the two lowest FEM “molecular orbitals.” The total π-electron density will be given (as shown in Fig. 3.4) by ρ = 2ψ12 + 2ψ22 .

(3.30)

42 Introduction to Quantum Mechanics

FIGURE 3.4 Pi-electron density in butadiene.

For a chemical interpretation of this picture, note that the π-electron density is concentrated between carbon atoms 1 and 2, and between 3 and 4. Thus the predominant structure of butadiene has double bonds between these two pairs of atoms. Each double bond consists of a π-bond, in addition to the underlying σ -bond. However, this is not the complete story, because we must also take account of the residual π-electron density between carbons 2 and 3 and beyond the terminal carbons. In the terminology of valence-bond theory, butadiene would be described as a resonance hybrid with the predominant structure CH2 =CH-CH=CH2 but with a secondary contribution from ◦CH -CH=CH-CH ◦. The reality of the latter structure is suggested by the abil2 2 ity of butadiene to undergo 1,4-addition reactions. The free-electron model can also be applied to the electronic spectrum of butadiene and other linear polyenes. The lowest unoccupied molecular orbital (LUMO) in butadiene corresponds to the n = 3 particle-in-a-box state. Neglecting electron-electron interaction, the longest-wavelength (lowest-energy) electronic transition should occur from n = 2, the highest occupied molecular orbital (HOMO), as shown below:

The energy difference is given by E = E3 − E2 = (32 − 22 )

h2 . 8mL2

(3.31)

Here m represents the mass of an electron (not a butadiene molecule!), 9.1 × 10−31 kg, and L is the effective length of the box, 4 × 1.40 × 10−10 m. By the Bohr frequency condition hc . (3.32) λ The wavelength is predicted to be 207 nm. This compares well with the experimental maximum of the first electronic absorption band, λmax ≈ 210 nm, in the ultraviolet region. E = h ν =

Quantum mechanics of some simple system Chapter | 3

43

We might therefore be emboldened to apply the model to predict absorption spectra in higher polyenes CH2 =(CH–CH=)n−1 CH2 . For the molecule with 2n carbon atoms (n double bonds), the HOMO → LUMO transition corresponds to n → n + 1, thus h2 hc

. ≈ (n + 1)2 − n2 λ 8m(2nLCC )2

(3.33)

A useful constant in this computation is the Compton wavelength h/mc = 2.426 × 10−12 m. For n = 3, hexatriene, the predicted wavelength is 332 nm, while experiment gives λmax ≈ 250 nm. For n = 4, octatetraene, FEM predicts 460 nm, while λmax ≈ 300 nm. Clearly the model has been pushed beyond it range of quantitative validity, although the trend of increasing absorption band wavelength with increasing n is correctly predicted. A compound should be colored if its absorption includes any part of the visible range 400–700 nm. Retinol (vitamin A), which contains a polyene chain with n = 5, has a pale yellow color. This is its structure:

3.4 Particle in a three-dimensional box A real box has three dimensions. Consider a particle which can move freely with in rectangular box of dimensions a × b × c with impenetrable walls, as shown in Fig. 3.5.

FIGURE 3.5 Coordinate system for particle in a box.

In terms of potential energy, we can write 0 inside box V (x, y, z) = ∞ outside box.

(3.34)

44 Introduction to Quantum Mechanics

Again, the wavefunction vanishes everywhere outside the box. By the continuity requirement, the wavefunction must also vanish in the six surfaces of the box. Orienting the box so its edges are parallel to the cartesian axes, with one corner at (0, 0, 0), the following boundary conditions must be satisfied: ψ(x, y, z) = 0 when x = 0, x = a, y = 0, y = b, z = 0 or z = c.

(3.35)

Inside the box, where the potential energy is everywhere zero, the Hamiltonian is simply the three-dimensional kinetic energy operator and the Schrödinger equation reads 2 2 (3.36) ∇ ψ(x, y, z) = E ψ(x, y, z), 2m subject to the boundary conditions (3.35). This second-order partial differential equation is separable in cartesian coordinates, with a solution of the form −

ψ(x, y, z) = X(x) Y (y) Z(z),

(3.37)

subject to the boundary conditions X(0) = X(a) = 0,

Y (0) = Y (b) = 0,

Z(0) = Z(c) = 0.

(3.38)

Substitute (3.37) into (3.36) and note that ∂2 X(x) Y (y) Z(z) = X (x) Y (y) Z(z), ∂x 2

etc.

(3.39)

Dividing through by (3.37), we obtain X (x) Y (y) Z (z) 2mE + + + 2 = 0. X(x) Y (y) Z(z)

(3.40)

Each of the first three terms in (3.40) depends on one variable only, independent of the other two. This is possible only if each term separately equals a constant, say, −α 2 , −β 2 and −γ 2 , respectively. These constants must be negative in order that E > 0. Eq. (3.40) is thereby transformed into three ordinary differential equations X + α 2 X = 0,

Y + β 2 Y = 0,

Z + γ 2 Z = 0,

(3.41)

subject to the boundary conditions (3.39). The constants are related by 2mE = α2 + β 2 + γ 2. 2

(3.42)

Each of Eqs. (3.41), with its associated boundary conditions is equivalent to the one-dimensional problem (3.13) with boundary conditions (3.11). The normalized solutions X(x), Y (y), Z(z) can therefore be written down in complete

Quantum mechanics of some simple system Chapter | 3

45

analogy with (3.24): 1/2 2 n1 πx sin , a a 1/2 2 n2 πy sin , Yn2 (y) = b b 1/2 2 n3 πz sin , Zn3 (x) = c c

Xn1 (x) =

n1 = 1, 2 . . . n2 = 1, 2 . . . n3 = 1, 2 . . .

(3.43)

n3 π , c

(3.44)

n1 , n2 , n3 = 1, 2 . . . .

(3.45)

The constants in Eq. (3.41) are given by α=

n1 π , a

β=

n2 π , b

so that the allowed energy levels are

h2 n21 n22 n23 + + 2 , En1 ,n2 ,n3 = 8m a 2 b2 c

γ=

Three quantum numbers are required to specify the state of this threedimensional system. The corresponding eigenfunctions are ψn1 ,n2 ,n3 (x, y, z) =

8 V

1/2 sin

n1 πx n2 πy n3 πz sin sin , a b c

(3.46)

where V = abc, the volume of the box. These eigenfunctions form an orthonormal set [cf. Eq. (3.28)] such that

a b c 0

0

0

ψn1 ,n2 ,n3 (x, y, z) ψn1 ,n2 ,n3 (x, y, z) dx dy dz = δn1 ,n1 δn2 ,n2 δn3 ,n3 .

(3.47)

Note that two eigenfunctions will be orthogonal unless all three quantum numbers match. The three-dimensional matter waves represented by (3.46) are comparable with the modes of vibration of a solid block. The nodal surfaces are planes parallel to the sides, as shown in Fig. 3.6.

FIGURE 3.6 Nodal planes for particle in a box, for n1 = 4, n2 = 2, n3 = 3.

46 Introduction to Quantum Mechanics

When a box has the symmetry of a cube, with a = b = c, the energy formula (3.45) simplifies to En1 ,n2 ,n3 =

h2 (n2 + n22 + n23 ), 8ma 2 1

n1 , n2 , n3 = 1, 2 . . . .

(3.48)

Quantum systems with symmetry generally exhibit degeneracy in their energy levels. This means that there can exist distinct eigenfunctions which share the same eigenvalue. An eigenvalue which corresponds to a unique eigenfunction is termed nondegenerate while one which belongs to n different eigenfunctions is termed n-fold degenerate. As an example, we enumerate the first few levels for a cubic box, with En1 ,n2 ,n3 expressed in units of h2 /8ma 2 : E1,1,1 = 3 (nondegenerate), E1,1,2 = E1,2,1 = E2,1,1 = 6 (3-fold degenerate), E1,2,2 = E2,1,2 = E2,2,1 = 9 (3-fold degenerate), E1,1,3 = E1,3,1 = E3,1,1 = 11 (3-fold degenerate), E2,2,2 = 12 (nondegenerate), E1,2,3 = E1,3,2 = E2,1,3 = E2,3,1 = E3,1,2 = E3,2,1 = 14 (6-fold degenerate). The particle in a box is applied in statistical thermodynamics to model the perfect gas. Each molecule is assumed to move freely within the box without interacting with the other molecules. The total energy of N molecules, in any distribution among the energy levels (3.48), is proportional to 1/a 2 , thus E = const V −2/3 .

(3.49)

From the differential of work dw = −p dV , we can identify p=−

dE 2 E = . dV 3V

(3.50)

But the energy of a perfect monatomic gas is known to equal 32 nRT , which leads to the perfect gas law pV = nRT .

(3.51)

Supplement 3A. Finite square-well potential A more challenging variant of the particle-in-a-box problem is the case when the potential well is finite. Consider a finite square-well potential − a/2 ≤ x ≤ a/2 −V0 (3.52) V (x) = 0 x < −a/2 and x > a/2, shown in Fig. 3.7. We consider first the negative energy case −V0 < E < 0.

Quantum mechanics of some simple system Chapter | 3

47

FIGURE 3.7 Finite potential well of depth V0 .

There are three regions to consider: I (x < − a2 ), II (− a2 ≤ x ≤ a2 ) and III ( a2 < x). Since the potential is an even function of x, V (−x) = V (x), the wavefunction must be either an even or an odd function of x, designated as symmetric or antisymmetric. For the symmetric case ψI = Aeκx ,

ψII = B cos kx,

ψIII = Ae−κx ,

(3.53)

ψIII = −Ae−κx ,

(3.54)

while for odd parity, ψI = Aeκx , where

ψII = B sin kx,

κ=

−

2mE , 2

k=

2m(E + V0 ) . 2

(3.55)

We have taken into account the limits ψI → 0 as x → −∞ and ψIII → 0 as x → ∞. The energy eigenvalues are determined by the condition that the wavefunction and its first derivative must be continuous at the discontinuities of the potential. It is sufficient to consider just the matching conditions at x = −a/2, namely ψI (−a/2) = ψII (−a/2) and ψI (−a/2) = ψII (−a/2). For the even parity solutions, we find Ae−κa/2 = B cos(ka/2)

and

−

κ −κa/2 k = − B sin(ka/2). Ae 2 2

(3.56)

Dividing the second equation by the first, we obtain κ = k tan(ka/2)

(symmetric).

(3.57)

Analogously, for antisymmetric wavefunctions κ = −k cot(ka/2)

(antisymmetric).

(3.58)

48 Introduction to Quantum Mechanics

By virtue of (3.55), these constitute transcendental equations for the energies of a particle in a finite potential well. Let us define the dimensionless variables ma 2 (−E) ma 2 V0 ka and ξ = . (3.59) = ξ= 0 2 22 22 Eqs. (3.57) and (3.58) can then be written ξ02 − ξ 2 = ξ tan ξ and ξ02 − ξ 2 = −ξ cot ξ.

(3.60)

Graphical solutions for the energies can be obtained by plots of the functions f (ξ ) = ξ02 − ξ 2 and either ξ tan ξ or −ξ cot ξ , as shown in Fig. 3.8. The energies are determined by the values of ξ at the points of intersection. For example,

FIGURE 3.8 Graphical determination of energies for finite square well.

with ξ0 = 4, we obtain three solutions ξ = 1.252, 2.475 and 3.595. The corresponding energies are then determined using (3.59). For energies E > 0, the solutions of the Schrödinger equation in regions I, II and III have the form ψ(x) = a eikx + b e−ikx .

(3.61)

We will not work out the detailed values of a and b but note simply that solutions exist for all values of k ≥ 0, corresponding to an energy continuum for E ≥ 0, similar to the case of a free particle. The finite square well problem thus has a feature in common with atomic and molecular systems: a finite set of negativeenergy bound states plus a continuum for E ≥ 0, corresponding to ionization or dissociation of the system. (A fine point: the hydrogen atom and most positive ions have an infinite number of bound states.)

Quantum mechanics of some simple system Chapter | 3

49

Chapter 3. Exercises 3.1. Which of the following is not a solution y(x) of the differential equation y (x) + k 2 y(x) = 0 (k = constant): (i) sin(kx) (ii) cos(kx) (iii) eikx (iv) e−kx (v) sin(kx + α) (α = constant). 3.2. For a particle in a 1-dimensional box, calculate the probability that the particle will be found in the middle third of the box: L/3 ≤ x ≤ 2L/3. From the general formula for arbitrary n, find the limiting value as n → ∞. 3.3. Redo the solution of the 1-dimensional particle-in-a-box, when the particle is confined to the region −a/2 ≤ x ≤ a/2. 3.4. Predict the wavelength (in nm) of the lowest-energy electronic transition in the following polymethine ion: (CH3 )2 N+ = CH − CH = CH − CH = CH − N(CH3 )2 Assume that all the C–C and C–N bond lengths equal 1.40 Å. Note that N+ and N contribute 1 and 2 π-electrons, respectively. 3.5. In this calculation you will determine the order of magnitude of nuclear energies. Assume that a nucleus can be represented as a cubic box of side 10−14 m. The particles in this box are the nucleons (protons and neutrons). Calculate the lowest allowed energy of a nucleon. Express your result in MeV (1 MeV = 106 eV = 1.602 ×10−13 J). 3.5. Consider the hypothetical reaction of two “cube-atoms” to form a “molybox”:

Each cube-atom contains one electron. The interaction between electrons can be neglected. Determine the energy change in the above reaction. 3.7. Consider the two-dimensional particle-in-a-box—a particle free to move on a square plate of side a. Solve the Schrödinger equation to obtain the eigenvalues and eigenfunctions. You should be able to do this entirely by analogy with solutions we have already obtained. Discuss the degeneracies of the lowest few energy levels.

50 Introduction to Quantum Mechanics

3.8. As a variant on the free-electron model applied to benzene, assume that the six π electrons are delocalized within a square plate of side a. Calculate the value of a that would account for the 268 nm ultraviolet absorption in benzene.

Chapter 4

Principles of quantum mechanics In this Chapter we will continue to develop the mathematical formalism of quantum mechanics, using heuristic arguments as necessary. This will lead to a system of postulates which will be the basis of our subsequent applications of quantum mechanics.

4.1

Hermitian operators

An important property of operators is suggested by considering the Hamiltonian for the particle in a box: 2 d 2 . Hˆ = − 2m dx 2

(4.1)

Let f (x) and g(x) be arbitrary functions which obey the same boundary values as the eigenfunctions of Hˆ , namely that they vanish at x = 0 and x = a. Consider the integral

a

f (x) Hˆ g(x) dx = −

0

2 2m

a

f (x) g (x) dx.

(4.2)

0

Now, using integration by parts,

a

f (x) g (x) dx = −

0

a

a f (x) g (x) dx + f (x) g (x) .

0

(4.3)

0

The boundary terms vanish by the assumed conditions on f and g. A second integration by parts transforms (4.3) to +

a

a f (x) g(x) dx − f (x) g(x) .

0

0

It follows therefore that a ˆ f (x) H g(x) dx = 0

a

g(x) Hˆ f (x) dx.

(4.4)

0

Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00012-4 Copyright © 2021 Elsevier Inc. All rights reserved.

51

52 Introduction to Quantum Mechanics

An obvious generalization for complex functions will read

a

f ∗ (x) Hˆ g(x) dx =

0

a

g ∗ (x) Hˆ f (x) dx

∗ .

(4.5)

0

In mathematical terminology, an operator Aˆ for which

f ∗ Aˆ g dτ =

g ∗ Aˆ f dτ

∗ (4.6)

for all functions f and g which obey specified boundary conditions is denoted as hermitian or self-adjoint. Evidently, the Hamiltonian is a hermitian operator. It is postulated that all quantum-mechanical operators that represent dynamical variables are hermitian.

4.2 Eigenvalues and eigenfunctions The sets of energies and wavefunctions obtained by solving any quantummechanical problem can be summarized symbolically as solutions of the eigenvalue equation Hˆ ψn = En ψn .

(4.7)

For another value of the quantum number, we can write Hˆ ψm = Em ψm .

(4.8)

∗ and the complex conjugate of (4.8) by ψ . Then we Let us multiply (4.7) by ψm n subtract the two expressions and integrate over dτ . The result is

∗ ˆ ψm H ψn dτ −

ψn∗ Hˆ ψm dτ

∗

∗ = (En − Em )

∗ ψm ψn dτ.

(4.9)

But by the hermitian property (4.6), the left-hand side of (4.9) equals zero. Thus ∗ ∗ ) ψm ψn dτ = 0. (4.10) (En − Em Consider first the case m = n. The second factor in (4.10) then becomes the normalization integral ψn∗ ψn dτ , which equals 1 (or at least a nonzero constant). Therefore the first factor in (4.10) must equal zero, meaning that En∗ = En .

(4.11)

This implies that the energy eigenvalues must be real numbers, which is quite reasonable from a physical point of view since eigenvalues represent possible

Principles of quantum mechanics Chapter | 4

53

results of measurement. Consider next the case when Em = En . Then it is the second factor in (4.10) that must vanish and ∗ ψn dτ = 0 when Em = En . (4.12) ψm Thus eigenfunctions belonging to different eigenvalues are orthogonal. In the case that ψm and ψn are degenerate eigenfunctions, so m = n but Em = En , the above proof of orthogonality does not apply. But it is always possible to construct degenerate functions that are mutually orthogonal. A general result is therefore the orthonormalization condition ∗ ψm ψn dτ = δmn . (4.13) It is easy to prove that a linear combination of degenerate eigenfunctions is itself an eigenfunction of the same energy. Let Hˆ ψnk = En ψnk ,

k = 1, 2 . . . d,

(4.14)

where the ψnk represent a d-fold degenerate set of eigenfunctions with the same eigenvalue En . Consider now the linear combination ψ = c1 ψn,1 + c2 ψn,2 + · · · + cd ψn,d .

(4.15)

Operating on ψ with the Hamiltonian and using (4.14), we find Hˆ ψ = c1 Hˆ ψn,1 + c2 Hˆ ψn,2 + · · · = En (c1 ψn,1 + c2 ψn,2 + . . . ) = En ψ,

(4.16)

which shows that the linear combination ψ is also an eigenfunction of the same energy. There is evidently a limitless number of possible eigenfunctions for a degenerate eigenvalue. However, only d of these will be linearly independent.

4.3 Expectation values One of the extraordinary features of quantum mechanics is the possibility for superposition of states. The state of a quantum system can sometimes exist as a linear combination of other states, such that ψ = c1 ψ 1 + c2 ψ 2 .

(4.17)

For example, the electronic ground state of the butadiene can (to an approximation) be considered a superposition of the valence-bond structures CH2 =CH-CH=CH2 and ◦CH2 -CH=CH-CH2 ◦. Assuming that all three functions in (4.17) are normalized and that ψ1 and ψ2 are orthogonal, we find ∗ ψ ψ dτ = (c1∗ ψ1∗ + c2∗ ψ2∗ )(c1 ψ1 + c2 ψ2 ) dτ =

54 Introduction to Quantum Mechanics

c1∗ c1

ψ1∗ ψ1 dτ

+ c2∗ c2

ψ2∗ ψ2 dτ

+ c1∗ c2

ψ2∗ ψ1 dτ

+ c2∗ c1

ψ2∗ ψ1 dτ =

c1∗ c1 × 1 + c2∗ c2 × 1 + c1∗ c2 × 0 + c2∗ c1 × 0 = |c1 |2 + |c2 |2 = 1. (4.18) We can interpret |c1 |2 and |c2 |2 as the probabilities that a system in a state described by ψ can have the attributes of the states ψ1 and ψ2 , respectively. (In the butadiene example above, |c1 |2 and |c2 |2 might approximate the fraction of molecules which undergo 1,3-addition and 1,4-addition, respectively, in the reaction with Cl2 .) Suppose ψ1 and ψ2 represent eigenstates of an observable A, satisfying the respective eigenvalue equations ˆ 1 = a1 ψ1 Aψ

and

ˆ 2 = a2 ψ2 . Aψ

(4.19)

Then a large number of measurements of the variable A in the state ψ will register the value a1 with a probability |c1 |2 and the value a2 with a probability |c2 |2 . The average value or expectation value of A will be given by A = |c1 |2 a1 + |c2 |2 a2 . This can be obtained directly from ψ by the “sandwich construction” ˆ dτ, A = ψ ∗ Aψ

(4.20)

(4.21)

or, if ψ is not normalized, ∗ ˆ dτ ψ Aψ . A = ∗ ψ ψ dτ

(4.22)

Note that the expectation value need not itself be a possible result of a single measurement (like the centroid of a donut, which is located in the hole!). When the operator Aˆ is a simple function, not containing differential operators or the like, then (4.21) reduces to the classical formula for an average value: A = A ρ dτ. (4.23)

4.4 More on operators An operator represents a prescription for turning one function into another—in ˆ = φ. From a physical point of view, the action of an operator on symbols, Aψ a wavefunction can be pictured as the process of measuring the observable A on the state ψ. The transformed wavefunction φ then represents the state of the system after the measurement is performed. In general φ is different from ψ,

Principles of quantum mechanics Chapter | 4 55

consistent with the fact that the process of measurement on a quantum system can produce an irreducible perturbation of its state. Only in the special case that ψ is an eigenstate of A, does a measurement preserve the original state. The function φ is then equal to an eigenvalue a times ψ. ˆ represents the successive action of The product of two operators, say Aˆ B, ˆ In general, the the operators, reading from right to left—i.e., first Bˆ then A. ˆ ˆ action of two operators in the reversed order, say B A, gives a different result, ˆ We say that the operators do not commute. which can be written Aˆ Bˆ = Bˆ A. This can be attributed to the perturbing effect one measurement on a quantum system can have on subsequent measurements. Here’s an example of noncommuting operators from everyday life: in our usual routine each morning, we shower and we get dressed. But the result of carrying out these operations in reversed order will be dramatically different! The commutator of two operators is defined by ˆ B] ˆ ≡ Aˆ Bˆ − Bˆ A. ˆ [A,

(4.24)

ˆ B] ˆ = 0, the two operators are said to commute. This means their When [A, combined effect will be the same whatever order they are applied (like brushing your teeth and showering).

Werner Heisenberg The uncertainty principle for simultaneous measurement of two observables A and B is determined by their commutator. The uncertainty a in the observable A is defined in terms of the mean square deviation from the average: (a)2 = (Aˆ − A)2 = A2 − A2 .

(4.25)

This corresponds to the standard deviation σ in statistics. The following inequality can be proven for the product of two uncertainties:

56 Introduction to Quantum Mechanics

1 ˆ ˆ a b ≥ |[A, B]|. 2

(4.26)

The best known application of (4.26) is to the position and momentum operators, say xˆ and pˆ x . Their commutator is given by [x, ˆ pˆ x ] = i,

(4.27)

x p ≥ /2,

(4.28)

so that

which is known as the Heisenberg uncertainty principle. This fundamental consequence of quantum theory implies that the position and momentum of a particle cannot be determined with arbitrary precision—the more accurately one is known, the more uncertain is the other. For example, if the momentum is known exactly, as in a momentum eigenstate, then the position is completely undetermined. The uncertainty relation for energy and time can be deduced by using the “energy operator” Eˆ = i∂/∂t in Eq. (4.26). The result is E t ≥ /2.

(4.29)

This aspect of the uncertainty principle, first proposed by Bohr, refers to a measurement of the energy carried out during a time interval t. It implies that a short-lived excited state of an atom or molecule, for which t is small, will be associated with a relatively large uncertainty in energy E. This is one factor contributing to the broadening of spectral lines. If two operators commute, there is no restriction on the accuracy of their simultaneous measurement. For example, the x and y coordinates of a particle can be known at the same time. An important theorem states that two commuting observables can have simultaneous eigenfunctions. To prove this, write the eigenvalue equation for an operator Aˆ Aˆ ψn = an ψn ,

(4.30)

then operate with Bˆ and use the commutativity of Aˆ and Bˆ to obtain Bˆ Aˆ ψn = Aˆ Bˆ ψn = an Bˆ ψn .

(4.31)

This shows that Bˆ ψn is also an eigenfunction of Aˆ with the same eigenvalue an . This implies that Bˆ ψn = const ψn ≡ bn ψn ,

(4.32)

so that ψn is a simultaneous eigenfunction of Aˆ and Bˆ with eigenvalues an and bn , respectively. The derivation becomes slightly more complicated in the case of degenerate eigenfunctions, but the same conclusion follows.

Principles of quantum mechanics Chapter | 4

57

After the Hamiltonian, the operators for angular momenta are probably the most important in quantum mechanics. The definition of angular momentum in classical mechanics is L = r × p. In terms of its cartesian components, Lx = ypz − zpy ,

Ly = zpx − xpz ,

Lz = xpy − ypx .

(4.33)

In future, we will write such sets of equation as “Lx = ypz − zpy , et cyc,” meaning that we add to one explicitly stated relation, the versions formed by successive cyclic permutation x → y → z → x. The general prescription for turning a classical dynamical variable into a quantum-mechanical operator was developed in Chap. 2. The key relations were the momentum components pˆ x = −i

∂ , ∂x

pˆ y = −i

∂ , ∂y

pˆ z = −i

∂ , ∂z

(4.34)

with the coordinates x, y, z simply carried over into multiplicative operators. Applying (4.34) to (4.33), we construct the three angular momentum operators ∂ ∂ ˆ et cyc, (4.35) Lx = −i y − z ∂z ∂y while the total angular momentum is given by Lˆ 2 = Lˆ 2x + Lˆ 2y + Lˆ 2z .

(4.36)

The angular momentum operators obey the following commutation relations: [Lˆ x , Lˆ y ] = iLˆ z

et cyc,

(4.37)

but [Lˆ 2 , Lˆ z ] = 0

(4.38)

and analogously for Lˆ x and Lˆ y . This is consistent with the existence of simultaneous eigenfunctions of Lˆ 2 and any one component, conventionally designated Lˆ z . But then these states cannot be eigenfunctions of either Lˆ x or Lˆ y . We will eventually drop the “hat” notation for operators and write simply A ˆ in place of A.

4.5 Postulates of quantum mechanics Our development of quantum mechanics is now sufficiently complete that we can reduce the theory to a set of postulates. Postulate 1. The state of a quantum-mechanical system is completely specified by a wavefunction that depends on the coordinates and time. The square of this function ∗ gives the probability density for finding the system with a specified set of coordinate values.

58 Introduction to Quantum Mechanics

The wavefunction must fulfill certain mathematical requirements because of its physical interpretation. It must be single-valued, finite and continuous. It must also satisfy a normalization condition

∗ dτ = 1. (4.39) Postulate 2. Every observable in quantum mechanics is represented by a linear, hermitian operator. The hermitian property was defined in Eq. (4.6). A linear operator is one which satisfies the identity ˆ 1 ψ1 + c2 ψ2 ) = c1 Aψ ˆ 1 + c2 Aψ ˆ 2, A(c

(4.40)

which is necessary for explaining superpositions of quantum states. The form of an operator which has an analog in classical mechanics is derived by the prescriptions rˆ = r,

pˆ = −i∇,

(4.41)

which we have previously expressed in terms of cartesian components (cf. Eq. (4.34)). Postulate 3. In any measurement of an observable A, associated with an opˆ the only possible results are the eigenvalues an , which satisfy an erator A, eigenvalue equation ˆ n = an ψn . Aψ

(4.42)

This postulate captures the essence of quantum mechanics—the quantization of measured dynamical variables. A continuum of eigenvalues is not excluded, however, as in the case of an unbound particle. Every measurement of A invariably gives one of its eigenvalues. For an arbitrary state (not an eigenstate of A), these measurements will be individually unpredictable—they can introduce “noise” into a system—but they follow a definite statistical law, according to the fourth postulate: Postulate 4. For a system in a state described by a normalized wavefunction

, the average or expectation value of the observable corresponding to A is given by A = ∗ Aˆ dτ. (4.43) Finally, we state Postulate 5. The wavefunction of a system evolves in time in accordance with the time-dependent Schrödinger equation

Principles of quantum mechanics Chapter | 4

59

∂

= Hˆ . (4.44) ∂t For time-independent problems this reduces to the time-independent Schrödinger equation i

Hˆ ψ = E ψ,

(4.45)

which is equivalent to the eigenvalue equation for the Hamiltonian operator.

4.6 Dirac bra-ket notation

P. A. M. Dirac The term orthogonal has been used to refer to both perpendicular vectors and to functions whose product integrates to zero. This actually connotes a deep connection between vectors and functions. Consider two orthogonal vectors a and b. Then, in terms of their x, y, z components, labeled by 1, 2, 3, respectively, the scalar product can be written a · b = a1 b1 + a2 b2 + a3 b3 = 0.

(4.46)

Suppose now that we consider an analogous relationship involving vectors in n-dimensional space (which you need not visualize!). We could then write a·b=

n

ak bk = 0.

(4.47)

k=1

Finally let the dimension of the space become nondenumerably infinite, turning into a continuum. The sum (4.47) would then be replaced by an integral such as

60 Introduction to Quantum Mechanics

a(x) b(x) dx = 0.

(4.48)

But this is just the relation for orthogonal functions. A function can therefore be regarded as an abstract vector in a higher-dimensional continuum, known as Hilbert space. This is true for eigenfunctions as well. Dirac denoted the vector in Hilbert space corresponding to the eigenfunction ψn by the symbol |n. Cor∗ is denoted by m|. The integral over respondingly, the complex conjugate ψm the product of the two functions is then analogous to a scalar product of the abstract vectors, written ∗ ψm ψn dτ = m| · |n ≡ m|n. (4.49) The last quantity is known as a bracket, which led Dirac to designate the vectors m| and |n as a “bra” and a “ket,” respectively. The orthonormality conditions (4.13) can be written m|n = δmn .

(4.50)

ˆ can be A matrix element, an integral of a “sandwich” containing an operator A, written very compactly in the form ∗ ˆ (4.51) A ψn dτ = m|A|n. ψm The hermitian condition on Aˆ [cf. Eq. (4.6)] is therefore expressed as m|A|n = n|A|m∗ .

(4.52)

For any operator Aˆ the adjoint operator Aˆ † is defined by m|A† |n = n|A|m∗ .

(4.53)

ˆ A hermitian operator is thus self-adjoint since Aˆ † = A. In matrix terminology, a ket |n is analogous to a column vector, a bra m| to a row vector and an operator A to a square matrix. Thus the expressions m|n and m|A|n represent compatible combinations for matrix products. A bra can be considered to be the adjoint of a ket n| = |n† ,

(4.54)

with the elements of a column vector being rearrayed as a row of corresponding complex conjugates. Note that n|A† = (A|n)† , so that

n|A A|n = †

ˆ n dτ ψn∗ Aˆ † Aψ

=

(4.55)

ˆ n |2 dτ ≥ 0. |Aψ

(4.56)

Principles of quantum mechanics Chapter | 4

61

4.7 The variational principle Except for a small number of intensively-studied examples, the Schrödinger equation for most problems of chemical interest cannot be solved exactly. The variational principle provides a guide for constructing the best possible approximate solutions of a specified functional form. Suppose that we seek an approximate solution for the ground state of a quantum system described by a Hamiltonian Hˆ . We presume that the Schrödinger equation Hˆ ψ0 = E0 ψ0

(4.57)

is too difficult to solve exactly. Suppose, however, that we have a function ψ˜ which we think is an approximation to the true ground-state wavefunction. According to the variational principle (or variational theorem), the following formula provides an upper bound to the exact ground-state energy E0 : ∗ ψ˜ Hˆ ψ˜ dτ E˜ ≡ (4.58) ≥ E0 . ψ˜ ∗ ψ˜ dτ Note that this ratio of integrals has the same form as the expectation value H ˜ the lower will be the comdefined by (4.22). The better the approximation ψ, ˜ though it will still be greater than the exact value. To prove puted energy E, Eq. (4.58), we suppose that the approximate function can, in concept, be represented as a superposition of the actual eigenstates of the Hamiltonian, analogous to (4.17), ψ˜ = c0 ψ0 + c1 ψ1 + . . . .

(4.59)

˜ the approximate ground state, might be close to the acThis means that ψ, tual ground state ψ0 but is “contaminated” by contributions from excited states ψ1 , ψ2 , . . . . Of course, none of the states or coefficients on the right-hand side is actually known, otherwise there would no need to worry about approximate computations. By Eq. (4.20), the expectation value of the Hamiltonian in the state (4.59) is given by E˜ = |c0 |2 E0 + |c1 |2 E1 + · · · .

(4.60)

Since all the excited states have higher energy than the ground state, E1 , E2 · · · ≥ E0 , we find E˜ ≥ (|c0 |2 + |c1 |2 + · · · ) E0 = E0 ,

(4.61)

assuming ψ˜ has been normalized. Thus E˜ must be greater than the true groundstate energy E0 , as implied by (4.58). As a very simple, although artificial, illustration of the variational principle, consider the ground state of the particle in a box. Suppose we had never studied trigonometry and knew nothing about sines or cosines. Then a reasonable

62 Introduction to Quantum Mechanics

approximation to the ground state might be an inverted parabola such as the normalized function 1/2 30 ˜ x (a − x). (4.62) ψ(x) = a5 Fig. 4.1 shows this function along with the exact ground-state eigenfunction 1/2 2 πx

sin ψ1 (x) = . (4.63) a a

FIGURE 4.1 Variational approximation for particle in a box.

A variational calculation gives a 2 ˜ ˜ E˜ = ψ (x) dx = ψ(x) − 2m 0 10 5 h2 = E1 = 1.01321E1 , 4π 2 ma 2 π 2

(4.64)

in terms of the exact ground state energy E1 = h2 /8ma 2 . In accord with the variational theorem, E˜ > E1 . The computation is in error by about 1%.

4.8 Spectroscopic transitions Interactions of atoms or molecules with electromagnetic radiation can induce transitions between quantum states. To deal with transitions, we need to consider the full time-dependent Schrödinger equation (TDSE) i

d

= Hˆ . ∂t

(4.65)

We consider a Hamiltonian of the form Hˆ (t) = Hˆ 0 + Vˆ (t),

(4.66)

Principles of quantum mechanics Chapter | 4

63

where Hˆ 0 , the unperturbed Hamiltonian, is independent of time and Vˆ (t), a perturbation, is much smaller than Hˆ 0 but contains dependence on time. The unperturbed Hamiltonian possesses a set of eigenstates satisfying Hˆ 0 ψn = En ψn .

(4.67)

The perturbation Vˆ (t) can induce transitions among eigenstates ψn and ψm of Hˆ 0 if the integral of Vˆ (t) connecting the two states, the matrix element ∗ ˆ Vmn (t) = ψm (4.68) V (t) ψn dτ is not equal to zero. This is consistent with our discussion of operators in Sect. 2.4. The action of Vˆ (t) on ψn transforms the system into a state Vˆ (t)ψn . Then the overlap (4.68) of this function with ψm gives the probability amplitude that ψm is the resulting state. We will show in Supplement 4A that the rate of transitions m ← n is proportional to |Vmn (t)|2 . The interaction of an atom or molecule with an radiation field is, to a first approximation, V (t) = −μ · E(t),

(4.69)

where μ is the instantaneous electric dipole moment of the atomic or molecular charge distribution and E, the electric field of the radiation. The contribution to the dipole for each electron is given by μ = −er, where r is measured from the centroid of positive charge. There is, in addition, a contribution to the interaction energy from the magnetic dipole, with the analogous form −μmag · B, but this is generally small compared to the electric dipole energy. The magnetic dipole becomes dominant however in magnetic resonance phenomena, to be considered in Chap. 18. The key quantities for electric dipole transitions between quantum states n and m are the matrix elements ∗ x ψn dτ (4.70) xmn = ψm and analogous ones for ymn and zmn . These determine the probability of a transition for radiation polarized in the corresponding directions x, y, z. If xmn , ymn and zmn are all equal to zero for two states m and n, then the transition between these states is electric dipole forbidden. The conditions on the quantum numbers m and n for allowed transitions to occur are called selection rules. Transitions among states of the particle in a box are determined by the integrals a ψn (x) x ψn (x) dx. (4.71) xn n = 0

It can be deduced that the selection rule for transitions is n = ±1, ±3, ±5 . . . , where n = n + n. Transitions with n = ±2, ±4 . . . are forbidden.

64 Introduction to Quantum Mechanics

Supplement 4A. Perturbation theory The number of problems in quantum mechanics which can be solved exactly is relatively small. The great majority of problems, including those involving atoms or molecules more complicated than the hydrogen atom, must be approached using approximate methods. The two most important approximation techniques are the variational method, discussed in Sect. 4.7, and perturbation theory, which we will now describe. Assume that the Hamiltonian for a problem of interest can be represented by a relatively small addition to the Hamiltonian for an exactly solvable problem. We write H = H0 + λV ,

(4.72)

where the Schrödinger equation for unperturbed Hamiltonian H0 is assumed to be solvable for a complete set of eigenvalues and eigenfunctions H0 ψn(0) = En(0) ψn(0) .

(4.73)

The perturbation operator V is scaled by an auxiliary parameter λ, which will serve to track the different orders of approximation; λ = 0 corresponds to the unperturbed problem. The equation we wish to solve is (H0 + λV )ψn = En ψn .

(4.74)

Assuming that the effect of the perturbation on the eigenfunctions ψn and eigenvalues En is relatively small, we can expand them in power series in the parameter λ: ψn = ψn(0) + λψn(1) + λ2 ψn(2) + . . . ,

(4.75)

En = En(0) + λEn(1) + λ2 En(2) + . . . .

(4.76)

Substituting these expansions into (4.74) we find H0 ψn(0) + λ(H0 ψn(1) + V ψn(0) ) + λ2 (H0 ψn(2) + V ψn(1) ) + · · · = En(0) ψn(0) + λ(En(0) ψn(1) + En(1) ψn(0) ) + λ

2

(En(0) ψn(2)

+ En(1) ψn(1)

(4.77)

+ En(2) ψn(0) ) + . . . .

Since the parameter λ is arbitrary, the coefficients of like powers of λ can be equated. This leads to a series of equations: H0 ψn(0) = En(0) ψn(0) ,

(4.78)

H0 ψn(1) + V ψn(0) = En(0) ψn(1) + En(1) ψn(0) , H0 ψn(2)

+ V ψn(1)

= En(0) ψn(2)

+ En(1) ψn(1)

+ En(2) ψn(0) .

(4.79) (4.80)

Principles of quantum mechanics Chapter | 4

65

The first of these, is just the unperturbed result (4.73). The second equation (1) (1) (2) (2) can be solved to give En and ψn , the third to give En and ψn and so on. For compactness, we will now omit the superscripts (0) pertaining to the unperturbed eigenvalues and eigenfunctions. (1) To find the solutions, it is assumed that the function ψn can be expanded in the complete orthonormal set of unperturbed eigenfunctions cnk ψk . (4.81) ψn(1) = cn0 ψ0 + cn1 ψ1 + cn2 ψ2 + · · · = k

If we multiply Eq. (4.79) by ψn∗ and integrate, we find ψn∗ H0 ψn(1) dτ + ψn∗ V ψn dτ = En ψn∗ ψn(1) dτ + En(1) ψn∗ ψn dτ.

(4.82)

Noting that ∗ (1) (1)∗ ψn H0 ψn dτ = ψn H0 ψn dτ = En ψn∗ ψn(1) dτ

(4.83)

and

ψn∗ ψn dτ = 1, we determine the first-order perturbation energy En(1) = ψn∗ V ψn dτ = Vnn .

(4.84)

Note that the first-order energy depends only on the unperturbed eigenfunction. We introduce the matrix notation ∗ V ψn dτ, (4.85) Vmn = ψm where the indices refer to the unperturbed eigenfunctions. ∗ , where m = n and integrate, we find If we multiply Eq. (4.79) instead by ψm ∗ ∗ ∗ (1) ψm H0 ψn(1) dτ + ψm V ψn dτ = En ψm ψn dτ, (4.86) ∗ noting that ψm ψn dτ = 0 by orthogonality of the unperturbed eigenfunctions. (1) Now, inserting the expansion (4.81) for ψn and using matrix notation, Em cnk δmk + Vmn = En cnk δkn , (4.87) k

k

where we have noted that ψk∗ ψn dτ = δkn and ψk∗ H0 ψn dτ = En δkn . Noting that the Kronecker deltas reduce each sum to a single term, we determine the

66 Introduction to Quantum Mechanics

expansion coefficients cnm =

Vmn . E n − Em

(4.88)

(1)

The first-order eigenfunction ψn can be assumed to be orthogonal to the unperturbed ψn , so that cnn = 0.

(4.89)

Thus the first order function is given by ψn(1) =

Vmn ψm , En − Em m

(4.90)

where the prime on the summation indicates that the term m = n is omitted. To find the second-order energy we multiply Eq. (4.80) by ψn∗ and integrate. Noting that ∗ (2) (2)∗ ψn H0 ψn dτ = ψn H0 ψn dτ = En ψn∗ ψn(2) dτ (4.91) and

ψn∗ ψn(1) dτ = 0.

The result is

(4.92)

En(2) =

ψn V ψn(1) dτ.

(4.93)

Using (4.90), this gives the well-known formula for the second-order energy En(2) =

Vnm Vmn |Vnm |2 = . En − Em E n − Em m m

(4.94)

Supplement 4B. Time-dependent perturbation theory for radiative transitions If a Hamiltonian H is independent of time, it will possess a set of eigenstates satisfying H ψn = E n ψn .

(4.95)

The functions n = ψn e−iEn t/ will be also solutions of the time-dependent Schrödinger equation i

d

= H . ∂t

(4.96)

Principles of quantum mechanics Chapter | 4

Consider now a superposition of functions n

= cn e−iEn t/ ψn = c0 e−iE0 t/ ψ0 + c1 e−iE1 t/ ψ1 + . . . ,

67

(4.97)

n

where c0 , c1 . . . are constants. It is readily shown that is a general solution of the time-dependent Schrödinger equation. This represents a nonstationary state, one which does not have a definite energy and is not a solution of the time-independent Schrödinger equation (4.72). For a Hamiltonian H (t) which has explicit dependence on time, solutions of the TISE do not generally exist, but it is still possible to solve the TDSE (4.73). Again consider a Hamiltonian of the form H (t) = H0 + V (t),

(4.98)

where H0 is independent of time and V (t) is much smaller than H0 but contains dependence on time. Assume that we can determine the eigenstates of the unperturbed Hamiltonian, such that H0 ψ n = E n ψ n .

(4.99)

Let the perturbation V (t) now be “turned on.” The wavefunction of the perturbed system will now be determined by the TDSE i

d

= (H0 + V ) . ∂t

(4.100)

The solution can be represented by a generalization of (4.74)

= c0 (t) e−iE0 t/ ψ0 + c1 (t) e−iE1 t/ ψ1 + . . . ,

(4.101)

in which the coefficients c0 , c1 . . . are now functions of t, to reflect the timedependence of the new Hamiltonian. We will suppose that the perturbation has been operative for only a short time, so that the system deviates only slightly from its unperturbed state 0 . With this approximation, |c0 (t)| ≈ 1 and |c1 (t)|, |c2 (t)| · · · > 2π . The dependence on the transition rate on the square of the perturbation matrix element, as assumed in Sect. 4.8, has thus been proven.

Principles of quantum mechanics Chapter | 4 69

Chapter 4. Exercises 4.1. If ψ happens to be an eigenfunction of an operator Aˆ with the eigenvalue a, evaluate the expectation value A. 4.2. Discuss why the noncommutativity of observables is not generally significant in everyday life. For example, why can we simultaneously measure the instantaneous position and momentum of a pitched baseball with confidence? 4.3. Evaluate the commutator [x, px ] used to derive the Heisenberg uncertainty principle. Hint: First compute the quantity x pˆ x f (x) − pˆ x xf (x), where f (x) is a arbitrary function. 4.4. Convince yourself of the correctness of the commutation relation [Lx , Ly ] = iLz . 4.5. Can you measure simultaneously a particle’s y-position coordinate and x-component of momentum? 4.6. Can you measure simultaneously a particle’s z-component of linear momentum and z-component of angular momentum? Give proof. 4.7. The one-dimensional harmonic oscillator involves a particle of mass m subject to a potential energy V (x) = 12 kx 2 . Write down the Lagrangian and Hamiltonian for this system and show that Newton’s second law, Lagrange’s equation and Hamilton’s equations all lead to the same solution.

Chapter 5

Special functions Solutions for the simple quantum systems considered thus far have involved only elementary functions, the term applied to algebraic, exponential, logarithmic and trigonometric functions. In subsequent Chapters, we will take up quantum mechanical problems including the harmonic oscillator, angular momentum and the hydrogen atom which require more complex functions. These are known collectively as special functions and are usually named in honor of mathematicians who studied them a couple of centuries ago, including Hermite, Legendre, Laguerre, Bessel and several others. Hopefully, previous familiarity with this mathematics will expedite our treatment of more complex quantum systems as we encounter them. A comprehensive reference on special functions is M. Abramowitz and I. A. Stegun, eds., Handbook of Mathematical Functions with Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series, Vol 55, Washington, DC, 1964). The National Institute of Standards and Technology (NIST) has published a Digital Library of Mathematical Functions (DLMF) available online at http://dlmf.nist.gov. This features numerous hypertext links and extensive graphics. In the first few Sections, we will also introduce some useful mathematics, in addition to the above named special functions. This Chapter can be skipped by the reader, without loss of continuity. The relevant Sections can then be referred to as the need arises.

5.1 Gaussian functions A gaussian function (Fig. 5.1) is given by 1 2 2 g(x) = √ e−x /2σ . σ 2π The function is normalized, with ∞ −∞

Laplace (1778) proved that

∞

−∞

g(x) dx = 1.

e−x dx = 2

√

π.

Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00013-6 Copyright © 2021 Elsevier Inc. All rights reserved.

(5.1)

(5.2)

(5.3) 71

72 Introduction to Quantum Mechanics

FIGURE 5.1 Normalized gaussian function.

This remarkable result can be obtained as follows. Denoting the integral by I , we can write I = 2

∞

−∞

e

−x 2

2

=

dx

∞

−∞

e−x dx 2

∞

−∞

e−y dy, 2

(5.4)

where the dummy variable y has been substituted for x in the last integral. The product of two integrals can be expressed as a double integral: I =

∞

2

−∞

∞

−∞

e−(x

2 +y 2 )

dx dy.

(5.5)

The differential dx dy represents an element of area in cartesian coordinates, with the domain of integration extending over the entire xy-plane. An alternative representation of the last integral can be expressed in plane polar coordinates r, θ . The two coordinate systems are related by x = r cos θ,

y = r sin θ,

(5.6)

so that r 2 = x2 + y2.

(5.7)

The element of area in polar coordinates is given by r dr dθ , so that the double integral becomes ∞ 2π 2 e−r r dr dθ. (5.8) I2 = 0

0

Integration over θ gives a factor 2π. The integral over r can be done after the substitution u = r 2 , du = 2r dr:

∞

e 0

−r 2

r dr =

1 2

0

∞

e−u du = 12 .

(5.9)

Special functions Chapter | 5

Therefore I 2 = 2π × general result is

1 2

73

and Laplace’s result (5.68) is proven. A slightly more

∞

−∞

e−αx dx = 2

π 1/2 α

(5.10)

,

√ obtained by scaling the variable x to α x. We also require definite integrals of the type ∞ 2 x n e−αx dx, n = 1, 2, 3 . . . , −∞

(5.11)

for computations involving harmonic oscillator wavefunctions. For odd n, these integrals all equal zero since the contributions from {−∞, 0} exactly cancel those from {0, ∞}. The following stratagem produces successive integrals for even n. Differentiate each side of (5.75) wrt the parameter α and cancel minus signs to obtain ∞ π 1/2 2 x 2 e−αx dx = . (5.12) 2 α 3/2 −∞ Differentiation under an integral sign is valid provided that the integrand is a continuous function. Differentiating again, we obtain ∞ 3 π 1/2 2 x 4 e−αx dx = . (5.13) 4 α 5/2 −∞ The general result is ∞ 1 · 3 · 5 · · · |n − 1| π 1/2 2 x n e−αx dx = , 2n/2 α (n+1)/2 −∞

n = 0, 2, 4 . . . .

(5.14)

Finite-interval integrations over a gaussian defines the error function x 2 2 e−t dt (5.15) erf(x) = √ π 0 and the related complementary error function 2 erfc(x) = 1 − erf(x) = √ π

∞

e−t dt. 2

(5.16)

x

5.2 The gamma function The gamma function is also most conveniently introduced by a definite integral. The following integral can be evaluated exactly, ∞ 1 e−αx dx = . (5.17) α 0

74 Introduction to Quantum Mechanics

The derivative of this integral with respect to α gives ∞ ∞ 1 d d 1 −αx −αx =− 2. e dx = (−x)e dx = dα 0 dα α α 0 We have therefore obtained a new definite integral: ∞ 1 xe−αx dx = 2 . α 0

(5.18)

(5.19)

Taking d/dα again we find

∞

x 2 e−αx dx =

0

2 . α3

Repeating the process n times ∞ 2 · 3 · 4···n x n e−αx dx = . α n+1 0

(5.20)

(5.21)

Setting α = 1, now that its job is done, we wind up a neat integral formula for n! ∞ x n e−x dx = n!. (5.22) 0

This is certainly not the most convenient way to evaluate n!, but suppose we replace n by a noninteger ν. In conventional notation, this defines the gamma function: ∞ (ν) ≡ x ν−1 e−x dx. (5.23) 0

When ν is an integer, this reduces to the factorial by the relation (n + 1) = n!

n = 0, 1, 2, 3 . . . .

FIGURE 5.2 The gamma function on the real axis.

(5.24)

Special functions Chapter | 5

75

For the case ν = 1/2,

∞ 1 x −1/2 e−x dx. = 2 0

(5.25)

The integral can be evaluated with a change of variables x = y 2 , dx = 2y dy giving ∞ √ 1 2 e−y dy = π, (5.26) =2 2 0 where we have recalled Laplace’s famous result from Eq. (5.3). Fig. 5.2 shows a plot of the gamma function. For x > 0, the function is a smooth interpolation between integer factorials. (x) becomes infinite for x = 0, −1, −2 · · · .

5.3

The Dirac deltafunction

The Dirac deltafunction can intuitively be understood as the limit of the density of a finite distribution of mass, say ρ(r), as the object is shrunken to a point mass at r = 0, in which case we write ρ(r) = δ(r). In one dimension, the deltafunction is the analog of the Kronecker delta 1 if m = n (5.27) δmn ≡ 0 if m = n, when the discrete variables n and m are replaced by continuous variables. A summation over Kronecker deltas reduces to a single term, such that ∞

fn δnm = fm .

(5.28)

n=0

The analog of this relation for continuous variables, with n → x and m → x0 , introduces the deltafunction δ(x − x0 ). We can take as the defining property ∞ f (x) δ(x − x0 ) dx = f (x0 ). (5.29) −∞

This also implies the normalization condition ∞ δ(x − x0 ) dx = 1. −∞

Evidently

δ(x − x0 ) ≡

0 if x = x0 ∞ if x = x0 .

(5.30)

(5.31)

76 Introduction to Quantum Mechanics

The approach to ∞ is sufficiently tame, however, that the integral has a finite value.

FIGURE 5.3 Dirac deltafunction with x0 = 0, as limit of normalized gaussian.

A simple representation for the deltafunction is the limit of a normalized gaussian as the standard deviation approaches zero: 1 2 2 δ(x − x0 ) = lim √ e−(x−x0 ) /2σ . σ →0 2π σ

(5.32)

This is shown pictorially in Fig. 5.3. The deltafunction is the limit of a function which becomes larger and larger in an interval which becomes narrower and narrower. An integral which reduces to a deltafunction is the following: ∞ ei(k−k )x dx = 2πδ(k − k ). (5.33) −∞

To prove this, consider the related integral

∞

−∞

e

−σ x 2 /2 i(k−k )x

e

√ dx =

2π − (k−k2 )2 e 2σ . σ

(5.34)

The right hand side can be written 2πg(k − k ), in terms of the normalized gaussian (5.1). By (5.32) the limit as σ → 0 gives the deltafunction, thus verifying (5.33). The free-particle eigenfunctions, considered in Sect. 3.1, can now be deltafunction orthonormalized. Defining 1 ψk (x) = √ eikx , 2π Eq. (5.33) determines the continuum orthonormalization relation ∞ ψk∗ (x)ψk (x) dx = δ(k − k ). −∞

(5.35)

(5.36)

Special functions Chapter | 5

77

Differentiation of a function at a finite discontinuity produces a deltafunction. Consider, for example, the Heaviside unit step function: H (x − x0 ) ≡

0 if x < x0 1 if x ≥ x0 .

(5.37)

Sometimes H (0) (for x = x0 ) is defined as 12 . The derivative of the Heaviside function H (x − x0 ) is clearly equal to zero when x = x0 . In addition ∞ H (x − x0 ) dx = H (∞) − H (−∞) = 1 − 0. (5.38) −∞

Thus, we find H (x − x0 ) = δ(x − x0 ).

(5.39)

There is related result for the sign function sgn(x) = ±1, for x > 0 and x < 0, respectively. We find sgn (x − x0 ) = 2δ(x − x0 ).

(5.40)

The deltafunction can be generalized to multiple dimensions. In three dimensions, the defining relation for a deltafunction can be expressed f (r) δ(r − r0 ) d 3 r = f (r0 ). (5.41) For example, the limit of a continuous distribution of electrical charge ρ(r) shrunken to a point charge q at r0 can be represented by ρ(r) = q δ(r − r0 ).

(5.42)

The potential energy of interaction between two continuous charge distributions is given by ρ1 (r1 )ρ2 (r2 ) 3 d r1 d 3 r2 . V= (5.43) |r2 − r1 | If the distribution ρ2 (r2 ) is reduced to a point charge q2 at r2 , this reduces to ρ1 (r1 ) 3 d r1 . (5.44) V = q2 |r2 − r1 | If the analogous thing then happens to ρ1 (r1 ), the formula reduces to the Coulomb potential energy between two point charges V=

q1 q2 , r12

where

r12 = |r2 − r1 |.

(5.45)

78 Introduction to Quantum Mechanics

A note on the deltafunction involving spherically symmetrical functions in spherical polar coordinates. Applying (5.41) to the case when r0 = 0 and f (r) = f (r), we can write ∞ 3 f (r) δ(r) d r = f (r)δ(r) 4πr 2 dr = f (0). (5.46) 0

But a 1-dimensional deltafunction should also produce the result ∞ 1 f (r)δ(r) dr = f (0), 2 0

(5.47)

with the factor 12 reflecting the fact that the deltafunction is located at one of the limits, so that, in effect, only half of it is within the range of integration. Comparing the last two integrals we can surmise the relationship between 1and 3-dimensional deltafunctions in spherical coordinates: δ(r) =

δ(r) . 2πr 2

(5.48)

An interesting identity involving the deltafunction in spherical polar coordinates is 1 (5.49) ∇ 2 = −4πδ(r). r To derive this result, we note that the variable r in spherical coordinates can have only non-negative values. Thus r might harmlessly be replaced by |r|. Accordingly, we can write 1 d 21 2 d 1 r . (5.50) ∇ = 2 r dr |r| r dr Working through the steps in the above formula, we find r2

d 1 d = −r 2 r −2 |r| = −sgn(r). dr |r| dr

(5.51)

Finally, 1 d 2 (5.52) sgn(r) = − 2 δ(r) = −4πδ(r). r 2 dr r Poisson’s equation (1.23) applied to the potential of a point charge gives −

∇ 2 (r) = ∇ 2

q = −4πqδ(r), r

showing the density of a point charge ρ(r) = qδ(r).

(5.53)

Special functions Chapter | 5

79

5.4 Leibniz’s formula In some of the following work, we will make use of a formula for the nth derivative of the product of two functions. This can be derived stepwise as follows:

d f (x)g(x) = f (x)g(x) + f (x)g (x), dx

d2 f (x)g(x) = f (x)g(x) + 2f (x)g (x) + f (x)g (x), dx 2

d3 f (x)g(x) = f (x)g(x) + 3f (x)g (x) + 3f (x)g (x) + f (x)g (x), dx 3 .... (5.54) Clearly, we are generating a series containing the binomial coefficients n m

=

n! m!(n − m)!

m = 0, 1 . . . n

(5.55)

and the general result is Leibniz’s formula

n dn n f (n−m) (x)g (m) (x) = f (x)g(x) = n dx m m=0

f (n) (x)g(x) + nf (n−1) (x)g (x) +

n(n − 1) (n−2) f (x)g (x) + . . . . 2

(5.56)

5.5 Hermite polynomials The Schrödinger equation for the quantum-mechanical harmonic oscillator can be reduced to a second-order differential equation with nonconstant coefficients: ψ (x) + (λ − x 2 )ψ(x) = 0.

(5.57)

A useful first step is to determine the asymptotic solution to this equation, giving the form of ψ(x) as x → ±∞. For sufficiently large values of |x|, x 2 >> λ, so that the differential equation can be approximated by ψ (x) − x 2 ψ(x) ≈ 0.

(5.58)

This suggests the following manipulation:

d d d2 2 −x + x ψ(x) ≈ 0. − x ψ(x) ≈ dx dx dx 2

(5.59)

80 Introduction to Quantum Mechanics

Now, the first-order differential equation ψ (x) + xψ(x) ≈ 0

(5.60)

can be solved exactly to give ψ(x) ≈ const e−x

2 /2

for

|x| → ∞.

(5.61)

To build in this asymptotic behavior, let ψ(x) = H (x) e−x

2 /2

.

(5.62)

This reduces Eq. (5.57) to a differential equation for H (x): H (x) − 2xH (x) + (λ − 1)H (x) = 0.

(5.63)

To construct a solution to Eq. (5.63), we begin with the function u(x) = e−x , 2

(5.64)

which is clearly the solution of the first-order differential equation u (x) + 2x u(x) = 0.

(5.65)

Differentiating this equation (n + 1) times using Leibniz’s formula (5.56), we obtain w (x) + 2x w (x) + 2(n − 1)w(x) = 0,

(5.66)

d nu dn 2 2 = n e−x = H (x)e−x . n dx dx

(5.67)

where w(x) = We find that H (x) satisfies H (x) − 2x H (x) + 2n H (x) = 0,

(5.68)

which is known as Hermite’s differential equation. The solutions in the form Hn (x) = (−)n ex

2

d n −x 2 e dx n

(5.69)

are known as Hermite polynomials, the first few of which are enumerated below: H0 (x) = 1, H1 (x) = 2x, H2 (x) = 4x 2 − 2, 3 H4 (x) = 16x 4 − 48x 2 + 12. H3 (x) = 8x − 12x,

(5.70)

Comparing Eq. (5.68) with Eq. (5.63), we can relate the parameters λ − 1 = 2n,

(5.71)

Special functions Chapter | 5

81

which, as we will see in the following Chapter, leads to a general formula for the energy eigenvalues. A generating function for Hermite polynomials is given by ex

2 −(t−x)2

2

= e2tx−t =

∞ Hn (x) n t . n!

(5.72)

n=0

Using the generating function, we can evaluate integrals over products of Hermite polynomials, such as ∞ √ 2 Hn (x)Hn (x)e−x dx = 2n n! π δn,n . (5.73) −∞

Thus, the functions √ 2 ψn (x) = (2n n! π)−1/2 Hn (x) e−x /2 , form an orthonormal set with ∞ −∞

n = 0, 1, 2 . . .

ψn (x) ψn (x) dx = δn,n .

(5.74)

(5.75)

5.6 Spherical polar coordinates Most three-dimensional applications of quantum mechanics, particularly to atomic and molecular systems, make use of the spherical polar coordinate system. This is the most natural coordinate system for treating problems of spherical symmetry and for consideration of angular momentum. Following is a brief review of spherical polar coordinates, sufficient for our purposes.

FIGURE 5.4 Spherical polar coordinates.

The position of an arbitrary point r in three-dimensional space is described by the coordinates r, θ, φ, as shown in Fig. 5.4. As can readily be deduced from

82 Introduction to Quantum Mechanics

the figure, these are connected to cartesian coordinates by the relations x = r sin θ cos φ, y = r sin θ sin φ, z = r cos θ.

(5.76)

Spherical coordinates are closely analogous to the geographical coordinate system, which locates points by latitude, longitude and altitude. Referring to the familiar globe of the world, r represents the distance from the center of the globe, with the range 0 ≤ r ≤ ∞. The azimuthal angle θ is the angle between the vector r and the z-axis or North Pole, with the range 0 ≤ θ ≤ π. Thus θ = 0 points to the North Pole, θ = π, to the South Pole and θ = π/2 runs around the Equator. The circles of constant θ on the surface of a sphere are analogous to the parallels of latitude on the globe (although the geographic conventions are different, with the equator at 0◦ latitude, while the poles are at 90◦ N and S latitude). The polar angle φ measures the rotation of the vector r around the z-axis, with 0 ≤ φ < 2π, counterclockwise from the x-axis. The loci of constant φ on the surface of a sphere are great circles through both poles. These clearly correspond to meridians in the geographic specification of longitude (measured in degrees, 0◦ to 180◦ E and W of the Greenwich Meridian). The radial variable r represents the distance from r to the origin, the length of the vector r: (5.77) r = x 2 + y 2 + z2 . The volume element in spherical polar coordinates is given by dτ = r 2 sin θ dr dθ dφ,

(0 ≤ r ≤ ∞, 0 ≤ θ ≤ π, 0 ≤ φ ≤ 2π ) ,

(5.78)

and represented graphically by the distorted cube in Fig. 5.5.

FIGURE 5.5 Volume element in spherical polar coordinates.

Integration of a function over all space in spherical coordinates takes the form: ∞ π 2π f (r, θ, φ) r 2 sin θ dr dθ dφ. (5.79) 0

0

0

Special functions Chapter | 5

83

For a spherically-symmetric function f (r), independent of θ or φ, this integral simplifies to ∞ f (r) 4πr 2 dr, (5.80) 0

as implied by the division of space into spherical shells of area 4πr 2 and thickness dr. Laplace’s equation in spherical polar coordinates can be written ∇ 2 (r, θ, φ) =

∂ 1 ∂ 2 ∂ 1 ∂ 1 ∂ 2 = 0. r + 2 sin θ + 2 2 ∂θ r ∂r ∂r r sin θ ∂θ r 2 sin θ ∂φ 2 (5.81)

We consider separable solutions (r, θ, φ) = R(r)Y (θ, φ),

(5.82)

where the functions Y (θ, φ) are known as a spherical harmonics. Substitution of (5.82) into (5.81), followed by division by R(r)Y (θ, φ) and multiplication by r 2 , separates the radial and angular variables:

1 ∂Y 1 ∂ 2Y 1 ∂ 1 2 R (r) + 2rR (r) = 0. sin θ + 2 r + Y (θ, φ) sin θ ∂Y ∂θ R(r) sin θ ∂φ 2 (5.83) This can hold true for all values of r, θ, φ only if each of the two parts of this partial differential equation equals a constant. Let the first part equal −λ and the second equal λ. The ordinary differential equation for r becomes r 2 R (r) + 2rR(r) − λR(r) = 0.

(5.84)

It is easily verified that the general solution is R(r) = c1 r + c2 r −−1

where

λ = ( + 1).

(5.85)

The spherical harmonics evidently satisfy a partial differential equation in two variables: ∂Y 1 ∂ 2Y 1 ∂ + λY = 0. sin θ + 2 sin θ ∂θ ∂θ sin θ ∂φ 2

(5.86)

Again we assume a separable solution with Y (θ, φ) = (θ)(φ).

(5.87)

84 Introduction to Quantum Mechanics

Substituting (5.87) into (5.86), dividing by (θ)(φ) and multiplying by sin2 θ , we achieve separation of variables: d 1 d 2 sin θ d + λ sin2 θ = 0. sin θ + (θ ) dθ dθ (φ) dφ 2

(5.88)

Setting the term containing φ to a constant −m2 , we obtain the familiar differential equation (φ) + m2 (φ) = 0. Solutions periodic in 2π can be chosen as 1 imφ m (φ) = with m = 0, ±1, ±2 . . . . e 2π

(5.89)

(5.90)

Alternative solutions are sin mφ and cos mφ.

5.7

Legendre polynomials

Separation of variables in Laplace’s equation leads to an ordinary differential equation for the function (θ ):

d m2 1 d sin θ − 2 + λ (θ ) = 0. sin θ dθ dθ sin θ

(5.91)

Let us first consider the case m = 0 and define a new independent variable x = cos θ

with P (x) = (θ ).

(5.92)

This transforms Eq. (5.91) to (1 − x 2 )P (x) − 2x P (x) + λP (x) = 0,

(5.93)

which is known as Legendre’s differential equation. We can construct a solution to this linear second-order equation by exploiting Leibniz’s formula (5.56). Begin with the function u = (1 − x 2 ) ,

(5.94)

which is a solution of the first-order equation (1 − x 2 )u (x) + 2x u(x) = 0.

(5.95)

Differentiating ( + 1) times, we obtain (1 − x 2 )P (x) − 2x P (x) + ( + 1)P (x) = 0,

(5.96)

Special functions Chapter | 5

85

where d d u = (1 − x 2 ) . dx dx This is a solution of Eq. (5.93) for P (x) =

λ = ( + 1)

(5.97)

= 0, 1, 2, . . . .

(5.98)

With a choice of constant such that P (1) = 1, the Legendre polynomials can be defined by Rodrigues’ formula: P (x) =

1 2 !

d (1 − x 2 ) . dx

(5.99)

Reverting to the original variable θ , the first few Legendre polynomials are P0 (cos θ ) = 1,

1 P2 (cos θ ) = (3 cos2 θ − 1), 2 1 P4 (cos θ ) = (35 cos4 θ − 30 cos2 θ + 3). 8 (5.100)

P1 (cos θ ) = cos θ,

1 P3 (cos θ ) = (5 cos3 θ − 3 cos θ ), 2

The Legendre polynomials obey the orthonormalization relations

+1 −1

P (x)P (x) dx =

π

P (cos θ )P (cos θ ) sin θ dθ =

0

2 δ, . 2 + 1 (5.101)

A generating function for Legendre polynomials is given by

1 − 2tx + t 2

−1

=

∞

P (x) t .

(5.102)

=0

Returning to Eq. (5.91) for arbitrary values of m, the analog of Eq. (5.93) can be written

m2 (1 − x 2 )P (x) − 2x P (x) + ( + 1) − P (x) = 0. (5.103) 1 − x2 The solutions are readily found to be Pm (x) = (1 − x 2 )|m|/2

d |m| P (x), dx |m|

(5.104)

known as associated Legendre functions. These reduce to the Legendre polynomials (5.99) when m = 0. Since P (x) is a polynomial of degree , |m| is

86 Introduction to Quantum Mechanics

limited to the values 0, 1, 2 . . . . The associated Legendre functions obey the orthonormalization relations +1 π Pm (x)Pm (x) dx = Pm (cos θ )Pm (cos θ ) sin θ dθ −1

0

=

2 ( + |m|)! δ, . 2 + 1 ( − |m|)!

The orthonormalized solutions to Eq. (5.91) are thus given by

2 + 1 ( − |m|)! 1/2 m m (θ ) = P (cos θ ). 2 ( + |m|)!

(5.105)

(5.106)

5.8 Spherical harmonics Combining the above functions of θ and φ, we obtain the spherical harmonics (which have been used in applied mathematics long before quantum mechanics).

2 + 1 ( − |m|)! 1/2 m Ym (θ, φ) = m m (θ ) m (φ) = m P (cos θ )eimφ , 4π ( + |m|)! (5.107) where

m =

1 (−1)m

for m ≤ 0 for m > 0.

(5.108)

The factor m is appended in applications to quantum mechanics, in accordance with the Condon and Shortley phase convention. Two important special cases are 2 + 1 (5.109) P (cos θ ) Y0 (θ, φ) = 4π and (−) (2 + 1) iφ Y (θ, φ) = (5.110) sin θ e . 2 ! 4π Following is a listing of the spherical harmonics Ym (θ, φ) through = 2, which will be sufficient for our applications: 1 3 , Y10 = cos θ, Y00 = 4π 4π 3 5 (5.111) sin θ e±iφ , Y20 = (3 cos2 θ − 1), Y1±1 = ∓ 4π 16π 15 15 cos θ sin θ e±iφ , Y2±2 = sin2 θ e±2iφ . Y2±1 = ∓ 8π 32π

Special functions Chapter | 5

87

A graphical representation of these functions is given in Fig. 5.6. Surfaces of constant absolute value are drawn, with intermediate shadings representing differing complex values of the functions.

FIGURE 5.6 Contours of spherical harmonics as three-dimensional polar plots.

The spherical harmonics are an orthonormal set wrt integration over solid angle: 0

π

0

2π

Y∗ m (θ, φ)Ym (θ, φ) sin θ dθ dφ = δ δmm .

(5.112)

Linear combinations of Ym and Y−m contain the real functions: Pm (cos θ )

sin mφ cos mφ.

(5.113)

These are called tesseral harmonics since they divide the surface of a sphere into tesserae—four-sided figures bounded by nodal lines of latitude and longitude.

5.9 Laguerre polynomials In solving the quantum-mechanical problem of the hydrogen atom, we will encounter Laguerre’s differential equation: xy (x) + (1 − x)y (x) + ny(x) = 0.

(5.114)

Following the strategy used to solve the Hermite and Legendre differential equations, we begin with a function u(x) = x n e−x ,

(5.115)

88 Introduction to Quantum Mechanics

where n is an integer. This satisfies the first-order differential equation xu (x) + (x − n)u(x) = 0.

(5.116)

Differentiating this equation n + 1 times using Leibniz’s formula (5.56), we obtain xw (x) + (1 − x)w (x) + nw(x) = 0,

(5.117)

where w(x) =

d n n −x (x e ). dx n

(5.118)

We express this as w(x) = e−x L(x),

(5.119)

where L(x) is a Laguerre polynomial. Multiplying by a constant such that Ln (0) = 0 for all n, the Laguerre polynomials can be defined by Rodrigues’ formula: Ln (x) =

ex d n n −x (x e ). n! dx n

(5.120)

A generalization of Eq. (5.114) xy (x) + (α + 1 − x)y (x) + ny(x) = 0

(5.121)

has solutions known as associated Laguerre polynomials, given by the Rodrigues’ formula Lαn (x) =

x −α ex d n (x n+α e−x ). n! dx n

(5.122)

Laguerre and associated Laguerre polynomials can be determined using the following generating functions: ∞ xt 1 Ln (x) t n , exp − = (1 − t) 1−t n=0 ∞ 1 xt exp − Lαn (x) t n . = 1−t (1 − t)α+1

(5.123)

(5.124)

n=k

From these, one can derive the orthonormalization relations ∞ (n + α + 1) e−x x α Lαn (x)Lαm (x) dx = δnm . n! 0

(5.125)

Special functions Chapter | 5

5.10

89

Series solutions of differential equations

A very general method for obtaining solutions to second-order differential equations is to expand y(x) in a power series and then evaluate the coefficients term by term. In the majority of quantum mechanics textbooks, this is the method by which Hermite, Legendre and Laguerre functions are derived. Although we have made use of more direct ad hoc methods of creating the differential equations for these special functions, it is worthwhile to introduce the series solution method. We will, in fact, use it to as our approach to Bessel functions. First we will illustrate the method with a trivial example which we have already solved, namely the equation with constant coefficients: y (x) + k 2 y(x) = 0.

(5.126)

Assume that y(x) can be expanded in a power series about x = 0: y(x) =

∞

an x n .

(5.127)

n=0

The first derivative is given by y (x) =

∞

nan x n−1

n→n+1

−→

n=1

∞

(n + 1)an+1 x n .

(5.128)

n=0

We have redefined the summation index in order to retain the dependence on x n . Analogously, y (x) =

∞

n→n+2

n(n − 1)an x n−2 −→

n=2

∞

(n + 2)(n + 1)an+2 x n .

(5.129)

n=0

Eq. (5.126) then implies ∞

(n + 2)(n + 1)an+2 + k 2 an x n = 0.

(5.130)

n=0

Since this is true for all values of x, every quantity in square brackets must equal zero. This leads to the recursion relation an+2 = −

k2 an . (n + 2)(n + 1)

(5.131)

Let a0 be treated as one constant of integration. We then find a2 = −

k2 a0 , 2·1

a4 = −

k2 k4 a2 = + a0 , 4·3 4!

··· .

(5.132)

90 Introduction to Quantum Mechanics

It is convenient to rewrite the coefficient a1 as ka1 . We find thereby a3 = −

k3 a1 , 3·2

a5 = +

k5 a1 , 5!

··· .

(5.133)

The general power-series solution of the differential equation is thus given by

k2x 2 k4x 4 k3x 3 k5x 5 y(x) = 1 − + − · · · a0 + x − + − · · · a1 , 2! 4! 3! 5! (5.134) which is recognized as the expansion for y(x) = a0 cos kx + a1 sin kx.

(5.135)

We consider next the more general case of a linear homogeneous secondorder differential equation with nonconstant coefficients: y (x) + p(x)y (x) + q(x)y(x) = 0.

(5.136)

If the functions p(x) and q(x) are both finite at x = x0 , then x0 is called a regular point of the differential equation. If either p(x) or q(x) diverges as x → x0 , then x0 is called a singular point. If both (x − x0 )p(x) and (x − x0 )2 q(x) have finite limits as x → x0 , then x0 is called a regular singular point or nonessential singularity. If either of these limits continues to diverge, the point is an essential singularity. For regular singular points, a series solution of the differential equation can be obtained by the method of Frobenius. This is based on the following generalization of the power series expansion: y(x) = x α

∞ k=0

ak x k =

∞

ak x k+α .

(5.137)

k=0

The derivatives are then given by

y (x) =

∞

(α + k)ak x k+α−1

(5.138)

(α + k)(α + k − 1)ak x k+α−2 .

(5.139)

k=0

and y (x) =

∞ k=0

The possible values of α are obtained from the indicial equation, which is based on the presumption that a0 is the first nonzero coefficient in the series (5.137).

Special functions Chapter | 5

5.11

91

Bessel functions

The solutions of Bessel’s differential equation: x 2 y (x) + xy (x) + (x 2 − n2 )y(x) = 0

(5.140)

are one of the classics of mathematical physics. Bessel’s equation occurs, in particular, in a number of applications involving cylindrical coordinates. Dividing the standard form (5.140) by x 2 shows that x = 0 is a regular singular point of Bessel’s equation. The method of Frobenius is thus applicable. Substituting the power series expansion (5.137) into (5.140), we obtain ∞

(α + k)(α + k − 1)ak + (α + k)ak + ak−2 − n2 ak x k+α = 0.

(5.141)

k=0

This leads to the recursion relation ak−2 = −[(α + k)(α + k − 1) + (α + k) − n2 ]ak .

(5.142)

Setting k = 0 in the recursion relation and noting that a−2 = 0 (a0 is the first nonvanishing coefficient), we obtain the indicial equation α(α − 1) + α − n2 = 0.

(5.143)

The roots are α = ±n. With the choice α = n, the recursion relation simplifies to ak−2 ak = − . (5.144) k(2n + k) Since a−1 = 0, a1 = 0 (assuming n = − 12 ). Likewise a3 , a5 , · · · = 0, as do all odd ak . For even k, we have a2 = −

a0 , 2(2n + 2)

a4 = −

a2 a0 = . 4(2 + 4n) 2 · 4(2n + 2)(2n + 4)

(5.145)

For n = 0, 1, 2 . . . , the coefficients can be represented by a2k = (−)k

n!a0 . 22k k!(n + k)!

(5.146)

From now on, we will use the compact notation (−)k ≡ (−1)k .

(5.147)

Setting a0 = 1/2n n!, we obtain the conventional definition of a Bessel function of the first kind: x n

(x/2)4 (x/2)2 Jn (x) = + − ··· . (5.148) 1− 2 2!(n + 2)! 4!(n + 4)!

92 Introduction to Quantum Mechanics

The first three Bessel functions are plotted in Fig. 5.7. Their general behavior can be characterized as damped oscillation. The Bessel functions are normalized in the sense that ∞ Jn (x) dx = 1. (5.149) 0

FIGURE 5.7 Bessel functions of the first kind Jn (x).

Bessel functions can be generalized for noninteger index ν, as follows: Jν (x) =

x ν

(x/2)4 (x/2)2 + − ··· . 1− 2 2!(ν + 3) 4!(ν + 5)

(5.150)

The general solution of Bessel’s differential equation for noninteger ν is given by y(x) = c1 Jν (x) + c2 J−ν (x).

(5.151)

J−n (x) = (−)n Jn (x),

(5.152)

For integer n, however,

so that J−n (x) is not a linearly independent solution. We can construct a second solution by defining Yν (x) =

cos νπJν (x) − J−ν (x) . sin νπ

In the limit as ν approaches an integer n, we obtain

1 ∂ n ∂ Yn (x) = Jn (x) − (−1) J−n (x) . π ∂n ∂n

(5.153)

(5.154)

This defines a Bessel function of the second kind, also known as a Neumann function. The computational details are horrible, but fortunately, mathematicians have worked them all out for us and these functions have been extensively tabulated. Fig. 5.8 shows the first three functions Yn (x).

Special functions Chapter | 5

93

The limiting behavior of the Yn (x) as x → 0 is apparent from the leading term Jn (x) ≈ (x/2)n . Using the definition (5.154) we find 2 x (5.155) Jn (x) + complicated series. Yn (x) = ln π 2 Fig. 5.8 shows the logarithmic singularities as x → 0.

FIGURE 5.8 Bessel functions of the second kind Yn (x).

Bessel functions of integer order can be determined from expansion of a generating function:

∞ 1 x Jn (x)t n . t− = exp 2 t n=−∞

(5.156)

To see how this works, expand the product of the two exponentials:

x x 1 x 2 −2 1 x 2 2 t + · · · 1 − t −1 + t − ··· . ext/2 e−x/2t = 1 + t + 2 2! 2 2 2! 2 (5.157) The terms which contribute to t 0 are x 2 1 x 4 1− + − ··· , 2 (2!)2 2

(5.158)

which gives the expansion for J0 (x). The expansion for J1 (x) is found from the terms proportional to t 1 and so forth. Asymptotic forms for the Bessel function for small and large values of the argument are given by 1 x n for x → 0 (5.159) Jn (x) ≈ n! 2 and

Jn (x) ≈

2 π cos x − (2n + 1) πx 4

for

x >> n.

(5.160)

94 Introduction to Quantum Mechanics

Bessel functions of the second kind show the following limiting behavior: Y0 (x) ≈

2 ln x, π

Yn (x) ≈ −

2n (n − 1)! −n x (n > 0) π

for

x→0 (5.161)

and

Yn (x) ≈

π 2 sin x − (2n + 1) πx 4

for x >> n.

(5.162)

The two kinds of Bessel functions thus have the asymptotic dependence of slowly damped cosines and sines. In analogy with Euler’s formula e±ix = cos x ± i sin x, we define Hankel functions of the first and second kind: Hn(1) (x) = Jn (x) + iYn (x)

and Hn(2) (x) = Jn (x) − iYn (x).

(5.163)

The asymptotic forms of the Hankel functions is given by Hn(1,2) (x) ≈

2 ±i x−(2n+1)π/4 e πx

for x >> n,

(5.164)

having the character of waves propagating to the right and left, respectively. Trigonometric functions with imaginary arguments suggested the introduction of hyperbolic functions: sinh x = −i sin ix and cosh x = cos ix. Analogously, we can define modified Bessel functions (or hyperbolic Bessel functions) of the first and second kind as follows: In (x) = i −n Jn (ix)

(5.165)

and Kn (x) =

π n+1 π Jn (ix) + iYn (ix) = i n+1 Hn(1) (ix). i 2 2

(5.166)

Their asymptotic forms for large x are ex In (x) ≈ √ 2πx

and Kn (x) ≈

π −x e 2x

for x >> n.

(5.167)

We have given just a meager sampling of formulas involving Bessel functions. Many more can be found in Abramowitz & Stegun and other references. G. N. Whittaker’s classic A Treatise on the Theory of Bessel Functions (Cambridge University Press, 1952) is a ponderous volume devoted entirely to the subject.

Special functions Chapter | 5

5.12

95

Spherical Bessel functions

A number of problems in spherical polar coordinates lead to a differential equation of the form

(5.168) x 2 f (x) + 2xf (x) + x 2 − n(n + 1) f (x) = 0. With the substitution f (x) = x −1/2 F (x), the equation reduces to

x 2 F (x) + xF (x) + x 2 − (n + 12 )2 F (x) = 0,

(5.169)

which, for integer n, is recognized as Bessel’s equation of odd-half order 1 3 5 2 , 2 , 2 . . . . The linearly-independent solutions are Jn+1/2 (x) and Yn+1/2 (x), the latter being proportional to J−n−1/2 (x) since the order is not an integer. The solutions to Eq. (5.168) are known as spherical Bessel functions and conventionally defined by π (5.170) Jn+1/2 (x) jn (x) = 2x and

yn (x) =

π Yn+1/2 (x) = (−)n+1 2x

π J−n−1/2 (x). 2x

(5.171)

You can verify that Eq. (5.168) with n = 0 has the simple solutions sin x/x and cos x/x. In fact, spherical Bessel functions have closed-form expressions in terms of trigonometric functions and powers of x, given by jn (x) = (−x)n

1 d x dx

n

sin x , x

yn (x) = −(−x)n

1 d x dx

n

cos x . (5.172) x

Explicit formulas for the first few spherical Bessel functions follow: sin x sin x cos x 3 sin x 3 cos x , , j1 (x)= 2 − , j2 (x)= 2 −1 − x x x x x x2 cos x cos x sin x 3 cos x 3 sin x , y1 (x)=− 2 − , y2 (x)= − 2 +1 − 2 . y0 (x)=− x x x x x x (5.173) j0 (x)=

There are also spherical analogs of the Hankel functions: hn(1,2) (x) =

π (1,2) (x) = jn (x) ± iyn (x). H 2x n+1/2

(5.174)

96 Introduction to Quantum Mechanics

The first few are (1,2)

h0

(x) = ∓

e±ix x ±i x 2 ± 3ix −3 ±ix (1,2) (1,2) e . , h1 (x) = − 2 e±ix , h2 (x) = ±i x x x3 (5.175)

Supplement 5A. Particle in a disk With knowledge of Bessel functions, we can derive an exact solution for the two-dimensional problem of a particle confined within a disk. The Schrödinger equation, most naturally expressed in polar coordinates, has the form −

2 (∇ 2 ψ(r, θ ) + V (r)ψ(r, θ ) = Eψ(r, θ ), 2M

(5.176)

with the potential function V (r) =

r ≤ r0 r > r0 .

0 ∞

(5.177)

With the definition k 2 = 2ME/2

(5.178)

this takes the form of the Helmholtz equation for r ≤ r0 (∇ 2 + k 2 )ψ(r, θ ) = 0. Using the form of the Laplacian in polar coordinates, this becomes 1 ∂ ∂ψ 1 ∂ 2ψ r + 2 2 + k 2 ψ(r, θ ) = 0. r ∂r ∂r r ∂θ

(5.179)

(5.180)

This is subject to the boundary condition ψ(r0 , θ ) = 0. Assuming a separable solution ψ(r, θ ) = R(r)(θ ), Helmholtz’s equation reduces to R (r)(θ ) + r −1 R (r)(θ ) + r −2 R(r) (θ ) + k 2 R(r)(θ ) = 0 or R (r) + r −1 R (r) + r −2 R(r)

(θ ) + k 2 R(r) = 0. (θ )

(5.181)

(5.182)

We complete a separation of variables by dividing by r −2 R(r). The result implies that the function of θ in square brackets equals a constant, which we can write (θ ) = −m2 ⇒ (θ ) + m2 (θ ) = 0. (5.183) (θ )

Special functions Chapter | 5

97

This is a familiar equation, with linearly-independent solutions (θ ) =

sin mθ cos mθ.

(5.184)

Since θ is an angular variable, the periodicity (θ ± 2nπ) = (θ ) is required. This restricts the parameter m to integer values. Thus we obtain an ordinary differential equation for R(r): R (r) + r −1 R (r) − m2 r −2 R(r) + k 2 R(r) = 0.

(5.185)

Changing the variable to x ≡ kr, with J (x) ≡ R(r), we can write x 2 J (x) + xJ (x) + (x 2 − m2 )J (x) = 0.

(5.186)

This is recognized as Bessel’s equation (5.140). Only the Bessel functions Jm (kr) are finite at r = 0, the Neumann functions Ym (kr) being singular there. Thus the solutions to Helmholtz’s equation are sin mθ ψ(r, θ ) = Jm (kr) cos mθ

m = 1, 2, 3 . . . m = 0, 1, 2 . . . .

(5.187)

The boundary condition at r = r0 requires that Jm (kr0 ) = 0.

(5.188)

The zeros of Bessel functions are extensively tabulated. Let xmn represent the nth root of Jm (x) = 0. A tabulation of the first few zeros follows: n 1 2 3

x0n 2.4048 5.5201 8.6537

x1n 3.8317 7.0156 10.1735

x2n 5.1356 8.4172 11.6198

The corresponding values of k are given by kmn = xmn /r0 .

(5.189)

Contour plots for the first few wavefunctions ψnm (r, θ ), labeled by the quantum numbers m and n, are sketched in Fig. 5.9. The states with m > 0 are two-fold degenerate, corresponding to the factors cos mθ and sin mθ . Only one of the degenerate pair is shown. The second would be obtained by a rotation of π/2m. Incidentally, an identical diagram would pertain to modes of vibration (m, n) of a circular membrane, rigidly fixed on a circular boundary of radius r0 .

98 Introduction to Quantum Mechanics

FIGURE 5.9 Several eigenfunctions ψmn (r, θ) for the particle in a disk. The functions are positive in blue areas, negative in white. Rows show m = 0, 1, 2, columns n = 1, 2, 3.

Supplement 5B. Particle in an infinite spherical well A particle of mass in an infinite spherical potential well of radius is described by the Schrödinger equation −

2 2 ∇ ψ(r, θ, φ) + V (r)ψ(r, θ, φ) = Eψ(r, θ, φ), 2M

where

V (r) =

0 ∞

r ≤ r0 r > r0 .

(5.190)

(5.191)

The wavefunction is separable in spherical polar coordinates, such that ψ(r, θ, φ) = R(r)Ym (θ, φ),

(5.192)

where Y is a spherical harmonic. With the definition k 2 = 2mE/2

(5.193)

Special functions Chapter | 5

99

the radial equation reduces to 2 ( + 1) R(r) = 0, R (r) + R (r) + k 2 − r r2

(5.194)

subject to the boundary condition R(r0 ) = 0. With the scaled radial variable x = kr, this reduces to the equation for a spherical Bessel function (5.168)

(5.195) x 2 R (x) + 2xR (x) + x 2 − ( + 1) R(x) = 0. Thus R(r) = const j (kr). The boundary condition is fulfilled when kr0 is the zero of the spherical Bessel function j (x). Let xn represent the nth root of j (x) = 0. A tabulation of the first few zeros follows: n 1 2 3

x0n 3.142 6.283 9.425

x1n 4.493 7.725 10.904

x2n 5.763 10.904 12.323

The corresponding values of k are kn = xn /r0 . The quantized energy levels are then given by En =

2 2 kn

2Mr02

.

(5.196)

Since these are independent of the quantum number m, they are (2 + 1)-fold degenerate. Fig. 5.10 shows contour plots on a cross section of the sphere for the lowerenergy eigenfunctions with n = 1, 2, 3 and = 0, 1, 2, with m = 0. The wavefunction is positive in the blue regions and negative in the white regions.

Supplement 5C. Particle in a deltafunction well A simple one-dimensional problem is a particle in a deltafunction potential well, described by the Schrödinger equation (in atomic units): 1 − ψ (x) − Zδ(x)ψ(x) = Eψ(x). 2

(5.197)

Since the deltafunction has the same dimensions as the function 1/|x|, this problem might be considered a one-dimensional analog of a hydrogenlike atom. For a bound state, E < 0, and it is convenient to write E = −κ 2 /2. For x = 0, the Schrödinger equation reduces to ψ0 (x) − κ 2 ψ0 (x) = 0,

(5.198)

which has the solutions ψ0 (x) = e±κx . To get the correct behavior as x → ±∞, we choose, for x > 0, e−κx , while for x < 0, eκx . These can be combined as

100 Introduction to Quantum Mechanics

FIGURE 5.10 Spherical-well eigenfunctions ψn 0 (r, θ, φ) in plane θ = π/2. Rows show = 0, 1, 2, columns n = 1, 2, 3.

a single function ψ0 (x) = e−κ|x| . We next substitute this form into the original Schrödinger equation. We need to calculate the second derivative of this function. Note first that d|x| d −κ|x| = −κ e−κ|x| e . dx dx Clearly

d|x| dx

(5.199)

= sgn(x), equal to ±1 for x > 0 and x < 0, respectively. Thus d −κ|x| = −κ sgn(x)e−κ|x| e dx

(5.200)

d 2 −κ|x| e = κ 2 sgn(x)2 e−κ|x| − κ e−κ|x| sgn (x). dx 2

(5.201)

and

Using the relation sgn (x) = 2δ(x), the Schrödinger equation (5.197) is satisfied when 1 − κ 2 + κ δ(x) − Zδ(x) = E. 2

(5.202)

Special functions Chapter | 5

101

Thus we identify κ = Z and E = − 12 κ 2 = −Z 2 /2. This is the only bound solution. Remarkably, the ground-state eigenfunction e−Zx is identical to a crosssection of a hydrogenic 1s orbital e−Zr , with the same energy E = −Z 2 /2.

Chapter 6

The harmonic oscillator The harmonic oscillator is a model which has several important applications in both classical and quantum mechanics. It serves as a prototype in the mathematical treatment of such diverse phenomena as elasticity, acoustics, AC circuits, molecular and crystal vibrations, electromagnetic fields and optical properties of matter.

6.1 Classical oscillator A simple realization of the harmonic oscillator in classical mechanics is a particle which is acted upon by a restoring force proportional to its displacement from its equilibrium position. Considering motion in one dimension, this means F = −k x.

(6.1)

FIGURE 6.1 Spring obeying Hooke’s law.

Such a force might originate from a spring which obeys Hooke’s law, as shown in Fig. 6.1. According to Hooke’s law, which applies to real springs for sufficiently small displacements, the restoring force is proportional to the displacement—either stretching or compression—from the equilibrium position. The force constant k is a measure of the stiffness of the spring. The variable x is chosen equal to zero at the equilibrium position, positive for stretching, negative for compression. The negative sign in (6.1) reflects the fact that F is a restoring force, always in the opposite sense to the displacement x. Applying Newton’s second law to the force from Eq. (6.1), we find F =m

d 2x = −kx, dx 2

Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00014-8 Copyright © 2021 Elsevier Inc. All rights reserved.

(6.2) 103

104 Introduction to Quantum Mechanics

where m is the mass of the body attached to the spring, which is itself assumed massless. This leads to a differential equation of familiar form, although with different variables: x(t) ¨ + ω2 x(t) = 0,

ω2 ≡ k/m.

(6.3)

The dot notation (introduced by Newton himself) is used in place of primes when the independent variable is time. The general solution to (6.3) is x(t) = A sin ωt + B cos ωt,

(6.4)

which represents periodic motion with a sinusoidal time dependence. This is known as simple harmonic motion and the corresponding system is known as a harmonic oscillator. The oscillation occurs with a constant angular frequency k ω= radians per second. (6.5) m This is called the natural frequency of the oscillator. The corresponding circular frequency in hertz (cycles per second) is k 1 ω = Hz. (6.6) ν= 2π 2π m The general relation between force and potential energy in a conservative system in one dimension is dV F =− . (6.7) dx Thus the potential energy of a harmonic oscillator is given by V (x) =

1 2 kx , 2

(6.8)

which has the shape of a parabola, as drawn in Fig. 6.2. A simple computation shows that the oscillator moves between positive and negative turning points 2 while the ±xmax where the total energy E equals the potential energy 12 k xmax kinetic energy is momentarily zero. In contrast, when the oscillator moves past x = 0, the kinetic energy reaches its maximum value while the potential energy equals zero.

6.2 Quantum harmonic oscillator Given the potential energy (6.8), we can write down the Schrödinger equation for the one-dimensional harmonic oscillator: −

1 2 ψ (x) + kx 2 ψ(x) = E ψ(x). 2m 2

(6.9)

The harmonic oscillator Chapter | 6

105

FIGURE 6.2 Potential energy function and first few energy levels for harmonic oscillator.

For the first time we encounter a differential equation with non-constant coefficients, which is a much greater challenge to solve. We can combine the constants in (6.9) to two parameters α2 =

mk 2

and λ =

2mE 2 α

(6.10)

and redefine the independent variable as ξ = α 1/2 x.

(6.11)

This reduces the Schrödinger equation to ψ (ξ ) + (λ − ξ 2 )ψ(ξ ) = 0.

(6.12)

The range of the variable x (also ξ ) must be taken from −∞ to +∞, there being no finite cutoff as in the case of the particle in a box. A useful first step is to determine the asymptotic solution to (6.12), that is, the form of ψ(ξ ) as ξ → ±∞. For sufficiently large values of |ξ |, ξ 2 >> λ and the differential equation is approximated by ψ (ξ ) − ξ 2 ψ(ξ ) ≈ 0. This suggests the following manipulation: 2 d d d 2 − ξ ψ(ξ ) ≈ −ξ + ξ ψ(ξ ) ≈ 0. dξ dξ dξ 2

(6.13)

(6.14)

The first-order differential equation ψ (ξ ) + ξ ψ(ξ ) = 0

(6.15)

106 Introduction to Quantum Mechanics

can be solved exactly to give ψ(ξ ) = const e−ξ

2 /2

(6.16)

.

Remarkably, this turns out to be an exact solution of the Schrödinger equation (6.12) with λ = 1. Using (6.10), this corresponds to an energy k 2 α 1 (6.17) = 2 = 1 ω, E= 2m m 2 where ω is the natural frequency of the oscillator according to classical mechanics. The function (6.16) has the form of a gaussian, or bell-shaped curve. The function has no nodes, which leads us to conclude that this represents the ground state of the system. The ground state is usually designated with the quantum number n = 0 (the particle in a box is a exception, with n = 1 labeling the ground state). Reverting to the original variable x, we write ψ0 (x) = const e−αx

2 /2

,

α = (mk/2 )1/2 .

With help of the well-known definite integral ∞ π 1/2 2 e−αx dx = α −∞

(6.18)

(6.19)

we find the normalized eigenfunction α 1/4 2 e−αx /2 , ψ0 (x) = π

(6.20)

with the corresponding eigenvalue E0 = 12 ω.

(6.21)

6.3 Harmonic-oscillator eigenfunctions and eigenvalues Drawing from our experience with the particle in a box, we might surmise that the first excited state of the harmonic oscillator would be a function similar to (6.20), but with a node at x = 0, say, ψ1 (x) = const x e−αx

2 /2

(6.22)

.

This is orthogonal to ψ0 (x) by symmetry and is indeed an eigenfunction with the eigenvalue E1 = 32 ω.

(6.23)

Continuing the process, we try a function with two nodes ψ2 (x) = const (x 2 − a) e−αx

2 /2

.

(6.24)

The harmonic oscillator Chapter | 6

107

Using integrals calculated in Sect. 5.1, we determine that a = 1/2 makes ψ2 (x) orthogonal to both ψ0 (x) and ψ1 (x). We verify that this is another eigenfunction, corresponding to E2 = 52 ω.

(6.25)

The general result involves Hermite polynomials, which are discussed in Sect. 5.4. We can thereby write the normalized eigenfunctions: ψn (x) =

√

α √ 2n n! π

1/2

√ 2 Hn ( α x) e−αx /2 ,

(6.26)

where Hn (ξ ) represents the Hermite polynomial of degree n. The first few Hermite polynomials are H0 (ξ ) = 1,

H1 (ξ ) = 2ξ,

H2 (ξ ) = 4ξ 2 − 2,

H4 (ξ ) = 16ξ − 48ξ + 12, 4

2

H3 (ξ ) = 8ξ 3 − 12ξ,

H5 (ξ ) = 32ξ − 160ξ 3 + 120ξ. 5

(6.27)

The four lowest harmonic-oscillator eigenfunctions are plotted in Fig. 6.3. Note the topological resemblance to the corresponding particle-in-a-box eigenfunctions.

FIGURE 6.3 Harmonic oscillator eigenfunctions for n=0, 1, 2, 3.

108 Introduction to Quantum Mechanics

The eigenvalues are given by the simple formula En = (n + 12 )ω.

(6.28)

These are drawn in Fig. 6.2, on the same scale as the potential energy. The ground-state energy E0 = 12 ω is greater than the classical value of zero, again a consequence of the uncertainty principle. It is remarkable that the difference between successive energy eigenvalues has a constant value E = En+1 − En = ω = hν.

(6.29)

This is reminiscent of Planck’s formula for the energy of a photon. It comes as no surprise then that the quantum theory of radiation has the structure of an assembly of oscillators, with each oscillator representing a mode of electromagnetic waves of a specified frequency.

6.4 Operator formulation of harmonic oscillator To develop the quantum theory of electromagnetic radiation, it is useful to reformulate the harmonic-oscillator problem in terms of creation and annihilation operators, following a derivation due to Dirac. The Schrödinger equation (6.9) can be written 1 (6.30) (p 2 + mω2 x 2 )|n = En |n = (n + 12 )ω|n, 2m √ where p is the momentum operator −i d/dx and ω = k/m. Now define the mutually adjoint operators mω mω i i † p and a = p. (6.31) a= x+√ x−√ 2 2 2mω 2mω H |n =

These are not hermitian operators, but in terms of them the Hamiltonian can be expressed H = (a † a + 12 )ω.

(6.32)

Comparing (6.30), it is clear that a † a represents a number operator, such that a † a|n = n|n.

(6.33)

From the fundamental commutator [x, p] = i

(6.34)

we can derive the following commutation relations: [a, a † ] = 1,

(6.35)

The harmonic oscillator Chapter | 6

109

[a, H ] = ωa

(6.36)

[a † , H ] = −ωa † .

(6.37)

and Applying (6.36) to an eigenfunction |n, we find aH |n − H a|n = ωa|n.

(6.38)

Using H |n = En |n, this can be rearranged to H (a|n) = (En − ω)(a|n).

(6.39)

Since En − ω = En−1 for harmonic-oscillator eigenvalues, it follows that a|n = const |n − 1,

(6.40)

meaning that a|n represents an eigenfunction for quantum number n − 1. The value of the constant in (6.40) follows from (6.33) since n − 1|n − 1 = n|a † a|n = n,

(6.41)

noting that n|a † = (a|n)† according to a general property of bras and kets. Assuming that |n √ and |n − 1 are both normalized, it follows that the constant in (6.40) equals n. Thus √ (6.42) a|n = n |n − 1 Proceeding analogously from Eq. (6.37), we find √ a † |n = n + 1 |n + 1.

(6.43)

For obvious reasons, a † and a are known as step-up and step-down operators, respectively. They are also called ladder operators since they take us up and down the ladder of harmonic-oscillator eigenvalues. In the context of radiation theory, a † and a are called creation and annihilation operators, respectively, since their action is to create or annihilate a quantum of energy. For the harmonic-oscillator ground state |0, Eq. (6.42) implies that a|0 = 0, consistent with E0 being the lowest possible eigenvalue. Using the definition of a in (6.31), this leads to a first-order differential equation for the ground-state wavefunction: d mω (6.44) + x ψ0 (x) = 0, dx with solution ψ0 (x) = const e−mωx

2 /2

.

(6.45)

110 Introduction to Quantum Mechanics

This agrees with what we found in Eq. (6.18) as an asymptotic solution, which also proved to be exact. In fact, Eq. (6.44) is identical to Eq. (6.15). Eqs. (6.42) and (6.43) for the ladder operators can be applied to derive matrix elements for the harmonic oscillator. Solving (6.31) for x and p we find mω † x= p = −i (6.46) (a + a ) (a − a † ). 2mω 2 Thus

n|x|n =

n|(a + a † )|n . 2mω

(6.47)

By (6.42) and (6.43), the only nonvanishing matrix elements will be for n = n ± 1, with n (n + 1) , n|x|n + 1 = . (6.48) n|x|n − 1 = 2mω 2mω It follows that the selection rule for electric-dipole transitions [cf. Eq. (4.71)] in a harmonic oscillator is n = ±1. Matrix elements of operators x 2 and p 2 can be evaluated by taking squares of Eqs. (6.46). Some examples are given in the Exercises.

6.5 Quantum theory of radiation Sir James Jeans’ suggestion that the radiation field could be treated as an ensemble of different modes of vibration confined to an enclosure was applied to the blackbody problem in Chap. 1. The quantum theory of radiation develops this correspondence more explicitly, identifying each mode of the electromagnetic field with an abstract harmonic oscillator of frequency ωλ . The Hamiltonian for the entire radiation field can be written

H = 12 (Pλ2 + ωλ2 Q2λ ), (6.49) λ

where λ labels the frequencies, propagation directions and polarizations of the constituent modes. We can then define, in analogy with Eq. (6.31), ωλ ωλ i i aλ = Pλ and aλ† = Pλ . (6.50) Qλ + √ Qλ − √ 2 2 2ωλ 2ωλ These can be very explicitly identified as annihilation and creation operators for photons, the quanta of the electromagnetic field, with frequency ωλ , propagation vector kλ and polarization eˆ λ . The Hamiltonian (6.49) becomes

† (aλ aλ + 12 )ωλ (6.51) H= λ

The harmonic oscillator Chapter | 6

111

with the energy of the radiation field equal to E=

(nλ + 12 )ωλ .

(6.52)

λ

The nλ represent the number of λ photons contained in a cubic box of volume L3 , as illustrated in Fig. 6.4.

FIGURE 6.4 Schematic representation of an atom and an ensemble of photons enclosed in a cubic box.

The state of the radiation field is determined by a set of photon numbers nλ . The vacuum state, designated |0, contains no photons. The state |1λ contains one photon of energy ωλ , propagation vector kλ and polarization eˆ λ . The state |2λ contains two such photons, while |1λ , 1λ contains two different photons, λ and λ . The most general state of the radiation field would be designated |nλ , nλ , nλ . . . . If the enclosure also contains an atom in quantum state ψn , the composite state is designated |n; nλ , nλ , nλ . . . . The electric dipole interaction between the atom and the radiation is generally the dominant contribution, represented by a perturbation V = −μ · E.

(6.53)

The energy of the radiation field within the box has the form H=

1 2

2 3 B

0 Eλ2 + μ−1 λ ×L , 0

(6.54)

λ

where Eλ and Bλ are the electric and magnetic fields associated with the oscillator mode λ. Comparing the sums of quadratic forms (6.49) and (6.54) it is evident that the quantum-mechanical operator representing the electric field contains contributions linear in both aλ and aλ† . This is all we need to know.

112 Introduction to Quantum Mechanics

A more detailed derivation would give the complete expression i ωλ E(r) = 3/2 aλ eˆ λ ei kλ ·r − aλ† eˆ ∗λ e−i kλ ·r . 2 0 L

(6.55)

λ

A sufficient approximation for our purposes is to take Vabs ≈ const × μ aλ

and

Vem ≈ const × μ aλ†

(6.56)

for absorption and emission, respectively, of a photon by an atom making a transition between energy levels En and Em . Conservation of energy requires that the photon frequency satisfies ωλ =

E m − En .

(6.57)

The transition takes place from one of the composite states |n; . . . nλ . . . or |m; . . . nλ . . . , where the photon numbers for the other (nonresonant) modes are irrelevant. As shown in Sect. 4.8, the transition probability is proportional to the square of the perturbation matrix element 2 W = const final|V |initial .

(6.58)

Thus for the upward transition of the atom (m ← n) 2 Wabs = const m; nλ − 1|Vabs |n; nλ = const |μmn |2 nλ , while for the downward transition 2 Wem = const n; nλ + 1|Vem |m; nλ = const |μmn |2 (nλ + 1).

(6.59)

(6.60)

The photon numbers come from the action of the creation and annihilation operators:

√ (6.61) aλ |nλ = nλ |nλ − 1 and aλ† |nλ = nλ + 1|nλ + 1 analogous to (6.42) and (6.43). The occurrence of different factors nλ and nλ + 1 is quite significant. The absorption probability is linear in nλ , which is proportional to the radiation density in the enclosure. By contrast, the emission probability, varying as nλ + 1, is made up of two distinct contributions. The part linear in nλ is called stimulated emission while the part independent of nλ accounts for spontaneous emission. Remarkably, the probability for absorption is

The harmonic oscillator Chapter | 6

113

exactly equal to that for stimulated emission. A detailed calculation gives the following transition rates: Wabs = Wstim em =

4π 2 ρ(ω)|μmn |2 , 32

(6.62)

while Wspont em =

4ω3 |μ |2 . 3c3 mn

(6.63)

Note that all three radiative processes depend on |μmn |2 and thus obey the same selection rules. The dependence on ω3 makes spontaneous emission significant only for higher-energy radiation, in practice, for optical frequencies and higher. Einstein in 1917 showed the relation between the three radiative processes and Planck’s blackbody radiation law. Suppose the radiation in a cavity is in equilibrium at a temperature T . This means that the rate of upward and downward transitions between every pair of energy levels En and Em in the walls of the enclosure must exactly balance. Thus Nn Wabs = Nm (Wstim em + Wspont em ).

(6.64)

But if the populations Nn and Nm obey a Boltzmann distribution Nm = e−(Em −En )/kT = e−ω/kT . Nn

(6.65)

Putting in the transitions rates from (6.62) and (6.63) and solving for the radiation density, we obtain ω3 /π 2 c3 , (6.66) eω/kT − 1 independent of any atomic parameters such as e or me . This is Planck’s radiation law expressed in terms of the radian frequency ω. ρ(ω) =

Supplement 6A. Anharmonic oscillator An anharmonic oscillator is one which deviates from the exact form of the harmonic oscillator. We will consider the case of an oscillator with a quartic anharmonicity. We write the Hamiltonian, with conveniently scaled variables, as 1 1 1 d2 + ω2 x 2 + λx 4 . (6.67) 2 2 dx 2 4 Clearly, this problem can be treated by the perturbation theory of Supplement 4A with 1 V (x) = λx 4 . (6.68) 4 H =−

114 Introduction to Quantum Mechanics

The unperturbed eigenfunctions are given by ψn (x) =

√ 1/2 √ ω 2 Hn ( ω x) e−ωx /2 , √ n 2 n! π

n = 0, 1, 2, . . . ,

(6.69)

with the corresponding eigenvalues En = (n + 12 )ω. We will consider this perturbation problem for the n = 0 and n = 1 states of the oscillator. The unperturbed energies are E0 = 12 ω and E1 = 32 ω. Referring to Supplement 4A, the first-order energies are given by λ 4

(1)

E0 = and (1)

E1 =

λ 4

∞

−∞

∞

−∞

ψ0 (x)x 4 ψ0 (x) dx =

3λ 16ω2

(6.70)

ψ1 (x)x 4 ψ1 (x) dx =

15λ . 16ω2

(6.71)

The general result for arbitrary n is En(1) =

3λ (2n2 + 2n + 1). 16ω2

(6.72)

To compute the second-order perturbation energies, we require the matrix elements (x 4 )n,m . The only nonvanishing elements turn out to be those of the form (x 4 )n,n , (x 4 )n,n±2 and (x 4 )n,n±4 . We require only the following: 1 (n + 1)(n + 2)(2n + 3), 2ω2 1 (n + 1)(n + 2)(n + 3)(n + 4). (x 4 )n,n+4 = 4ω2 (x 4 )n,n+2 =

(6.73) (6.74)

The sum representing the second-order energy for n = 0 and n = 1 thus reduces to two terms: (2)

E0 =

|V0,2 |2 |V0,4 |2 21λ2 + =− E0 − E2 E0 − E4 128ω5

(6.75)

|V1,3 |2 |V1,5 |2 165λ2 + =− . E1 − E3 E1 − E5 128ω5

(6.76)

and (2)

E1 =

The total energies to second order then work out to 1 21λ2 3λ E0 = ω + − , 2 16ω2 128ω5

3 165λ2 15λ E1 = ω + − . 2 16ω2 128ω5

(6.77)

The harmonic oscillator Chapter | 6

115

Chapter 6. Exercises 6.1. For a classical harmonic oscillator, the particle can not go beyond the points where the total energy equals the potential energy. Identify these points for a quantum-mechanical harmonic oscillator in its ground state. Write an integral giving the probability that the particle will go beyond these classically-allowed points. (You need not evaluate the integral.) 6.2. Evaluate the average (expectation) values of potential energy and kinetic energy for the ground state of the harmonic oscillator. Comment on the relative magnitude of these two quantities. 6.3. Using ladder operators to evaluate matrix elements, calculate the average potential and kinetic energies for a harmonic oscillator in its nth quantum state. 6.4. Apply the Heisenberg uncertainty principle to the ground state of the harmonic oscillator. Applying the formula for expectation values, calculate x = x 2 − x2 and p = p 2 − p2 and find the product x p. 6.5. Using ladder operators to evaluate matrix elements, evaluate the uncertainty product x p for the nth quantum state. 6.6. The probability of a radiative dipole transition between levels n and n of a harmonic oscillator depend on the square of the matrix element ∞ xn,n = ψn (x) x ψn (x) dx. −∞

When xn,n = 0, the transition is forbidden. Determine whether each of the following harmonic-oscillator transitions is allowed or forbidden: 1 ← 0, 2 ← 0, 2 ← 1. Derive the general form of the selection rule. 6.7. (optional) Verify the given results for the first- and second-order perturbation energies of a harmonic oscillator perturbed by V = 14 λx 4 for the n = 0 and n = 1 states.

Chapter 7

Angular momentum 7.1 Particle in a ring Consider a variant of the one-dimensional particle in a box problem in which the x-axis is bent into a ring of radius R. We can write the same Schrödinger equation 2 d 2 ψ(x) = Eψ(x). (7.1) 2M dx 2 There are no boundary conditions in this case since the x-axis closes upon itself. A more appropriate independent variable for this problem is the angular position on the ring given by φ = x/R. The Schrödinger equation then reads −

−

2 d 2 ψ(φ) = Eψ(φ). 2MR 2 dφ 2

(7.2)

The kinetic energy of a body rotating in the xy-plane can be expressed as E=

L2z , 2I

(7.3)

where I = MR 2 is the moment of inertia and Lz , the z-component of angular momentum. (Since L = r × p, if r and p lie in the xy-plane, L points in the z-direction.) The structure of Eq. (7.2) suggests that this angular-momentum operator is given by ∂ Lˆ z = −i . ∂φ

(7.4)

This result will follow from a more general derivation in the following Section. The Schrödinger equation (7.2) can now be written more compactly as ψ (φ) + m2 ψ(φ) = 0,

(7.5)

m2 ≡ 2I E/2 .

(7.6)

where

(We have used the symbol M for mass to avoid confusion with m.) Possible solutions to (7.5) are ψ(φ) = const e±imφ . Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00015-X Copyright © 2021 Elsevier Inc. All rights reserved.

(7.7) 117

118 Introduction to Quantum Mechanics

In order for this wavefunction to be physically acceptable, it must be singlevalued. Since φ increased by any multiple of 2π represents the same point on the ring, we must have ψ(φ + 2π) = ψ(φ)

(7.8)

eim(φ+2π) = eimφ .

(7.9)

e2πim = 1,

(7.10)

and therefore

This requires that

which is true only is m is an integer: m = 0, ±1, ±2 . . . .

(7.11)

Using (7.6), this gives the quantized energy values Em =

2 2 m . 2I

(7.12)

In contrast to the particle in a box, the eigenfunctions corresponding to +m and −m in Eq. (7.7) are linearly independent, so both must be included. Therefore all eigenvalues, except E0 , are two-fold (doubly) degenerate. The eigenfunctions can all be written in the form const eimφ , with m running over all integer values, as in Eq. (6.11). The normalized eigenfunctions are 1 ψm (φ) = √ eimφ 2π

(7.13)

and can be verified to satisfy the complex generalization of the normalization condition 2π ∗ ψm (φ) ψm (φ) dφ = 1, (7.14) 0

∗ (φ) = (2π)−1/2 e−imφ . The mutual orthogonality where we have noted that ψm of the functions (7.13) also follows easily, for 2π 2π 1 ∗ ψm ψm (φ) dφ = ei(m−m )φ dφ = 2π 0 0 2π 1 [cos(m − m )φ + i sin(m − m )φ] dφ = 0, for m = m. (7.15) 2π 0

The solutions (7.13) are also eigenfunctions of the angular momentum operator (7.4), with Lˆ z ψm (φ) = m ψm (φ),

m = 0, ±1, ±2 . . . .

(7.16)

Angular momentum Chapter | 7 119

This is a instance of a fundamental result in quantum mechanics, that any measured component of orbital angular momentum is restricted to integral multiples of . The Bohr theory of the hydrogen atom, to be discussed in the next Chapter, can be derived from this assumption alone.

7.2 Free electron model for aromatic molecules The benzene molecule contains a ring of six carbon atoms around which six delocalized π-electrons can circulate. A variant of the FEM for rings predicts the ground-state electron configuration which we can write as 1π 2 2π 4 , as shown in Fig. 7.1:

FIGURE 7.1 Free electron model for benzene. Dotted arrow shows the lowest-energy excitation.

The enhanced stability the benzene molecule can be attributed to the complete shells of π-electron orbitals, analogous to the way that noble gas electron configurations achieve their stability. Naphthalene, apart from the central C–C bond, can be modeled as a ring containing 10 electrons in the next closed-shell configuration 1π 2 2π 4 3π 4 . These molecules fulfill Hückel’s “4N + 2 rule” for aromatic stability. The molecules cyclobutadiene (1π 2 2π 2 ) and cyclooctatetraene (1π 2 2π 4 3π 2 ), even though they consist of rings with alternating single and double bonds, do not exhibit aromatic stability since they contain partiallyfilled orbitals. The longest wavelength absorption in the benzene spectrum can be estimated according to this model as 2 hc (22 − 12 ). = E 2 − E1 = λ 2mR 2

(7.17)

The ring radius R can be approximated by the C–C distance in benzene, 1.39 Å. We predict λ ≈ 210 nm, whereas the experimental absorption has λmax ≈ 268 nm.

7.3 Rotation in three dimensions A particle of mass M, free to move on the surface of a sphere of radius R, can be located by the two angular variables θ, φ. The Schrödinger equation therefore has the form −

2 2 ∇ Y (θ, φ) = E Y (θ, φ), 2M

(7.18)

120 Introduction to Quantum Mechanics

with the wavefunction conventionally written as Y (θ, φ). These functions are the spherical harmonics, discussed in Sect. 5.7. Since r = R, a constant, the first term in the Laplacian does not contribute. The Schrödinger equation reduces to ∂ 1 ∂2 1 ∂ + λ Y (θ, φ) = 0, (7.19) sin θ + 2 sin θ ∂θ ∂θ sin θ ∂φ 2 where λ=

2MR 2 E 2I E = 2 , 2

(7.20)

again introducing the moment of inertia I = MR 2 . The variables θ and φ can be separated in Eq. (7.19) after multiplying through by sin2 θ. If we write Y (θ, φ) = (θ) (φ)

(7.21)

and follow the procedure used for the three-dimensional box, we find that dependence on φ alone occurs in the term (φ) = const. (φ)

(7.22)

This is identical in form to Eq. (7.5), with the constant equal to −m2 , and we can write down the analogous solutions 1 imφ m (φ) = m = 0, ±1, ±2 . . . (7.23) e , 2π Substituting (6.27) into (6.23) and canceling the functions m (φ), we obtain an ordinary differential equation for (θ ) d m2 1 d sin θ − 2 + λ (θ ) = 0. (7.24) sin θ dθ dθ sin θ As shown in Sect. 5.6, the single-valued, finite solutions to (7.24) are known as associated Legendre functions. The parameters λ and m are restricted to the values λ = l(l + 1),

l = 0, 1, 2 . . . ,

(7.25)

while m = 0, ±1, ±2 . . . ± l,

(2l +1 values).

(7.26)

Putting (7.25) into (7.20), the allowed energy levels for a particle on a sphere are found to be E =

2 l(l + 1). 2I

(7.27)

Angular momentum Chapter | 7 121

Since the energy is independent of the second quantum number m, these levels are (2l +1)-fold degenerate. The explicit forms for the spherical harmonics we will need is given in Eq. (5.112). The spherical harmonics constitute an orthonormal set satisfying the integral relations

π 0

7.4

2π 0

Yl∗ m (θ, φ)Ylm (θ, φ) sin θ dθ dφ = δll δmm .

(7.28)

Theory of angular momentum

Generalization of the energy-angular momentum relation (7.3) to three dimensions gives L2 . (7.29) 2I Thus from (7.18) and (7.19) we can identify the operator for the square of total angular momentum ∂ 1 ∂2 1 ∂ sin θ + 2 . (7.30) Lˆ 2 = −2 sin θ ∂θ ∂θ sin θ ∂φ 2 E=

The functions Y (θ, φ) are simultaneous eigenfunctions of Lˆ 2 and Lˆ z such that Lˆ 2 Ylm (θ, φ) = l(l + 1) 2 Ylm (θ, φ)

(7.31)

Lˆ z Ylm (θ, φ) = m Ylm (θ, φ).

(7.32)

and and Ly (unless = 0). Note But Ylm (θ, φ) is not an eigenfunction of either Lx √ that the magnitude of the total angular momentum l(l + 1) is greater than its maximum observable component in any direction, namely l . The quantummechanical behavior of the angular momentum and its components can be represented by a vector model, illustrated in Fig. 7.2. The angular momentum √ vector L, with magnitude l(l + 1) , can be pictured as diffused around a cone about the z-axis, with only the z-component Lz having a definite value. The components Lx and Ly are indeterminate, corresponding to the fact that the system is not in an eigenstate of either. There are 2l + 1 different allowed values for Lz , with eigenvalues m (m = 0, ±1, ±2 . . . ± l) equally spaced between +l and −l . √ = l(l + 1) is greater The magnitude of the total angular momentum |L| than its maximum observable component in any direction, namely l. The quantum-mechanical behavior of the angular momentum and its components can be represented by a vector model, as shown in Fig. 7.2. The angular can be pictured as precessing about the z-axis, with its momentum vector L

122 Introduction to Quantum Mechanics

FIGURE 7.2 Space quantization of angular momentum, showing the case l = 2.

z-component Lz constant. The components Lx and Ly fluctuate in the course of precession, mirroring the fact that the system is not in an eigenstate of either, as implied by the commutation relations. There are 2l + 1 possible values for Lz , with eigenvalues m (m = 0, ±1, ±2, . . . , ±l), equally spaced between +l and −l. This discreteness in the allowed directions of the angular momentum vector is called space quantization. The existence of simultaneous eigenstates of Lˆ 2 and any one component, conventionally Lˆ z , is consistent with the commutation relations derived in Chap. 4: [Lˆ 2 , Lˆ z ] = 0

(7.33)

and [Lˆ x , Lˆ y ] = iLˆ z ,

7.5

et cyc.

(7.34)

Operator derivation of angular momentum eigenvalues

The eigenvalues of angular momentum can alternatively be determined by making use of ladder operators, analogous to the treatment of the harmonic oscillator in Sect. 6.4. As a consequence of (7.33), there must exist simultaneous eigenstates of L2 and one component, generally chosen as Lz . Let us denote the corresponding eigenvectors by |l, m , and write the eigenvalue equations Lz |l, m = m|l, m

L2 |l, m = λ2 |l, m ,

and

(7.35)

with λ remaining to be determined. At this point we introduce the operators L+ = Lx + iLy ,

L− = Lx − iLy ,

(7.36)

which, we will see in a moment, are raising and lowering operators for the eigenvalues of Lz . It is easy to see that L2 commutes with L+ , since it commutes

Angular momentum Chapter | 7 123

with both Lx and Ly . Therefore L2 L+ |l, m = L+ L2 |l, m = λ2 L+ |l, m ,

(7.37)

using the eigenvalue equation for L2 , Eq. (7.35). We can then write: L2 (L+ |l, m ) = λ2 (L+ |l, m ), which shows that L+ |l, m is an eigenvector of L2 with the same eigenvalue λ2 as |l, m . Thus, the operator L+ applied to |l, m does not change the magnitude of the angular momentum. Consider now the commutator [Lz , L+ ] [Lz , L+ ] = [Lz , Lx ] + i[Lz , Ly ] = iLy + i(−i)Lx = (iLy + Lx ) = L+ . (7.38) Operating on the eigenvector |l, m : [Lz , L+ ]|l, m = Lz L+ |l, m − L+ Lz |l, m = L+ |l, m , Lz (L+ |l, m ) − m(L+ |l, m ) = (L+ |l, m ), Lz (L+ |l, m ) = (m + 1)(L+ |l, m ).

(7.39)

Evidently, L+ |l, m is an eigenvector of Lz with the eigenvalue (m + 1), thus the designation of L+ as a raising operator, meaning that L+ |l, m = const|l, m + 1 .

(7.40)

Analogously, L− is a lowering operator, with L− |l, m = const|l, m − 1 .

(7.41)

Suppose that the quantum number l is now defined as the maximum allowed value of m for a given eigenvalue λ2 of L2 . (Correspondingly, −l would be the minimum value of m.) Since there is no higher possible value of m, the raising operator on |l, l should annihilate the vector: L+ |l, l = 0.

(7.42)

Now here is an identity you would never ordinarily think of: L− L+ = L2x + L2y + i(Lx Ly − Ly Lx ) = L2 − L2z + i(i)Lz .

(7.43)

Thus L2 = L− L+ + L2z + Lz .

(7.44)

L2 |l, l = L− L+ |l, l + L2z |l, l + Lz |l, l ,

(7.45)

Applying this to |l, l

124 Introduction to Quantum Mechanics

FIGURE 7.3 Schematic representation of Stern-Gerlach experiment.

giving λ2 |l, l = 0 + l 2 2 |l, l + l2 |l, l .

(7.46)

Finally, we can identify λ = l 2 + l = l(l + 1).

(7.47)

Thus we arrive at the eigenvalues and eigenvectors for orbital angular momentum: l = 0, 1, 2, . . . , L2 |l, m = l(l + 1)2 |l, m , m = 0, ±1, ±2, . . . , ±l. Lz |l, m = m|l, m ,

(7.48)

7.6 Electron spin Many atomic spectral lines appear, under sufficiently high resolution, to be closely-spaced doublets, for example the 17.2 cm−1 splitting of the yellow sodium D lines. Uhlenbeck and Goudsmit proposed in 1925 that such doublets were due to an intrinsic angular momentum possessed by the electron (in addition to its orbital angular momentum) that could be oriented in just two possible ways. This property, known as spin, occurs as well in other elementary particles. Spin and orbital angular momenta are roughly analogous to the daily and annual motions, respectively, of the Earth around the Sun. In a classic experiment by Stern and Gerlach in 1922, shown in Fig. 7.3, a beam of silver atoms passed through an inhomogeneous magnetic field splits into two beams, corresponding to the two possible orientations of the magnetic moment of the single unpaired electron. To distinguish the spin angular momentum from the orbital, we designate the quantum numbers as s and ms , in place of l and m. For the electron, the quantum number s always has the value 12 , while ms can have one of two values, ± 12 , as shown in Fig. 7.4. The electron is said to be an elementary particle of spin 12 . The proton and neutron also have spin 12 and belong to the classification

Angular momentum Chapter | 7 125

of particles called fermions, which are governed by the Pauli exclusion principle. Other particles, including the photon, have integer values of spin and are classified as bosons. These do not obey the Pauli principle, so that an arbitrary number can occupy the same quantum state. A complete theory of spin requires relativistic quantum mechanics. For our purposes, it is sufficient to recognize the two possible internal states of the electron, which can be called spin up and spin down. These are designated, respectively, by α and β as factors in the electron wavefunction. Spins play an essential role in determining the possible electronic states of atoms and molecules.

FIGURE 7.4 Electron spin.

Wolfgang Pauli

7.7 Pauli spin algebra A more sophisticated way of representing a quantum system with an internal degree of freedom (such as electron spin) is to introduce a spinor wavefunction: ψ1 (r) (r) = (7.49) ψ2 (r) The spinorbital written ψ(r)α would then be given by

126 Introduction to Quantum Mechanics

ψ(r) 0

(r) = while ψ(r)β is

= ψ(r)

0 ψ(r)

(r) = The matrix operator

,

(7.50)

.

(7.51)

0 1

= ψ(r)

Sz = 2

1 0

1 0

0 −1

(7.52)

represents the z-component of electron spin. There are two eigenstates, 1 0 |α = and |β = , (7.53) 0 1 such that

Sz

and

Sz

1 0

0 1

= 2

=− 2

1 0

(7.54)

0 1

.

(7.55)

Operators for the x- and y-components of spin angular momentum can be represented by 0 1 0 −i Sx = and Sy = , (7.56) 2 1 0 2 i 0 such that the commutation relations [Sx , Sy ] = i Sz ,

et cyc,

(7.57)

analogous to (7.34) are satisfied. The magnitude of the spin angular momentum is given by 3 2 1 0 2 2 2 2 S = S x + Sy + S z = , (7.58) 4 0 1 consistent with the value s(s + 1)2 for angular-momentum quantum number s = 1/2.

Angular momentum Chapter | 7 127

The three unit Hermitian matrices 0 1 0 −i σ1 = , σ2 = , 1 0 i 0

σ3 =

1 0

0 −1

(7.59)

called the Pauli spin matrices are important in relativistic quantum mechanics. For future reference, we summarize the action of the spin operators on the two spin eigenstates: Sz |α = |α , 2 Sx |β = |α , 2

Sz |β = − |β , Sx |α = |β , 2 2 i i Sy |α = |β , Sy |β = − |α , 2 2 3 2 3 2 2 2 S |β = |β . S |α = |α , 4 4

(7.60)

7.8 Addition of angular momenta The total angular momentum of an atom or molecule is the vector sum of the angular momenta of its constituent parts, e.g., electrons. For example, the total orbital angular momentum of several electrons is given by L = l1 + l2 + . . .,

(7.61)

while the total spin angular momentum is analogously S = s1 + s2 + . . ..

(7.62)

These can combine to give a total electronic angular momentum J = L + S.

(7.63)

Later, we will also encounter angular momentum from molecular rotation and from nuclear spins. Consider the general case of vector addition of two angular momenta, which we will denote as J1 and J2 : J = J 1 + J2 .

(7.64)

We can picture J1 and J2 as cones around their resultant J, which is itself represented by a conical surface about some axis in space, as shown in Fig. 7.5. According to quantum theory, each component √ of angular momentum, as well as their resultant, has a magnitude given by J (J + 1) with J having possible values 0, 12 , 1, 32 , 2 . . . , now including the possibility of spins contributing multiples of 12 . The observable components of J are again given by M, with M running from −J to +J in integer steps. If J1 and J2 are described by quantum numbers J1 and J2 , respectively, then the total angular momentum quantum

128 Introduction to Quantum Mechanics

number J has the possible values J = |J2 − J1 |, |J2 − J1 | + 1, . . . , J2 + J1 ,

(7.65)

again in integer steps. The value of J depends on the relative orientation of the components J1 and J2 . For example, angular momenta 1 and 12 can combine to give either J = 12 or J = 32 .

FIGURE 7.5 Vector addition of angular momenta.

Chapter 8

The hydrogen atom and atomic orbitals The hydrogen atom has provided the most fundamental prototype at several levels in the advancement of quantum theory, beginning with the old quantum theory, through nonrelativistic, then relativistic, quantum mechanics and quantum field theory (the Lamb shift, etc.). It is the only real physical system that can be solved exactly, giving analytical solutions in closed-form (although this might also be said for the radiation field, as an assembly of harmonic oscillators).

8.1

Atomic spectra

When gaseous hydrogen in a glass tube is excited by a 5000-volt electrical discharge, four lines are observed in the visible part of the emission spectrum: red at 656.3 nm, blue-green at 486.1 nm, blue violet at 434.1 nm and violet at 410.2 nm:

FIGURE 8.1 Visible spectrum of atomic hydrogen.

Other series of lines have been observed in the ultraviolet and infrared regions. Rydberg (1890) found that all the lines of the atomic hydrogen spectrum could be fitted to a single formula 1 1 1 (8.1) =R 2 − 2 , n1 , n2 = 1, 2, 3 . . . , n2 > n1 , λ n1 n2 where R, known as the Rydberg constant, has the value 109,677 cm−1 for hydrogen. The reciprocal of wavelength, in units of cm−1 , is in general use by spectroscopists. This unit is also designated wavenumbers, since it represents the number of wavelengths per cm. The Balmer series of spectral lines in the visible region, shown in Fig. 8.1, correspond to the values n1 = 2, n2 = 3, 4, 5 and 6. The lines with n1 = 1 in the ultraviolet make up the Lyman series. The Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00016-1 Copyright © 2021 Elsevier Inc. All rights reserved.

129

130 Introduction to Quantum Mechanics

line with n2 = 2, designated Lyman alpha, has the longest wavelength (lowest wavenumber) in this series, with 1/λ = 82.258 cm−1 or λ = 121.57 nm. In the words of Arthur Schawlow, “The spectrum of the hydrogen atom has proved to be the Rosetta stone of modern physics: once this pattern of lines had been deciphered much else could also be understood.” Other atomic species have line spectra, which can be used as a “fingerprint” to identify the element. However, no atom other than hydrogen has a simple relation analogous to (8.1) for its spectral frequencies. Bohr in 1913 proposed that all atomic spectral lines arise from transitions between discrete energy levels, giving a photon such that hc . (8.2) λ This is called the Bohr frequency condition. We now understand that the atomic transition energy E is equal to the energy of a photon, as proposed earlier by Planck and Einstein. E = hν =

8.2 The Bohr atom The nuclear model proposed by Rutherford in 1911 pictures the atom as a heavy, positively-charged nucleus, around which much lighter, negativelycharged electrons circulate, much like planets in the Solar system. This model is however completely untenable from the standpoint of classical electromagnetic theory, for an accelerating electron (circular motion represents an acceleration) should radiate away its energy. In fact, a hydrogen atom should exist for no longer than 5×10−11 sec, time enough for the electron’s death spiral into the nucleus. This is one of the worst quantitative predictions in the history of physics. It has been called the Hindenberg disaster on an atomic scale. (The Hindenberg, a hydrogen-filled dirigible, crashed and burned in a famous disaster in 1937.) Bohr sought to avoid an atomic catastrophe by proposing that certain orbits of the electron around the nucleus could be exempted from classical electrodynamics and remain stable. The Bohr model was quantitatively successful for the hydrogen atom, as we shall now show. Recall that the attraction between two opposite charges, such as the electron and proton, is given by Coulomb’s law ⎧ ⎪ ⎪ e2 ⎪ ⎨− (gaussian units) r2 F= (8.3) ⎪ e2 ⎪ ⎪ (SI units). ⎩− 4π0 r 2 We prefer to use the gaussian system in applications to atomic phenomena. (In any event, it won’t matter once we change to atomic units.) Since the Coulomb attraction is a central force (dependent only on r), the potential energy is related

The hydrogen atom and atomic orbitals Chapter | 8

131

by dV (r) . (8.4) dr We find therefore, for the mutual potential energy of a proton and electron, F =−

V (r) = −

e2 . r

(8.5)

Bohr considered an electron in a circular orbit of radius r around the proton. To remain in this orbit, the electron must be experiencing a centripetal acceleration a=−

v2 , r

(8.6)

where v is the speed of the electron. By Newton’s second law F = ma, we can write e2 mv 2 = , r r2

(8.7)

where m is the mass of the electron. For simplicity, we assume that the proton mass is infinite (actually mp ≈ 1836me ) so that the proton’s position remains fixed. We will later correct for this approximation by introducing reduced mass. The energy of the hydrogen atom is the sum of the kinetic and potential energies: e2 . r

(8.8)

E = 12 V = −T .

(8.9)

E = T + V = 12 mv 2 − Using Eq. (8.7), we see that T = − 12 V

and

This is the form of the virial theorem for a force law with r −2 dependence. Note that the energy of a bound atom is negative, since it is lower than the energy of the separated electron and proton, which is taken to be zero. For further progress, we need some restriction on the possible values of r or v. This is where we can introduce the quantization of angular momentum L = r × p. Since p is perpendicular to r, we can write simply L = rp = mvr.

(8.10)

Using (8.7), we find that L2 . me2 Assuming angular momentum quantization, r=

L = n,

n = 1, 2 . . . ,

(8.11)

(8.12)

132 Introduction to Quantum Mechanics

excluding n = 0, which would not give a circular orbit. The allowed orbital radii are then given by rn = n 2 a0 ,

(8.13)

where 2 = 5.29 × 10−11 m = 0.529 Å, me2 which is known as the Bohr radius. The corresponding energy is a0 ≡

En = −

e2 me4 = − , 2a0 n2 2 2 n2

n = 1, 2 . . . .

(8.14)

(8.15)

Rydberg’s formula (8.1) can now be deduced from the Bohr model. We have hc 2π 2 me4 1 1 − = En2 − En1 = (8.16) λ h2 n21 n22 and the Rydbeg constant can be identified as R=

2π 2 me4 ≈ 109,737 cm−1 . h3 c

(8.17)

The slight discrepancy with the experimental value for hydrogen (109,677) is due to the finite proton mass. This will be corrected by introducing reduced mass in Sect 8.8. The Bohr model can be readily extended to hydrogenlike ions, systems in which a single electron orbits a nucleus of arbitrary atomic number Z. Thus Z = 1 for hydrogen, Z = 2 for He+ , Z = 3 for Li++ , and so on. The Coulomb potential (8.5) generalizes to V (r) = −

Ze2 , r

(8.18)

while the radius of the orbit (8.13) becomes rn =

n 2 a0 Z

(8.19)

Z 2 e2 . 2a0 n2

(8.20)

giving the energy (8.15) En = −

De Broglie’s proposal that electrons can have wavelike properties was actually inspired by the Bohr atomic model. Since L = rp = n =

nh , 2π

(8.21)

The hydrogen atom and atomic orbitals Chapter | 8

133

we find 2πr =

nh = nλ. p

(8.22)

Therefore, each allowed orbit traces out an integral number of de Broglie wavelengths. Wilson (1915) and Sommerfeld (1916) generalized Bohr’s formula for the allowed orbits to p dr = nh, n = 1, 2 . . . . (8.23) The Sommerfeld-Wilson quantum conditions reduce to Bohr’s results for circular orbits, but allow, in addition, elliptical orbits along which the momentum p is variable. According to Kepler’s first law of planetary motion, the orbits of planets are ellipses with the Sun at one focus. Fig. 8.2 shows the generalization of the Bohr theory for hydrogen, including the elliptical orbits. The lowest energy state n = 1 is still a circular orbit. But n = 2 allows an elliptical orbit in addition to the circular one; n = 3 has three possible orbits, and so on. The energy still depends on n alone, so that the elliptical orbits represent degenerate states. Atomic spectroscopy shows in fact that energy levels with n > 1 consist of multiple states, as implied by the splitting of atomic lines by an electric field (Stark effect) or a magnetic field (Zeeman effect). The shapes of some of these orbits are drawn in Fig. 8.2.

FIGURE 8.2 Bohr-Sommerfeld orbits for n = 1, 2, 3.

The Bohr model was an important first step in the historical development of quantum mechanics. It introduced the quantization of atomic energy levels and gave quantitative agreement with the atomic hydrogen spectrum. With the Sommerfeld-Wilson generalization, it accounted as well for the degeneracy

134 Introduction to Quantum Mechanics

of hydrogen energy levels. Although the Bohr model was able to sidestep the atomic “Hindenberg disaster,” it cannot avoid what we might call the “Heisenberg disaster.” By this we mean that the assumption of well-defined electronic orbits around a nucleus is completely contrary to the basic premises of quantum mechanics. Another flaw in the Bohr picture is that the angular momenta are all too large by one unit, for example, the ground state actually has zero orbital angular momentum (rather than ).

8.3 Quantum mechanics of hydrogenlike atoms In contrast to the particle in a box and the harmonic oscillator, the hydrogen atom is a real physical system that can be treated exactly by quantum mechanics. In addition to their inherent significance, these solutions suggest prototypes for atomic orbitals used in approximate computations on complex atoms and molecules. For an electron in the field of a nucleus of charge +Ze, the Schrödinger equation can be written

2 2 Ze2 − ∇ − ψ(r) = E ψ(r). 2m r

(8.24)

It is convenient to introduce atomic units in which length is measured in bohrs: a0 =

2 = 5.29 × 10−11 m ≡ 1 bohr, me2

and energy in hartrees: e2 = 4.358 × 10−18 J = 27.211 eV ≡ 1 hartree. a0 Electron volts (eV) are a convenient unit for atomic energies, one eV being defined as the energy an electron gains when accelerated across a potential difference of 1 volt. The ground state of the hydrogen atom has an energy of −1/2 hartree or −13.6 eV. Conversion to atomic units is equivalent to setting =e=m=1 in all formulas containing these constants. Rewriting the Schrödinger equation in atomic units, we have

1 2 Z − ∇ − ψ(r) = E ψ(r). (8.25) 2 r Since the potential energy is spherically symmetrical (a function of r alone), it is obviously advantageous to treat this problem in spherical polar coordinates

The hydrogen atom and atomic orbitals Chapter | 8

135

r, θ, φ. Expressing the Laplacian operator in these coordinates (cf. Eq. (5.81)), −

1 ∂ 1 ∂2 1 1 ∂ 2 ∂ ∂ r + sin θ + ψ(r, θ, φ) 2 r 2 ∂r ∂r r 2 sin θ ∂θ ∂θ r 2 sin2 θ ∂φ 2 Z − ψ(r, θ, φ) = E ψ(r, θ, φ). r

(8.26)

Eq. (7.30) shows that the second and third terms in the Laplacian represent the angular momentum operator Lˆ 2 . Clearly, Eq. (8.26) will then have separable solutions of the form ψ(r, θ, φ) = R(r) Ylm (θ, φ).

(8.27)

Substituting this into (8.26) and using the angular momentum eigenvalue equation (7.31), we obtain an ordinary differential equation for the radial function R(r):

1 d 2 d l(l + 1) Z − 2 r − + R(r) = E R(r). (8.28) r 2r dr dr 2r 2 Note that in the domain of the variable r, the angular momentum contribution l(l + 1)/2r 2 acts as an effective addition to the potential energy. It can be identified with centrifugal force, which pulls the electron outward, in opposition to the Coulomb attraction. Carrying out the successive differentiations and simplifying, we obtain 1 Z l(l + 1) 1 + E R(r) = 0, (8.29) R (r) + R (r) + − 2 r r 2r 2 another second-order linear differential equation with non-constant coefficients. It is again useful to explore the asymptotic solutions as r → ∞. In the asymptotic approximation, the equation simplifies to R (r) − 2|E|R(r) ≈ 0,

(8.30)

having noted that the energy E is negative for bound states. Solutions to this equation are √

R(r) ≈ const e±

2|E| r

.

(8.31)

We reject the positive exponential as physically unacceptable, since R(r) would approach infinity as r → ∞, in violation of the requirement that the wavefunction must be finite everywhere. Choosing the negative exponential and setting E = −Z 2 /2, the ground state energy in the Bohr theory (in atomic units), we obtain R(r) ≈ const e−Zr .

(8.32)

136 Introduction to Quantum Mechanics

It turns out that this asymptotic approximation is also an exact solution of the radial equation (8.29) with l = 0, just what happened for the harmonic-oscillator problem in Chap 6. The solutions are designated Rnl (r), where the label n is known as the principal quantum number, as well as by the angular momentum l, which is a parameter in the radial equation. This solution corresponds to R10 (r). It should be normalized according to the condition

∞

[R10 (r)]2 r 2 dr = 1.

(8.33)

0

Recalling the definite integral

∞

r n e−αr dr =

0

n! α n+1

,

(8.34)

we find the normalized radial function R10 (r) = 2Z 3/2 e−Zr .

(8.35)

Since this function is nodeless, we identify it with the ground state √ of the hydrogenlike atom. Multiplying by the spherical harmonic Y00 = 1/ 4π , we obtain the complete wavefunction ψ100 (r) =

Z 3 −Zr . e π

(8.36)

This is conventionally designated as the 1s function ψ1s (r). Making use of the simplified integration for a spherically-symmetrical function (cf. Eq. (5.80)), the normalization condition for the 1s wavefunction can be written

∞ [ψ1s (r)]2 4πr 2 dr = 1. (8.37) 0

8.4 Hydrogen-atom ground state There are a number of different ways of representing hydrogen-atom wavefunctions graphically. We will illustrate some of these with the 1s ground state. In atomic units, 1 ψ1s (r) = √ e−r π

(8.38)

is a decreasing exponential function of a single variable r, and is simply plotted in Fig. 8.3. Fig. 8.4 gives a somewhat more pictorial representation, a threedimensional contour plot of ψ1s (r) as a function of x and y in the x, y-plane.

The hydrogen atom and atomic orbitals Chapter | 8

137

FIGURE 8.3 Plot of 1s orbital.

FIGURE 8.4 Contour map of 1s orbital in the xy-plane.

According to Born’s interpretation of the wavefunction, the probability per unit volume of finding the electron at the point (r, θ, φ) is equal to the square of the normalized wavefunction ρ1s (r) = [ψ1s (r)]2 =

1 −2r e . π

(8.39)

This is represented in Fig. 8.5 by a scatter plot describing a possible sequence of observations of the electron position. Although results of individual measurements are not predictable, a statistical pattern does emerge after a sufficiently large number of measurements. The probability density is normalized such that

∞ ρ1s (r) 4πr 2 dr = 1. (8.40) 0

138 Introduction to Quantum Mechanics

FIGURE 8.5 Scatter plot of 500 random electron position measurements on hydrogen 1s orbital.

In some ways ρ(r) does not provide the best description of the electron distribution, since the region around r = 0, where the wavefunction has its largest values, is a relatively small fraction of the volume accessible to the electron. Larger radii r have more impact since, in spherical polar coordinates, a value of r is associated with a shell of volume 4πr 2 dr. A more significant measure is therefore the radial distribution function D1s (r) = 4πr 2 [ψ1s (r)]2 ,

(8.41)

which represents the probability density within the entire shell of radius r, normalized such that

∞ D1s (r) dr = 1. (8.42) 0

The functions ρ1s (r) and D1s (r) are both shown in Fig. 8.6. Remarkably, the 1s RDF has its maximum at r = a0 = 1 bohr, which coincides with the radius of the first Bohr orbit.

8.5 Schrödinger equation for atomic orbitals To construct a general solution for the radial function Rnl (r) satisfying Eq. (8.29), we build in its limiting behavior both as r → ∞ and as r → 0. As we √ have seen (Eq. (8.29)), R ≈ e− 2|E|r as r → ∞. As r → 0, the Coulomb potential −Z/r becomes negligible compared to the centrifugal term l(l + 1/2r 2 . The limiting form as r → 0 can then be determined from the solution of Laplace’s equation in spherical coordinates (see Eq. (5.81)). Then term finite as r → 0 is then given by R ≈ r l (cf. Eq. (5.85)). The radial function can be written in the form √

R(r) = r l e−

2|E|r

F (r).

(8.43)

The hydrogen atom and atomic orbitals Chapter | 8

139

FIGURE 8.6 Density ρ(r) and radial distribution function D(r) for hydrogen 1s orbital.

At this point, we can make a very opportunistic move, since we already know, from the Bohr theory, that the energy eigenvalues in atomic units are given by Z2 . 2n2

(8.44)

ρ = 2Zr/n,

(8.45)

En = − It is expedient to define a new variable

so that the radial function can now be written R(ρ) = ρ l e−ρ/2 L(ρ).

(8.46)

The radial equation then reduces to the following equation for L(ρ): ρL (ρ) + (β + 1 − ρ)L (ρ) + αL(ρ) = 0,

(8.47)

where α = n + l, β = 2l + 1. This is the known equation for associated Laguerre β polynomials, which were introduced in Sect 5.7, with solutions Lα (ρ). Thus we arrive finally at the radial function for the hydrogenlike problem: Rnl (r) = Nnl ρ l e−ρ/2 L2l+1 n+l (ρ),

(8.48)

where Nnl is a normalization constant

(n − l − 1)! Nnl = − 2n[(n + l)!]3

1/2

2Z n

3/2 ,

(8.49)

140 Introduction to Quantum Mechanics

such that Rnl (r) fulfills the normalization condition

∞ [Rnl (r)]2 r 2 dr = 1.

(8.50)

0

The angular momentum quantum number l is by convention designated by the following code: l = 0 1 2 3 4 ... s p d f g ... The first four letters come from an old classification scheme for atomic spectral lines: sharp, principal, diffuse and fundamental. Although these designations have long since outlived their original significance, they remain in general use. The solutions of the hydrogenic Schrödinger equation in spherical polar coordinates can now be written in full: ψnlm (r, θ, φ) = Rnl (r)Ylm (θ, φ), n = 1, 2, 3 . . . ,

l = 0, 1 . . . n − 1,

m = 0, ±1, ±2 · · · ± l,

(8.51)

where Ylm are the spherical harmonics tabulated in Eq. (5.112). In many applications, particularly in chemistry, real forms of the atomic orbitals are used. The m = 0 orbital functions are already real. For m = 0, the linear combinations of degenerate functions √1 (ψnlm ± ψnl−m ) gives real atomic orbitals. 2 Table 8.1 on the next page enumerates all the hydrogenlike functions we will actually need. These are called atomic orbitals, in anticipation of their later applications to the structure of atoms and molecules. The energy levels for a hydrogenic system are given by En = −

Z2 2n2

hartrees.

(8.52)

The energy depends on the principal quantum number n alone. Considering all the allowed values of l and m, the level En has a degeneracy of n2 . Fig. 8.7 shows an energy level diagram for hydrogen (Z = 1). For E ≥ 0, the energy is a continuum. The continuum represents states of an electron and proton in interaction, but not bound into a stable atom. Also shown are some of the transitions which make up the Lyman series in the ultraviolet and the Balmer series in the visible region. The ns orbitals are all spherically symmetrical, being √ associated with a constant angular factor, the spherical harmonic Y00 = 1/ 4π . They have n − 1 radial nodes—spherical shells on which the wavefunction equals zero. The 1s ground state is nodeless and the number of nodes increases with energy, in a pattern now familiar from our study of the particle-in-a-box and harmonic oscillator. For example, the 2s orbital has a radial node at r = 2 bohrs. The 3s orbital

The hydrogen atom and atomic orbitals Chapter | 8

141

TABLE 8.1 Real hydrogenic orbitals in atomic units. Z 3/2 ψ1s = √ π Z 3/2 ψ2s = √ 1− 2 2π Z 5/2 ψ2pz = √ z e−Zr/2 4 2π ψ3s = √ ψ3pz =

e−Zr Zr 2

ψ2px , ψ2py

analogous

Z 3/2 √ (27 − 18Zr + 2Z 2 r 2 ) e−Zr/3 81 3π

2 Z 5/2 √ (6 − Zr) z e−Zr/3 81 π ψ3d

z2

=

√ ψ3dzx =

e−Zr/2

x 2 −y 2

analogous

Z 7/2 √ (3z2 − r 2 ) e−Zr/3 81 6π

2 Z 7/2 √ zx e−Zr/3 81 π ψ3d

ψ3px , ψ3py

=

ψ3dyz , ψ3dxy

analogous

Z 7/2 √ (x 2 − y 2 ) e−Zr/3 81 π

FIGURE 8.7 Energy levels and allowed transitions for atomic hydrogen.

has two radial nodes at r = Fig. 8.8.

9 2

−

√ 3 3 2

and

9 2

+

√ 3 3 2 .

These orbitals are plotted in

8.6 p- and d-orbitals The lowest-energy solutions deviating from spherical symmetry are the 2p-orbitals. With R21 (r) and the three l = 1 spherical harmonics, we find three degenerate eigenfunctions:

142 Introduction to Quantum Mechanics

FIGURE 8.8 Plots of 2s and 3s orbitals. Scale units in bohrs.

Z 5/2 ψ210 (r, θ, φ) = √ r e−Zr/2 cos θ 4 2π

(8.53)

and Z 5/2 (8.54) ψ21±1 (r, θ, φ) = ∓ √ r e−Zr/2 sin θ e±iφ . 4 2π The function ψ210 is real and contains the factor r cos θ, which is equal to the cartesian variable z. In chemical applications, this is designated as a 2pz orbital: Z 5/2 ψ2pz = √ z e−Zr/2 . 4 2π

(8.55)

A contour plot is shown in Fig. 8.9. Note that this function is cylindricallysymmetrical about the z-axis with a node in the x, y-plane. The eigenfunctions ψ21±1 are complex and not as easy to represent graphically, their angular dependence being that of the spherical harmonics Y1±1 . It has been shown that any linear combination of degenerate eigenfunctions is an equally-valid alternative eigenfunction. Making use of the Euler formulas for sine and cosine cos φ =

eiφ + e−iφ 2

and

sin φ =

eiφ − e−iφ 2i

(8.56)

and noting that the combinations sin θ cos φ and sin θ sin φ correspond to the cartesian variables x and y, respectively, we can define the alternative 2p orbitals 1 Z 5/2 ψ2px = √ (ψ21−1 − ψ211 ) = √ x e−Zr/2 4 2π 2

(8.57)

i Z 5/2 ψ2py = − √ (ψ21−1 + ψ211 ) = √ y e−Zr/2 . 4 2π 2

(8.58)

and

The hydrogen atom and atomic orbitals Chapter | 8

143

Clearly, these have the same shape as the 2pz-orbital, but are oriented along the x- and y-axes, respectively. The threefold degeneracy of the p-orbitals is very clearly shown by the geometric equivalence the functions 2px, 2py and 2pz, which is not obvious from the spherical harmonics. The functions listed in Table 8.1 are, in fact, the real forms for atomic orbitals. All higher p-orbitals have analogous functional forms xf (r), yf (r) and zf (r) and are likewise 3-fold degenerate.

FIGURE 8.9 Contour plot of 2pz orbital. Negative values are shown in red. Scale units in bohrs.

The orbital ψ320 is, like ψ210 , a real function. It is known in chemistry as the dz2 -orbital and can be expressed as a cartesian factor times a function of r: ψ3dz2 = ψ320 = (3z2 − r 2 )f (r).

(8.59)

A contour plot is shown in Fig. 8.10. This function is also cylindrically symmetric about the z-axis with two angular nodes—conical surfaces with 3z2 −r 2 = 0. The remaining four 3d orbitals are complex functions containing the spherical harmonics Y2±1 and Y2±2 pictured in Fig. 6.4. We can again construct real functions from linear combinations, the result being four geometrically equivalent “four-leaf clover” functions with two perpendicular planar nodes. These orbitals are designated dx 2 −y 2 , dxy , dzx and dyz . Two of these are also shown in Fig. 8.10. The dz2 orbital has a different shape. However, it can be expressed in terms of two non-standard d-orbitals, dz2 −x 2 and dy 2 −z2 . The latter functions, along with dx 2 −y 2 add to zero and thus constitute a linearly dependent set. Only two combinations of these three functions can be chosen as independent eigenfunctions.

8.7 Summary on atomic orbitals The atomic orbitals listed in Table 8.1 are illustrated in Fig. 8.11. Blue and yellow indicate, respectively, positive and negative regions of the wavefunctions (the radial nodes of the 2s and 3s orbitals are obscured). These pictures are intended as stylized representations of atomic orbitals and should not be interpreted as quantitatively accurate.

144 Introduction to Quantum Mechanics

FIGURE 8.10 Contour plots of 3d orbitals.

FIGURE 8.11 Hydrogenic atomic orbitals.

The electron charge distribution in an orbital ψnlm (r) is given by ρ(r) = |ψnlm (r)|2 ,

(8.60)

which for the s-orbitals is a function of r alone. The radial distribution function can be defined, even for orbitals containing angular dependence, by Dnl (r) = 4πr 2 [Rnl (r)]2 .

(8.61)

This represents the electron density in a shell of radius r, including all values of the angular variables θ, φ. Fig. 8.12 shows plots of the RDF for the first few hydrogen orbitals.

The hydrogen atom and atomic orbitals Chapter | 8

145

FIGURE 8.12 Some radial distribution functions.

8.8 Reduced mass Consider a system of two particles of masses m1 and m2 interacting with a potential energy which depends only on the separation of the particles. The classical energy is given by 1 1 E = m1 r˙ 21 + m2 r˙ 22 + V (|r2 − r1 |), 2 2

(8.62)

the dots signifying derivative wrt time. Introduce two new variables, the particle separation r and the position of the center of mass R: r = r2 − r1 ,

R=

m1 r1 + m2 r2 , m

(8.63)

where m = m1 + m2 . In terms of the new coordinates r1 = R +

m2 r, m

r2 = R −

m1 r m

(8.64)

146 Introduction to Quantum Mechanics

and 1 ˙2 1 2 + μ r˙ + V (r), E = mR 2 2 where r = |r| and μ is called the reduced mass μ≡

m1 m2 . m1 + m2

(8.65)

(8.66)

An alternative relation for reduced mass is 1 1 1 + , = μ m1 m2

(8.67)

reminiscent of the formula for resistance of a parallel circuit. Note that, if m2 → ˙ represents the kinetic energy of a ∞, then μ → m1 . The term containing R single hypothetical particle of mass m located at the center of mass R. The remaining terms represent the relative motion of the two particles. It has the appearance of a single particle of effective mass μ moving in the potential field V (r). 1 p2 Erel = μ r˙ 2 + V (r) = + V (r). 2 2μ

(8.68)

We can thus write the Schrödinger equation for the relative motion

2 − ∇ 2 + V (r) ψ(r) = Eψ(r). 2μ

(8.69)

When we treated the hydrogen atom earlier, it was assumed that the nuclear mass was infinite. In that case we can set μ = m, the mass of an electron. The Rydberg constant for infinite nuclear mass was calculated to be R∞ =

2π 2 me4 = 109,737 cm−1 . h3 c

(8.70)

If instead, we use the reduced mass of the electron-proton system μ=

mM 1836 ≈ m ≈ 0.999456 m. m+M 1837

(8.71)

This changes the Rydberg constant for hydrogen to RH ≈ 109,677 cm−1

(8.72)

in perfect agreement with experiment. In 1931, H. C. Urey evaporated four liters of hydrogen down to one milliliter and measured the spectrum of the residue. The result was a set of lines displaced

The hydrogen atom and atomic orbitals Chapter | 8

147

slightly from the hydrogen spectrum. This amounted to the discovery of deuterium, or heavy hydrogen, for which Urey was awarded in 1934 Nobel Prize in Chemistry. Estimating the mass of the deuteron, 2 H1 , as twice that of the proton, gives RD ≈ 109,707 cm−1 .

(8.73)

Another interesting example involves positronium, a short-lived combination of an electron and a positron—the electron’s antiparticle. The electron and positron mutually annihilate with a half-life of approximately 10−7 sec and positronium decays into gamma rays. The reduced mass of positronium is μ=

m×m m = m+m 2

(8.74)

half the mass of the electron. Thus the ionization energy equals 6.80 eV, half that of hydrogen atom. Positron emission tomography (PET) provides a sensitive scanning technique for functioning living tissue, notably the brain. A compound containing a positron-emitting radionuclide, for example, 11 C, 13 N, 15 O or 18 F, is injected into the body. The emitted positrons attach to electrons to form shortlived positronium (see Fig. 8.13), and the annihilation radiation is monitored.

FIGURE 8.13 Jens Zorn sculpture depicting positronium annihilation. Outside University of Michigan Physics Building.

Chapter 8. Exercises 8.1. Assume that each circular Bohr orbit for an electron in a hydrogen atom contains an integer number of de Broglie wavelengths, n = 1, 2, 3, etc. Show

148 Introduction to Quantum Mechanics

that the orbital angular momentum must then be quantized. Bohr’s formula for the hydrogen energy levels follows from this. 8.2. Based on your knowledge of the first few hydrogenic eigenfunctions, deduce general formulas, in terms of n and l, for: (i) the number of radial nodes in an atomic orbital; (ii) the number of angular nodes; (iii) the total number of nodes. 8.3. Calculate the wavelength of the Lyman alpha transition (1s ← 2p) in atomic hydrogen and in He+ . Express the results in both nm and cm−1 . 8.4. Determine the maximum of the radial distribution function for the ground state of hydrogen atom. Compare this value with the corresponding radius in the Bohr theory. 8.5. The nodeless 1s ground state hydrogenlike eigenfunction is given by 3/2 ψ1s = Z√π e−Zr . Assuming only orthogonality and normalization, construct the 2s and 3s eigenfunctions, with 1 and 2 nodes, respectively, with the forms ψ2s = const(1 + αr)e−Zr/2 and ψ2s = const(1 + βr + γ r 2 )e−Zr/3 , respectively. 8.6. The following reaction might occur in the interior of a star: He++ + H → He+ + H+ . Calculate the electronic energy change (in eV). Assume all species in their ground states. 8.7. Which of the following operators is not equal to the other four: (i) ∂ 2 /∂r 2 (ii) r −2 ∂/∂r r 2 ∂/∂r (iii) r −1 ∂ 2 /∂r 2 r (iv) (r −1 ∂/∂r r)2 (v) ∂ 2 /∂r 2 + 2r −1 ∂/∂r. 8.8. Calculate the expectation values of r, r 2 and of r −1 in the ground state of the hydrogen atom. Give results in atomic units. 8.9. Calculate the expectation values of potential and kinetic energies for the 1s state of a hydrogenlike atom. 8.10. Verify that the 3dxy orbital given in the table is a normalized eigenfunction of the hydrogenlike Schrödinger equation. 8.11. Show that the function

ψ(r, θ, φ) = const 1 − r sin2 (θ/2) e−r/2

is a solution of the Schrödinger equation for the hydrogen atom and find the corresponding eigenvalue (in atomic units).

The hydrogen atom and atomic orbitals Chapter | 8

149

8.12. For the ground state of a hydrogenlike atom, calculate the radius of the sphere enclosing 90% of the electron probability in the 1s state of hydrogen atom. (This involves a numerical computation with successive approximations.) 8.13. Consider as a variational approximation to the ground state of the hydrogen atom the wavefunction ψ(r) = e−αr . Calculate the corresponding energy E(α) then optimize with respect to the parameter α. Compare with the exact solution. 8.14. The electron-spin resonance hyperfine splitting for atomic hydrogen is given by 8π 3 cos2 θ − 1 2 MHz. |ψ(0)| + ν = 532.65 3 r3 Calculate ν for the 1s and for the 2p0 states. The result is in MHz when the bracketed terms are expressed in atomic units. (Hint: In evaluating the expectation value, do the integral over angles first.)

Chapter 9

The helium atom The second element in the periodic table provides our first example of a quantum-mechanical problem which cannot be solved exactly. Nevertheless, as we will show, approximation methods applied to helium can give accurate solutions in essentially perfect agreement with experimental results. In this sense, it can be concluded that quantum mechanics is correct for atoms more complicated than hydrogen. By contrast, the Bohr theory failed miserably in attempts to apply it beyond the hydrogen atom.

9.1 Experimental energies The helium atom has two electrons bound to a nucleus with charge Z = 2. The successive removal of the two electrons can be diagrammed as I1 I2 He −→ He+ + e− −→ He++ + 2e− .

(9.1)

The first ionization energy I1 , the minimum energy required to remove the first electron from helium, is experimentally 24.5874 eV. The second ionization energy, I2 , is 54.4228 eV, which can be calculated exactly since He+ is a hydrogenlike ion. We have I2 = −E1s (He+ ) = −

Z2 = −2 hartrees, 2n2

(9.2)

which corresponds to 54.4228 eV. The energy of the three separated particles on the right side of Eq. (9.1) is, by definition, zero. Therefore the ground-state energy of helium atom is given by E0 = −(I1 + I2 ) = −79.0102 eV = −2.9037 hartrees. We will attempt to reproduce this value, as closely as possible, by theoretical analysis.

9.2 Schrödinger equation and variational calculations The Schrödinger equation for He atom, again using atomic units and assuming infinite nuclear mass, can be written 1 1 Z Z 1 − ∇12 − ∇22 − − + (9.3) ψ(r1 , r2 ) = E ψ(r1 , r2 ). 2 2 r1 r2 r12 Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00017-3 Copyright © 2021 Elsevier Inc. All rights reserved.

151

152 Introduction to Quantum Mechanics

FIGURE 9.1 Coordinates for helium atom Schrödinger equation.

The five terms in the Hamiltonian represent, respectively, the kinetic energies of electrons 1 and 2, the nuclear attractions of electrons 1 and 2, and the repulsive interaction between the two electrons (see Fig. 9.1). It is this last contribution which prevents an exact solution of the Schrödinger equation and accounts for much of the complication in the theory. In seeking an approximation to the ground state, we might first work out the solution in the absence of the 1/r12 -term. In the Schrödinger equation thus simplified, we can separate the variables r1 and r2 to reduce the equation to two independent hydrogenlike problems. The ground state wavefunction (not normalized) for this hypothetical helium atom would be ψ(r1 , r2 ) = ψ1s (r1 )ψ1s (r2 ) = e−Z(r1 +r2 )

(9.4)

and the energy would equal 2 × (−Z 2 /2) = −4 hartrees, compared to the experimental value of −2.90 hartrees. Neglect of electron repulsion evidently introduces a very large error. A significantly improved result can be obtained by keeping the functional form (9.4), but replacing Z by an adjustable parameter α. Using the function ˜ 1 , r2 ) = e−α(r1 +r2 ) ψ(r in the variational principle [cf. Eq. (4.58)], we have ˜ 1 , r2 ) dτ1 dτ2 ˜ 1 , r2 ) H ψ(r ψ(r ˜ , E= ˜ 1 , r2 ) dτ1 dτ2 ˜ 1 , r2 ) ψ(r ψ(r

(9.5)

(9.6)

where H is the full Hamiltonian as in Eq. (9.3), including the 1/r12 -term. The expectation values of the five parts of the Hamiltonian work out as follows: 1 2 α2 1 2 − ∇1 = − ∇2 = , 2 2 2 Z 1 5 Z (9.7) = − = −Zα, = α. − r1 r2 r12 8 The sum of the integrals in (9.7) gives the variational energy 5 ˜ E(α) = α 2 − 2Zα + α. 8

(9.8)

The helium atom Chapter | 9

153

This will always be an upper bound for the true ground-state energy. We can optimize our result by finding the value of α which minimizes the energy (9.8). We find 5 d E˜ = 2α − 2Z + = 0, dα 8

(9.9)

giving the optimal value α=Z−

5 . 16

(9.10)

This can be given a physical interpretation, noting that the parameter α in the approximate wavefunction (9.5) represents an effective nuclear charge. Each electron partially shields the other electron from the positively-charged nucleus by an amount equivalent to 5/16 of an electron charge. Substituting (9.10) into (9.8), we obtain the optimized approximation to the energy 5 2 . E˜ = − Z − 16

(9.11)

For helium (Z = 2), this gives −2.84765 hartrees, an error of about 2% (exact E0 = −2.90372). Note that the inequality E˜ > E0 applies in an algebraic sense. In the late 1920’s, it was considered important to do the best possible helium computation, as a test of the validity of quantum mechanics for many electron atoms. The table below gives the results for a selection of variational computations on helium. wavefunction

parameters

e−Z(r1 +r2 )

Z=2

energy

e−α(r1 +r2 )

α = 1.6875

−2.84765

ψ(r1 )ψ(r2 )

best ψ(r)

−2.86168

e−α(r1 +r2 ) (1 + c r12 )

best α, c

−2.89112

Hylleraas (1929)

10 parameters

−2.90363

Pekeris (1959)

1078 parameters

−2.90372

−2.75

The third entry refers to the self-consistent field method, developed by Hartree. Even for the best possible choice of one-electron functions ψ(r), there remains a considerable error. This is due to failure to include the variable r12 in the wavefunction. The effect is known as electron correlation. The fourth entry, containing a simple correction for correlation, gives a considerable improvement. Hylleraas (1929) extended this approach with a variational function of the form ψ(r1 , r2 , r12 ) = e−α(r1 +r2 ) × polynomial in r1 , r2 , r12

(9.12)

154 Introduction to Quantum Mechanics

and obtained a result in agreement with experiment using a function with 10 optimized parameters. More recently, using modern computers, results in essentially perfect agreement with experiment have been obtained.

9.3 Spinorbitals and the exclusion principle The simpler wavefunctions for helium atom, for example (9.5), can be interpreted as representing two electrons in hydrogenlike 1s orbitals, designated as a 1s 2 configuration. Pauli’s exclusion principle, which states that no two electrons in an atom can have the same set of four quantum numbers, requires the two 1s electrons to have different spins: one spin-up or α, the other spin-down or β. A product of an orbital with a spin function is called a spinorbital. For example, electron 1 might occupy a spinorbital which we designate φ(1) = ψ1s (1)α(1)

or ψ1s (1)β(1).

(9.13)

Spinorbitals can be designated by a single subscript, for example, φa or φb , where the subscript stands for a set of four quantum numbers. In a two-electron system the occupied spinorbitals φa and φb must be different, meaning that at least one of their four quantum numbers must be unequal. A two-electron spinorbital function of the form 1 (1, 2) = √ φa (1)φb (2) − φb (1)φa (2) (9.14) 2 automatically fulfills the Pauli principle since it vanishes if a = b. Moreover, this function associates each electron equally with each orbital, which is consistent with the √ indistinguishability of identical particles in quantum mechanics. The factor 1/ 2 normalizes the two-particle wavefunction, assuming that φa and φb are normalized and mutually orthogonal. The function (9.13) is antisymmetric with respect to interchange of electron labels, meaning that (2, 1) = −(1, 2).

(9.15)

This antisymmetry property is an elegant way of expressing the Pauli principle. We note, for future reference, that the function (9.13) can be expressed as a 2 × 2 determinant: 1 φa (1) φb (1) (1, 2) = √ (9.16) . 2 φa (2) φb (2) For the 1s 2 configuration of helium, the two orbital functions are the same and the total wavefunction including spin can be written 1 (1, 2) = ψ1s (1)ψ1s (2) × √ α(1)β(2) − β(1)α(2) . (9.17) 2

The helium atom Chapter | 9

155

For two-electron systems (but not for three or more electrons), the wavefunction can be factored into an orbital function times a spin function. The two-electron spin function 1 (9.18) σ0,0 (1, 2) = √ α(1)β(2) − β(1)α(2) 2 represents the two electron spins in opposing directions (antiparallel) with a total spin angular momentum of zero. The two subscripts are the quantum numbers S and MS for the total electron spin. Eq. (9.17) is called the singlet spin state since there is only a single orientation for a total spin quantum number of zero. It is also possible to have both spins in the same state, provided the orbitals are different. There are three possible states for two parallel spins: σ1,1 (1, 2) = α(1)α(2), 1 σ1,0 (1, 2) = √ α(1)β(2) + β(1)α(2) , 2 σ1,−1 (1, 2) = β(1)β(2).

(9.19)

These make up the triplet spin states, which have the three possible orientations of an angular momentum of 1.

9.4

Excited states of helium

The lowest excitated states of helium have the electron configuration 1s 2s. The 1s 2p configuration has higher energy, even though the 2s and 2p orbitals in hydrogen are degenerate. The repulsive interaction between 1s and 2s has a lower energy than that between 1s and 2p because 2s overlaps with the 1s less than 2p does, as is evident from the radial distribution functions plotted in Fig. 8.12. This is enhanced by the fact that 2s has a radial node in a region of space where the 1s density is significant. When electrons are in different orbitals, their spins can be either parallel or antiparallel. In order that the wavefunction satisfy the antisymmetry condition (9.14), the two-electron orbital and spin functions must have opposite behavior under exchange of electron labels. There are four possible states from the 1s 2s configuration: a singlet state 1 + (1, 2) = √ ψ1s (1)ψ2s (2) + ψ2s (1)ψ1s (2) σ0,0 (1, 2) 2

(9.20)

and three triplet states ⎧ ⎪ ⎨σ1,1 (1, 2) 1 − (1, 2) = √ ψ1s (1)ψ2s (2) − ψ2s (1)ψ1s (2) σ1,0 (1, 2) ⎪ 2 ⎩ σ1,−1 (1, 2).

(9.21)

156 Introduction to Quantum Mechanics

Using the Hamiltonian (9.3), we can compute the approximate energies ± (1, 2) H ± (1, 2) dτ1 dτ2 . (9.22) E± = After evaluating some fierce-looking integrals, this reduces to the form E ± = I1s + I2s + J1s,2s ± K1s,2s , in terms of one electron integrals 1 2 Z Ia = ψa (r) − ∇ − ψa (r) dτ, 2 r

(9.23)

(9.24)

Coulomb integrals Ja,b =

ψa (r1 )2

1 ψb (r2 )2 dτ1 dτ2 r12

and exchange integrals 1 Ka,b = ψa (r1 )ψb (r1 ) ψa (r2 )ψb (r2 ) dτ1 dτ2 . r12

(9.25)

(9.26)

The Coulomb integral represents the repulsive potential energy for two interacting charge distributions [ψa (r1 )]2 and [ψb (r2 )]2 . The exchange integral, which has no classical analog, arises because of the exchange symmetry (or antisymmetry) requirement of the wavefunction. Both J and K can be shown to be positive quantities. Therefore the negative sign in (9.22) is associated with the state of lower energy. Thus the triplet state of the configuration 1s 2s is lower in energy than the singlet state. This is an almost universal generalization and contributes to Hund’s rule, to be discussed in the next Chapter. The excitation energies of the 1s2s and 1s2p states of helium are shown in Fig. 9.2. On this scale the 1s 2 ground state is the zero of energy. If the energies are approximated as in Eq. (9.22) by E(1,3 S) ≈ I1s + I2s + J1s,2s ± K1s,2s , E(1,3 P ) ≈ I1s + I2p + J1s,2p ± K1s,2p ,

(9.27)

then the results are consistent with the following values of the Coulomb and exchange integrals: J1s,2p ≈ 10.0 eV, K1s,2p ≈ 0.1 eV, J1s,2s ≈ 9.1 eV and K1s,2s ≈ 0.4 eV.

Chapter 9. Exercises 9.1. For the optimized helium variational wavefunction ψ(r1 , r2 ) = e−α(r1 +r2 )

The helium atom Chapter | 9

157

FIGURE 9.2 Lower excited states of helium atom.

calculate the expectation values of total kinetic and potential energies. Do these satisfy the virial theorem? 9.2. Using the same form of an optimized variational wavefunction ψ(r1 , r2 ) = e−α(r1 +r2 ) estimate the ground-state energy of Li+ . 9.3. Calculate the energy of the hypothetical 1s 3 state of the Li atom using the optimized variational wavefunction ψ(1, 2, 3) = e−α(r1 +r2 +r3 ) . Neglect electron spin, of course. Compare with the experimental ground-state energy, E0 = −7.478 hartrees. Comment on the applicability of the variational theorem.

Chapter 10

Atomic structure and the periodic law The discovery of the periodic structure of the elements by Mendeleev, shown in Fig. 10.1, must be ranked as one the greatest achievements in the history of science. And perhaps the most impressive conceptual accomplishment of quantum mechanics has been its rational account of the origin of the periodic table. Although accurate computations become increasingly more difficult as the number of electrons increases, the general patterns of atomic behavior can be predicted with remarkable accuracy. A modern version of the periodic table is shown in Fig. 10.2.

Dmitri Ivanovich Mendeleev

10.1

Slater determinants

According to the orbital approximation, which was introduced in the last Chapter, an N -electron atom contains N occupied spinorbitals, which can be designated φa , φb . . . φn . In accordance with the Pauli exclusion principle, no two of these spinorbitals can be identical. Also, every electron should be equally associated with every spinorbital. A very neat mathematical representation for these properties is a generalization of the two-electron wavefunction (9.15) called a Slater determinant Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00018-5 Copyright © 2021 Elsevier Inc. All rights reserved.

159

160 Introduction to Quantum Mechanics

FIGURE 10.1 Mendeleev’s original periodic table (1869). The elements in the gaps marked with red squares—now known to be Sc, Ga and Ge—were predicted by Mendeleev before they were actually discovered.

FIGURE 10.2 Modern version of the periodic table extending through element 118.

φa (1) 1 φa (2) (1, 2 . . . N) = √ N! φa (N )

φb (1) φb (2)

φb (N)

... ... .. . ...

φn (1) φn (2). . φn (N )

(10.1)

Since interchanging any two rows (or columns) of a determinant multiplies it by −1, the antisymmetry property (9.14) is fulfilled, for every pair of electrons.

Atomic structure and the periodic law Chapter | 10 161

John C. Slater The total number of electrons in an atom, the cardinality, can be known while the numbering of individual electrons, their ordinality, has no physical significance. This is something like making a deposit in your bank account. You know how many dollars you put in. But, when you make a withdrawal, you have no way of matching the individual dollars you take out with the dollars you put in. The Hamiltonian for an atom with N electrons around a nucleus of charge Z can be written N N 1 Z 1 H= . (10.2) − ∇i2 − + 2 ri rij i E–X > X–X. The 4-coordinate molecule CH4 will be a perfect tetrahedron. Ammonia, which is NH3 E, will be tetrahedral to a first approximation. But the lone pair E will repel the N–H bonds more than they repel one another. Thus the E–N–H angle will increase from the tetrahedral value of 109.5◦ , causing the H–N–H angles to decrease slightly. The observed value of 106.6◦ is quite well accounted for. For water, OH2 E2 , the opening of the E–O–E angle will cause an additional closing of H–O–H, to 104.5◦ . The electron-deficient molecule BF3 has just 3 electron pairs in the valence shell. It will therefore have a trigonal planar configuration. Likewise, the carbene radical CH2 , a rare case of divalent carbon, will be a bent molecule since the lone pair occupies one of the vertices. The 6-coordinated compound SF6 has a regular octahedral configuration. A lone pair, as in BrF5 , occupies one octahedral vertex, leaving a square pyramidal molecular framework. In xenon tetrafluoride, the 8 valence electrons from Xe form four Xe–F bonds, leaving two lone pairs. Since the lone pairs are maximilly repulsive, they will be as far apart as possible—on opposite sides of the octahedron. This leaves the XeF4 molecule in a square planar configuration.

188 Introduction to Quantum Mechanics

The 5-coordination compounds show some more exotic possibilities. PF5 has a trigonal bipyramidal shape with inequivalent axial and equatorial positions. The lone pair in SF4 chooses an equatorial position since it can do less “damage” there—it make a 90◦ angle with two F atoms, whereas an axial position would make 90◦ angles with three F atoms. This leaves SF4 with a shape resembling a distorted see-saw. The two lone pairs in ClF3 are most stable when located in two equatorial positions, separated by 120◦ . This leaves ClF3 in a distorted tee shape. The complex that forms between I− and I2 in aqueous solution is a linear ion. Consider I− as the central atom, with 8 valence electrons. Two electrons form bonds to other I atoms, leaving three lone pairs, which prefer to occupy all three equatorial positions, leaving the bonding pairs in the axial positions. Following are illustrations of the molecules we have described:

Multiple bonds behave very similarly to single-bond pairs, apart from being slightly more repulsive. CO2 is a linear molecule with the two double bonds

The chemical bond Chapter | 11 189

accounting for all the valence electrons. Double-bonded carbon =C forms

a trigonal structure, as expected for sp 2 hybrids. Triple-bonded carbon (≡C–) forms a linear structure, as expected for sp hybrids. Sulfate anion SO2− 4 has two single bonds and two double bonds (individual bonds resonating between these). Consistent with the valence-shell model, the four bonds are in a tetrahedral configuration. Unpaired electrons, as in free radicals, behave much like lone pairs, but are slightly less repulsive. The bonding in NO2 can be described by contributing ◦ ◦ resonance structures including O=N=O and O=N +−O− . The valence shell model predicts a bent molecule with the structure:

For more than six electron pairs, the geometries of valence-shell theory are no longer as simple to determine. For n = 7, two reasonable alternatives are a pentagonal bipyramid and a capped or distorted octahedron. Iodine heptafluoride, IF7 , has a pentagonal bipyramidal geometry. Xenon hexafluoride, XeF6 , also has seven electron pairs—six bonding pairs plus one lone pair. The structure of is believed to be a distorted octahedron, complicated by time-dependence of the fluorine positions—what is known as a fluxional molecule. For n = 8, a possible geometry is a square antiprism, obtained by twisting one face of a cube by 45◦ with respect to the opposite face. This is most likely the shape of the ion XeF−2 8 .

11.7

Transition metal complexes

Transition metal ions, with their unoccupied d-orbitals, are excellent electron acceptors or Lewis acids. Molecules or ions known as ligands can donate electron pairs to the metal ion, thus acting as Lewis bases. These are said to coordinate to the Lewis acid via coordinate covalent bond to form a coordination compound. Most transition metal ions have valence-shell d-electrons in excess of the number needed for bonding to the ligands. The geometry of the complex ion is determined by the most stable arrangement of its ligands—an octahedron for 6 ligands, a trigonal bipyramid for 5 and a tetrahedron for 4 (or, in certain cases, a square planar configuration). Crystal field theory was proposed by Hans Bethe in 1929 (with later elaborations by J. H. Van Vleck) to describe the nonbonding d-electrons on the central atom. This is a simple electrostatic model in which the ligands are idealized as negative point charges. In the absence of

190 Introduction to Quantum Mechanics

the crystal field, the five d-orbitals are degenerate. When ligands coordinate to the metal, however, the individual d-orbitals are shifted in energy. In the case of an octahedral crystal field, the metal atom is surrounded by six identical ligands in a regular octahedron, oriented as shown in Fig. 11.9. The dz2 and the dx 2 −y 2 orbitals point directly at ligands along cartesian axes, and are thus raised in energy by the repulsive interactions. By contrast the dxy , dy 2 −z2 and dz2 −x 2 orbitals are more successful at avoiding the ligands by pointing at 45◦ angles to the cartesian axes. As a result, the five-fold degeneracy of the d-orbitals is partially resolved by the octahedral field into two levels, as shown in Fig. 11.10. The doubly-degenerate upper level and the triply-degenerate lower level are designated eg and t2g , respectively, in group-theoretical notation (which will be explained in Chap. 15). The crystal field splitting is denoted by (or, in older notation 10Dq).

FIGURE 11.9 The five 3d orbitals in an octahedral crystal field.

There can be anywhere from 12 to 22 valence electrons in an octahedral complex. The first twelve of these are used for bonding to the six ligands. The remaining ones occupy atomic d-orbitals which interact with the crystal field. A simple example is the hexaquatitanium(III) octahedral complex [Ti(H2 O)6 ]3+ , which has a single 3d electron after bonding to the ligands has been accounted for. In the ground state, this electron occupies one of the degenerate t2g orbitals. The violet color of this complex is the result of absorption of light in excitation to the eg level. The transition eg ← t2g occurs at a maximum at 20,300 cm−1 , which is evidently the approximate value of . Crystal-field splittings are typically in the range of 7000–30,000 cm−1 . (The visible region is approximately 14,000–24,000 cm−1 .) Extensive chemical experience has led to a spectrochemical series, an ordering of ligands by the strength of their crystalfield splittings. Following is a partial listing of some common ligands:

The chemical bond Chapter | 11 191 − − − CO > CN− > NO− 2 > en > NH3 > H2 0 > OH > F > Cl

Here “en” stands for ethylenediamine NH2 CH2 CH2 NH2 , a bidentate ligand. The dividing line between strong- and weak-field ligands is often considered to be between NH3 and H2 0. The variability of color depending on the crystalfield environment for some octahedral cobalt (III) complexes is illustrated by the following list: Complex Ion [Co(CN)6 ]3− [Co(en)3 ]3+ [Co(NH3 )6 ]3+ [Co(H2 O)6 ]3+ [CoF6 ]3−

λmax (nm) 310 340, 470 437 400, 600 700

Color pale yellow yellow-orange yellow-orange deep blue green

FIGURE 11.10 Splitting of d-orbitals in octahedral crystal field.

For d-shells containing between 4 and 7 electrons, the magnitude of influences the magnetic properties of the complex. Consider for example two different octahedral complexes of ferric ion Fe(III) with five d-electrons. For [Fe(CN)6 ]3− , ≈ 30, 000 cm−1 while for [FeF6 ]3− , ≈ 10, 000 cm−1 . The first of these is found to be a high-spin complex with spin S = 5/2, while the second is a low spin complex with S = 1/2. This behavior can be rationalized on the basis of Aufbau principles and Hund’s rules, as shown in the schematic energylevel diagrams below. For [FeF6 ]3− , the eg and t2g energies are close enough together that the increased stability due to exchange integrals among 5 parallel electron spins more than compensates for the small excitation energy. This is analogous to what was encountered for atomic chromium (see Sect. 10.4), where the 4s3d 5 ground-state electron configuration was preferred over the 4s 2 3d 4 .

192 Introduction to Quantum Mechanics

High-spin and low-spin states are readily distinguished by the magnetic properties of a complex. Since the environment of the central atom is not spherically symmetrical, the orbital angular momentum is not conserved and makes no contribution to magnetic behavior. The orbital angular momentum is said to be quenched by the crystal field.

FIGURE 11.11 Octahedral and tetrahedral arrangements of ligands inscribed in a cube.

We will briefly consider the crystal field splitting of the d-orbitals in fourcoordinate, tetrahedral complexes. The cube, octahedron and tetrahedron are related geometrically. Octahedral coordination results when ligands are placed in the centers of cube faces while tetrahedral coordination results when ligands are placed on alternate corners of a cube, as shown in Fig. 11.11. Fourcoordinate complexes are most likely to occur when the total number of valence electrons—from metal plus ligands—is less than 18. In a tetrahedral complex, none of the five d-orbitals point directly at ligands, but the dxy , dxz and dyz orbitals do come close. Thus these three orbitals will be raised in energy. The dz2 and the dx 2 −y 2 orbitals point toward the sides of the cube, farther away from the attached ligands, so these two are lower in energy. This pattern is inverted compared to octahedral complexes. The two-fold degenerate lower level is designated e, while the three-fold degenerate upper level is t2 . The subscript g (gerade or even) is not relevant because the tetrahedron does not have a center of symmetry. Several complexes, principally of Ni, Pd and Pt, with d 8 configurations form square planar complexes. To rationalize this geometry using crystal field theory, imagine an octahedral complex tetragonally distorted such that the two ligands on the z-axis are removed. The dx 2 −y 2 then becomes a uniquely “bad” orbital in that it points directly at the four remaining ligands. A good “strategy” for the d 8 -complex is to doubly fill the four other d-orbitals. Some well-know square planar complexes include [Ni(CN)4 ]2− , [Pt(NH3 )4 ]2+ , [Cu(NH3 )4 ]2+ and [PdCl4 ]2− . A class of square planar complexes of major biological significance are the porphyrins.

The chemical bond Chapter | 11 193

Porphyrins have a planar structure consisting of four pyrrole units linked by four methine bridges. The outer periphery is an aromatic system containing 18 delocalized π-electrons, fulfilling Hückel’s 4N + 2 rule. The four nitrogen atoms are in a perfect configuration to act as ligands. Heme proteins are iron-porphyrin complexes. They are the prosthetic groups in hemoglobins and myoglobins, responsible for oxygen transport and storage. The heme group is also found in cytochromes, which are vital electron-transporting molecules. Chlorophylls, central to photosynthesis in green plants, are magnesium-porphyrin complexes. Vitamin B-12, essential to the metabolism of proteins, carbohydrates and fats, is a cobalt-porphyrin structure. The geometry of coordination compounds can not always be determined systematically. Ni(II), for example, can form octahedral complexes such as [Ni(H2 O)6 ]2+ , tetrahedral complexes such as [Ni(CO)4 ]2+ , square planar complexes such as [Ni(CN)4 ]2− , and even two different isomeric forms of [Ni(CN)5 ]3− , one a trigonal bipyramid, the other a square pyramid. The chemistry and spectroscopy of transition-metal ions is a vast subject, and we have only scratched the surface. For a more extended treatment, refer, for example, to Ballhausen and Gray, Molecular Orbital Theory, (Benjamin, New York, 1965).

11.8

The hydrogen bond

Hydrogen usually forms a single electron-pair covalent bond. If, however, H is bonded to one highly electronegative atom (F, O or N) and is in close proximity to another one, it can form a three-atom bridge known as a hydrogen bond. This can be represented as a resonance hybrid between the two structures comprising one covalent bond (length ≈ 0.97 Å) and one electrostatic attraction (separation ≈ 1.79 Å): X—H- - - -Y and X- - - -H—Y with the three atoms in a linear configuration. As quantum-mechanical model for the hydrogen bond, one can consider a proton moving in a double-well potential, as shown in Fig. 11.12. Hydrogen bonds are typically about 5% as strong as covalent bonds, but their collective effect can be quite significant. The fact that water is a liquid a room temperature (while the heavier compound H2 S remains a gas) is attributed to hydrogen bonding involving the two unshared electron pairs on each oxygen

194 Introduction to Quantum Mechanics

FIGURE 11.12 Double-well potential V representing hydrogen bond. Atom X is shown as more electronegative than atom Y. The proton wavefunction ψ is show in red.

atom. Both liquid and solid water contain networks of covalent and hydrogen bonds, as shown in Fig. 11.13:

FIGURE 11.13 Tetrahedral network of covalent (black) and hydrogen (red) bonds in water. Each tetrahedron has O at center, H at each vertex.

Each oxygen atom is associated on average with two hydrogen atoms. An oxygen instantaneously connected with one more or one less hydrogen produces an H3 O+ or an OH− ion, respectively. In ice, the optimal arrangement is a hexagonal structure which less dense than the liquid. Without exaggeration, life as we know it could not exist without hydrogen bonding. Recall that proteins are built up of amino-acid units connected by peptide bonds, as shown here:

The chemical bond Chapter | 11 195

While the sequence of amino acids determines the primary structure of a protein, hydrogen bonding determines the secondary structure. Linus Pauling deduced that the α-helix structure results when each peptide carbonyl is hydrogen bonded to the amino group of the fourth peptide along the chain, as shown in Fig. 11.14. An alternative protein structure, the β-sheet, is based on hydrogen bonding between neighboring peptide chains.

FIGURE 11.14 Schematic representation of alpha helix. Hydrogen bonds (dotted) connect carbonyl oxygens (red) to amino nitrogens (blue) four amino-acid units down the chain.

The genetic code carried by the DNA double-helix depends on the specificity of the base pairings thymine (T) to adenine (A) and cytosine (C) to guanine (G), which is determined by optimal configurations of hydrogen bonds, as shown in Fig. 11.15.

FIGURE 11.15 Base pairing in DNA via hydrogen bonds.

11.9

Critique of valence-bond theory

Valence-bond theory is over 90% successful in explaining much of the descriptive chemistry of ground states. VB theory is therefore particularly popular among chemists since it makes use of familiar concepts such as chemical bonds between atoms, resonance hybrids and the like. It can perhaps be characterized as a theory which “explains but does not predict.” VB theory fails to account for the triplet ground state of O2 or for the bonding in electron-deficient molecules

196 Introduction to Quantum Mechanics

such as diborane, B2 H6 . It is not very useful in consideration of excited states, hence for spectroscopy. Many of these deficiencies are remedied by molecular orbital theory, which we take up in the next two Chapters.

Chapter 11. Exercises 11.1. The electronic energy of a diatomic molecule can be approximated by the Morse function:

2 E(R) = D 1 − e−β(R−Re ) . Re is the equilibrium internuclear separation while D and β are constants. (i) Identify the dissociation energy De . (ii) Sketch the Morse function, labeling De and Re . (iii) Expand the Morse function up to terms quadratic in (R − Re ). Show that this approximates a harmonic oscillator potential and identify the force constant k. 11.2. The allene molecule CH2 =C=CH2 is known to have a linear geometry for the three carbon atoms. Rationalize this on the basis of hybridization of carbon AO’s. 11.3. Show that the total electron distribution in the acetylene molecule H–C≡C–H is cylindrically symmetrical. 11.4. Applying the valence-shell model, predict the shapes of each of the following molecules: H2 S, SF4 , XeF4 , SF6 , BrF5 , IF7 . 11.5. Predict the ground-state geometry of each of the following species: CH2 , SO2 , SO3 , NO− 3 , XeO3 . 11.6. Consider the structure of hydrogen peroxide H2 O2 based on (i) unhybridized atomic orbitals, (ii) the valence shell model. The experimental H–O–O angle is 94.8◦ . 11.7. As an alternative to the VSEPR approach, the following model for hypervalent molecules has been proposed [Pimentel, Rundle, Coulson, ca 1951]: (i) Consider the ground-state electron configuration of the central atom, e.g., S(s 2 px2 py pz ), Br(s 2 px2 py2 pz ), Xe(s 2 px2 py2 pz2 ). (No promotion, hybridization or d-orbitals!) (ii) Singly-occupied p-orbitals form electron-pair bonds with ligand atoms in the usual way. (iii). Doubly-occupied p-orbitals form electron-pair bonds with two ligand atoms, located on opposite sides of the central atom.

The chemical bond Chapter | 11 197

On the basis of the above model, predict the geometry of SF4 , BrF3 , BrF5 , XeF2 , XeF4 and XeF6 . Compare with the corresponding valence-shell predictions. 11.8. Determine the d-electron configurations of the following octahedral complexes: [Fe(H2 O)6 ]3+ , [Cr(CN)6 ]4− , [Co(NH3 )6 ]3+ , [Cu(H2 O)6 ]2+ . Note whether each is a high-spin or low-spin configuration.

Chapter 12

Molecular orbital theory of diatomic molecules Molecular orbital theory is a conceptual extension of the orbital model, which was so successfully applied to atomic structure. As has been playfully remarked, “a molecule is nothing more than an atom with more nuclei.” This may be overly simplistic, but we do attempt, as far as possible, to exploit analogies with atomic structure. Our understanding of atomic orbitals began with the exact solutions of a prototype problem—the hydrogen atom. We will begin our study of homonuclear diatomic molecules beginning with another exactly solvable prototype, the hydrogen molecule-ion H+ 2.

12.1

The hydrogen molecule-ion

The simplest conceivable molecule would be made of two protons and one electron, namely H+ 2 . This species actually has a transient existence in electrical discharges through hydrogen gas and has been detected by mass spectrometry. It also has been detected in outer space. The Schrödinger equation for H+ 2 can be solved exactly within the Born-Oppenheimer approximation. For fixed internuclear distance R, this reduces to a problem of one electron in the field of two protons, designated A and B. We can write 1 2 1 1 1 − ∇ − − + ψ(r) = E ψ(r), (12.1) 2 rA rB R where rA and rB are the distances from the electron to protons A and B, respectively. This equation was first solved by Burrau (1927), after separating the variables in prolate spheroidal coordinates. We will write down these coordinates but give only a pictorial account of the solutions. The three prolate spheroidal coordinates are designated ξ , η, φ. The first two are defined by ξ=

rA + rB , R

η=

rA − rB , R

(12.2)

while φ is the angle of rotation about the internuclear axis. The surfaces of constant ξ and η are, respectively, confocal ellipsoids and hyperboloids of revolution with foci at A and B. The two-dimensional analog should be familiar from analytic geometry, an ellipse being the locus of points such that the sum of the distances to two foci is a constant. Analogously, a hyperbola is the locus whose Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00020-3 Copyright © 2021 Elsevier Inc. All rights reserved.

199

200 Introduction to Quantum Mechanics

difference is a constant. Fig. 12.1 shows several surfaces of constant ξ , η and φ. The ranges of the three coordinates are: 1 ≤ ξ ≤ ∞, −1 ≤ η ≤ 1, 0 ≤ φ ≤ 2π . The prolate-spheroidal coordinate system conforms to the natural symmetry of the H+ 2 problem in the same way that spherical polar coordinates were the appropriate choice for the hydrogen atom.

FIGURE 12.1 Prolate spheroidal coordinates.

The first few solutions of the H+ 2 Schrödinger equation are sketched in Fig. 12.3, roughly in order of increasing energy. The φ-dependence of the wavefunction is contained in a factor (φ) = eiλφ ,

λ = 0, ±1, ±2 . . . ,

(12.3)

which is identical to the φ-dependence in atomic orbitals. In fact, the quantum number λ represents the component of orbital angular momentum along the internuclear axis, the only component which has a definite value in systems with axial (cylindrical) symmetry. The quantum number λ determines the basic shape of a diatomic molecular orbital, in the same way that did for an atomic orbital. An analogous code is used: σ for λ = 0, π for λ = ±1, δ for λ = ±2, and so on. We are already familiar with σ - and π-orbitals from valence-bond theory. A second classification of the H+ 2 eigenfunctions pertains to their symmetry with respect to inversion through the center of the molecule, also known as parity (see Fig. 12.2). If ψ(−r) = +ψ(r), the function is classified gerade or even parity, and the orbital designation is given a subscript g, as in σg or πg . If ψ(−r) = −ψ(r), the function is classified as ungerade or odd parity, and we write instead σu or πu . Atomic orbitals can also be classified by inversion symmetry. However, all s and d AO’s are g, while all p and f orbitals are u, so no further designation is necessary. The MO’s of a given symmetry are numbered in order of increasing energy, for example, 1σg , 2σg , 3σg . The lowest-energy orbital, as we have come to expect, is nodeless. It must obviously have cylindrical symmetry (λ = 0) and inversion symmetry (g). It is designated 1σg since it is the first orbital of this classification. The next higher orbital has a nodal plane, with η = 0, perpendicular to the axis. This function still has cylindrical symmetry (σ ) but now changes sign upon inversion (u). It is designated 1σu , as the first orbital of this type. The

Molecular orbital theory of diatomic molecules Chapter | 12 201

FIGURE 12.2 Inversion symmetry classification.

next higher orbital has an inner ellipsoidal node. It has the same symmetry as the lowest orbital and is designated 2σg . Next comes the 2σu orbital, with both planar and ellipsoidal nodes. Two degenerate π-orbitals come next, each with a nodal plane containing the internuclear axis, with φ = const. Their classification is 1πu . The second 1πu -orbital, not shown in Fig. 12.3, has the same shape rotated by 90◦ . The 3σg orbital has two hyperboloidal nodal surfaces, where η = ±const. The 1πg , again doubly-degenerate, has two nodal planes, η = 0 and φ = const. Finally, the 3σu , the last orbital we consider, has three nodal surfaces where η = const.

FIGURE 12.3 H+ 2 molecular orbitals. Wavefunctions are positive in blue regions, negative in yellow regions.

An MO is classified as a bonding orbital if it promotes the bonding of the two atoms. Generally a bonding MO, as a result of constructive interference between AO’s, has a significant accumulation of electron charge in the region between the nuclei and thus reduces their mutual repulsion. The 1σg , 2σg , 1πu and 3σg are evidently bonding orbitals. An MO which, because of destructive interference, does not significantly contribute to nuclear shielding is classified

202 Introduction to Quantum Mechanics

as an antibonding orbital. The 1σu , 2σu , 1πg and 3σu belong in this category. Often an antibonding MO is designated by σ ∗ or π ∗ . The actual ground state of H+ 2 has the 1σg orbital occupied. The equilibrium internuclear distance Re is 2.00 bohr and the binding energy De is 2.79 eV, which represents a quite respectable chemical bond. The 1σu is a repulsive state and a transition from the ground state results in subsequent dissociation of the molecule.

12.2 The LCAO approximation In Fig. 12.4, the 1σg and 1σu orbitals are plotted as functions of z, along the internuclear axis. Both functions have cusps, discontinuities in slope, at the positions of the two nuclei A and B. The 1s orbitals of hydrogen atoms have these same cusps. The shape of the 1σg and 1σu suggests that they can be approximated by a sum and difference, respectively, of hydrogen 1s orbitals, such that ψ(1σg,u ) ≈ ψ(1sA ) ± ψ(1sB ).

(12.4)

FIGURE 12.4 H+ 2 orbitals plotted along internuclear axis.

This linear combination of atomic orbitals is the basis of the so-called LCAO approximation. The other orbitals pictured in Fig. 12.3 can likewise be approximated as follows: ψ(2σg,u ) ≈ ψ(2sA ) ± ψ(2sB ), ψ(3σg,u ) ≈ ψ(2pσA ) ± ψ(2pσB ), ψ(1πu,g ) ≈ ψ(2pπA ) ± ψ(2pπB ).

(12.5)

The 2pσ atomic orbital refers to 2pz , which has the axial symmetry of a σ -bond. Likewise 2pπ refers to 2px or 2py , which are positioned to form π-bonds. An alternative notation for diatomic molecular orbitals which specifies their atomic origin and bonding/antibonding character is the following: 1σg

1σu

2σg

2σu

3σg

3σu

1πu

1πg

σ 1s

σ ∗ 1s

σ 2s

σ ∗ 2s

σ 2p

σ ∗ 2p

π2p

π ∗ 2p

Molecular orbital theory of diatomic molecules Chapter | 12 203

Almost all applications of molecular-orbital theory are based on the LCAO approach, since the exact H+ 2 functions are too complicated to work with. The relationship between MO’s and their constituent AO’s can be represented in a correlation diagram, show in Fig. 12.5. LCAO involves addition of orbital functions. By contrast, in Chap. 9 we multiplied orbitals to construct two-electron wavefunctions. Confusion can be avoided by keeping in mind this simple rule: Orbital functions are added when they belong to the same electron, multiplied when they belong to different electrons. Composite many-electron wavefunctions, such as Slater determinants, contain sums of products, as required for antisymmetry.

FIGURE 12.5 Molecular-orbital correlation diagram. The 1s → 1σg , 1σu is similar to the 2s correlations.

12.3

MO theory of homonuclear diatomic molecules

A sufficient number of orbitals is available for Aufbau of the ground states of all homonuclear diatomic species from H2 to Ne2 . Table 12.1 summarizes the results. The most likely order in which the MO’s are filled is given by 1σg < 1σu < 2σg < 2σu < 3σg ∼ 1πu < 1πg < 3σu The relative order of 3σg and 1πu depends on which other MO’s are occupied, much like the situation involving 4s and 3d atomic orbitals. The results of photoelectron spectroscopy indicate that 1πu is lower up to and including N2 , but 3σg is lower thereafter.

204 Introduction to Quantum Mechanics

TABLE 12.1 Homonuclear diatomic molecules. Molecule H+ 2 H2 He2 He+ 2 Li2 Be2 B2 C2 N2 N+ 2 O2 O+ 2 F2 Ne2

Electron configuration 2 + g 2 1 1σg g+ 2 2 1σg 1σu 1 g+ 1σg2 1σu 2σg 3 u+ b 1σg2 1σu 2 u+ 1σg2 1σu2 2σg2 1 g+ 1σg2 1σu2 2σg2 2σu2 1 g+ . . . 1πu2 3 g− c . . . 1πu4 1 g+ . . . 1πu4 3σg2 1 g+ . . . 1πu4 3σg 2 g+ . . . 3σg2 1πu4 1πg2 3 g− c,e . . . 3σg2 1πu4 1πg 2 g . . . 1πu4 3σg2 1πg4 1 g+ . . . 1πu4 3σg2 1πg4 3σu2 1 g+

1σg

Bond order

De /eV

Re /Å

0.5

2.79

1.06

1

4.75

0.741

0

0.0009a

3.0

1

2.6

1.05

0.5

2.5

1.08

1

1.07

2.67

0

0.1

2.5

1

3.0

1.59

2

6.3

1.24

3

9.91

1.10

2.5

8.85d

1.12

2

5.21

1.21

2.5

6.78d

1.12

1

1.66

1.41

0

0.0036a

3.1

a Van der Waals bonding. b Lifetime ≈ 10−4 sec. c Note application of Hund’s rules. d Compare effect of ionization on binding energy. e Paramagnetism of O predicted by MO theory. 2

The term symbol , , . . . , analogous to the atomic S, P, D. . . symbolizes the axial component of the total orbital angular momentum. When a π-shell is filled (4 electrons) or half-filled (2 electrons), the orbital angular momentum cancels to zero and we find a term. The spin multiplicity is completely analogous to the atomic case. The total parity is again designated by a subscript g or u. Since the many electron wavefunctions are made up of products of individual MO’s, the total parity is odd only if the molecule contains an odd number of u orbitals. Thus a σu2 or a πu2 subshell transforms like g. For terms, the superscript ± denotes the sign change of the wavefunction under a reflection in a plane containing the internuclear axis. This is equivalent to a sign change in the variable φ → −φ. This symmetry is needed when we deal with spectroscopic selection rules. In a spin-paired πu2 subshell the triplet spin function is symmetric so that the orbital factor must be antisymmetric, of the form 1 (12.6) √ πx (1)πy (2) − πy (1)πx (2) . 2

Molecular orbital theory of diatomic molecules Chapter | 12 205

This will change sign under the reflection, since x → x but y → −y. We need only remember that a πu2 subshell will give the term symbol 3 g− . The net bonding effect of the occupied MO’s is determined by the bond order, half the excess of the number bonding minus the number antibonding. This definition brings the MO results into correspondence with the Lewis (valencebond) concept of single, double and triple bonds. It is also possible in MO theory to have a bond order of 1/2, for example, in H+ 2 which is held together by a single bonding orbital. A bond order of zero generally indicates no stable chemical bond, although helium and neon atoms can still form clusters held together by much weaker van der Waals forces. Molecular-orbital theory successfully accounts for the transient stability of a 3 u+ excited state of He2 , in which one of the antibonding electrons is promoted to an excited bonding orbital. This species has a lifetime of about 10−4 sec, until it emits a photon and falls back into the unstable ground state. Another successful prediction of MO theory concerns the + relative binding energy of the positive ions N+ 2 and O2 , compared to the neutral molecules. Ionization weakens the N–N bond since a bonding electron is lost, but it strengthens the O–O bond since an antibonding electron is lost. One of the earliest triumphs of molecular orbital theory was the prediction that the oxygen molecule is paramagnetic. Fig. 12.6 shows that liquid O2 is a magnetic substance, attracted to the region between the poles of a permanent magnet. The paramagnetism arises from the half-filled 1πg2 subshell. According to Hund’s rules the two electrons singly occupy the two degenerate 1πg orbitals with their spins aligned parallel. The term symbol is 3 g− and the molecule thus has nonzero spin angular momentum and a net magnetic moment, which interacts with an external magnetic field. Linus Pauling invented the paramagnetic oxygen analyzer, which was for a time extensively used in medical technology.

FIGURE 12.6 Demonstration showing blue liquid O2 attracted to the poles of a permanent magnet.

12.4

Variational computation of molecular orbitals

Thus far we have approached MO theory from a mainly descriptive point of view. To begin a more quantitative treatment, recall the LCAO approximation to

206 Introduction to Quantum Mechanics

the H+ 2 ground state, Eq. (12.4), which can be written ψ = c A ψ A + cB ψ B .

(12.7)

Using this as a trial function in the variational principle, we have ψ H ψ dτ , E(cA , cB ) = 2 ψ dτ

(12.8)

where H is the Hamiltonian from Eq. (12.1). In fact, these equations can be applied more generally to construct any molecular orbital, not just solutions for H+ 2 . In the general case, H will represent an effective one-electron Hamiltonian determined by the molecular environment of a given orbital. The energy expression involves some complicated integrals, but can be simplified somewhat by expressing it in a standard form. Hamiltonian matrix elements are defined by: HAA = ψA H ψA dτ, HBB = ψB H ψB dτ, (12.9) HAB = HBA = ψA H ψB dτ, while the overlap integral is given by SAB = ψA ψB dτ.

(12.10)

Presuming the functions ψA and ψB to be normalized, the variational energy (12.8) reduces to E(cA , cB ) =

2 H 2 cA AA + 2cA cB HAB + cB HBB 2 + 2c c S 2 cA A B AB + cB

.

(12.11)

To optimize the MO, we find the minimum of E wrt variation in cA and cB , as determined by the two conditions: ∂E = 0, ∂cA

∂E = 0. ∂cB

(12.12)

The result is a secular equation determining two values of the energy: HAA − E HAB − ESAB = 0. (12.13) H − ES HBB − E AB AB For the case of a homonuclear diatomic molecule, for example H+ 2 , the two Hamiltonian matrix elements HAA and HBB are equal, say to α. Setting HAB = β and SAB = S, the secular equation reduces to α − E β − ES 2 2 (12.14) β − ES α − E = (α − E) − (β − ES) = 0,

Molecular orbital theory of diatomic molecules Chapter | 12 207

with the two roots E± =

α±β . 1±S

(12.15)

The calculated integrals α and β are usually negative, thus for the bonding orbital α+β (bonding), (12.16) E+ = 1+S while for the antibonding orbital E− =

α−β 1−S

(antibonding).

(12.17)

Note that (E − − α) > (α − E + ), thus the energy increase associated with antibonding is slightly greater than the energy decrease for bonding. For historical reasons, α is called a Coulomb integral and β, a resonance integral.

12.5

Heteronuclear molecules

The variational computation leading to Eq. (12.13) can be applied as well to the heteronuclear case in which the orbitals ψA and ψB are not equivalent. The matrix elements HAA and HBB are approximately equal to the energies of the atomic orbitals ψA and ψB , respectively, say EA and EB with EA > EB . It is generally true that |EA |, |EB | |HAB |. With these simplifications, secular equation can be written EA − E HAB − ESAB = H − ES EB − E AB AB (EA − E)(EB − E) − (HAB − ESAB )2 = 0.

(12.18)

This can be rearranged to E − EA =

(HAB − ESAB )2 . E − EB

(12.19)

To estimate the root closest to EA , we can replace E by EA on the right hand side of the equation. This leads to E − ≈ EA +

(HAB − EA SAB )2 , E A − EB

(12.20)

(HAB − EB SAB )2 . EA − EB

(12.21)

and analogously for the other root, E + ≈ EB −

208 Introduction to Quantum Mechanics

The relative energies of these AO’s and MO’s are represented by a correlation diagram, as shown in Fig. 12.7.

FIGURE 12.7 Correlation diagram for bonding and antibonding molecular orbitals in heteronuclear case.

An analysis of Eqs. (12.20)-(12.21) implies that, in order for two atomic orbitals ψA and ψB to form effective molecular orbitals the following conditions must be met: 1. The AO’s must have compatible symmetry. For example, ψA and ψB can be either s or pσ orbitals to form a σ -bond or both can be pπ (with the same orientation) to form a π-bond. 2. The charge clouds of ψA and ψB should overlap as much as possible. This was the rationale for hybridizing the s and p orbitals in carbon. A larger value of SAB implies a larger value for HAB . 3. The energies EA and EB must be of comparable magnitude. Otherwise, the denominator in (12.20) and (12.21) will be too large and the MO’s will not differ significantly from the original AO’s. A rough criterion is that EA and EB should be within about 0.2 hartree or 5 eV. For example, the chlorine 3p orbital has an energy of −13.0 eV, comfortably within range of the hydrogen 1s, with energy −13.6 eV. Thus these can interact to form a strong bonding (plus an antibonding) MO in HCl. The chlorine 3s with an energy of −24.5 eV could not form an effective bond with hydrogen, even if it were available. Generally, the greater the AO-energy difference, the more polar will be the bond. The limiting case is an ionic bond, in which no effective MO is formed but two electrons occupy the lower AO.

12.6 Electronegativity The character of a chemical bond is dependent on the electronegativity difference between the bonded atoms. Electronegativity was first introduced by Pauling as a measure of an atom’s power to attract electron charge. He proposed the following formula for the difference in electronegativity χ (chi) between

Molecular orbital theory of diatomic molecules Chapter | 12 209

two atoms X and Y:

1/2 1 , χ X − χ Y = 0.102 DXY − (DXX − DYY ) 2

(12.22)

where the D’s are the dissociation energies of the diatomic molecules expressed in kJ/mol (1 eV=96.485 kJ/mol). Following are conventional electronegativity values for a few common elements: Na χ

Si

B

P

H

C

S

N

Cl

O

F

0.93 1.90 2.04 2.19 2.20 2.55 2.58 3.04 3.16 3.44 3.98

An empirical relation between the electronegativity difference χ and the fractional ionic character of a covalent bond was suggested by Hannay and Smythe (1946): %ionic ≈ 16 χ + 3.5(χ)2 .

(12.23)

For a diatomic molecule, the fractional ionic character can be defined by μ μ(XY ) , = μ(X + Y − ) eRe

fraction ionic =

(12.24)

where μ is the observed dipole moment and Re , the equilibrium internuclear distance. For a 100% ionic bond, the positive and negative charges would be separated by a distance Re . As an example, HF has a dipole moment of 1.82 D. One debye unit (D) equals 3.336 × 10−30 C m. For charges ±e separated by R Å, a simple numerical relation is μ = 4.80R debye. For HF, Re = 0.916 Å, so the bond is 41% ionic. Eq. (12.23) with χ = 1.78 predicts 40% ionic character. Mulliken gave an alternative definition of electronegativity in terms of the ionization energy I and electron affinity A of an atom. The ionization energy equals E for the reaction X → X+ + e, while the electron affinity equals E for X− → X + e. To a rough approximation, I ≈ −EHOMO

and

A ≈ −ELUMO

(12.25)

Strictly speaking, these energies should pertain to the appropriate valence states of the atoms. Consider now the hypothetical alternative atomic ionization reactions: X + + Y− X + Y −→ X− + Y+ For the first reaction, E1 = IX − AY , while for the second E2 = IY − AX . If the electronegativities of X and Y were equal then we would have E1 = E2 and so IX + AX = IY + AY . This suggests Mulliken’s definition of the electronegativity of an element as the average of I and A (in eV): χ=

I +A . 2

(12.26)

210 Introduction to Quantum Mechanics

The Mulliken and Pauling scales are related approximately by χ ≈ 1.35 (χ )1/2 − 1.37. P

M

(12.27)

A parameter akin to electronegativity is hardness, introduced by Pearson and Parr (1983): I −A 1 ≈ (ELUMO − EHOMO ). 2 2 This is equal to E for the disproportionation reaction η=

X→

(12.28)

1 + 1 − X + X . 2 2

Such reactions become increasingly difficult as the hardness of X increases. Hardness is thus a measure of resistance to separation of charge; more generally, resistance to chemical reactivity, consistent with its relation to the HOMOLUMO energy gap.

Chapter 12. Exercises 12.1. After separation of variables in the H+ 2 problem, the function (ξ ) is found to obey the differential equation

d λ2 d 2 2 1 2 (ξ − 1) + A + 2R ξ + 4 R E ξ − 2 (ξ ) = 0 dξ dξ ξ −1 where A is a constant, R is the internuclear distance, λ is the angular momentum quantum number, an integer, and E is the energy, a negative number for bound states. Find the asymptotic solution of the above equation as ξ → ∞. 12.2. Consider the LCAO functions ψ = N (ψA ± ψB ) with ψA and ψB each normalized and their overlap integral equal to S. Show that ψ is normalized when N = [2(1 ± S)]−1/2 . 12.3. Write out the two-electron orbital function for the H2 molecule ψ(1, 2) = ψ1σ g (1) ψ1σ g (2) assuming the LCAO approximation for each MO. Expand this result and show how it relates to the corresponding valence-bond wavefunction. What is the meaning of the left-over terms? 12.4. Give the electron configuration, term symbol and bond order for the − ground state of each of the following species: N+ 2 , N2 and N2 .

Molecular orbital theory of diatomic molecules Chapter | 12 211

12.5. Predict the electronic configuration and term symbol for the ground state 2− of the superoxide ion O− 2 and of the peroxide ion O2 . 12.6. Propose electron configurations and term symbols for the two lowest singlet excited states of O2 . 12.7. Rationalize why the Be2 molecule, unlike He2 , is weakly bound (D2 ≈ 0.1 eV). Hint: 2s-2p hybridization is involved. 12.8. The overlap integral between a 1s and a 2pσ orbital on nuclei separated by a distance R (in bohr) is given by

R3 S = R + R2 + e−R 3 Determine the value of R which gives the maximum overlap. (It may be of interest that the internuclear distance in HF equals 0.916 Å.)

Chapter 13

Polyatomic molecules and solids 13.1

Hückel molecular orbital theory

Molecular orbital theory has been very successfully applied to large conjugated systems, especially those containing chains of carbon atoms with alternating single and double bonds. An approximation introduced by Hückel in 1931 considers only the delocalized p-electrons moving in a framework of σ -bonds. This is, in fact, a more sophisticated version of the free-electron model introduced in Chap. 3. We again illustrate the model using butadiene CH2 =CH–CH=CH2 . From four p atomic orbitals, designated p1 , p2 , p3 , p4 , with nodes in the plane of the carbon skeleton, one can construct four π molecular orbitals by an extension of the LCAO approach: ψ = c 1 p1 + c 2 p2 + c 3 p3 + c4 p4 .

(13.1)

Applying the linear variational method, the energies of the MO’s are the roots of the 4 × 4 secular equation H11 − H − S 12 12 ...

H12 − S12 H22 − ...

... ... ...

= 0.

(13.2)

Four simplifying assumptions are now made: (1) All overlap integrals Sij are sufficiently small that they can be set equal to zero. This is quite reasonable since the p-orbitals are directed perpendicular to the direction of their bonds. (2) All resonance integrals Hij between non-neighboring atoms are set equal to zero. (3) All resonance integrals Hij between neighboring atoms are set equal to β. (4) All coulomb integrals Hii are set equal to α. Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00021-5 Copyright © 2021 Elsevier Inc. All rights reserved.

213

214 Introduction to Quantum Mechanics

The secular equation thus reduces to α − β 0 α− β β 0 β α− 0 0 β

0 0 = 0. β α −

(13.3)

Dividing by β 4 and defining

the equation simplifies further to x 1 0 0

x=

α− , β

1 x 1 0

0 1 x 1

(13.4)

0 0 = 0. 1 x

(13.5)

This is essentially the connection matrix for the molecule. Each pair of connected atoms is represented by 1, each non-connected pair by 0 and each diagonal element by x. Expansion of the determinant gives the 4th order polynomial equation x 4 − 3x 2 + 1 = 0.

(13.6)

Noting that this is a quadratic equation in x 2 , the roots are found to be x 2 = √ 3 ± 5 /2, so that x = ±0.618, ±1.618. This corresponds to the four MO energy levels = α ± 1.618β,

α ± 0.618β.

(13.7)

Since α and β are negative, the lowest MO’s have 1π = α + 1.618β

and

2π = α + 0.618β,

and the total π-electron energy of the 1π 2 2π 2 configuration equals Eπ = 2(α + 1.618β) + 2(α + 0.618β) = 4α + 4.472β.

(13.8)

The coefficients ci in (13.1) can now be found by solving four simultaneous equations. For the lowest energy orbital 1π : 4 i=1

(Hij − 1π δij )cj1π = 0

j = 1, . . . , 4,

(13.9)

Polyatomic molecules and solids Chapter | 13 215

and analogously for each of the higher MO’s. The normalized Hückel MO’s are given by ψ1π = 0.372p1 + 0.602p2 + 0.602p3 + 0.372p4 , ψ2π = 0.602p1 + 0.372p2 − 0.372p3 − 0.602p4 , ψ3π = 0.602p1 − 0.372p2 − 0.372p3 + 0.602p4 , ψ4π = 0.372p1 − 0.602p2 + 0.602p3 − 0.372p4 .

(13.10)

A schematic representation of these four orbitals is given in Fig. 13.1, with the scale of each p-orbitals proportional to its coefficient in (13.10). Note the topological resemblance to free-electron model wavefunctions in Sect. 3.3.

FIGURE 13.1 Hückel molecular orbitals for butadiene. Positive regions are shown as blue, negative as yellow. The corresponding free-electron model wavefunctions are drawn in the background.

The simplest application of Hückel theory, the ethylene molecule CH2 =CH2 , gives the secular equation x 1 (13.11) = 0. 1 x This is easily solved for the energies = α ± β. [Compare to Eqs. (12.16) and (12.17) with S = 0.] The lowest orbital has 1π = α + β and the 1π 2 ground state has Eπ = 2(α + β). If butadiene had two localized double bonds, as in its dominant valence-bond structure, its π-electron energy would be given by Eπ = 4(α + β). Comparing this with the Hückel result (13.8), we see that the energy lies lower than the that of two double bonds by 0.48β. The thermochemical value is approximately −17 kJmol−1 . This stabilization of a conjugated system is known as the delocalization energy. It corresponds to the resonancestabilization energy in valence-bond theory.

216 Introduction to Quantum Mechanics

Aromatic systems provide the most significant applications of Hückel theory. For benzene, we find the secular equation x 1 0 0 0 1

1 x 1 0 0 0

0 1 x 1 0 0

0 0 1 x 1 0

0 0 0 1 x 1

1 0 0 = 0, 0 1 x

(13.12)

with the six roots x = ±2, ±1, ±1. The energy levels are = α ± 2β and twofold degenerate = α ± β. With the three lowest MO’s occupied, we have Eπ = 2(α + 2β) + 4(α + β) = 6α + 8β.

(13.13)

Since the energy of three localized double bonds is 6α + 6β, the delocalization energy equals 2β. The thermochemical value is −152 kJmol−1 . A least-squares fit of a series of benzenoid hydrocarbons suggests the value |β| ≈ 2.72 eV.

FIGURE 13.2 Hückel MO’s for benzene. The coefficients of the carbon p atomic orbitals are proportional to the sizes of the circles (blue for positive, yellow for negative). Since these are p orbitals, the plane of the molecule is also a node. The e2u and b2g orbitals are unoccupied in the ground state.

The six π-electron molecular orbitals for benzene are diagrammed in Fig. 13.2. The group-theoretical notation for the orbitals will be explained in Chap. 15. It should be evident from the figure that the number of nodes increases 2 e4 . with energy. The ground-state electron configuration is a2u 1g

13.2 Conservation of orbital symmetry; Woodward-Hoffmann rules In the course of their synthesis of Vitamin B12 , R. B. Woodward and coworkers were puzzled by the failure of certain cyclic products to form from apparently appropriate starting materials—in particular, the stereochemistry of interconversions of cyclohexadienes with conjugated trienes in thermal and photochemical reactions. Woodward, in collaboration with Roald Hoffmann (ca 1965), discovered that the course of such reactions depended on identifiable symmetries of the participating molecular orbitals. The principle of conservation of orbital symmetry can be stated thus:

Polyatomic molecules and solids Chapter | 13 217

R. B. Woodward In the course of a concerted reaction, the MO’s of the reactant molecules are transformed into the MO’s of the products by a continuous pathway. A concerted reaction is one which takes place in a single step, through a transition state, but without the formation of reactive intermediates. Breaking of bonds in the reactants and formation of new bonds in the products takes place in one continuous process. Often this involves interconversion of σ - and π-bonds. Such reactions are generally insensitive to such factors as solvent polarity and catalysis but are characterized by a high degree of stereospecificity. As emphasized by Fukui, the mechanism of chemical reactions can often be understood in terms of frontier orbitals—the HOMO’s and LUMO’s of reacting molecules. Ideally, the frontier orbitals of the reactants interact to form the MO’s of the products. And it is in such transformations that orbital symmetry is conserved. We will consider two relevant examples from organic chemistry: electrocyclic reactions and cycloadditions. Simple examples of electrocyclic reactions are the formation of cyclobutene from butadiene and cyclohexadiene from hexatriene:

To appreciate the conformational implications of orbital-symmetry conservation we consider these two reactions with groups R1 and R2 replacing two of the terminal hydrogens. (As per convention, the remaining hydrogens are not drawn.) If the reactions are carried out under thermal conditions, they proceed as follows:

218 Introduction to Quantum Mechanics

The butadiene reaction gives a trans-configuration of substituents R1 and R2 while the hexatriene gives a cis-configuration. If, on the other hand, the reactions are photochemically induced, the opposite configurations are produced:

The stereoselectivity of the above reactions can be explained by the geometry of the highest-occupied molecular orbitals (HOMO’s). For butadiene, the HOMO for the ground electronic state is the orbital ψ2 in Fig. 13.1. Ring closure occurs when the two terminal p-orbitals reorient themselves to create a σ -bond, as shown in Fig. 13.3. Since the p1 and p4 lobes are out of phase, the orbitals rotate in the same direction—conrotatory—to give a positive overlap necessary for bonding. This is accompanied by the substituents R1 and R2 moving to opposite sides of the ring, into a trans configuration. This accounts for the geometry of the thermal electrocyclic reaction of butadiene. The reaction can be alternatively induced photochemically by irradiation with ultraviolet. What happens then is electrons in the ψ2 orbital are excited to ψ3 . As is evident from Fig. 13.1, the p1 and p4 lobes for this new HOMO are now in phase. They must now rotate in opposite directions—disrotatory—to give a bonding overlap. Thus R1 and R2 wind up on the same side of the ring, the cis configuration.

FIGURE 13.3 Woodward-Hoffmann rules for conservation of orbital symmetry in electrocyclic ring closure, as determined by relative phases in HOMO.

Polyatomic molecules and solids Chapter | 13 219

For ring closure in hexatriene, you can easily show that the HOMO for the ground state has its terminal lobes in phase, while the photochemically induced state has its terminal lobes out of phase. Thus the cis-trans stereospecificity is exactly the opposite of that for butadiene. The general result can be formalized by the Woodward-Hoffmann rule for concerted electrocyclic reactions: If the total number of electrons in the transition state equals 4n [4n + 2], the thermal reaction will produce the conrotarory [disrotatory] configuration while the photochemical reaction will produce the disrotatory [conrotarory] configuration. The prototype of a cycloaddition is the Diels-Alder reaction between a diene and a dienophile. Two examples:

Fig. 13.4 shows two possible ways for this to happen: the HOMO of the diene can combine with the LUMO of the dienophile or the LUMO of the diene with the HOMO of the dienophile. The thermal reaction with this 6-electron transition state is allowed but the corresponding photochemical mechanism is forbidden. More generally, the Woodward-Hoffmann rule for concerted cycloaddition reactions can be stated: If the number of electrons in the transition state equals 4n [4n + 2], then the photochemical [thermal] reaction will be allowed but the thermal [photochemical] reaction will be forbidden.

FIGURE 13.4 Two possible orbital-symmetry combinations in Diels-Alder reaction.

The hydrogen-iodine reaction H2 + I2 → 2HI was one of the first whose kinetics were studied in detail. It had long been assumed that the reaction proceeded through a square intermediate, followed by breaking of H–H and I–I bonds and simultaneous formation of H–I bonds. Application of Woodward-Hoffmann orbital-symmetry concepts shows, however, that such a mechanism could not possibly explain the course of the reaction. According to frontier-orbital picture, the formation and dissociation of this intermediate must involve electron flow either from the hydrogen HOMO to the iodine LUMO or from the iodine HOMO to the hydrogen LUMO. The symmetries of these valence-shell MO’s are the same as those illustrated in Fig. 11.3.

220 Introduction to Quantum Mechanics

The hydrogen HOMO is a σg bonding orbital while the LUMO is a σu antibonding orbital. For iodine, the HOMO is a πg antibonding orbital while the LUMO is a σu antibonding orbital from pσ -pσ overlap. The hydrogen HOMO and iodine LUMO are symmetry-incompatible. The hydrogen LUMO-iodine HOMO interaction would be symmetry-compatible but further analysis indicates that this intermediate would not lead to the desired products. As electrons flow into the hydrogen LUMO the H–H bond would indeed weaken, but the I–I bond would strengthen as electrons vacated the π antibonding orbital. Thus no H–I bonds are likely to be formed. Subsequent calculations and experiments confirmed that this was not a concerted reaction but rather proceeded through a sequence of steps beginning with the dissociation of I2 into iodine atoms.

13.3

Band theory of metals and semiconductors

The importance of metals and semiconductors to modern technology is difficult to overestimate. We consider in this Section the band theory of solids, which can account for many of the characteristic properties of these materials. The LCAO approximation, including the Hückel model, exhibits a “conservation law” for orbitals in which the number of molecular orbitals is equal to the number of constituent atomic orbitals. Consider, for example, a 3-dimensional array of n sodium atoms, each contributing one 3s valence electron. Two overlapping AOs will interact to form one bonding plus one antibonding orbital. Three AOs will give, in addition, an orbital of intermediate energy, essentially nonbonding. Continuing the process, as sketched in Fig. 13.5, n interacting AOs will produce a stack of n MOs, with the lower energy orbitals being of predominantly bonding character, the upper ones, of antibonding character. The n electrons will fill half of the available energy levels. As n increases, the spacing between successive levels decreases, until for n ∼ 1023 the discrete levels merge into a continuous energy band. The valence electrons become delocalized over the entire crystal lattice, consistent with the Drude-Sommerfeld model of a metal as an electron gas surrounding cores of positive ions. This simple model accounts for many of the familiar attributes of metals. High electrical and thermal conductivity are obvious consequences of the large number of mobile electrons. Metals are usually malleable and ductile because metallic bonding, although strong, is nondirectional and tolerant of lattice deformation. Metals can be usually recognized by their shiny appearance or “metallic luster,” their ability of reflect light. The high-frequency electromagnetic fields of light induces oscillations of the loosely-bound electrons near the metal surface. These vibrating charges, in turn, reemit radiation, equivalent to a reflection of the incident light. The closely-spaced energy levels in the conduction bands allow metals (with notable exceptions of copper and gold) to absorb all wavelengths across the visible range. Elements with a valence-shell configuration ns 2 , such as beryllium and magnesium, might be expected to have completely filled bands and thus behave as

Polyatomic molecules and solids Chapter | 13 221

FIGURE 13.5 Approach to continuous energy bands in a crystal with increasing number of interacting atoms n.

FIGURE 13.6 Energy bands in sodium as function of atomic spacing. After J. C. Slater, Phys. Rev. 45 794 (1934). Dotted line represents equilibrium spacing.

nonmetals. However the nearby p-orbitals likewise form a band which overlaps the upper part of the s-band to give a continuous conduction band with an abundance of unoccupied orbitals. Transition metals can also contribute their d-orbitals to the conduction bands. Fig. 13.6 is a detailed plot of the band structure of metallic sodium, which shows how combinations of s, p and d energy bands can overlap. Only the outermost atomic orbitals are involved in band formation. Inner atomic orbitals remain localized and are not involved in bonding or electrical conduction. The band theory of solids very succinctly describes the essential differences between conductors, insulators and semiconductors, as shown in Fig. 13.7. A metallic conductor possesses either a partially filled valence band or overlapping valence and conduction bands so that electrons can be excited into the empty levels by an external electric field. Energy bands in crystalline solids can be separated by forbidden zones or bandgaps. When the orbitals below a sufficiently-large bandgap are completely filled, the element or compound be-

222 Introduction to Quantum Mechanics

comes an insulator. In sodium chloride, for example, the Cl 3s, 3p band is completely filled by the valence electrons and separated by a large gap from the empty Na 3s band. Thus the NaCl is an ionic crystal made up of Na+ and Cl− units but no delocalized electrons at moderate temperatures.

FIGURE 13.7 General features of band structure for insulators, semiconductors and conductors. Blue regions show levels filled by electrons at room temperature.

Good insulators have bandgaps Eg of at least 5 eV. Semiconductors are materials with smaller bandgaps, of the order of 1 eV. For example Eg = 1.12 eV for Si, 0.66 eV for Ge. Electrons can be excited into the conduction band if they absorb sufficient energy. In an intrinsic semiconductor, weak conductivity can be achieved by thermal excitation, as determined by the magnitude of the Boltzmann factor e−Eg /kT . At 300 K, kT corresponds to 0.026 eV. The conductivity of a semiconductor therefore increases with temperature (opposite to the behavior of a metal). Excitation energy can also be provided by absorption of light, so many semiconductors are also photoconductors. The spectacular success of the semiconductor industry is based on the production of materials selectively designed for specialized applications in electronic and optical devices. By carefully-controlled doping of semiconductors with selected impurities—electron donors or electron acceptors—the conductivity and other properties can be modulated with great precision. Fig. 13.8 shows schematically how doped semiconductors work. In an intrinsic semiconductor (a), conducting electron-hole pairs can only by produced by thermal or photoexcitation across the band gap. In (b), addition of a small concentration of an electron donor creates an impurity band just below the conduction band. Electrons can then jump across a much-reduced gap to the conduction band and act as negatively-charged current carriers. This produces a n-type semiconductor. In (c), an electron acceptor creates an empty impurity band just above the valence band. In this case electrons can jump from the valence band to leave positive holes. These can also conduct electricity since electrons falling into positive holes create new holes, a sequence which can propagate across the crystal, in the direction opposite to the electron flow. The result is a p-type semiconductor.

Polyatomic molecules and solids Chapter | 13 223

FIGURE 13.8 Band structures for (A) intrinsic semiconductor, (B) n-type, (C), p-type. Filled bands are shown in blue. Arrows show excitations creating electrons and holes.

The most popular semiconductor material is silicon (hence Silicon Valley). Fig. 13.9A is a schematic representation of a pure Si crystal. Each Si atom has 4 valence electrons and bonds to 4 other atoms to form Lewis octets. The crystal can become a conductor if some of the valence electrons are shaken loose. This produces both negative and positive charge carriers—electrons and holes. Much more important are extrinsic semiconductors in which the Si crystal is doped with impurity atoms, usually at concentrations of several parts per million (ppm). For example, Si can be doped with P (or As or Sb) atoms, which has 5 valence electrons. As shown in Fig. 13.9B, a P atom can replace a Si atom in the lattice. The fifth electron on the P is not needed for bonding and becomes available as a current carrier. Thus Si doped with P is a n-type semiconductor, as electrons can be excited from the donor band to the conduction band. If Si is instead doped with B (or Ga or Al), which has only 3 valence electrons, as shown in Fig. 13.9C, a B atom replacing a Si atom leaves an electron vacancy in one of its 4 bonds. Such positive holes can likewise become current carriers, making Si doped with B an p-type semiconductor, with electrons excited from the valence band to leave positive holes.

FIGURE 13.9 Lewis structures for pure and doped silicon crystals. (A) Pure silicon showing excitation of two electron-hole pairs. (B) Si doped with P, an electron donor. (C) Si doped with B, an electron acceptor.

Besides the semiconducting elements Si and Ge from Group IV of the periodic table, there exist III-V semiconducting compounds including GaAs, InP and GaN and II-VI compounds such as ZnS and CdTe. GaAs (Gallium arsenide), with a band gap of 1.43 eV, has been especially useful for solar cells, lightemitting diodes, lasers, and other optoelectronic devices.

224 Introduction to Quantum Mechanics

FIGURE 13.10 P-N junction showing distribution of electrons and holes. Current can flow (bottom left) only if the diode is forward biased.

FIGURE 13.11 Principle of metal-oxide-semiconductor field-effect transistor (MOSFET). A narrow conducting channel is created when the gate is at positive voltage, allowing electrons to pass from the source to the drain.

A key element in solid state electronics is the P-N junction, formed when p-type and n-type semiconductors are placed in contact, shown in Fig. 13.10. Electric current can flow through the junction in one direction (forward biased) but not in the opposite direction (reverse biased). The junction can act as a semiconductor diode. When electrons combine with holes in a forward biased P-N junction, the bandgap energy is released either as heat—which is usual for Si or Ge—or as radiation, in which case we have a light-emitting diode (LED). Diodes constructed using aluminum gallium arsenide (AlGaAs) can emit in the red and infrared regions. The device which has revolutionized modern electronic technology is the transistor, invented by Bardeen, Brattain and Shockley at Bell Laboratories in 1947. In digital circuits, particularly computers, transistors function as highspeed electronic switches. Transistors are building blocks for logic gates, RAM memory and other components in integrated circuits. In analog circuits, transistors are used as amplifiers and oscillators, having replaced vacuum tubes since the 1960s. A transistor is based on a three-layer assembly of doped semiconductors, either NPN or PNP. Most of the transistors manufactured today are metal-oxide-semiconductor field-effect transistors (MOSFETs). These have largely supplanted the original bipolar transistors. The leading developers of semiconductor technology have been Bell Laboratories, Fairchild Semiconduc-

Polyatomic molecules and solids Chapter | 13 225

tor, Texas Instruments and many other companies based in Silicon Valley and Japan. A simplified representation of the operation of a MOSFET is shown in Fig. 13.11. The gate is a metal electrode with a very thin insulating coating of its oxide. When the gate is uncharged, very little current flows between the source and the drain through the NPN sequence of semiconductor layers. When the gate becomes positively charged, its electric field causes a concentration of electrons to build up in a narrow channel opposite the gate. This closes the circuit between the source and the drain. A small change in the gate voltage can produce a large and rapid variation in the current. The transister can thereby act as a signal amplifier or as a high-speed electronic switch which can open and close several million times a second.

FIGURE 13.12 MOSFET and diode in an integrated circuit. The substrate here is p-type silicon.

Today, most microelectronic technology is based on integrated circuits (ICs). These are assemblies containing thousands or millions of microscopic-size electronic components—resistors, capacitors, diodes, transistors—and their connections built into a chip of semiconductor (in most cases, silicon) called the substrate. Fig. 13.12 shows a simplified representation of a MOSFET and a diode as components of an integrated circuit.

13.4

Computational chemistry

There is no question that quantum mechanics provides the correct mathematical framework for description of all chemical phenomena. However its fundamental equation can be solved exactly for only a single problem—the hydrogenlike system. Every other application of quantum mechanics to atoms or molecules involves approximations of varying levels of sophistication. Some of these, although rather rudimentary, can provide useful models for some aspect of chemical behavior. For many purposes, however, it is necessary to seek more accurate solutions to the Schrödinger equation, the goal being “chemical accuracy,” which is usually understood to mean an error less than about 1 kJ/mol

226 Introduction to Quantum Mechanics

(approximately .001 hartree). The application of electronic computers to chemical problems during the last third of the Twentieth Century is now considered a specialty in its own right—computational chemistry—taking its place alongside the traditional modes of experimental and theoretical research. Computational chemistry now routinely contributes to the design and synthesis of materials with novel properties, the understanding of complex biological processes and the rational design of therapeutic drugs.

C. C. J. Roothaan

John Pople

Roald Hoffmann Computational chemistry had its roots in the early attempts to solve the Schrödinger equation for two-electron systems, notably the work of Hylleraas in 1929 and of Coolidge and James in 1933. At its most advanced level, this work was performed using hand-cranked calculating machines. In the 1950s, with the advent of digital computers, accurate computations became possible for molecular systems with up to 5 atoms. By the 1970s, these could be extended to as many as 20 atoms. Spectacular advances in computing technology and algorithms in subsequent years has made it possible now to do computations, to varying levels of rigor, even on biological molecules containing thousands of atoms.

Polyatomic molecules and solids Chapter | 13 227

Formulation of the methods of computational chemistry is reasonably straightforward. The hard work comes in its implementation. We begin with the electronic Hamiltonian for a molecule, a generalization of that for a manyelectron atom given in Eq. (10.2): ZA ZB 1 ZA 1 + , − ∇i2 + − + H= 2 riA rij RAB i

iA

i kT .

(17.98)

Thus, in most cases, Boltzmann factors for excited electronic states are entirely negligible. Notable exceptions are a few molecules with multiplet ground states, including NO, O2 , NO2 and ClO2 , which have low-lying excited electronic states. The electronic partition function qelec = g0 e−0 /kT + g1 e−1 /kT + . . . ,

(17.99)

where g0 , g1 , etc. are the degeneracies of the electronic levels. Unlike qrot and qvib , this partition function cannot be explicitly evaluated in any general way. However, in view of the fact that the second (and following) terms are negligible, one can usually include just the first term. For atoms, it is simplest to choose the energy origin such that 0 = 0. For diatomic molecules, recall that the v = 0 vibrational energy level lies at an energy De − D0 , approximately 12 hν, above the minimum of the potential-energy curve. It is convenient to define the origin as the energy of the separated atoms at rest, which shifts the ground state to 0 = −D0 . In view of the preceding discussion, we can usually write, for a

310 Introduction to Quantum Mechanics

diatomic molecule, qelec = g0 eD0 /kT .

(17.100)

Electronic contributions to the thermodynamic functions can then be found from ln Qelec = N ln g0 +

N D0 , kT

(17.101)

where D0 is interpreted as zero for atoms. The energy and heat capacity are given by Uelec = −N D0 ,

CVelec = 0,

(17.102)

excluding the exceptional cases of low-lying excited states. The electronic contribution to entropy is given by Selec = N k ln g0 .

(17.103)

= R ln 3 per mole. For example, the entropy of O2 is Selec

17.12

Summary

FIGURE 17.3 Thermodynamic functions from molecular parameters.

Fig. 17.3 shows a schematic flowchart for computation of thermodynamic functions from molecular parameters. Table 17.1 summarizes our key results for the statistical thermodynamics of monatomic and polyatomic ideal gases. TABLE 17.1 Statistical thermodynamics formulas Translation

Rotation (linear)

q

(2π mkT )3/2 V / h3

kT σ hcB

Um

1 2 RT

RT

3 2 RT

CV,m

1 2R

R

3 2R

Sm

R ln

(2π mkT )3/2 V h3 N A

m

e5/2

R ln

ekT σ hcB

Rotation (nonlinear) 3 1/2 1 σ

R ln

π ABC

1 σ

kT hc

π ABC

Vibration

Electronic

(1 − e−u )−1

g0 eD0 /kT

RT R

1/2 ekT 3/2 hc

R

u eu −1

Vibrational contributions per normal mode i: u = hcν˜ i /kT .

−NA D0

u eu −1

u2 e u (eu −1)2

0

−ln(1−e−u )

R ln g0

Statistical thermodynamics Chapter | 17 311

Supplement 17A. Low-temperature heat capacity of hydrogen molecules H2 is the lowest boiling point molecular species, remaining in the gas phase down to 20 K. The rotational and vibrational characteristic temperatures are rot = 87.5 K and vib = 5986 K, respectively. At and above room temperature, T ≈ 300 K, vibration is unexcited while rotation is almost fully excited. The rotational contribution to heat capacity accordingly approaches its equipartition value, R per mole. Owing to its exceptionally small moment of inertia, rotation becomes inactive at temperatures below about 50 K. Below room temperature, the heat capacity as a function of temperature behaves anomalously, as was first explained by Dennison in 1927. Since H2 is a homonuclear molecule, only half of its rotational states are accessible. In the singlet nuclear-spin state, known as parahydrogen (p-H2 ), only even-J rotational states are accessible, while in the triplet nuclear-spin state, known as orthohydrogen (o-H2 ), only odd-J rotational states are accessible. The molecular partition functions representing the rotational and nuclear spin degrees of freedom are thereby given by para qrot = (2J + 1)e−J (J +1)rot /T (17.104) J even

or ortho =3 qrot

(2J + 1)e−J (J +1)rot /T ,

(17.105)

J odd

where the factors 1 and 3 represent the degeneracies of the para and ortho nuclear-spin states, respectively. The rotational contribution to heat capacity per mole can be calculated using ∂ ln qrot ∂ RT 2 . (17.106) Crot (T ) = ∂T ∂T Fig. 17.4 shows plots for o-H2 and p-H2 . At room temperature, hydrogen gas consists of a 3:1 mixture. The two forms do not interconvert unless a catalyst, such as activated charcoal or platinum is present, so that ratio will persist as the temperature is lowered. The results for the isotopomer HD, a heteronuclear diatomic molecule with σ = 1 and rot = 65.7 K, is also shown in the figure.

Chapter 17. Exercises 17.1. Boltzmann’s famous formula for the entropy of a system at equilibrium is S = k ln ,

(17.107)

where represents the number of microstates consistent with a macroscopic state. The molar partition function can be written

312 Introduction to Quantum Mechanics

FIGURE 17.4 Heat capacities of ortho and para H2 and HD.

Q=

gn e−En /kT ,

(17.108)

n

where the summation is over energy levels, rather than individual states, and gn represents the degeneracy of the energy level En . Assume that the most probable energy is so overwhelmingly dominant that all other states can be neglected. Setting gn = and En = U , show that the statistical mechanical formula for entropy S = U T + k ln Q (Eq. (17.44)) leads to Boltzmann’s formula. 17.2. The molecules in the atmosphere, mostly N2 and O2 experience a gravitational potential energy E(h) = mgh,

(17.109)

where m is the molecular mass (assume, for simplicity, an value consistent with the statistical average molecular weight of N2 and O2 , M ≈ .029), g ≈ 9.8 m/sec2 is the acceleration of gravity and h is the altitude above sea level, in meters. Assuming that the atmosphere is at a constant temperature, with a pressure p(0) at sea level, and neglecting kinetic energy and other contributions to the gas molecules, use the Boltzmann distribution to derive the barometer formula for the pressure at an elevation of h meters: p(h) = p(0)e−Mgh/RT .

(17.110)

Estimate the average atmospheric pressure in Denver, with an elevation h ≈ 1600 m (1 mile). 17.3. The classical kinetic energy of a molecule moving in the x-direction with a speed vx is given by 1 (vx ) = mvx2 . 2

(17.111)

Statistical thermodynamics Chapter | 17 313

Derive an expression for the distribution of molecular speeds, taking −∞ < vx < ∞. Normalizing the distribution to a total probability of 1, show that the probability density is given by ρ(vx ) =

m 1/2 2 e−mvx /2kT . 2πkT

(17.112)

Next, derive the distribution of molecular speeds v = vx2 + vy2 + vz2 . Write the composite Boltzmann distribution for vx , vy , vz and normalize, using polar coordinates in velocity space. The result is the famous Maxwell-Boltzmann distribution: m 3/2 2 ρ(v) = 4πv 2 e−mv /2kT . (17.113) 2πkT The distribution is sketched at three different temperatures:

Maxwell-Boltzmann distribution of molecular speeds at three temperatures: T3 > T2 > T1 .

17.4. For iodine vapor I2 at 298 K, calculate the fraction of the molecules in the lowest three vibrational states, v = 0, 1, 2. Assume a vibrational constant ν˜ = 214.5 cm−1 . 17.5 The J th rotational level of a diatomic molecule has a degeneracy of 2J + 1, taking account of the possible values of M. At a given temperature T , which level is the most populated? 17.6 Apply the preceding result to the rotational levels of N2 at 298 K. The rotational constant is B = 1.9987 cm−1 . 17.7 Iodine vapor 127 I2 has the spectroscopic constants ν˜ =214.5 cm−1 and B=0.03737 cm−1 . Assuming the harmonic vibration and rigid rotor models, at 298 K. The experimental value is 36.90 calculate the molar value of Cp,m −1 −1 JK mol . of iodine vapor at 298 K. The experimental 17.8 Calculate the molar entropy Sm −1 −1 value is Sm = 260.69 JK mol .

314 Introduction to Quantum Mechanics of ammonia NH at 298 K. The 17.9 Calculate the molar heat capacity Cp,m 3 fundamental vibrational frequencies are ν1 = 3337, ν2 = 950, ν3 = 3414 is (two-fold), ν4 = 1627.5 (two-fold) cm−1 . The experimental value of Cp,m −1 −1 35.06 JK mol .

Chapter 18

Nuclear magnetic resonance Nuclear magnetic resonance (NMR) is a versatile and highly-sophisticated spectroscopic technique which has been applied to a growing number of diverse applications in science, technology and medicine. We will consider, for the most part, magnetic resonance involving 1 H and 13 C nuclei.

18.1

Magnetic properties of nuclei

In all our previous work, it has been sufficient to treat nuclei as structureless point particles characterized fully by their mass and electric charge. On a more fundamental level, as was discussed in Chap. 1, nuclei are actually composite particles made of nucleons (protons and neutrons), which are themselves made of quarks. The additional properties of nuclei which will now become relevant are their spin angular momenta and magnetic moments. Recall that electrons possess an intrinsic or spin angular momentum s which can have just two possible projections along an arbitrary direction in space, namely ± 12 . Since is the fundamental quantum unit of angular momentum, the electron is classified as a particle of spin one-half. The electron’s spin state is described by the quantum numbers s = 12 and ms = ± 12 . A circulating electric charge produces a magnetic moment μ proportional to the angular momentum J. This is written μ = γ J,

(18.1)

where the constant of proportionality γ is known as the magnetogyric ratio. The z-component of μ then has the possible values μz = γ mJ where mJ = −J, −J + 1, . . . , +J,

(18.2)

determined by space quantization of the angular momentum J. The energy of a magnetic dipole in a magnetic field B is given by E = −μ·B = −μz B,

(18.3)

where magnetic field defines the z-axis. The SI unit of magnetic field (more correctly, magnetic induction) is the tesla, designated T. Electromagnets used in NMR produce fields in excess of 10 T. Small iron magnets have fields around .01 T, while some magnets containing rare-earth elements such as NIB (niobiumiron-boron) reach 1 T. The Earth’s magnetic field is approximately 5 × 10−5 T Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00026-4 Copyright © 2021 Elsevier Inc. All rights reserved.

315

316 Introduction to Quantum Mechanics

(0.5 gauss in alternative units), depending on geographic location. At the other extreme, a neutron star, which is really a giant nucleus, has a field predicted to be of the order of 108 T. The energy relation (18.3) determines the most conveniently units for magnetic moment, namely joules per tesla, J T−1 . For orbital motion of an electron, where the angular momentum is , the magnetic moment is given by μz = −

e m = −μB m , 2m

(18.4)

where the minus sign reflects the negative electric charge. The Bohr magneton is defined by e = 9.274 × 10−24 J T−1 . 2m The magnetic moment produced by electron spin is written μB =

μz = −g μB ms ,

(18.5)

(18.6)

with introduction of the g-factor. Eq. (18.4) implies g =1 for orbital motion. For electron spin, however, g =2 (more exactly, 2.0023). The factor 2 compensates for ms = 12 such that spin and = 1 orbital magnetic moments are both equal to one Bohr magneton. Many nuclei possess spin angular momentum, analogous to that of the electron. The nuclear spin, designated I , has an integral or half-integral value: 0, 12 , 1, 32 , and so on. Table 18.1 lists some nuclei of importance in chemical applications of NMR. The proton and the neutron both are spin 12 particles, like the electron. Complex nuclei have angular momenta which are resultants of the spins of their component nucleons. The deuteron 2 H, with I = 1, evidently has parallel proton and neutron spins. The 4 He nucleus has I = 0, as do 12 C, 16 O, 20 Ne, 28 Si and 32 S. These nuclei contain filled shells of protons and neutrons with the vector sum of the component angular momenta equal to zero, analogous to closed shells of electrons in atoms and molecules. In fact, all even-even nuclei have spins of zero. Nuclear magnetic moments are of the order of a nuclear magneton e (18.7) = 5.051 × 10−27 J T−1 , 2M where M is the mass of the proton. The nuclear magneton is smaller than the Bohr magneton by a factor m/M ≈ 1836. In analogy with Eqs. (18.2) and (18.6), nuclear moments are represented by μN =

μz = gI μN mI = γI mI ,

(18.8)

where gI is the nuclear g-factor and γI , the magnetogyric ratio. Most nuclei have positive g-factors, as would be expected for a rotating positive electric

Nuclear magnetic resonance Chapter | 18 317

TABLE 18.1 Some common nuclei in NMR spectroscopy. nuclide

I

gI

μ/μN

γI /107 T −1 s −1

1n 0

1 2 1 2

−3.8260

−1.9130

−18.324

5.5857

2.7928

26.752

99.98

1

0.8574

0.8574

4.1067

0.0156

1.7923

2.6886

8.5841

80.4

13 C 6

3 2 1 2

1.4046

0.7023

6.7272

1.1

14 N 7

1

0.4038

0.4038

1.9338

99.634

15 N 7

1 2 5 2 1 2 1 2

−0.5664

−0.2832

−2.7126

0.366

−0.7572

−1.894

−3.627

0.037

5.2567

2.628

25.177

100

2.2634

1.2317

10.840

100

1H 1 2H 1 11 B 5

17 O 8 19 F 9 31 P 15

abundance %

charge. It was long puzzling that the neutron, although lacking electric charge, has a magnetic moment. It is now understood that the neutron is a composite of three charged quarks, udd. The negatively-charged d-quarks are predominantly in the outermost regions of the neutron, thereby producing a negative magnetic moment, like that of the electron. The g-factor for 15 N, 17 O, and other nuclei dominated by unpaired neutron spins, is consequently also negative.

18.2

Nuclear magnetic resonance

The energy of a nuclear moment in a magnetic field, according to Eq. (18.3), is given by EmI = − γI mI B.

(18.9)

For a nucleus of spin I , the energy of a nucleus in a magnetic field is split into 2I + 1 Zeeman levels. A proton, and other nuclei with spin 12 , have just two possible levels: 1 E±1/2 = ∓ γ B, 2

(18.10)

with the α-spin state (mI = −1/2) lower in energy than the β-spin state (mI = +1/2) by E = γ B.

(18.11)

Fig. 18.1 shows the energy of a proton as a function of magnetic field. In zero field (B = 0), the two spin states are degenerate. In a field B, the energy splitting

318 Introduction to Quantum Mechanics

corresponds to a photon of energy E = ω = hν where ωL = γ B

or

νL =

γB , 2π

(18.12)

known as the Larmor frequency of the nucleus. For the proton in a field of 1 T, νL = 42.576 MHz, as the proton spin orientation flips from + 12 to − 12 . This transition is in the radiofrequency region of the electromagnetic spectrum. NMR spectroscopy consequently exploits the technology of radiowave engineering.

FIGURE 18.1 Energies of spin- 12 nucleus in magnetic field showing NMR transition at Larmor frequency νL .

A transition cannot occur unless the values of the radiofrequency and the magnetic field accurately satisfy Eq. (18.12). This is why the technique is categorized as a resonance phenomenon. No radiation can be absorbed or emitted by the nuclear spins unless some resonance condition is satisfied. In the earlier techniques of NMR spectroscopy, it was found more convenient keep the radiofrequency fixed and sweep over values of the magnetic field B to detect resonances. These have been largely supplanted by Fourier-transform techniques, to be described later. The transition probability for the upward transition (absorption) is equal to that for the downward transition (stimulated emission). The contribution of spontaneous emission is negligible at radiofrequencies. Thus if there were equal populations of nuclei in the α and β spin states, there would be zero net absorption by a macroscopic sample. The possibility of observable NMR absorption depends on the lower state having at least a slight excess in population. At thermal equilibrium, the ratio of populations follows a Boltzmann distribution Nβ e−Eβ /kT = −E /kT = e−γ B/kT . Nα e α

(18.13)

Thus the relative population difference is approximated by N α − Nβ γ B N ≈ = . N N α + Nβ 2kT

(18.14)

Nuclear magnetic resonance Chapter | 18 319

Since nuclear Zeeman energies are so small, the populations of the α and β spin states differ very slightly. For protons in a 1 T field, N/N ≈ 3 × 10−6 . Although the population excess in the lower level is only of the order of parts per million, NMR spectroscopy is capable of detecting these weak signals. Higher magnetic fields and lower temperatures enhanced NMR sensitivity.

18.3

The chemical shift

NMR has become such an invaluable technique for studying the structure of atoms and molecules because nuclei represent ideal noninvasive probes of their electronic environment. If all nuclei of a given species responded at their characteristic Larmor frequencies, NMR might then be useful for chemical analysis, but little else. The real value of NMR to chemistry comes from minute differences in resonance frequencies dependent on details of the electronic structure around a nucleus. The magnetic field induces orbital angular momentum in the electron cloud around a nucleus, thus, in effect, partially shielding the nucleus from the external field B. The actual or local value of the magnetic field at the position of a nucleus is expressed Bloc = (1 − σ )B,

(18.15)

where the fractional reduction of the field is denoted by σ , the shielding constant, typically of the order of parts per million. The actual resonance frequency of the nucleus in its local environment is then equal to ν = (1 − σ )

γB . 2π

(18.16)

A classic example of this effect is the proton NMR spectrum of ethanol CH3 CH2 OH, shown in Fig. 18.2. The three peaks, with intensity ratios 3:2:1 can be identified with the three chemically-distinct environments in which the protons find themselves: three methyl protons (CH3 ), two methylene protons (CH2 ) and one hydroxyl proton (OH). The variation in resonance frequency due to the electronic environment of a nucleus is called the chemical shift. Chemical shifts on the delta scale are defined by ν − νo × 106 , (18.17) νo where ν o represents the resonance frequency of a reference compound, usually tetramethylsilane Si(CH3 )4 , which is rich in highly-shielded chemicallyequivalent protons, as well as being unreactive and soluble in many liquids. By definition δ = 0 for TMS and almost everything else is “downfield” with positive values of δ. Most compounds have delta values in the range of 0 to 12 (hydrogen halides have negative values, up to δ ≈ −13 for HI). The hydrogen atom has δ ≈ 13 while the bare proton would have δ ≈ 31. Conventionally, the δ=

320 Introduction to Quantum Mechanics

FIGURE 18.2 Oscilloscope trace showing the first NMR spectrum of ethanol, taken at Stanford University in 1951. Courtesy Varian Associates, Inc.

δ-scale is plotted as increasing from right to left, in the opposite sense to the magnitude of the magnetic field. Nuclei with larger values of δ are said to be more deshielded, with the bare proton being the ultimate limit. Fig. 18.3 shows some representative values for proton and 13 C chemical shifts in organic compounds.

FIGURE 18.3 Ranges of 1 H and 13 C chemical shifts for common organic functional groups. From P. Atkins, Physical Chemistry, (Freeman, New York, 2002).

Nuclear magnetic resonance Chapter | 18 321

The δ scale for 13 C is based on the 13 C resonance in isotopically-enriched TMS. The natural abundance of 13 C is about 1.1%. Since carbon nuclei are surrounded by a larger number of electrons, 13 C chemical shifts are about an order of magnitude larger than those for protons. Fig. 18.2 shows a high-resolution NMR spectrum of ethanol, including a δ-scale. The “fine structure” splittings of the three chemically-shifted components will be explained in the next Section. The chemical shift of a nucleus is very difficult to treat theoretically. However, certain empirical regularities, for example those represented in Fig. 18.2 provide clues about the chemical environment of the nucleus. We will not consider these in any detail except to remark that often increased deshielding of a nucleus (larger δ) can often be attributed to a more electronegative neighboring atom. For example the proton in the ethanol spectrum (Fig. 18.4) with δ ≈ 5 can be identified as the hydroxyl proton, since the oxygen atom can draw significant electron density from around the proton.

FIGURE 18.4 High-resolution NMR spectrum of ethanol showing δ scale of chemical shifts. The line at δ = 0 corresponds to the TMS trace added as a reference.

Neighboring groups can also contribute to the chemical shift of a given atom, particularly those with mobile π-electrons. For example, the ring current in a benzene ring acts as a secondary source of magnetic field. Depending on the location of a nucleus, this can contribute either shielding or deshielding of the external magnetic field, as shown in Fig. 18.5. Where the arrows are parallel to the external field B, including protons directly attached to the ring, the effect is deshielding. However, any nuclei located within the return loops will experience a shielding effect. The interaction of neighboring groups can be exploited to obtain structural information by using lanthanide shift reagents. Lanthanides (elements 58 through 71) contain 4f -electrons, which are not usually involved in chemical bonding and can give large paramagnetic contributions. Lanthanide complexes which bind to organic molecules can thereby spread out proton resonances to simplify their analysis. A popular chelating complex is Eu(dpm)3 , tris(dipivaloylmethanato)europium, where dpm is the group (CH3 )3 C–CO=CH–CO–C(CH3 )3 .

322 Introduction to Quantum Mechanics

FIGURE 18.5 Magnetic fields, shown as red loops, produced by ring current in benzene.

18.4 Spin-spin coupling Two of the resonances in the ethanol spectrum shown in Fig. 18.4 are split into closely-spaced multiplets—one triplet and one quartet. These are the result of spin-spin coupling between magnetic nuclei which are relatively close to one another, usually separated by no more than three covalent bonds. Identical nuclei in identical chemical environments are said to be equivalent. They have equal chemical shifts and do not exhibit spin-spin splitting. Nonequivalent magnetic nuclei, on the other hand, can interact and thereby affect one another’s NMR frequencies. A simple example is the HD molecule, in which the spin- 12 proton can interact with the spin-1 deuteron, even though the atoms are chemically equivalent. The proton’s energy is split into two levels by the external magnetic field, as shown in Fig. 18.1. The neighboring deuteron, itself a magnet, will also contribute to the local field at the proton. The deuteron’s three possible orientations in the external field, with MI = −1, 0, +1, with different contributions to the magnetic field at the proton, as shown in Fig. 18.6. The proton’s resonance is split into three evenly spaced, equally intense lines (a triplet), with a separation of 42.9 Hz. Correspondingly the deuteron’s resonance is split into a 42.9 Hz doublet by its interaction with the proton. These splittings are independent of the external field B, whereas chemical shifts are proportional to B. Fig. 18.6 represents the energy levels and NMR transitions for the proton in HD. Nuclear-spin phenomena in the HD molecule can be compactly represented by a spin Hamiltonian Hˆ = −γH MH (1 − σH )B − γD MD (1 − σD )B + hJH D IH · ID .

(18.18)

The shielding constants σH and σD are, in this case, equal since the two nuclei are chemically identical. For sufficiently large magnetic fields B, the last term is effectively equal to hJH D MH MD . The spin-coupling constant J can be directly equated to the splitting expressed in Hz. We consider next the case of two equivalent protons, for example, the CH2 group of ethanol. Each proton can have two possible spin states with MI = ± 12 , giving a total of four composite spin states. Just as in the case of electron

Nuclear magnetic resonance Chapter | 18 323

FIGURE 18.6 Nuclear energy levels for proton in HD molecule. The two Zeeman levels of the proton when B > 0 are further split by interaction with the three possible spin orientations of the deuteron Md = −1, 0, +1. The proton NMR transition, represented by blue arrows, is split into a triplet with separation 42.9 Hz.

spins, these combine to give singlet and triplet nuclear-spin states with M = 0 and 1, respectively. Also, just as for electron spins, transitions between singlet and triplet states are forbidden. The triplet state allows NMR transitions with M = ±1 to give a single resonance frequency, while the singlet state is inactive. As a consequence, spin-spin splittings do not occur among identical nuclei. For example, the H2 molecule shows just a single NMR frequency. And the CH2 protons in ethanol do not show spin-spin interactions with one another. They can however cause a splitting of the neighboring CH3 protons. Fig. 18.7 (left side) shows the four possible spin states of two equivalent protons, such as those in the methylene group CH2 , and the triplet with intensity ratios 1:2:1 which these produce in nearby protons. Also shown (right side) are the eight possible spin states for three equivalent protons, say those in a methyl group CH3 , and the quartet with intensity ratios 1:3:3:1 which these produce. In general, n equivalent protons will give a splitting pattern of n + 1 lines in the ratio of binomial coefficients 1 : n : n(n − 1)/2 . . . (These are also rows in Pascal’s triangle.) The tertiary hydrogen in isobutane (CH3 )3 CH∗ , marked with an asterisk, should be split into 10 lines by the 9 equivalent methyl protons. The NMR spectrum of ethanol CH3 CH2 OH (Fig. 18.4) can now be interpreted. The CH3 protons are split into a 1:2:1 triplet by spin-spin interaction with the neighboring CH2 . Conversely, the CH2 protons are split into a 1:3:3:1 quartet by interaction with the CH3 . The OH (hydroxyl) proton evidently does not either cause or undergo spin-spin splitting. The explanation for this is hydrogen bonding, which involves rapid exchange of hydroxyl protons among neighboring molecules. If this rate of exchange is greater than or comparable to the NMR radiofrequency, then the splittings will be “washed out.” Only one line with a motion-averaged value of the chemical shift will be observed. NMR has consequently become a useful tool to study intramolecular motions.

324 Introduction to Quantum Mechanics

FIGURE 18.7 Splitting patterns from methylene and methyl protons.

18.5 Mechanism for spin-spin interactions Two magnetic nuclei separated by a distance r have a mutual potential energy given by μ1 · μ2 3(μ1 · r) (μ2 · r) − . (18.19) r3 r5 These interactions are responsible for the broad NMR spectra observed in solids. In liquids and gases, these contributions are averaged to zero by rapid tumbling of molecules through all possible orientations. Nuclei in liquids and gases can however still interact through covalent bonds by a second-order mechanism involving electron magnetic moments. The dominant contribution is the Fermi contact interaction, which comes from the instantaneous juxtaposition of electron and nuclear spins. The interaction energy is given by V=

E = −

8π |ψ(0)|2 μS · μI , 3

(18.20)

where ψ(0) represents the electron wavefunction at the position of the nucleus. The energy is lowest when the electron and nuclear spins are antiparallel. For an isolated hydrogen atom, the transition between the parallel and antiparallel spin configurations corresponds to 1420 MHz or 21 cm. This is the source of the faint microwave radiation from hydrogen atoms in interstellar space observed in radio astronomy. Molecular orbitals must have some s character to order to contribute to the Fermi contact interaction since only s-atomic orbitals have nonzero density at the nucleus. When a nuclear spin interacts with a spin-singlet electron pair in a doubly-occupied molecular orbital, it induces a very small spin polarization in the electron distribution. In the immediate vicinity of the magnetic nucleus, the density of electron spin antiparallel to the nuclear spin is slightly greater, as represented in Fig. 18.8. This is compensated by a larger density of parallel spin elsewhere, such that the net electron-spin angular momentum remains zero.

Nuclear magnetic resonance Chapter | 18 325

FIGURE 18.8 Spin polarization induced by Fermi contact interaction. Regions shown in green [red] have slight excess of α [β] electron spin density.

FIGURE 18.9 Mechanisms for electron-coupled nuclear spin-spin interactions for nuclei separated by one, two and three bonds. Only the lowest-energy nuclear-spin states are shown.

Spin polarization produced by Fermi contact interactions makes possible electron-coupled nuclear spin-spin interactions. For spin- 12 nuclei in atoms directly bonded to one another, the antiparallel nuclear spin state evidently has the lower energy, as shown in Fig. 18.9. The state with parallel nuclear spins (not shown) has the higher energy of the spin-spin doublet. The coupling constant for adjacent nuclei is designated 1 J . For nuclei separated by two or three bonds, the constants are designated 2 J for geminal coupling and 3 J for vicinal coupling. When the coupling goes through more than one bonding orbital, the adjacent electron spin densities around an intermediate atom tend to be parallel, thus maximizing contributions to exchange energy. This is the same mechanism which leads to Hund’s rule for maximum electron-spin multiplicity. It is clear from Fig. 18.9 that the parallel nuclear spin configuration is more stable in 2 J coupling but the antiparallel configuration is again favored in 3 J coupling. Remember that alternative nuclear-spin states are also allowed, but with higher energy. Since the spin-spin couplings correspond to terms hJ IX · IY in the spin Hamiltonian (18.18), the 1 J and 3 J coupling constants tend to be negative while the 2 J coupling constant tends to be positive. These generalizations apply for most couplings involving protons and 13 C nuclei in organic compounds. Coupling between nuclei separated by more than three bonds is generally weak but can sometimes be detected. For nuclei heavier than hydrogen, the situation can become much more complex, with additional dipolar interactions between nuclear spins and electron spin and orbital angular momenta.

326 Introduction to Quantum Mechanics

Vicinal coupling constants 3 JH H for HCCH and HCNH group have been approximated by the Karplus equation, one form of which is 3

JH H (φ) ≈ A cos2 φ + B cos φ + C,

(18.21)

where φ is the dihedral angle between the two protons. Fig. 18.10 shows a series of HCα NH 3 JH H couplings determined in the study of a protein structure.

FIGURE 18.10 3 JH H couplings in the enzyme staphylococcal nuclease (SNase) showing leastsquares fit to a Karplus equation.

18.6 Magnetization and relaxation processes A magnetic field B0 acting on a nuclear moment μ produces a torque τ , where τ = μ × B0 .

(18.22)

From Newton’s second law, it follows that the torque equals the rate of change of angular momentum I, so that τ=

dI = ω × I, dt

(18.23)

representing a precessional motion sweeping out a cone around the axis of the field with a frequency ω. Since μ = γI I, the precession is seen to occur at the Larmor frequency (18.12) ω = ωL = γI B0 .

(18.24)

The individual nuclear magnetic moments in a macroscopic sample add vectorially to give a net magnetization M. This will be proportional to the population difference Nα − Nβ , according to Eq. (18.14), and thus be larger at lower

Nuclear magnetic resonance Chapter | 18 327

temperatures. A magnetic field B0 in the Z-direction will produce a net longitudinal magnetization MZ . The transverse components MX and MY will average to zero when the individual nuclei precess with random phases. It is possible to change the net magnetization by exposing the nuclear spin system to radiation at the resonance frequency. If enough energy is put in, the spin system can become saturated, with the α and β populations becoming equal, so that MZ = 0. The magnetization subsequently returns to its equilibrium value M0 with a time constant T1 called the spin-lattice relaxation time, according to MZ = M0 (1 − e−t/T1 ).

(18.25)

Spin-lattice relaxation is induced by time-varying magnetic fields in the environment of the spin system, principally from molecular rotations. If the net magnetization were suddenly rotated into the XY plane (we will learn how shortly), M would precess about the Z-axis at the Larmor frequency ωL , with all the spins initially rotating in phase with one another. In time, however, the spins will begin to dephase, as well as recovering their longitudinal magnetization. The time constant T2 which describes the return to equilibrium of the transverse magnetization, MXY , is called the spin-spin relaxation time, so that MXY = M0 e−t/T2 .

(18.26)

The net magnetization in the XY plane then goes to back to zero while the longitudinal magnetization along Z returns to M0 . Both processes occur simultaneously with T1 always greater than or equal to T2 . Thus far, we have considered the motion of spins with respect to the laboratory frame of reference. It is convenient to define a rotating frame of reference which follows the precession about the Z-axis at the Larmor frequency. The rotating coordinates are designated as X and Y . A magnetization vector rotating at the Larmor frequency in the laboratory frame appears to be stationary in the rotating frame, while the relaxation of MZ magnetization to its equilibrium value looks the same in both frames. A transverse magnetization vector rotating faster or slower than the Larmor frequency appears to be rotating clockwise or counterclockwise, respectively, in the X Y -frame. Fig. 18.11 shows how the transverse magnetization simultaneously dephases and relaxes toward its equilibrium longitudinal magnetization. When viewed in the rotating frame, the transverse component fans out over a sector of a circle as the phases of the individual spins lose coherence. Concurrently, the magnetization vector becomes concentrated in narrower and narrower cones of precession. The time constants T2 and T1 govern the respective processes.

18.7

Pulse techniques and Fourier transforms

Radiation which induces NMR transitions is usually produced by radiofrequency (RF) coils wound around the sample. These are energized by alternating

328 Introduction to Quantum Mechanics

FIGURE 18.11 Relaxation of transverse magnetization as viewed in the rotating frame.

currents approximately matching the Larmor frequency. The coils are arranged so that the magnetic field of the radiation B1 is perpendicular to the static field B0 . In contrast to other types of spectroscopy, the electric field of the radiation plays no role here. The simplest case is a circularly polarized RF field, such that, in the laboratory frame B1X = B1 cos ωt,

B1Y = B1 sin ωt.

(18.27)

Transformed to a frame rotating at frequency ω, the radiation field simplifies to B1X = B1 = const,

B1Y = 0,

(18.28)

equivalent a constant magnetic field in the rotating frame. A magnetization initially in the Z-direction can then precesses around the Y axis with an angular frequency ω1 = γI B1 ,

(18.29)

known as the Rabi frequency. If the radiation consists of a finite pulse of duration t, Rabi precession turns the magnetization through an angle θ = ω1 t. Rotation of M into the XY -plane, imagined in the preceding Section, can be thus accomplished with a 90◦ pulse, for which θ = π/2. A 180◦ pulse, with θ = π, causes a reversal of the magnetization vector from M to −M. The earliest NMR spectra were obtained by varying the radiofrequency in a constant magnetic field or by varying the field with a fixed frequency and recording the individual resonances. The slow and laborious continuous-wave (CW) approach has been largely supplanted by Fourier-transform techniques (FT-NMR) in which all the nuclei are excited at the same time by a radiofrequency pulse. This can produce the entire spectrum in a single step, often in less than a second. Andrew Derome suggested [in Modern NMR Techniques for Chemistry Research (Pergamon, New York, 1987)] the following analogy for FT-NMR, comparing it to tuning a bell. In principle, you could measure each resonance frequency by exciting the bell with a series of tuning forks. But this would be extremely tedious. A better way is to hit the bell with a hammer, exciting all of its resonances at once, and Fourier-analyzing the BOINNNGGG to identify the spectrum of frequencies. How does pulsed Fourier-transform NMR spectroscopy work? An RF coil generally produces radiation at a single frequency ω0 . However, if the radiation

Nuclear magnetic resonance Chapter | 18 329

is emitted as a very short pulse, of the order of microseconds, this is equivalent to a superposition of many frequencies in a broad band around ω0 and it can excite the resonances of all the spins in a sample at the same time. The frequency spread and pulse length are related by ωt ≈ 1, which has the same form as the energy-time uncertainty principle. After a high-intensity 90◦ pulse, as described above, the nuclear magnetizations evolve much like that shown in Fig. 18.11. Back in the laboratory frame, we would see an oscillating transverse magnetization. This causes reemission of radiofrequency radiation, which can be picked up by detecting coils positioned around the sample. The resulting signal, during the time the magnetization is returning to its equilibrium state, is called free-induction decay (FID). It is a superposition of all the resonance frequencies of the sample. Fourier transformation, carried out by a computer built into the NMR spectrometer, translates the signal from the time domain t to the frequency domain ω. The resulting FT-NMR spectrum is similar in appearance to earlier CW spectra but with resolution and sensitivity improved by several orders of magnitude. Fig. 18.12 shows free-induction decays and Fourier transforms for model systems with just one and two resonance frequencies. Fig. 18.13 shows a more realistic example, the FID signal for ethanol. A frequency ω0 gives an exponentially-decaying sinusoidal function f (t) = f (0) cos ω0 t e−t/T2 ,

(18.30)

where T2 is the transverse relaxation time. The complete FID signal is a superposition of all contributing frequency components. The spectral distribution is obtained from the real Fourier transform of the FID signal ∞ S(ω) = f (t) cos ωt dt. (18.31) 0

18.8

Two-dimensional NMR

For a complex molecule such as a protein, a high-resolution NMR spectrum might contain hundreds of lines and its analysis would be essentially hopelessly. In the 1970s Richard Ernst and coworkers developed sophisticated techniques involving sequences of RF pulses, followed by Fourier analysis in more than one dimension (Ernst received the Nobel Prize in 1991). This has made it possible to determine the structure of proteins and other biomolecules with a facility rivaling X-ray crystallography. We will consider only the simplest such techniques, a method for two-dimensional NMR known as correlation spectroscopy (COSY). Conventional (one-dimensional) NMR spectra are plots of intensity vs. frequency. In two-dimensional spectroscopy, intensity is plotted as a function of two frequencies, in what resembles a topographical map. In onedimensional FT-NMR, the signal is recorded as a function of time and then Fourier transformed to the frequency domain. In two-dimensional NMR, the

330 Introduction to Quantum Mechanics

FIGURE 18.12 Free-induction decay signals and their Fourier transforms for simple systems with one and two resonance frequencies.

FIGURE 18.13 Free-induction decay (FID) signal for the proton resonances in ethanol. Fourier transformation gives the NMR spectrum shown in Fig. 18.4.

signal is recorded as a function of two time variables, t1 and t2 , and the result Fourier transformed twice to yield a function of two frequency variables. The general scheme for two-dimensional spectroscopy is a Preparation-EvolutionMixing-Detection (PEMD) sequence of operations, as shown in Fig. 18.14. In the preparation phase, the sample allowed to come to equilibrium and is then excited by a 90◦ pulse. The resulting magnetization is allowed to evolve for time t1 . During the mixing phase, the system is subjected to another 90◦ pulse. The signal is then recorded as a function of the second time variable t2 . The cycle is repeated for incrementally increased time intervals t1 , beginning with t1 = 0, and the resulting function of t1 and t2 is Fourier transformed. The two-dimensional COSY spectrum for ethanol is shown in Fig. 18.15, with the two frequency coordinates expressed in terms of chemical shifts δ1 and δ2 . In Sects. 18.3–18.4, we had considered in detail the NMR spectrum for this molecule. The one-dimensional spectrum, drawn at the top, consists of three chemical-shifted components for the CH3 , OH and CH2 protons, with two of

Nuclear magnetic resonance Chapter | 18 331

FIGURE 18.14 Sequence of operations in correlation spectroscopy (COSY): Preparation, Evolution, Mixing, Detection. The signal is analyzed by a two-dimensional Fourier transform.

FIGURE 18.15 COSY spectrum for ethanol. Frequencies are expressed in terms of chemical shifts δ. The corresponding 1D-NMR spectrum is shown at the top.

them further split into multiplets. The COSY spectrum is symmetrical about the diagonal, with the diagonal peaks iterating the one-dimensional spectrum. In the two-dimensional spectrum, multiplets appear as square or rectangular arrays

332 Introduction to Quantum Mechanics

of peaks. These two-dimensional multiplets are of two distinct types: diagonalpeak multiplets which belong to the same value of δ and cross-peak multiplets which are centered around unequal δ1 and δ2 coordinates. The appearance in a COSY spectrum of a cross-peak multiplet indicates that the protons at shifts δ1 and δ2 must be J -coupled. The absence of a cross-peak connecting the OH and CH3 protons indicates that there is no detectable coupling between them. These simple generalizations are key to interpretation of two-dimensional NMR spectra. Thus, from a single COSY spectrum it is possible to trace out the whole network of couplings in a molecule, no matter how complex. What causes the cross-peaks in two-dimensional spectra? If a peak occurs at some δ1 , δ2 , this means that an excitation of frequency δ1 was present during the evolution time t1 . And during the mixing time some magnetization was transferred to another excitation of frequency δ2 , which was detected during time t2 . Only nuclei which are connected by spin-spin coupling can experience such transfers of magnetization. Many other pulse-sequence techniques besides COSY can be used to produce multidimensional NMR spectra. It will suffice here to simply list the acronyms of some of the better known methods: EXSY (exchange spectroscopy), NOESY (nuclear Overhauser effect spectroscopy), TOCSY (total correlation spectroscopy), ROESY (rotational nuclear Overhauser effect spectroscopy). The nuclear Overhauser effect (NOE) refers to a change in intensity of one NMR peak when another peak is irradiated.

18.9 Magnetic resonance imaging Magnetic resonance imaging (MRI) is a noninvasive technique for viewing the inside of the human body. The level of detail it provides is extraordinary compared with the alternative imaging methods for medical diagnosis and anatomical studies. In contrast to X-rays, MRI can image the body’s soft tissue. Images of the brain (Fig. 18.17) can enable early detection of cerebrovascular or neurodegenerative disease.

FIGURE 18.16 Principle of MRI. A flask of water in a magnetic field with a linear gradient gives the NMR spectrum shown at right.

Nuclear magnetic resonance Chapter | 18 333

The basic idea of MRI is that a system of protons in an inhomogeneous magnetic field will exhibit a variation in resonance frequency proportional to the strength of the field. Thus the geometry of an object can be translated into a frequency map. The principle is illustrated in Fig. 18.16, applied to a flask of water. With field gradients applied successively along the x, y and z axes, computer analysis can reconstruct a three-dimensional image. Image contrast in a biological system can be enhanced by using pulse sequences which exploit differences in T1 and T2 for protons in various types of tissue. With a time dimension added, functional MRI can detect such things as changes in different parts of the brain during perceptive or cognitive activity.

FIGURE 18.17 MRI of axial cross-section of human brain. Extensive cortical folding is correlated with high intelligence.

Chapter 18. Exercises 18.1. Analyze the proton NMR spectrum of diethylketone, shown below.

18.2. The NMR spectrum of methane CH4 shows just a single peak. Explain why. Now explain the proton NMR spectrum of the isotopically-substituted

334 Introduction to Quantum Mechanics

dideuteromethane, shown below.

18.3. The proton magnetic resonance spectrum of toluene (methylbenzene) shows two peaks with relative intensities 5:3. Explain this spectrum.

18.4. Analyze the proton magnetic resonance spectrum of 1,1-dibromomethane. The bromine nuclei do not cause any detectable splittings.

18.5. Assume that the magnetic moments μ1 and μ2 in Eq. (18.19) are both directed along the z-axis. Show that the angular dependence of the dipole-

Nuclear magnetic resonance Chapter | 18 335

interaction potential energy then contains the factor (3 cos2 θ − 1). Show that this quantity vanishes when averaged over all orientations. 18.6. After a 90◦ pulse, the spins in a sample will begin to dephase by spin-spin relaxation. Show that this process can be reversed for a time by application of a 180◦ pulse. This is exploited in a number of spin-echo techniques.

Chapter 19

Wonders of the quantum world After going through Chapters 1–18, it is hoped that the reader is convinced of the essential validity of the quantum theory and of its success in providing a conceptual framework for the fundamental phenomena of physics, chemistry and biology. It is by any measure a highly successful physical theory, capable of making correct predictions for innumerable physical phenomena. This includes statistical predictions involving large numbers of observations. But honestly, you still don’t really “understand” quantum mechanics (at least I don’t). But we’re in good company. Richard Feynman wrote “I think it is safe to say that no one understands quantum mechanics.” Niels Bohr said that “Anyone who can contemplate quantum mechanics without getting dizzy hasn’t properly understood it.” According to Roger Penrose, “while the theory agrees incredibly well with experiment and while it is of profound mathematical beauty, it makes absolutely no sense.” Feynman very succinctly summarized the situation this way: “We cannot make the mystery go away . . . we will just tell you how it works.” The instinctive disbelief often experienced when first confronted with the novelties of quantum mechanics is aptly described by the biologist Peter Medawar: “The human mind treats a new idea the way the body treats a strange protein—it rejects it.” No less an eminence than Albert Einstein could never buy into the worldview of quantum mechanics, notwithstanding his own role as one of its creators. Maxwell believed that “. . . the effectual studies of the sciences must be ones of simplification and reduction of the results of previous investigations to a form in which the mind can grasp them.” Thus “Clockwork Universe” is a metaphor which succinctly captures the essence of classical physics. But quantum mechanics lacks a simple metaphor accessible to everyday experience and common sense. In thinking about quantum mechanics, it is well to keep in mind the schema relating appearance and reality central to the metaphysics of Immanuel Kant. Appearance, what he called phenomena, represents our observations and experiences, both external and internal. The reality beyond phenomena, which he called noumena, represents ultimate causation, which is forever hidden from our perception. Theories are models we create in attempts to make connections between appearance and reality.

19.1

The Copenhagen interpretation

Fig. 19.1 is a portrait of the participants in the Fifth Solvay International Conference in 1927, including all of the founding fathers of quantum mechanics. Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00027-6 Copyright © 2021 Elsevier Inc. All rights reserved.

337

338 Introduction to Quantum Mechanics

Immanuel Kant They came together to contemplate the foundations of the newly formulated theory. Here the long-running dialog between Niels Bohr and Albert Einstein first began.

FIGURE 19.1 The 1927 Solvay Congress on the quantum theory. Colorized version from the American Physical Society historical collection.

The Copenhagen interpretation of quantum mechanics was the working philosophy developed largely by Neils Bohr, Werner Heisenberg and Max Born in the late 1920s. According to the Copenhagen interpretation, a quantum system exists in some sort of nebulous existential fog until a measurement is carried out. It is pragmatically asserted that only after a measurement can a value be assigned to a dynamical variable. It is considered meaningless to assume some value before it is measured. This is of course much at odds with notions of objective reality, which contends that there exist attributes which exist independently of whether or when they are observed. This outlook is central to the classical picture of Nature (“The Moon is there whether or not we look at it”). For Ein-

Wonders of the quantum world Chapter | 19 339

stein, among many other thinkers, “. . . belief in an external world independent of the perceiving subject is the basis of all natural science.” Another contentious issue associated with the Copenhagen interpretation is the so called measurement problem. Everyone agrees that a quantum system will evolve deterministically in accordance with the time-dependent Schrödinger equation until an observation is made. According to the orthodox Copenhagen interpretation, the observation forces a discontinuous change in which the wavefunction settles into one of the eigenstates of the measured dynamical variable. This somewhat mystical transition is known as collapse of the wavefunction. It is not described by any conventional Schrödinger equation, although attempts have been made to treat the system plus the measuring apparatus as parts of an entangled quantum system. Phenomena in the submicroscopic quantum world inevitably create apparent paradoxes from the viewpoint of classical macroscopic experience. We will focus in this Chapter on two of the most counterintuitive aspects of quantum theory: superposition (Schrödinger’s Cat) and entanglement (EPR and Bell’s theorem).

19.2

Superposition

The inescapable implications of the double-slit and similar experiments is the capability of a quantum system to exist in a state which simultaneously partakes of alternative realities. Such a state can moreover exhibit the effects of interference between its component realities. Chemists generally agree that a benzene molecule can be represented as a resonance hybrid of at least its two Kekulé structures and that a hydrogen bond is an intermediate between two ordinary covalent bonds. The basic idea of superposition is that, if a quantum system is capable of existing in the individual states 1 and 2 , then it can also exist in a linear combination of these two states, which can be written | = c1 |1 + c2 |2 .

(19.1)

This can be generalized for any number of contributing states. According to the Copenhagen interpretation, the wavefunction for state can under certain circumstances collapse to one of the component states 1 or 2 , with a probability |c1 |2 or |c2 |2 , respectively. When this happens, all other components of the superposition essentially disappear. It is relatively easy to produce simple superposition phenomena with photons. This will also serve to introduce some devices which we will encounter later in some key experiments. Fig. 19.2A shows the operation of a beamsplitter. This is a glass plate, partially-silvered on one surface, so as to reflect half of an incident light beam and transmit the other half. A wave reflected directly off the silvered side has its phase shifted by 180◦ with respect to the transmitted wave. However, a wave which passes through the glass layer before being reflected, has its phase unchanged. After passing through the beamsplitter, an

340 Introduction to Quantum Mechanics

FIGURE 19.2 Mach-Zehnder interferometer demonstrating photon superposition. M denotes a mirror. B and B are beamsplitters, with reflecting surface on the gray side. D1 and D2 are detectors.

incident photon has a 50–50 chance of entering detectors D1 and D2 . The setup in Fig. 19.2B, with two oppositely-oriented beamsplitters and two mirrors, is known as a Mach-Zehnder interferometer. If the path lengths are precisely the same, all photons will go to detector D1 and none will go to detector D2. To explain this result, a photon must be treated as a wave which travels along both paths at each beamsplitter. The photon’s wavefunction is thus represented by a superposition similar to (19.1). Taking into account the phase shifts in the successive reflections, the two waves arriving at detector D1 are in phase and will reinforce, while the two waves arriving at detector D2 are 180◦ out of phase and will cancel. Now suppose that we block the light beam after the mirror in the upper path, as shown in Fig. 19.2C. This will ensure that all of the light arriving at the second beamsplitter had traversed only the lower path. In this case, there is no interference, and the second beamsplitter sends equal components of the incident wave into the two detectors. This result is counterintuitive, in that blocking out some of the light seems to increase its detection. But the explanation based on photon superposition is very clear.

19.3

Schrödinger’s Cat

In 1935 Erwin Schrödinger published an essay questioning whether strict adherence to the Copenhagen interpretation can cause the “weirdness” of the quantum world to creep into everyday reality. He speculated on how the principle of superposition, which is so fundamental for the quantum-mechanical behavior of microscopic systems, might possibly affect the behavior of a large scale object. Schrödinger proposed a rather diabolical Gedankenexperiment (thought experiment) known as Schrödinger’s Cat. A more humane version of the experiment makes use of the apparatus sketched in Fig. 19.3. A cat is confined to a opaque box while a weak radioactive source is monitored by a Geiger counter. Detection of a decaying atom during a specified time period triggers a spray of catnip (hydrogen cyanide in Schrödinger’s original atrocity) into the box occupied by our cat. If the spray is activated, the cat will relax into a blissful quantum state. Otherwise the cat, annoyed to be cooped up in a dark box, will become excited into a perturbed state. Assume a 50% probability for each outcome.

Wonders of the quantum world Chapter | 19 341

Then—if you believe the Copenhagen interpretation—until the box is opened, the cat’s quantum state must be described by a superposition 1 | = √ | 2

, + |/

.

(19.2)

Only after the box is opened and the cat observed will the wavefunction collapse to a recognizable state of bliss or annoyance. In an extension of this experiment, an observer known as Wigner’s friend is invited into a closed room with the apparatus. And until the door is opened, Wigner’s friend himself will also become part of a quantum superposition, which is starting to become ridiculous! We might be able to accept the concept of an atomic system being described as a superposition of quantum states. But a cat?

FIGURE 19.3 Schrödinger Cat experiment. A spray of catnip is released when a decay from a small radioactive sample is detected. Version suggested by Sarah Blinder and Amy Blinder, Am. J. Phys. 69, 633 (2001).

It might perfectly acceptable for an atom to be in a superposition of radioactively decayed and undecayed states. The difficulty arises when you start to wonder what is happening inside the box after the radioactive process has run its course. Can superposition on a microscopic level can be amplified to apply to a macroscopic object? Can our cat be temporarily suspended in a superposed state of contentment and annoyance? Gell-Mann and others have suggested that the coherence of superposed quantum states can be compromised by the irreducible interaction of a system with the surroundings. Our cat is not a small element of the microscopic quantum world but a large complex system made up of trillions of atoms. These can occupy an immense number of possible quantum states which are indistinguishable macroscopically. Moreover, the very strong interactions with the environment soon washes out the cat’s quantum behavior. The modern consensus resolves the Schrödingers Cat paradox by invoking decoherence. Decoherence can be viewed as a continual process of “self-measurement”

342 Introduction to Quantum Mechanics

brought about by interactions within a quantum system and with the surroundings. Thus anything as complicated as a cat will certainly be well described as a classical object. This implies that the quantum superposition collapses the instant a nucleus decays, and everything thereafter follows classical determinism. Decoherence in quantum systems is somewhat akin to transverse relaxation in NMR (T2 processes), in which the nuclear spins lose their phase coherence. The proposal that consciousness is somehow connected with collapse of the wavefunction, although intriguing, does not appear to be relevant. Quantum superposition does remain alive and well on the atomic scale. Christopher Monroe and coworkers in 1996 were able to prepare a single beryllium ion as a superposition of wavepackets representing two different electronic states spatially separated by as much as 80 nm. By an appropriate sequence of laser pulses, they were able to detect interference between the two wavepackets. Inevitably, this experiment has been referred to as “Schrödinger’s cation.” A superconducting “Schrödinger’s Cat” has been demonstrated by David Wineland and coworkers in 2000. Supercurrents, containing billions of electron pairs all residing in a single quantum state, can move around a macroscopically sized superconductor, such as a superconducting quantum interference device (SQUID) circuit. These workers were able to create a superposition of states consisting of supercurrents flowing in opposite directions at the same time.

19.4 Einstein-Podolsky-Rosen experiment In 1935 (the same year as Schrödinger’s Cat) Albert Einstein, in collaboration with Boris Podolsky and Nathan Rosen, proposed a Gedankenexperiment to demonstrate the incompleteness of quantum mechanics. The “EPR experiment” stimulated one of the major scientific controversies of the 20th Century and continues to be a subject of intense contemplation and analysis. EPR focuses on the quantum-mechanical pronouncement (the Heisenberg uncertainty principle) that the position and momentum of a subatomic particle cannot be exactly known simultaneously. The particle can be in a state of definite momentum, but then we cannot know where it is located. Conversely, we can put the particle at a definite position, but then its momentum is completely indeterminate. It is also possible to create states in which we have limited knowledge of both observables, consistent with xp ≥ /2. This state of affairs is not due to any inadequacy of measurement techniques but is rather an inescapable feature of the quantum-mechanical description of Nature. In the original form of the EPR experiment, shown schematically in Fig. 19.4, two particles A and B from a common source fly apart in opposite directions, perhaps as a result of radioactive disintegration. At some subsequent time, the position x of particle A is measured, which can in principle be done exactly. At the same instant, the momentum px of particle B is measured, also exactly. By conservation of momentum, which is also valid in quantum mechanics, the momentum of A ought to be the negative of B’s. Thus the position and momentum of particle A are apparently determined simultaneously!

Wonders of the quantum world Chapter | 19 343

FIGURE 19.4 Einstein-Podolski-Rosen (EPR) Gedankenexperiment.

The Copenhagen response to EPR was that the two measurements cannot be regarded as independent since particles A and B remain correlated as parts of a single indivisible quantum system. Thus measuring the position of A will perturb the momentum of B just as surely as it would perturb the momentum of A. The states of the two particles are said to be entangled. Entanglement persists no matter how great the separation of the two particles. Einstein argued that this was tantamount to faster-than-light communication between A and B (“spooky action at a distance”), which is contrary to locality (or local realism) implied by the theory of relativity. He thought that subatomic particles must possess yet undiscovered hidden variables which gives their quantum states objective reality even after they separate. This conflicts of course with the Copenhagen viewpoint that the value of an observable does not even exist until a measurement is made. Einstein, among others, couldn’t swallow such violations of objective reality and locality implied by quantum mechanics. Einstein agreed that quantum mechanics was a correct theory phenomenologically. But he objected that it gave an incomplete account of physical reality, which is well summarized in the title of the EPR paper: “Can quantum-mechanical description of physical reality be considered complete?” The concept of hidden variables can be illustrated by a simple example. The result of a coin toss—heads or tails—can, on the simplest level, be regarded as a random occurrence. Yet, if the coin’s complicated trajectory were analyzed in detail, the result would become completely determinate. The “hidden variables” are the coordinates and momenta describing the motion of the penny. This is certainly a highly challenging problem in mechanics. But it is, in principle, solvable so that the result of the coin toss is, despite appearances, far from random. Moreover, the apparently statistical nature of coin tosses—approximately 50% heads, 50% tails—is actually a consequence of a distribution among the large number of possible initial configurations when the coin is released. David Bohm in 1951 proposed a modified version of the EPR experiment which is conceptually equivalent but easier to analyze mathematically. This makes use of spin correlations rather than momentum correlations between particles. Bohm pictured a pair of spin-1/2 particles in a singlet state blown apart in opposite directions. The spin components are measured by two Stern-Gerlach detectors, as shown in Fig. 19.5. The spins of the two particles are antiparallel and remain so even after they are separated. Thus if particle A is spin-up, particle

344 Introduction to Quantum Mechanics

FIGURE 19.5 Bohm’s modification of the EPR experiment based on a pair of correlated spin-1/2 particles. Left: interpretation according to local realistic models. Right: quantum-mechanical interpretation for two different orientations of Stern-Gerlach detectors.

B must be spin-down, and vice versa—the two spins are thus correlated. This is true if the two detectors are initially oriented in the z-direction. Remarkably, the perfect correlation persists even when the detectors are both rotated by an arbitrary angle from the z-axis. The result registered by each individual detector is completely random. But in every case the other detector will give the opposite reading. The states of the two particles are random but correlated. Measuring the spin state of one particle will, in a sense, instantaneously force the other particle into the other spin state. Such entanglement takes place no matter how far apart the particles have moved. Rejection of the possibility for such instantaneous “communication” or “telepathy” is the core of the EPR argument that quantum mechanics is incomplete. It is a common misconception that the EPR experiment was intended to disprove quantum mechanics by means of a paradox. In fact, nothing in the EPR argument suggests doubt about the correctness of quantum mechanics. Their intent was to show that the complete description of a microscopic state is richer than that provided by quantum mechanics. They hoped to stimulate discovery of a more fundamental theory able to predict more than just statistical behavior in atomic phenomena. EPR were perceptive enough to have zeroed in on the one feature of quantum mechanics that is most counterintuitive for the classical worldview—entanglement. According to the viewpoint of local realism, the recurring correlations in the Bohm experiment can be attributed to the existence of hidden variables which determine the spin state in every possible direction. It is as if each particle carried a little code book containing all this detailed information, a situation something like the left-hand side in Fig. 19.5. It must be concluded—so far— that both local realism and the quantum mechanical picture of the world are separately capable of giving consistent accounts of the EPR and Bohm experiments. In what follows, we will refer to the two competing worldviews as local realism (LR) and quantum mechanics (QM). By QM we will understand the conventional formulation of the theory, complete as it stands, without hidden variables or other auxiliary constructs.

Wonders of the quantum world Chapter | 19 345

19.5

Bell’s theorem

In 1964, J. S. Bell derived a remarkable relation capable of experimentally deciding between local realism and quantum mechanics. His idea involves measurements for different orientations of the Stern-Gerlach detectors. Thus far we have considered only the case of two detectors oriented in opposite directions. Let P (θ ) represent the probability of recording antiparallel spins when the angle between the detectors is θ. Only for θ = 180◦ (oppositely aligned detectors) would we have P (π) = 1, meaning certainty (unit probability). For θ = 0 (aligned detectors) the probability drops to zero, so that P (0) = 0. For detectors oriented 90◦ apart, there would be equal probabilities for parallel or antiparallel spins, so that P (π/2) = 1/2. To find a general relation for P (θ ), we make use of the quantum-mechanical operators Sx , Sy and Sz for spin-1/2. It is convenient to express these in terms of dimensionless Pauli spin operators σx , σy and σz such that S=

σ. 2

(19.3)

Eqs. (7.60) imply the operator relations σy |α = i|β, σx |α = |β, σx |β = |α, σy |β = −i|α,

σz |α = |α, σz |β = −|β.

(19.4)

The Pauli spin operator for the component at an angle θ with respect to the z-axis is given by σθ = cos θ σz − sin θ σy .

(19.5)

Suppose one detector is rotated by θ , while the other detector remains vertical at θ = 0. The vertical detector will select one of the particles to be in the α-spin state (spin-up) with respect to the z-axis. Therefore the other particle must be in the β state (spin-down). The expectation value for σθ for a β-spin is given by β|σθ |β = cos θ β|σz |β − sin θβ|σy |β = − cos θ.

(19.6)

This expectation value can be related to the spin-up and spin-down probabilities P↑(θ ) and P↓(θ ) by σθ = P↑ −P↓= 2P↑ −1,

(19.7)

noting that P ↑ +P ↓= 1. From (19.6) and (19.7), the probability for detecting spin-up in the θ direction is given by P↑(θ ) =

1 − cos θ = sin2 (θ/2) . 2

(19.8)

Let us, following Bell, provisionally assume the point of view of local realism. Referring to Fig. 19.6, we suppose our particles are initially observed

346 Introduction to Quantum Mechanics

with both detectors parallel to the z-axis. We would then find zero probability for correlated spins since P (0) = 0. Now, if one of the detectors were rotated clockwise by an angle θ1 , the detection probability would increase from zero to P (θ1 ) = sin2 (θ1 /2). Likewise, the other detector rotated counterclockwise by θ2 , would register an average value P (θ2 ) = sin2 (θ2 /2). Suppose now that both detectors are rotated. From the perspective of local realism, we can reason as follows. The probability P (θ1 ) is increased from zero because a number of spins becomes parallel. Rotation of the other detector gives an analogous increase to P (θ2 ). Assuming that these two results are independent of one another, a first guess might be that the coincidence probability after rotating both detectors to a relative angle θ = θ1 + θ2 is given by P (θ1 ) + P (θ2 ). But this would be an overcount, since those instances in which both detectors register spins antiparallel to their paired detectors, they must be parallel to one another, and thus not be appropriately correlated. This must reduce P (θ1 + θ2 ) to a value less than the sum, which can be expressed as an inequality P (θ1 + θ2 ) ≤ P (θ1 ) + P (θ2 ).

(19.9)

FIGURE 19.6 Series of experiments to demonstrate Bell’s inequalities. P (θ ) represents the fraction of observations which register antiparallel spins.

Eq. (19.9) is an instance of Bell’s inequality. A more general form can be expressed P (a, b) ≤ P (b, c) + P (c, a),

(19.10)

representing correlations involving three different axes in space, designated a, b and c. Bell’s inequality can be demonstrated by assuming that definite spin components along all three directions exist for every pair of particles, in accordance with objective reality. This can be described by a set of 23 = 8 triply-composite probability functions P (a↑, b↑, c↑), P (a↑, b↑, c↓), etc. Clearly, each correlation probability in (19.10) is equal to a sum of four triple composites, for example, P (a, b) = P (a↑, b↓, c↑) + P (a↓, b↑, c↓) + P (a↓, b↑, c↑) + P (a↓, b↑, c↓), (19.11)

Wonders of the quantum world Chapter | 19 347

and analogously for the other two. Fig. 19.7 provides a graphical proof of Bell’s inequality, which is actually a very general result in probability theory, not limited to spin variables.

FIGURE 19.7 Graphical demonstration of Bell’s inequality (19.10) using a three-dimensional Venn diagram. Each of 8 regions represents one combination of the variables a, b and c. It is clear that P (a, b) is a subset of P (b, c) + P (c, a).

It is readily apparent that Bell’s inequality in the form (19.7) is in disagreement with quantum mechanics. With the coincidence probabilities (19.6), the sense of the inequality is, in fact, reversed since sin2 [(θ1 + θ2 )/2] ≥ sin2 (θ1 /2) + sin2 (θ2 /2) for 0 ≤ θ1 , θ2 ≤ π.

(19.12)

For example, when θ1 = 20◦ , θ2 = 30◦ , Eq. (19.12) gives 0.1786 > 0.0971. This means that quantum mechanics predicts greater than expected correlations between events that are out of range by classical causality. On a practical note, even if a small fraction of coincidences fails to resister because of imperfections in the apparatus, the validity of inequality (19.12) is not likely to be compromised. The essential inconsistency of quantum mechanics with any local realistic picture is also evident from the form of the two-spin singlet state wavefunction 1 | = √ |α1 |β2 − |β1 |α2 . (19.13) 2 There is no way in which such a function can be written as a simple product of the form | = |ψ1 |φ2 , representing two noninteracting particles. Thus there is no way in which two particles acting independently of one another can simulate the entanglement of the singlet state. Bell’s inequality provides a clear-cut test of local reality vs. quantum mechanics. The unambiguous answer, from a variety of experiments which we describe in the next Section, is that quantum mechanics wins! Thus we can conclude that we live in Universe which does not respect local reality. Quantum entanglement—the term used by Schrödinger—really happens! In drawing this conclusion we are actually glossing over a number of still-unresolved hairsplitting metaphysical arguments. This remarkable result is often summarized as Bell’s theorem: The local realistic model is violated by quantum mechanics. Henry Stapp regards Bell’s theorem as “the most revolutionary scientific discovery of the Twentieth Century.” The inherent nonlocality of quantum mechanics

348 Introduction to Quantum Mechanics

means that two particles once having been together might continue to assert instantaneous influence on one another—even if they are in different galaxies. It is even possible that all the matter in the Universe, having originated in the Big Bang, is in some way mutually entangled.

19.6 Aspect’s experiment Bell’s theorem is by now a well-established experimental fact. The most accurate experiments have been based on analogs of the EPR-Bohm experiment measuring photon polarizations rather than spins of massive particles. Instead of spin-up and spin-down states, photons can have right and left circular polarizations. In certain processes, two photons with correlated polarizations—one left, one right—can be emitted in opposite directions. Wheeler had proposed in 1946 that the pair of photons emitted in the annihilation of positronium were entangled with opposite polarizations. This was experimentally confirmed by Wu and Shaknov in 1949.

FIGURE 19.8 Emission of two polarization-correlated photons by calcium atom after laser excitation.

Circular polarization is equivalent to a superposition of linear polarizations in two perpendicular directions, say “horizontal” (H) and “vertical” (V). Whereas two quantized states of spin-1/2 particles differ in orientation by 180◦ , for photons, the two orthogonal states of polarization differ by 90◦ . Formulas derived for spin-1/2 particles can generally be applied to photons with θ/2 replaced by θ . It is useful to define the expectation value for correlation between two photons E(a, b) such that E = 1 (perfect correlation) if one is H and the other V, E = −1 (perfect anticorrelation) if they are both H or both V. Expressed as a function of θ , the angle between the polarization vectors of photons a and b, we would have E(90◦ ) = 1 and E(0◦ ) = −1. For θ = 45◦ , equal probabilities of correlation and anticorrelation would give the statistical result E(45◦ ) = 0. For arbitrary θ , the quantum-mechanical result is E(θ ) = sin2 θ − cos2 θ.

(19.14)

For angles θ = 0◦ , 45◦ and 90◦ the predictions of local realism and quantum mechanics agree. For the general case, a modification of Bell’s inequality appropriate for photon polarizations was derived by Clauser, Horne, Shimony and Holt. The CHSH relation predicts |S| ≤ 2, where

Wonders of the quantum world Chapter | 19 349

S ≡ E(a, b) − E(a, b ) + E(a , b) + E(a , b ).

(19.15)

FIGURE 19.9 Schematic diagram of Aspect’s experiment: Phys. Rev. D 14, 1944 (1976). CA and CB are two-channel polarizers which direct linearly polarized photons to the photomultipliers PM.

Violations of Bell’s inequality were successfully demonstrated by Clauser, Horne, Shimony and coworkers in the 1970s. The definitive experiments are those of Alain Aspect and coworkers in 1982. Correlated photons were produced by emission from doubly-excited calcium atoms, as shown in Fig. 19.8. An atomic beam irradiated by two lasers excites calcium atoms to their [Ar]4p 2 1 S0 level by two-photon absorption. Spontaneous emission of a 551.3 nm (green) photon accompanies a transition to [Ar]4s4p 1 P1 . This is followed very rapidly by emission of a second 422.7 nm (blue) photon, as the calcium atom returns to its [Ar]4s 2 1 S0 ground state. Each photon carries one unit of angular momentum, with their vector sum equal to zero. The two photons are accordingly entangled with opposite circular polarizations. In Aspect’s experiment, schematically represented in Fig. 19.9, polarization detectors for orientations a1 , a2 and b1 , b2 measure the linear polarizations of the two photons and coincidences are electronically monitored. Aspect’s experimental results, plotted in Fig. 19.10, show correlation statistics as a function of angle between polarimeters. These agree perfectly with the quantum-mechanical result (19.12). The maximum deviation from local realism occurs for a series of measurements with detectors separated by 22.5◦ increments. Aspect’s result gives S = 2.697 ± 0.015, in contrast to the CHSH condition S ≤ 2. Thus violation of Bell’s inequality is verified with a confidence level greater than 40 standard deviations. A further modification addresses a remote loophole for local realism in which the polarization detectors might somehow be able to signal one another at subluminal speed. Aspect designed an experiment in which mirrors switched at high speed could direct the two photons to different detectors, after they were already in flight. It was verified in this “delayed choice” experiment that Bell’s inequality is still violated, in what is currently considered the most conclusive test of non-locality.

19.7

Multiple photon entanglement

The general principles of entanglement are valid for all quantum-mechanical observables, including momentum, spin and so on. In practice however, the most

350 Introduction to Quantum Mechanics

FIGURE 19.10 Results of Aspect experiment. Dotted curve shows prediction of quantum mechanics (multiplied by 0.955 to correct for detection efficiency). Shaded regions show where Bell’s inequality is violated. From A. Aspect, P. Grangier, and G. Roger, Phys. Rev. Lett. 49, 91 (1982).

accessible entanglement phenomena involve photon polarizations. This is true because of the availability of highly efficient instrumentation for controlling and detecting optical photons. A technique for producing polarization-entangled photons pairs—more efficient than atomic cascade sources—has been used since the 1980s. Known as parametric down conversion, this is based on the nonlinear optical properties of certain crystals, most notably β-barium borate (BBO). As shown in Fig. 19.11, photons of frequency ν0 from a high-intensity argon-ion UV laser are incident on a specially-prepared BBO crystal, a small fraction of the photons are down converted to a pair of photons of frequency ν0 /2, with linear polarizations H and V. The H and V photons exiting the crystal in two particular directions can form entangled pairs, which show up at the circle intersections in Fig. 19.11. By appropriate manipulation by optical devices, four possible EPR-Bell states can be obtained, namely 1 | ± = √ |H 1 |V 2 ± |V 1 |H 2 , 2 1 (19.16) | ± = √ |H 1 |H 2 ± |V 1 |V 2 . 2 For the case of two correlated particles, the decision between local realism and quantum mechanics hinged on statistical violations of (Bell’s) inequalities. It was subsequently suggested (around 1989) by Greenberger, Horne and Zeilinger (GHZ) that for three or more correlated particles inequalities are no longer necessary. They showed that certain measurements give unambiguous results which can decide between LR and QM. Zeilinger and coworkers were able to produce GHZ states of three entangled photons in 1999 and four entangled photons in 2001. We will describe the four-photon experiments. As shown in Fig. 19.12, two pairs of entangled photons are produced by sources A and B, which are actually realized by passing a UV pulse twice through the same BBO crystal. Source A emits photons number 1 and 2, while

Wonders of the quantum world Chapter | 19 351

FIGURE 19.11 Experimental setup for producing entangled photons by type-II parametric down conversion. Conical streams of H and V polarized photons are emitted from BBO crystal. A colorized image of the output infrared radiation is shown at right. Entangled beams of photons are created at the cone intersections. Based on P. G. Kwiat, et al., Phys. Rev. Lett. 75 4337 (1995).

FIGURE 19.12 Experiment to detect four-photon entanglement. General principle sketched on left, more detail on right. Sources A and B each deliver one entangled photon pair. This is actually produced in two passes of a UV pulse through the BBO crystal. PBS is polarizing beam splitter which transmits H photons but reflects V photons. D1–D4 are photon detectors. J.-W. Pan, M. Daniell, S. Gasparoni, G. Weihs and A. Zeilinger, Phys. Rev. Lett. 86 4435 (2001).

source B emits photons number 3 and 4. The state of the four-photon system at this point can be represented as 1 1 | = √ |H 1 |V 2 − |V 1 |H 2 ⊗ √ |H 3 |V 4 − |V 3 |H 4 . (19.17) 2 2 One photon from each pair (numbers 2 and 3) are directed to the inputs of a polarizing beam splitter (PBS) which always transmits H but reflects V polarization. Detectors D2 and D3 then record coincidences in which one and only one photon enters each. (Events in which both photons enter the same detector are rejected.) Consequently, photons 2 and 3 must be both H or both V. A GHZ state of four particles results with

352 Introduction to Quantum Mechanics

1 | = √ |H 1 |V 2 |V 3 |H 4 + |V 1 |H 2 |H 3 |V 4 . 2

(19.18)

This state was confirmed by observations of fourfold coincidences that were either HVVH or VHHV, but none of the 14 other possible combinations—with a signal-to-noise ratio of about 200:1. These observations provide a necessary but not rigorously sufficient condition for the coherent state (19.18). To eliminate the remote possibility that the four photons were just a statistical superposition of HVVH and VHHV, further measurements were performed with the polarizers rotated by ±45◦ . The basis functions for ±45◦ diagonal linear polarization are given by 1 |± 45◦ = √ |H ± |L . 2

(19.19)

Transforming (19.18) to the diagonal basis results in a superposition of 8 terms containing an even number of |+ 45◦ components, but none with an odd number of |+ 45◦ s. This can be tested by comparing the coincidence counts with the four polarization filters set to (+45◦ / + 45◦ / + 45◦ / + 45◦ ) and to (+45◦ / + 45◦ / + 45◦ / − 45◦ ). The number of counts in the first configuration is found to be about 10 times that in the second. Under ideal circumstances, we would expect zero counts for the second configuration. Since local realism would predict equal numbers of counts, support for quantum mechanics is again highly convincing. The remarkable feature of this multiple entanglement is that, for any arrangement of the polarizers, the readings of any three detectors will determine the fourth with 100% certainty. Unlike the experiments testing Bell’s inequality, the predictions for a GHZ state specify a definite outcome rather than a statistical distribution. A very active field of current research is the exploration of possible applications of entangled photons to teleportation, communication, cryptography and quantum computation.

19.8 Philosophical problems of quantum theory The contents of this book, in common with most texts on quantum mechanics, has been focused on recipes for computing the properties of systems on the atomic scale. Without doubt, this approach has been highly successful, leading to some of the most extraordinarily accurate numerical predictions in any field of science–some in agreement with experiment to as many as 10 significant figures. In addition, quantum-mechanical models have stimulated the development of the technology which dominates life in the 21st Century: lasers, computers and other semiconductor devices, how stars shine, nuclear power, telecommunications (including smartphones), biotechnology, medical instrumentation (such as MRI), and many many other examples.

Wonders of the quantum world Chapter | 19 353

Still, quantum mechanics, augmented, say, by the Copenhagen interpretation–still the “party line” for most physicists–remains simply a collection of recipes, not a fully-structured physical theory. One might speculate on the possible similarity of the present state of the quantum theory with the laws of thermodynamics, as they were understood in the early 19th Century, at the time of Carnot, Joule and Clausius. Thermodynamics was a model of a successful physical theory which explained the observed behavior of heat and work, for example in steam engines, without invoking any extraneous ideas about the intrinsic nature of matter or energy. No deeper understanding of the fundamental underlying physical reality–molecules and their motions–was necessary for the success of thermodynamics. It can be argued, a proper scientific theory should tell us what are the actual things it describes and how do these things behave. Two persistent theories which have been around for many years but both of which carry heavy metaphysical baggage are the pilot wave theory of de Broglie and Bohm, and the many-worlds interpretation, first proposed by Hugh Everett. The de Broglie-Bohm theory, often called Bohmian mechanics, is a causal interpretation of quantum mechanics, proposed by Louis de Broglie in 1927 and newly advocated by David Bohm in 1952. This is the simplest example of what is often called a hidden variables interpretation of quantum mechanics. A system of particles is described by its wave function, evolving in accord with the Schrödinger equation. The wavefunction plays the part of a pilot wave, which guides the motion of the particles, with trajectories dependent on initial conditions. The particle velocities are determined by the gradient of the wavefunction. Thereby, this formulation of quantum mechanics can be considered a deterministic theory, much like Newtonian mechanics. According to Everett’s many-worlds interpretation, proposed in 1957, the evolution of a quantum system is completely governed by the Schrödinger equation, with no need to augment it by wavefunction collapse. It is proposed that every time a quantum experiment is performed, all possible outcomes are obtained, with each occurring in a different world, even if we are only aware of the one world with the outcome we actually experience. This is accordingly known as the many-worlds interpretation of quantum mechanics, which posits that there must exist a myriad of worlds in the Universe in addition to the world we are aware of. One supporter of this view is the British quantum theorist David Deutsch, who believes that the operation of a quantum computer actually involves simultaneous computations in different worlds. It has also been speculated by some that the many-worlds interpretation might be related to the cosmological idea that we actually inhabit a Multiverse. Several currently popular formulations of quantum mechanics make use of decoherence, the gradual loss of the coherence of quantum superpositions caused by interactions with the macroscopic environment. This idea was first proposed in 1970 by the H. Dieter Zeh, with major contributions by Wojtek Zurek. Decoherence might, in fact, contribute to the mechanism responsible for

354 Introduction to Quantum Mechanics

collapse of the wavefunction. When it has run its course, a quantum superpo c ψ can be reduced to a classical ensemble of states, sition such as = n n n such as n |cn |2 ρn . Decoherence is the principal obstacle to practical realization of quantum computers, since these rely on the unitary evolution of coherent quantum states. Decoherence is clearly relevant in considerations of the long-controversial measurement problem. It might also be evoked to help explain the arrow of time–how it is that time flows in one direction despite the fact that the fundamental equations are symmetric wrt time reversal. Decoherence is possibly relevant in consideration of the persistent question of how the “classical world” can emerge from quantum mechanics. It should also be recognized that quantum mechanics in its present form cannot be the ultimate theory, until it can coherently incorporate gravity. The well-known incompatibility of quantum field theory and general relativity is an indication of the work that remains to be done. Perhaps the future will eventually see some fuller understanding of quantum mechanics. To quote Viola in Twelfth Night: “O time, thou must untangle this, not I. It is too hard a knot for me t’untie.”

Chapter 19. Exercises 19.1. A singlet state for two spin-1/2 particles is described by the wavefunction 1 | = √ |α1 |β2 − |β1 |α2 , 2 where |α and |β are referred to the z-axis. For an axis perpendicular to the z-axis, the corresponding spin-up and spin-down eigenfunctions are given by 1 |↑ = √ |α − |β 2

1 |↓ = √ |α + |β . 2

Express | in terms of |↑ and |↓. You should find the form of the wavefunction to be invariant, a consequence of the spherical symmetry of the singlet spin state. 19.2. Consider an unpolarized light beam incident on an arrangement of polarizers as shown below:

The beam is totally blocked by successive horizontal and vertical polarizers, as on the left. If, however, a 45◦ polarizer is interposed, as on the right, a fraction of the incident light gets through. Explain this.

Chapter 20

Quantum computers During the past 50 years, top-of-the-line semiconductor-based computers have doubled in capacity and speed approximately every 18 months, an observation known as “Moore’s law.” As computer components get smaller, quantum effects become more significant, usually to the detriment of a computer’s reliability. But turning adversity to advantage, the two great wonders of the quantum world— superposition and entanglement—suggest powerful methods for encoding and manipulating information, far beyond the capabilities of classical computers. Richard Feynman in 1982 suggested the possibility of constructing a new type of computer taking advantage of quantum principles. As we will show, quantum computers, if they could be constructed, might vastly outperform classical computers. The potential power of quantum computation was first anticipated in a paper by David Deutsch at Oxford in 1985, which described a universal quantum computer as a generalization of a classical Turing machine. The first “killer application” for quantum computers was devised in 1994 by Peter Shor at AT&T Laboratories, an algorithm to perform factorization of large numbers much more efficiently than any classical computer. Encryption techniques such as that of Rivest, Shamir and Adleman (RSA), used for secure electronic transactions, depend on the difficulty of factoring large numbers. Another algorithm, invented in 1996 by Lov Grover, also at AT&T, was a method to speed up searches on large databases. To be sure, no quantum computer yet exists to carry out any such programs. But the motivation is certainly there. The prospect of applying quantum computers to atomic and molecular problems is highly attractive in view of the relative computational complexities of classical and quantum algorithms. Complexity is a general measure of how computer time and memory resources scale with increasing size. For classical computers, applications to atoms and molecules scale exponentially with system size, roughly of the order of 2n with increasing number of electrons n. However, using quantum-computer algorithms, in principle, the increase is only polynomial, meaning something like n3 , n4 , etc. Problems which can be solved using polynomially increasing computational resources are commonly called tractable, while those that require exponentially growing resources remain intractable. The basic unit of information in computer science is the binary digit or bit, whose physical realization is a component capable of two stable states, say |0 and |1. In a conventional computer, these usually correspond to tiny charged and uncharged capacitors within a silicon microchip. On an atomic level, bits Introduction to Quantum Mechanics. https://doi.org/10.1016/B978-0-12-822310-9.00028-8 Copyright © 2021 Elsevier Inc. All rights reserved.

355

356 Introduction to Quantum Mechanics

might in concept be represented by the two orientations of an electron spin or the two polarization states of a photon. Apart from economy of scale, an atomic two-level system has a capability beyond that of a classical component, namely the possibility of being in a coherent superposition such as | = a0 |0 + a1 |1

(20.1)

Such an entity represents the basic unit of information in a quantum computer— a quantum bit or qubit. Unlike a classical bit, which can store only a single value—a 0 or 1—a qubit can store both 0 and 1 at the same time. The state of a two-qubit register could be written | = a0 |00 + a1 |01 + a2 |10 + a3 |11

(20.2)

and contains the equivalent of four classical bits. A quantum register of 64 qubits can store 264 ≈ 1019 values at once. More remarkably, the mutual entanglement of qubits makes it possible to perform computations on all these values at the same time. A quantum computer thus has the capability of operating in a massively-parallel mode. A 300-qubit quantum computer could theoretically store 2300 ≈ 1090 bits of information, more than the estimated number of atoms in the known Universe, and also be capable of doing 2300 simultaneous calculations. A classical computer can be likened to a solo musical instrument, a quantum computer to a full orchestra. If the music is well played, a symphony is much richer in content than the sum of its parts. A major technical problem in constructing quantum computers is to minimize interactions within the machine and with the environment, which would cause decoherence–a breakdown in quantum entanglement.

20.1 Qubits The fundamental idea of quantum computing is the replacement of the elementary unit of information in digital computers, the bit, with the qubit. The qubit can be defined as a normalized quantum state vector belonging to a complex two-dimensional Hilbert space C2 . The basis vectors are conventionally designated, using Dirac notation, 1 0 |0 = , |1 = . (20.3) 0 1 A generic element of C2 can be written as a complex linear combination of two classical states: | = α|0 + β|1,

(20.4)

where α and β are complex numbers. But a qubit is an entirely different type of linear combination, which does not correspond to any well-defined classi-

Quantum computers Chapter | 20 357

FIGURE 20.1 Bloch sphere. The qubit | and the six cardinal points are marked with red dots.

cal entity. The norm || is assumed equal to 1; therefore α, β must obey the normalization condition: |α|2 + |β|2 = 1.

(20.5)

The Bloch sphere, shown in Fig. 20.1, provides a useful geometrical representation of a qubit. The angles θ and φ, analogous to latitude and longitude, describe the state of a qubit. Antipodal points represent orthogonal states, rather than just negatives of the state vector. On the Bloch sphere, |0 is mapped onto the north pole (θ = 0) and |1 onto the south pole (θ = π). The equator, with θ = π/2, contains √ √ with φ =0,√π/2,π, 3π/2, corre four additional √ cardinal points, sponding to |0 + |1 / 2, |0 + i|1 / 2, |0 − |1 / 2, |0 − i|1 / 2, respectively. There is a one-to-one correspondence between qubit states and points on the Bloch sphere. A convenient parametrization of a qubit, consistent with the above, is θ θ (20.6) | = cos |0 + sin eiφ |1, 2 2 noting the characteristic spinor half-angles. Different physical systems can serve as realizations of a qubit: |0 and |1 can represent the two polarization states of a photon, the ± 12 spin states of an electron or nuclear spin, the direction of a vortex in a superconductor or two different energy levels of an atom or molecule. Since θ and φ are real numbers over a continuous range, a qubit can contain a vast amount of information. Quoting Nielsen and Chuang, “Paradoxically, there are an infinite number of points on the unit sphere, so that, in principle, one could store an entire text of Shakespeare in the infinite binary expansion of θ.” However, an actual measurement

358 Introduction to Quantum Mechanics

of the qubit will give only one of two discrete results, ±1, which collapses the qubit to either |0 or |1.

20.2 Quantum gates and circuits The model of quantum computing we will consider is a generalization of the classical circuit model, based on classical bits and logic gates. A quantum circuit carries out transformations on qubits, using gates which correspond to unitary operations. It will suffice for our purposes to limit the unitary gates to those involving just one or two qubits. Quantum gates are the analogs, for qubits, of classical logic gates. There is only one single-bit logic gate, namely, the NOT gate, but several other single-qubit quantum gates are possible. A single α α into the qubit . To satisfy qubit quantum gate transforms the qubit β β Eq. (20.5), we must have: |α |2 + |β |2 = |α|2 + |β|2 = 1, (20.7) α so that the norm of the state is invariant. Since we want to preserve linβ earity, there is only one class of operators that can serve as quantum gates, namely unitary operators. Unitary operators U leave the norm invariant, so that |U | = || for all . The most important single-qubit gates are shown in Fig. 20.2, along with the 2 × 2 unitary transformations they produce. For example, application of the Hadamard gate to each of the basis states gives 1 H |0 = √ |0 + |1 , 2

1 H |1 = √ |0 − |1 . 2

(20.8)

The Pauli-X, Y and Z gates are named after the Pauli spin operators, σx , σy and σz , which have the same matrix representations. The Pauli-X gate is equivalent to the NOT gate, which interchanges basis states X|0 = |1,

X|1 = |0,

(20.9)

just as the classical NOT gate ◦. Operators representing rotation of a qubit on the Bloch sphere, about the x, y, and z axes, can be constructed by exponentiation of the Pauli matrices, as the following: cos α2 −i sin α2 α α −iαX/2 Rx (α) = e = cos 2 I − i sin 2 X = , −i sin α2 cos α2 β β cos − sin β β 2 2 , Ry (β) = e−iβY/2 = cos 2 I − i sin 2 Y = sin β2 cos β2

Quantum computers Chapter | 20 359

FIGURE 20.2 Single-qubit gates.

Rz

(γ ) = e−iγ Z/2

= cos

γ 2I

− i sin

γ 2Z

=

e−iγ /2

0

0

eiγ /2

. (20.10)

A generalization of (20.10), a rotation by θ about a unit vector to the Bloch sphere, n = (n1 , n3 , n3 ) is represented by θ θ Rn (θ ) = e−iθ n·/2 = cos I − i sin n · , 2 2

(20.11)

where = (X, Y, Z). An arbitrary rotation on the Bloch sphere can be described by sequential rotations about three Euler angles. From a multitude of possible conventions, we will choose the sequence: (1) rotation about the z-axis by an angle α; (2) rotation about the new x-axis by an angle β; (3) rotation about the new z-axis by an angle γ . This sequence can be represented by the unitary operator U = eiδ Rz (γ )Rx (β)Rz (α),

(20.12)

where the global phase factor eiδ enables some resulting matrices to be expressed in standard form. Any unitary operation on a single qubit can be written as a combination of rotations in the form (20.12), often in alternative ways. For example, you can show that U becomes the Hadamard gate H with the angles α = β = γ = δ = π/2. Another combination gives H = iRx (π)Ry (π/2). As we shall see later, rotation gates are if central importance in the simulation of quantum systems. Multi-qubit states and operations can be represented as tensor products and constructed asKronecker products. For example, let A = their matrices a11 a12 b11 b12 and B = represent matrices in C2 . Then the matrix a21 a22 b21 b22 for A ⊗ B in C2 ⊗ C2 is given by

360 Introduction to Quantum Mechanics

FIGURE 20.3 Controlled-NOT (CNOT) gate.

⎛

A⊗B =

a11 B a21 B

a b ⎜ 11 11 a12 B ⎜a b = ⎜ 11 21 ⎝a21 b11 a22 B a21 b21

The two-qubit state with 1 =

a11 b12 a11 b22 a21 b12 a21 b22

a12 b11 a12 b21 a22 b11 a22 b21

⎞ a12 b12 ⎟ a12 b22 ⎟ ⎟ . (20.13) a22 b12 ⎠ a22 b22

α1 α2 , 2 = is given by β1 β2 ⎛

⎞ α1 α2 ⎜α1 β2 ⎟ ⎟ = 1 ⊗ 2 = ⎜ ⎝β1 α2 ⎠ .

(20.14)

β1 β2 Next we consider two-qubit gates. Two qubits gates operate in the C2 ⊗ C2 space. Thus they can be represented by 4 × 4 unitary matrices. Since the inverse of the unitary matrix U is the matrix U † , all quantum gates are reversible, in contrast to classical logic gates. For a quantum gate, the input can be deduced from the output. But for a classical XOR gate x ⊕ y, for the output 1, we do not know whether the input was |10 or |01. The output does not identify the input. The simplest two-qubit gate is the CNOT (controlled-NOT) gate. We denote the two input qubits by x, y. The first qubit x is called the control qubit, the second y the target qubit. This target qubit flips if and only if x = 1. If x = 0 the second qubit remains unchanged. A representation of the CNOT gate is shown in Fig. 20.3. The black dot represents the action of the control qubit. The unitary matrix corresponding to the CNOT gate is given by ⎛ ⎞ 1 0 0 0 ⎜ ⎟ ⎜0 1 0 0⎟ UCNOT = ⎜ (20.15) ⎟. ⎝0 0 0 1⎠ 0 0 1 0 In effect, UCNOT leaves the states |00, |01 invariant, but swaps the states |10 and |11. The CNOT gate is the fundamental creator of entangled qubits, performing, in a sense, the “quantization” of a classical gate to a quantum gate.

Quantum computers Chapter | 20 361

FIGURE 20.4 Controlled-V gate.

FIGURE 20.5 Inverted controlled-V gate.

A generalization of the CNOT gate is a controlled gates acting on 2 qubits, with the control qubit determining some unitary operation V on the target qubit. If the single qubit gate V is represented by the 2 × 2 matrix: V V = 11 V21 The corresponding 2-qubit matrix is ⎛ 1 ⎜ ⎜0 UCV = ⎜ ⎝0 0

0 1 0 0

V12 V22

0 0 V11 V21

(20.16)

⎞ 0 ⎟ 0 ⎟ ⎟. V12 ⎠ V22

(20.17)

The symbol for the controlled-V gate is shown in Fig. 20.4. Exchanging the control and target registers gives an inverted controlled gate. Fig. 20.5 shows an inverted controlled-V gate, which is represented by the matrix: ⎞ ⎛ 1 0 0 0 ⎟ ⎜ ⎜0 V11 0 V12 ⎟ (20.18) UICV = ⎜ ⎟. ⎝0 0 1 0 ⎠ 0 V21 0 V22 1 0 If V is a phase gate, such as S or T , in general Rn = n−1 , then the 0 eiπ/2 corresponding controlled and inverted-controlled gates are identical:

362 Introduction to Quantum Mechanics

FIGURE 20.6 SWAP gate.

FIGURE 20.7 Sequence of CNOT operations equivalent to SWAP gate.

⎛

⎞

1 ⎜0 ⎜ UCR = UICR = ⎜ ⎝0

0 1 0

0 0 1

0 0 0

0

0

0

eiπ/2

⎟ ⎟ ⎟. ⎠

(20.19)

n−1

Another important 2-qubit gate is the SWAP gate, which swaps the states of two input qubits, such that ⎛

1 ⎜ ⎜0 USWAP = ⎜ ⎝0 0

0 0 1 0

0 1 0 0

⎞ 0 ⎟ 0⎟ ⎟. 0⎠ 1

(20.20)

This gate is shown in Fig. 20.6. It is easy to verify the equivalence of the SWAP gate with the sequence of CNOT operations shown in Fig. 20.7 In a quantum computer circuit, the path of each qubit is represented by a single horizontal line —— which usually connects a sequence of gates. The qubit is understood to move along the “quantum wire” from left to right. A wire carrying a classical bit, 0 or 1, usually after a measurement, is indicated by a double line ==. As an example, let us show a quantumcircuit which can produce Bell states, which are the superpositions |± = √1 |00 ± |11 and | ± = 2 √1 |01 ± |10 . These are maximally entangled states of two qubits. Bell states 2 can be produced by the circuit shown in Fig. 20.8. The first qubit is passed through a Hadamard gate and then the two qubits are entangled by a CNOT gate. If the input to the system is |0 ⊗ |0 = |00, then the Hadamard gate changes the state to

Quantum computers Chapter | 20 363

FIGURE 20.8 Circuit producing a Bell state.

FIGURE 20.9 Circuit producing GHZ state √1 |000 + |111 . 2

1 1 √ |0 + |1 ⊗ (|0 = √ |00 + |10 , 2 2 and the CNOT gate converts this to √1 |00 + |11 = |+ . For the four pos2 sible inputs, we find 1 1 |00 → √ |00 + |11 = |+ , |01 → √ |01 + |10 = | + , 2 2 1 1 − |10 → √ |00 − |11 = | , |11 → √ |01 − |10 = | − . 2 2 (20.21) Bell states can simulate the correlation of measurements in Bohm’s version of an Einstein-Podolski-Rosen experiment, described in Sect. 19.5. In the states |+ or |− , a measurement collapses the two-particle state to either |00 or |11, with equal probability. Thus, if a spin measurement on one particle gives +1 or −1, the other particle must necessarily give the same result. Similarly, the states | + or | − collapse in a measurement to either |01 or |10. The spin measurements on the two particle will then, of necessity, give opposite results. Another interesting example is a quantum circuit, shown in Fig. 20.9 which can produce a GHZ state, which is an entangled superposition of three qubits: 1 |GHZ = √ |000 + |111 . 2

(20.22)

GHZ refers to Greenberger, Horne and Zeilinger, who first produced entangled sets of three (and four) photons. Such experiments enabled demonstration of the

364 Introduction to Quantum Mechanics

validity of Bell’s theorem in a single run, rather than from statistical analysis of a multitude of runs. Bell’s theorem states that the results of quantum mechanics are incompatible with any picture based on local realism. The quantum algorithms we are considering in this Chapter are based on the quantum circuit model. This uses one- and two-qubit gates sequentially connected by wires. We begin with an initial register of qubits to be converted by the action of the circuit into a resultant register of classical bits, on which measurements can be made.

20.3 Simulation of a Stern-Gerlach experiment In the Stern-Gerlach experiment (see Section 7.6), an unpolarized beam of neutral particles of spin-1/2 is directed through an inhomogeneous magnetic field (blue and red magnets, which produces separated beams of spin-up and spindown particles). For simplicity, only the outgoing spin-up beam is shown in Fig. 20.10. This beam is then directed through a second magnet, for which the polarization can be rotated by an angle θ from the original. This further splits the beam (except when θ = 0 or π) into spin-up and spin-down beams with respect to the new polarization direction. Again, only the spin-up component is shown. The probability for a particle to emerge with spin-up (↑) or spin-down (↓) is given by cos2 (θ/2) and sin2 (θ/2) respectively. The resulting probabilities of ↑ and ↓ are shown for five selected angles. The results of the Stern-Gerlach experiment can be simulated by a quantum computer. The qubits |0 and |1 correspond to the spin states ↑ and ↓, respectively. The initial state |0 corresponds to the polarized beam leaving the first magnet. By an appropriate sequence of quantum gates, the results of the beam passing through the second magnet, with polarization angle θ, can be simulated. The statistical results are verified after a large number of runs on the quantum computer, as shown after the measurement symbol. For example, the θ = π/4 rotation is produced by the sequence

1 1 1 H T H |0 = 1 −1 0

1 1 eiπ/4 0

1 −1

1 1 iπ/4 1 + e . = 2 2 1 iπ/4 1 0 2 − 2e

The probability of a result |0 in the subsequent measurement is then given by 1 1 iπ/4 2 2 + 2e = 0.8536, or about 85% spin-up.

20.4 Quantum Fourier transform The quantum Fourier transform is closely analogous to the well-known discrete Fourier transform (DFT), whereby set of N complex numbers xj (j = 0, 1, . . . , N − 1) can be transformed into another set of N complex numbers yk (k = 0, 1, . . . , N − 1) by the relations:

Quantum computers Chapter | 20 365

FIGURE 20.10 Stern-Gerlach experiment and quantum-computer simulation. N −1 1 yk = √ xj (ωN )kj , N j =0

ωN = e2πi/N .

(20.23)

366 Introduction to Quantum Mechanics

FIGURE 20.11 Circuit with Hadamard gate; k = 0 or 1.

N−1 ∗ kj The inverse DFT is then given by xj = √1 k=0 yk (ωN ) . A quantum N Fourier transform QFT, which we denote by F , is equivalent to a DFT on the amplitudes of a quantum state, =

N−1

αj |j ,

with

j =0

N−1

|αj |2 = 1,

(20.24)

j =0

such that F=

N−1

βk |k,

with

k=0

N−1 1 βk = √ αj (ωN )kj . N j =0

(20.25)

The unitary operator corresponding to a quantum Fourier transform is given by (with ω written for ωN ): ⎞ ⎛ 1 1 1 ... 1 ⎜1 . . . ωN−1 ⎟ ω ω2 ⎟ ⎜ ⎟ ⎜ 1 ⎜1 2 4 2(N−1) ⎟ ω ω ... ω F=√ ⎜ (20.26) ⎟. ⎟ N⎜ .. .. .. ⎟ ⎜ .. . . . ⎠ ⎝. 1 ωN−1

ω2(N−1)

...

ω(N−1)

2

Most often, the QFT is applied to individual basis functions |k, such that N−1 1 (ωN )kj |j . F |k = √ N j =0

(20.27)

The action of a Hadamard gate is the simplest example of a QFT, with N = 2, ω2 = −1, as shown in Eq. (20.8) and Fig. 20.11. A quantum state represented by n qubits has N = 2n basis functions. For example, a 2-qubit state is spanned by 4 basis functions, which can be designated as follows |0 = |0 ⊗ |0 = |00, |2 = |1 ⊗ |0 = |10,

|1 = |0 ⊗ |1 = |01, |3 = |1 ⊗ |1 = |11.

(20.28)

To summarize: |k = |k1 ⊗ |k2 = |k1 k2 ,

(20.29)

Quantum computers Chapter | 20 367

FIGURE 20.12 Circuit for 2-qubit quantum Fourier transform.

FIGURE 20.13 Details for 2-qubit quantum Fourier transform.

where k is expressed as a binary number. With N = 4, ω = ω4 = eiπ/2 = i, the QFT of a basis function is given by 1 kj ω |j . 2 3

F |k =

(20.30)

j =0

Noting that |j = |j1 ⊗ |j2 , F |k =

1 1 1 1 2kj1 kj2 ω ω |j1 ⊗ |j2 = |0 + ω2k |1 ⊗ |0 + ωk |1 = 2 2 j1 =0 j2 =0 k2 1 1 √ |0 + eiπk2 |1 ⊗ √ |0 + eiπ(k1 + 2 ) |1 . (20.31) 2 2

It has also been noted that e2πik1 = 1. Fig. 20.12 shows a circuit which implements the 2-qubit QFT. The details of its operation are represented in Fig. 20.13. We begin with the state |0 = |k1 ⊗ |k2 . The Hadamard gate on the first register produces the state 1 |1 = √ |0 + eiπk1 |1 ⊗ |k2 . 2

(20.32)

The subsequent action of the inverted controlled-S gate uses the unitary operator ⎞ ⎛ 1 0 0 0 ⎜0 1 0 0 ⎟ ⎟ ⎜ (20.33) UICS = ⎜ ⎟, 0 ⎠ ⎝0 0 1 0 0 0 eiπ/2

368 Introduction to Quantum Mechanics

FIGURE 20.14 Circuit for 3-qubit quantum Fourier transform.

which gives k2 1 |2 = UICS |1 = √ |0 + eiπk1 eiπ 2 |1 ⊗ |k2 . 2 A Hadamard gate in the second register then results in k 1 1 iπ k1 + 22 |3 = √ |0 + e |1 ⊗ √ |0 + eiπk2 |1 . 2 2

(20.34)

(20.35)

The final result is produced by a SWAP gate: k 1 1 iπ k1 + 22 |4 = SWAP|3 = √ |0 + eiπk2 |1 ⊗ √ |0 + e |1 . 2 2 (20.36)

By analogy with the above procedure, we can derive the 3-qubit QFT. Eq. (20.31) generalizes to k3 1 1 F |k = √ |0 + eiπk3 |1 ⊗ √ |0 + eiπ(k2 + 2 ) |1 2 2 k2 k3 1 ⊗ √ |0 + eiπ(k1 + 2 + 4 ) |1 = 2 1 1 √ |0 + e2πi[0.k3 ] |1 ⊗ √ |0 + e2πi[0.k2 k3 ] |1 2 2 1 ⊗ √ |0 + e2πi[0.k1 k2 k3 ] |1 . (20.37) 2 In the last line, we have expressed k in binary fractional notation, such that k = k1 +

k 2 k3 + + · · · = 2[0.k1 k2 k3 . . . ]. 2 4

(20.38)

The corresponding circuit is shown in Fig. 20.14. In all of the above examples, a QFT applied to a single basis qubit transforms it into a equally-weighted linear combination of all N basis qubits, with complex phase factors of unit magnitude.

Quantum computers Chapter | 20 369

FIGURE 20.15 Circuit for n-qubit quantum Fourier transform.

The inverted controlled-T gate used in the circuit is represented by the unitary matrix ⎞ ⎛ 1 0 0 0 ⎜0 1 0 0 ⎟ ⎟ ⎜ UICT = ⎜ (20.39) ⎟. 0 ⎠ ⎝0 0 1 0 0 0 eiπ/4 We can define a generalized controlled-Rn gate such that ⎛

⎞

1 ⎜0 ⎜ Rn = ⎜ ⎝0

0 1 0

0 0 1

0 0 0

0

0

0

eiπ/2

⎟ ⎟ ⎟. ⎠

(20.40)

n−1

The general n-qubit QFT circuit is shown in Fig. 20.15.

20.5

Phase estimation algorithm

An important application of the quantum Fourier transform is phase estimation. In applications to quantum chemistry, this enables energy eigenvalues to be calculated after the action of the evolution operator e−iH t . Let U be a known n 2n × 2n unitary matrix and | ∈ C2 , one of its eigenvectors. We can then write U | = eiϕ |,

(20.41)

since the eigenvalues of a unitary matrix are complex numbers of unit magnitude (sometimes called eigenphases). The action of the operator U is performed by some otherwise unspecified “black box,” also known as a oracle. This can be symbolically represented by | —– U —– eiϕ |. The basic maneuver leading to the phase-estimation algorithm is known as phase kickback. Consider the

370 Introduction to Quantum Mechanics

FIGURE 20.16 Phase kickback.

little circuit in Fig. 20.16, involving a single-qubit function |ψ. The initial input to the two registers can be written |0 ⊗ |ψ. The Hadamard gate transforms this to √1 (|0 ⊗ |ψ + |1 ⊗ |ψ). The controlled-U gate then acts only for the |1 2 component of the control register. The result is 1 1 √ |0 ⊗ |ψ + |1 ⊗ eiϕ |ψ = √ |0 + eiϕ |1 ⊗ |ψ, 2 2

(20.42)

with the target register |ψ emerging unchanged. After action of another Hadamard gate, the control register becomes 1 1 1 H √ |0 + eiϕ |1 = |0 + |1 + eiϕ |0 − |1 2 2 2 iϕ 1 − eiϕ 1+e [0 + [1. = 2 2

(20.43)

Finally, a measurement on the control register results in a bit 0 or 1, with probabilities: 1 − eiϕ 2 1 + eiϕ 2 2 ϕ = sin2 ϕ . = cos , p(1) = (20.44) p(0) = 2 2 2 2 A statistical analysis then enables the phase ϕ to be determined. For a multiqubit function |, suppose it is desired to estimate the phase ϕ to an accuracy of n bits. We then need to subject the first n qubits of the eigenvector | to a series of controlled operators, involving powers of U , followed by an inverse quantum Fourier transform. The circuit implementing the phase estimation shown in Fig. 20.17. The input of the first register consists of m qubits, all prepared in the state |0. This is an example of an ancilla, which is an entangled component of a quantum computer intended to assist in carrying out an algorithm. The second register contains the input of n qubits, which represent the state |. For concreteness, we consider the particular case m = 3, n = 3. 2n An H gate, by followed a controlled U (for n = 0, 1, 2), maps the qubit |0 nϕ 1 i2 √ into |0 + e |1 . In this way the output of the top register is the product 2 state: 1 1 1 |OUT = √ |0 + e4iϕ |1 ⊗ √ |0 + e2iϕ |1 ⊗ √ |0 + eiϕ |1 . (20.45) 2 2 2

Quantum computers Chapter | 20 371

FIGURE 20.17 Circuit implementing phase estimation.

Comparing this with the last line of Eq. (20.37), |OUT is seen to represent the Fourier transform of a vector |k = |k1 ⊗ |k2 ⊗ |k3 . Accordingly, |k can be determined by the inverse Fourier transform: |k = F −1 |OUT.

(20.46)

The magnitude of k (0 ≤ k ≤ 1) is given by k = [0.k1 k2 k3 ]. This represents an approximation to the phase ϕ, to 3-bit accuracy: ϕ = 2π [0.k1 k2 k3 . . . ].

(20.47)

Therefore, the circuit of Fig. 20.17 can determine the phase ϕ to this level of accuracy. The error in this value is bounded by the first neglected bit, numbered n + 1. Thus the error in the measured phase, say δϕ, is of the order of 2π/2n+1 . A plot showing a series of repeated determinations of ϕ might resemble a Gaussian centered at 2πk with σ ≈ δϕ. Clearly, increasing the number of qubits in |k, improves the accuracy of the computed phase.

20.6

Many-electron systems

Their representation requires systems of multiple qubits. The Hilbert space of D qubits, C2D , is the tensor product of D copies of C2 , namely, C2 ⊗ C2 . . . ⊗ C2 , D times. An orthonormal basis in each space C2 . The extended basis then consists of 2D product states: |δ1 , δ2 , . . . , δD = |δ1 ⊗ |δ2 ⊗ .... ⊗ |δD .

(20.48)

We use the symbols δk , rather than nk , for the occupation numbers to emphasize that they can equal only 0 or 1 (inspired by the Kronecker delta δij ). The multi-dimensional systems of particular interest are many-electron atoms or molecules, as represented by Fock states with D spinorbitals, occupied by N electrons in all combinations of occupation numbers. The D-dimensional Fock states will include N occupied spinorbitals and D − N unoccupied (or virtual) spinorbitals. Most often, the virtual spinorbitals have positive energies

372 Introduction to Quantum Mechanics

k > 0. On a classical computer, enumeration of all the possible Fock states for a D-dimensional system would require 2D memory locations. But in a quantum computer, each qubit can be in a superposition of states, containing the same quantity of information. Moreover, computations can, in principle, be run simultaneously on the entire assemblage, exhibiting what is known as quantum parallelism. The classical algorithm has exponential complexity wrt increasing system size, while the quantum algorithm is approximately polynomial. As an idea of the scales involved, a 30-qubit quantum computer could simultaneously process 1030 ≈ 1 billion states. Fermion creation operators ak† and annihilation operators ak can add or remove electrons from the Fock state. These obey the anticommutation relations {aj , ak } = {aj† , ak† } = 0.

{aj , ak† } = δj k ,

(20.49)

The vacuum state |vac, the formal starting point for manipulations in Fock space, is defined by |vac = |01 , 02 , . . . , 0D .

(20.50)

An electron in a spinorbital k is added to the system using the creation operator ak† : ak† |vac = |01 , 02 , . . . , 1k , . . ..

(20.51)

Adding a second electron in the spinorbital j gives aj† ak† |vac = |01 , 02 , . . . , 1j , . . . , 1k , . . .,

(20.52)

if j comes before k in the Fock space ordering, but aj† ak† |vac = −|01 , 02 , . . . , 1k , . . . , 1j , . . .,

(20.53)

if k comes before j . The sign change is a consequence of the anticommutation aj† ak† = −ak† aj† . This accords with the antisymmetry principle for manyelectron systems. Clearly, also, ak† ak† = 0, which is a statement of the Pauli exclusion principle. For the action of a creation operator on an arbitrary Fock state k−1

ak† |δ1 , δ2 , . . . , 0k , . . . = (−1)

j =1 δj

|δ1 , δ2 , . . . , 1k , . . .,

(20.54)

where the sum counts the number of occupied spinorbitals j < k. Of course, ak† |δ1 , δ2 , . . . , 1k , . . . = 0, if the spinorbital k is already occupied. Analogous relations hold for the annihilation operators ak . In particular, k−1

ak |δ1 , δ2 , . . . , 1k , . . . = (−1) and ak |δ1 , δ2 , . . . , 0k , . . . = 0.

j =1 δj

|δ1 , δ2 , . . . , 0k , . . .,

(20.55)

Quantum computers Chapter | 20 373

20.7

Atomic and molecular Hamiltonians

An atomic or molecular N -electrons system can be represented by the Hamiltonian N N ZA 1 1 2 , (20.56) H= − ∇i − + 2 |ri − RA | rij i>j =1

A

i=1

in atomic units and with the usual assumption of the Born-Oppenheimer approximation and the neglect of relativistic effects. The Hamiltonian contains only one-body and two-body interactions, which makes it feasible to simulate them with quantum-computer circuits. In second-quantized notation, the Hamiltonian can be expressed as H=

hj,k aj† ak +

j,k

1 2

vj,k,p,q aj† ak† ap aq

(20.57)

j =k,p=q

The coefficients hj,k and vj,k,p,q need to be evaluated in advance on a classical computer. In terms of a selected set of basis functions {φk (x)}, we have 1 2 ZA hj,k = φj (x) − ∇ − (20.58) φk (x) , 2 |r − RA | A

and

1 φp (x)φq (x ) . = φj (x)φk (x ) |r − r |

vj,k,p,q

(20.59)

The next step is to assemble a set of quantum gates which will simulate the actions of the creation and annihilation operators in this Hamiltonian. Qubits are often physically realized with the spin- 12 states of electrons or nuclei. The spin-up state is written |1 (or sometimes |1/2), the spin-down state, |−1 (or |−1/2). The analogous qubits are designated |0 and |1, respectively. In Pauli spin algebra, raising and lowering operators are defined by σ± =

1 σx ± iσy , 2

(20.60)

such that σ + | − 1 = |1,

σ + |1 = 0,

σ − |1 = | − 1,

σ − | − 1 = 0. (20.61)

These relations suggest the analogous quantum gates Q†k =

1 Xk + iYk , 2

Qk =

1 Xk − iYk , 2

(20.62)

374 Introduction to Quantum Mechanics

such that Q†k |1k = |0k ,

Q†k |0k = 0,

Qk |0k = |1k ,

Qk |1k = 0. (20.63)

It is useful to review the multiplicative relations involving Pauli gates on the same qubit: X 2 = Y 2 = Z 2 = I, XY = −Y X = iZ, Y Z = −ZY = iX, ZX = −XZ = iY, QZ = −ZQ = Q,

Q† Z = −ZQ† = −Q† . (20.64)

The correspondence between creation/annihilation operators and Pauli gates works very well for actions on a single qubit. However, a problem arises for multiple qubits: operators on different spins commute, but operators on multifermion states need to obey the appropriate anticommutation relations. This can be fixed by means of a Jordan-Wigner transformation. Essentially, this determines the proper ± signs, just like the (−1)δ factors in Eqs. (20.54) and (20.55). Recall the action of the Pauli-Z gate Zk |0k = +|0k ,

Zk |1k = −|1k .

(20.65)

This suggests a form for the quantum gate corresponding to the creation operator ak† , which we designate Ak† : † D Ak† = ⊗k−1 j =1 Zj ⊗ Qk ⊗j =k+1 Ij .

(20.66)

D Ak = ⊗k−1 j =1 Zj ⊗ Qk ⊗j =k+1 Ij .

(20.67)

Analogously,

Note that the gates Ak† and Ak are now non-local, since their action depends on the other qubits in the Fock state. For example, for D = 4, k = 3, we have A3† = Z1 ⊗ Z2 ⊗ Q†3 ⊗ I4 ,

A3 = Z1 ⊗ Z2 ⊗ Q3 ⊗ I4 .

(20.68)

Next, we show how to construct quantum-computer circuits to represent parts of the molecular Hamiltonian. For the diagonal terms in the one-particle sum in (20.57), the operator products ak† ak = Nk , which represent number operators. The corresponding Jordan-Wigner operators are given by † D k−1 D Ak† Ak = ⊗k−1 Z ⊗ Q ⊗ I Z ⊗ Q ⊗ I ⊗ = j j j k j j =k+1 k j =k+1 j =1 j =1 1 1 1 Q†k Qk = Xk + iYk Xk − iYk = 2Ik − i Xk , Yk = 2 2 4 1 (Ik + Zk ) , (20.69) 2

Quantum computers Chapter | 20 375

noting that Zj2 = Ij , Xk2 = Yk2 = Ik and Xk , Yk = 2iZk . For off-diagonal contributions to the one-particle sum, such as aj† ak , consider the case j, k = 2, 3: A3† A2 = Z1 ⊗ Z2 ⊗ Q†3 ⊗ I4 Z1 ⊗ Q2 ⊗ I3 ⊗ I4 = Z12 ⊗ Z2 Q2 ⊗ Q†3 I3 ⊗ I4 = −I1 ⊗ Q2 ⊗ Q†3 ⊗ I4 , (20.70) while

A2† A3 = Z1 ⊗ Q†2 ⊗ I3 ⊗ I4 Z1 ⊗ Z2 ⊗ Q3 ⊗ I4 = Z12 ⊗ Q†2 Z2 ⊗ I3 Q3 ⊗ I4 = −I1 ⊗ Q†2 ⊗ Q3 ⊗ I4 . (20.71)

Thus

A2† A3 + A3† A2 = −I1 ⊗ Q†2 ⊗ Q3 + Q2 ⊗ Q†3 ⊗ I4 = 1 − I1 ⊗ X2 ⊗ X3 + Y2 ⊗ Y3 ⊗ I4 . (20.72) 2

If the indices j, k are not consecutive, then the intervening indices m will contribute factors of Zm interposed between the j and k factors. Thus 1 A1† A3 + A3† A1 = − X1 ⊗ Z2 ⊗ X3 + Y1 ⊗ Z2 ⊗ Y3 ⊗ I4 , 2 1 † † A1 A4 + A4 A1 = − X1 ⊗ Z2 ⊗ Z3 ⊗ X4 + Y1 ⊗ Z2 ⊗ Z3 ⊗ Y4 , 2 (20.73) and so forth. Proceeding analogously for the two-particle terms in the Hamiltonian (20.57), we need to evaluate products of the form Aj† Ak† Ap Aq using (20.66) and (20.67). For the sum of two-electron Coulomb plus exchange terms, we find Aj† Ak† Aj Ak − Aj† Ak† Ak Aj =

1 Ij + Zj ⊗ (Ik + Zk ) . 2

(20.74)

For the case j = k = p = q, Aj† Ak† Ap Aq + A†q Ap† Ak Aj =

1 Xj ⊗ Xk ⊗ Xp ⊗ Xq − Xp ⊗ Xq ⊗ Yj ⊗ Yk 2 +Xk ⊗ Xq ⊗ Yj ⊗ Yp + Xj ⊗ Xq ⊗ Yk ⊗ Yp + Xk ⊗ Xp ⊗ Yj ⊗ Yq +Xj ⊗ Xp ⊗ Yk ⊗ Yq − Xj ⊗ Xk ⊗ Yp ⊗ Yq + Yj ⊗ Yk ⊗ Yp ⊗ Yq . −

(20.75)

376 Introduction to Quantum Mechanics

20.8 Time-evolution of a quantum system The principal goal of the quantum-computer program we are outlining is to find accurate values of energy eigenvalues, most often just the ground-state energy. The method exploits the fact that the time-evolution of a quantum system exhibits its eigenvalues, in the form of a Fourier superposition of its eigenstates. Let us begin with the time-dependent Schrödinger equation for an arbitrary quantum state (x1 , x2 , . . . , xN , t), abbreviated (x, t): i

∂(x, t) = H (x, t). ∂t

(20.76)

The formal solution, with an initial state (x, 0), can be written (x, t) = e−iH t (x, 0),

(20.77)

where the exponential, known as the evolution operator or propagator, is explicitly a representation of the power series e−iH t =

∞ (−iH t)n n=0

n!

= 1+(−it)H +

(−it)2 2 (−it)3 3 H + H +. . . . (20.78) 2! 3!

The initial state (x, 0) is formally represented by a superposition of eigenstates of the Hamiltonian H : (x, 0) = cm ψm (x) = c0 ψ0 (x) + c1 ψ1 (x) + c2 ψ2 (x) + · · · + continuum, m

(20.79) where H ψm (x) = Em ψm (x) = ωm ψm (x). It is useful to express the energies in frequency units Em = ωm = ωm . We label the ground state as m = 0. Applying the evolution operator, we find (x, t) = e−iH t (x, 0) = cm e−i ωm t ψm (x) = m

c0 e

−i ω0 t

ψ0 (x) + c1 e−i ω1 t ψ1 (x) + · · · + continuum.

(20.80)

The scalar product of (x, t) with (x, 0) is an instance of an autocorrelation function: F (t) = (x, 0)|(x, t) = (x, 0)|e−iH t |(x, 0) = |cm |2 e−i ωm t = |c0 |2 e−i ω0 t + . . . .

(20.81)

m

The Fourier transform of the autocorrelation function exhibits the spectrum of eigenvalues

Quantum computers Chapter | 20 377

FIGURE 20.18 Simulation of computed spectrum in quantum computation.

G(ω) =

1 |cm |2 δ(ω − ωm ) + continuum. 2π m

(20.82)

In practice, the peaks of G(ω) will be finite, because of approximations and inaccuracies, something like the spectrum shown in Fig. 20.18. The objective pursued in this Chapter is computation of the quantized energy levels of a molecular system. This will be realized by simulation of the dynamics of the system, generated by the time-evolution operator e−iH t . To do this on a quantum computer, we must construct a set of one- and two-qubit gates that implement this exponential transformation on a set of qubits. Approximations to the energy eigenvalues then appear in the phases of qubit states, which are determined using the phase-estimation algorithm.

20.9

Trotter expansions

The time-evolution operator e−iH t is constructed by exponentiation of the second-quantized representation of the molecular Hamiltonian. However, the terms the Hamiltonian, H = h1 + h2 + . . . , do not generally commute among themselves. Therefore approximate expansions involving noncommuting exponential operators are necessary. Consider an exponential operator of the form e(A+B)t where A and B do not, in general, commute.1 If the time interval t is broken up into n subintervals t = t/n, the Lie-Trotter product formula states that2 e(A+B)t = lim

t→0

eAt eBt

n .

(20.83)

1

1 Recall the Baker-Hausdorff formula e(A+B)t = eAt eBt e− 2 [A,B]t 2 + O(t 3 ). 2 A simple proof:

e(A+B)t = lim

n→∞

1+

n n (A + B)t n At n Bt n eBt/n . = lim 1 + = lim eAt/n 1+ n→∞ n→∞ n n n

378 Introduction to Quantum Mechanics

For a finite value of n, this is approximated by n e(A+B)t ≈ eAt eBt + O(t n+1 )

(20.84)

The right hand side can be rearranged to n n eAt/2 eBt/2 eAt/2 eBt/2 = eAt/2 eBt eAt/2 or n eBt/2 eAt eBt/2 .

(20.85)

This is the form generally used in applications of Trotter approximations for exponential operators, with values of n anywhere from 1 to 10, as appropriate. The propagator e−iH t should act over sufficiently long time to resolve the leading Fourier frequencies to a desired level of precision. As shown in Eqs. (20.10), exponential operators involving Pauli matrices can be evaluated in closed form.

20.10

Simulations of molecular structure

By applying the Jordan-Wigner transformation and Trotter decomposition, it becomes possible to approximate the energy levels of a fermionic many-body Hamiltonian system on a quantum computer. The energy levels of a quantum system can, in principle, be determined by simulating its time evolution, by action of the operator U = e−iH t . This might be done on a quantum computer using a set of one- and two-qubit gates that implement the time evolution on a set of qubits. Recent work by Aspuru-Guzik and coworkers has been able to create a quantum circuit to compute the energy of a molecular system with fixed nuclear geometry using a recursive quantum phase estimation algorithm to simulate a full configuration interaction (FCI) computation. Let us develop the method beginning with a simple example. A singleparticle term in the Hamiltonian, under a Jordan-Wigner transformation, Eq. (20.69) has the form: H = ha † a → hA† A =

h (I + Z). 2

(20.86)

If this were the only term in the Hamiltonian, the time-evolution operator would be given by h

U = e−i 2 (I +Z)t = e−iht/2 e−ihtZ/2 .

(20.87)

The exponential operator reduces to separate factors here since I and Z commute. The second factor has the form of a “rotated” Pauli operator, Eq. (20.10). Therefore ht ht e−iht/2 0 −ihZt/2 = cos I − i sin Z= e (20.88) 2 2 0 eiht/2

Quantum computers Chapter | 20 379

FIGURE 20.19 Gate for two-particle terms.

and

e−iht U= 0

0 . 1

(20.89)

1 Taking (0) = , the autocorrelation function is given by 0 F (t) = † (0)U (t)(0) = e−iht ,

(20.90)

which implies that the system has a single eigenvalue: E0 = h.3 Consider next a two-particle contribution to the Hamiltonian, representing the Coulomb plus exchange integrals for spinorbitals 1 and 2. Using (20.74) we have v(a1† a2† a2 a1 − a1† a2† a1 a2 ) → v(A1† A2† A2 A1 − A1† A2† A1 A2 ) = v (I + Z) ⊗ (I + Z) . (20.91) 2 The propagator corresponding to this Hamiltonian is given by vt

U12 = e−i 2 (I +Z)⊗(I +Z) .

(20.92) The unitary gate that can carry out operations of the form exp −i θ2 Z ⊗ Z is shown in Fig. 20.19. The CNOT gate first entangles the two qubits. Then the θ rotation Rz (θ ) = e−i 2 Z is applied, followed by a second CNOT gate. Fig. 20.20 is a highly schematic diagram of a circuit which might be applicable to a simplified two-electron system, such as a helium atom or hydrogen molecule, in which there are two single-particle terms and one interaction term in the Hamiltonian. A minimum basis set of qubits is designated by |φ1 , |φ2 . The input parameters are h11 =h22 = h and v1221 = v. The gate T (θ) is rep1 0 . PEA represents the gates for the resented by the unitary matrix 0 e−iθ phase-estimation algorithm, which are shown in full in Fig. 20.17. 3 If this were applied to the harmonic oscillator, with H = a † a + 1 , the result would give E = 1 . 0 2 2

380 Introduction to Quantum Mechanics

FIGURE 20.20 Simplified circuit for two-particle Hamiltonian.

In summary, we outline the general procedure for a simulation of an atomic or molecular system by quantum computer circuit: (1) The Hamiltonian is expressed as a sum of products of Pauli operators, using the Jordan-Wigner transformation, in conjunction with a set of molecular integrals calculated by conventional computation. (2) Each of these operators is converted into unitary gates such that their sequential execution on a quantum computer can approximate the action of the propagator e−iH t , with application of the Trotter decomposition. (3) The phase estimation algorithm is applied to approximate the eigenvalue of an eigenstate produced by the quantum Fourier transform of the time domain propagation. The output results can serve as an input for an improved approximation, and the procedure can be further iterated, as desired. The rather intricate and extensive details of the circuits which carry out these operations are described more fully in some current references.4

4 A. Aspuru-Guzik, A. Dutoi, P. Love and M. Head-Gordon, Simulated Quantum Computation of Molecular Energies, Science, 309, 1704 (2005); J. D. Whitfield, J. Biamonte and A. Aspuru-Guzik, Simulation of electronic structure Hamiltonians using quantum computers, Molecular Physics, 109, 735-750 (2011); I. Kassal, J. D. Whitfield, A. Perdomo-Ortiz, M.-H. Yung and A. Aspuru-Guzik, Simulating Chemistry Using Quantum Computers, Ann. Rev. Phys. Chem., 62, 185-207 (2011); P. J. J. O’Malley, et al., Scalable Quantum Simulation of Molecular Energies, Physical Review X, 6, 031007 (2016).

Answers to exercises

NOTE TO STUDENTS: Don’t even think of looking here before you attempt to solve the problems yourself! Chapter 2 2.1. The components of the momentum operator can be expressed pˆ k = −i

∂ , ∂xk

k = 1, 2, 3.

Now extend this relation for k = 4 using p4 = iE/c and x4 = ict: ∂ Hˆ = +i , ∂t where the energy operator is the Hamiltonian Hˆ . Applying the quantization prescription to the classical energy-momentum relation E=

p2 + V (x, y, z), 2m

p 2 = p12 + p22 + p32

then leads to the 3-dimensional time-dependent Schrödinger equation (2.30). 2.2. 100 watts = 100 J/sec. The energy of a 550 nm photon is given by E = hν =

hc (6.626 × 10−34 )(2.998 × 108 ) = 3.61 × 10−19 J. = λ 550 × 10−9

Thus 100/E = 2.77 × 1020 photons/sec. 2.3. Since 1 eV=1.602 × 10−19 J, each electron has a kinetic energy of (40 × 103 )(1.602 × 10−19 ) J. This is equal to E=

1 p2 mv 2 = . 2 2m 381

382 Answers to exercises

The de Broglie relation λ = h/p, therefore gives h 6.626 × 10−34 λ= √ = 2mE 2(9.109 × 10−31 )(40 × 103 )(1.602 × 10−19 ) = 6.13 × 10−12 m. This gives sufficient resolution to study the geometric structure of molecules. (Since 40 keV electrons travel at a significant fraction of the speed of light, the relativistic energy-momentum relation must be used. The corrected de Broglie wavelength is actually 6.016 × 10−12 m.) 2.4. Evaluate the partial derivatives ∂2 p2 (x, t) = − 2 ei(px−Et)/ 2 ∂x

∂ ip (x, t) = ei(px−Et)/ , ∂x and

∂ iE (x, t) = − ei(px−Et)/ . ∂t Eq. (2.27) then follows from the relation E = p 2 /2m. 2.5. Note that p · r = px x + py y + pz z. Then ∂ ipx i(p·r−Et)/ , (r, t) = e ∂x

etc.

and Eq. (2.30), with V (r)=0, follows from E = (px2 + py2 + pz2 )/2m. 2.6. Evaluate the derivatives (suppressing A for now): ψ (x) = e−αx − αxe−αx

and

ψ (x) = −2αe−αx + α 2 xe−αx .

Then the Schrödinger equation Hˆ ψ(x) = Eψ(x) becomes −

2 q 2 −αx = Exe−αx . (−2αe−αx + α 2 xe−αx ) − xe 2m x

Now, cancel out the e−αx and find two independent relations for the terms independent of x and linear in x. The results give α = mq 2 /2 , which agrees with the definition and E=−

2 α 2 mq 4 =− 2 . 2m 2

To normalize the function ∞ |ψ(x)|2 dx = 1 = A2 0

giving A = 2α 3/2 .

0

∞

x 2 e−2αx dx = A2 × 2!/(2α)3 ,

Answers to exercises

383

Chapter 3 3.1. y = e−kx is a solution of the differential equation y (x) − k 2 y(x) = 0. Note the minus sign. 3.2. P (L/3 ≤ x ≤ 2L/3) = 2 = L

2L/3 L/3

2L/3

sin

2

nπx L

L/3

2 L dx = L nπ

|ψn (x)|2 dx

θ sin 2θ − 2 4

2nπ/3 . nπ/3

Note sin(4nπ/3) = sin(4nπ/3 − 2nπ) = sin(−2nπ/3) = − sin(2nπ/3). Thus P=

1 2nπ 1 + sin . 3 nπ 3

As n → ∞, this approaches 1/3. 3.3. Polymethine ion: N+ =C–C=C–C=C–N, 8 electrons (1 from each C, 1 from N+ , 2 from N), L ≈ 7 × 1.40 Å. hc h2 (52 − 42 ), = λ 8mL2 giving λ = 352 nm. 3.4. For particle of mass M = 1.67 × 10−27 kg in cubic box with a = 10−14 m, ground-state energy is E111 =

h2 2 1 + 12 + 12 ≈ 6.15 MeV. 2 8Ma

3.5. Energy of 2 electrons in molybox minus that of 2 electrons in cube-atoms: 2 12 12 1 3 h2 h2 2 h2 2 2 + + + 1 + 1 . 1 = − − 2 ×

E = 2 × 8m (2a)2 a 2 a 2 16 ma 2 8ma 2 Note that the molybox is more stable (has lower energy). One of the factors promoting formation of molecules from atoms is the increased volume available to valence electrons. 3.6. By analogy with 3-dimensional particle-in-a-box n πx n πy 2 1 2 ψn1 n2 = sin sin , a a a

384 Answers to exercises

En1 n2 =

h2 2 2 + n n 2 , 8ma 2 1

n1 , n2 = 1, 2 . . .

Ground state E11 = h2 /4ma 2 is nondegenerate. First excited level, with E21 = E21 = 5h2 /8ma 2 , is 2-fold degenerate. 3.7. Six π-electrons occupy E11 , E12 and E21 . Lowest-energy transition is from E12 or E21 to E22 : hc h2 (8 − 5), = E22 − E21 = λ 8ma 2 λ = 268 nm when a = 4.94 Å.

Chapter 4 4.1.

A =

ψ Aˆ ψ dτ = a ∗

ψ ∗ ψ dτ = a.

4.2. Compare the magnitude of /2 with reasonable values for x and p for a baseball. 4.5. Yes, since [y, px ] = 0. 4.6. Yes, since [pz , Lz ] = [pz , xpy − ypx ] = 0.

Chapter 6 6.1. The turning points for quantum number occur where the kinetic energy equals 0, so that the potential energy equals the total energy. For quantum number n, this is determined by 1 1 2 = n+ kx ω. 2 max 2 Recalling that ω =

√

k/m and α =

√

mk/, we find

(2n + 1) 2 xmax = (2n + 1) √ . = α km Therefore P (xmax ≤ x ≤ ∞) = P (−∞ ≤ x ≤ −xmax ) =

∞

xmax

|ψn (x)|2 dx.

Answers to exercises

385

[Optional: For n = 0, Poutside = 2

∞ √

1/ α

α 1/2 π

e

−αx 2

2 dx = √ π

∞

e−ξ dξ = erfc(1) ≈ 0.158, 2

1

where erfc is the complementary error function. This result means that in the ground state, there is a 16% chance that the oscillator will “tunnel” outside its classical allowed region.] 6.2. The ground state wavefunction is ψ0 (x) = (α/π)1/4 e−αx

2 /2

,

α = (mk/2 )1/2 .

Using integrals in Supplement 5A, ∞ 1 2 k 1 1 ψ0 (x) kx ψ0 (x) dx = = ω = E0 , V = 2 4α 4 2 −∞ ∞ 2 1 T = ψ0 (x) − ψ0 (x) dx = E0 . 2m 2 −∞ Thus the average values of potential and kinetic energies for the harmonic oscillator are equal. This is an instance of the virial theorem, which states that for a potential energy of the form V (x) = const x N , the average kinetic and potential energies are related by T =

N V . 2

6.3. Taking the square of Eq. (5.46) x2 =

(aa + aa † + a † a + a † a † ). 2mω

Using (5.42) and (5.43) we find n|aa|n = 0,

n|a † a † |n = 0,

n|a † a|n = n,

n|aa † |n = n + 1.

Thus mω2 k (2n + 1) = 12 (n + 12 )ω. V = x 2 = 2 2 2mω The kinetic energy T works out to the same value. 6.4. The expectation values x and p are both equal to zero since they are integrals of odd functions, such that f (−x) = −f (x), over a symmetric range of integration. You have already calculated the expectation values x 2 and p 2

386 Answers to exercises

in Exercise 5.2, namely 1 2α

x 2 =

p2 =

and

2 α . 2

Therefore x p = /2, which is its minimum possible value. 6.5. Since n|x|n = 0 and n|p|n = 0,

x =

n|x 2 |n =

and

p =

n|p 2 |n =

1/2

(2n + 1) 2mω

1/2

mω (2n + 1) 2

,

we find x p = (n + 12 ). 6.6. Using the integrals in Supplement 5A, it is found that x02 = 0 while x01 and x12 are nonzero. The general result n = ±1 follows from (5.48).

Chapter 8 8.1. De Broglie wavelength λ = h/p with L = r p. Circumference of orbit 2πr = nλ, an integer number of wavelengths. This implies L = nh/2π = n. 8.2. You can count n − − 1 radial nodes, angular nodes, n − 1 total nodes. 8.3. The best formula to use is 1 1 1 2 − =Z R , λ n21 n22 where R is the Rydberg constant, 109,678 cm−1 . For hydrogen, 1/λ = R(1/12 − 1/22 ) = 82258.5 cm−1 , λ = 121.6 nm. For helium, 1/λ = 4 R(1/12 1/22 ) = 329, 034 cm−1 , λ = 30.39 nm.

2 8.4. Find the maximum of D1s (r) = 4πr 2 ψ1s (r) = const r 2 e−2Zr . Set dD/dr = 0, giving rmax = 1/Z (= a0 /Z), same as Bohr radius for 1s orbit. 8.5. Define the functions f1 (r) = e−Zr, f2 (r) = (1 + αr)eZr/2 , f3 (r) = (1 + ∞ βr + γ r 2 )eZr/3 . Evaluate the integral 0 f1 (r)f2 (r)4πr 2 , dr. The result contains a factor (Z + 2α). Set α = −Z/2 so that this vanishes. To find the 2s ∞ normalization constant, evaluate 0 f2 (r)2 4πr 2 , dr. This equals 8π/Z 3 . Thus the normalized 2s eigenfunction is given by ψ2s (r) =

Z√3/2 (1 − Zr/2)e−Zr/2 . 2 2π

Answers to exercises

387

Proceed analogously for the 3s eigenfunction, ∞ ∞determining β and γ from the conditions 0 f1 (r)f3 (r)4πr 2 , dr = 0 and 0 f2 (r)f3 (r)4πr 2 , dr = 0. 8.6. He++ and H+ are bare nuclei so their electronic energies equal zero. He+ and H are hydrogenlike so their 1s energies equal −Z 2 /2. Thus E = −4/2 + 1/2 = −3/2 hartrees = −40.8 eV. 8.7. (i); the other four operators are equal. 8.8.

3 = a0 , r = 2 0 ∞ r 2 = ψ1s (r) r 2 ψ1s (r) 4πr 2 dr = 3 = 3a02 , 0 ∞ 1 −1 −1 2 . r = ψ1s (r) r ψ1s (r) 4πr dr = 1 = a0 0

∞

3 ψ1s (r) r ψ1s (r) 4πr dr = 2

2

8.9. Average potential energy: ∞ Z < V >= ψ1s (r) 4πr 2 dr = −Z 2 . ψ1s (r) − r 0 Average kinetic energy: ∞ 1 2 < T >= ψ1s (r) − ∇ ψ1s (r) 4πr 2 dr = Z 2 /2. 2 0 More simply, since total energy E1s = −Z 2 /2, T = E1s − V . Note that V = −2T , consistent with the virial theorem. 8.10. For an easier exercise, do the 2pz orbital instead. 8.11. You should find that this function solves the Schrödinger equation with E = −Z 2 /8, i.e., n = 2. For normalization Z 3/2 const = √ . 4 π Noting that sin2 (θ/2) = (1 − cos θ )/2, the function is found to be an s-p hybrid orbital: 1 ψ = √ ψ2s + ψ2pz . 2 8.12. Solve for R: R |ψ1s (r)|2 4πr 2 dr = 0.9, 0

388 Answers to exercises

or easier

∞

|ψ1s (r)|2 4πr 2 dr = 0.1.

R

We find, using integral table, ∞ r 2 e−2r dr = e2R (1 + 2R + 2R 2 ) = 0.1. 4 R

Solving numerically, R = 2.6612a0 = 1.41 Å. 8.13. Let ψ(r) = e−αr . Then ∞ −αr 1 2 e ∇ − Z/r e−αr 4πr 2 dr − 0 2 1 ∞ = α 2 − Zα. E(α) = −2αr 2 2 4πr dr 0 e E (α) = 0 for minimum, giving α = Z. Thus ψ(r) = e−Zr and E = −Z 2 /2, which, in this exceptional case, gives the exact eigenfunction and eigenvalue. √ 8.14. In atomic units, ψ1s = π −1/2 e−r , ψ2p0 = (4 2π)−1 r cos θ e−r/2 . Therefore |ψ1s (0)|2 = π −1 , Integrating over θ π

|ψ2p0 (0)|2 = 0.

(3 cos2 θ − 1) sin θ dθ =

−1

0

and

π

1

(3 cos2 θ − 1) cos2 θ sin θ dθ =

0

so that 3 cos2 θ − 1 = 0, r3 1s

3 cos2 θ − 1 r3

(3u2 − 1)du = 0

1 −1

(3u2 − 1)u2 du =

= 2π × 2p0

8 , 15

8 1 × r −3 2p0 = . 15 30

Finally

ν1s = 1420.4 MHz,

ν2p0 = 17.8 MHz.

Chapter 9 9.1. ∞∞ T =

0

0

ψ(r1 , r2 ) − 12 ∇12 − 12 ∇22 ψ(r1 , r2 ) 4πr12 dr1 4πr22 dr2 ∞∞ = α2 2 4πr 2 dr 4πr 2 dr |ψ(r , r )| 1 2 1 2 1 2 0 0

Answers to exercises

and

389

5 Z Z 1 = −2Zα + α. V = − − + r1 r2 r12 8

For the optimized variational function, α = Z − 5/16, so 5 2 T = Z − 16

and

5 2 V = −2 Z − . 16

Thus V = −2T , in agreement with the virial theorem. 9.2. Li+ is He-like with Z = 3. Just as for He, 5 E(α) = α 2 − 2Zα + α, 8 with optimal α = Z −

= 2.6875 and

5 16

5 2 = −7.223 hartrees. E=− Z− 16 A more accurate value is −7.280 hartrees. 9.3. For the Li atom with 3 electrons, Hˆ =

3 Z 1 1 1 1 + + . − ∇i2 − + 2 ri r12 r23 r31 i=1

Assuming ψ(1, 2, 3) = e−α(r1 +r2 +r3 ) , we find in analogy with helium results,

1 1 − ∇i2 = α 2 , 2 2

−

Z = −Zα, ri

1 5 = α. rij 8

The total energy is given by 3 15 E(α) = α 2 − 3Zα + α, 2 8 with Z = 3. To optimize, E (α) = 3α − 9 +

15 = 0, 8

α = 2.375,

E = −8.4609 hartrees.

This is less than the exact ground state energy −7.478, in apparent violation of the variational principle. But ψ is an “illegal” wavefunction.

390 Answers to exercises

Chapter 10 10.1. Spherically symmetrical (S) state whenever valence shell contains only (i) all s-electrons, (ii) half filled shells, (iii) filled shells. Group IA, configuration ns: H, Li, Na, K, Rb. Group IIA, ns 2 : Be, Mg, Ca, Sr. Group VB, ns 2 np 3 : N, P, As, Sb. Group 0: He, Ne, Ar, Kr, Xe. Transition elements: Cr 4s3d 5 , Mn 4s 2 3d 5 , Mo 5s4d 5 , Tc 5s 2 4d 5 . Also Cu, Zn, Pd, Ag, Cd, all with d 10 . 10.2. First excited states: H 2s 2 S, He 1s2s 3 S, Li 1s 2 2p 2 P, Be 1s 2 2s2p 3 P, B 1s 2 2s2p 2 2 S. For C, N and O, the electron configuration is the same as for the ground state but the occupation of degenerate p-orbitals is different: C 2s 2 2p 2 1 D, N 2s 2 2p 3 2 D, O 2s 2 2p 4 1 D. Finally, F 2s2p 6 2 S, Ar 2s 2 2p 5 3s 3 P. 10.3. Promote one of the 2s electrons to the empty 2p orbital. If the four valence electrons have parallel spins, this is a 5 S state, which can form four bonds. 10.4. According to n + rule, the ordering of atomic orbitals should be: 1s < 2s < 2p < 3s < 3p < 4s < 3d < 4p < 5s < 4d < 5p < 6s < 4f < 5d < 6p < 7s < 5f < 6d < 7p. The ordering is sometimes reversed for the entries shown in red (gray in print version). Both rules give the same ground state configuration for Rn (Z = 86): 1s 2 2s 2 2p 6 3s 2 3p 6 4s 2 3d 10 4p 6 5s 2 4d 10 5p 6 6s 2 4f 14 5d 10 6p 6 . 10.5. The angular momentum in the first Bohr orbit is given by L = mvr = . For atomic number Z, the radius of the lowest orbit is r = a0 /Z, where a0 = 2 /me2 . Therefore v=

Z e2 = Z ma0

and v/c = Zα. The nonrelativistic energy of this orbit is E = −Z 2 me4 /22 = −13.6Z 2 ≈ −87, 000 eV. With the relativistic mass increased by a factor (1 − v 2 /c2 )−1/2 , the energy decreases to approximately −107, 000 eV.

Chapter 11 11.1. (i) Minimum value of E(R) can be found by setting E(R) = 0. It is easy to see from the formula itself that E(R) will have a minimum value of 0 when R = Re . As R → ∞, E(R) approaches D. Thus De = D, the dissociation energy.

Answers to exercises

391

(ii)

(iii) Remember the expansion for the exponential (In fact, don’t ever forget this!) ex =

∞ x2 x3 xn =1+x + + + ... n! 2 6 n=0

Expanding the Morse function up to terms quadratic in R − Re gives E(R) = 0 + Dβ 2 (R − Re )2 + . . . This has the form of a harmonic oscillator potential V (x) = 12 kx 2 with x = R − Re

and k = 2Dβ 2 .

11.2. The central carbon forms two sp-hybrids and two unhybridized p-orbitals, just like acetylene. The sp-hybrids bond to the terminal carbons in a linear arrangement of σ -bonds. Each p orbitals then bonds to a terminal carbon to form a π-bond, as shown below:

Note that the two CH2 groups are in perpendicular planes. 11.3. For atomic p-orbitals, the density of a px py configuration has the form ρ = |ψpx |2 + |ψpy |2 = (x 2 + y 2 )f (r), which is cylindrically-symmetrical about the z-axis. This remains true for the two π-orbitals formed by linear combination of AO’s on the two carbon atoms. 11.4. H2 S: S has 6 valence electrons, 2 form bonds to H leaving 4 electrons or 2 unshared pairs. SH2 E2 approximately tetrahedral configuration giving two S–H bonds for bent H–S–H molecule. Just like H2 O. SF6 : 6 S–F bonds, octahedral molecule.

392 Answers to exercises

XeF4 : Xe has 8 valence electrons, 4 bonds to F, leaving 2 pairs. XeF4 E2 octahedral with the two E’s on opposite sides to minimize repulsion, so XeF4 molecule is square planar. SF4 : 4 S–F bonds, leaving 2 electrons or 1 lone pair. SF4 E trigonal bipyramid with E in one equatorial position. The 4 S–F bonds bend away from the E giving a see-saw shaped molecule. BrF5 : Br 5 Br–F bonds plus 1 lone pair. BrF5 E octahedral configuration gives geometry of square pyramid. IF7 : I has 7 valence electrons, 7 IF bonds give geometry of a pentagonal bipyramid. 11.5. CH2 and SO2 are bent triatomics. SO3 and NO− 3 are equilateral triangles. XeO3 is a triangular pyramid like NH3 . 11.6. Unhybridized oxygen 2p-orbitals would form two OH bonds 90◦ apart. According to the valence-shell model, the two bonds would belong to distorted tetrahedra around each oxygen atom. 3 e2 (high spin). [Cr(CN) ]4− d 4 → t 4 (low-spin). 11.8. [Fe(H2 O)6 ]3+ d 5 → t2g 6 g 2g 6 6 e3 . 3+ 6 2+ 9 [Co(NH3 )6 ] d → t2g (low-spin). [Cu(H2 O)6 ] d → t2g g

Chapter 12 12.1. As ξ → ∞, the equation reduces approximately to ξ2

d 2 R 2 |E|ξ 2 − ≈ 0. 2 4 dξ

Canceling the ξ 2 and noting that E is negative for bound states, R |E| ξ . (ξ ) ≈ exp − 2 12.2.

ψ dτ = N 2

(ψA2 + ψB2 ± 2ψA ψB )dτ = N 2 (1 + 1 ± 2S),

2

thus ψ is normalized when N = (2 ± 2S)−1/2 . 12.3. ψ1σ g = 1sA + 1sB , thus

ψ(1, 2) = 1sA (1) + 1sB (1) 1sA (2) + 1sB (2) =

Answers to exercises

393

1sA (1)1sB (2) + 1sB (1)1sA (2) + 1sA (1)1sA (2) + 1sB (1)1sB (2). Term in brackets is the valence bond function for the bond. The remaining terms represent ionic structures H+ H− and H− H+ with both electrons on the same hydrogen atom. 12.4. N2 N+ 2 N− 2

1+ . . . 1πu4 3σg2 BO=3 g 4 + 2 . . . 1πu 3σg g BO=2.5 2 . . . 1πu4 3σg2 1πg BO= 2.5 g

12.5. O2 O− 2 O2− 2

. . . 3σg2 1πu4 1πg2 . . . 3σg2 1πu4 1πg3 . . . 3σg2 1πu4 1πg4

3− g 2 g 1+ g

12.6. Both excited states have same configuration as ground state, . . . 3σg2 1πu4 1πg2 , but with the following occupancy of 1πg orbitals: ↑↓

1

g+

and

↑↓

1

g .

The plus superscript in the first term symbol is rather tricky. Don’t worry about it. But if you insist . . . two-electron singlet spin state has antisymmetric spin function, thus must have symmetric orbital function like πx (1)πy (2) + πy (1)πx (2) which doesn’t change sign upon transformation φ → −φ. Singlet oxygen and other active oxygen species are involved in lipid metabolism. 12.7. Be2 has configuration . . . 2σg2 2σu2 1 g+ . The 2σ orbitals are LCAO’s made from hybrids of 2s and 2pσ . The 2σg MO probably becomes more strongly bonding while the 2σu becomes more weakly antibonding, with the net effect being weak bonding. 12.8. Setting dS/dR = 0 find maximum at R = 2.1038 bohrs or 2.1038 × .593 = 1.115 Å.

Chapter 13 13.1. Secular determinant: x 1 0

1 x 1

0 1 = x 3 − 2x = 0, x

√ √ √ where x = (α − E)/β. Roots x = 0, ± 2, thus E = α − 2β, α, α + 2β. Remember both α and β are negative. Ground state energy (3 electrons) = 2(α +

394 Answers to exercises

√ √ 2β) + α = 3α + 2 2β. One localized π-orbital plus one unpaired electron would have √ energy = 2(α + β) + α = 3α + 2β. Resonance stabilization energy = (2 − 2 2)β = −.828 β = .828|β|. Lowest energy electronic transition given by hc √ = 2 |β|. λ 13.2. For linear H3 , the secular equation is x 1 0

0 1 = x 3 − 2x = 0, x

1 x 1

√ √ √ with roots x = 0, ± 2. Thus the three MO energies are √α − 2β, α, α + 2β. The energy of the three-electron ground state is 3α + 2 2β ≈ 3α + 2.828β. For triangular H3 , x 1 1 1 x 1 = x 3 − 3x + 2 = 0. 1 1 x One obvious root is x = 1. Division of x 3 − 3x + 2 by x − 1 gives x 2 + x − 2, with roots x = 1 and −2. The three MO’s are α +2β, α −β, α −β. The energy of the ground state is 3α + 3β. Apparently the triangular form of H3 has a slightly lower energy. 13.3. The secular equation is x 1 0 1

1 x 1 0

0 1 x 1

1 0 = x 4 − 4x 2 = 0, 1 x

with roots x = 0, 0, ±2. The ground state π-electron energy equals 2 × (α + 2β) + 2α = 4α + 4β. This the same as that for two ethylene molecules, so there is no delocalization energy. This system is in fact antiaromatic, with 4N π-electrons. 13.4. The five occupied π-MO’s are α + 2.303β, α + 1.618β, α + 1.303β, α + 1.000β, α + 0.618β. The total energy is Eπ = 10α + 13.684β. Since five localized double bonds would have energy 10α + 10β, the resonance energy is approximately 3.684β ≈ 10 eV.

Answers to exercises

395

13.5. For azulene, the HOMO is α + 0.477β, the LUMO is α − 0.400β. Thus

E = 0.877|β|. Using |β| ≈ 2.72 eV, |β|/ hc ≈ 21, 900 cm−1 , we obtain 1 |β| ≈ 0.877 ≈ 19, 200 cm−1 . λ hc This corresponds to an absorption of green light at about 520 nm. If the Hückel computation were quantitatively accurate, the compound should appear red, the complementary color. Actually, azulene is blue, hence its name. 13.6. For hexatriene the HOMO is the 3π, which has its terminal p-orbitals in phase. The thermal reaction should be disrotatory and the photochemical reaction conrotatory.

Chapter 14 14.1. The Hartree-Fock-Slater equation for a 1s orbital φ(r) is given by 1 Z ρ(r ) 3 3α 3 1/3 2 1/3 r − ρ(r) d φ(r) = φ(r). − ∇ − + 2 r |r − r | 2 π The normalized orbital is taken as ζ 3/2 φ(r) = √ e−ζ r . π The total electron density is then given by ρ(r) = 2φ(r)2 = 2

ζ 3 −2 ζ r . e π

Multiplying the HFS equation by φ(r) and integrating, we obtain T + V + J + X = , with 1 1 ζ2 φ(r) − φ (r) − φ (r) 4πr 2 dr = , 2 r 2 0 ∞ Z φ(r) − φ(r) 4πr 2 dr = −Zζ, V= r 0 ∞ ∞ ρ(r )ψ(r)2 5 4πr 2 dr 4πr 2 dr = ζ J= | |r − r 4 0 0

T=

∞

and 3α X=− 2

1/3 ∞ 81 × 31/3 3 ρ(r)1/3 ψ(r)2 4πr 2 dr = − α ζ. π 64(2π)2/3 0

396 Answers to exercises

Given the values Z = 2, ζ = 1.6875, = −0.9037, the HFS equation is satisfied when α = 1.17386.

Chapter 15 15.1. From the C2v character table we find the following direct products: A1 ⊗ X = X,

X ⊗ X = A1 ,

A2 ⊗ B2 = B1 ,

A2 ⊗ B1 = B2 ,

B2 ⊗ B2 = A2 ,

where X is any of the four representations. The cartesian coordinates x, y, z transform as B1 , B2 and A1 , respectively. For a transition to be allowed, the direct product in the dipole integral must contain the totally symmetric representation A1 . Thus x-polarized A2 ↔ B2 transitions and y-polarized A2 ↔ B1 transitions are allowed. z-polarized transitions are allowed between states belonging to the same representation. All B1 ↔ B2 transitions are dipole-forbidden. 15.2. Orbitals of a1 symmetry are linear combinations of O2s, O2pz and H1s(1)+H1s(2). Orbitals of b2 symmetry are linear combinations of O2py and H1s(1)-H1s(2). The O2px orbital already has b1 symmetry. 15.3.

2− + 15.4. NH+ 4 Td . H3 O C3v . SO4 Td . PCl5 D3h . POCl3 C3v . XeO3 F2 D3h . PF3 Cl2 , SF4 C2v . XeF4 D4h . SOF4 , ClF3 C2v . IOF5 C4v .

15.5. (a) C2v (b) D2h (c) C2h (d) C2v (f) D3d (g) C2v (h) Cs (i) D2h (j) D3h (k) C2v 15.6. Chromium oxylate C3 . Adamantane Td . Bicyclooctane D3h . Ferrocene D5d . Diborane D2h . Sulfur D4d .

Chapter 16 16.1. The minimum of V (R) occurs where V (R) = 4 (−12σ 12 R −13 + 6σ 6 R −7 ) = 0.

Answers to exercises

397

This gives Re = 21/6 σ . At R = Re , V (Re ) = −. Since V (∞) = 0, De = . When R = σ , V (R) = 0. This is the point where the potential-energy curve crosses the axis. 16.3. (i) hcEJ = BJ (J + 1), (ii)

hc(E1 − E0 ) = 2B = 3.410 cm−1 .

R = 410.6

14 × 16 B = 115 pm. 14 + 16

(iii) k = 58.9 × 10−6 ν˜ 2

14 × 16 = 1590 N/m. 14 + 16

(iv) hcEv = (v + 12 )˜ν − (v + 12 )2 xe ν˜ . 2 2 3 3 1 1 xe ν˜ = 1876 cm−1 . ν˜ − − hc(E1 − E0 ) = − 2 2 2 2 16.4. The number of bound vibrational levels is equal to ωe 1 , + nbound = vmax + 1 ≈ 2 ω e xe 2 where the square bracket means the closest integer. This gives for n ≈ 19, 21, 24 for H2 , HD and D2 , respectively. 16.5. For 1 H35 Cl, ωe = 2989.7 cm−1 and Be = 10.59 cm−1 . The dependence of these parameters on reduced mass can be expressed ωe (1) Be (1) μ(2) μ(2) = , = . ωe (2) μ(1) Be (2) μ(1) √ √ 35 Cl) ≈ In amu, μ(1 H√ 1 × 35/(1 + 35), μ(1√ H37 Cl) ≈ 1 × 37/(1 + 37), μ(2 H35 Cl) ≈ 2 × 35/(2 + 35), μ(2 H37 Cl) ≈ 2 × 37/(1 + 37). The results follow: Molecule

ωe /cm−1

Be /cm−1

1 H35 Cl

2989.7

10.59

1 H37 Cl

2987.5

10.57

2 H35 Cl

2143.2

5.442

2 H37 Cl

2140.1

5.426

398 Answers to exercises

16.6. For N2 , the maximum population occurs for the rotational level 1/2 kT 1 Jmax = − =7 2hcBe 2 with Be = 2.010 cm−1 , T = 300 K, hc/k = 1.4388 cm K. 16.7. For a spherical rotor, the degeneracy increases to gJ = (2J + 1)2 , the Boltzmann distribution for rotational levels is then NJ = const (2J + 1)2 e−BJ (J +1)hc/kT , with a maximum at the level Jmax =

kT hcB

1/2

1 − . 2

For CH4 at 300 K, Jmax = 6. 16.8. The rotational energy of the centrifugally-distorted molecule is given by EJ =

2 2 2 R J (J + 1) ≈ J (J + 1) − J (J + 1). 2μ(R + R)2 2μR 2 μR 3

Equating the Hooke’s law force to the centrifugal force k R = m1 ω2 r1 + m2 ω2 r2 = 2μ ω2 R. Use the relations

J (J + 1) = μR 2 ω, k 1 ωe = 2πc μ

and hcBe =

2 2μR 2

to obtain EJ 4B 3 = Be J (J + 1) − 2e J 2 (J + 1)2 , hc ωe thus identifying DJ = 4Be3 /ωe2 .

Answers to exercises

399

16.9, 16.10. Symmetry species which transform like x, y or z are IR-active. Those which transform like x 2 , xy, etc. are Raman-active. These cartesian combinations can be found in the right-hand columns of character tables.

Chapter 17 17.1. Assuming the molar partition function can be approximated by Q = e−U/kT , we have ln Q = ln −

U kT

. Therefore S=

U − k ln Q = k ln . T

17.2. Approximating p(0) = 760 mmHg, and T = 293 K, we find p(1600) = 760 exp(−.029 × 9.8 × 1600/8.314 × 293) ≈ 630 mmHg. 17.3. Using the classical Boltzmann distribution, with (vx ) = mvx2 /2, ∞ 2 −mvx2 /2kT e−mvx /2kT dvx . ρ(vx ) = e −∞

The integral is of the form

∞

−∞

e

−αx 2

dx =

π , α

which leads to

m 1/2 2 e−mvx /2kT . 2πkT For the composite 3-dimensional velocity distribution, we can write ρ(vx ) =

ρ(v) = ρ(vx , vy , vz ) = ρ(vx )ρ(vy )ρ(vz ) =

m 3/2 2 e−mv /2kT . 2πkT

This is normalized in 3-dimensional velocity space according to ∞ ∞ ∞ ρ(v) dvx dvy dvz = 1. −∞ −∞ −∞

Transforming to spherical polar coordinates in velocity space, noting that ρ(v) is spherically symmetrical, we can write ∞ ρ(v)4πv 2 dv = 1. 0

400 Answers to exercises

Seeking a normalized function of speed v, independent of direction of velocity, such that ∞ ρ(v) dv = 1. 0

We thus arrive at the Maxwell-Boltzmann-distribution function m 3/2 2 ρ(v) = 4πv 2 e−mv /2kT . 2πkT 17.4. The probability for a molecule to be in the vibrational state v is given by pv =

e−v /kT ˜ ˜ = e−hcνv/kT (1 − e−hcν/kT ). qvib

Using hc/k = 1.438775, ν˜ = 214.4, T = 298, we find p0 = 0.645,

p1 = 0.229,

p2 = 0.0813.

17.5. The probability for a molecule to be in the rotational level J is proportional to the Boltzmann factor (2J + 1)e−J (J +1)hcB/kT . As shown in Eq. (16.19), the maximum of this quantity occurs at 1/2 kT 1 Jmax = − , 2hcB 2 where the square bracket means the nearest integer. 17.6.

Jmax =

298 2 × 1.438775 × 1.9987

1/2

1 − = [6.698] = 7. 2

17.7. Taking account of translational, rotational and vibrational contributions, 3 u2 eu CV,m = R + R + R u , 2 (e − 1)2 where u = hcν˜ /kT = 1.43877 × 214.5/298 = 1.03562. We find that =C −1 mol−1 . Cp,m V,m + R = 4.4152R = 36.71 JK 17.8. Given the parameters M = 253.8, ν˜ = 214.5, u = hcν˜ /kT = 1.03562, B = 0.03737, σ = 2, we have trans Sm = (1.5 ln M + 2.5 ln T − 1.1517)R = 21.3959R,

Answers to exercises

401

ekT = R ln = 8.92704R, σ hcB u vib −u − ln(1 − e ) = 1.00852R. Sm = R u e −1 rot Sm

These add up to = 31.3314R = 260.51 JK−1 mol−1 . Sm 17.9. Calculate the ui = hcν˜i /kT : u1 = 16.1113, u2 = 4.5867, u3 = 16.4831(two-fold), u4 = 7.8577(two-fold). The vibrational contribution to heat capacity is given by R

i

u2i eui = (0 + 0.2187 + 2 × 0 + 2 × 0.0238)R = 0.2666R. (eui − 1)2

The sum of translational, rotational and vibrational contributions gives 3 3 CV,m = R + R + 0.2666R. 2 2 The result is =C −1 mol−1 . Cp,m V,m + R = 4.2666R = 35.47 JK

Chapter 18 18.1. Each CH2 group is split by the neighboring CH3 group into a 1:3:3:1 quartet. Correspondingly, each CH3 group is split by the neighboring CH2 into a 1:2:1 triplet. Protons in different ethyl groups are too far apart to interact. 18.2. The protons in methane are equivalent and do not exhibit spin-spin splittings. In CD2 H2 each deuteron has a spin of 1, which by itself would cause splitting into a 1:1:1 triplet. Two deuterons will give a splitting pattern of 1:2:3:2:1, which is what we see for the proton resonances. 18.3. The 3 protons in the methyl group are equivalent with a chemical shift δ ≈ 2. The 5 protons on the phenyl group are not strictly equivalent but, evidently, their chemical shifts are nearly equal. Note that the ring protons are significantly deshielded, as shown in Fig. 18.5. 18.4. The methyl protons are split into a doublet by the lone proton on the other carbon atom. The latter proton is itself split into a 1:3:3:1 quartet.

402 Answers to exercises

18.5. The integral over solid angle is given by 0

2π

π

(3 cos2 θ − 1) sin θ dθ dφ = 2π ×

0

1 −1

(3u2 − 1) du = 0.

18.6. A 180◦ pulse reverses the directions of the precessing nuclei, thus undoing the dephasing after the 90◦ pulse.

Chapter 19 19.1. The singlet state has the same form 1 | = √ |↑ 1 |↓ 2 − |↓ 1 |↑ 2 , 2 with respect to any axis. 19.2. The H-polarized beam is equivalent to an equal mixture of | + 45◦ and | − 45◦ . Thus half the photons pass through the +45◦ polarizer. Analogously, the +45◦ -polarized beam is equivalent to an equal mixture of H and V. Thus half of the remaining photons pass through the V polarizer.

Suggested references

Atkins, P.W., de Paula, J., 2014. Physical Chemistry, 12th edition. Oxford University Press. The best-selling physical chemistry text worldwide. A solid alternative reference for all topics covered in this book. House, J.E., 2003. Fundamentals of Quantum Chemistry. Elsevier Science & Technology, Amsterdam/New York. A companion volume in the Complementary Science Series. Greater emphasis on numerical problems and worked-out examples than this book. Claf, A.A., 1957. Calculus Refresher. Dover, New York. A quick tune-up for students whose calculus has become rusty. Blinder, S.M., 2013. Guide to Essential Math, 2nd edition. Elsevier, Amsterdam. The author’s review of the more advanced mathematical requisites for students in chemistry, physics and engineering. Orear, J., 1979. Physics. Macmillan, New York. Contains all the background in general and modern physics needed for quantum mechanics on the level of this book. Feynman, R.P., Leighton, R.B., Sands, M.L., 1963. Feynman Lectures on Physics, vols. I, II, III. Addison-Wesley, Reading, MA. Especially vol. III on quantum mechanics. Feynman’s unique masterly presentation of the fundamentals of physics. Great reading whenever you are so inclined. Limited overlap with our subject matter but worth skimming for Feynman’s quotable pearls of wisdom on physics. Atkins, P.W., 1989. General Chemistry. Freeman (Scientific American Books), San Francisco. Chapters 7-9 deal with the structure of atoms and molecules. Porile, N.T., 1993. Modern University Chemistry, 2nd ed. McGraw-Hill, New York. A high-level general chemistry text. Chapter 5 contains a very accessible introduction to quantum theory. Herzberg, G., 1944. Atomic Spectra and Atomic Structure. Dover, New York. A concise introduction to quantum theory and atomic structure. Atkins, P.W., 1997. The Periodic Kingdom: A Journey Into the Land of the Chemical Elements. Basic Books, New York. A metaphorical description of the periodic table of chemical elements. Appropriate for any level of scientific background. Companion, A.L., 1979. Chemical Bonding. McGraw-Hill, New York. A charming little volume covering chemical bonds on an elementary level. Pauling, L., 1960. The Nature of the Chemical Bond. Cornell University Press, Ithaca, NY. Another science classic by one of the original heroes of quantum chemistry. Heavily slanted towards Pauling’s views on resonance and valence-bond theory. Coulson, C.A., 1961. Valence. Oxford University Press, London. Somewhat dated but a classic introduction to quantum theory of chemical bonding. McWeeny, R., 1979. Coulson’s Valence, 3rd ed. Oxford University Press, London. An update of Coulson’s classic, seeking to preserve the flavour of the original. Pauling, L., Wilson, E.B., 1935. Introduction to Quantum Mechanics. McGraw-Hill, New York. Has long been the definitive exposition of quantum mechanics for chemists. Solutions of Schoödinger equation for harmonic oscillator and hydrogen atom are worked out with stepby-step, giving all mathematical details. Now available as a Dover reprint.

403

404 Suggested references

Dirac, P.A.M., 1948. The Principles of Quantum Mechanics, 3rd ed. Clarendon Press, Oxford. A scientific classic by one of the creators of quantum mechanics. Advanced level, but even beginners can enjoy the elegance of Dirac’s presentation. Blinder, S.M., 1974. Foundations of Quantum Dynamics. Academic Press, London. The author’s earlier monograph on the fundamental principles of time-dependent quantum mechanics, including transitions and time-dependent perturbation theory. Levine, I.N., 2014. Quantum Chemistry, 7th edition. Pearson Education, London. A longtime favorite text for undergraduate and graduate courses in quantum chemistry. Levine, I.N., 1975. Molecular Spectroscopy. Wiley, New York. Originally a companion volume to the above. Detailed coverage of quantum-mechanical foundations of spectroscopy. Ballhausen, C.J., Gray, H.B., 1965. Molecular Orbital Theory. Benjamin, New York. Contains a detailed account of crystal and ligand field theories for transition metal ions. Koch, W., Holthausen, M.C., 2001. A Chemist’s Guide to Density-Functional Theory, 2nd ed. Wiley-VCH, Weinheim. An up-to-date account of density-functional theory and applications. Parr, R.G., Yang, W., 1989. Density-Functional Theory of Atoms and Molecules. Oxford University Press, New York and Oxford. Cotton, F.A., 1990. Chemical Applications of Group Theory. Wiley, New York. The standard reference on group theory for chemists. Derome, A.E., 1987. Modern NMR Techniques for Chemistry Research. Pergamon, New York. A practical introduction for chemists using NMR. Stryer, L., 1989. Molecular Design of Life. Freeman, New York. A lucid introduction to the chemical and physical basis of molecular biology. Herbert, N., 1985. Quantum Reality: Beyond the New Physics. Doubleday, Garden City, NY. A patient exposition dealing with the “weirdness” of quantum mechanics. Gribbin, J., 1996. Schrödinger’s Kittens and the Search for Reality. Little, Brown & Co., Boston, MA. A stimulating nontechnical account of the perplexing paradoxes of quantum theory. Aczel, A.D., 2001. Entanglement: The Greatest Mystery in Physics. Four Walls Eight Windows, New York. A popular account of recent developments on the wonders of the quantum world. Johnson, G., 2003. A Shortcut Through Time: The Path to the Quantum Computer. Knopf, New York. An engaging popular-level introduction to the possibilities of quantum computation. Karplus, M., Porter, R.N., 1970. Atoms and Molecules: An Introduction for Students of Physical Chemistry. Benjamin, New York. Valuable for computational details on atomic and molecular orbitals and several key topics in quantum chemistry. McQuarrie, D.A., 1983. Quantum Chemistry. University Science Books, Mill Valley, CA. A studentfriendly introduction to the subject. Ratner, M.A., Schatz, G.C., 2001. Introduction to Quantum Mechanics in Chemistry. Prentice-Hall, Upper Saddle River, NJ. A recent quantum chemistry text for both undergraduate and graduate students. Keyed to approximately the same level as this book. Fitts, D.D., 1999. Principles of Quantum Mechanics as Applied to Chemistry and Chemical Physics. Cambridge University Press, Cambridge, UK. Another alternative, as above. Nielsen, M.A., Chuang, I.L., 2010. Quantum Computation and Quantum Information, 10th anniversary edition. Cambridge University Press, Cambridge, UK. The definitive reference on this subject. Fano, G., Blinder, S.M., 2017. Twenty-First Century Quantum Mechanics: Hilbert Space to Quantum Computers. Springer International Publishing, New York. Mathematical methods and conceptual foundations of quantum mechanics from a modern point of view. Blinder, S.M., House, J.E. (Eds.), 2018. Mathematical Physics in Theoretical Chemistry. Elsevier Science Publishing, Amsterdam. A collection of fairly advanced articles on various mathematical methods applied to theoretical chemistry. McQuarrie, D.A., 1973. Statistical Thermodynamics. Harper and Row, New York. The standard reference on the subject.

Index

A Absorption, 9, 112, 267, 271, 318 light, 190, 222 lines, 275 net, 284, 318 photon, 12, 283, 288, 290 probability, 112 process, 12, 281 radiation, 10, 12, 282, 284 spectroscopy, 289 spectrum, 11, 12, 281 Achiral molecule, 262 Allene molecule, 196 Ammonia, 183, 187, 245, 249, 254 electronic ground state, 251 molecules, 245, 262 states, 253 symmetry groups, 245 Anharmonic oscillator, 113, 269 Antibonding, 184, 205, 207, 208 character, 202, 220 electrons, 205 MO, 202 orbital, 202, 207, 220 Antiparallel electron spin, 324 nuclear spin state, 325 spin, 345, 346 spin configurations, 324 Antisymmetric for nuclei, 305 function, 305 orbital, 180 wavefunctions, 47 Antisymmetry principle, 372

Aromatic molecules, 119 Asymmetric rotor, 279, 307 Atomic behavior, 159 Bohr model, 132 chromium, 191 composition, 12 configurations, 166 energy, 134, 238 energy levels, 133, 267 ground states, 174 hydrogen, 148, 149 hydrogen spectrum, 12, 129, 133 ionization energies, 231 ionization reactions, 209 level, 355 lines, 133 mass unit, 276 motions, 308 nucleus, 3 number, 3, 165–167, 169, 170, 173–175, 236 neutral atom, 238 nucleus, 3, 172 orbitals, 134, 140, 143, 164–166, 181, 182, 184, 199, 200, 202, 203, 213, 220, 221, 228, 241, 251 origin, 202 parameters, 113 particles, 1 phenomena, 130 principles, 293 properties, 170 radius, 170, 171 scale, 10, 21, 130, 177 species, 130 405

406 Index

spectra, 129, 267 spectral lines, 124, 130, 140 spectroscopy, 133 structure, 11, 16, 163, 199, 267, 293 system, 373, 380 transition energy, 130 units, 99, 130, 134–136, 148, 151, 177, 237 weight, 172, 275, 276 Atoms carbon, 41–43, 119, 167, 175, 181, 182, 196, 213, 262 electron, 153 electronic states, 125, 309 emission spectra, 12 energy levels, 12 in quantum state, 111 ligand, 187 monatomic, 308 neighboring, 181, 213 quantum theory, 9, 16 Aufbau principles, 165

Bonding, 180, 184, 185, 189, 190, 201, 218, 220, 221, 223, 255 carbon atoms, 181 character, 202 chemical, 177, 179 electron, 205 hydrogen, 193–195, 323 metallic, 220 MO, 201, 252 orbital, 184, 201, 205, 207, 220, 325 overlap, 218 pairs, 188 potential, 180 Bound electrons, 164 states, 48, 135, 210 vibrational states, 291 Butadiene, 41, 42, 53, 54, 215, 217–219 electronic spectrum, 42 energy state, 41 molecule, 42 reaction, 218

C B Balmer series, 129, 140 Bandgap energy, 224 Bell states, 362, 363 Bent molecule, 187, 189, 255 Benzene molecule, 119 Binding energy, 178 Biochemistry, 267, 281 Biomolecules, 281, 329 Blackbody radiation, 7–9, 11, 17, 24, 25, 113, 308 Bohr atom, 130 atomic model, 132 frequency, 68, 130 model, 130, 132–134 orbit, 138 radius, 132 theory, 119, 133, 135, 139, 148, 151 Boltzmann distribution, 17, 113, 284, 295–300, 312, 318 for rotational levels, 273 Bond order, 205, 210

Carbon atoms, 41–43, 119, 167, 175, 181, 182, 196, 213, 262 atoms bonding, 181 ring cyclobutadiene, 232 Central atom, 183, 184, 186–189, 192, 255 Chemical bond, 177, 180, 195, 202, 205, 208, 268, 282 bonding, 177, 179 energy, 288 Chemical oxygen-iodine laser (COIL), 288 Chiral molecule, 262 Circular Bohr orbit, 147 Clauser, Horne, Shimony and Holt (CHSH) relation, 348 Collisional energy exchange, 286 Complete wavefunction, 136 Computational chemistry, 225–227, 230, 235, 251 Configuration interaction (CI), 231 Constituent atoms, 231

Index 407

Continuous wave (CW), 285 Controlled-NOT (CNOT) gate, 360–363, 379 Correlation spectroscopy (COSY), 329–331 Cosmic microwave background radiation, 18 Crystal vibrations, 103 Cylindrical symmetry, 200, 258

D Delocalization energy, 215, 216 Delocalized electrons, 41, 222 Deltafunction, 36, 75–78, 99 potential well, 99 well, 99 Density functional theory (DFT), 235, 238, 240, 241, 243, 244 Diatomic molecular orbitals, 200, 202 molecules, 178, 196, 209, 267, 269, 273, 275–277, 280, 282, 290, 294, 304–307, 309, 310, 313 rotation, 272, 304 vibration, 267 vibrational transitions, 269 nitrogen molecule, 181 Dipole transitions, 253 Dirac deltafunction, 75 Discrete Fourier transform (DFT), 364 Dissociation energy, 178, 196, 209, 268 Double bonds, 42, 43, 119, 188, 189, 213, 215, 216 Double integral, 16, 72 Downward energy transition, 12 Downward transition, 112, 284, 318

E Eigenfunctions, 32, 36, 39, 40, 45, 51–53, 56, 57, 60, 106, 107, 118, 249 hydrogenic, 148 independent, 143 unperturbed, 65, 114 Eigenstates, 55, 61, 63, 339, 376 superposition, 376

Eigenvalue, 32, 46, 106, 109, 123, 164 equation, 32, 36, 52, 54, 56, 58, 59, 122, 123, 242 momentum, 36 Electromagnetic radiation, 5, 7, 21, 62, 267, 288 emission, 9 quantum theory, 108 Electromagnetic waves, 4, 5, 9, 21, 108 intensity, 32 Electron accelerating, 130 acceptors, 189, 222 affinity, 209 atoms, 153 bonding, 205 bonds, 185, 186 charge, 153, 201, 208, 242 charge distribution, 144 cloud, 162, 319 configuration, 155, 164, 210, 211, 251 correlation, 153, 165, 230, 241 deficient, 186 density, 144, 168, 183, 236, 240, 242, 321 diffraction, 34 distribution, 138, 196, 324 donor, 222 gun, 27 hole, 169 kinetic energy, 152, 237 lone pair, 187 mobile, 220 pairs, 187, 189 position, 137 repulsions, 152, 161 self interactions, 238 shells, 167 spin, 124–126, 155, 172, 180, 191, 316, 323, 325, 356 antiparallel, 324 coordinates, 235 densities, 325 vacancy, 223 velocities, 172 wavefunction, 41, 125, 204, 324 wavelength, 235 Electron volts (eV), 134

408 Index

Electronegative neighboring atom, 321 Electronegativity, 170, 208–210, 244 Electronegativity difference, 208, 209 Electronic absorption band, 42 charge, 261 charge density, 235 computers, 226 configurations, 174, 211 density, 235 energy, 178, 179, 196, 227, 240, 307 energy change, 148 environment, 319 ground state, 53 Hamiltonian, 227 Schrödinger equation, 178 states, 169, 255, 267, 281, 287, 342 structure, 169, 235, 319 transition, 42, 267, 279, 280 transition frequency, 290 wavefunction, 179 Emission, 112, 267, 349 net, 284 photon, 12, 284, 290, 349 probability, 112 process, 12, 281 radiation, 12 spectroscopy, 289 spectrum, 11 Energy bands, 220, 221, 235 chemical, 288 continuum, 38, 48 contributions, 279 curvature, 244 decrease for bonding, 207 density, 15, 19, 22 difference, 42 eigenvalues, 47, 52, 81, 108, 139, 369, 376, 377 electronic, 178, 179, 196, 227, 240, 307 excess, 10 exchange, 288 Fermi, 236 flux, 15 gap, 173 ground state, 62, 135 increase, 207

intensity, 15 levels, 13, 35, 38, 45, 46, 112, 113, 120, 130, 133, 140, 173, 214, 216, 220, 267, 269, 279, 284, 286, 293–295, 300, 304 operator, 56 photon, 12, 26, 111, 289, 318 quantities, 22 quantization, 10 resonance, 232 rotational, 179, 267, 273, 274, 278, 291, 294 shift, 289 splitting, 317 state, 41 states, 288, 297 variational, 152, 206 vibrational, 179 Entangled photons, 350, 352 quantum system, 339 superposition, 363 Equilibrium internuclear distance, 178, 202, 209, 276, 291 Equilibrium value, 327 Equipartition principle, 9, 18, 306, 308 Equipartition value, 311 Equivalent methyl protons, 323 orientations, 245 protons, 323 Exchange spectroscopy (EXSY), 332 Exclusion principle, 164 Pauli, 125, 159, 164, 172, 241, 372 Expectation values, 53, 54, 58, 61, 148, 152, 157

F Fermi energy, 236 hole, 242 momentum, 235 Fluorescence spectroscopy, 281 Fluorescent emission, 285 Fluorine atoms, 185 Fluxional molecule, 189 Fock states, 371, 372, 374 Fourier superposition, 376 Free electron gas, 237, 238

Index 409

Free particle, 35, 38, 48 Free-electron model (FEM), 41–43, 50, 119, 215 Frontier orbitals, 217 Full configuration interaction (FCI), 378 Fundamental vibrational frequency, 269, 294 Fundamental vibrational transition, 275

G Gas molecules, 293, 312 Gate, 225, 358, 379 classical, 360 Hadamard, 358, 359, 362, 366–368, 370 non-local, 374 NOT, 358 qubit, 362 sequence, 362 SWAP, 362, 368 voltage, 225 General Atomic and Molecular Electronic Structure System (GAMESS), 231, 233 Gibbs free energy, 294 Gravitational potential energy, 312 Ground electronic states, 167, 218 state, 38, 61, 106, 115, 134, 136, 152, 166, 169, 202, 203, 205, 215, 219, 281, 282, 284, 286 descriptive chemistry, 195 electron configuration, 173 energy, 62, 135 hydrogenlike eigenfunction, 148 vibrational energy, 268 wavefunction, 152 vibrational state, 275

H Hadamard gate, 358, 359, 362, 366–368, 370 Hamiltonian electronic, 227 matrix, 229 matrix elements, 206 molecular, 373 operator, 59

rotational, 306 spin, 322, 325 unperturbed, 63, 67 vibrational, 287 Harmonic oscillator, 69, 71, 79, 103, 104, 106, 108, 110, 113, 122, 129, 134, 140, 196, 268–270 wavefunctions, 73 Harmonic vibration, 313 Hartree, 153, 163 equations, 164, 165 method extension, 164 HD molecule, 322, 323 Heisenberg uncertainty principle, 342 Helium atom, 151, 152, 154, 161, 164, 244, 379 states, 156 Helmholtz free energy, 294, 295, 300 Heteronuclear diatomic molecules, 258, 269, 311 Heteronuclear molecules, 207, 305 Highest occupied molecular orbital (HOMO), 42, 43, 210, 217–219 Homonuclear diatomic molecules, 199, 203, 206, 255, 259, 269 diatomic species, 203 molecule, 305, 311 Hückel molecular orbital theory, 213 Hybrid orbitals, 181–183 Hydrogen AOs, 254 atom, 13, 28, 48, 64, 71, 87, 119, 129–131, 134, 146, 147, 179, 199, 200, 245, 246, 319, 324 atom ground state, 148 atom orbitals, 202 atomic, 12, 129, 133, 148, 149 bonding, 193–195, 323 bonds, 193–195 energy levels, 134, 148 gas, 199 halides, 319 molecule, 177, 179, 180, 311, 379 orbitals, 144 spectrum, 147 Hydrogen fluoride (HF), 288

410 Index

Hydrogenic atomic orbitals, 229 eigenfunctions, 148 Schrödinger equation, 140 system, 140 Hydrogenlike atom, 99, 136, 148, 149, 240 atom quantum mechanics, 134 functions, 140 ions, 132, 151 orbitals, 154 problem, 139 Schrödinger equation, 148 Hydroxyl proton, 319, 321, 323 Hypervalent molecules, 183, 196

I Infrared radiation, 287, 288 Infrared transitions, 286 Instantaneous interelectronic interactions, 165 Integer values spin, 125 Integral Coulomb, 156 exchange, 156 formula, 74 multiples, 119, 161 number, 39, 133 one electron, 156 phase, 302, 306 relations, 121 Integrated circuits (IC), 225 Intensity laser transition, 284 radiation, 10, 22 Rayleigh scattering, 288 transition, 280 Interaction energy, 16, 63, 324 Interatomic attraction, 174 Interelectronic interaction, 238 repulsion energy, 237 Internuclear axis, 180, 199–202, 204 Internuclear distance, 178, 199, 210, 267, 272, 280 Intrinsic semiconductor, 222

Inversion symmetry, 200 Iodine atoms, 220, 288 Ionization energy, 147, 151, 162, 164, 170, 209, 229 Isotopomer, 275

K Kinetic energy, 10, 35, 36, 38, 40, 44, 104, 117, 146, 177, 235, 237, 243, 301, 308, 312 in quantum mechanics, 238

L Larmor frequency, 318 Laser emission, 287, 288 Laser transition, 284, 285, 287, 288 Lewis acids, 189 bases, 189 diagrams, 183 Linear combination of atomic orbitals (LCAO), 202, 203 approach, 203 approximation, 202, 205, 210, 220, 228 Linear molecule, 182, 270, 279, 290, 308 Linear momentum, 36, 69 Local density approximation (LDA), 237, 238 Local realism (LR), 343–346, 349, 350, 352, 364 predictions, 348 Localized orbitals, 252 Logic gates, 224, 358 Lone pair, 187–189, 252 electrons, 187 orbitals, 183 Lowering operator, 123 Lowest unoccupied molecular orbital (LUMO), 42, 43, 210, 217, 219 Lyman series, 129, 140

M Macroscopic state, 311 Magnetic resonance imaging (MRI), 332, 333 functional, 333

Index 411

Mass number, 3 particle, 98, 119, 293 proton, 131, 132 ratios, 1 Microstates, 296, 297, 311 Microwave spectroscopy, 273 Minimum value momentum, 38 Molecular orbital (MO), 200, 201, 203–206, 208, 213, 214, 217, 219 Molecule butadiene, 42 cyclobutadiene, 119 diatomic, 178, 196, 209, 267, 269, 273, 275–277, 280, 282, 294, 304–307, 309, 310, 313 energy levels, 289 fluxional, 189 hydrogen, 177, 180, 311, 379 number state, 298 quantum states, 279 quantum theory, 293 spectroscopy, 267 vibrational energies, 289 Moment of inertia, 117, 120, 273, 274, 277–279, 294, 306, 311 Momentum components, 34, 57 eigenstate, 56 eigenvalue, 36 minimum value, 38 operators, 56 particle, 342 photon, 26 space, 235 Monatomic atoms, 308 formulas, 304 gas molecule, 302 gases, 301, 302 ideal gases, 304, 310 Multiple photon entanglement, 349 states, 133 Multiplet ground states, 309 Multitude, 12, 227, 286, 359, 364

N Neighboring atoms, 181, 213 molecules, 323 orbitals, 183 Net bonding effect, 205 emission, 284 spin, 173 Neutral atom, 236, 238 molecules, 177, 205 particles, 364 Neutron spins, 316 unpaired, 317 Nitrogen atom, 168, 181, 193, 245 Noble gas electron, 119 Nonbonding, 184, 187, 255 orbital, 184 Nonradiative transitions, 284 Nonrelativistic free particle, 30 Nonstationary state, 67 Nonsymmetrical linear molecules, 258 Normal modes of vibration, 308 Nuclear atom, 13 attractive energy, 237 energies, 49 masses, 146, 267, 280 spin, 127, 305, 316, 318, 324, 325, 327, 342 spin configuration, 325 Nuclear magnetic resonance (NMR), 315, 342 absorption, 318 chemical applications, 316 spectroscopy, 318, 319, 328 transition, 322, 327 Nuclear Overhauser effect (NOE), 332 Nuclear Overhauser effect spectroscopy (NOESY), 332 Nucleus atomic, 3 atomic number, 3, 172 spin, 317

412 Index

O Oblate symmetric rotors, 279 Operator angular momentum, 57 lowering, 123 raising, 123 Optimized helium variational wavefunction, 156 wavefunction, 228 Optoelectronic devices, 223 Orbital angular momenta, 119, 124, 127, 134, 148, 166, 167, 169, 173, 192, 200, 204, 273, 282, 319, 325 antibonding, 202, 207, 220 antisymmetric, 180 approximation, 159 bonding, 184, 201, 205, 207, 220, 325 classification, 255 designation, 167, 200 distributions, 165 energies, 165, 170, 229 factor, 204 functions, 180, 203 Hartree product, 164 model, 199 motion, 316 radii, 132 spins, 162 symmetry, 216, 217 Organic chemistry, 217 dye molecules, 287 molecules, 321 Oscillator anharmonic, 113, 269 harmonic, 69, 71, 79, 103, 104, 106, 108, 110, 113, 122, 129, 134, 140, 196, 268–270 Overtone transitions, 269 Oxidation states, 170 Oxygen atom, 181, 194, 321 molecules, 205, 288

P Parallel spins, 155, 169, 170, 324 Partial differential equation, 83

Particle mass, 98, 119 momentum, 342 separation, 145 single, 146, 302 velocities, 353 Pauli exclusion principle, 125, 159, 164, 172, 241, 372 gates, 374 spin, 125 spin matrices, 127 Perturbed state, 340 Phase gate, 361 Phosphorescence, 282 Photoelectric effect, 7, 9, 10, 24, 25 Photoelectrons, 10 spectroscopy, 203 Photon, 10, 24, 110 absorption, 12 annihilation, 12 emission, 12, 205, 282 energy, 12, 25, 26, 108, 111, 130, 289 frequency, 112 incoming, 283 momentum, 26 numbers, 111, 112 pair, 350 polarizations, 348, 350 rest mass, 26 spontaneous emission, 288 superposition, 340 Planck radiation law, 17 Polar angle φ, 82 Polarization states, 356, 357 Polarizing beam splitter (PBS), 351 Polyatomic gases, 301 ideal gas, 304 ideal gases, 304, 310 molecules, 275, 277, 293, 303, 305–308 molecules rotation, 277, 306 molecules vibration, 270 Positron emission tomography (PET), 147 Potential energy, 16, 30, 31, 35–37, 43, 44, 104, 130, 131, 134, 135, 178, 235–238, 267, 280

Index 413

Power series, 64, 89, 376 expansion, 90, 91 Principal axis, 245, 249, 255, 256, 259, 279 in ammonia, 245 Principal quantum number, 136, 140 Principle variational, 240 Probability absorption, 112 amplitude, 63 density, 33, 57, 137, 138, 313 distributions, 39 electron, 149 emission, 112 for absorption, 112 theory, 347 transition, 68, 112, 318 Proton mass, 131, 132 NMR spectrum, 319, 333 resonances, 321 spin orientation flips, 318

Q Quantized energy, 118 energy levels, 35, 38, 377 particles, 10 Quantum computer, 353–356, 362, 364, 370, 372, 377, 378 computer circuit, 380 gates, 358, 360, 364, 373, 374 numbers, 38, 39, 45, 52, 106, 121, 123, 124, 154, 155, 200, 268, 269, 273, 294, 298, 301 principles, 355 state, 38, 39, 58, 62, 63, 115, 125, 245, 299, 341, 343, 366, 376 state superposition, 341 state vector, 356 superposition, 341, 342, 353, 354 system, 32, 38–40, 46, 53, 55, 61, 125, 239, 338, 339, 342, 353, 359, 376, 378 theory, 7, 9, 21, 26, 56, 127, 129, 172, 308, 337, 339, 352, 353

Quantum mechanics (QM), 21, 26, 31–33, 35, 36, 38, 51, 53, 57, 81, 86, 133, 134, 153, 154, 177, 225, 248, 293, 337, 338, 342–345, 347, 348 Qubit gate, 362 quantum computer, 356, 372 single, 359, 374 states, 357, 366, 377

R Radiation absorption, 10, 12, 282, 284 density, 112, 113 emission, 12 field, 10, 17, 63, 110, 111, 129, 290, 328 frequency, 9, 10 intensity, 10 law, 113 output, 285 quantum theory, 108, 110 radiofrequency, 329 theory, 109 Radiationless relaxation, 282 transition, 282 Radiative dipole transition, 115 transition, 66, 252 transition frequencies, 13 Raising operator, 123 Raman spectroscopy, 288–290, 308 resonance, 290 Raman spectroscopy resonance, 290 Rayleigh scattering, 288 Relativity, 172–174 Repulsive potential energy, 156 Resonance energy, 232 integrals, 213 Raman spectroscopy, 290 Resonant photons, 284 Rotation diatomic molecules, 272, 304 gates, 359 symmetry operation, 257

414 Index

Rotational axes, 258, 260, 261 characteristic temperature, 306, 311 constant, 273, 274, 276, 278, 304, 307, 313 contributions, 306 degrees of freedom, 267, 270, 306 energy, 179, 267, 273, 274, 278, 291, 294 energy eigenvalues, 279 energy levels, 273, 290 factor, 304 groups, 258 Hamiltonian, 306 levels, 273–275, 291, 313 motion, 270 parameters, 289 partition function, 305–307 quantum numbers, 274, 279, 305 spectrum, 277 states, 280, 311 structure, 279 transitions, 267, 271, 273, 275, 280 Rotational nuclear Overhauser effect spectroscopy (ROESY), 332

S Scattered radiation, 289 Schrödinger equation, 28, 33, 35–37, 44, 61, 96, 105, 119, 151, 152, 178, 179, 225–227 electronic, 178, 267 for atomic orbitals, 138 for one-dimensional harmonic oscillator, 104 for quantum-mechanical harmonic oscillator, 79 for relative motion, 146 for unperturbed Hamiltonian, 64 for vibration, 267 hydrogenlike, 148 in atomic units, 134 Secular equation, 206, 207, 213–216, 229 Secure electronic transactions, 355 Self-consistent field method, 153 Simple harmonic motion, 104

Single bonding orbital, 205 bonds, 41 electron orbits, 132 frequency, 328 particle, 146, 302 peak, 333 quantum state, 342 qubit, 359, 374 qubit gate, 361 Slater determinant, 243 symmetry operation, 246 unpaired electron, 124 wavelength, 21 Singlet, 167, 323 excited states, 211 spin function, 180 spin state, 155, 354 state, 155, 156, 323, 343, 347, 354 Slater determinant, 159, 161, 162, 165, 227, 241, 251 single, 243 spinorbitals, 164 wavefunctions, 164 Spectroscopic transitions, 62 Spectroscopy, 11, 193, 196, 267, 276, 280, 283, 293, 304, 307, 328 atomic, 133 emission, 289 molecule, 267 Spherical harmonics, 83, 86, 87, 120, 121, 140–143, 260 polar coordinates, 33, 78, 81–83, 95, 98, 135, 138, 200 rotor, 278, 279, 291, 307 Spin antiparallel, 346 coordinates, 161, 162 electron, 124–126, 155, 172, 180, 191, 316, 323, 325, 356 function, 154, 155 Hamiltonian, 322, 325 integer values, 125 multiplicity, 167, 204 net, 173 nucleus, 317 operators, 127

Index 415

orbit splitting, 173 orthogonality, 162 parallel, 205 particles, 316 Pauli, 125 polarization, 324, 325 quantum number, 167 states, 315, 317–319, 322, 323, 344, 357, 364 system, 327 Spinor wavefunction, 125 Spinorbitals, 154, 159, 162, 164, 242, 243, 251, 371, 372, 379 Slater determinant, 164 Spontaneous emission, 112, 113, 283, 284, 318, 349 photon, 288 Stable molecule, 177 states, 355 State ground, 38, 61, 106, 115, 134, 136, 152, 166, 169, 202, 203, 205, 215, 219, 281, 282, 284, 286 quantum, 38, 39, 62, 63, 115, 125, 245, 299, 341, 343, 366 superposition, 53, 342, 372 vector, 357 Stereochemistry, 185, 216 Stimulated emission, 112, 113, 283, 284, 288, 318 Subatomic particles, 1, 2, 342, 343 Superposed quantum states, 341 state, 341 Superposition, 23, 36, 53, 61, 67, 230, 308, 329, 339–342, 348, 352, 362 entangled, 363 photon, 340 principle, 340 quantum, 341, 342, 353, 354 state, 53, 342, 372 SWAP gate, 362, 368 Symmetric rotor, 279, 307 Symmetry axis, 279 elements, 257, 259

group, 245, 246, 248, 254, 255, 257–260, 262, 264, 270 number, 305 operations, 245, 246, 250, 255–258 operators, 246 orbital, 216, 217, 262 states, 262

T Tertiary hydrogen, 323 Tetrahedral molecules, 290 orbitals, 183 Theory of relativity, 26, 33, 172, 343 Thermal energy, 273, 281 Thermal radiation, 7, 18 Thermodynamic energy, 299 Thermodynamic functions, 294, 300, 301, 304, 306, 308–310 Total correlation spectroscopy (TOCSY), 332 Transition elements, 169, 170 metals, 170, 171 probability, 68, 112, 318 rates, 68, 113 Translational energy, 303, 306 Triatomic molecules, 255, 278 Triplet, 322, 323 ground state, 195 spin function, 180, 204 spin states, 155 state, 323 states, 155, 156, 282

U Unadorned symmetry groups, 260 Unbound particle, 58 Undecayed states, 341 Unhybridized atomic orbitals, 196 Unoccupied orbitals, 167, 169, 221 Unpaired electrons, 181, 189 neutron spins, 317 orbitals, 180 spins, 169 Unperturbed eigenfunctions, 65, 114 eigenvalues, 65

416 Index

energies, 114 Hamiltonian, 63, 67 state, 67 Unshared electron pairs, 193 Upward transition, 112, 318

V Vacuum state, 111, 372 Valence electrons, 171, 187–190, 220, 222, 223 shell, 170, 171, 180, 181, 187, 255 shell model, 189, 196 states, 209 Variational energy, 152, 206 principle, 240 wavefunction, 157 Vertical transitions, 280, 281 Vibration amplitude, 272 diatomic molecules, 267 modes, 39, 45, 97, 110, 270 Vibrational characteristic temperatures, 307, 311 constant, 273, 276, 307, 313 contributions, 308, 310 degrees of freedom, 267 energy, 179 energy levels, 269, 290, 294, 309 frequency, 308, 314 ground state, 273 Hamiltonian, 287 levels, 269, 281, 282 modes, 270

motion, 280 parameters, 289 partition function, 307, 309 quantum numbers, 274, 279 relaxation, 281 states, 279, 280, 291, 313 structure, 279 transitions, 267, 269, 272, 275, 280 wavefunctions, 281 Virtual spinorbitals, 371

W Water molecule, 181, 254, 270 Wavefunction, 23, 29, 31, 32, 37, 39, 40, 58, 138, 140, 153, 155, 164, 180, 200 collapse, 339, 342, 354 electron, 41, 125, 204, 324 electronic, 179 for electron, 163 functional, 240 gradient, 353 ground state, 152 in quantum mechanics, 15 optimized, 228 separable, 98 sign change, 204 variational, 157 Wavenumber units, 268, 269, 273, 304, 307 Woodward-Hoffmann rule, 219

X XOR gate, 360