Quantum Mechanics: For Scientists and Engineers [1 ed.] 9781032117645, 9781003221432

This book covers the entire span of quantum mechanics whose developments have taken place during the early part of the t

189 75 14MB

English Pages 226 [227] Year 2021

Table of contents :
Cover
Half Title
Title Page
Copyright Page
Preface
Table of Contents
1. Wave and Matrix Mechanics of Schrodinger, Heisenberg and Dirac
2. Exactly Solvable Schrodinger Problems, Perturbation Theory, Quantum Gate Design
3. Open Quantum Systems, Tensor Products, Fock Space Calculus, Feynman’s Path Integral
4. Quantum Field Theory, Many Particle Systems
5. Quantum Gates, Scattering Theory, Noise in Schrodinger and Dirac’s Equations
6. Quantum Stochastics, Filtering, Control and Coding, Hypothesis Testing and Non-Abelian Gauge Fields
7. Master Equations, Non-Abelian Gauge Fields
Appendix

Recommend Papers

Advanced Fluid Mechanics and Heat Transfer for Engineers and Scientists: For Engineers and Scientists 3030729249, 9783030729240

The current book, Advanced Fluid Mechanics and Heat Transfer is based on author's four decades of industrial and ac

234 75 20MB Read more

C Programming for Scientists and Engineers (Manufacturing Engineering for Scientists and Engineers) 1857180305, 9781857180305, 9781417526758

In a computing world that is increasingly full of C++ and object-oriented methods, C still has an important role to play

844 22 7MB Read more

Scientific Computing: For Scientists and Engineers 9783110359428

Scientific Computing for Scientists and Engineers is designed to teach undergraduate students relevant numerical methods

185 90 3MB Read more

A First Course in Continuum Mechanics: for Physical and Biological Engineers and Scientists 0130615323, 9780130615329

Revision of a classic text by a distinguished author. Emphasis is on problem formulation and derivation of governing equ

380 74 4MB Read more

Scientific Computing: For Scientists and Engineers 9783110359428

Scientific Computing for Scientists and Engineers is designed to teach undergraduate students relevant numerical methods

207 33 1MB Read more

Essential Matlab For Engineers And Scientists

612 109 2MB Read more

Water wave mechanics for engineers and scientists 9810204213, 9810204205, 9789812385512, 9789810204204

This book is intended as an introduction to classical water wave theory for the college senior or first year graduate st

356 22 14MB Read more

Quantum Mechanics: An Introduction for Device Physicists and Electrical Engineers [3 ed.] 9780367469153, 9780367467272, 9781003031949

366 41 30MB Read more

Quantum mechanics for science and engineering

563 115 1MB Read more

Quantum Mechanics for Chemistry 9783031302176

This textbook forms the basis for an advanced undergraduate or graduate level quantum chemistry course, and can also ser

177 40 8MB Read more

Quantum Mechanics: For Scientists and Engineers [1 ed.]
9781032117645, 9781003221432

Author / Uploaded
Harish Parthasarathy

Similar Topics
Physics
Quantum Mechanics

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

File loading please wait...

Citation preview

QUANTUM MECHANICS FOR SCIENTISTS AND ENGINEERS

QUANTUM MECHANICS FOR SCIENTISTS AND ENGINEERS

Harish Parthasarathy Professor Electronics & Communication Engineering Netaji Subhas Institute of Technology (NSIT) New Delhi, Delhi-110078

First published 2022 by CRC Press 2 Park Square, Milton Park, Abingdon, Oxon, OX14 4RN and by CRC Press 6000 Broken Sound Parkway NW, Suite 300, Boca Raton, FL 33487-2742 © 2022 Manakin Press Pvt. Ltd. CRC Press is an imprint of Informa UK Limited The right of Harish Parthasarathy to be identiﬁed as author of this work has been asserted in accordance with sections 77 and 78 of the Copyright, Designs and Patents Act 1988. Reasonable eﬀorts have been made to publish reliable data and information, but the author and publisher cannot assume responsibility for the validity of all materials or the consequences of their use. The authors and publishers have attempted to trace the copyright holders of all material reproduced in this publication and apologize to copyright holders if permission to publish in this form has not been obtained. If any copyright material has not been acknowledged please write and let us know so we may rectify in any future reprint. All rights reserved. No part of this book may be reprinted or reproduced or utilised in any form or by any electronic, mechanical, or other means, now known or hereafter invented, including photocopying and recording, or in any information storage or retrieval system, without permission in writing from the publishers. For permission to photocopy or use material electronically from this work, access www.copyright.com or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400. For works that are not available on CCC please contact [email protected] Trademark notice: Product or corporate names may be trademarks or registered trademarks, and are used only for identiﬁcation and explanation without intent to infringe. Print edition not for sale in South Asia (India, Sri Lanka, Nepal, Bangladesh, Pakistan or Bhutan). British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library Library of Congress Cataloging-in-Publication Data A catalog record has been requested ISBN: 978-1-032-11764-5 (hbk) ISBN: 978-1-003-22143-2 (ebk) DOI: 10.1201/9781003221432

Preface

This book covers the entire span of modern quantum mechanics including quantum field theory and more recent developments in quantum stochastics. The book starts with the Bohr-Sommerfeld correspondence principle which was the first model developed by Bohr to explain Rutherford's picture of the atom with electrons revolving around the nucleus without losing energy in the form of electromagnetic radiation (produced by accelerating particles according to the classical Ma.xwell theory) which would otherwise cause the energy of the electron to continuously decrease and collapse to the nucleus leading to the instability of all matter. Bohr explained this by postulating that in a stationary state the electron does not lose or gain energy and further that the angular momentum of the electron which is the line integral of the momentum around one closed orbit is quantized in integral multiples of h/21f where h is Planck's constant. Sommerfeld's generalization of this principle of Bohr to arbitrary action-angle varables is mentioned here so that the line integral of the action variable with respect to the angle variable over one complete circuit is an integral multiple of h/21r. The book then continues with the evolution of the Schrodinger equation from the Planck and De-Broglie hypothesis then proceeds to the matrix mechanics of Heisenberg and its equivalence with the Schrodinger wave mechanics along the lines outlined by Dirac in his celebrated book "The principles of quantum mechanics". In the process, we also discuss the interaction picture of Dirac which is something midway between the Schrodinger and the Heisenberg pictures. MIIJ{ Born's intepretation of the wave function in terms of probabilities is also discussed. The superposition principle which is at the heart of quantum mechanics is cliBcussed with illustrations from the Young double slit experiments for interference of photons and likewise, the Stern-Gerlach experiments that demonstrate the fact that electrons can sometimes behave like waves thus confirming De-Broglie's wave-particle duality. We then discuss the effect of an electromagnetic field on an atom using time dependent perturbation theory and also the Zeeman effect which predicts the splitting of the degenerate levels of an atom in a. constant magnetic field caused by the interaction of spin with the magnetic field. This leads us to the Pauli equation which is the Schrodinger equation for a two component wave function with spin-magnetic field interaction accounted for. A thorough discussion of a time independent quantum system perturbed by time varying potential using the Dyson series is presented which is later on used in the book to calculate scattering amplitudes for electrons, positrons and photons upto any order in perturbation theory. Time independent perturbation theory is also introduced as a tool for calculating approximately the shift in the energy levels and the energy eigenstates of a quantum system when perturbed by a potential. The problem of perturbation of degenerate states is also taken up by showing that if the unperturbed energy E is degenerate with a degeneracy of d, then on perturbation, it will split into d energy levels and whose eigensta.tes and energy eigenvalues are determined by the eigenvalues and eigenvectors of the "secular matrix" obtained by considering the matrix elements of the perturbing potential with respect to an orthonormal basis of the d dimensional vector space of all eigenfunctions of the unperturbed Hamiltonian having energy E. Using the Ca.uchySchwarz inequality, we give Weyl's proof of the Heisenberg uncertainty principle in its most general form. Three exactly solvable problems in quantum mechanics are presented in full detail, namely derivation the the energy eigenfunctions and energy eigenvalues of the Hamiltonian corresponding to (a.) a particle in a box, (b) the harmonic oscillator and (3) the non-relativistic Hydrdogen atom. The last two are based on solving ordinary differential equations using the power series method and for the Hydrogen atom case, the full derivation of the spherical harmonics as eigenfunctions of the angular momentum square operator is presented in full detail. Later on, we show how the spherical harmonics determined all the irreducible reprsenta.tions of the rotation group 80(3). This derivation gives an alternate method to obtain all the irreducible representations of the rotation group in comparison to the method of Wigner and Weyl who constructed the irreducible representations of SU(2) using polynomials in two complex variables. The book contains some major results in group representation theory like the Frobenius- Mackey's theory for constructing irreducible representations of the semidirect product of a normal Abelian group N with another subgroup H. This theory was applied by Wigner to construct the irreducible representations of the Poincare group which is generated by Lorentz transformations and spa.c&time translations. It is also known as the little group method where one starts with a. character of the Abelian group N, constructs its orbit under the dual action of H, then constructs the stability subgroup of H for which the chosen character is a fixed point, and then using an irreducible representation of this little group Ho and the chosen character xo, one constructs an irreducible representation of the sernidirect product Go = N ® 8 Ho of Nand H 0 and by the induced representation theory, the representation of G = N x •H is constructed using the Mackey method. Irreduciblity of such induced representations is proved and easier methods of describing such induced representations using functions on the character orbit with values in the vector space on which the irreducible representation of the little group acts are provided. The reader is referred to the excellent book by K.R.Parthasa.rathy on this subject (KRP:Mathematical Foundations of

Quantum Mechanics for Scientists and Engineers Preface

vi ii

Quantum Mechanics, Hindustan Book Agency). After the discussion of non-relativistic quantum mechanics using the Schrodinger method, we describe Feynman’s path integral approach to non-relativisti quantum mechanics. Feynman in his Ph.D thesis showed that solution to the Schrodinger equation for the evolving wave function under a Hamiltonian can equivalently be represented as a path integral of exp(2πiS[q]/h) over all paths q joining (t1 , q1 ) and (t2 , q2 ) where S[q] is t the action of the path t → q(t) with the prescribed end points. This action is obtained as the integral t12 L(t, q(t), q � (t))dt of the Lagrangian L of the particle or system of particles with the Lagrangian related to the Hamiltonian used in the Schrodinger equation via the Legendre transformation. Thus, Feynman’s approach is that along each path q there is a complex amplitude exp(2πiS[q]/h) for the particle/system of particles to go from (t1 , q1 ) to (t2 , q2 ) and the total probability amplitude to go from the initial point to the second in the given time is obtained by summing these amplitudes over all paths. This is an intuitively beautiful picture of presenting quantum mechanics because it displays the quantum interference along diﬀerent paths and in addition, it clearly shows that in the limit when Planck’s constant converges to zero, the rapid oscillations of the phase 2πS[q]/h for small variations the path q cause cancellations of the probability amplitudes except for paths in the vicinity of the classical path q∗ where the variation of the action vanishes, ie where δS[q∗ ] = 0. A path q∗ satisﬁng this condition satisﬁed the classical Euler-Lagrange equation of classical mechanics which is equivalent to Newtonian classical mechanics. The mathematical problems with Feynman’s approach have till today deﬁed mathematicians since the constant of proportionality in√the FPI is inﬁnite and in addition, the path integral is 2 something like √ the integral of exp(−ix /2) over R where i = −1 and Feynman’s idea is equivalent to replacing this integral by 2πi. It gives the correct results predicted by Schrodinger’s equation but why it gives is not clear since the integral of exp(−ix2 /2) is oscillatory and hence does not converge. The Feynman path integral is the same as Kac t formula for u(t, x) = E[exp( 0 V (x + B(s))ds)f (x + B(t))] where B(.) is Brownian motion but with t replaced by it, ie ”complex time”. More general forms of Kac formula can be derived when B(.) is an arbitrary diﬀusion process or even any Markov process with a generator K. These ideas have been discussed in this book. After all these discussions, we come to relativistic quantum mechanics whose wave equation was ﬁrst correctly proposed by Dirac. Before Dirac, we had the Klein-Gordon equation based on replacing the kinetic energy K by i∂/∂t + eV (t, r) and the mometum P by −i∇ + eA(t, r) in the Einstein energy-momentum relation K 2 − c2 P 2 − m2 c4 = 0 and treating this as an operator acting on a wave function ψ(t, r) set to zero. Earlier, Sommerfeld had solved the corresponding stationary state equation for the Hydrogen atom obtaining thereby relativistic corrections to the −1/n2 spectrum of Schrodinger. However, Dirac observed two major ﬂaws with this equation. One, an evolution equation of quantum mechanics has to be ﬁrst order in time, since if the evolution operator is T (t), then T (t+s) = T (t).T (s), t, s ≥ 0 and hence T (t) = exp(−itH) for some selfadjoint operator H. Which is the same as saying that T � (t) = −iHT (t). Of course, we also have T �� (t) = −H 2 T (t) but that would lead to a superpostion of two solutions exp(±itH) and unitarity of the evolution would be broken. One way to rectify this is to consider the wave operator as K − c P 2 + m2 c2

where K = i∂/∂t + eV (t, r) and P replaced by −i∇ + eA(t, r) but the resulting equation could not be solved for any potential due to ∇2 occuring inside a square root. The other major problem with such an equation is that it is highly asymmetric in the space and time indices whereas special relativity requires that space and time should be treated on the same footing. Thus, Dirac proposed the factorization of the quadratic form E 2 −c2 (Px2 +Py2 +Pz2 )−m2 c4 into linear factors by the introducing of non-commuting 4 × 4 matrices known today as the famous Dirac γ matrices. The Dirac equation was solved by Dirac for the Coulomb potential thus obtaining negative as well as positive energy states, the positive energy states appear in a continuum and Dirac interpreted this as corresponding to a new kind of particle the positron having the same mass but opposite charge as that of an electron. Dirac proved that his relativistic wave equation for the electron is invariant under a representation of the Lorentz group, namely the ”Dirac representation” and he also proved that under conjugation followed by application of an appropriate unitary matrix, the resulting wave function also satisﬁes the Dirac wave equation but with the charge −e replaced by +e and no change in mass. This phenomenon is called charge conjugation and led Dirac to immediately conclude that this charge conjugated wave equation describes the motion of a positron. The discovery of the Positron by Dirac and its subsequent detection in an accelerator by Anderson came as a total surprise to the physics community since it led to the concept of antimatter, ie every elementary particle should have a corresponding antiparticle and that when a particle interacts with its antiparticle then the two may get annihilated to produce just a γ ray photon. The probabilities of such processes and other scattering processes where later on calculated systematically by Feynman in the process of developing quantum electrodynamics. This discovery of antimatter by Dirac must deﬁnitely rank as one of the greatest achievements in physics since the time of Newton. The discovery of the Dirac equation also led Dirac to conclude that the spin of the electron is a relativistic eﬀect and he demonstrated that by replacing the momentum operator P by P + eA and the energy E by E + eV where A is the magnetic vector potential

Quantum Mechanics for Scientists and Engineers Preface

vii iii

and V is the electric potential, and premultiplying the resulting wave equation by its ”reversed version” , one obtains the original Klein-Gordon equation of the particle in an electromagnetic ﬁeld plus terms that describe the interaction of the electric and magnetic ﬁeld with the spin of the electron. Earlier, Pauli had artiﬁcially introduced spin terms into the non-relativistic Schrodinger equation to demonstrate the Zeeman splitting of the energy levels but Dirac’s approach led naturally to terms involving the interaction of the electronic spin with the external electromagnetic ﬁeld. Moreover, for the development of relativistic quantum ﬁeld theory, ie a quantum ﬁeld theory that respects Lorentz invariance, it is the Dirac and Maxwell equations that must be used rather than the Schrodinger and Maxwell equations. In quantum ﬁeld theory, wave functions become operator ﬁelds satisfying either commutation or anticommutation relations depending on whether the wave functions describe Bosons or Fermions. The process of replacing wave functions by operator valued ﬁelds is called second quantization and from a mathematical viewpoint, this can be achieved by considering multiple tensor products of Hilbert spaces and operators acting on such spaces. If the tensor product is the usual one, the operator ﬁelds describe distinguishable particles satisfying the Maxwell-Boltzmann statistics, if the tensor products are symmetrized, then they describe Bosons which are indistinguishable particles having integer spins and that any state can be ﬁlled by any number of such particles while if the tensor products are antisymmetrized, then they describe Fermions which are indistinguishable particles having half integral spins and any state can have only either no particle or one particle also known as the Pauli exlcusion principle. In the quantum ﬁeld theory, for any kind of particle, we can have operators that create, annihilate or conserve the number of particles at any point in space-time or at any value of the four momentum. This leads us to a calculus of such operators and using the Pauli exclusion principle for Fermions, we can show that the Fermionic creation and annihilation operator ﬁelds satisfy canonical anticommutation relations while using the analogy of Bosons with the Harmonic oscillator algebra of creation and annihilation operators, we can show that the Bosonic creation and annihilation operator ﬁelds satisfy canonical commutation relations. The relation between spin and statistics is also one of the most striking features of quantum mechanics, namely that if we maximize the entropy of a system of particles subject to distinguishability of the particles and no restriction on the number of particles in a state, we get the Maxwell-Boltzmann distribution while if we maximize the entropy subject to indistinguishability and Pauli exclusion, we get the Fermi-Dirac statistics for Fermions and ﬁnally if we maximize the entropy subject to indistinguishability no Pauli exclusion, we get the Bose-Einstein statistics. The last of these, was ﬁrst discovered by the great Indian physicist Satyendranath Bose who derived this statistics and showed that Planck’s law of black-body radiation for photons can be obtained from such statistics. Einstein communicated Bose’s paper to the most prestigious physics journal at that time ”Annalen-der-physik” after translating it into German and adding footnotes explaining the signiﬁcance of Bose’s work at that time when only Planck’s law of black-body radiation was avaiable in the quantum theory and further that no rigorous derivation of this law from physical and mathematical principles was available. Our next theme in this book is the development of quantum electrodynamics. This involves ﬁrst writing down the energy of the electromagnetic ﬁeld in terms of the electric potential and magnetic vector potential in the frequency domain and then observing that for such a free electromagnetic ﬁeld, the vector potential satisﬁes the wave equation which in the spatial frequency domain, is simply a sequence of second order in time ordinary diﬀerential equations for harmonic oscillators whose characteristic frequency is simply c|k| where k is the wave vector. The Lagrangian of the ﬁeld is simply a quadratic functional of A(t, k) and ∂A(t, k)/∂t and and their complex conjugates and by applying the Euler-Lagrange equations to this Lagrangian, we get the same equation as mentioned above, namely a sequence of harmonic oscillator ode’s. We are thus led to conclude that the electromagnetic ﬁeld is simply a continuous ensemble of harmonic oscillators. Application of the Legendre transformation to this Lagrangian gives us the Hamiltonian density of the ﬁeld as a sum of position ﬁeld square and momentum ﬁeld square where the position ﬁeld is the magnetic vector potential A(t, r). By postulating the canonical commutation relations between the position and momentum ﬁelds in the form [Aμ (t, r), πν (t, r� )] = iδνμ δ 3 (r − 0r� ) we are led into trouble since the form of the Lagrangian density of the electromagnetic ﬁeld Fμν F μν implies that certain constraints on the position and momentum ﬁelds like for example since the Lagrangian does not depend on A0,0 ,the corresponding momentum π0 = 0 which is inconsistent with the canonical commutation relations. Further, by writing down the Euler-Lagrange equation of motion with a matter-ﬁeld interaction Lagragian denstiy Jμ Aμ , we are lead to a relationship between the spatial momenta πr , r = 1, 2, 3 and and J μ involving only spatial derivatives of πr . It should be noted that the current ﬁeld J μ is a matter ﬁeld described in terms of the Dirac operator wave function: J μ = −eψ ∗ (t, r)γ 0 γ μ ψ(t, r) These constraints are therefore either primary ie, arising from the the structure of the Lagrangian or secondary, ie, arising from the Euler-Lagrange equations of motion. These constraints are incompatible with the canonical commutation relations. To rectify this situation, Dirac proposed a new type of bracket expressed in terms of the Poisson/Lie bracket and taking the constraints into account. He showed that this new bracket, known today as the Dirac bracket is conststent with the constraints and also yields the correct equations of motion for the unconstrained observables. Using

Quantum Mechanics for Scientists and Engineers Preface

viii iv

this he developed the ﬁrst operator theoretic version of quantum electrodynamics. Feynman then applied his path integral formalism to ﬁelds like the electromagnetic four potential and the Dirac four component wave function and showed how one can derive from the path integral formalism, perturbative corrections to the electron and photon propagators arising from interaction between the matter ﬁeld in the form of Dirac currents and the electromagnetic four potential. He gave a beautiful diagrammatic method for calculating the scattering matrix elements to any order in perturbation theory. The resulting formulae can directly be derived by considering an initial state |i > of photons, electrons and positrons having deﬁnite four momenta and spin/polarizations and likewise a ﬁnal state |f >. These initial and ﬁnal states can be obtained by acting on the vacuum appropriate creation operators, then writing the interaction terms in the total Hamiltonian between photon ﬁeld and electron current as integrals expanding the time ∞ ordered exponential U (∞, −i∞) = T {exp(−i −∞ H(t� )dt� )} in the interaction picture as a power series in the interaction energy − J μ (x)Aμ (x)d3 x where J μ (x) = −eψ ∗ (x)γ 0 γ μ ψ(x), then expressing ψ(x) and Aμ (x) in terms of creation and annihilation operators of the electron-positron and photon ﬁeld and then using the standard commutation relations, evaluate the scattering matrix elements < f |U (∞, − inf ty)|i > in the interaction picture. This operator formalism was developed by Schwinger-Tomonaga and Dyson while Feynman developed the path integral approach to such calculations with ﬁnally Dyson proving the equivalence of the theories of Feynman and Schwigner-Tomonaga. The importance of the propagator computation in quantum ﬁeld theory has been highlighted in this book. Namely, if φ(x) is a collection of ﬁelds and the action for such a collection of ﬁelds is expressed as S[φ] = SQ [φ] + �SN Q [φ] where SQ is a quadratic functional and SN Q is a cubic and higher degree functional, the latter coming from interactions, then the scattering matrix using FPI can be expressed as S[φ∞ , φ−∞ ] = exp(iS[φ])Dφ =

ex(iSQ [φ])(1 + i�SN Q [φ] − �2 SN Q [φ]2 /2 + ...)Dφ

It is clear that each term in this inﬁnite series is equivalent to calculating the moments of an inﬁnite dimensional Gaussian distribution with complex variance and we know from basic probability theory, that even higher order moments of a Gaussian vector can be expressed as sums over products of the second order moments. The second order moment appearing here is the propagator of the ﬁeld: Dφ [x, y] = exp(iSQ [φ])φ(x)φ(y)Dφ These aspects of quantum electrodynamics have been covered in this book. We give explicit computations of the photon propagator in diﬀerent gauges (Feynman, Landau and Coulomb gauges) and also for the electron propagator. These expressions are derived using both the methods, ﬁrst the operator theoretic method and second, using the Feynman path integral for quadratic Lagrangians combined with the standard formulas for the second order moments of a multivariate Gaussian distribution. We also present modern developments in quantum ﬁeld theory, starting with the standard YangMills group theoretic generalization of the Dirac or Klein-Gordon equations for matter ﬁelds interacting with the photon ﬁeld. In this generalization, we assume that the wave function takes values in C4 ⊗ CN and a subgroup G of the unitary group U (N ) acts on this wave function. We introduce a Lie algebra theoretic covariant derivative which is an ordinary four gradient plus a connection ﬁeld which takes values in the Lie algebra of the group G. This gauge covariant derivative acts on the matter ﬁeld wave function and if the matter ﬁeld wave function undergoes a local (ie space-time dependent) transformation g(x) ∈ G, then accordingly the gauge connection ﬁeld in the covariant derivative has to undergo a certain transformation so that the covariant derivative of the matter ﬁeld transforms simply by multiplication by g(x). This ensures that the Lagrangian density constructed out of functions of the matter ﬁeld and its covariant derivative will be invariant under local G-transformations of the matter ﬁeld and the gauge ﬁeld provided that the Lagrangian is a G-invariant function of its agruments. Owing to the non-commutativity of the gauge ﬁelds, the gauge ﬁeld tensor deﬁned as the commutator of the covariant derivatives contains an extra quadratic nonlinear term in the gauge potentials apart from its four dimensional curl. This is a characteristic feature of non-Abelian gauge theories in contrast to the case of the electromagnetic ﬁeld where the gauge group is U (1). In the electromagnetic ﬁeld case, the matter ﬁeld is the Dirac ﬁeld since the gauge group is U (1), the local group transformation element g(x) is simply a modulus one complex number depending on the space-time variables. The corresponding gauge transformation of the gauge ﬁeld potentials, namely, the electromagnetic four potential simpliﬁes to the standard Lorentz gauge transformation Aμ → Aμ + ∂μ φ. Based on the Yang-Mills non-Abelian generalization of electromagnetism, Salam,Weinberg and Glashow were able to unify the electromagnetic forces and the weak forces, calling it the electro-weak theory. Their idea was to start with a

Quantum Mechanics for Scientists and Engineers Preface

ix v

non-massive vector gauge ﬁeld Lagrangian corresponding to the weak forces in the nucleus, coupled to the matter ﬁeld of Leptons (which include electrons) and then add symmetry breaking terms to this Lagrangian by coupling a scalar ﬁeld to the Leptons. Symmetry breaking occurs when the scalar ﬁeld is in its ground state which is a constant and this leads to mass terms involving quadratic non-derivative terms of the electronic ﬁeld as well as to the gauge Bosonic ﬁelds which are components of the vector gauge ﬁeld. In this context, we give a brief review of symmetry breaking both spontaneous and local versions. Symmetry breaking can occur when Hamiltonian/Lagrangian is invariant under a group G and the ground state is degenerate. If the Hamiltonian is invariant under G, then the subspace of ground states is invariant under G but any given ground state may not be invariant under G. If we take such a ground state and look upon the quantum state as this ground state plus a quantum perturbation, then the resulting Lagrangian will not be invariant under G since the ground state is not, but will be invariant under a subgroup H of G also called a broken subgroup. This sort of symmetry breaking produces massless particles called Goldstone Bosons. We can demonstrate this fact that symmetry breaking produces massless particles much better and in a more generalized framework using groupt theory. To do so, we ﬁrst express the wave function of the system in terms of a unbroken H part and a broken G/H part. The unbroken part transforms according to H and the broken part can be viewed as a ﬁeld with values in the coset space G/H. When the Lagrangian is expressed in terms of these components, it turns out that it does not contain any non-derivative quadratic components of the broken part while it contains non-derivative quadratic components of the unbroken part. This demonstrates that the broken part describes massless Goldstone Bosons while the unbroken part of the wave function describes massive particles. We also show that the G-symmetry of a Lagrangian can be broken by adding perturbative terms that are not G-invariant. Sometimes symmetry breaking can lead to massless particles being massive as in the electroweak theory of Salam, Weinberg and Glashow. This happens because of the coupling of the gauge ﬁelds to a scalar ﬁeld. The gauge ﬁelds initially are massless but the coupling to the scalar ﬁeld followed by evaluation of the Lagrangian in the ground state of the scalar ﬁeld causes terms in the Lagrangian involving quadratic non-derivative terms of the gauge ﬁeld to be present. These terms cause the gauge ﬁelds to become massive. The electroweak theory is an example of such a situation which gives masses to the gauge ﬁelds other than the electromagnetic ﬁeld. All this is quantum ﬁeld theory from the physical standpoint. We next explain how certain aspects of quantum ﬁeld theory including the theory of quantum noise can be developed using rigorous mathematics, ie, functional analysis. In particular, we show how certain major stochastic processes in classical probability theory like the Brownian motion and Poisson processes are special cases of quantum stochastic processes, ie, a family of non-commuting observables in a special kind of Hilbert space, namely the Boson Fock space when viewed in speciﬁc states. The notion of a quantum probability space as a triplet (H, P (H), ρ) where H is a Hilbert space, P (H) is the lattice of orthogonal projection operators in H and ρ is a state in H, ie, a positive semideﬁnite unit trace operator in H is introduced as compared to a classical probability space (Ω, F, P ). After doing so, we construct the Boson Fock space which can describe an arbitrary number of Bosons using the symmetric tensor product of a particular Hilbert space. The Boson Fock space is the substratum for constructing basic quantum noise processes, ie, noncommuting family of operators which specialize to Brownian motion and Poisson processes when the state is appropriately chosen. The Boson Fock space or noise Bath space is coupled to the system Hilbert space via a tensor product. Then we follow the marvellous approach of R.L.Hudson and K.R.Parthasarathy of construting the creation, annihilation and conservation operator ﬁelds in the Boson Fock space. We show via physical arguments that the creation and annihilation operator ﬁelds can also be viewed fromt the standpoint of an inﬁnite sequence of Harmonic oscillators, by constructing the creation and annihilation operator for each oscillator and deﬁning the coherent vectors in terms of a superposion of the energy eigenstates of these oscillators and proving that the coherent states are eigenstates of the annihilation operator ﬁeld now constructed as superpostions of the annihilation operators for the diﬀerent oscillators. The creation operator is the adjoint of the annihilation operator and turns out to have the same action on coherent vectors as the complex derivative of the latter with respect to the complex numbers used to construct the coherent vector from the oscillator energy eigenstates. In the work of Hudson and Parthasarathy, coherent vectors in the Boson Fock space were constructed as a weighted direct sum of multiple tensor products of a ﬁxed vector in the Hilbert space with itself and the creation, conservation and annihilation operators were deﬁned using the generators of one parameter unitary groups derived from the Weyl operator by restricting to translation and unitary rotation. The Weyl operator in the Hudson-Parthasrathy (HP) theory is itself described by its action on coherent vectors. Coherent vectors without normalization in the HP theory were called exponential vectors. This approach to the construction of the basic noise ﬁelds is highly mathematical and we provide in this book some physical insight into this correspondence by making an isomorphism between the coherent vectors of the HP theory and coherent vectors constructed using eignstates of harmonic oscillators. Further, in our book, we construct the unitary rotation operators needed for constructing the conservation process by resorting to quadratic forms of the creation and annihilation operators of the harmonic oscillators. The theory of quantum stochastic calculus developed by Hudson and Parthasarathy introduces time dependent creation, annihilation and conservation processes which statisfy a quantum Ito formula for products of time diﬀerentials of these processes. The classical Ito formula for Brownian motion and the Poisson process is shown to arise as a special commutative version of the quantum Ito formula of HP. The HP theory says

x vi

Quantum Mechanics for Scientists and Engineers Preface

more, namely that in the general noncommutative case, the products of creation and annihilation operator diﬀerentials with conservation diﬀerentials need not be zero. This unlike the classical probability case where if B is Brownian motion and N is Poisson process, then dB.dN = 0. Our analogy of the HP theory with the quantum harmonic oscillator theory provides us with a direct route to quantum optics as described in the celebrated book by Mandel and Wolf on optical coherence and quantum optics. The idea is to ﬁrst set up the famous Glauber-Sudarshan non-orthogonal resolution of the identity operator in the Boson Fock space as a complex integral of the coherent states |e(u) >< e(u)|, then to express the Hamiltonian of the system interacting with a photon bath as the sum of the system Hamiltonian which consists of spin matrices interacting with a constant classical magnetic ﬁeld, the bath photon ﬁeld Hamiltonian expressed as a quadratic form in the creation and annihilation operators and an interaction Hamiltonian expressed as a time varying linear combination of the products(tensor) between the atomic spin observables and the bath annihilation and creation variables, then assume that the density of ρ(t) of the system and bath can be expanded as a Glauber-Sudarshan integral: ρ(t) = ρA (t, u) ⊗ |e(u) >< e(u)|du where ρA (t, u) is a ﬁnite dimensional matrix (of the same order as the spin observables of the atomic system). Finally, we substitute this expansion into the quantum equation of motion iρ� (t) = [HA + HF + HAF (t), ρ(t)] where HA is the atomic Hamiltonian, HF is the bath ﬁeld Hamiltonian and HAF (t) is the atomic-bath ﬁeld interaction Hamiltonian. Using properties of creation and annihilation operators acting on the exponential/coherent vectors and integration by parts, we then derive a partial diﬀerential equation for ρA (t, u) which may be called the fundamental equation of quantum optics. After these discussions, we proceed to one of the most modern techniques in time dependent quantum measurement theory. The time dependent HP theory, ie quantum stochastic calculus also has a nice physical interpretation. When the average values of the creation, and annihilation processes in a coherent state are calculated, we get time integrals of the product of the coherent state deﬁning vector with the creation/annihilation process deﬁning vector upto time t. This result can be interpreted physically by saying that the annihilation (creation) process acts on a coherent state and yields the total amplitude and phase of photons annihilated (created) upto time t taking into account the relative polarization of the photons in the coherent state with respect to that of the annihilation (creation) process deﬁning vector. On the other hand, the conservation process average in a coherent state gives a time integral of a quadratic product of the coherent state deﬁning vector upto time t which has the interpretation as being the number of photons in the state present upto time t. The conservation process in the HP calculus has the Poisson process interpretation in classical probability theory just as the creation and annihilation process are linked to Brownian motion. The whole subject of quantum probability and quantum stochastic processes can be viewed as an linear-algebrafunctional analytic approach to probability theory with generalizations to the non-commutative case. As an important application of the HP stochastic calculus, we present the celebrated work of V.P.Belavkin on quantum ﬁltering. To do so, we ﬁrst note that in quantum mechanics two or more observables may not be simultaneously measurable when they do not commute but when they commute, they can be simultaneously diagonalized and hence simultaneously measured. Measurement on a quantum system causes the state of the system to collapse to a state dictated by the outcome of the measurement or of the measurement is made by a set of projection valued operators (pvm) or more generallyn by a set of positive operators (povm), then the state of the system collapses to a state formed by superposing the collapsed states corresponding to each measurement outcome. Belavkin proposed a scheme of constructing a ﬁltration on an Abelian Von-Neumann algebra of observables that satisﬁes the non-demolition property, ie, the algebra generated by the elements of the ﬁltration upto time t is Abelian and also commutes with the states of the HP noisy Schrodinger equation at times s ≥ t. Such measurements, he called non-demolition measurements. The HP noisy Schrodinger equation determines a unitary evolution in the joint system and bath space h ⊗ Γs (L2 (R+ ) ⊗ Cd ). The dynamics of the unitary evolution is dictated by the system Hamiltonian, the fundamental creation, annihilation and conservation processes of the HP quantum stochastic calculus which are operators in the bath space and connect to the system dynamics via system operators. There is also a quantum Ito correction term to the system Hamiltonian in the form of an additive skew Hermitian operator that ensures unitarity of the evolution. It is known that if one computes the Heisenberg dynamics of a system observable using these unitary evolution operators, then the standard Heisenberg equations of motion are obtained along with noise correction terms which and the system observables after ﬁnite time t evolves to an observable in the tensor product of the system Hilbert space and the noise bath space. Speciﬁcally, if U (t) denotes the evolution operator of the HP noisy Schrodinger equation and X is a system observable, then after time t it evolves to jt (X) = U (t)∗ XU (t) which satisﬁes the noisy Heisenberg equations of motio. Belavkin proposed that if we take an input noise process Yin (t) which is a commuting family of operators in the bath space and deﬁne the output noise process Yout (t) = U (t)∗ Yin (t)U (t), then by virtue of the fact that the unitarity of U (t) depends only on system operators which commute with bath operators, it follows that Yout (t) = U (T )∗ Yin (t)U (T ) for all T ≥ t from

Quantum Mechanics for Scientists and Engineers Preface

xi vii

which it is easy to see that Yout (t) commutes with Yout (s) for all t, s ≥ 0 and further that Yout (t) commutes with js (X) for all s ≥ t. This commutativity enables us to jointly measure jt (X), Yout (s), s ≤ t and hence deﬁne the conditional expectation πt (X) = E(jt (X)|Yout (s), s ≤ t). The family πt (X), t ≥ 0 of operators forms an Abelian family since πt (X) is a function of Yout (s), s ≤ t and the latter form a commuting family of operators. Belavkin derived a stochstic diﬀerential equation for πt (X) driven by the process Yout (.) and its derivation was greatly simpliﬁed using quantum versions of the Kallianpur-Striebel formula and other methods based on orthogonality properties of the conditional expectation estimate by John Gough and Kostler. The resulting equation for πt (X) is the fundamental quantum ﬁltering equation and is the non-commutative generalization of Kushner’s equation for classical non-linear ﬁltering. These aspects have been discussed in this book. It should be noted that although this version of the ﬁltering equations takes place in the observable domain, it could be directly transformed to the density domain. This is analogous to the situation of classical ﬁltering where we can describe the evolution of the conditional moments of the state or more generally of the conditional expectation of any function of the state at time t given measurements upto time t or equivalently of the conditional probability density of the state at time t given measurements upto time t. To obtain the Belavkin stochastic diﬀerential equation for the conditoinal density of the state at time t given output measurements upto time t, we must simply note that we can write πt (X) = T r(ρt X) where ρt can be viewed as a density matrix in the system Hilbert space that is a function of the output measurement process upto time t, or equivalently, simply as a classical random process with values in the space of system space density operators. Classical random because the measurement operators commute. We substitute πt (X) = T r(ρt X) into the Belavkin observable version of the ﬁltering equation and using the arbitrariness of X, we derive a classical stochastic diﬀerential equation for the system state valued classical random process ρt driven by Yout (t). Such equations are called ”Stochastic Schrodinger equations” (Luc Bouten, Ph.D thesis on quantum optics ﬁltering and control). We present a generalization of Belavkin’s work by ﬁrst constructing a family of p commuting inputmeasurement processes which are expressible as linear combinations of the creation, annihilation and conservation processes. Such processes have independent increments in coherent states and the quantum Ito’s formula leads in general to the fact that any integer power of the diﬀerentials of such processes is non zero just as in the classical case (dN )k = dN, k = 1, 2, ... where N (.) is a Poisson process. The Belavkin ﬁlter is constructed by assuming it to have the form Gmkt (X)(dYmout (t))k dπt (X) = Ft (X)dt + k≥1,m=1,2,...,p

with Ft (X)andGmkt (X) being measurable with respect to the algebra generated by the output measurments upto time t. The coeﬃcients Ft (X), Gmkt (X) are determined by applying the quantum Ito formula to the orthogonality equation E((jt (X) − πt (X))Ct ) = 0

where

dCt =

m,k

fmk (t)Ct (dYmout (t))k , t ≥ 0, C0 = 1

and using arbitrariness of the complex valued functions fmk (t). It should be noted that in the absence of any measurement, ie, when the system evolves according to the HP noisy Schrodinger equation, we can take a system observable X, obtain its noisy evolution jt (X) and then choose a pure states of the form |fk ⊗ φ(u) >, k = 1, 2 of the system and the bath where |fk > is a system state and |φ(u) > is a bath coherent state, then compute the matrix elements < f1 ⊗ φ(u)|jt (X)|f2 ⊗ φ(u) > and write down the evolution of this quantity. More generally, we can compute the system state at time t deﬁned via the duality equation T r(ρs (t)X) = T r((ρs (0) ⊗ |φ(u) < φ(u)|)jt (X)) or equivalently, as or equivalently, as

ρs (t) = T rB (U (t)(ρs (0) ⊗ |φ(u) >< φ(u)|)U (t)∗ ) < f1 |ρs (t)|f2 >= T r(ρs (t)|f2 >< f1 |) =

T r((ρs (0) ⊗ |φ(u) >< φ(u)|)jt (|f2 >< f1 |))

where |fk >, k = 1, 2 are system states. The resulting diﬀerential equation for the system state ρs (t) is the generalized GKSL equation (Gorini, Kossakowski, Sudarshan, Lindblad). If u = 0, ie, the bath is in vaccuum, then the ordinary GKSL equation is obtained. Using the dual of the GKSL equation, we get a description of the evolution of system observables when corrupted by bath noise in such a way that the system observable always remains a system observable, ie, averaging out over the bath noise variables is being performed at each time. The dual GKSL can be used to describe non-Hamiltonian quantum dissipative systems, for example, damped quantum harmonic oscillators or lossy

Quantum Mechanics for Scientists andPreface Engineers

xii viii

quantum transmission lines. We then introduce the notion of quantum control of the Belavkin stochastic Schrodinger equation in the sense of Luc Bouten. This involves taking non-demolition Belavkin measurements on a quantum system evolving according to the HP noisy Schrodinger equation, then considering at each time point t, the Belavkin ﬁltered and controlled state ρc (t), applying the Belavkin ﬁlter evolution by making a non-demolition measurement dY (t) in time [t, t + dt] to obtain the Belavkin ﬁltered state at time t + dt as ρ(t + dt) = ρc (t) + Ft (ρc (t))dt + Gt (ρc (t))dY (t) c = exp(iZdY (t)) in the time interval [t, t + dt] to the Belavkin ﬁlter output and then applying a control unitary Ut,t+dt ρ(t + dt) to get the controlled state at time t + dt as c c∗ ρ(t + dt)Ut,t+dt ρc (t + dt) = Ut,t+dt

Here, Z is a system observable that commutes with dY (t). More precisely, Z should be replaced by U (t + dt)∗ ZU (t + dt) to make it commute with dY (t). We can then show by application of the quantum Ito formula that the system operator Z can be selected so that the evolution from ρc (t) to ρc (t + dt) is such that a major portion of the noise in the Belavkin ﬁlter is removed and the evolution of ρc (t) nearly follows that of the noiseless Belavkin equation, ie, the GKSL equation. The next topic discussed in this book is that of designing quantum gates using scattering theory experiments. The basic idea is to realize a unitary quantum gate using the quantum scattering matrix. The system consists of a free projectile having Hamiltonian H0 = P 2 /2m and the interaction potential energy between the projectile and the scattering centre is V (Q) so that the total Hamiltonian of the projectile interacting with the scattering centre is H = H0 + V . The projectile arrives from the inﬁnite past (t → −∞) from the input free particle state |φin >. This free state evolves according to H0 , ie, at time t this free particle state is exp(−itH0 )|φin >. After interacting with the scatterer, it goes to the input scattered state |ψin > which evolves according to H, ie, at time t, this input scattered state is exp(−itH)|ψin >. It follows that these two states coincide in the remote past, ie, limt→−∞ (exp(−itH)|ψin > −exp(−itH0 )|φin >) = 0 from which, we easily deduce that where

|ψin >= Ω− |φin > Ω− = slimexp(itH).exp(−itH0 )

After interacting with the scatterer for a suﬃcient long time, we ask the question, what is the probability amplitude of the projectile being in the free particle state |φout >?. To evaluate this, we must ﬁrst determine the scattered state, ie, the output scattered state |ψout > that develops as t → ∞ to |φout >. It is clear that the condition required for this is that limt→∞ (exp(−itH)|ψout > −exp(−itH0 )|φout >) = 0 or equivalently,

where

|ψout >= Ω+ |φout > Ω+ = slimt→∞ exp(itH).exp(−itH0 )

The domains of Ω− and Ω+ will not in general be the entire Hilbert space L2 (R3 ). In fact, determining the domains of deﬁnition of Ω± is a hard problem in operator theory and nice treatments of this delicate problem have been given in the masterful monographs of T.Kato (Perturbation theory for linear operators) and W.O.Amrein (Hilbert space methods in quantum mechanics). The scattering matrix S is an operator that maps free input particle states to free output particle states so that the scattering amplitude for the process |φin >→ |φout > is given by < φout |S|φin >=< ψout |ψin >= or equivalently,

< φout |Ω∗+ Ω− |φin > S = Ω∗+ Ω−

We deﬁne and derive a formula for the operator R in the from

R=S−I

< λ, ω � |R|λ, ω >=< λ, ω � |V − V (H − λ)−1 V |λ, ω >

Quantum Mechanics for Scientists and Engineers Preface

xiii ix

where λ is the energy of the total system or equivalently of the free projectile coming from t = −∞. The energy is conserved during the scattering process. The state |λ, ω > represents the incident free projectile having energy P 2 /2m = λ and with incoming momentum P being directed along the direction ω ∈ S 2 and likewise |λ, ω � > represents the free projectile state having energy λ and outgoing momentum P being directed along ω � ∈ S 2 . It should be noted that the scattering operator S conserves the energy since it commutes with H0 . This follows from the relations Ω− exp(−itH0 ) = exp(itH)Ω− , Ω+ exp(−itH0 ) = exp(itH)Ω+ for all t ∈ R, from which follows

exp(itH0 )Ω∗+ Ω− exp(−itH0 ) = Ω∗+ Ω−

which gives formally, on diﬀerentiating w.r.t t at t = 0,

[H0 , S] = 0 The design of the quantum gate is based on choosing a potential V so that for a given energy λ, the matrix ((< λ, ω � |R|λ, ω >))ω,ω is as close as possilble to Ug − I where Ug is a given N × N unitary matrix and N is the number of discretized directions ω chosen on the unit sphere. Another topic of importance discussed in this book is a rough idea about how a uniﬁed quantum ﬁeld theory can be developed encompassing gravitation, electromagnetism, the electronpositron ﬁeld of matter and if possible other Yang-Mills ﬁelds. The theory developed relies heavily on identifying the correct connection for the covariant derivative of spinor ﬁelds in the presence of a curved space-time metric. The construction of this gravitational connection for Dirac spinors can be achieved in terms of the tetrad components of the gravitational metric and the Lorentz generators in the Dirac spinor representation. This construction can be found in Weinberg’s book (Gravitation and Cosmology: Principles and Applications of the General Theory of Relativity). This connection can also be used for Yang Mills ﬁeld. Once this has been done, it is an easy matter to construct the Lagrangian density of the Dirac or Yang-Mills ﬁeld in curved space-time. This Lagrangian density is added to the Einstein-Hilbert Lagrangian density of the gravitational ﬁeld and to the Lagrangian density of the electromagentic and Yang-Mills gauge ﬁelds, and to the Lagrangian density of scalar Higgs ﬁeld taking once again into account the gravitational connection using the Tetrad. This total Lagrangian density is a function of the Dirac wave function, the Yang-Mills wave function, the tetrad components of the gravitational metric and the gauge ﬁelds, namely the electromagnetic four potential, ie, the U (1) gauge ﬁelds for the Dirac equation and the non-Abelian SU (N ) gauge ﬁelds for the Yang-Mills equations. The corresponding four dimensional action integral is then constructed and Feynman diagrammatic rules are formulated using the path integral approach for determining the probability amplitude for scattering/absorption-emission processes involving gravitons, photons, electrons, positrons, and particles associated with the Yang-Mills ﬁelds. The basic idea for calculating these amplitudes is to use Feynman’s trick: Identify the parts Sq of the action S that are quadratic in the ﬁelds retain them in the exponent exp(iS). The cubic and higher degree terms of the ﬁelds appearing in S denoted by Sc are considered as perturbations thus writing down exp(iS) = exp(iSq )(1 + iSc − Sc2 /2 − iSc3 /6 + ...) and the path integral is evaluated using the basic Wick theorem which roughly states that higher moments of a Gaussian distribution can be decomposed into a sum over products of second order moments, ie, propagators. Other schemes of quantization of all the ﬁelds can be found in the textbook by Thiemann (Modern canonical quantum general relativity). Finally, we discuss the modern theory of Supersymmetry which is one of the mathematically successful attempts to unify the various ﬁeld theories like electromagnetism, Dirac’s relativistic quantum mechanics, the scalar ﬁeld theories of Klein-Gordon and Higgs, the Yang-Mills gauge theories and even general relativity. The idea here is to introduce four anticommuting Majorana Fermionic variables θ and to deﬁne a superﬁeld S[x, θ] as a function of both the bosonic space-time coordinates x = (xμ ) and the four Fermionic variables θ = (θa ). When the superﬁeld is expanded in powers of θ, all terms involving ﬁve or more θ� s vanish owing to the anticommutativity of the four θ� s. Hence, the superﬁeld S is a fourth degree polynomial in θ and the coeﬃcients of θa , θa θb , θa θb θc , θ0 θ1 θ2 θ3 , 0 ≤ a < b < c ≤ 4 are functions ¯ a a = 0, 1, 2, 3 that are of x only. Then following Salam and Stratadhee, we introduce supersymmetric generators La , L vector ﬁeld vector ﬁeld in super-space (x, θ). These generators satisfy the standard anticommutation relations required for supersymmetry generators, ie, their anticommutators are bosonic generators: ¯ b } = γ μ ∂/∂xμ {La , L ab The general form of these generators can be obtained using standard group theoretic arguments: Deﬁne the composition of superspace variables as (xμ , θa ).(xμ , θa ) = (xμ + xμ� + θT Γμ θ, θa + θa )

Quantum Mechanics for Scientists and Engineers Preface

xiv x

Then, the supersymmetry generators (which are supervector ﬁelds) are calculated by expressing ∂ f (x, θ).(x� , θ� )) ∂xμ and

∂ f ((x, θ).(x� , θ� )) ∂θa evaluated at x� = 0, θ� = 0 in terms of ∂x∂ μ and ∂θ∂a . The supersymmetry generators when acting on a superﬁeld induce transformations on the component superﬁeld and it is noted that only the coeﬃcient of the fourth degree term in θ suﬀers a change that is a total diﬀerential. In fact, if the coeﬃcient of the fourth degree in θ is split into a sum of two components, one we call the D component and the other we write as a constant times the D’Alembertian acting on the scalar superﬁeld component, then the D component suﬀers a change that is a perfect divergence. Hence, the four dimensional space-time integral of the D component remains invariant under a supersymmetry transformation and hence this integral can be used as a candidate action for a supersymmetric ﬁeld theory. It is also noted that if we impose conditions that one part of the θ3 component, namely the λ component (called the gaugino) and the D component is zero, and further if the gauge component Vμ (x) which appear as coeﬃcients of the θ2 portions is a perfect gauge, ie, a perfect divergence, then after a supersymmetry transformation again λ and D vanish. Hence we obtain a subclass of the class of all superﬁelds, namely the Chiral superﬁelds which can be expressed as the sum of a left Chiral and a right Chiral superﬁeld. Chiral ﬁelds are characterized by the property that their λ and D components vanish and the Vμ component is a perfect gauge. We further note following the exposition of Wienberg that a left Chiral superﬁeld can be expressed as a function of xμ+ and θL only and likewise, a right Chiral superﬁeld can be expressed T as a function of xμ− and θR where xμ± = xμ ± θL �γ μ θR and θL = (1 + γ5 )θ/2, θR = (1 − γ5 )θ/2 are respectively the left chiral and right chiral projections of the Fermionic parameter θ. There are just two linearly independent left chiral Fermionic parameters and likewise two linearly independent right chiral Fermionic parameters. This means that cubic and higher degree terms in θL vanish and likewise cubic and higher terms in θR vanish. Further, left chiral superﬁelds are characterized by the property that certain ”right” superdiﬀerential operators DR acting on these superﬁelds give zero and right chiral superﬁelds are characterized by the property that certain ”left” superdiﬀerential operators DL acting on these superﬁelds give zero. Here DR and DL are deﬁned as respectively the right chiral and left chiral projections (mutliplication by (1 + γ5 )/2 and (1 − γ5 )/2) of a super vector ﬁeld D obtained by changing a sign in the expression of the supersymmetry generators. The proofs of these facts follow by showing ﬁrst that DR xμ+ = 0, DL xμ− = 0 and then noting that DR (anylef tsuperf ield) = 0, DL (anyrightsuperf ield) = 0 since left superﬁelds are functions of x+ , θL 2 only and right superﬁelds are functions of x− , θR only. Further it is readily shown that the coeﬃcient of θL (called the F -term) in a left chiral superﬁeld ΦL (x, θ) suﬀers change by a perfect divergence under a supersymmetry transformation and hence its space-time integral is a candiadate supersymmetric action. After this, we come to the crucial point: A kinematic Lagrangian density should canonically by a quadratic function of component superﬁelds just as kinetic energies are quadratic functions of momenta/velocities and potential energies of harmonic oscillators are quadratic functions of positions, or equivalently, the Klein-Gordon Lagrangian density is a quadratic function of the space-time derivatives of the ﬁeld minus a mass term which is a quadratic function of the ﬁeld itself. Now, if we start with a supeﬁeld S[x, θ] and consider the D-component of the superﬁeld S[x, θ]∗ S[x, θ], then we get terms that are bilienar in the D and C terms but no term that is quadratic or higher in the D term. Likewise, we get terms that are bilinear in the λ and ω terms (λ is a cubic component and ω is a ﬁrst degree component) but no terms that are quadratic or higher in the λ term. Now if we build our action from these terms, then standard Gaussian path integral considerations show that we must evaluate our path integrals by setting the variational derivatives of the action with respect to the various component ﬁelds to zero. But the variational derivative of the above mentioned action w.r.t. the D term of S ∗ S is the C term and likewise, the variational derivative of S ∗ S w.r.t the λ term is the ω terms. So we are led to the disastrous consequence that C = 0 and ω = 0, ie, we cannot have scalar ﬁelds like the Klein-Gordon ﬁeld or Fermionic ﬁelds like the Dirac ﬁeld. To rectify this problem, we assume that D = 0, λ = 0 in S while constructing the action [S ∗ S]D . In other, words, to construct matter ﬁeld Lagrangians, we take our basic matter superﬁeld S as a Chiral ﬁeld. For a given superﬁeld S[x, θ], the component superﬁelds C(x) (the scalar ﬁeld) (zeroth power of θ coeﬃcient), the Ferminoic ﬁeld ω(x) (ﬁrst power of θ coeﬃcient) and the other ﬁelds M (x), N (x) (which are coeﬃcients of the second power of θ) constitute the matter ﬁelds and the other components Vμ (x) (one set of components of the second power of θ) which is also called the gauge ﬁeld, λ(x) (one part of the Fermion ﬁeld appearing as a coeﬃcient of the third power of θ) also called the gaugino ﬁeld (and interpreted as the superpartner of the gauge ﬁeld) and the auxiliary ﬁeld D(x) appearing as a coeﬃcient of the fourth power of θ constitute the gauge part of the superﬁeld. To get a gauge invariant theory of supeﬁelds, ﬁrst we must form out of a left Chiral superﬁeld Φ[x, θ] (built out of the matter ﬁelds C, ω, M, N ) and a matrix Γ[x, θ] built out of the gauge superﬁelds Vμ , λ, D the D component [Φ∗ ΓΦ]D of Φ∗ ΓΦ (which will of course be supersymmetry invariant since it is the D component of a superﬁeld) and the transformation law of the matter part Φ and the gauge part Γ under a gauge transformation is deﬁned in such a way so that [Φ∗ ΓΦ]D remains invariant. This forms the part of the

Quantum Mc:dianiai for Scientists and Engineers

XV

total Lagrangian density that describes matter like the scalar field, the Dirac Fermionic field of electrons and positrons etc., and the interaction of the matter fields with the gauge fields. Finally, another superfi.eld W[x, 9] iB constructed out of the gauge and auxiliary components v,., >.., D such that W[x, 9] is a left Chiral field (and hence its F-component is supersymmetry invariant) and its F component has the fonn

c1F,..,F"" + c1XT"f" D,.>.. + c3D2 which guarantees gauge invariance of the F-pa.rt of W. Here, iF,.., [=a,.+ iV,., a.,+ iV.,] where ir1 an Abelian gauge theory, Yl' is simply a function of x so that F,.., = Vv,l" - \'s.,v iB the electromagne!ic field of photons and :FD,.>.. is the sum of>..T-y~' >..,,. which iB like a kinematic part of the Dirac Lagrangian density and >..Ti-y~'>..VI' which is like the interaction Lagrangian between the Dirac field of Fermions and the photon electromagnetic four potential field VI'. In a non-Abelian gauge theory, V,. is a Lie algebra valued function of x and then [VI', V.,] =F 0. It follows then that the structure constants of this Lie algebra will enter into the picture while defining and F~'v'D~'>... In this case, the above gauge Lagrangian density will describe particles arising in the Yang-Mills theories like Electrowea.k theories etc. When the matter and matter-gauge interaction Lagrangian [4?*f4?]D iB added to the above gauge Lagrangian, we obtain a Lagrangian density that is both supersymmetry and gauge invariant. In such a theory, if path integrals are evaluated with respect to certain auxiliary fields, we obtain equations like the Dirac relativistic wave equation with mass dependent on the scalar field, the Maxwell photon equations, the Yang-Mills equations and even gravitational fields can be included by more additions to the superfield. In short, supersymmetry provides the ideal ground for unifying all the known theories into a single framework, studying irlteractions between the particles associated with each theory and finally quantizing such a unified field theory using the Feynman path irltegral. We do not give all the details here for they can be found in the masterpiece ofWienberg (Supersymmetry, Cambridge University Press).

Table of Contents 1. Wave and Matrix Mechanics of Schrodinger, Heisenberg and Dirac

1-6

2. Exactly Solvable Schrodinger Problems, Perturbation THeory, Quantum Gate Design 7-16 3. Open Quantum Systems, Tensor Products, Fock Space Calculus, Feynman’s Path Integral 17-22 4. Quantum Field THeory, Many Particle Systems 23-48 5. Quantum Gates, Scattering THeory, Noise in Schrodinger and Dirac’s Equations 49-58 6. Quantum Stochastics, Filtering, Control and Coding, Hypothesis Testing and Non-Abelian Gauge Fields 59-74 7. Master Equations, Non-Abelian Gauge Fields 75-88 Appendix 89-208

Chapter

1

Wave and Matrix Mechanics of Schrodinger, Heisenberg and Dirac [1] The De-Broglie Duality of particle and wave properties of matter. A plane wave in one dimension is expressed by the following complex amplitude: ψ(t, x) = A.exp(i(kx − ωt))

where k = 2π/λ with λ as the wavelength and ν = ω/2π is the frequency. According to De-Broglie, we can associate a particle having momentum p = h/λ = hk/2π with such a wave where h is Planck,s constant. According to Planck, we can associate a quantum of energy E = hν = hω/2π with such a wave. Thus, we have (ih/2π)

∂ψ = (hω/2π)ψ, = Eψ ∂t

(−ih∇/2π)ψ = (hk/2π)ψ = pψ In the presence of an external potential V (x), E should be taken as p2 /2m + V (x) and hence, Eψ = (p2 /2m + V )ψ = (−h2 ∇2 /8π 2 m + V (x))ψ Although the above plane wave ψ cannot satisfy this equation for general V (x), we assume that the actual De-Broglie matter wave ψ associated with the particle is a solution to the above equation.. This is called the one dimensional Schrodinger equation. In short, by putting together De-Broglie’s matter-wave duality principle and Planck’s quantum hypothesis, we have heuristically derived the Schrodinger wave equation. [2] Bohr’s correspondence principle. Bohr’s correspondence principle is that associated with each observable in classical mechanics is an observable in quantum mechanics that follows certain quantization axioms which can be used to explain the energy spectra of atoms and quantum oscillators. For example, consider an electron of charge −e and mass m moving in a circular orbit of radius r around the nucleus. Its mometum is p = mv and the correspondence principle implies that Γ pdq = nh, n ∈ Z where q is the position coordinate and the lhs is the line integral around a complete orbit of the electron. Thus, we derive 2πpr = nh, p = nh/2πr, n ∈ Z Thi is the same as

mvr = nh/2π

According to the centripetal force law that keeps a particle in an orbit, mv 2 /r = KZe2 /r2 , K = 1/4π0 Thus, we get or and

n2 h2 /4π 2 mr3 = KZe2 /r2 r = n2 h2 /4π 2 mKZe2 v 2 = KZe2 /mr = 4π 2 K 2 Z 2 e4 /n2 h2

and finally, we get the energy spectrum of the particle: E == En = p2 /2m − KZe2 /r = mv 2 /2 − Ze2 /r = KZe2 /2r − KZe2 /r = −KZe2 /2r = −2π 2 mK 2 Z 2 e4 /n2 h2 , n = 1, 2, ...

This spectrum first derived by Bohr, agreed with experiments.

[3] Bohr-Sommerfeld’s quantization rules. If q is a canonical position coordinate vector and p the canonical momentum vector, then for cyclic motion, the Bohr-Sommerfeld’s quantization rules state that p.dq = nh, n ∈ Z Γ

Quantum Mechanics for Quantum Scientists and Engineers Mechanics

2 2

where Γ is a closed loop. A special case of this rule was applied earlier by Bohr to derive the spectrum of the Hydrogen atom. For example, if the system is described by action-angle variables I, θ, then we get 2π I(θ)dθ = nh, n ∈ Z 0

Sommerfeld applied this to relativistic quantum mechanics according to which the Hamiltonian of the particle is given by H(q, p) = c p2 + m2 c2 − eV (q) Writing H = E and solving for p, we get

p = ±[(E + eV (q))2 + c2 p2 + m2 c4 ]1/2 = ±p(q, E) If p given by this expression vanishes at q = q1 , q2 , then the Bohr-Sommerfeld quantization rule states that the energy spectrum of this relativistic particle is given by q2 2 p(q, E)dq = nh, n ∈ Z q1

Sommerfeld used this to derive the relativistic spectrum of the Hydrogen atom which improves upon the non-relativistic spectrum given by Bohr. [4] The principle of superposition of wave functions and its application to the Young double slit diﬀraction experiment. In quantum mechanics, a pure state in the position representation is described by a complex valued wave function ψ(x), x ∈ R3 . Given two such wave functions ψ1 (x), ψ2 (x), we can construct another wave function ψ(x) = c1 ψ1 (x) + c2 ψ2 (x), where c1 , c2 are two complex numbers. |ψk (x)|2 is proportional to the intensity of the wave ψk at x, k = 1, 2 and |ψ(x)|2 = |c1 |2 |ψ1 (x)|2 + |c2 |2 |ψ2 (x)|2 + 2Re(¯ c1 c2 ψ¯1 (x)ψ2 (x)) is proportional to the intensity of the wave ψ at x. In particular, we can take c1 = c2 = 1, then the intensity of ψ is the intensity of ψ1 plus the intensity of ψ2 plus an interference term 2Re(ψ¯1 (x)ψ2 (x)). This last term is a purely quantum mechanical eﬀect. This fact has been marvellously illustrated by Feynman using the Young double slit experiment: We may two slits in a cardboard sheet and place an electron gun behind this sheet. On the other side of the sheet is a screen than can record the impact of electrons. If the ﬁrst slit is open and the second closed, then the electron intensity pattern on the screen shows a maximum value at the portion of the screen directly in front of the ﬁrst slit and decays down as the distance from the ﬁrst slit increases on both the sides. Likewise, if slit one is closed and slit two is open, then the electron intensity on the screen in front of slit two shows a maximum. We denote the former intensity pattern by I1 (x) and the latter intensity pattern by I2 (x). Now, when both the slits are open, if the electron were to behave as classical particles, we should expect the intensity pattern on the screen to be I1 (x) + I2 (x), ie, maxima at both the points on the screen in front of the ﬁrst and second slit respectively. But this is not what we observe. Instead we observe a maximum on the screen at a point in front of the middle between the two slits, ie at a point on the screen that is equidistant from the two slits. Further, as we move away from this intensity maximum point, the intensity shows a sinusoidal variation. This can be explained only by the presence of the above interference term. For example, if ψ1 (x) = A1 .exp(ik1 x), ψ2 (x) = A2 .exp(ik2 x) where x is the distance on the screen from the central point P that is equidistant from both the slits, then the intensity pattern on the screen when both the slits are open is given by I(x) = |ψ1 (x) + ψ2 (x)|2 = |A1 |2 + |A2 |2 + 2|A1 ||A2 |cos((k1 − k2 )x + φ) where

φ = arg(A1 ) − arg(A2 )

It follows that if φ = 0 (which will be the case when the two slits are equidistant from the electron gun, then I(x) will show a maximum at x = 0 and its spatial variation will be sinusoidal with the distance between two successive maxima or between two successive minima being given by 2π/|k1 − k2 |. Thus we are forced to conclude that electrons exhibit wave behaviour at some time which is completely in accord with the De-Broglie matter-wave duality principle. The Heisenberg uncertainty principle can also be explained using this setup. The diﬀerence between the two electron momenta parallel to the screen is Δp = h|k1 − k2 |/2π according to De-Broglie and the position measurement uncertainty is the distance between an intensity maximum and an intensity miniumum on the screen, ie, Δx = π/|k1 − k2 |. Thus, Δp.Δx ≈ h/2 This means that if we attempt to measure the momentum diﬀerence accurately then the position measurement will become less accurate and vice versa.

Quantum Quantum Mechanics for Scientists and Engineers

3 3

[5] Schrodinger’s wave mechanics and Heisenberg’s matrix mechanics. In wave mechanics, the state of a quantum system at time t is deﬁned by a normalized vector |ψ(t) > in a Hilbert space H and this vector satisﬁes a ”Schrodinger wave equation”: i

d|ψ(t) > = H(t)|ψ(t) > dt

where H(t) is a possibly time varying self-adjoint operator in H. If H(t) = H is time independent, then the solution can be expressed as |ψ(t) >= exp(−itH)|ψ(0) > where exp(−itH) is a unitary operator in H and may be deﬁned via resolvents: exp(−itH) = limn→∞ (1 + itH/n)−n We note that even if H is an unbounded operator, this deﬁnition of the exponential may make sense using the theory of resolvents and spectra. (A complex number z is said to be in the resolvent set ρ(H) of H if (z − H)−1 exists and is a bounded operator. The complement of ρ(H) denoted by σ(H) is called the spectrum of the operator H. Since here H is Hermitian, we have the spectral representation H = d xE(dx) R

so that exp(−itH) = and

(1 + itH/n)−n = so for any |f >∈ H, we have � (1 + itH/n)−n f − exp(−itH)f �2 =

R

R

exp(−itx)E(dx)

(1 + itx/n)−n E(dx)

|(1 + itx/n)−n − exp(−itx)|2 < f |E(dx)|f >

which converges by the dominated convergence theorem to zero as n → ∞. We note that if T is any densely deﬁned operator in H, bounded or unbounded, and if for z ∈ ρ(T ), we have an inequality of the form � (z − T )−1 �≤ f (z) then

� (1 + itT /n)−n �≤� (1 + itT /n)−1 �n ≤ nf (in/t)/|t|

and this may remain bounded as n → ∞. On the other hand, (1 + itT /n)n is unbounded if T is unbounded and hence we cannot deﬁned exp(itT ) as the limit of this as n → ∞ (Reference: T.Kato, Perturbation theory for linear operators, Springer). In wave mechanics, the observables like position, momentum, angular momentum, energy etc. are represented by selfadjoint operators in the Hilbert space H and these do not vary with time while the state |ψ(t) > varies with time. Hence, if X is an observable, then its average at time t is given in the Schrodinger wave mechanics picture by < ψ(t)|X|ψ(t) >. On the other hand, in the Heisenberg matrix mechanics picture, observables change with time while the state remains the same and hence, the average of X at time t in the Heisenberg picture is < ψ(0)|X(t)|ψ(0) >. Since the physics must be the same no matter which model we use, we must have < ψ(t)|X|ψ(t) >=< ψ(0)|X(t)|ψ(0) > − − −(1) and diﬀerentiating this equation w.r.t. time gives i < ψ(t)|[H(t), X]|ψ(t) >=< ψ(0)|X � (t)|ψ(0) > Writing then gives us

|ψ(t) >= U (t)|ψ(0) > X � (t) = iU (t)∗ [H(t), X]U (t) = [iU (t)∗ H(t)U (t), X(t)]

Quantum Mechanics for Scientists Engineers Quantumand Mechanics

4 4

We could also directly write (1) as < ψ(0)|U (t)∗ XU (t)|ψ(0) >=< ψ(0)|X(t)|ψ(0) > and hence

X(t) = U (t)∗ XU (t)

from which we obtain on diﬀerentiation, X � (t) = iU (t)∗ [H(t), X]U (t) = i[U (t)∗ H(t)U (t), X(t)] This is Heisenberg’s equation of matrix mechanics. In particular, if H(t) = H does not vary with time, we get U (t)∗ HU (t)∗ = H and Heisenberg’s equation of matrix mechanics becomes X � (t) = i[H, X(t)] with solution,

X(t) = exp(itH).X.exp(−itH)

[6] Dirac’s replacement of the Poisson bracket by the quantum Lie bracket. The Poisson bracket between two observables u(q, p) and v(q, p) satisﬁes {u, vw} = {u, v}w + v{u, w} and likewise for the ﬁrst argument. If we agree that an analogous bracket [.] exists for non-commutative quantum observables with the same ordering preserved, then we must have [uv, wz] = [uv, w]z + w[uv, z] = u[v, w]z + [u, w]vz + wu[v, z] + w[u, z]v on the one hand and on the other hand, [uv, wz] = [u, wz]v + u[v, wz] = [u, w]zv + w[u, z]v + u[v, w]z + uw[v, z] Equating these two expressions gives [u, w]vz + wu[v, z] = [u, w]zv + uw[v, z] Hence

[u, w](vz − zv) = (uw − wu)[v, z]

It follows from the arbitrariness of the four observables u, v, w, z that the quantum bracket [.] should have the form [u, w] = c(uw − wu) where c is a complex constant. In other words, the quantum bracket that replaces the classical Poisson bracket must be proportional to the Lie bracket. [7] Duality between the Schrodinger and Heisenberg mechanics based on Dirac’s idea. In Schrodinger’s wave mechanics, the state of the system at any time t is described by a density operator ρ(t) in a Hilbert space H. In other words, ρ(t) ≥ 0, T r(ρ(t)) = 1. Further, it satisﬁes the Schrodinger-Liouville- Von-Neumann equation of motion: iρ� (t) = [H(t), ρ(t)] This follows by writing the spectral decomposition of ρ(t) as |ψk (t) > pk < ψk (t)| ρ(t) = k

in diagonal form with

p�k s

constant and

k

pk = 1, pk ≥ 0. The ψk (t)� s satisfy the Schrodinger wave equation i|ψk� (t) >= H(t)|ψk (t) >

Quantum Quantum Mechanics for Scientists and Engineers

and hence

5 5

−i < ψk� (t)| =< ψk (t)|H(t)

so that

iρ� (t) = =

k

We can write its solution as

(i|ψk� (t) > pk < ψk (t)| + i|ψk (t > pk < ψk� (t)|)

k

(H(t)|ψk (t) > pk < ψk (t)| − |ψk (t)pk < ψk (t)|H(t)) = [H(t), ρ(t)] ρ(t) = U (t)ρ(0)U (t)∗

where U (t) is a unitary operator satisfying the Schrodinger equation U � (t) = −iH(t)U (t) In this picture, observables do not vary with time. Thus, the average value of an observable X at time t is given by pk < ψk (t)|X|ψk (t) > < X > (t) = T r(ρ(t)X) = k

In the Heisenberg picture, the observable X changes at time t to X(t) while the state ρ(0) does not change. Hence, in this picture the average of the observable at time t is < X > (t) = T r(ρ(0)X(t)) and the two pictures must give the same average. Thus, T r(ρ(0)X(t)) = T r(ρ(t)X) = T r(U (t)ρ(0)U (t)∗ X) = T r(ρ(0)U (t)∗ XU (t)) and hence from the arbitrariness of ρ(0), it follows that dynamics of observables in the Heisenberg picture must be given by X(t) = U (t)∗ XU (t) and hence where

˜ X(t)] X � (t) = iU (t)∗ (H(t)X − XH(t))U (t) = i[H(t), ˜ H(t) = U (t)∗ H(t)U (t)

In particular, if H(t) = H is time independent, then U (t) = exp(−itH) and we get X � (t) = i[H, X(t)] for the Heisenberg dynamics and

iρ� (t) = [H, ρ(t)]

for the Schrodinger dynamics. [8] Quantum dynamics in Dirac’s interaction picture. Suppose H(t) = H0 + V (t) is the system Hamiltonian. The Schrodinger evolution operator U (t) satisﬁes iU � (t) = H(t)U (t) We write

U (t) = U0 (t)W (t), U0 (t) = exp(−itH0 )

Then, substituting this into the above Schrodinger evolution equation gives iW � (t) = V˜ (t)W (t), V˜ (t) = U0 (t)∗ V (t)U0 (t)

Quantum Mechanics for Scientists Engineers Quantum and Mechanics

6 6

This leads us to the Dirac interaction picture where a state |ψ > evolves according to the Hamiltonian V˜ (t) while an observable X evolves according to the Hamiltonian H0 . This keeps the physics invariant since if X is an observable and |ψ > is the state at time 0, then in the Dirac interaction picture, the average of the observable in the state at time t is given by (The subscript i stands for evolution of states or observables in the interaction picture. Thus |ψi (t) >= W (t)|ψ =< ψ|W (t)∗ (U0 (t)∗ XU0 (t))W (t)|ψ > where W (t) = T {exp(−i Now, as a check

t

V˜ (τ )dτ )}

0

d (U0 (t)W (t)) = U0� (t)W (t) + U0 (t)W � (t) = dt −iH0 U0 (t)W (t) + U0 (t)(−iV˜ (t))W (t)

= −iH0 U0 (t)W (t) − iU0 (t)U0 (t)∗ V (t)U0 (t)W (t)

= −iH0 U0 (t)W (t) − iV (t)U0 (t)W (t) = −iH(t)U0 (t)W (t)

˜ (t) satisﬁes the same equation as U (t) and hence must equal U (t). Thus, U0 (t)W (t) = U (t) as In other words, U0 (t)U expected and this shows that < ψi (t)|Xi (t)|ψi (t) >=< ψ|U (t)∗ XU (t)|ψ > which is in agreement with the Schrodinger and Heisenberg pictures. [9] The Pauli equation: Incorporating spin in the Schrodinger wave equation in the presence of a magnetic ﬁeld. We describe how the Pauli equation can be used to calculate the average electric dipole moment and average magnetic dipole moment of an atom in an external electromagnetic ﬁeld. This calculation enables us to calculate the electric polarization ﬁeld and the magnetization ﬁeld of a material. It will be a consequence of this calculation that the average electric dipole moment and hence polarization depends on both the external electric and magnetic ﬁelds and likewise for the average magnetic dipole moment and magnetization. The electric dipole moment observable is −er, r = (x, y, z) being the electron position relative to the nucleus. The magnetic dipole moment observable is μ = (e/2m)(L+gσ/2) where L = (Lx , Ly , Lz ) is the angular momentum observable vector and σ = (σx , σy , σz ) is the vector of Pauli spin matrices. A heuristic justiﬁcation of this fact [10] The Zeeman eﬀect: Let H0 be the Hamiltonian of an atom that commutes with the orbital and spin angular momentum operators (Lx , Ly , Lz ) = L (ie, H0 is rotation invariant as happens when H0 = p2 /2m0 + V (r) where V depends only on the radial coordinate) and (σx , σy , σz ) = σ. Then the eigenstates of the Hamiltonian operator H = H0 + e(L + gσ, B)/2m0 where B is a constant magnetic can be calculated as follows. We may assume without loss of generality that B is along the z axis. Then, H = H0 + (eB/2m)(Lz + gσz ) and since L2 , Lz , σz , H0 mutually commute, the energy eigenstates of H are labelled by four quantum numbers |n, l, m, s > where l(l+1) is an eigenvalue of L2 , m is an eigenvalue of Lz and s is an eigenvalue of σz . If E(n, l) is an energy eigenvalue of H0 ie of H without the magnetic ﬁeld, then this eigenvalue has a degeneracy of 2(2l + 1) corresponding to the 2l + 1 eigenvalues of Lz for a given eigenvalue l(l + 1) (ie,+ m = −l, −l + 1, ..., l − 1, l)of L2 and the two eigenvalues ±1/2 of σz . When the magnetic ﬁeld is turned on, these 2(2l + 1) degenerate eigenstates split into the the same number of nondegenerate eigenstates with eigenvalues E(n, l, m, s) = E(n, l) + (eB/2m0 )(m + gs), m = −l, −l + 1, ..., l − 1, l, s = ±1/2. The eigenstate of H corresponding to the eigenvalue E(n, l, m, s) is denoted by |n, l, m, s >. [11a] The spectrum of the Hydrogen atom. This is described by the stationary Schrodinger equation ∇2 ψ(r, θ, φ) + 2m(E + e2 /r)ψ(r, θ, φ) = 0 Writing

ψ(r, θ, φ) = R(r)Ylm (θ, φ)

2

Chapter |n, l, m, s >

Exactly Solvable Schrodinger Problems, Perturbation THeory, Quantum Gate Design l+1

=− −

2

l E(n, l, m, s) = E(n, l) + (eB/2m0 )(m + gs), m = −l, −l + 1, ..., l − 1, l, s = ±1/2. .

[11a] The spectrum of the Hydrogen atom. This is described by the stationary Schrodinger equation ∇2 ψ(r, θ, φ) + 2m(E + e2 /r)ψ(r, θ, φ) = 0 Writing

ψ(r, θ, φ) = R(r)Ylm (θ, φ)

where Ylm are the spherical harmonics, we get using ∂ 2 ∂ r − L2 /r2 ∂r partialr where L2 is the squared angular momentum operator: ∂ ∂ 1 ∂2 1 L2 = − sin(θ) − sin(θ) ∂θ ∂θ sin2 (θ) ∂φ2 ∇2 = r−2

Ylm is an eigenfunction of both the commuting operators L2 and Lz : L = r × p = −ir, L2 = −(r × ∇)2 Lz Ylm = mYlm , L2 Ylm = l(l + 1)Ylm Exercise: Verify that L given by the above diﬀerential expression coincides with 2

−(r × ∇)2 = (ypz − zpy )2 + (zpx − xpz )2 + (xpy − ypx )2 , The radial equation for R(r) thus becomes

px = −i∂x , py = −i∂y , pz = −i∂z

R�� (r) + 2R� (r)/r − l(l + 1)R(r)/r2 + 2m(E + e2 /r)R(r) = 0

or equivalently,

r2 R�� (r) + 2rR� (r) + 2m(Er2 + e2 r − l(l + 1)/2m)R(r) = 0

As r → ∞, this equation approximates to

r2 R�� (r) + 2mEr2 R(r) ≈ 0

or equivalently,

R�� (r) = −2mER(r)

Since for bound states, E must be negative, it follows that as r → ∞, we have R(r) ≈ C.exp(−αr), α = writing the exact solution as √ R(r) = f (r)exp(−αr), α = −2mE we get

R� = (f � − αf )exp(−αr), R�� = (f �� − 2αf � + α2 f )exp(−αr)

The Schrodinger equation now becomes

r2 (f �� − 2αf � + α2 f ) + 2r(f � − αf ) + (−α2 r2 + 2me2 r − l(l + 1))f = 0

or

r2 f �� + 2r(1 − αr)f � + (2me2 r − l(l + 1))f = 0

We solve this equation by the power series method. Substitute f (r) = c(n)rn+s n≥0

Then

n≥0

(n + s)(n + s − 1)c(n)rn+s + 2 +2me

2

or equivalently,

n≥0

n≥0

c(n)r

n≥0

(n + s)c(n)rn+s − 2α

n+s+1

− l(l + 1)

c(n)r

(n + s)c(n)rn+s+1

n≥0

n+s

n≥0

((n + s)(n + s + 1) − l(l + 1))c(n)rn+s

=0

√ −2mE. So,

Quantum Mechanics for Scientists and Engineers

8 8

+

(2me2 − 2(n + s)α)c(n)rn+s+1 = 0

Equating coeﬃcients of equal powers of r gives (s(s + 1) − l(l + 1))c(0) = 0, and for all n ≥ 1,

((n + s)(n + s + 1) − l(l + 1))c(n) + (2me2 − 2(n + s − 1)α)c(n − 1) = 0

We must assume c(0) �= 0 for otherwise, this recursion would imply that c(n) = 0∀n ≥ 0 which would mean that the wave function vanishes. Then, we get s(s + 1) = l(l + 1) so s = l since s = −l − 1 would give a singularity at r = 0 and the wave function would fail to be square integrable when l > 0. Further, for n large, ie, n → ∞, the above diﬀerence equation asymptotically is equivalent to c(n) ≈ 2αc(n − 1)/n or equivalently,

c(n) ≈ (2α)n /n!

and hence

f (r) =

n

so

c(n)rn+l ≈ rl exp(2αr)

f (r)exp(−αr) ≈ rl exp(αr)

which is not square integrable, in fact, it diverges exponentially as r → ∞. The only way out is that c(n) = 0 for all n ≥ n0 for some n0 ≥ 1. Let n0 be the smallest such integer. Then we get from the above recursion by putting n = n0 , and c(n0 − 1) �= 0, me2 = (n0 + s − 1)α = (n0 + l − 1)α Thus,

E = −me4 /2(n0 + l − 1)2

Thus, we get the result that the possible energy levels of the hydrogen atom are En = −me4 /2n2 , n = max(1, l), l + 1, l + 2, ..., This result was ﬁrst derived rigorously by Schrodinger in this way although it was earlier obtained by Niels Bohr using adhoc arguments like the correspondence principle. [11b] The spectrum of particle in a 3 − D box. This is described by the stationary state Schrodinger equation −∇2 ψ(r)/2m = Eψ(r), r = (x, y, z) ∈ [0, a] × [0, b] × [0, c] with boundary conditions that ψ(r) vanishes on the boundary, ie when either x = 0, a or y = 0, b or z = 0, c. The solution to this boundary valued problem is obtained by separation of variables and the result is the set of orthonormal wave functions ψnmk (r) = ((2/a)(2/b)(2/c))1/2 sin(nπx/a)sin(mπy/b)sin(lπz/c), n, m, k = 1, 2, 3, ... with the corresponding energy eigenvalues E = π(n2 /a2 + m2 /b2 + k 2 /c2 )1/2

[11c] The spectrum of a quantum harmonic oscillator. A quantum Harmonic oscillator has the Hamiltonian H = p2 /2m + mω 2 q 2 /2

Quantum Mechanics for Scientists and Engineers

9

where

9

[q, p] = i

We deﬁne the annihilation and creation operators by √ √ a = (p − imωq)/ 2m, a∗ = (p + imωq)/ 2m Then,

a∗ a = p2 /2m + mω 2 q 2 /2 − ω/2 = H − ω/2, aa∗ = H + ω/2

Thus,

[a, a∗ ] = ω

Now let |E > be any normalized eigenstate of H with eigenvalue E. Then we have 0 ≤< E|a∗ a|E >=< E|H − ω/2|E >= E − ω/2 Hence E ≥ ω/2

ie, the minimum energy level of the oscillator is ω/2. This is attained when and only when a|E >= 0. In other words, we have a|ω/2 >= 0 and hence

(d/dq + mωq) < q|ω/2 >= 0

or equivalently, < q|ω/2 >= C.exp(−mω 2 q 2 /2) with C being the normalizing constant chosen so that | < q|ω/2 > |2 = |C|2 exp(−mω 2 q 2 )dq 1 =< ω/2|ω/2 >= R

R

= |C| = (π/m) 2

so we may take

1/2

ω

−1

C = (π/mω 2 )1/4

[12] Time independent perturbation theory: Calculation of the energy levels and eigenfunctions of non-degenerate and degenerate systems using perturbation theory. The perturbed Hamiltonian is ∞ �k Vk H = H0 + k=1

The stationary state wave functions for H are expanded as

|ψ >= |ψ0 > +

k≥1

�k |ψk >

and the corresponding perturbed energy level is also expanded as a power series: E = E0 + �k Ek k≥1

Substituting these expressions into the eigenvalue problem H|ψ >= E|ψ > and equating coeﬃcients of � , k = 0, 1, 2, ... gives us the series k

H0 |ψ0 >= E0 |ψ0 >,

Quantum Mechanics for Scientists Engineers Quantum and Mechanics

10 10

(H0 − E0 )|ψk > +

k

Vr |ψk−r > +

r=1

k r=1

Er |ψk−r >, k ≥ 1

For each eigenvalue E0 = E0 (m) of H0 , let |m, r >, r = 1, 2, ..., d(m) denote an orthonormal basis of eigenstates of N (H0 − E0 (m)). Thus, the eigenvalue E0 (m) has a degeneracy of d(m). So we can write d(m)

|ψ0 >=

c(m, r)|m, r >

r=1

for E0 = E0 (m). The O(�) equation (H0 − E0 (m))|ψ1 > +V1 |ψ0 >= E1 |ψ0 > − − −(1) then gives on premultiplying by < m, r|, c(m, r) < m, k|V1 |m, r >= E1 c(m, r)δ[k − r] r

and hence, it follows that the for the unperturbed energy level E0 (m), the possible ﬁrst order perturbations E1 = E1 (m, s), s = 1, 2, ..., d(m) to the energy levels are given by the eigenvalues of the d(m) × d(m) secular matrix ((< d(m) m, k|V1 |m, r >))1≤k,r≤d(m) . Further, if cs (m) = ((cs (m, r)))r=1 is the eigenvector of this secular matrix corresponding d(m) to the eigenvalue E1 (m, s), then then the corresponding unperturbed state is given by r=1 cs (m, r)|m, r >. We note d(m) that the constants cs (m, r) can be chosen so that the states r=1 cs (m, r)|m, r >, s = 1, 2, ..., d(m) form an orthonormal basis for the d(m) dimensional vector space N (H0 − E0 (m)). Further, we get from (1) by premultiplying by < l, r| for l �= m, (E0 (l) − E0 (m)) < l, r|ψ1 > + < l, r|V1 |ψ0 >= E1 < l, r|ψ0 >= 0 so that

< l, r|ψ1 >= which implies that the unperturbed state |ψ0 >=

< l, r|V1 |ψ0 > E0 (m) − E0 (l)

r cs (m, r)|m, r

> gets perturbed to

|ψ0 > +�|ψ1 > +O(�2 ) where |ψ1 >= and

l�=m,r=1,2,...,d(m)

|l, r >< l, r|V1 |ψ0 > /(E0 (m) − E0 (l)) d(m)

< l, r|V1 |ψ0 >=

k=1

cs (m, k) < l, r|V1 |m, k >

We now calculate the second order pertrubation to the energy levels and the the corresponding eigenfunctions. [13] Time dependent perturbation theory: Calculation of the transition probabilities of a quantum system from one stationary state of the unperturbed system to another stationary state in the presence of a time varying interaction potential. The perturbed Hamiltonian has the form H(t) = H0 +

∞

�m Vm (t)

m=1

The Schrodinger evolution operator U (t) of this system is expanded as a perturbation series: �m Um (t) U (t) = U0 (t) + m≥1

Substituting this into the evolution equation

iU � (t) = H(t)U (t)

Quantum Mechanics for Scientists and Engineers

11

and equating equal powers of the perturbation parameter � gives us the sequence of diﬀerential equations: iU0� (t) = H0 U0 (t), � (t) − H0 Um (t) = iUm

Thus

m

k=1

Vk (t)Um−k (t), m ≥ 1

U0 (t) = exp(−itH0 ), Um (t) = −i

m

k=1

0

t

U0 (t − s)Vk (s)Um−k (s)ds, m ≥ 1 − − − (1)

Let now |n > be an eigenstate of the unperturbed system with energy E(n): H0 |n >= E(n)|n > Then when the perturbation is switched on, the transition probability amplitude from state |n > to state |m > in time [0, T ] with m �= n is given by < m|U (T )|n >= −i

r

�k < m|Uk (T )|n > +O(�r+1 )

k=1

where U1 (T ), ...., Ur (T ) are successively determined from (1). For example, t U0 (t − s)V1 (s)U0 (s)ds, U1 (t) = −i 0

U2 (t) = −i =−

0 to |m > for m �= n in time T is given by T exp(−i(E(m)(t − s) − E(n)s)) < m|V1 (s)|n > ds|2 + O(�3 ) PT (n → m) = �2 | < m|U1 (T )|n > |2 + O(�3 ) = �2 | 0

= �2 |

0

T

exp(i(E(m) − E(n))s) < m|V1 (s)|n > ds|2 + O(�3 )

In the limit as T → ∞, this probability converges to where Vˆ1 (ω) =

∞ 0

�2 |Vˆ1 (E(m) − E(n))|2 + O(�3 )

V1 (t)exp(iωt)dt is the one sided Fourier transform of V1 .

[14] The full Dyson series for the evolution operator of a quantum system in the presence of a time varying potential. [15] The transition probabilities in the presence of a stochastically time varying potential. If the potentials Vk (t), k = 1, 2, ... appearing in section (13) are operator valued stochastic processes, then the various perturbation terms Uk (t), k = 1, 2, ... in the expansion of the evolution operator will also be operator valued stochastic of terms, each term of which is a multilinear functional of Vr (.), r = 1, 2, ... processes and in fact, Uk (t) will be a sum with Vrj (.) appearing mj times such that j mj rj = k. The transition probability from state |n > to state |m > in time T will then be given by PT (n → m) = E[< m|U (T )|n > |2 ] = E[(|sumk≥1 �k < m|Uk (T )|n > |2 ] where the expectation is taken w.r.t the probability distribution of the random potentials Vk (.), k = 1, 2, .... [16] Basics of quantum gates and their realization using perturbed quantum systems.

Quantum Mechanics for Scientists and Engineers

12

If H0 is the unperturbed Hamiltonian and we apply a perturbation time varying potential p

V (t) =

fk (t)Vk

k=1

where the fk (t)� s are control functions, then the solution to the Schrodinger evolution unitary operator U (t) deﬁned by U � (t) = −iH(t)U (t), H(t) = H0 + δV (t) upto O(δ m ) is given by W (T ) = W (T, f ) = I +

U (t) = U0 (t)W (t), U0 (t) = exp(−itH0 ), (−i)r δ r fk1 (t1 )...fkr (tr )V˜k1 (t1 )...V˜kr (tr )dt1 ...dtr + O(δ m ) 0 converges weakly to zero. However since � en − em �= √ 2, n �= m, the sequence |en > does not converge strongly and in fact, no subsequence of this sequence can converge strongly. Now deﬁne the operators Tn = |en >< en |, n = 1, 2, .... Then < f |Tn |g >=< f |en >< en |g >→ 0, hence wlimTn = 0. Further � Tn f �2 = | < en |f > |2 → 0

hence slimTn = 0. Further, � Tn �= 1 and hence Tn does not converge in the operator norm. Assuming the spectral theorem stated above, we can deﬁne for any Borel set B in R the orthogonal projection E(B) = B dE(x). This means that for all f, g ∈ H, < f |E(B)|g >= B d < f |E(x)|g > where d < f |E(x)|g > deﬁnes a signed measure for each f, g. Note that the function x →< f |E(x)|g > has bounded variation since |d < f |E(x)|g > | = | < f |dE(x)|g > | ≤� f �< g|dE(x)|g >1/2 =� f �� dE(x)g � =� f � .(d(� E(x)g �)2 )1/2

where we use the fact that (dE(x))2 = dE(x) = dE(x)∗ and that for a < b, � (E(b) − E(a))g �2 =� E(b)g �2 − � E(a)g �2 We also have for < f |f >= 1, |d < f |E(x)|g > |2 = | < f |dE(x)|g > |2 ≤< dE(x)g|dE(x)g >=� dE(x)g �2 so that |d < f |E(x)|g > | ≤� dE(x)g �

Quantum Mechanics for Scientists and Engineers

14

Note that x →� E(x)g � is a non-decreasing function bounded above by � g � and hence is a function of bounded variation or more precisely, R d � E(x)g �=� g �< ∞. We have further, for < f |f >= 1 that | < f |E(x)|g > | ≤� E(x)g �

and also

| < f |E(y)|g > − < f |E(x)|g > | = | < f |(E(y) − E(x))|g > | ≤� (E(y) − E(x))g �

Now x →� E(x)g � is a function of bounded variation and x →< g|E(x)|g >=� E(x)g �2 is also a function of bounded variation since it is a non-decreasing function bounded above by � g �2 . Now, 4Re(< f |E(x)|g >) =< f + g|E(x)|f + g > − < f − g|E(x)|f − g > implying that x → Re(< f |E(x)|g >) is of bounded varation an likewise x → Im(< f |E(x)|g >) is also of bounded variation obtained by replacing f with i.f . Hence x →< f |E(x)|g > has bounded variation which enables us to deﬁne spectral integrals weakly using Riemann-Stieltjes integrals. Further, if xdE(x) H= R

then for c ∈ R, the projection Pc onto R((H − c)+ ) is given using the formula (H − c)+ = (x − c)dE(x) x≥c

as

Pc = E(c)

This formula enables us to constructively describe the spectral measure associated with a Hermitian operator. In fact, in the proof of the spectral theorem, we deﬁne Pc = f (H) = (H −c)+ by taking f (x) = max(x − c, 0) = (|x − c| + (x − c))/2 and then proving that c → Pc is a spectral measure such that cdPc = H. See T.Kato, ”Perturbation theory for linear operators”, Springer, for details. Remark: If z is any complex number with Im(z) �= 0, then for Hermitian H, (H − z)−1 = (x − z)−1 dE(x) so that the spectral norm of (H − z)−1 is given by � (H − z)−1 �= max(|x − z)−1 : x ∈ R} = |Im(z)|−1 < ∞ In paricular, (H − z)−1 is a bounded operator, ie z ∈ ρ(H) where ρ(H) is the resolvent set of H. More generally, it can be shown that if H is a closed symmetric operator, then ±i belong to ρ(H) iﬀ H is Hermitian. Remark: If T is an operator in a Hilbert space, its graph is deﬁned by Gr(T ) = {(x, T x) : x ∈ D(T )} T is said to be closed if Gr(T ) is closed, ie if xn ∈ D(T ), xn → x, T xn → y imply x ∈ D(T ) and y = T x. An operator T is said to be closable if the closure Cl(Gr(T )) of Gr(T ) is also the graph of some operator T˜. It is easy to see that this happens iﬀ xn ∈ D(T ), xn → x, T xn → 0 imply x = 0. Then we say that T is closable with T˜ being the closure of T . A operator T in H is said to be essentially selfadjoint if T is closable with its closure T˜ being selfadjoint. If E is a subset of D(T ) such that {(x, T x) : x ∈ D(T )} is dense in Gr(T ), then E is called a core for T . It is easy to see that this happens iﬀ for any x ∈ D(T ) there exists a sequence xn ∈ E such that � xn − x �2 + � T xn − T x �2 → 0, or equivalently, iﬀ for any x ∈ D(T ), < (x, T (x)), (z, T (z)) >= 0 for all z ∈ E implies x = 0. [19] The general theory of Events, states and observables in the quantum theory. The basic axiomatic set up that deﬁnes a quantum probability space is the triplet (H, P (H), ρ) where H is a Hilbert space, P (H) is the lattice of orthogonal projections in H and ρ is a state in H, ie, a positive deﬁnite matrix of unit trace. This is to be compared with a classical probability space deﬁned by the triplet (Ω, F, P ) where Ω is the sample space, F is a σ − algebra of subsets of Ω and P is a probability measure on F. The elements of F are called events in classical probability and likewise the elements of P (H), namely the orthogonal projections are the events in quantum probability. The elements of Ω in classical probability are called the elementary outcomes and likewise the elements of

Quantum Mechanics for Scientists and Engineers

15

H after normalizing are the elementary outcomes in quantum probability. Just as an event E in classical probability is a set of elementary outcomes, likewise in quantum probability, an event P which is an orthogonal projection can be viewed as an aggregated of elements of H using the spectral diagonalization method: P = |ψa >< ψa ||, |ψa >∈ H, < ψa |ψb >= δab a∈I

P is viewed as the aggregate of |ψa >, a ∈ I. Finally if μ is any measure on P (H), ie, μ : P (H) → [0, 1] is a map such that if Pn , n = 1, 2, ... are pairwise orthogonal orthogonal projections in H, then μ( n Pn ) = n μ(Pn ), μ(0) = 0, μ(I) = 1, then Gleason’s theorem asserts that there exists a positive deﬁnite operator ρ of unit trace such that μ(P ) = T r(ρP ), P ∈ P (H) This proves the generality of the setup of a quantum probability space. An observable in the quantum probability space (Ω, F, P ) is a Hermitian operator X in H. It has spectral representation X = xEX (dx) and the probability that when X is measured, the outcome will fall in the range (a, b] is T r(ρEX ((a, b])). In other words, FX (x) = T r(ρE((−∞, x])) is the probability distribution of X in the state ρ. This is to be compared with the classical setup where an observable X is a random variable in the the probability space (Ω, F, P ), ie a measurable map X : (Ω, F) → (R, B(R)). The probability distribution of X in the ”state” P , is FX (x) = P X −1 ((−∞, x])) = P ({ω ∈ Ω : X(ω) ≤ x}) To show that classical random variables are special cases of quantum observables, we deﬁne the spectral measure EX in L2 (R) by EX (B) = χB , B ∈ B(R)

ie EX (B) is multiplication by the indicator of the set B. Then we obviously have xEX (dx) X= R

Then, for any functionf : R → R, we have

f (X) =

and hence (f (X)g)(y) =

f (x)EX (dx)

f (x)χ(x,x+dx] (y)g(y) = f (y)g(y)

ie f (X) is the operator of multilication by f (x). The crucial diﬀerence between classical and quantum probability is that in the former, all observables commute while in the latter, they need not commute. Thus, if X, Y are two classical random variables deﬁned over the same probability space (Ω, F, P ), then both have same spectral measure:

X=

EX (B) = EY (B) = χB , xEX (dx), Y = yEY (dy), EX = EY

Hence, we can deﬁne the joint probability of X taking values in a Borel set B1 and Y taking values in a Borel set B2 as P (X ∈ B1 , Y ∈ B2 ) = P (X −1 (B1 ) ∩ Y −1 (B2 )) and in particular the joint probability distribution of the pair (X, Y ) is FXY (x, y) = P (X −1 ((−∞, x]) ∩ Y −1 ((−∞, y])) On the other hand, in quantum probability two observables X, Y need not commute, ie XY − Y X �= 0. Then the spectal measures of X, Y will diﬀer: X=

xEX (dx), Y =

yEY (dy), EX �= EY

Quantum Mechanics for Scientists and Engineers

16

and hence, we cannot speak of the event that (X, Y ) takes values in B1 × B2 in a given state. In fact, we have the basic Heisenberg uncertainty inequality T r(ρX 2 )T r(ρY 2 ) ≥ T r(ρXY )2 = (Re(T r(ρXY ))2 + (Im(T r(ρXY ))2 = (T r(ρ(XY + Y X)))2 /4 + (T r(ρ(XY − Y X)/i))2 /4 ≥ (T r(ρ[X, Y ]/i))2 /4

In particular, if [X, Y ] = ic, ie,

T r(ρX 2 ).T r(ρY 2 ) ≥ c2 /4

Thus, if X is replaced by X − T r(ρX) and Y by Y − T r(ρY ), we get that the product of the variances of X and Y in any state ρ is always ≥ c2 /4 and hence we cannot measure them simultaneously. We can measure any single Hermitian function of X, Y by not jointly two or more functions unless they commute. If X, Y commute, then they have the same spectral measure (jointly diagonable) and hence we can represent X = xEX (dx), Y = f (x)EX (dx) for some real valued function f and hence T r(ρEX (B1 Y in B2 . If X, Y do not commute, then the quantity

f −1 (B2 ))) is the joint probability of X taking values in B1 and

ψX (t1 , t2 ) = T r(ρ.exp(i(t1 X + t2 Y ))) need not be positive deﬁnite and hence it is not a characteristic function of any probability measure in R2 . However, for any a, b ∈ R, ψ(t) = T r(ρ.exp(it(aX + bY )) is positive deﬁnite in t and hence the characteristic function of the observable aX + bY . Thus, any single real linear combination or more generally, any single Hermitian matrix valued function of (X, Y ) is measurable with a deﬁnite characteristic function but two diﬀerent linear combinations may not have a joint characteristic function that is positive deﬁnite and hence the 2-D inverse Fourier transform of such a function need not be a probability measure on R2 . Exercise: Choose a general 2 × 2 state p1 z ρ= z¯ p2 so that

p1 , p2 ≥ 0, p1 + p2 = 1, p1 p2 − |z|2 ≥ 0

These conditions ensure that ρ is a state in C2 , ie, positive deﬁnite of unit trace. Take the three Pauli matrices 0 1 0 −i , σ = , σ1 = 2 10 i 0 1 0 σ3 = 0 −1 Calculate the joint characteristic function of these three observables in the state ρ, ie, ψ(t1 , t2 , t3 ) = T r(ρ.exp(i(t1 σ1 + t2 σ2 + t3 σ3 ))) and show that there exists a ρ such that ψ is not positive deﬁnite. [20] [21] [22] tics. [23] [24]

The evolution of the density operator in the absence of noise. The Gorini-Kossakowski-Sudarshan-Lindblad (GKSL) equation for noisy quantum systems. Distinguishable and indistinguishable particles: The Maxwell-Boltzmann, Bose-Einstein and Fermi-Dirac statisThe relationship between spin and statistics. Tensor products of Hilbert spaces: Let H1 , H2 be two Hilbert spaces. For (x1 , x2 ), (y1 , y2 ) ∈ H1 × H2 , deﬁne K((x1 , x2 ), (y1 , y2 )) =< x1 , y1 >1 < x2 , y2 >2

By Schur’s theorem, K is a positive deﬁnite kernel on H1 × H2 . Hence, by Kolmogorov’s consistency theorem, there exists a Gaussian process {λ(x1 , x2 ), (x1 , x2 ) ∈ H1 × H2 } on H1 × H2 such that E(λ(x1 , x2 )λ(y1 , y2 )) = K((x1 , x2 ), (y1 , y2 ))

Chapter

3

Open Quantum Systems, Tensor Products, Fock Space Calculus, Feynman’s Path Integral [20] [21] [22] tics. [23] [24]

The evolution of the density operator in the absence of noise. The Gorini-Kossakowski-Sudarshan-Lindblad (GKSL) equation for noisy quantum systems. Distinguishable and indistinguishable particles: The Maxwell-Boltzmann, Bose-Einstein and Fermi-Dirac statisThe relationship between spin and statistics. Tensor products of Hilbert spaces: Let H1 , H2 be two Hilbert spaces. For (x1 , x2 ), (y1 , y2 ) ∈ H1 × H2 , deﬁne K((x1 , x2 ), (y1 , y2 )) =< x1 , y1 >1 < x2 , y2 >2

By Schur’s theorem, K is a positive deﬁnite kernel on H1 × H2 . Hence, by Kolmogorov’s consistency theorem, there exists a Gaussian process {λ(x1 , x2 ), (x1 , x2 ) ∈ H1 × H2 } on H1 × H2 such that Quantum Mechanics E(λ(x1 , x2 )λ(y1 , y2 )) = K((x1 , x2 ), (y1 , y2 )) or equivalently, if < ., . > denotes the standard inner product in the underlying probability space, we can write < λ(x1 , x2 ), λ(y1 , y2 ) >= K((x1 , x2 ), (y1 , y2 )) =< x1 , y1 >1 < x2 , y2 >2 λ is called the tensor product of H1 with H2 . It is easy to prove that the maps xk → λ(x1 , x2 ), k = 1, 2 are linear. This is proved by expanding � λ(c1 x + c2 y, z) − c1 λ(x, z) − c2 λ(y, z) �2

and likewise for the second argument. [25] Symmetric and antisymmetric tensor products of Hilbert spaces, the Fock spaces. Let H be a ﬁnite dimensional Hilbert space of dimension N . Choose an orthonormal basis {|ei >: i = 1, 2, ..., N } for this space. Let ⊗ be a tensor product of this Hilbert space with itself as many times as needed. Using the deﬁnition of the inner product < u1 ⊗ ... ⊗ un , v1 ⊗ ... ⊗ vn >= Πni=1 < ui , vi > we easily see that

{ei1 ⊗ ... ⊗ ein : 1 ≤ i1 , ..., in ≤ N }

is an orthonormal basis of the Hilbert space H⊗n and hence dimH⊗n = N n . The state |ei1 ⊗ ... ⊗ ein > corresponds to a state of n distinguishable particles in which the k th particle is in the state |eik >. Since there may be repetitions of an index in i1 , ..., ik , it follows that more than one particle can occupy a given state. Now suppose that the n particles are indistinguishable but are bosons, ie, more than one particle can occupy a given state. Then a given state of the n-particle system is speciﬁed by saying how many particles occupy each distinct state |ei >. Such a state should be invariant under a permutation of the particles, ie, it must be superpositions of vectors in H⊗n of the form σ∈Sn |uσ1 ⊗ ... ⊗ uσn > where uk ∈ H. The set of all such superpositions forms a Hilbert space H⊗n , called the n-fold symmetric tensor product of H with itself. An orthonormal basis for this space is |er11 ...erNN >, r1 , ..., rN ≥ 1 N ⊗ ... ⊗ e⊗r > multiplied by the normalizing 0, r1 + ... + rN = n. where |er11 ...erNN > equals the symmetrization of |e⊗r 1 N factor (r1 !...rN !)−1 (n!/(r1 !...rN !))−1/2 = (n!r1 !...rN !)−1/2 . It is easy to see from this that dimH⊗sn is the number of ways in which n indistinguishable balls can be placed in N boxes and this number is the coeﬃcient of xn in the expansion (1 + x + x2 + ...)N = (1 − x)−N . We thus get −N N +n−1 (−1)n = dimH⊗s n = n n Finally, we consider the situation of Fermions which are also indistinguishable particles butno state can have more than one particle. Such a state consisting of n particles is a superposition of states of the form σ∈Sn sgn(σ)uσ1 ⊗ ... ⊗ uσn , ie, it is antisymmetric with respect to interchange of particles. The set of all such superpositions forms a subspace of H⊗n and is denoted by H⊗an and an orthonormal basis for this space is |ei1 ∧ ei2 ∧ ... ∧ ein >, 1 ≤ i1 < ... < in ≤ N , where u1 ∧ ... ∧ un = (n!)−1/2 sgn(σ)uσ1 ⊗ ... ⊗ uσn σ∈Sn

uantum Mechanics for Scientists and Engineers

18 where

It is easily seen that if any two of the u�i s are identical, then this n-fold wedge product is zero and this corresponds to the fact that no state have more than one particle, also called the Pauli exclusion principle for Fermions. The above results are sometimes expressed by saying that bosonic wave functions are symmetric under particle interchange while Fermionic wave functions are antisymmetric under particle interchange. We are now in a position to derive the MaxwellBoltzmann, Bose-Einstein and Fermi-Dirac statistics respectively for distinguishable particles, Bosons and Fermions. Suppose our quantum system has energy levels E1 , E2 , ... with respective degeneracies of g1 , g2 , .... The number of ways of placing nk distinguishable articles in the k th energy level Ek is gknk . Hence, the total number of ways of placing n distinguishable particles in these states such that nk particles occupy the energy level Ek is Ω(n1 , n2 , ...) = (

n! (g n1 ...gknk ...) n1 !n2 !... 1

The most {nk } is obtained by maximizing Ω(n1 , n2 , ...) subject to the number and energy conprobable distribution straints k nk = n, k gk nk = E. Using Stirling’s formula for the factorial and incorporating the above constraints using Lagrange multipliers yields the fact that the most probable distribution is obtained by maximizing 18 Quantum Mechanics F [n1 , n2 , ..., a, b] = (nk log(gk ) − nk log(nk )) − a( nk − n) − b( Ek nk − E) k

Setting

∂F ∂nk

k

k

= 0 gives log(gk ) − log(nk ) − 1 = a + bEk

or

nk = Cgk .exp(−bEk ), C = exp(−a − 1)

This is called the Maxwell-Boltzmann distribution C, b are determined by the conditions k nk = n, k Ek nk = E. Suppose now that we are concerned with bosons, ie, indistinguishable particles with no constraint on the number of particles in each state. The number of ways in which nk bosons can be distributed in the energy level Ek having gk k −1 boxes is nk +g and hence if no constraint is placed on the number of particles, the most probable distribution {nk } nk with a constraint on the total energy E is obtained by maximizing nk + gk − 1 F = log Ek nk − E) − b( nk k

k

or using Stirling’ formula and neglecting constants, ((nk + gk )log(nk + gk ) − nk log(nk )) − b( Ek nk − E) F = k

Setting

∂F ∂nk

k

= 0 gives log(nk + gk ) − log(nk ) − bEk = 0

or

1 + gk /nk = bEk , nk = gk /(exp(bEk ) − 1)

If we put a constraint on the total number of particles n, then the function to be maximized is ((nk + gk )log(nk + gk ) − nk log(nk )) − a( nk − n) − b( Ek nk − E) F = k

Setting

∂F ∂nk

or

k

k

= 0 then gives log(nk + gk ) − log(nk ) − a − bEk = 0 nk =

gk exp(a + bEk ) − 1

This is the Bose-Einstein distribution. The constants a, b are determined from the constraints nk = n, E k nk = E k

k

[26] Coherent/exponential vectors in the Fock spaces. Let H0 = L2 (R) and consider the standard position and momentum operators, q, p = −id/dq acting in this space. Construct the annihilation and creation operators a, a∗ in this space as √ √ a = (q + ip)/ 2, a∗ = (q − ip)/ 2 Show that

[a, a∗ ] = 1

a Quantum Mechanics for Scientists and Engineers

19

Now consider copies of H0 and in the k th copy, let the annihilation and creation operators be denoted by ak , a∗k . Thus, in the tensor product of p such copies H0⊗p , we have deﬁned operators a1 , ..., ap , a∗1 , ..., a∗p satisfying [ai , a∗j ] = δij , [ai , aj ] = 0, [a∗i , a∗j ] = 0. Let |n1 , ..., np > be the tensor product of the state |n1 >, ..., |np > where |nk > is the state in the k th √ √ copy of H0 such that ak |nk >= nk |nk − 1 >, a∗k |nk >= nk + 1|nk + 1 > and a∗k ak |nk >= nk |nk >, ak a∗k |nk >= p (nk + 1)|nk >. Deﬁne for z ∈ C , the state |e(z) >= z1n1 ...zpnp |n1 , ..., np > / n1 !...np ! n1 ,...,np ≥0

=

n1 ,...,np ≥0

∗np 1 z1n1 ...zpnp a∗n 1 ...ap |0 > /n1 !...np !

= exp(z.a∗ )|0 >, z.a∗ = z1 a∗1 + ... + zp a∗p Show using

< n1 , ..., np |m1 , ..., mp >= Πpk=1 δ[nk − mk ] = δ[n − m]

that for z, u ∈ Cp ,

< e(z)|e(u) >= exp(< z, u >)

Now, deﬁne for z ∈ Cp , n ≥ 0,

ψ(z ⊗n ) =

√

n! z1n1 ...zpnp |n1 , ..., np > n1 !...np !

n1 +...+np =n

Then show using the multinomial theorem that for z, u ∈ Cp , < ψ(z ⊗n ), ψ(u⊗n ) >=< z, u >n Now extend the map ψ linearly to the Boson Fock space Γs (Cp ) =

(Cp )⊗s n

n≥0

by deﬁning for |f (z) >=

z ⊗n √ ∈ Γs (Cp ) n! n≥0

ψ(|f (z) >) = |e(z) >

or equivalently,

√ ψ(z ⊗n / n!) = n

z n1 ...zp p 1 |n1 , ..., np >, n ≥ 0 n1 !...np ! n1 +...+np =n

Show that ψ deﬁnes a Hilbert space isomorphism between Γs (Cp ) and L2 (Rp ). This gives a physical interpretation of the coherent states in Boson Fock space in terms of the p-dimensional Harmonic oscillator algebra. [25] Creation, conservation and annihilation operators in the Boson Fock space. [26] The general theory of quantum stochastic processes in the sense of Hudson and Parthasarathy. [27] The quantum Ito formula of Hudson and Parthasarathy. [28] The general theory of quantum stochastic diﬀerential equations. [29] The Hudson-Parthasarathy noisy Schrodinger equation and the derivation of the GKSL equation from its partial trace. [30] The Feynman path integral for solving the Schrodinger equation. Let X(t), t ≥ 0 be a Markov process with values in R having generator K, ie E[φ(X(t + dt))|X(t) = x] = φ(x) + Kφ(x)dt + o(dt) Note that by including generalized functions such as the Dirac Delta function and its derivatives, we may represent K as an integral kernel: Kφ(x) = K(x, y)phi(y)dy We deﬁne

u(s, t, x) = E[f (X(t))exp(

s

t

V (X(s))ds)|X(s) = x], 0 ≤ s ≤ t

Quantum Mechanics for Scientists and Engineers

20

, s

Then an easy application of the Markov property shows that

and hence,

u(s, t, x) = (1 + V (x)ds)E[(u(s + ds, t, X(s + ds))|X(s) = x] + o(ds) = (1 + V (x)ds)(u(s, t, x)ds + u,s (s, t, x) + ds K(x, y)u(s, t, y)dy) + o(ds) ∂s u(s, t, x) + V (x)u(s, t, x) +

Further,

K(x, y)u(s, t, y)dy = 0

u(t, t, x) = f (x)

In particular, if X(t) is Brownian motion, we have K(x, y) = (1/2)δ �� (x − y) and the above becomes

∂s u(s, t, x) + V (x)u(s, t, x) + (1/2)∂x2 u(s, t, x) = 0

which is the Feynman-Kac formula. Since we are assuming that the Markov process is time homogeneous, ie its generator K is time independent, it follows that u(s, t, x) = u(0, t − s, x). We denote this by v(t − s, x) and then we get ∂t v(t, x) = V (x)v(t, x) + K(x, y)v(t, y)dy, t ≥ 0, v(0, x) = f (x) √ −1, ie deﬁne w(t, x) = v(it, x) in the above formula to get i∂t w(t, x) = −V (x)w(t, x) − K(x, y)w(t, y)dy

Feynman formally replaced t by it, i =

Now replace V by −V to get the generalized Schrodinger equation i∂t w(t, x) = V (x)w(t, x) − K(x, y)w(t, y)dy, w(0, x) = f (x) where w(t, x) = E[exp(− = E[exp(−i

t

it

V (X(s))ds)f (X(it))|X(0) = x]

0

V (X(is)ds)f (X(it))|X(0) = x]

0

Formally, this formula can be interpreted as follows: v(t, x) is approximated as t v(t, x) = E[exp(− V (X(s))ds)f (X(t))|X(0) = x] ≈ 0

where

exp(

n

(−V (xk )) + (log(K))(xk , xk+1 ))δsk )f (sn )Π0≤k≤n dxk

k=0

0 = s0 < s1 < ... < sn = t

and log(K) is the operator logarithm of K (not log(K(x, y))). [31] Comparison between the Hamiltonian (Schrodinger-Heisenberg) and Lagrangian (path integral) approaches to quantum mechanics. The Schrodinger-Heisenberg approaches to quantum mechanics are Hamiltonian approaches. Feynman proposed an alternative approach to non-relativistic quantum mechanics that is based on the Lagrangian. To see how this proceeds, assume that the Hamiltonian of the system is H(t, q, p) . The Lagrangian is then L(t, q, q � ) = (p, q � ) − H(t, q, p) with p=

∂L ∂q �

Quantum Quantum Mechanics Mechanics for Scientists and Engineers

Thus,

2121

< t + dt, q �� |t, q � >=< t, q �� |exp(−iH(t, q, p)dt)|t, q � >

We can applying the commutation relations between q and p, assume that in the Hamiltonian H(t, q, p), all the p� s appear to the left of all the q � s. Then we get < t + dt, q �� |t, q � >= < t, q �� |t, p� > dp� < t, p� |exp(−iH(t, q, p)dt)|t, q � > =C =C

exp(i(q”, p� ))exp(−iH(t, q � , p� )dt)|t, q � > dp�

exp(i((q �� , p� ) − H(t, q � , p� )dt)).exp(−i(q � , p� ))dp�

=C

exp(i[(q �� − q � , p� )/dt − H(t, q � , p� )]dt))dp�

By composing these inﬁnitesimal amplitudes, we get the transition amplitude for ﬁnite time as

C

K(t2 , q2� |t1 , q1� ) =< t2 , q2� |t1 , q1� >=

exp(i

q(t1 )=q1 ,q(t2 )=q2

t2

t1

(p(t), q � (t)) − H(t, q(t), p(t)))dt)Πt1 ≤t≤t2 dq(t)dp(t)

This is indeed the formula for the time evolution operator kernel in the position representation, ie t2 H(t)dt)}|t1 , q1� > K(t2 , q2� |t1 , q1� ) =< t2 , q2� |T {exp(−i t1

where T {} is the time ordering operator. In the special case, when the Hamiltonian has the form H(t, q, p) = p2 /2m + V (q) the integral over p becomes a Gaussian integral and therefore it can be replaced by evaluating the action integral at the stationary point, ie at p(t) given by d ((p(t), q � (t)) + p2 (t)/2m) = 0 dp(t) ie

q � (t) = p(t)/m, p(t) = mq � (t)

Thus, in this special case, K(t2 , q2� |t1 , q1� ) = C

q(t1 )=q1 ,q(t2 )=q2

exp(i

t2

t1

(mq � (t)2 /2 − V (q(t)))dt)πt1 at time t2 in which the ﬁeld is exactly another given function φf (r), r ∈ R3 of the spatial variables. The corresponding Hamiltonian will then have the form H(t) = H0 (φ(x), ∇φ(x), π(x))d3 x − f (x)V (φ(x))d3 x (Note: x = (t, r), d3 x = d3 r, d4 x = d3 xdt = d3 rdt) where H0 is the Hamiltonian density corresponding to the Lagrangian density L0 (ie obtained by applying the Legendre transform to L0 ): H0 = (1/2) (π 2 + (∇φ)2 + m2 φ2 )d3 x The transition probability amplitude from |φi >→ |φf > in the time duration [t1 , t2 ] can be calculated using the Feynman path integral formula: < φf |S[t2 , t1 ]|φi >= exp(−( Ldtd3 x))Πr∈R3 ,t∈(t1 ,t2 ) dφ(x) C =C where

φ(t1 ,.)=φi ,φ(t2 ,.)=φf

exp(iS0 )(1 + i

[t1 ,t2 ]×R3

f (x)V (φ(x))dtd3 x + (i2 /2!) S0 =

L0 dtd3 x = (1/2)

f (x)f (x)� φ(x)φ(x� )dtdt� d3 xd3 x� + ..)Πdφ(x)

(∂μ φ∂ μ φ − m2 φ2 )dt d3x

By expanding V (φ(x) as a power series in φ(x), the computation of the above path integral reduces to computing the moments of a complex inﬁnite dimensional zero mean Gaussian distribution sinc S0 is a quadratic functional of φ. In particular, we note that the odd moments of a symmetric Gaussian distribution are zero and the even moments can be computed by summing the products of the second moments taken over all partitions of the product ﬁelds into pairs. Thus, computation of the second moments of such a Gaussian distribution beomes signiﬁcant, ie, D(x, y) = C exp(iS0 )φ(x)φ(y)Πz∈R4 dφ(z) if we are interested in transitions from t = −∞ to t = +∞. From standard methods in quantum mechanics, it is easily seen that D(x, y) =< 0|T {φ(x)φ(y)}|0 >

provided that we use the interaction representation which removes the eﬀect of the unperturbed Hamiltonian H0 . If we use the Schrodinger representation, then we would have to compute D as D(x, y) =< 0|T {U (∞, −∞)φ(x)φ(y)}|0 >= < 0|U (∞, tx )φ(x)U (tx , ty )φ(y)U (ty , −∞)|0 >

Quantum )}|0 >= Mechanics for Scientists and Engineers

26

assuming tx ≥ ty and where U is the unperturbed Schrodinger evolution operator. Here |0 > is the vacuum state of the ﬁeld. The function D(x, y) is called the propagator. The complete propagator taking into account interactions is deﬁned as Dc (x, y) = C exp(iS)φ(x)φ(y)Πz dφ(z) where

S = S0 +

f (x)V (φ(x))d4 x =

Ld4 x

We can write a perturbative expansion for Dc as Dc (x, y) = exp(iS0 )(1 + iS1 + i2 S12 /2! + ..)φ(x)φ(y)Πz dφ(z) where S1 =

f (x)V (φ(x))d4 x

is the perturbation to the action caused by external ﬁeld coupling. Even if there is no external ﬁeld, but there is a small perturbation to the Lagrangian density/Hamiltonian density, the above series expansion can be used to determine the complete propagator It was Feynman’s genius to recognize that the various perturbation terms in Dc can be calculated easily using a digrammatic method which could be applied to more complex situations like quantum electrodynamics wherein the quantum ﬁelds are the electromgnetic four potential Aμ (x) and the Dirac four component spinor wave function ψ(x). Let us now formally compute the propagator of the unperturbed KG ﬁeld: S0 = φ(x)[(1/2)∂μ ∂ μ − m2 /2)δ 4 (x − y)]φ(y)d4 xd4 y =

φ(x)K(x, y)φ(y)d4 xd4 y

and hence a simple Gaussian second moment evaluation gives D(x, y) = exp(iS0 [φ])φ(x)φ(y)Πz dφ(z) = C1 (det(iK))−1/2 .K −1 (x, y) In other words D(x, y) is proportional to K −1 (x, y) where K −1 is the inverse Kernel of K: K −1 (x, y)K(y, z)d4 y = δ 4 (x − z) We can write K(x, y) = K(x − y) and then deﬁning its four dimensional Fourier transform: ˆ K(p) = K(x)exp(−ip.x)d4 x, p.x = pμ xμ = p0 x0 − p1 x1 − p2 x2 − p3 x3 we get Clearly, and hence Thus,

K −1 (p) = 1/K(p) K(x) = (1/2)∂μ ∂ μ − m2 /2)δ 4 (x) ˆ K(p) = (pμ pμ − m2 )/2 ˆ D(p) =

where Finally,

C0 p2 − m2

p2 = pμ pμ = p02 − p12 − p22 − p32 D(x, y) = D(x − y) = C0 /(2π)4

exp(ip.x) 4 d p p2 − m2

The corrected (complete) propagator: Dc (x, y) = exp(iS0 [φ])φ(x)φ(y)(1 + i f (x)V (φ(z))d4 z + ...)Πu dφ(u)

Quantum Mechanics for Scientists and Engineers

27

Clearly, we can write this in operator kernel notation as Dc = D + DΣD + DΣDΣD + ... using the property of moments of a Gaussian distribution. For example, if V (φ) = φ4 and f = c0 , then in the Gaussian average of the product φ(x)φ(y) f (z)φ(z)4 d4 z, we get the coupling terms 4 < φ(x)φ(z) >< φ(z)2 >< φ(z)φ(y) > so if we deﬁne Σ(z) = 4f (z)0 < φ(z)2 >, we can write < φ(x)φ(y)( f (z)φ(z)4 d4 z) >= D(x − z)Σ(z)D(z − y)d4 z Likewise, for the next perturbation term

f (z)φ(z)4 d4 z)2 >=

< φ(x)φ(y)(

f (z1 )f (z2 ) < φ(x)φ(y)φ4 (z1 )φ4 (z2 ) > d4 z1 d4 z2

Again, this can be expressed using the Gaussian moments formula as a sum of terms of the form f (z1 )f (z2 ) < φ(x)φ(z1 ) >< φ(z1 )3 φ(z2 )3 >< φ(z2 )φ(y) > d4 z1 d4 z2 and

f (z1 )2 < φ(x)φ(z1 ) >< φ(z1 )2 φ(z2 )4 >< φ(z1 )φ(y) > d4 z1 d4 z2

etc. Now, each term < φ(z1 )m φ(z2 )m > is a product of propagators D(z1 − z2 ) and D(0) so the above general form is valid. [b] Quantization of the electromagnetic ﬁeld. The Lagrangian density is 1 LF = − Fμν F μν , Fμν = Aν,μ − Aμ,ν 4 Thus, We get

F0r = Er , F12 = −B3 , F23 = −B1 , F31 = −B2 LF =

1 2 (E − B 2 ) 2

as required. We compute the canonical momenta: πr =

∂LF = −F0r ∂Ar,0

and π0 =

∂LF =0 ∂A0,0

This is inconsistent with the canonical commutation relations [Aμ (t, r), πν (t, r )] = iδνμ δ 3 (r − r ) Hence we must use the Dirac brackets which are modiﬁcations of the Poisson/Lie bracket when constraints are taken into account. Thie ﬁrst constraint is π0 = 0 and the second constraint is obtained from the equations of motion: ∂r

∂LF = −J 0 ∂A0,r

where J μ the four current density is a matter ﬁeld unlike the Aμ s which are the Maxwell ﬁelds. These equations can be expressed as χ1 = ∂r πr + J 0 = 0 and since time derivatives do not appear here, this equation should be regarded as a constraint, ie, a relationship between the matter ﬁeld and the electromagnetic ﬁeld. The above equation is obtained by adding to the ﬁeld Lagrangian density, the matter-ﬁeld interaction Lagrangian Lint = −J μ Aμ

Now, we work in the Coulomb gauge (we are free to impose a gauge condition on the potentials that leaves the actual electric and mangetic ﬁeld invariant). In this gauge, divA = 0, ie, χ2 = Ar,r = 0

27 Quantum Mechanics for Scientists and Engineers

28

The Maxwell equations in this gauge imply that which has solution

∇2 A0 = −J 0

A0 (t, r) =

J 0 (t, r� ) 3 � d r |r − r� |

and since both sides of the above equation are taken at the same time, we can regard A0 as a matter ﬁeld. Hence, the quantized electromagnetic ﬁeld is described by only three position ﬁelds Ar , r = 1, 2, 3 and once we impose the Coulomb gauge condition, there are only two degrees of freedom for the position ﬁelds. We now calculate the ﬁeld Hamiltonian when it interacts with matter. The Hamiltonian density is with L = LF + Lint , H = πr Ar,0 − L = −F0r Ar,0 + (1/4)Fμν F μν + J μ Aμ 1 F0r (Ar,0 + A0,r ) + (1/4)Frs Frs + J μ Aμ 2 Thus, making use of the matter equation A0,rr = −J 0 , the constraint χ1 and neglecting a 3-dimensional divergence (which will not aﬀect the total Hamiltonian), we get H=

1 F0r F0r + F0r A0,r + (1/4)Frs Frs + J μ Aμ 2

= π 2 /2 + (∇ × A)2 /2 − F0r,r A0 + J μ Aμ

= (1/2)(π 2 + (∇ × A)2 ) + πr,r A0 + J μ Aμ = (1/2)(π 2 + (∇ × A)2 ) − J 0 A0 + J μ Aμ

The term J 0 A0 is a pure matter ﬁeld while the two terms within the bracket are pure ﬁeld terms. This simpliﬁes to H = (1/2)(π 2 + (∇ × A)2 ) − J.A where

J.A = J r Ar

There is no pure matter term in this Hamiltonian. We deﬁne π⊥ = π − ∇A0 ie

π⊥r = πr − A0,r

Then,

divπ⊥ = πr,r − A0,rr = −J 0 + J 0 = 0

Thus π⊥ is a solenoidal ﬁeld. Then, we can express H=

1 2 (π + (∇ × A)2 ) − J.A + (∇A0 )2 /2 − − − (1) 2 ⊥

since the term (∇A0 , π⊥ ) on perfoming a 3 − D integration is zero because it is a perfect 3-D divegence: (∇A0 , π⊥ ) = div(A0 π⊥ ) (1) is our ﬁnal form of the Hamiltonian of the electromagnetic ﬁeld interacting with an external current source. We note that the last term (∇A0 )2 /2 is a pure matter ﬁeld. Hence, if we are bothered only about the electromgnetic ﬁeld and its interaction with matter, the Hamiltonian density is HF = (1/2)(π 2 + (∇ × A)2 ) where we have renamed π⊥ as π for convenience of notation. Our constraints are divπ = 0, divA = 0.

Quantum Mechanics for Scientists and Engineers 28

29

Dirac brackets for constraints: Suppose Q1 , ..., Qn , P1 , ..., Pn are the unconstrained positions and momenta of a system. The constraints are Qj = Pj = 0, j = n+1, ..., n+p. Without loss of generality, we are choosing our constrained variables as new positions and momenta. The Poisson bracket relations are {f, g} =

n+p i=1

f,Qi g,Pi − f,Pi g,Qi )

In particular, we get the contradiction f,Qi = −f,Pi , {f, Pi } = f,Qi , i > n since Qi = Pi = 0, i > n. In order to rectify this problem, Dirac introduced a new kind of bracket deﬁned as follows: Let χij = {ηi , ηj } = Jij J is the standard symplectic matrix of size 2p × 2p. where

η = [Qn+1 , ..., Qn+p , Pn+1 , ..., Pn+p ]T Qn+i , Pn+i , i = 1, 2, ..., p, ie ηi are functions of Qi , Pi , i = 1, 2, ..., n and the bracket {., .}ef f is calculated using Qi , Pi , i = 1, 2, ..., n and regarding Qn+i , Pn+i as functions of Qj , Pj , j ≤ n. The bracket {f, g}P is computed using Qi , Pi , i ≤ n and taking Qn+i = 0, Pn+i = 0: n (f,Qi g,Pi − f,Pi g,Qi ) {f, g}P = i=1

We have

C = χ−1 = −J

as 2p × 2p matrices. Then, the Dirac bracket is deﬁned as

{f, g}D = {f, g} + We see that for k ≤ n,

{f, ηi }Jij {ηj , g} i,j

{f, Qk }D = {f, Qk } = −f,Pk

since

{ηj , Qk } = 0, k ≤ n

Note that {., .} is the unconstrained Poisson bracket. Again, we note that {f, Pk }D = {f, Pk } = f,Qk since {Pk , ηj } = 0, k ≤ n. Further, for i, j ≥ 1, we have {f, ηi }D = {f, ηi } + = {f, ηi } − since

k,l

k,l

{f, ηk }Jkl {ηl , ηi }

{f, ηk }Ckl χli = 0

Ckl χli = δki

l

We note that

{f, g}D = {f, g} −

{f, ηi }Jij {ηj , g} i,j

(f,Qi + (f,ηj ηj,Qi ))(g,Pi + g,ηj ηj,Pi ) = i≤n

j

−interchangeof f andg

j

Quantum Mechanics for Scientists and Engineers

30

+ =

f,ηi Jij g,ηj +

i

(f,Qi +

{f, ηi }Jij {ηj , g}

(f,ηj ηj,Qi ))(g,Pi +

j

i≤n

g,ηj ηj,Pi )

j

This formula tells us that the Dirac bracket between two observables is calculated using the Poisson bracket w.r.t. the unconstrained variables and by regarding the constrained variables as functions of the unconstrained variables. More generally, suppose Q = (Q1 , ..., Qn ), P = (P1 , ..., Pn ) are arbitrary canonical coordinates and the constraints are of the form χi (Q, P ) = 0, i = 1, 2, ..., r Deﬁne the Poisson bracket {., .} as usual w.r.t Q, P , and then deﬁne the Dirac bracket {f, g}D = {f, g} − where

r

i,j=1

{f, χi }Cij {χj , g}

((Cij )) = (({χi , χj }))−1

Then,

{f, χk }D = {f, χk } − = {f, χk } − as required. Further,

i,j

i

{f, χi }Cij {χj , χk }

{f, χi }δik = 0

{f, Qk }D = −f,Pk −

f,Pi Cij χj,Pk

i,j

It is then not hard to show that the rhs is the same as −dPk f , ie, the partial derivative of f w.r.t. Pk where we regard f as an independent function of Q, P and the χ�i s and then deﬁned dPk f = f,Pk + f,χj χj,Pk j

Now, we evaluate the Dirac bracket between πi and A taking into account the constraints: j

χ1 = πi,i = 0, χ2 = Ai,i = 0 We get

{χ1 (t, r), χ2 (t, r� )} = i∇2 δ 3 (r − r� )

The inverse kernel of the rhs is K(r − r� ) = i/4π|r − r� |. Further,

3 3 {Am (t, r), χ1 (t, r� )} = −iδkm δ,k (r − r� ) = iδ,m (r − r� )

{Am (t, r), χ2 (t, r� )} = 0, {χ1 (t, r), πm (t, r� )} = 0,

Hence,

k 3 3 δ,k (r − r� ) = iδ,m (r − r� ) {χ2 (t, r), πm (t, r� )} = iδm

{Am (t, r), πk (t, r� )}D = iδkm δ 3 (r − r� ) − = iδkm δ 3 (r − r� ) + = iδkm δ 3 (r − r� ) −

(

d3 r�� d3 r�� {Am (t, r), χ1 (t, r�� )}K(r�� − r�� ){χ2 (t, r�� ), πk (t, r� )}

3 3 d3 r�� d3 r�� δ,m (r − r�� )K(r�� − r�� )δ,k (r�� − r� )

∂2 K(r�� − r�� ))δ 3 (r − r�� )δ 3 (r�� − r� )d3 r�� d3 r�� ∂xm ∂xk

= iδkm δ 3 (r − r� ) −

∂2 K(r − r� ) ∂xm ∂xk

Quantum Mechanics for Scientists and Engineers 30

31

= iδkm δ 3 (r − r� ) + K,mk (r − r� )

where

K(r) = i/4π|r|

Quantum electrodynamics using creation and annihilation operators for photons, electrons and positrons: We work in the Coulomb gauge so that divA = 0 and this implies ∇2 A0 = −J 0 , ie, A0 is a matter ﬁeld. The Maxwell wave equation for A in the absence of matter, ie charge and current densities is given by ∇2 A − A,0 = 0 and the general solution to this is Ak (t, r) = er (K, σ)[a(K, σ)exp(−i(|K|t − K.r)) + e¯r (K, σ)a(K, sigma)∗ exp(i(|K|t − K.r))]d3 K Here, the summation 3 is over σ = 1, 2 corresponding to only two linearly independent polarizations of the photon, ie, divA = 0 implies r=1 K r er (K, σ) = 0. The energy of the electromagnetic ﬁeld in the Coulomb gauge is HF = (1/2) (E 2 + B 2 )d3 x = (1/2) [(A2,t + (∇ × A)2 ]d3 x =

2|K|2 |e(K, σ)|2 a(K, σ)∗ a(K, σ)d3 K

once we make use of the fact that |K × e(K, σ)| = |K||e(K, σ)|. For this to be interpretable as the sum of energies of harmonic oscillators, each oscillator in the spatial frequency domain having energy |K|, ie, the frequency of the wave. This means that we must have |e(K, σ)| = (2|K|)−1/2 in order to ensure that

HF =

|K|a(K, σ)∗ a(K, σ)d3 K

We can cross check this result as follows. Assuming that the a(K, σ)� s satisfy the canonical commutation relations: [a(K, σ), a(K � , σ � )∗ ] = δ 3 (K − K � )δσ,σ it follows from the Heisenberg equations of motion that a(t, K, σ),t = i[HF , a(t, K, σ)] = −i|K|a(t, K, σ), a∗ (t, K, σ),t = i[HF , a(t, K, σ)∗ ] = i|K|a∗ (t, K, σ) These equations imply

a(t, K, σ),t = −|K|2 a(t, K, σ)

a∗ (t, K, σ),tt = −|K|2 a∗ (t, K, σ)

which are the correct equations for the spatial Fourier transform of the vector potential arrived from the wave equation. Another way to check these commutation relations which we leave as an exercise, is to start with the Lagrangian density LF = (1/2)(A,t )2 − (1/2)(∇ × A)2 so that the momentum density is πk (t, r) =

∂LF = Ak,t ∂Ak,t

then apply the canonical commutation relations k 3 [Ak (t, r), πm (t, r� )] = iδm δ (r − r� )

and veriﬁy that these relations are satisﬁed by the above Fourier integral representation of A assuming the canonical commutation relations between a(K, σ) and a(K � , σ � ). We leave this veriﬁcation as an exercise to the reader.

Quantum Mechanics for Scientists and Engineers 31

32

Now consider the second quantized Dirac ﬁeld described by the four component ield operators ψ(x), ψ(x)∗ where x = (t, r), t ∈ R, r ∈ R3 . In the absence of any classical or quantum electromagnetic ﬁeld ,ψ satisﬁes the Dirac equation [iγ μ ∂μ − m]ψ(x) = 0 or equivalently,

[γ μ pμ − m]ψ = 0, pμ = i∂μ

The solutions to ψ are plane waves: ψ(x) = (u(P, σ)a(P, σ)exp(−ip.x) + v(P, σ)b(P, σ)∗ exp(ip.x))d3 P where

p.x = pμ xμ = E(P )t − P.r, E(P ) =

m2 + P 2 , p = (π μ ) = (E, P ), u(P, σ), v(P, σ) ∈ C4

Here, the summation is over σ = ±1/2 corresponding to the fact that Dirac’s equation can be expressed as [i∂0 − (α, P ) − βm]ψ(x) = 0, P = −i∇ and hence if P denotes an ordinary 3-vector (not an operator), then u(P )exp(−ip.x) satisﬁes the Dirac equation iﬀ [p0 − (α, P ) − βm]u(P ) = 0 and likewise, v(P )exp(ip.x) satisﬁes the Dirac equation iﬀ (−p0 + (α, P ) − βm)v(P ) = 0 Thus, u(P ) is an eigenvector of the matrix HD (P ) = (α, P ) + βm with eigenvalue p0 and v(−P ) is an eigenvector of HD (P ) with eigenvalue p0 . matrix, it has four real eigenvalues taking all multiplicities into account. These Now since HD (P ) is a 4 × 4 Hermitian √ eigenvalues are ±E(P ), E(P ) = m2 + P 2 with each one have a multiplicity of two. We denote the corresponding mutually orthogonal eigenvectors by u(P, σ), v(−P, σ), σ = ±1/2. On applying second quantization, the free Dirac Hamiltonian becomes HDQ = ψ(x)∗ ((α, −i∇) + βm)ψ(x)d3 x and it is easy to verify that the normalizations of u(P, σ) and v(P, σ) are chosen so that HDQ = E(P )(a(P, σ)∗ a(P, σ) + b(P, σ)b(P, σ)∗ )d3 P

and if we postulate the anticommutation relations {a(P, σ), a(P � , σ � )∗ } = {b(P, σ), b(P � , σ � )∗ } = δσ,σ δ 3 (P − P � ) then and only then we can ensure the canonical anticommutation relations (CAR) {ψl (t, r), πm (t, r� )} = iδlm δ 3 (r − r� ) where πm is the canonical momentum associated with the canonical position ﬁeld ψm . From the free Dirac Lagrangian density LD = ψ(x)∗ (i∂0 − (α, −i∇) − βm)ψ(x) we infer that

= ψ(x)∗ γ 0 (iγ μ ∂μ − m)ψ(x) πm (x) =

so that the CAR gives

∂LD = iψl (x)∗ ∂ψl,0

{ψl (t, r), ψm (t, r� )∗ } = δlm δ 3 (r − r� )

Thus in particular, we can subtract an inﬁnite constant from the second quantized Dirac Hamiltonian to get HDQ = E(P )(a(P, σ)∗ a(P, σ) − b(P, σ)∗ b(P, σ)) − − − (1) This equation has the following nice interpretation: a(P, σ)∗ creates an electron with momentum P and spin σ, a(P, σ) annihilates an electron with momentum P and spin σ. b(P, σ)∗ creates positron with momentum P and spin σ while

Quantum Mechanics for Scientists and Engineers 32

33

b(P, σ) annihilates a positron with momentum P and spin σ. a(P, σ)∗ a(P, σ) is the number operator density for electrons and b(P, σ)∗ b(P, σ) is the number operator for positrons. Since the presence of an additional electron increases the energy of the Dirac sea of electrons by E(P ) while the presence of an additional positron decreases the energy of the Dirac sea by E(P ), equn (1) has the correct physical interpretation for the energy of the second quantized Dirac ﬁeld. Now suppose we have a collection of photons, electrons and positrons. The total Lagrangian density is then L = LEM + LD + Lint

so that

= (−1/4)Fμν F μν + ψ ∗ γ 0 (γ μ (i∂μ + eAμ ) − m)ψ LEM = (−1/4)Fμν F μν , LD = ψ ∗ γ 0 (iγ μ ∂μ − m)ψ, Lint = −J μ Aμ ,

J μ = −eψ ∗ γ 0 γ μ ψ

J μ is the Dirac four current density. It is easily veriﬁed to be conserved even when an electromagnetic ﬁeld is present. In other words, we can verify using the Dirac equation [γ μ (i∂μ + eAμ ) − m]ψ = 0 that

∂μ J μ = 0

ie, the current is conserved. We can further show that the matrices K μν = (−1/4)[γ μ , γ ν ] satisfy the same commutation relations as do the standard skew-symmetric generators of the Lorentz group do. Hence these matrices furnish a representation of the Lie algebra of the Lorentz group. Let D denote the corresponding representation of the Lorentz group. D is called the Dirac spinor representation of the Lorentz group and if Λ is any Lorentz transformation, we write D(Λ) = exp(ωμν K μν ) where

Λ = exp(ωμν Lμν )

with ω a skew symmetric matrix and Lμν the standard generators of the Lorentz group: (Lμν )αβ = η μα η νβ − η μβ η να Further, we note the following:

D(Λ)γ μ D(Λ)−1 = Λμν γ ν

and hence, the Dirac equation is invariant under Lorentz transformations ie if xμ → Λμν xν and ψ(x) → D(Λ)ψ(x), Aμ → Λμν Aν , then the Dirac equation remains invariant. Further, the existence of the positron follows from the fact that if we start with the Dirac equation, conjugate it and multiply by the unitary matrix iγ 2 , then we get γ μ )(iγ 2−1 )(−i∂μ + eAμ ) − m]iγ 2 ψ¯ = 0 [(iγ 2 )(¯ It is easily veriﬁed that this equation is the same as [γ μ (i∂μ − eAμ ) − m]ψ˜ = 0 where

ψ˜ = iγ 2 ]ψ¯

In other words ψ˜ satisﬁes the Dirac equation in an electromagnetic ﬁeld but with the charge e replaced by −e or equivalently, −e replaced by e. This observation led Dirac to conclude the existence of the positron, namely the antiparticle of the electron, having the same mass but opposite charge as that of the electron. The positron was discovered in an accelerator later by Anderson. Another property of the Dirac equation in an external electromagnetic ﬁeld is obtained by considering (γ μ (i∂μ + eAμ ) + m)(γ ν (i∂ν + eAν ) − m)ψ = 0

Quantum Mechanics for Scientists and Engineers 33

34

If Aμ = 0, this reduces to the free KG equation [∂μ ∂ μ + m2 ]ψ = 0 since In case however Aμ �= 0, we get or equivalently,

{γ μ , γ |nu } = 2η μν [γ μ γ ν (i∂μ + eAμ )(i∂ν + eAν ) − m2 ]ψ = 0

((1/2){γ μ , γ ν } + (1/2)[γ μ , γ ν ])[(i∂μ + eAμ )(i∂ν + eAν ) − m2 ]ψ = 0

or since {γ μ , γ ν } = 2η μν , we get

[(∂μ − eAμ )(∂ μ − ieAμ ) + m2 + (1/4)[γ μ , γ ν ][∂μ − ieAμ , ∂ν − ieAν ]]ψ = 0 or since

we can write

[∂μ − ieAμ , ∂ν − ieAν ] =

−i(Aν,μ − Aμ,ν ) = −iFμν [(∂μ − eAμ )(∂ μ − ieAμ ) + m2 − (i/4)[γ μ , γ ν ]Fμν ]ψ = 0

This equation is the same as the KG equation in an external electromagnetic ﬁeld obatined by replacing ∂μ by ∂mu −ieAμ except for the last term which displays explicitly the interaction of the electromagnetic ﬁeld Fμν (whose non-zero components are the electric and magnetic ﬁelds) with the spin of the electron described by the antisymmetric ”spin tensor” (−i/4)[γ μ , γ ν ]. Exercise: Write down explicitly in terms of the components of the electric and magnetic ﬁelds, the spin-ﬁeld interaction component and display in particular, the spin magnetic dipole moment and the spin electric dipole moment. The photon and electron propagator: We make this calculation using ﬁrst the Feynman path integral for ﬁelds and then leave as an exercise to demonstrate the same result using the operator expansion of the ﬁelds. First, note that the photon propagator is deﬁned in space-time as Dμν (x, y) =< 0|T {Aμ (x)Aν (y)}|0 > and the electron propagator as

Slm (x, y) =< 0|T {ψl (x)ψm (y)∗ }|0 >

Here we are using the Lorentz gauge in which case even A0 is a component of the electromagnetic ﬁeld potential, not a matter ﬁeld. In the absence of matter, Aμ (x) satisﬁes the wave equation ∂μ ∂ μ Aα = 0 and this equation has solutions Aμ (x) = [a(K, σ)eμ (K, σ)exp(−ik.x)/ 2|K| + a(K, σ)∗ e¯μ (K, σ)exp(ik.x)/ 2|K|]d3 K

where

The Lorentz gauge condition implies

k.x = kμ xμ = |K|t − K.r ∂μ Aμ = 0 kμ eμ (K, σ) = 0

which means that eμ has three degrees of freedom. Formally, we do not take the canonical momentum π0 as zero, even though the Lagrangian density LEM of the electromagnetic ﬁeld does not depend on A0,0 and hence implies π0 = 0. The way out is to introduce a small perturbing Lagrangian density to the Lagrangian density of the electromagnetic ﬁeld involving A0,0 replace LEM by this perturbed Lagrangian density and deﬁne the canonical momenta as πμ =

∂LEM ∂Aμ,0

Quantum Mechanics for Scientists and Engineers 34

35

Then, we introduce the commutation relations [Aμ (t, r), Aν (t, r� )] = δνμ δ 3 (r − r� ) This is satisﬁed provided

[a(K, σ), a(K � , σ � )∗ ] = δσ,σ δ 3 (K − K � )

Then, we ﬁnd that since a(K, σ)|0 >= 0, < 0|a(K, σ)∗ = 0, we have eν (K � , σ � )(exp(−i(k.x − k � .x� ))/2|K|)d3 Kd3 K � Dμν (x, x� ) = θ(t − t� ) < 0|a(K, σ)a(K � , σ � )∗ |0 > eμ (K, σ)¯ +

=

θ(t� − t) < 0|a(K � , σ � )a(K, σ)∗ |0 > eν (K � , σ � )¯ eμ (K, σ)(exp(i(k.x − k � .x� ))/2|K|)d3 Kd3 K � = θ(t − t� ) eμ (K, σ)¯ eν (K � , σ � )δσ,σ δ 3 (K − K � )/2|K|)(exp(−i(k.x − k � .x� ))/2|K|)d3 Kd3 K � +θ(t� − t) eν (K � , σ � )¯ eμ (K, σ)δσ,σ δ 3 (K − K � )(exp(i(k.x − k � .x� ))/2|K|)d3 Kd3 K �

[eμ (K, σ)¯ eν (K, σ)θ(t−t� )exp(−i|K|(t−t� )+iK.(r−r� ))+eν (K, σ)¯ eμ (K, σ)θ(t� −t)exp(i|K|(t−t� )−iK.(r−r� ))]d3 K/2|K|

The normalization condition

es (K, σ) = δrs , r, s = 1, 2, 3 er (K, σ)¯

(summation over σ = 1, 2, 3) had to be imposed (look at the treatment in the Coulomb gauge) in order to guarantee the correct commutation relations for the creation and annihilation operators following from the canonical commutation relations (CCR) for the position and momentum ﬁelds. From this we get Drs (x, x� ) = δrs (2|K|)−1 [θ(t − t� )exp(−ik.(x − x� )) + θ(t� − t)exp(ik.(x − x� ))]d3 K where k 0 = |K|. This is a function of only x − x� and so, we can denote it by Drs (x − x� ). Now, consider k 0 to be a variable and consider the identity dk 0 /(k 02 − |K|2 ) = πi/|K| Γ

where Γ is a contour along the real axis from −∞ to +∞ making a small encirclement of the pole at |K| above the real axis but excluding the pole at −|K| and then completed into a big inﬁnite semicircle below the real axis (so that on this semicircle, k 0 has a negative imaginary part). Then it is clear that for t > t� , the contour integral exp(−ik.(x − x� ))dk 0 /(k 02 − |K|2 ) = iπexp(−i|K|(t − t� ) + iK.(r − r� ))/|K| Γ

and likewise for the other term t� > t. Thus, it follows from the above formula that −δrs Drs (x).exp(−ik.x)d4 x = 02 k − |K|2 + i� and to preserve Lorentz invariance, we may assume ˆ μν (k) = Dμν (x).exp(−ik.x)d4 x = D where

ημν ημν = 2 k 02 − |K|2 + i� k + i�

k 2 = k 02 − |K|2 = kμ k μ

We can repeat this calculation for the electron propagator and show that Sˆlm (p) = Slm (x)exp(−ip.x)d4 x = (γ 0 γ μ pμ − m)−1

These results can be derived directly from the FPI: Dαβ (y, z) = exp((−i/4) Fμν (x)F μν (x)d4 x)Aα (y)Aβ (z)DAμ

Quantum Mechanics for Scientists and Engineers 35

36

Fμν (x)F μν (x) = (Aν,μ − Aμ,ν )(Aν,μ − Aμ,ν ) = 2Aν,μ Aν,μ − 2Aν mu Aμ,ν

= −2Aν ∂μ ∂ μ Aν + 2Aν ∂ ν ∂μ Aμ

on ignoring perfect divergences which do not contribute to the action integral. Thus, S[A] = (−1/4) Fμν (x)F μν (x)d4 x =

or equivalently in the Fourier domain

K μν (x − y)Aμ (x)Aν (y)d4 xd4 y

S[A] = where

ˆ μν (k)Aˆμ (k)Aˆν (k)∗ d4 k K

ˆ μν (k) = k 2 η μν − k μ k ν K

ˆ The matrix K(k) is singular and hence its inverse cannot be evaluated. However, we evaluate its pseudo-inverse and use ˆ μν (k)kν = 0, we use as the photon propagator, the solution to the equation it as the propagator, or since K ˆ D(k) ˆ K(k) = P (k) where P (k) is the orthogonal projecton onto the spatial variable subspace, ie, Pμν (k) = I − kμ kν /k 2 or equivalently,

P μν (k) = I − k μ k ν /k 2

We are using the fact that the second moment matrix of a Gaussian distribution is its covariance matrix, ie, in this case, ˆ the inverse/pseudo-inverse of the matrix K(k). For the electron propagator, we ﬁnd via the operator formalism that taking ψl (x) = (ul (p, σ)a(p, σ)exp(−ip.x) + vl (p, σ)b(p, σ)∗ exp(ip.x))d3 P

with

p0 = E(P ) =

P 2 + m2

and u(P, .), v(−P, .) eigenfunctions of the free Dirac Hamiltonian (α, P ) + βm, αr = γ 0 γ r , β = γ 0 , normalized in such a way that the CAR for ψl , πl = iψl imply {a(P, σ), a(P � , σ � )∗ } = δ 3 (P − P � )δσ,σ , {b(P, σ), b(P � , σ � )∗ } = δ 3 (P − P � )δσ,σ , {a(P, σ), b(P � , σ � )} = 0,

{a(P, σ), b(P � , σ � )∗ } = 0

the electron propagator is given by

Slm (x, y) =< 0|T (ψl (x)ψm (y)∗ )|0 >= [ (ul (P, σ)um (P, σ)∗ exp(−ip.(x − y)) + vl (P, σ)vm (P, σ)∗ )exp(ip.(x − y))]d3 P l

= where

Sˆlm (p)exp(ip.(x − y))d4 p

ˆ S(p) = (γ 0 γ μ pμ − m + i�)−1

We leave this derivation as an exercise to the reader. Alternatively using the FPI, Slm (x, y) = exp(i ψ(x)∗ (iγ 0 γ μ ∂μ − m)ψ(x)d4 x)ψl (y)ψm (z)∗ DψDψ ∗

Quantum Mechanics for Scientists and Engineers 36

37

gives directly the answer using the standard formula for the second moment of a Gaussian distribution. Now we discuss interactions. Suppose the initial state of the ﬁeld is |i >= |pim , σim , p�il , σil� , kin , sin , m = 1, 2, ..., Ni1 , l = 1, 2, ..., Ni2 , n = 1, 2, ..., Ni3 > where pim , σim are the four momenta and spins of the mth electron, p�il , σil� are the four momenta and spins of the lth positron and kin , sin are the four momenta and helicities of the nth photon. Then, we can write |i >= Πm,l,n a(pim , σim )∗ b(p�il , σil� )∗ c(kin , sin )∗ |0 > where we are using the notation c(k, s) for the photon annihilation operators. Likewise for the ﬁnal state. Since transitions in the state occur only because of interactions, we work in the interaction representation in which the Hamiltonian of the electron-positron-photon ﬁeld is given by HI (t) = −e Aμ (x)ψ(x)∗ (iγ 0 γ μ ∂μ ψ(x)d3 x

Substituting the operator expressions for the quantum ﬁelds Aμ , ψ, ψ ∗ , it follows that HI can be expressed as a trilinear functional of (c(k, s), c(k, s)∗ ), (a(P, σ), b(P, σ)∗ ), (a(P, σ)∗ , b(P, σ)). The transition probability amplitude from |i > at time t → −∞ to |f > at time t → +∞ is given by ∞ < f |T {exp(−i HI (t)dt)}|i > = δ(f − i) +

∞

(−i)n

n=1

−∞

−∞ as Fourier integrals of the position space ﬁeld operators. To proceed further, we observe that (LEM + LD )d4 x is a quadratic functional of the position ﬁelds and Lint is small. So perturbation theory gives for the above FPI, exp(i (LEM + LD )d4 x)(1 + i( Lint d4 x) + (i2 /2!)( Lint d4 x)2 + ...)F (ψ, ψ ∗ , Aμ )DψDψ ∗ DAμ where

Lint = e

ψ ∗ (iγ 0 γ μ ∂μ − m)ψAμ d4 x =−

J μ Aμ d4 x

Quantum Mechanics for Scientists and Engineers 37

38

Then the various terms in the intergral are evaluated using the standard formulae for the moments of a Gaussian distribution on noting that exp(i (LEM + LD )d4 x) is a Gaussian density functional, it being a quadratic functional of ψ, ψ ∗ , Aμ . The propagators then enter naturally into the picture when we express the higher moments of even order as products of second moments, ie propagators. Renormalization, an example: Consider the Hamiltonian H = H0 − g|e >< e| Let |k >, k = 1, 2, ... denote the energy eigenstates of H0 with H0 |k >= Ek |k > Let |ψ > denote an energy eigenstate of H with energy eigenvalue E. Then, H|ψ >= E|ψ > gives H0 |ψ > −g|e >< e|ψ >= E|ψ >

or

|ψ >= g(H0 − E)−1 |e >< e|ψ > |k > (Ek − E)−1 < k|e >< e|ψ > =g k

The normalization condition < ψ|ψ >= 1 then implies (Ek − E)−2 | < k|e > |2 | < e|ψ > |2 = 1 g2 k

This implies that | < e|ψ > |2 = (g 2

k

(Ek − E)2 | < k|e > |2 )−1

If the above sum is divergent, then we would get < e|ψ >= 0 which may nor be the case. To avoid such divergences, we make an ultraviolet cutoﬀ meaning thereby that the sum over k is truncated to k such that Ek ≤ Λ where Λ is a ﬁnite positive constant. For a given ultraviolet cutoﬀ Λ, we may thus deﬁne a renormalized coupling constant g = g(Λ) so that (Ek − E)2 < k|e > |2 )−1 g 2 (Λ) = ( and get with this cutoﬀ imposed that

k:Ek ≤Λ

| < e|ψ > |2 = 1

Thus, divergence problems are avoided by redeﬁning the coupling constant. In quantum ﬁeld theory, when we calculate the transition probability amplitudes like vacuum polarization, self energy of the electron or anomalous magnetic moment of the electron, the integrals obtained using the Feynman diagrams for the various Dyson series terms diverge. So to ge meaningful answers, we renormalize the ﬁelds and coupling constants like charge and mass, by scaling these with constant factors and then split the resulting Lagrangian density into a term not involving the coupling constants Z and an interaction term involving the Lagrangian terms scaled by Z − 1. The latter terms are regarded as perturbations and we expand the resulting exponential in powers of Z − 1 and calculate the modiﬁed matrix elements or propagators. Finally, Z may be made to tend to inﬁnity in such a way so as to cancel out the inﬁnities arising in the matrix elements or propagators computed without the Z. This method was ﬁrst demonstrated by Dyson to lead to the experimentally correct values for the above phenomena. [33] Dirac’s wave equation in a gravitational ﬁeld. [34] Canonical quantization of the gravitational ﬁeld. Let Λ(x) be a local Lorentz transformation and let Λ → D(Λ) be the Dirac spinor representation of the Lorentz group. Let gμν (x) be the metric of curved space-time and ηab the Minkowski metric of ﬂat space-time. Let Vμa (x) be the associated tetrad, ie, ηab Vμa (x)Vνb (x) = gμν (x)

Quantum Mechanics for Scientists and Engineers 38

39

((Vμa (x))) can be regarded as a locally inertial frame, ie, ξ μ → Vμa ξ μ = ξ a transforms a vector ﬁeld ξ μ (x) in curved space-times to a Minkowski vector, ie each component ξ a is a scalar. Let ((γ a )) be the Dirac Gamma matrices. They determine the Dirac equation in ﬂat space-time: (iγ μ ∂μ − m)ψ = 0 orin the presence of an electromagnetic ﬁeld, (γ μ (i∂μ + eAμ (x)) − m)ψ = 0 This equation is invariant under global Lorentz transformations ,ie, if Λ is a constant 4 × 4 Lorentz transformation matrix, then D(Λ)(γ μ (i∂μ + eAμ ) − m)ψ = 0 implies

which implies or equivalently, where

[D(Λ)γ μ D(Λ)−1 (i∂μ + eAμ ) − m]D(Λ)ψ = 0 [Λμν γ ν (i∂μ + eAμ ) − m]D(Λ)ψ = 0 [γ ν (i∂ν� + eA�ν (x� )) − m]ψ � (x� ) = 0

x μ = Λμν xν ,

so that

i∂ν� = Λμν ∂μ

and

A�ν (x� ) = Λμν Aμ (x), ψ � (x� ) = D(Λ)ψ(x)

ie, ψ (x ) = D(Λ)ψ(x) satisﬁes Dirac’s equation in the Lorentz transformed system (x μ ) with the electromagnetic ﬁeld also transformed in accordance with the Lorentz transformation Λ. Note that the Lorentz generators of the Dirac representation are given by J μν = (i/4)[γ μ , γ ν ]. We wish to deﬁne a Dirac equation in curved space-time that is invariant under local Lorentz transformations in accordance with the equivalence principle of general relativity. To do this, we must use the tetrad which will transform the non-inertial metric to the inertial metric. So we assume our curved space-time Dirac equation to be given by �

�

[γ a Vaμ (i∂μ + iΓμ (x) + eAμ (x)) − m]ψ(x) = 0 where Γμ (x) is a 4×4 matrix interpreted as the gravitational connection of space-time in the Dirac spinor representation. Applying a local Lorentz transformation D(Λ(x)) gives [D(Λ(x))γ a D(Λ(x))−1 Vaμ (x)(iD(Λ(x))(∂μ + Γμ (x))D(Λ(x))−1 or

If we deﬁne

+eAμ (x) − m]D(Λ(x))ψ(x) = 0 [Λab (x)γ b Vaμ (x)(i(∂μ + D(Λ(x))Γμ (x)D(Λ(x))−1 ) + iD(Λ(x))(∂μ D(Λ(x))−1 ) +eAμ (x)) − m]D(Λ(x))ψ(x) = 0

Vaμ (x) → Λba (x)Vbμ (x) = Vbμ (x)

as the transformation of the tetrad under the local Lorentz transformation Λ(x), then we can express the above equation as [γ b Vbμ (x)(i(∂μ + Γμ (x)) + eAμ (x)) − m]D(Λ(x))ψ(x) = 0

where Γμ (x) is the transformed gravitational connection under the local Lorentz transformation Λ(x): Γ�μ (x) = D(Λ(x))Γμ (x)D(Λ(x))−1 + D(Λ(x))(∂μ D(Λ(x))−1 ) Equivalently, if ω(x) = ((ωab (x))) is an inﬁnitesimal local Lorentz transformation so that Λ(x) = I + dD(ω(x)) = I + ωab (x)J ab

Quantum Mechanics for Scientists and Engineers 39

40

and with neglect of O(� ω(x) �2 ) terms,

D(Λ(x))−1 = I − ωab (x)J ab

then under the inﬁnitesimal local Lorentz transformation Λ(x), the gravitational connection Γμ (x) transforms to Γ�μ (x) = (I + ωab (x)J ab )Γμ (x)(I − ωcd (x)J cd ) −(I + ωab (x)J ab )ωcd,μ (x)J cd

= Γμ (x) + ωab (x)[J ab , Γμ (x)] − ωab,μ (x)J ab

We need to look for a Dirac gravitational connection Γμ (x) that satisﬁes such a transormation law under an inﬁnitesimal local Lorentz transformation I + ω(x). It is easily seen that Γμ (x) = J ab Vaν (x)Vνb,μ (x) does the job. Indeed, under I + ω(x), this transforms to J ab (Vaν + ωac V νc )(Vνb + ωbd Vνd ),μ d = J ab Vaν Vνb,μ + J ab (Vaν Vν,μ ωbd + V νc Vνb,μ ωac ) d ωbd + V νc Vνb,μ ωac ) = Γμ + J db ωbd,μ + J ab (Vaν Vν,μ

This is seen to coincide with

Γμ − ωab,μ J ab + ωab [J ab , Γμ ]

on noting that

[J ab , Γμ ] = [J ab , J cd ]Vcν Vνd,μ

and using the Lie algebra commutation rules for the Lorentz group generators ((J ab )). Finally, we should replace ordinary partial derivatives of the tetrad ﬁeld by covariant derivatives, ie, Γμ (x) = J ab Vaν (x)Vνb:μ (x)

[35] The scattering matrix for the interaction between photons, electrons, positrons and gravitons. Calculating the scattering matrix using the Feynman path integral and also using the operator formalism with the Feynman diagrammatic rules. [36] Atom interacting with a Laser; The general theory based on quantum electrodynamics. Representing the quantum electromagnetic ﬁeld using ﬁnite sets of creation and annihilation operators. Representing any density operator for the quantum electromagnetic ﬁeld via the diagonal Glauber-Sudarshan representation. Representing any state of the laser interacting with the spin of an atom using the Galuber-Sudarshan representation having matrix coeﬃcients. Expressing the evolution equation for the density operator of the laser-atom system using the Glauber-Sudarshan representation. The Hamiltonian of the ﬁeld is given by HF =

p

ωk a∗k ak , a = (a1 , ..., ap ), a∗ = (a∗1 , ..., a∗p ),

k=1

[ak , a∗m ] = δkm and all the other commutators vanish. The Hamiltonian of the atom is HA an N × N Hermitian matrix and ﬁnally, the interaction Hamiltonian between the atom and the ﬁeld is given by HI (t) =

p

(Ak (t)Fk (a, a∗ ) + Ak (t)∗ Fk (a, a∗ )∗ )

k=1

where Ak (t) is a time varying N ×N matrix and Fk� s are ordinary complex valued functions which become ﬁeld operators when their complex arguments are replaced by the ﬁeld operators a, a∗ . The evolution of the joint state ρ(t) of the atom and ﬁeld follows the Schrodinger equation iρ� (t) = [H(t), ρ(t)], H(t) = HF + HA + HI (t)

Quantum Mechanics for Scientists and Engineers 40

41

Note that HA and HF commute. For obtaining the interaction representation, we deﬁne ρ˜(t) = U0 (t)∗ ρ(t)U (t) where

U0 (t) = U0A (t)U0F (t), U0A (t) = exp(−itHA ), U0F (t) = exp(−itHF )

We note that

U0F (t)∗ ak U0F (t) = exp(−iωk t)ak = ak (t), U0F (t)∗ a∗k U0F (t) = exp(iωk t)a∗k = ak (t)∗

Thus, U0F (t)∗ HI (t)U0F (t) =

(Ak (t)Fk (a(t), a(t)∗ ) + Ak (t)∗ Fk (a(t), a(t)∗ ))

k

and deﬁning the N × N complex matrices

Bk (t) = U0A (t)∗ Ak (t)U0A (t) we get the interaction picture Schrodinger equation for the atom and ﬁeld ˜ I (t), ρ˜(t)] i˜ ρ� (t) = [H where

˜ I (t) = U0 (t)∗ HI (t)U0 (t) = H

Bk (t)Fk (a(t), a(t)∗ ) + Bk (t)∗ Fk (a(t), a(t)∗ )∗

k

We write so

Fk (a(t), a(t)∗ ) = F ({ak exp(−iωk t)}, {a∗k exp(iωk t)}) = Fk (t, a, a∗ ) ˜ I (t) = H

(Bk (t)Fk (t, a, a∗ ) + Bk (t)∗ Fk (t, a, a∗ )∗ )

k

and the Schrodinger equation in the interaction picture becomes ˜ I (t), ρ˜(t)] i˜ ρ� (t) = [H To solve this diﬀerential equation, we adopt the Galuber-Sudarshan diagonal represention: Let a(z) = ∗ p k zk ak where z = (zk ) ∈ C . Then write z n a∗n |0 > /n! = exp(a(z)∗ )|0 > |e(z) >= n

with the obvious p-tuple notation. The normalized energy eigenstates of the ﬁeld are √ |n >= a∗n |0 > / n! and hence |e(z) >= Thus,

n

√ z n |n > / n!

< e(u)|e(z) >= exp(< u|z >) We can normalize |e(z) > by multiplying it by a function φ(z) of z so that I = |e(z) > φ(z) < e(z)|d2n z We have Also,

a(u)|e(z) >=< u|z > |e(z) >, ak |e(z) >= zk |e(z) > a∗k |e(z) >=

n

z n a∗k a∗n|0 > /n! =

∂ |e(z) > ∂z

¯k ak , a(z)∗ kz

=

41 Quantum Mechanics for Scientists and Engineers

42

We can evaluate

ak |e(z) >< e(z)| = z¯k |e(z) >< e(z)|, a∗k |e(z) >< e(z)| =

∂ |e(z) >< e(z)| ∂zk

Thus, F (t, a, a∗ )|e(z) >< e(z)| = F (t, z,

∂ )|e(z) >< e(z)| ∂z

assuming that in the expression F (t, a, a∗ ), all the a� s appear to the left of all the a∗ s. Such a representation is possible in view of the commutation relations between the a� s and the a∗ s. Likewise, we have |e(z) >< e(z)|F (t, a, a∗ ) and

= |e(z) > (F (t, a, a∗ )∗ |e(z) >)∗ F (t, a, a∗ )∗ |e(z) >= F¯ (t, a∗ , a)|e(z) >

where now in the expression for F¯ (t, a∗ , a) all the a∗ s will appear to the left of all the a� s. We thus get |e(z) >< e(z)|F (t, a, a∗ ) = F¯ (t,

∂ , z¯)|e(z) >< e(z)| ∂ z¯

Here, F¯ (t, u, v) is the function obtained by conjugating all the coeﬃcients in the Taylor series expansion of F (t, .). Now, with this understanding, we note that ˜ I (t) = (Bk (t)Fk (t, a, a∗ ) + Bk (t)∗ Fk (t, a, a∗ )∗ ) H k

can be written as

˜ I (t) = H

Ck (t)Gk (t, a, a∗ )

k

where in the G�k s all the a� s appear to the left of all the a∗ s. Then, by the above observation, [Gk (t, a, a∗ ), |e(z) >< e(z)|] = [Gk (t, z,

∂ ¯ k (t, z¯, ∂ )]|e(z) >< e(z)| )−G ∂z ∂ z¯

∂ acts only on the factor |e(z) > while ∂∂z¯ acts only on the factor < e(z)| in the term It should be noted that ∂z |e(z) >< e(z)|. This is because z → |e(z) > is an analytic Hilbert space valued function of the complex variable z and z¯ →< e(z)| is therefore an analytic function of the complex variable z¯. Using

< e(u), e(z) >= exp(< u, z >) it is easy to show that

φ(z) = π −p exp(� z �2 )

Indeed, then a simple Gaussian integral evaluation gives < e(u)| φ(z)|e(z) >< e(z)|d2p z|e(v) >= φ(z)exp(< u|z > + < z|v >)d2p z = 1 We can express the joint state of the atom and the ﬁeld in the interaction representation as a Glauber-Sudarshan integral: ρ˜(t) = ψ(t, z, z¯) ⊗ |e(z) >< e(z)|d2p z where

˜ z, z¯) ∈ CN ×N ψ(t,

We then get ρ˜� (t) =

∂ψ(t, z, z¯) ⊗ |e(z) >< e(z)|d2p z ∂t

Quantum Mechanics for Scientists and Engineers 42

43

and further, k

k

˜ I (t), ρ˜(t)] = [H [Ck (t)ψ(t, z, z¯) ⊗ Gk (t, a, a∗ )|e(z) >< e(z)| − ψ(t, z, z¯)Ck (t) ⊗ |e(z) >< e(z)|Gk (t, z, z ∗ )]d2p z

¯ k (t, z¯.∂/∂ z¯)|e(z) >< e(z)|]d2p z [Ck (t)ψ(t, z, z¯) ⊗ Gk (t, z, ∂/∂z)|e(z) >< e(z)| − ψ(t, z, z¯)Ck (t) ⊗ G =

k

¯ k (t, z¯, ∂/∂ z¯)T )ψ(t, z, z¯) ⊗ |e(z) >< e(z)|d2p z Ck (t)(Gk (t, z, ∂/∂z)T − G

where we have used integration by parts. Here for example, (z1m1 z2m2 ...zpmp = (−1)n1 +...+np

∂ n1 +...+np T n ) ∂z1n1 ...∂zp p

∂ n1 +...+np m1 mp ...zp n z ∂z1n1 ...∂zp p 1

It follows that ψ(t, z, z¯) satisﬁes the pde

∂ψ(t, z, z¯) = ∂t ¯ k (t, z¯, ∂/∂ z¯)T )ψ(t, z, z¯) Ck (t)(Gk (t, z, ∂/∂z)T − G i

k

=

k

¯ k (t, z¯, ∂/∂ z¯)T )Ck (t)ψ(t, z, z¯) (Gk (t, z, ∂/∂z)T − G

Remark: The following notation has been used here: If G(t, z, u) is a polynomial in variables z, u which may even be ¯ z, u), we mean the polnomial obatined from G(t, z, u) by replacing its complex noncommuting operators, then by G(t, coeﬃcients by their respective conjugates without making any change in the variables z, u. [37] The classical and quantum Boltzmann equations. Quantum Boltzmann equation: Let Hi , i = 1, 2, ..., N be Hilbert spaces and let H=

N k=1

Hk

Let ρ(t) be a density operator in H. Let the Hamiltonian according to which ρ evolves have the form H=

N

Hk +

k=1

Vkj

1≤k 0, the second marginal state ρ12 (t)will be a small perturbation of ρ1 (t) ⊗ ρ2 (t) (Note that, we are assuming that ρ2 (t) is an identical copy of ρ1 (t)). Thus, if the interaction potential V12 is small, then in (3), we can replace [V12 , ρ12 (t)] by [V12 , ρ1 (t) ⊗ ρ2 (t)]. Then, the approximate solution to (3) is given by t ρ12 (t) = exp(−itad(H1 + H2 ))(ρ12 (0)) + exp(−i(t − s)ad(H1 + H2 ))ad(V12 )(ρ1 (s) ⊗ ρ1 (s))ds − − − (4) 0

and this can be substituted into (2) to get the following nonlinear integro-diﬀerential equation for the ﬁrst marginal ρ1 (t): t iρ�1 (t) = [H1 , ρ1 (t)]+(N −1)T r2 [ad(V12 )T12 (t)(ρ12 (0))]+(N −1)T r2 [ ad(V12 )T12 (t−s)ad(V12 )(ρ1 (s)⊗ρ1 (s))ds]−−−(5) 0

where T12 (t) = exp(−itad(H1 + H2 )) = exp(−itad(H1 )).exp(−itad(H2 )) = exp(−itad(H2 )).exp(−itad(H1 )) since H1 and H2 commute. (5) may be termed as the quantum Boltzmann equation. Other versions of this equation exist like we can solve (3) to get ρ12 (t) = exp(−itad(H12 ))(ρ12 (0)) where (2) then gives

H12 = H1 + H2 + V12 iρ�1 (t) = ad(H1 )(ρ1 (t)) + (N − 1)T r2 ad(V12 )exp(−itad(H12 )(ρ12 (0)) − − − (6)

The third version is to note that the last term in (4) already contains a multiplicative factor ad(V12 ) and hence since we are interested in terms only upto linear orders in V12 , we can replace ρ1 (s) in (4) by exp(−is.ad(H1 ))(ρ1 (0)) (4)then becomes ρ12 (t) = exp(−itad(H1 + H2 ))(ρ12 (0)) +

t

0

exp(−i(t − s)ad(H1 + H2 ))ad(V12 )exp(−isad(H1 + H2 ))(ρ1 (0) ⊗ ρ1 (0))ds

Thus, (2) can be approximated by iρ�1 (t) = [H1 , ρ1 (t)]+(N −1)T r2 [ad(V12 exp(−it.ad(H1 +H2 ))(ρ12 (0))]+(N −1)T r2 [ad(V12 )

t

exp(−i(t−s)ad(H1 +H2 ))ad(V12 )exp

0

[38] Bands in a semiconductor: Derivation using the Bloch wave functions in a 3-D periodic lattice. V : R3 → R is the potential of the periodic lattice produced by nuclei located at diﬀerent sites of the crystal. The Lattice vectors are a1 , a2 , a3 and the periodicity gives V (r + n1 a1 + n2 a2 + n3 a3 ) = V (r + n.a) = V (r), n1 , n2 , n3 ∈ Z V (r) can be expressed as a Fourier series V (r) =

c(m)exp(2πimT M r)

m∈Z3

where the reciprocal lattice matrix M is calculated as M [a1 , a2 , a3 ] = I ie

M = A−1 , A = [a1 , a2 , a3 ]

Quantum Mechanics for Scientists and Engineers 44

It follows that

45

mT M (n1 a1 + n2 a2 + n3 a3 ) = mT M An = mT n ∈ Z, n ∈ Z3

This ensures that the above Fourier series for V (r) is periodic along the a1 , a2 , a3 directions. The wave function ψ(r) satisﬁes [−∇2 /2m + V (r)]ψ(r) = Eψ(r) Changing r to r + An (An = n1 a1 + n2 a2 + n3 a3 ) and using V (r + An) = V (r) gives Hψ(r + An) = Eψ(r + An) so by uniqueness (assuming non-degeneracy), ψ(r + An) = C(n)ψ(r), C(n) ∈ C, |C(n)| = 1 We write

Ck = C(ak ), k = 1, 2, 3

Then if Nk nuclei are along the ak direction, k = 1, 2, 3, then by imposing periodicity relations for the wave function at the crystal boundaries, we get CkNk = 1, k = 1, 2, 3 and hence

Ck = exp(2πilk /Nk ), k = 1, 2, 3

for some lk = 0, 1, ..., Nk − 1. We deﬁne the Bloch wave function corresponding to l = (l1 , l2 , l3 ) by ψ(r) = ul (r)exp(2πilT Kr) = where K is some 3 × 3 matrix. Then, for ul to be periodic with periods a1 , a2 , a3 , we require that ψ(r + ak )exp(−2πilT Kak ) = ψ(r), k = 1, 2, 3 or equivalently, Thus we require that and so we can take We note that

Ck = exp(2πilT Kak ) exp(2πilk /Nk ) = ex[(2πilT Kak ) K = N −1 M = N −1 A−1 , N = diag[N1 , N2 , N3 ] lT N −1 A−1 ak = lT N −1 ek = lk /Nk , k = 1, 2, 3

We write Then,

b = b(l) = 2πK T l ψ(r) = u(r)exp(ibT r), u = ul

We substitute this into the Schrodinger equation and derive the pde satisﬁed by u(r). After that since u is periodic with period A, we can expand it in a Fourier series as we did for V (r): u(r) = ul (r) = d(m)exp(2πimT M r) m∈Z3

and then derive a diﬀerence equation for the Fourier coeﬃcients d(m) of u(r). For each l = (l1 , l2 , l3 ) ∈ ×3k=1 {0, 1, ..., Nk − 1} we thus obtain a sequence of solutions ulk (r), k = 1, 2, ... with energy eigenvalues E = E(l, k), k = 1, 2, .... We say that each l deﬁnes an energy band. [39] The Hartree-Fock apporoximate method for computing the wave functions of a many electron atom. The Hamiltonian of the system comprising N particles has the form H=

N

k=1

Hk +

1≤k= ⊗N k=1 |ψk >

where |ψk >∈ Hk . We substitute this into the expression < ψ|H|ψ > for the average energy and extremize this w.r.t. the component wave functions |ψk >, k = 1, 2, ..., N subject to the constraints < ψk |ψj >= δkj . Incorporating these constraints using Lagrange multiplier λ(k, j) gives us the functional to be extremized as λ(k, j)(< ψk |ψj > −δkj ) S[{ψk , λ(k, j)}] =< ψ|H|ψ > − 1≤k≤j≤N

We observe that < ψ|H|ψ >=

k

< ψk |Hk |psik > +

1≤k≤j≤N

The variational equations gives Hk |ψk > +

and

1≤k

λ(k, j)(< ψk |ψj > −δkj ) δS/δψk∗ = 0

j:j>k

< Ik ⊗ ψj |Vkj |ψk ⊗ ψj > + −λ(k, k)|ψk > −

For example, if Hk = L2 (R3 ) and

j:j

λ(k, j)|ψj >= 0

j:k ⊗...|psiσn > σ∈Sn

Carry out for this trial wave function the above extremization of < ψ|H|ψ > subject to the constraints < ψk |psij >= δkj and specialize to the position representation. Next, take into account the spin of each electron so that the component wave function |ψk > depends on both the position rk and spin variable sk = ±1/2. The trial wave function is then ψσ1,sσ1 (r1 ) ⊗ ... ⊗ ψσn,sσn (rn ) ψ(r1 , s1 , ..., rn , sn ) = (n!)−1/2 σ∈Sn

Note that the constraints are < ψk,s |ψk ,s >= δkk δss [40] The Born-Oppenheimer approximate method for computing the wave functions of electrons and nuclei in a lattice.

Quantum Mechanics for Scientists and Engineers 46

47

Nucleon positions are Rk , k = 1, 2, ..., N . electron positions associated with the k th nucleus are rkl , l = 1, 2, ..., Z. Nucleon mass is M , electron mass is m. Total Hamiltonian of the system is H = TN + Te + Vee + VN N + VeN where

TN = TN (R) = −

is the total kinetic energy operator of all the nucleons. Te = Te (r) = −

k

k,l

∇2Rk /2M

∇2rkl /2m

is the total kinetic energy of all the electrons. Vee =

(k,l)�=(m,j)

e2 /2|rkl − rmj |

is the total electron-electron interaction potential energy. Z 2 e2 /|Rk − Rm | VN N = k�=m

is the total nucleon-nucleon interaction potential energy. VeN = − Ze2 /|Rk − rml | k,m,l

is the total electron-nucleon interaction potential energy. We ﬁrst solve (Te + Vee + VeN )Φ(r, R) = Ee (R)Φ(r, R) ie, the eigenfunctions for the electrons with ﬁxed values of the nuclear positions. The energy levels of the electrons then depend on the nucleon positions R. We then Assume that the total wave function of the electrons and nucleons is Ψ(r, R) = Φ(r, R)χ(R) Substituting this into the complete electron-nucleon eigenvalue equation gives (Te + TN + Vee + VN N + VeN )(Φ(r, R)χ(R)) = EΦ(r, R)χ(R) or using the above electron eigenvalue equation, (TN + VN N + Ee (R))(Φ(r, R)χ(R)) = E(Φ(r, R)χ(R)) or equivalently,

Φ(r, R)−1 TN (Φ(r, R)χ(R)) + (VN N + Ee (R))χ(R) = Eχ(R)

Now,

= −(2M )−1

k

TN (Φ(r, R)χ(R)) = ∇2Rk /2M )(Φ(r, R)χ(R)) (− k

[χ(R)(∇2Rk Φ(r, R)) + Φ(r, R)∇2Rk χ(R) + 2(∇Rk Φ(r, R), ∇Rk χ(R))]

[41] The performance of quantum gates in the presence of classical and quantum noise. Suppose we design a quantum gate by perturbing the Hamiltonian H0 of a quantum system to H0 + δ k fk (t)Vk and 2 running the unitary evolution for a duration of T seconds. Upto O(δ ), the evolution gate at time T is given by U (T ) = U0 (T )W (T ), U0 (T ) = exp(−iT H0 ),

Chapter

5

Quantum Gates, Scattering THeory, Noise in Schrodinger and Dirac’s Equations k

[41] The performance of quantum gates in the presence of classical and quantum noise. Suppose we design a quantum gate by perturbing the Hamiltonian H0 of a quantum system to H0 + δ k fk (t)Vk and Quantum Mechanics 47 2 running the unitary evolution for a duration of T seconds. Upto O(δ ), the evolution gate at time T is given by U (T ) = U0 (T )W (T ), U0 (T ) = exp(−iT H0 ), T W (T ) = I − iδ fk (t)V˜k (t)dt − δ 2 fk (t)fk (s)V˜k (t)V˜m (s)dtds 0

k

where

k,m

0 −exp(−itH0 )|φi >= 0 so that

|ψi >= Ω− |φi >, Ω− = s.limt→∞ exp(itH).exp(−itH0 )

Quantum Mechanics for Scientists and Engineers

56 54

Likewise |φo > is the out state ie, the free projectile state at t = +∞ projected to time t = 0. It also evolves according to the free particle Hamiltonian H0 . The out scattered state |ψo > from which it evolves, evolves according to exp(−itH). Thus, limt→∞ exp(−itH)|ψo > −exp(−itH0 )|phio >= 0

or equivalently,

|ψo >= Ω+ |φo >, Ω+ = s.limt→∞ exp(itH).exp(−itH0 )

We deﬁne the scattering operator by

S = Ω∗+ Ω−

It follows that < φo |S|φi >=< ψo |ψi >

ie, the probability of scattering from an in scattered state |ψi > to an out scattered state |ψo > equals | < ψo |ψi > |2 = | < φo |S|φi > |2 where |φi > is the input free state of the projectile and |φo > is the output free state of the projectile. The domain of Ω+ is not the whole of H but rather contained in the absolutely continuous spectrum of H0 . We denote the spectral measure of H0 by E0 and that of H by E. We then have ∞ d (exp(itH).exp(−itH0 )dt Ω+ = I + dt 0 ∞ =I +i exp(itH)V.exp(−itH0 )dt 0

and hence

Ω∗+ = I − i Likewise, we have Ω− = I − =I −i

∞

0

d (exp(itH).exp(−itH0 ))dt dt

−∞

exp(itH0 )V.exp(−itH)dt

0

0

exp(itH)V.exp(−itH0 )dt

−∞

=I −i

∞

exp(−itH)V.exp(itH0 )dt

0

and hence

Ω∗− = I + i It is clear that Ω+ and Ω− are isometries, ie,

∞

exp(−itH0 )V.exp(itH)dt

0

Ω∗+ Ω+ = I, Ω∗− Ω− = I deﬁned respectively on the domains of Ω+ and Ω− . We have R = S − I = Ω∗+ Ω− − Ω − −∗ Ω− Now, (I − i

0

= Ω− − i where we have used the identity

∞

0

S = Ω∗+ Ω− = exp(itH0 )V.exp(−itH)dt).Ω−

∞

exp(itH0 )V.Ω− .exp(−itH0 )dt

exp(−itH)Ω− = lims→∞ exp(−itH).exp(isH).exp(−isH0 ) = lims→∞ exp(i(s − t)H).exp(−i(s − t)H0 ).exp(−itH0 ) Ω− .exp(−itH0 )

Quantum Mechanics for Scientists and Engineers

Note that likewise, we have

57 55

exp(−itH)Ω+ = Ω+ exp(−itH0 )

We now also have I = Ω∗− Ω− = (I + i = Ω− + i Hence,

−i[

∞

0

= −i

−

= −i R3 ×R+

−2π

0

0

exp(−itH0 )V.exp(itH)dt)Ω−

exp(−itH0 )V.Ω− .exp(itH0 )dt

0

∞

−∞

R3

E0 (dλ)V (I − i

R3

∞

∞

R=S−I = ∞ exp(itH0 )V.Ω− .exp(−itH0 )dt + exp(−itH0 )V.Ω− exp(itH0 )dt] = −i

= −i

E0 (dλ)V Ω− E0 (dμ).exp(it(λ − μ))dt

∞

0

R3

E0 (dλ)V exp(−isH)V exp(isH0 )E0 (dμ).exp(it(λ − μ))dtds

R3 ×R+

+2πi

exp(−isH)V.exp(isH0 )ds)E0 (dμ).exp(it(λ − μ))

E0 (dλ)V.E0 (dμ)exp(it(λ − μ))

= −2πi

exp(itH0 )V.Ω− .exp(−itH0 )dt

= −2πi

E0 (dλ)V.E0 (dμ)δ(λ − μ)

E0 (dλ)V.exp(−is(H − x))V E0 (dx)E0 (dμ)δ(λ − μ)dxds = −2πi

R3

E0 (dλ)V.E0 (dμ)δ(λ − μ)

E0 (dλ)V.(H − x − i0)−1 V E0 (dx)E0 (dμ)δ(λ − μ)dxds

R2

E0 (dλ)(V − V (H − λ − i0)−1 V )E0 (dμ)δ(λ − μ)

From this it follows that the initial projectile energy μ is conserved after the scattering process, ie, λ = μ and if |φi > is a state of the incident projectile having energy in the range λ + dλ and likewise for |φo >, then < φo |S − I|φi >≈ −2πi < φo |V − V (H − λ − i0)−1 V |φi > In other words, if R(λ) = (S − I)(λ) denotes the scattering matrix elements for the energy λ between two free particle states having momenta along the directions ω and ω � (∈ S 2 ) respectively, then this matrix element is given by −2πi times the diagonal matrix D√V minus the√product of √ the matrices DV , (K +DV −λI)−1 and DV where DV is the √ diagonal � ω � )δ( 2mλ� ω � − 2mλ�� ω �� ). K is the non-diagonal matrix (−1/2m)∇2 δ( 2mλ� ω � − matrix having elements V ( 2mλ √ 2mλ�� ω �� ) The ﬁnal matrix DV − DV (K + DV − λI)−1 DV is to be evaluated at the row index (λ, ω � ) and column index (λ, ω) with λ being ﬁxed. [48] Design of quantum gates using time dependent scattering theory. H(t) = H0 + V (t) Ω(+) = limt→∞ T {exp(−i

t

H(s)ds)}−1 .exp(−itH0 )

0

Ω(−) = limt→∞ T {exp(−i

0

−t

H(s)ds)}.exp(itH0 )

Quantum Mechanics for Scientists and Engineers

58 56

S = Ω(+)∗ Ω(−) = limt→∞ exp(itH0 ).T {exp(−i

t

0

= limt→∞ exp(itH0 ).T {exp(−i

H(s)ds)}.T {exp(−i

t

0

H(s)ds)}.exp(itH0 )

−t

H(s)ds)}.exp(itH0 )

−t

For large t, the approximate value of the transition probability from an initial state |i > to a ﬁnal state |f > is given by (assuming V (t) is an operator valued stochastic process), P2t (|i >→ |f >) = t E[| < f |exp(itH0 ).T {exp(−i H(s)ds)}.exp(itH0 )|i > |2 ] −t

We have upto linear orders in V , the truncated Dyson series t H(s)ds)} = exp(−itH0 )(I − i T {exp(−i −t

where

t

V˜ (s)ds).exp(−itH0 )

−t

V˜ (s) = exp(isH0 )V (s).exp(−isH0 )

Thus in this approximation, the scattering operator is approximately given by t S =I −i V˜ (s)ds −t

and the transition probability from the state |i > to the state |f > is given by t E| < f | V˜ (s)ds|i > |2 −t

The exact expression for the matrix elements of the scattering operator is given by ∞ (−i)n V˜ (s1 )...V˜ (sn )ds1 ...dsn |i > |2 E| < f | −∞ is the exponential vector in the Boson Fock space. Then we claim that E(jt (X)|ηt ) = U (t)∗ Et (X|ηti )U (t) Note that ηti commutes with X and hence ηt commutes with jt (X). To prove the above formula, we observe that if if Z ∈ ηti , then U (t)∗ ZU (t) ∈ ηt and hence E[(jt (X) − U (t)∗ Et (X|ηti )U (t))U (t)∗ ZU (t)] = E[U (t)∗ XZU (t) − U (t)∗ Et (X|ηti )ZU (t)]

= E[U (t)∗ (XZ − Et (X|ηti )Z)U (t)] = Et (XZ − Et (XZ|ηti )) = 0

by the deﬁnition of conditional expectation. This proves the claim. Now suppose F (t) is a system operator, ie, in L(h) such that Et (X) = E(U (t)∗ XU (t)) = E(F (t)∗ XF (t)) = EF (t) (X) Note that [F (t), ηti ] = 0. It follows clearly that E(F (t)∗ F (t)) = E(U (t)∗ U (t)) = 1 Then, we claim that

EF (t) (X|ηti ) = E(F (t)∗ XF (t)|ηti )/E(F (t)∗ F (t)|ηti )

To prove this, we observe that the lhs and the numerator and denominator of the rhs all commute with each other since ηti is an Abelian algebra. Further, we have for any operator Z ∈ ηti that EF (t) [(X − (E(F (t)∗ XF (t)|ηti )/BbbE(F (t)∗ F (t)|ηti ))Z] = E[F (t)∗ XZF (t) − F (t)∗ (E(F (t)∗ XF (t)Z|ηti )/E(F (t)∗ F (t)|ηti ))F (t)]

= E[F (t)∗ XZF (t)] − E[E(F (t)∗ XZF (t)|ηti )F (t)∗ F (t)/E(F (t)∗ F (t)|ηti )]

Quantum Mechanics for Scientists and Engineers F (t)∗ F (t)|ηti ))F (t)]

62

since F (t) commutes with ηti and Z ∈ ηti . Further, conditioning the second term above on ηti and then taking the expectation gives E[E(F (t)∗ XZF (t)|ηti )F (t)∗ F (t)/E(F (t)∗ F (t)|ηti )] = E[E(F (t)∗ XZF (t)|ηti )E(F (t)∗ F (t)|ηti )/E(F (t)∗ F (t)|ηti )] = E(F (t)∗ XZF (t)) We have thus proved that EF (t) [(X − (E(F (t)∗ XF (t)|ηti )/BbbE(F (t)∗ F (t)|ηti ))Z] = 0 and hence the claim. The Hudson-Parthasarathy noisy Schrodinger evolution is described by the qsde β dU (t) = (Lα β dΛα (t))U (t)

where summation over the repeated indices α, β ≥ 0 is understood and the basic processes Λα β satisfy quantum Ito’s formula μ μ α dΛα β .dΛν = �ν dΛβ where �μν is zero if either μ = 0 or ν = 0 or μ �= ν and is one otherwise. The system operators Lα β satisfy the following conditions for U (t) to describe a unitary evolution: 60

0 = d(U ∗ U ) = dU ∗ .U + U ∗ .dU + dU ∗ .dU = β α β β∗ μ β ν U ∗ (Lβ∗ α dΛα + Lβ dΛα + Lα Lν dΛα dΛμ )U

so that

β α β β∗ μ β ν (Lβ∗ α dΛα + Lβ dΛα + Lα Lν dΛα dΛμ )U

which gives

α ν∗ μ ν Lβ∗ α + Lβ + Lα Lβ �μ = 0

The Evans-Hudson ﬂow corresponding to this HP equation is obtained by taking a self-adjoint operator X in the system Hilbert space h and setting jt (X) = U (t)∗ XU (t) Then application of quantum Ito’s formula gives djt (X) = dU ∗ XU + U ∗ XdU + dU ∗ XdU = μ α ν ν∗ β U ∗ (Lβ∗ α X + XLβ + �μ Lα XLβ )U.dΛα

= jt (θβα (X))dΛβα where the structure maps θβα are given by

θβα (X) = μ α ν ν∗ Lβ∗ α X + XLβ + �μ Lα XLβ

We note that they satisfy the structure equations since jt as deﬁned is a ∗ unital algebra homomorphism. The Belavkin input measurement processes are taken as β Yin,k (t) = cα β [k]Λα , k = 1, 2, ..., r, t ≥ 0

where the α = β = 0 term is omitted. These processes jointly generate an Abelian family of Von-Neumann algebras.The corresponding output processes are Yout,k (t) = Yk (t) = U (t)∗ Yin,k (t)U (t) and since U (t) is unitary, and Yin,k (t) commute with the system operators, it follows that Yk (t) = U (T )∗ Yin,k (t)U (T ), T ≥ t and hence

[Yk (t), js (X)] = 0, s ≥ t

ie the output measurements are non-demolition processes. Let ηt = σ(Yk (s) : s ≤ t, k = 1, 2, ..., r). Then ηt is an Abelian Von-Neumann algebra and belongs to the commutant of the algebra generated by jt (X) as X ranges over the system operators. We write πt (X) = E(jt (X)|ηt )

Quantum Mechanics for Scientists and Engineers

63

where the expectation is taken in the state |f φ(u) > with |f >∈ h, < f, f >= 1 and φ(u) = exp(− � u �2 /2)|e(u) > where u ∈ L2 (R+ ) ⊗ Cd and |e(u) > is the corresponding exponential vector in Γs (L2 (R+ ) ⊗ Cd ). Now we can write dπt (X) = Ft (X)dt + Gkmt (X)(dYm (t))k where the summation in the last term is over m ≥ 1, k ≥ 1 and Ft (X), Gkmt (X) are all ηt measurable operators, ie, they can be regarded as commutative stochastic processes. We can write dYm (t) = dYin,m (t) + dU (t)∗ .dYin,m (t).U (t) + U (t)∗ dYin,m (t).dU (t)∗ = +jt (Sβα [m])dΛβα and hence for k ≥ 1

(dYm (t))k = jt (Sβα [m, k])dΛβα (t)

where Sβα [k] are system operators, ie, operators in h and S00 [k] = 0, k ≥ 2. The equations E[(djt (X) − dπt (X))(dYm (t))k |ηt ] + E[(jt (X) − πt (X))(dYm (t))k ] = 0, k ≥ 1 and

E[(djt (X) − dπt (X))|ηt ] = 0

follow by taking the diﬀerential of the expression E[(jt (X) − πt (X))C(t)] = 0 where C(t) satisﬁes the qsde dC(t) =

fm,k (t)C(t)(dYm (t))k , C(0) = 1

m,k≥1

and using the arbitrariness of the complex valued functions fm,k (t). We have E[djt (X)(dYm (t))k |ηt ] = E[jt (θβα (X))jt (Sνμ [m, k])dΛβα (t).dΛνμ (t)|ηt ] = �βμ πt (θβα (X).Sνμ [m, k])uν (t)¯ uα (t)dt E[dπt (X)(dYm (t))k |ηt ] =

Grst (X)E[(dYs (t))r (dYm (t))k |ηt ]

= Grst (X)jt (Sβα [s, r]Sνμ [m, k])E[dΛβα (t).dΛνμ (t)|ηt ] = Grst (X)πt (Sβα [s, r]Sνμ [m, k])�βμ uν (t)¯ uα (t)dt [53] Classical control of a stochastic dynamical system by error feedback based on a state observer derived from the EKF. [54] Quantum control using error feedback based on Belavkin quantum ﬁlters for the quantum state observer. Let Y (t) be a measurement noise process and V (t) a system operator process say like A(t) + A(t)∗ . Consider a qsde dUc (t) = (−iV (t)dY (t) − V (t)2 dt/2)Uc (t)

We are assuming that V (t) is Hermitian. If V (t) = V does not vary with time, we can write the solution to the above qsde as Uc (t) = exp(−iV Y (t)) Uc (t) is a unitary matrix and its application to the state evolved from a qsde can remove the eﬀect of noise if Y (t) is present as a noise in the qsde. For example, consider the following qsde dU (t) = (−(iH + V 2 /2)dt − iV dY (t))U (t)

We can write this approximately as U (t + dt) = (I − dt(iH + V 2 /2) − iV dY (t))U (t) We now apply the control unitary (I − iV (t)dY (t) − V 2 (t)dt/2)−1 = I + iV (t)dY (t) − V 2 (t)dt/2 to U (t + dt) to get (I + iV (t)dY (t) − V 2 (t)dt/2)(I − iHdt − iV (t)dY (t) − V 2 (t)dt/2)U (t) = (I − iHdt)U (t)

∗

to 64

Quantum Mechanics for Scientists and Engineers − 2

ie, the eﬀect of the noise is cancelled out. We note that the output measurement is given by Yo (t) = U (t)∗ Y (t)U (t) and it is this process which follows the non-demolition property. We can measure only Yo without disturbing the dynamics generated by U (t) on the system Hilbert space h. We cannot measure the input measurement Y without disturbing the system dynamics. Now Belavkin’s quantum ﬁltering equation can be expressed as dπt (X) = Ft (X)dt + Gt (X)dYt 62 where

jt (X) = U (t)∗ XU (t), πt (X) = E[jt (X)|ηt ] ηt = σ{Ys : s ≤ t}

is the output measurement Abelian algebra at time t. U (t) is generated by the Hudson-Parthasarathy noisy Schrodinger equation and expectations are calculated in the pure state |f φ(u) > where f is a normalized system vector and φ(u) is a normalized coherent vector (See the paper by John Gough and Kostler). Ft (X), Gt (X) are linear functions of the system observable and belong to the measurement algebra ηt . More precisely, we can express the above ﬁltering equation as dπt (X) = πt (θ0 (X))dt + [πt (L1 X + XL∗1 ) − π(L2 )π(X)](dYt − πt (L3 X + XL∗3 )dt)

t where L2 is a Hermitian matrix. For quadrature measurements, Zt = Yt − 0 πs (L3 X + XL∗3 )ds is a Brownian motion process. θ0 is the Gorini-Kossakowski-Sudarshan-Lindblad (GKSL) generator on the system operator space. More generally, for general non-demolition measurements, like photon counting and a combination of quadrature and photon counting measurements, the Belavkin equation has the form dπt (X) = πt (θ0 (X))dt + [ fk (πt (Mk ))πt (θk (X))](dYt − gk (πt (Nk ))πt (φk (X))dt) k≥1

k≥1

where θk , φk , k ≥ 1 are linear operators on the Banach space of system operators. If we write ρt for the density operator on the system space conditioned on the measurements upto time t, we have πt (X) = T r(ρt X) ρt should be viewed as a random density matrix on the system operator algebra where the randomness comes from conditioning on the measurements upto time t. Thus, the above Belavkin equation reads dρt = θ0∗ (ρt )dt + [ fk (T r(ρt Mk ))θk∗ (ρt )][dYt − gk (T r(ρt Nk ))φ∗k (ρt )dt] k≥1

k≥1

It is to be noted that the process Mt = Yt −

t

0 k≥1

gk (T r(ρs Nk ))φ∗k (ρs )ds

is a Martingale (Ph.d thesis of Luc Bouten). The above equation for ρt is called a Stochastic Schrodinger Equation. Now, we wish to remove the noise from the above Belavkin equation by an appropriate control so that the evolution equation of the density has just the ﬁrst term θ0∗ (ρt ), ie, we wish to recover the GKSL equation. Consider now the control unitary Uc (t) = exp(iW Yt ) where W is a system observable and Yt the above output measurement. Assume for simplicity, that we take quadrature measurements, ie, Mt is a Wiener process. We apply Uc (t) to ρt to get ρc,t = Uc (t)ρt Uc (t)∗ Then, by Quantum Ito’s formula, dρc,t = dUc (t)ρt Uc (t)∗ + Uc (t)dρt Uc (t)∗ + Uc (t)ρt dUc (t)∗ + dUc (t)dρt Uc (t)∗ + Uc (t)dρt dUc (t)∗ [55] Lyapunov’s stability theory with application to classical and quantum dynamical systems. [56] Imprimitivity systems as a description of covariant observables under a group action. Construction of imprimitivity systems, Wigner’s theorem on the automorphisms of the orthogonal projection lattice. (Ω, F, pμ) is a measure space. P is a spectral measure on this space, ie, for each E ∈ F, P (E) is an orthogonal projection operator in a Hilbert space H. The set of all orthogonal projections on H is denoted by P(H). It is also called the projection lattice. Let τ be an automorphism of P(H), ie, if τ : P(H) → P(H) is such that if P1 , P2 , ... are mutually orthogonal, then τ ( j Pj ) = j τ (Pj ). More generally, we require that τ (max(P1 , P2 )) = max(τ (P1 ), τ (P2 )) and τ (min(P1 , P2 )) = min(τ (P1 ), τ (P2 )) for any two P1 , P2 ∈ P(H). Then, Wigner proved that there exists a unitary or antiunitary operator U in H such that τ (P ) = U P U ∗ , P ∈ P(H)

= min(τ (Pand (P2 and τ (min(P 1 , P2 )) for 1 ), τEngineers Quantum Mechanics Scientists

65

It follows that if G group that acts on the measure space (Ω, F, P ) and g ∈ G → τg is a homomorphism from G into aut(P(H)) such that τg (P (E)) = P (g.E), then there exists a projective unitary-antiunitary representation g → Ug of G Quantum Mechanics 63 into U A(H) such that ∗ P (gE) = Ug P (E)Ug , g ∈ G, E ∈ F We note that

τg1 τg2 (P (E)) = P (g1 g2 E) = τg1 g2 (P (E))

so the requirement that g → τg be a homomorphism is natural from the viewpoint of covariant transformation of observables. Let now H be a subgroup of G. Consider the homogeneous space X = G/H. G acts transitively on X. Let μ be a quasi-invariant measure on X, ie, for any g ∈ G, the measures μ.g −1 and μ are absolutely continuous with respect to each other. Consider f ∈ L2 (X, μ). Consider for f ∈ L2 (X, μ), Ug f (x) = (dμ.g −1 /dμ)1/2 f (g −1 x), x ∈ X Then,

� Ug f (x) �2 dμ(x) ==

X

|f (g −1 x)|2 dμ.g −1 (x) =

X

|f (x)|2 dμ(x)

This proves that Ug is a unitary operator. It is easy to see that U is also a representation: Ug1 g2 f (x) = (dμ.(g1 g2 )−1 (x)/dμ)f (g2−1 g1−1 x) On the other hand,

Ug1 (Ug2 f )(x) = (dμ.g1−1 (x)/dμ)1/2 (Ug2 f )(g1−1 x) = (dμ.g1−1 (x)/dμ)1/2 (dμ.g2−1 g1−1 (x)/dμ.g1−1 x)1/2 f (g2−1 g1−1 x) = (dμ.g2−1 g1−1 (x)/dμ)f (g2−1 g1−1 x)

Thus,

Ug1 g2 = Ug1 .Ug2 , g1 , g2 ∈ G

Let A(g, x) be a map from G × X into the algebra of linear operators in a Hilbert space � such that A(g1 g2 , x) = A(g1 , g2 x)A(g2 , x) Then consider the operator Ug deﬁned on L2 (μ, h) Ug f (x) = (dμ.g −1 (x)/dμ)1/2 A(g, g −1 x)f (g −1 x) We see that

Ug1 (Ug2 f )(x) = (dμ.g1−1 (x)/dμ)1/2 A(g1 , g1−1 x)(Ug2 f )(g1−1 x) −1.g1−1

= (dμ.g1−1 (x)/dμ)1/2 A(g1 , g1−1 x)(dμ.g2

(x)/dμ.g1−1 )1/2 A(g2 , g2−1 g1−1 x)f (g2−1 g1−1 x)

= (dμ.g2−1 g1−1 (x)/dμ)1/2 A(g1 g2 , g2−1 g1−1 x)f ((g1 g2 )−1 x) = Ug1 g2 f (x) Thus, g → Ug is a representation of G in L (X, h). 2

[57] Schwinger’s analysis of the interaction between the electron and a quantum electromagnetic ﬁeld. Let A(t, r) denote the vector potential corresponding to a quantum electromagnetic ﬁeld. The dynamical variables q, p of the electron bound to the nucleus commute with A. Let Φ(t, r) denote the quantum scalar potential of the quantum t electromagnetic ﬁeld. Note that if we adopt the Lorentz gauge, Φ = −c2 0 divAdt while if we adopt the Coulomb gauge, then divA = 0, Φ = 0 where for the latter, we are assuming that there is no externally charged matter to generate the scalar potential. The Hamiltonian of the atom interacting with the quantum electromagnetic ﬁeld is given by H(t) = (p + eA)2 /2m + V (q) − eΦ + Hem where the ﬁeld Hamiltonian Hem has also been added. H(t) is thus the total Hamiltonian of the atom interacting with the quantum em ﬁeld. We have H(t) = p2 /2m + V (q) + ((p, A) + (A, p))/2m − eΦ + Hem We note that

(p, A) + (A, p) = 2(A, p) − i.div(A) = −2i(A, ∇) − idiv(A)

Schrodinger’s equation for the wave function ψ(t) of the atom and ﬁeld is given by iψ � (t) = H(t)ψ(t)

Quantum Mechanics for Scientists and Engineers

66 64

Making the transformation

ψ(t) = exp(−itHem ))φ1 (t)

and assuming the Coulomb gauge gives ˜ p)/m)φ1 (t) iφ�1 (t) = (p2 /2m + V (q) + (A, where

A˜ = exp(itHem ).A.exp(−itHem )

Making another transformation

φ1 (t) = exp(−iS(t))φ2 (t)

where S(t) is linear in the creation and annihilation operators of the em ﬁeld, we get φ�1 (t) = exp(−iS(t))(−iS � (t) + [S(t), S � (t)]/2)φ2 (t) + exp(−iS(t))φ�2 (t) since [S(t), S � (t)] is a c-number function of time owing to the commutation relations between the creation and annihilation t t ˜ ˜ operators of the em ﬁeld. Taking S(t) = (p, 0 Adt)/m = (p, Z(t)) where Z(t) = 0 Adt/m gives iφ�2 (t) = exp(iS(t))(p2 /2m − S � (t) − i[S(t), S � (t)]/2 + V (q) + S � (t))exp(−iS(t))φ2 (t) = (p2 /2m + V (q + Z(t)))φ2 (t) where the c-number function −i[S(t), S � (t)] has been neglected as it only gives an additional phase factor to the wave function. (1)

[58] Quantum control: The system observable X evolves to jt (X) after time t. The desired system observable Xd (2) evolves to jt (Xd ). Here, (k) jt = (Hk (t), Lk ), k = 1, 2 which means that

(k)

jt (Z) = Uk (t)∗ ZUk (t), k = 1, 2, Z = X, Xd

where Uk (t) satisﬁes the HP equation dUk (t) = (−(iHk (t) + Qk )dt + Lk dA(t) − L∗k dA(t)∗ )Uk (t), Qk = Lk L∗k Deﬁne ˜= X ˜ = jt (X)

X 0

(1)

0 Xd

,

jt (X) 0 (2) 0 jt (Xd )

Let h denote the system Hilbert space and Γs (L2 (R+ )) the Boson Fock space. jt is a ∗ unital homomorhism from the algebra B(h) ⊕ B(h) into the algebra (B(h ⊗ Γs (L2 (R+ ))) ⊕ B(h × Γs (L2 (R+ ))). Deﬁne the operators H(t) = diag[H1 (t), H2 (t)], L = diag[L1 , L2 ], U (t) = diag[U1 (t), U2 (t)], Q = diag[Q1 , Q2 ] Then, the HP equations for Uk , k = 1, 2 can be expressed as a single qsde: dU (t) = [−(iH(t) + Q)dt + LdA(t) − L∗ dA(t)∗ ]U (t) ˜ A non demolition measurement for the ˜ in B(h) ⊕ B(h) evolve to U (t)∗ XU ˜ (t) = jt (X). The corresponding observables X ˜ in the sense of Belavkin is given by process jt (X) Y o (t) = U (t)∗ Y i (t)U (t), Y i (t) = A(t) + A(t)∗ It satisﬁes the sde

dY o (t) = dY i (t) + dU (t)∗ dY i (t)U (t) + U (t)∗ dY i (t).dU (t) = dY i (t) − U (t)∗ (L + L∗ )U (t)dt = dY i (t) + jt (S)dt = dA(t) + dA(t)∗ + jt (S)dt

Quantum Mechanics Mechanics for Scientists and Engineers Quantum

6567

where

S = −(L + L∗ )

Let ηt = σ(Yso : s ≤ t). Then ηt is an Abelian Von-Neumann algebra and we deﬁne the conditional expectation ˜ t ) = πt (X) ˜ Ejt (X)|η We can assume that

dπt (Z) = Ft (Z)dt + Gt (Z)dY o (t)

where Ft (Z) and Gt (Z) are in ηt . Then, its evident [See the paper by Gough and Koestler] that E[(jt (Z) − πt (Z))dY o (t)|ηt ] + E[(djt (Z) − dπt (Z))dY o (t)|ηt ] = 0 − − − (1) E[(djt (Z) − dπt (Z))|ηt ] = 0 − − − (2)

We can write

djt (Z) = jt (θ0 (Z))dt + jt (θ1 (Z))dA(t) + jt (θ2 (Z))dA(t)∗

where for Z = diag[Z1 , Z2 ], we have (1)

(2)

jt (Z) = diag[jt (Z1 ), jt (Z2 )] and

(1)

(1)

(1)

(1)

(2)

(2)

(2)

(2)

djt (Z1 ) = jt (θ10 (Z1 ))dt + jt (θ11 (Z1 ))dA(t) + jt (θ12 (Z1 ))dA(t)∗ djt (Z2 ) = jt (θ20 (Z2 ))dt + jt (θ21 (Z1 ))dA(t) + jt (θ22 (Z2 ))dA(t)∗

Thus,

θk (Z) = diag[θ1k (Z1 ), θ2k (Z2 )], k = 0, 1, 2

[59] Quantum error correcting codes: Let H = CN be the Hilbert space of the quantum system and let C be a subspace of H. C is called the code subspace. If ρ is a density operator in H, we say that ρ ∈ C if Range(ρ) ⊂ C. If N is a code, if there exist subspace of the space CN ×N of all linear operators in H, then we say C is an N error correcting operators R0 , ..., Rr ∈ CN ×N such that k Rk∗ Rk = I and whenever E1 , ..., Er ∈ N are such that k Ek Ek∗ = I, and ρ ∈ C, then Rk E(ρ)Rk∗ = cρ k

where c ∈ C and

E(ρ) =

Ek ρEk∗

k

is the output state of the noisy channel {Ek }. {Rk } are called recovery operators for the noise subspace N and the code subspace C. If there exist such recovery operators, then we say that C is an N error correcting code. Theorem (Knill-Laﬂamme): C is an N error correcting code iﬀ P N2∗ N1 P = λ(N2 ∗ N1 )P for all N1 , N2 ∈ N and some complex scalar λ(N2∗ N1 ) (dependent on N2∗ N1 ). Here, P is the orthogonal projection onto C. Proof: Suppose ﬁrst that C is an N error correcting code with recovery operators R0 , ..., Rr . Let Ek , k = 1, 2, ..., m be a quantum channel in N . Then we have for all |ψ >∈ C the relation Rk Es |ψ >< ψ|Es∗ Rk∗ = λ(ψ)|ψ >< ψ| k,s

where λ(ψ) is a complex scalar possibly dependent on |ψ >. It follows that for all |ψ >⊥ |ψ >, ie, < φ|ψ >= 0, we have | < φ|Rk Es |ψ > |2 = λ| < φ|ψ > |2 = 0 k,s

Thus, ie,

Rk Es |ψ >⊥ |φ > ∀|φ >⊥ |ψ Rk Es |ψ >= βks (ψ)|ψ >, ∀|ψ >∈ C

Quantum Mechanics for Scientists and Engineers

68 66

and some complex numbers βks (ψ). By linearity of the operators, it is clear that βks cannot depend on |ψ >. We get therefore, Rk Es P = βks P, ∀k, s

Thus,

P Eq∗ Rk∗ Rk Es P = βks β¯kq P ∗ and summing this equation over k and using k Rk Rk = I, we get P Eq∗ Es P = aqs P, aqs ∈ C

We have thus proved that

P N2∗ N1 P =∝ P, ∀N1 , N2 ∈ N

Conversely, suppose that this relation holds. Then, let

N0 = {N ∈ N : λ(N ∗ N ) = 0} It is clear that N0 is a subspace of N . Indeed, suppose N1 , N2 ∈ N0 . Then, P Nk∗ Nk P = λ(Nk∗ Nk )P = 0, k = 1, 2 and hence

Nk P = 0, k = 1, 2

Thus,

P Nj∗ Nk P = 0, j, k = 1, 2

and hence,

P (c1 N1 + c2 N2 )∗ (c1 N1 + c2 N2 )P = 0, .c1 , c2 ∈ C

Now, deﬁne for N ∈ N ,

[N ] = N + N0 ∈ N /N0

We deﬁne

< [N1 ], [N2 ] >= λ(N1∗ N2 ), N1 , N2 ∈ N

This deﬁnition is valid since for N0 ∈ N0 and N ∈ N we have

λ(N ∗ N0 ) = λ(N0∗ N ) = 0 This is because,

P N0∗ N0 P = λ(N0∗ N0 )P = 0

implies

P N0∗ = N0 P = 0

and hence,

0 = P N0∗ N P = λ(N0∗ N )P, 0 = P N ∗ N0 P = λ(N ∗ N0 )P

It is clear therefore that < ., . > deﬁnes an inner product on N /N0 . So, we can choose an onb {[Nk ], k = 1, 2, ..., r} for N /N0 w.r.t. this inner product. Now, deﬁne Rk = P Nk∗ , Pk = Nk P Nk∗ , k = 1, 2, ..., r, R0 = I − Then,

r

Pk

k=1

Pk Pj = Nk P Nk∗ Nj P Nj∗ = λ(Nk∗ Nj )Nj P Nj∗ = δkj Pj

proving that Pj , j = 1, 2, ..., r for a set of mutually orthogonal orthogonal projections in H. Thus R0 is also an orthogonal projection. Now, suppose ρ ∈ C. Then ρ = P ρ = ρP = P ρP . Then, for N ∈ N , Rk N ρN ∗ Rk∗ = P Nk∗ N P ρP N ∗ Nk P + R0 N ρN ∗ R0 k

k≥1

Quantum Mechanics for Scientists and Engineers

69 67

=

k≥1

|λ(Nk∗ N )|2 ρ + R0 N ρN ∗ R0

= λ(N ∗ N )ρ + R0 N ρN ∗ R0

Here, we have used the Bessel equality/Parseval relation for orthonormal bases for a Hilbert space. Further, R0 N P = N P − Pk N P = N P − Nk P Nk∗ N P k

= NP −

k

λ(Nk∗ N )Nk P

k

= NP − NP = 0

by the generalized Fourier series in a Hilbert space and the relation N0 P = 0 for all N0 ∈ N . Thus, we ﬁnally get r

Rk∗ N ρN ∗ Rk∗ = λ(N ∗ N )ρ

k=0

and further,

r

k=0

Rk∗ Rk =

Nk P Nk∗ + R0 =

k≥1

k≥1

Pk + I −

Pk = I

k≥1

This completes the proof of the Knill-Laﬂamme theorem. Construction of quantum error correcting codes using imprimitivity systems: Let A be a ﬁnite Abelian group and deﬁne the onb |x >, x ∈ A for L2 (A) so that |x >= [δx,0 , ..., δx,N −1 ]T where we are assuming

A = {0, 1, ..., N − 1}

with addition modulo N . with each n ∈ A, deﬁne the character

χn (x) = exp(2πinx/N ), x ∈ A We may identify this character with n ∈ A. In this way, we get an isomorphism of Aˆ with A and we write < n, x >=< x, n >= χn (x) Deﬁne the unitary operators Ux , Vx on L2 (A) by Ua |x >= |x + a >, Va |x >=< a, x > |x > Then deﬁne the Weyl operator

W (x, y) = Ux Vy , x, y ∈ A

on L (A). These are N unitary operators and in fact, as we shall soon see, they form an orthogonal basis for L2 (A × A), for the space of all linear operators in L2 (A). We derive the Weyl commutation relations: 2

2

W (x, y)|a >= Ux Vy |a >= Ux < y, a > |a >=< y, a > |a + x > Thus, Also,

Vy Ux |a >= Vy |a + x >=< y, a + x > |a + x >=< y, a >< y, x > |a + x > Vy Ux =< y, x > Ux Vy =< x, y > Ux Vy W (x, y)W (u, v) = Ux Vy Uu Vv =< y, u > Uu+x Vv+y =< y, u > W (u + x, v + y)

So (x, y) → W (x, y) is a projective unitary representation of the Abelian group G = A × A into the space of operators in L2 (A). Now, W (x, y)|a >=< y, a > |a + x >

Quantum Mechanics for Scientists Engineers Quantumand Mechanics

70 68

Hence, T r(W (x, y)) =

< a|W (x, y)|a >=

a

< y, a > δa+x,a

a

It follows that T r(W (x, y)) = 0 if x �= 0 and if x = 0, then T r(W (x, y)) = T r(W (0, y)) = for all x, y ∈ A, T r(W (x, y)) = N δx,0 δy,0

a

< y, a >= N δy,0 Thus,

It follows that W (x, y)∗ W (u, v) = V−y U−x Uu Vv = V−y Uu−x Vv =< y, x − u > Uu−x Vv−y =< y, x − u > W (u − x, v − y) Thus,

T r(W (x, y)∗ W (u, v)) =< y, x − u > T r(W (u − x, v − y)) = 0, (u, v) �= (x, y)

and further,

T r(W (x, y)∗ W (x, y)) = T r(I) = N

√

Thus, {W (x, y)/ N : x, y ∈ A} forms an orthonormal basis for L2 (A × A) and in particular, this is a set of N 2 linearly independent operators. We can express this orthonormality relations as T r(W (x, y)∗ W (u, v)) = N δ[x − u]δ[y − v] Now let H be a Gottesman subgroup of G = A × A. This means that for each pair (u, v), (x, y) ∈ H, the identity < u, y >∗ < v, x >= 1 holds. For arbitrary (u, v) ∈ G, we deﬁne the map ωH (u, v) : H → C such that ωH (u, v)(x, y) =< u, y >∗ < v, x > Then we have

ωH (u, v) = 1, (u, v) ∈ H

In other words, H ⊂ Ker(ωH ) = K say. Now we have that for (x, y), (u, v) ∈ H, W (x, y)W (u, v) =< y, u > W (x + u, y + v), W (u, v)W (x, y) =< v, x > W (u + x, v + y) and since

< y, u > / < v, x >=< y, u >< v, x >∗ = 1

it follows that

[W (x, y), W (u, v)] = 0∀(x, y), (u, v) ∈ H

Thus, there exists an onb for L2 (A) such that relative to this basis, W (x, y) is diagonal for all (x, y) ∈ H. W (x, y) = diag[αk (x, y), k = 1, 2, ..., N ], (x, y) ∈ H

˜ (x, y), (x, y) ∈ G by the equation Deﬁne W

˜ (x, y) W (x, y) = α1 (x, y)W

Note that |αk (x, y)| = 1∀k and also for (x, y), (u, v) ∈ H we have

˜ (x, y)W ˜ (u, v) = (α1 (x, y)α1 (u, v))−1 W (x, y)W (u, v) = W (α1 (x, y)α1 (u, v))−1 < y, u > W (x + u, y + v) =

Now the relation

< y, u > α1 (x + u, y + v) ˜ W (x + u, y + v) α1 (x, y)α1 (u, v)

W (x, y)W (u, v) =< y, u > W (x + u, y + v)

implies using the diagonal representation when restricted to H that α1 (x, y)α1 (u, v) =< y, u > α1 (x + u, y + v)

Quantum Quantum Mechanics Mechanics for Scientists and Engineers

and hence

6971

˜ (x, y)W ˜ (u, v) = W ˜ (x + u, y + v), (x, y), (u, v) ∈ H W

ie, the projective unitary representation W of G reduces to an ordinary unitary representation when restricted to H. An example: Let F be a ﬁnite ﬁeld and consider the vector spaces V = Fp . Let L : V → V and M : V → V be two linear transformations such that LT M : V → V is symmetric. Deﬁne N : V → V by LT M = N + N T . For x, y ∈ V , we deﬁne the Weyl operator W (x, y) : L2 (V ) → L2 (V ) in the usual way. We note that V is a ﬁnite vector space over F and if F consists of a elements, then V will consist of ap elements and dimL2 (V ) = ap . Now, let χ0 be a character of the ﬁeld F viewed as an Abelian group under addition. Consider for a ﬁxed a ∈ V ˜ (u) = χ0 (aT u + uT N u)W (Lu, M u), u ∈ V W

Now

˜ (u + v) = χ0 (aT (u + v) + uT N u + v T N v)W (Lu, M u)W (Lv, M v) W χ0 (aT (u + v) + uT N u + v T N v)χ0 (uT M T Lv)W (L(u + v), M (u + v)) = χ0 (aT (u + v) + uT N u + v T N v + uT (N + N T )v)W (L(u + v), M (u + v)) = χ0 (aT (u + v) + (u + v)T N (u + v))W (L(u + v), M (u + v)) ˜ (u + v) =W

provided that we assume that the Weyl operator W (x, y) has been deﬁned so that in the expression W (x, y)W (u, v) =< y, u > W (x + u, y + v), < y, u >= χ0 (y T u) or in other words, the character of V corresponding to any y ∈ V when V is viewed as an Abelian group under addition, ˜ (u) is a unitary representation of V . We note that H = {(Lu, M u) : u ∈ V } is a is given by u → χ0 (y T u). Thus u → W Gottesman subgroup of V × V . This is because, for u, v ∈ V , ¯0 (uT LT M v)χ0 (uT M T Lv) = χ0 (uT (M T L − LT M )v) = 0 < Lu, M v >∗ < M u, Lv >= χ since

χ ¯0 (x) = χ0 (x)−1

Construction of quantum error correcting codes using the Imprimitivity theorem of Mackey: Let G be a ﬁnite group acting on a ﬁnite set X. For each x ∈ X let Px be an orthogonal projection onto a Hilbert space H such that Px Py = δx,y I, x∈X Px = I. Assume that g → Ug is a unitary representation of G in H satisfying the Imprimitivity condition Ug Px Ug∗ = Pgx , g ∈ G, x ∈ X

Let E ⊂ X and then we have

Ug P (E)Ug∗ = P (gE), g ∈ G

Now let N be a linear space of linear operators in H spanned by {Ug P (E) : g ∈ G} where E ⊂ X is ﬁxed. We choose x ∈ X and ask the question when does the quantum code Px detect N . This happens iﬀ Px Ug P (E)P (E)Ug∗ Px is proportional to Px for all g, h ∈ G. This is the same as saying that Px P (gE)Px = λPx This happens iﬀ either x ∈ / gE in which case, the lhs is zero, or if gE = x for all g ∈ G which is impossible. Thus, Px corrects N iﬀ g −1 x ∈ / E. Now we take F ⊂ X and derive the conditions for P (F ) to detect N . This happens iﬀ P (F )Ug P (E)P (E)Ug∗ P (F ) is proportional to P (F ), ie, iﬀ P (F ∩ gE) is proportional to P (F ). This happens iﬀ either gE ∩ F = φ or gE ⊂ F . Now we consider the correction problem. P (F ) corrects N iﬀ for all g, h ∈ G, P (F )Ug P (E)P (E)Uh∗ P (F ) = λP (F )

Quantum Mechanics for Scientists and Engineers

72

where λ may depend on g, h. This happens iﬀ P (F )P (gE)P (gh−1 F )Uhg−1 = λP (F ) or equivalently, iﬀ

P (F ∩ gE ∩ gh−1 F )Uhg−1 = λP (F )

Post-multiplying both sides of this equation by their adjoints gives us

P (F ∩ gE ∩ gh−1 F ) = |λ|2 P (F ) This is a necessary condition for P (F ) to correct N and happens only if for each g, h either F ∩ gE ∩ gh−1 F = φ or / gE or F ⊂ gE ∩ gh−1 F . As a special case, choosing F = {x}, we see that Px corrects N only if for each g, h either x ∈ or x = gh−1 x. We also derive the following conclusions from the above discussion. Px corrects N iﬀ for each g, h, either x∈ / gE or x = gh−1 x and Ugh−1 Px = λPx We also observe that if Ua Px = λPa for some a ∈ G, then, Ua Px Ua∗ = |λ|2 Px or equivalently,

Pax = |λ|2 Px

which implies that |λ| = 1 and ax = x. Thus if we deﬁne

Gxeig = {g ∈ G : Ug Px = λPx } and then we ﬁnd that

Gxiso = {g ∈ G : gx = x} Gxeig ⊂ Gxiso

and that Px corrects N = span{Ug P (E) : g ∈ G} iﬀ for each g, h ∈ G (either x ∈ / gE or gh−1 ∈ Gxeig }) In particular, Px corrects span{Ug Py : g ∈ G} iﬀ for each g, h ∈ H, (either x �= gy or gh−1 ∈ Gxeig ). As a special case, Px corrects span {Ug Px : g ∈ G} if for each g, h ∈ G either g ∈ / Gxiso or gh−1 ∈ Gxeig . Let C be a cross section of G/Gxiso . Let c1 , c2 ∈ C, g, h ∈ Gxeig . Then Px Uc∗1 g Uc2 h Px = Px Ug−1 c−1 c2 h Px = λPx 1

since ⊂ . Hence, Px corrects span{Ug : g ∈ for each x ∈ G. A linear algebraic example of a quantum error correcting code. Let the code subspace projection P be given by Ir 0 P = ∈ Cn×n 0 0 Gxeig

Gxiso

CGxeig }

Let U be a ﬁxed n × n unitary matrix and let the noise subspace of operators N be the span of the adjoints of all n × n matrices Nk having the block structure λkj Ak λkj Bk Nkj = , k = 0, 1, ..., n/k − 1 Ckj Dkj where [Ak |Bk ] ∈ Cr×(n−r) is obtained as the kr+1 to (k+1)r rows of U arranged one below the other (k = 0, 1, ..., (n/k)− 1) and the λkj are arbitrary complex numbers. We ﬁnd that since ∗ = δkm Ir Ak A∗m + Bk Bm

(obtained using U U ∗ = In ), we have ∗ Nkj Nml =

¯ kl δkm Ir λkj λ F2

F1 F3

Quantum Mechanics for Scientists and Engineers

73

where F1 , F2 , F3 are matrices of size r × (n − r), (n − r) × r and (n − r) × (n − r) respectively. Thus, ∗ ¯ kl P P = δkm λkj λ P Nkj Nml

and hence the quantum code P corrects N . [60] Quantum hypothesis testing: Let A, B be two density matrices and let 0 ≤ T ≤ I. The probability of making a correct decision when the density is A and the measurement is T is given according to quantum mechanical rules by T r(AT ). The probability of making a correct decision when the density is B is given by T r(B(I − T )). Now we wish to choose T such that T r(AT ) is a maximum subject to the constraint T r(BT ) ≤ α where α ∈ [0, 1) is given. Such a test corresponds to minimizing the error probability under A subject to the constraint that the error probability under B is smaller than a prescribed threshold. The function c → T (c) = {A ≥ cB} is assumed to be continuous on [0, ∞). We note that T (0) = I, T (∞) = 0 (By {U ≥ V } for Hermitian operators U, V , we mean the orthogonal projection onto the space spanned by all vectors v for which (U − V )v = λv for some λ ≥ 0.) Hence, T r(BT (c)) takes all values in [0, 1) as c varies over [0, ∞). Let α ∈ [0, 1) and choose c such that T r(BT (c)) = α. Then, we claim that T (c) is an optimal test. To see this, suppose 0 ≤ T ≤ I is any other test (ie measurement or positive operator) such that T r(BT ) ≤ α. Then T r(AT ) = T r((A − cB)T ) + cT r(BT ) ≤ T r((A − cB)T ) + cα = T r((A − cB)T (c)T ) + T r((A − cB){A < cB}T ) + cα ≤ T r((A − cB)T (c)) + cα = T r(AT (c))

Here, we make use of the fact that since A − cB commutes with T (c) and T (c)2 = T (c), we have T r((A−cB)T (c)T ) = T r(T (c)(A−cB)T (c)T ) = T r(T 1/2 T (c)(A−cB)T (c)T 1/2 ) ≤ T r(T (c)(A−cB)T (c)) = T r((A−cB)T (c))

since 0 ≤ T 1/2 ≤ I and T (c)(A − cB)T (c) ≥ 0. Now, we can derive bounds on the error probabilities. We have for s ≥ 0, P1 (c) = T r(A(I − T (c)) = T r(A{A < cB}) ≤ cs T r(A1−s B s ) Thus,

log(P1 (c)) ≤ s.log(c) + log(T r(A1−s B s ) = s(R + log(A1−s B s )/s)

where

R = log(c)

Consider now the function

f (s) = sR + log(T r(A1−s B s ), s ≥ 0

We wish to select R so that f (s) has a minimum at s = 0. For this, we require that f � (0) = 0, f �� (0) ≥ 0. Now, f � (0) = R − D(A|B) where

D(A|B) = T r(A(logA − logB))

is the relative entropy between A and B. Thus, the ﬁrst condition gives R = D(A|B). The second condition f �� (0) ≥ 0 gives d [T r(−A1−s log(A)B s + A1−s B s log(B))/T r(A1−s B s )]|s=0 ≥ 0 ds or equivalently, T r(A(log(A))2 − 2Alog(A)log(B) + A(log(B)2 ) + (T r(Alog(A) − Alog(B)))2 ≥ 0 which is true since by the Schwarz inequality, |T r(Alog(A)log(B))|2 ≤ T r(A(log(A))2 ).T r(A.(log(B))2 ) It follows that

P1 (eR ) = 0

if R = D(A|B) and for the second kind of error probability, we get P2 (c) = T r(BT (c)) = T r(B{A < cB}) = 1 − T r(B{A > cB})

Quantum Mechanics for Scientists and Engineers

74

Also, for s ≤ 1

T r(B{A > cB}) ≤ cs−1 T r(A1−s B s )

so We now ﬁnd on letting s = 0 that

log(1 − P2 (c)) ≥ (s − 1)log(c) + log(T r(A1−s B s )) log(1 − P2 (c)) ≥ −log(c) + 1

so if we put R = log(c), we get

log(1 − P2 (eR )) ≥ 1 − R

or equivalently,

P2 (eR ) ≤ 1 − exp(1 − R)

We also note that So, taking log(c) = D(A|B) − δ gives

lims→0 log(T r(A1−s B s )/s = −D(A|B)

log(1 − P2 (c)) ≥ −log(c) + s(log(c) + log(T r(A1−s B s ))/s) ≥ −D(A|B) + δ + s(D(A|B) − δ − D(A|B)) = −D(A|B) + (1 − s)δ

for s ≤ 1. We also recall that

log(P1 (c)) ≤ s(log(c) + log(T r(A1−s B s ))/s)

= s(D(A|B) − δ + D(A|B) + �(s)) = −s(δ − �(s))

where

�(s) → 0, s → 0

Hence, if s is suﬃciently small, say s ≤ s0 , then �(s) < δ/2 and for all such s, we have log(P1 (c)) ≤ −sδ/2 ≤ −s0 δ/2 Note: Consider the function We have

g(s) = log(T r(A1−s B s ))/s

g � (s) = T r(−A1−s log(A)B s + A1−s B s log(B))/(s.T r(A1−s B s )) − log(T r(A1−s B s ))/s2

The limit of this as s → 0 is the same as the limit of

−D(A|B)/s + D(A|B)/s as s → 0 which is zero. We also note that the limit of g �� (s) as s → 0 is the same as the limit of T r(A(log(A))2 + B(log(B))2 − 2Alog(A)log(B))/s + D(A|B)/s2 + (D(A|B)2 /s) + (D(A|B)/s2 ) + (2/s3 )log(T r(A1−s B s ) which is same as the limit of T r(A(log(A))2 + B(log(B))2 − 2Alog(A)log(B))/s + D(A|B)/s2 + (D(A|B)2 /s) + (D(A|B)/s2 ) − 2D(A|B)/s2 The above equals

T r(A(log(A))2 + B(log(B))2 − 2Alog(A)log(B))/s + (D(A|B)2 /s

which is positive. Hence, g(s) for s ≥ 0 attains its minimum value of D(A|B) at s = 0.

[61] The Sudarshan-Lindblad equation for observables on the quantum system has the following general form: X � = i[H, X] −

1 ∗ (Lk Lk X + XL∗k Lk − 2L∗k XLk ) 2

Assume that H=

k

1 T 1 p F1 p + q T F2 q 2 2

Chapter

)/s2 ) − 2D(A|B)/s2

7

Master Equations, Non-Abelian Gauge Fields T r(A(log(A))2 + B(log(B))2 − 2Alog(A)log(B))/s + (D(A|B)2 /s

[61] The Sudarshan-Lindblad equation for observables on the quantum system has the following general form: X � = i[H, X] −

1 ∗ (Lk Lk X + XL∗k Lk − 2L∗k XLk ) 2 k

Assume that H= and

1 T 1 p F1 p + q T F2 q 2 2

Lk = λTk p + μTk q, k ≥ 1

We have

p = ((pn )), q = ((qn )), [qn , pm ] = iδn,m L∗k Lk X

say. We thus have

+ XL∗k Lk − 2L∗k XLk = L∗k [Lk , X] + [X, L∗k ]Lk = θk (X) [Lk , qn ] = [λTk p, qn ] = −iλk [n], ¯ T p] = iλ ¯ k [n] [qn , L∗k ] = [qn , λ k

so so

¯T p + μ ¯ k [n](λT p + μT q) ¯Tk q) + iλ θk (qn ) = −iλk [n](λ k k k

θk (q) =

k

¯ k λT )p + (−iλμ∗ + iλ ¯ k μT )q] [(−iλk λ∗k + iλ k k k

k

=

[2Im(λk λ∗k )p + 2Im(λk μ∗k )q] k

Likewise, so so

¯Tk q] = −i¯ μk [n] [Lk , pn ] = [μTk q, pn ] = iμk [n], [pn , L∗k ] = [pn , μ ¯T p + μ ¯Tk q) − i¯ μk [n](λTk p + μTk q) θk (pn ) = iμk [n](λ k θk (p) = −2Im(μk λ∗k )p − 2Im(μk μ∗k )q

We write

θ=

θk

k

Then,

[H, qn ] = [pT F1 p/2, qn ] = −i ie, Likewise,

F1 [n, m]pm

m

[H, q] = −iF2 p [H, p] = iF1 q

[62] The Yang-Mills ﬁeld and its quantization using path integrals: Let G be a ﬁnite dimensional Lie group and g its Lie algebra. For simplicity, assume that G is a subgroup of U (n, C). Then g consists of n × n complex skew-Hermitian matrices. We can thus choose a basis {iτa : a = 1, 2, ..., n} for g with the τa� s Hermitian matrices. Thus, the commutation relations of these basis elements has the form [iτa , iτb ] = −C(abc)iτc

where the C(abc) are real constants and summation over the repeated index c is understood. Equivalently, these commutation relations can be expressed as [τa , τb ] = iC(abc)τc We let Aμ : R4 → g be the gauge ﬁelds. Thus, the covariant derivatives in terms of these gauge ﬁelds are deﬁned by where e is a real constant. We can write

∇μ = ∂μ − eAμ

Aμ (x) = iAaμ (x)τa

Quantum Mechanics for Scientists and Engineers

76

where the Aaμ (x)� s are now real valued ﬁelds. There are in all 4n of such ﬁelds. We assume that the wave function ψ(x) ∈ R4n satisﬁes an equation of the form [γ μ i∂μ ⊗ In + eAaμ (x)γ μ ⊗ τa − mI4n ]ψ(x) = 0 where γ μ are the four Dirac γ matrices forming a basis for the Cliﬀord algebra in C4×4 . Formally, we can write the above wave equation as γ μ (i∂μ − ieAμ ) − m)ψ = 0 or equivalently as

[γ μ (i∂μ + eAaμ τa ) − m]ψ = 0

This eqations can be derived from the variational principle

δψ S = 0 where S[ψ] =

ψ¯∗ γ 0 (γ μ (i∂μ + eAaμ τa ) − m)ψd4 x =

Ld4 x

Note that γ 0 γ μ are Hermitian matrices and so is γ 0 γ μ ⊗ τa . Hence using integration by parts, it follows that S[ψ] is real. In terms of the gauge covariant derivative deﬁned above, we have L = ψ ∗ γ 0 (iγ μ ∇μ − m)ψ We consider the following transformation of the wave function ψ(x) → ψ � (x) = g(x)ψ(x) where g(x) ∈ G is to be intepreted as I4 ⊗ g(x). This is called a local gauge transformation of the wave function. Then we consider a corresponding transformation of the gauge ﬁeld Aμ (x): Aμ (x) → A�μ (x) so that if then

∇�μ = ∂μ − eA�μ ∇�μ ψ � = g(x)∇μ ψ

If this happens, the the above Lagrangian density L will be invariant under a local gauge transformations of the wave function and the gauge ﬁelds. To get this, we must satisfy g(x)(∂μ − eAμ (x))ψ(x) = (∂μ − eA�μ (x))ψ � (x) which is equivalent to

= (∂μ − eA�μ (x))g(x)ψ(x) A�μ = gAμ g −1 + e−1 (∂μ g)g −1

Note that we get a gauge invariant Lagrangian by considering ψ ∗ γ 0 γ μ (∂μ −eAμ )ψ which is the same as ψ ∗ gamma0 γ μ ∇μ ψ or more precisely, ψ ∗ (γ 0 γ μ ⊗ ∇μ )ψ

we note that the gauge transformation g(x) is actually to be interpreted as I4 ⊗ g(x) and it acts only on the second component in the tensor product of C4 ⊗ Cn . [63] A general remark on path integral computations for gauge invariant actions. Suppose that we have an action integral I(φ) of the ﬁelds φ(x) and we compute a path integral of the form S = exp(iI(φ))B[φ]dφ

Quantum Mechanics for Scientists and Engineers

77

Suppose that the action integral I(φ) is invariant under a Gauge transformation φ → φΛ . Suppose also that the path integral measure dφ = Πx∈R4 dφ(x) is invariant under the same Gauge transformation. Then, we can write S = exp(iI(φΛ )B[φΛ ]dφΛ

=

exp(iI(φ))B[φΛ ]dφ

If follows that for an function C(Λ) on the gauge group (Λ ∈ G), we have S C(Λ)dΛ = exp(iI(φ))dφ B[φΛ ]C(Λ)dΛ Now let Λ, λ be two gauge group transformations. Then, we have B[φΛ ]C(Λ)dΛ =

B[φΛoλ ]C(Λoλ)dλ

assuming that the measure dλ on the gauge group is a left invariant Haar measure so that dΛoλ = dλ. Deﬁne for any functional F of the ﬁelds φ F(φ, x) = (δF (φΛ) )/δΛ)(x)|Λ=Id We have

(δF (φΛoλ )/δλ)(x) = (δF (φΛoλ )/δ(Λoλ))(x)(δ(Λoλ)/δλ)(x)

It follows that on evaluating both sides at λ = Id, the identity gauge transformation, we get F˜ (φ, Λ, x) = F (φ, Λ, x)G(Λ) where

F˜ (φ, Λ, x) = (δF (φΛoλ )/δλ)(x)|λ=Id ,

and

G(Λ) = (δ(Λoλ)/δλ)|λ=Id F (φ, Λ, x) = (δF (φΛ )/δΛ)(x)

Now consider S= =

exp(iI(φ))B[F [φ]]F (φ, x)Πdφ(x)

exp(iI(φ))B[F [φΛ ]]Πx F (φΛ , x)dφ(x)

provided that we assume invariance of the path measure exp(iI(φ))dφ under gauge transformations Λ, ie, exp(iI(φΛ ))dφΛ = exp(iI(φ))dφ It follows that S

= So by choosing C = G, we get

C(Λ)dΛ =

exp(iI(φ))B[F [φΛ ]]Πx F (φΛ , x)dφ(x)C(Λ)dΛ exp(iI(φ))B[F [φΛ ]](δF [φΛ ]/δΛ)G(Λ)−1 C(Λ)dΛ

S

C(Λ)dΛ =

Quantum Mechanics for Scientists and Engineers

78 76

exp(iI(φ))B[F [φ]]dF [φ] =

exp(iI(φ))B[φ]dφ

establishing the invariance of the scattering matrix S under the gauge ﬁxing functional F . [64] Calculation of the normalized spherical harmonics. Ylm (θ, φ) = Plm (cos(θ))exp(imφ) where f (x) = Plm (x) satisﬁes the modiﬁed Legendre equation ((1 − x2 )f � )� + (l(l + 1) − m2 /(1 − x2 ))f = 0 or equivalently, or

(1 − x2 )2 f �� − 2x(1 − x2 )f � + (l(l + 1)(1 − x2 ) − m2 )f = 0 (1 − x2 )2 f �� − 2x(1 − x2 )f � + (l(l + 1) − m2 − l(l + 1)x2 )f = 0

For simplicity of notation, let

λ = l(l + 1)

Then, the above equation is the same as (1 + x4 − 2x2 )f �� − 2x(1 − x2 )f � + (λ − m2 − λx2 )f = 0 Let

f (x) =

c(n)xn

n≥0

Then, we get on substituting this into the above diﬀerential equation, c(n)n(n − 1)(xn−2 + xn+2 − 2xn ) − 2 nc(n)(xn − xn+2 ) + (λ − m2 ) c(n)xn − λ c(n)xn+2 = 0 n

n

n

n

Equating coeﬃcients of xn gives (n + 2)(n + 1)c(n + 2) + (n − 2)(n − 3)c(n − 2) − 2n(n − 1)c(n) − 2nc(n) + 2(n − 2)c(n − 2) + (λ − m2 )c(n) − λc(n − 2) = 0 or or

(n + 2)(n + 1)c(n + 2) + ((n − 1)(n − 2) − λ)c(n − 2) + (λ − m2 − 2n2 )c(n) = 0 (n + 4)(n + 3)c(n + 4) + (n(n + 1) − l(l + 1))c(n) + (l(l + 1) − m2 − 2(n + 2)2 )c(n + 2) = 0

If l is not a non-negative integer, then there will be nonzero c(n)� s for arbitrarily large n and as n → ∞, we would get c(n + 4) + c(n) − 2c(n + 2) ≈ 0, n → ∞ or equivalently,

c(n + 4) − c(n + 2) − (c(n + 2) − c(n)) ≈ 0

which would imply that c(n + 2) − c(n) converges to a constant, say K. Then c(2n) or c(2n + 1) for large n behaves as Kn and since the series n≥0 nx2n behaves as x/(1 − x2 )2 which is not integrable over x ∈ [0, 1] since it has a singularity at x = 1 (ie, at θ = 0), it follows that the series has to terminate at some ﬁnite N , ie, there must exist a ﬁnite integer N such that c(n) = 0 for all n > N which is equivalent to saying that f (x) must be a polynomial. This can happen only if we impose the condition l = N and c(N + 2) = 0. Assume ﬁrst that N = 2r is even. Then, we may assume that c(2n + 1) = 0 for all n and (2n + 4)(2n + 3)c(2n + 4) + (2n(2n + 1) − 2r(2r + 1))c(2n) + (2r(2r + 1) − m2 − 4(n + 1)2 )c(2n + 2) = 0 for n = 0, 1, ..., r − 1. Putting n = r we then get c(2r + 2) = 0 since we are assuming that c(2r + 4) = 0. Thus, c(2n) = 0 for all n > r and we get that r c(2n)x2n f (x) = n=0

Quantum Mechanics for Scientists and Engineers

79

where c(0) is arbitrary, c(2) is obtained by putting n = −1 in the above diﬀerence equation and using c(k) = 0, k < 0: c(2) = (m2 − 2r(2r + 1))c(0)/2 and (2n + 4)(2n + 3)c(2n + 4) = −[(2n(2n + 1) − 2r(2r + 1))c(2n) + (2r(2r + 1) − m2 − 4(n + 1)2 )c(2n + 2)]/(2n + 4)(2n + 3) for n = 0, 1, ..., r − 2. In a similar way, we can describe the polynomials Plm (x) for l odd, ie l = 2r + 1. We assume that c(2n) = 0 for all n and get using the recursion (n + 4)(n + 3)c(n + 4) + (n(n + 1) − (2r + 1)(2r + 3))c(n) + ((2r + 1)(2r + 3) − m2 − 2(n + 2)2 )c(n + 2) = 0 at n = −1,

6c(3) + ((2r + 1)(2r + 3) − m2 − 2)c(1) = 0

so that

c(3) = (m2 + 2 − (2r + 1)(2r + 3))c(1)/6

and then replacing n by 2n + 1 in the above recursion,

c(2n+5) = −[(2n+5)(2n+4)]−1 [((2n+1)(2n+3)−(2r+1)(2r+3))c(2n+1)+((2r+1)(2r+3)−m2 −2(2n+3)2 )c(2n+3)] for n = 0, 1, ..., r − 2 and we may assume that c(2n + 1) = 0 for n = r + 1, r + 2, .... The values of c(0) and c(1) for the two cases are determined by the normalization condition 1 Plm (x)2 dx = 1 2π −1

This guarantees that

0

π

2π

0

|Ylm (θ, φ)|2 sin(θ)dθdφ = 1

A technique for calculating the representation matrix πl (R) for R ∈ SO(3). Here, πl (R) is deﬁned by l

Ylm (R−1 n ˆ) =

[πl (R)]m m Ylm (ˆ n)

m =−l

From this equation, it is clear that [πl (R)]m m =

For example, taking we have

Ylm (R−1 n ˆ )Y¯lm (ˆ n)dΩ(ˆ n)

R = Rx (β) R−1 [cos(φ)sin(θ), sin(φ)sin(θ), cos(θ)]T =

[cos(φ)sin(θ), sin(φ)sin(θ)cos(β) − cos(θ)sin(β), sin(φ)sin(θ)sin(β) + cos(θ)cos(β)]T

Denoting, this new point by (θ� , φ� ), we get

θ� = cos−1 (sin(φ)sin(θ)sin(β) + cos(θ)cos(β)),

We then have

φ� = tan−1 [(sin(φ)sin(θ)cos(β) − cos(θ)sin(β))/cos(φ)sin(θ)] [πl (Rx (β))]m m =

0

π

0

2π

Y¯lm (θ, φ)Ylm (θ� , φ� )sin(θ)dθdφ

A MATLAB programme for tabulating the matrix elements [πl (Rx (−β))]m m would then proceed along the following lines: We store [πl (Rx (−2πk/N ))]m m with k = 0, 1, ..., N − 1, m� , m = −l, −l + 1, ..., l as a two dimensional array A with matrix elements A[k + 1, (2l + 1)(m� + l) + m + 1] of size N × (2l + 1)2 . Thus,

Quantum Mechanics for Scientists and Engineers

80

for k = 0 : N − 1 for m� = −l : l for m = −l : l sum = 0; for r = 0 : N − 1 for s = 0 : N − 1 θ = πr/N φ = 2πs/N β = 2πk/N θ� = cos−1 (sin(φ)sin(θ)sin(β) + cos(θ)cos(β)), φ = tan−1 [(sin(φ)sin(θ)cos(β) − cos(θ)sin(β))/cos(φ)sin(θ)] �

sum = sum + conj(Ylm (θ, φ)) ∗ Ylm (θ� , φ� ) ∗ (π/N ) ∗ (2π/N ); end; end; A[k + 1, (2l + 1)(m� + l) + m + 1] = sum; end; end; end;

[65] Volterra systems in quantum mechanics: The Hammiltonian has the form H(t) = H0 + f (t)V0 Let U (t) be the Schrodinger evolution operator: U � (t) = −iH(t)U (t), U (0) = I Then,

U (t) = U0 (t)W (t), U0 (t) = exp(−itH0 )

and W (t) has the Dyson series W (t) = I +

(−i)n

f (t1 )...f (tn )V (t1 )...V (tn )dt1 ...dtn

0 ua (t) =< f φ(u)|jt (θ˜ba (X)θ˜dc (X))|f φ(u) > �bc ud (t)¯ uk (t) =< f φ(u)|jt (θjk (X)θpj (X)|f (φ(u) > up (t)¯ ¯k (t) + < f φ(u)|jt (θjk (X)θ0j (X))|f φ(u) > u + < f φ(u)|jt (θj0 (X)θkj (X))|f φ(u) > uk (t)

[66] Quantum scattering theory, the wave operators and the scattering matrix. A is the free particle Hamiltonian and B = A + V is the Hamiltonian after the free particle starts interacting with the scatterer. Let |φi > be the ”in state” ie the free particle state at time t = 0 after it arrives from an inﬁnite distance from the scatterer at time t = −∞. Then, we see that if |ψi > is the scattered state at time t = 0 ie after the particle interacts with the scatterer, we must have limt→−inf ty exp(−itB)|ψi > −exp(−itA)|φi >= 0 so where

|ψi >= Ω− |φi > Ω− = limt→−∞ exp(itB)exp(−itA)

Likewise let |φo > be the ”out-state”, ie the free particle state at time t = 0 which evolves under the free particle Hamiltonian A to the scattered state at time t = ∞. Let |ψo > be the corresponding scattered state at time t = 0. Then we must have limt→∞ exp(−itB)|ψo > −exp(−itA)|φo >= 0

Quantum Quantum Mechanics Mechanics for Scientists and Engineers

8385

and hence

|ψo >= Ω+ |φo >

where

Ω+ = limt→∞ exp(itB)exp(−itA)

It should be noted that the wave operators Ω± must be deﬁned on appropriate domains. We have < ψo |ψi >=< φo |Ω∗+ Ω− |φi > and the magnitude square of this quantity gives the probability that the free particle starting at t = −∞ in the state |phio > interacts with the scatterer and then gets scattered at time t = ∞ to the state |φo > . This proability can be expressed as P (φi → φo ) = | < φo |S|φi > |2 where

S = Ω∗+ Ω−

is the scattering matrix or scattering operator. We have the following interesting relations: ∞ ∞ d (exp(itB)exp(−itA))dt = i exp(itB)V.exp(−itA)dt Ω+ − I = dt 0 0 0 d Ω− − I = − (exp(itB)exp(−itA))dt −∞ dt 0 = −i exp(itB)V.exp(−itA)dt −∞

= −i

∞

exp(−itB)V.exp(itA)dt

0

We can derive some formal expressoins for the wave operators based on the above formulas: Let A= λEA (dλ), B = λEB (dλ) R

R

be the spectral representations of the self adjoint operators A and B respectively. Then, we get from the above exp(it(B − lambda + i�))V EA (dλ)dt Ω+ − I = i [0,∞)×R

=− It follows that Ω+ = =

R

R

R

(B − λ + i�)−1 V EA (dλ)

(I − (B − λ + i�)−1 V EA (dλ)

(B − λ + i�)−1 ((A − λ + i�)−1 EA (dλ)

= (i�)−1

R

(B − λ + i�)−1 EA (dλ)

Example: Let A be self-adjoint and B = A + V where V is a bounded operator. Consider W (t) = exp(−itA).exp(itB) = exp(−itA).exp(it(A + V )) We know that W (t) has the Dyson series W (t) = I + U0 (−t1 )V U0 (t2 − t1 )V U0 (t3 − t2 )...U0 (tn − tn−1 )V U0 (−tn )dt1 ...dtn n≥1

where

0 ∀y ∈ D(S) where we have used D(S) ⊂ D(S ∗ ). Now, we show that zn is a convergent sequence. We have that lim(S−i)zn = (S ∗ −i)x exists and hence (S − i)zn is a Cauchy sequence, ie (S − i)(zn − zm ) → 0, n, m → ∞ and hence or equivalently, Now

� (S − i)(zn − zm ) �2 → 0 � S(zn − zm ) �2 + � (zn − zm ) �2 +2Im(< S(zn − zm ), zn − zm >) → 0 < Sz, z >=< z, Sz > ∀z ∈ D(S)

Quantum Mechanics for Scientists and Engineers 90

93

since S ⊂ S ∗ . Thus, Im(< Sz, z >) = 0∀z ∈ D(S). Thus, Im(< S(zn − zm ), zn − zm >) = 0. This proves that � S(zn − zm ) �2 + � (zn − zm ) �2 → 0 and hence

� zn − zm �→ 0

Thus, zn is Cauchy and hence convergent. Let zn → z. Then since (S +i)zn converges, it follows that Szn also converges. ¯ and we get Thus, z ∈ D(S) < x, (S + i)y >=< z, (S + i)y > ∀y ∈ D(S)

Thus,

x − z ⊥ R(S + i)

¯ Thus, we have proved that and since R(S + i) is dense in H, it follows that x = z ∈ D(S). ¯ D(S ∗ ) ⊂ D(S)

¯ ⊂ D(S ∗ ) ⊂ D(S) ¯ from which we deduce that D(S ∗ ) = D(S) ¯ Now S ⊂ S ∗ implies S¯ ⊂ S ∗ since S ∗ is closed. Thus, D(S) and therefore S¯ = S ∗ . Further, S ⊂ S ∗ impliesS ∗∗ ⊂ S ∗ = S¯ and since S ∗∗ is a closed extension of S, it follows that ¯ Thus, S ∗ = S¯ = S ∗∗ which proves that S¯ is self-adjoint, ie, (S) ¯ ∗ = S. ¯ S ∗∗ = S. Remarks: (a) If A, B are operators in H such that A ⊂ B, then B ∗ ⊂ A∗ . Indeed, let x ∈ D(B ∗ ). Then < B ∗ x, y >=< x, By >=< x, Ay >, ∀y ∈ D(A) ⊂ D(B). Hence, x ∈ D(A∗ ) and < B ∗ x − A∗ x, y >= 0∀y ∈ D(A). This proves that B ∗ x − A∗ x ⊥ D(A) and since A is densely deﬁned, it follows that B ∗ x − A∗ x = 0, ie, B ∗ x = A∗ x, proving the claim. (Note that we are assuming without any loss in generality that all operators are densely deﬁned). (b) If A is any operator, then A∗ is closed. Indeed, let xn ∈ D(A∗ ), xn → x, A∗ xn → y. Then for all z ∈ D(A), we have < A∗ xn , z >→< y, z >, < A∗ xn , z >=< xn , Az >→< x, Az > and hence, < y, z >=< x, Az > This proves that y ∈ D(A ) and A x = y, proving that A∗ is closed. ¯ then B = A. ¯ Indeed, it suﬃces (c) If A is any closable operator and B is a closed operator such that A ⊂ B ⊂ A, ¯ and ¯ Then there exists a sequence xn ∈ D(A) such that xn → x, Axn → Ax to show that A¯ ⊂ B. So let x ∈ D(A). ¯ and since B is closed, it follows that x ∈ D(B), Bx = Ax. ¯ since A ⊂ B, we have xn ∈ D(B), xn → x, Bxn = Axn → Ax ¯ then A¯ ⊂ B. ¯ This This proves the claim. A related statement is that if B is any closed operator such that A ⊂ B, ¯ implies Gr(A) ⊂ Gr(B) ¯ ¯ ¯ follows from the implications A ⊂ B and hence Gr(A) ⊂ Gr(∗B). Another related remark is that if A is any closable operator then A∗∗ is a closed extension of A and hence by the above A¯ ⊂ A∗∗ . For suppose ¯ y = Ax ¯ and we have for z ∈ D(A∗ ), xn ∈ D(A), xn → x, Axn → y. Then, x ∈ D(A), ∗

∗

< y, z >= lim < Axn , z >= lim < xn , A∗ z >=< x, A∗ z > Hence, x ∈ D(A∗∗ ) and y = A∗∗ x. This proves that A¯ ⊂ A∗∗ . Now we come to the last statement. Let S ⊂ S ∗ and R(S ± i) = H. Then, we have to show that S ∗ = S. Indeed, let x ∈ D(S ∗ ). Then there exists a y ∈ D(S) such that (S ∗ − i)x = (S − i)y since R(S − i) = H. Thus for any z ∈ D(S) ⊂ D(S ∗ ), it follows that < (S ∗ − i)x, z >=< x, (S + i)z >=< (S − i)y, z >=< y, (S ∗ + i)z >=< y, (S + i)z > and hence, ie and therefore, This proves that and hence

< x − y, (S + i)z >= 0, z ∈ D(S) x−y ⊥H x = y ∈ D(S) D(S ∗ ) ⊂ D(S) ⊂ D(S ∗ ) D(S) = D(S ∗ )

Quantum Mechanics for Scientists and Engineers 91

94

so that

S = S∗

The proof is complete. A.3.Stochastic processes Here we discuss various kinds of noise processes that arise in perturbed quantum systems and explain how to compute transition probabilities of quantum systems in the presence of such noise processes. [1] Brownian motion [2] Poisson process [3] Compound Poisson processes [4] Levy processes [5] Reﬂected and absorbed Brownian motion Let B(t), t ≥ 0 be a standard Brownian motion starting at any given point and let T0 = inf (t ≥ 0 : B(t) = 0) Obviously, the statistics of T0 will depend on B(0). Absorbed Brownian motion is the Brownian motion process upto time T0 and after time T0 it is set equal to zero. We denote this process by X(t), t ≥ 0. We can show that X is a Markov process by the following simple intuitive reasoning: If at time t0 the process X(t0 ) = 0, then obviously X(t) = 0 for all t > t0 . On the other hand, if at time t0 , X(t0 ) = x > 0, then obviously B(t0 ) = x and by the Markovian property of B(.), the statistics of X(t) for all t > t0 will depend only on (t0 , x). It remains to compute the transition probability of X(.). For x, y > 0, we have P (X(t) > x|X(0) = y) = Py (B(t) > x, T0 > t) = Py (B(t) > x, mins≤t B(s) > 0) = P0 (B(t) > x − y, mins≤t B(s) > −y)

= P0 (−B(t) > x − y, mins≤t (−B(s)) > −y)

= P0 (B(t) < y − x, −maxs≤t (B(s)) > −y) = P0 (B(t) < y − x, St < y) = P0 (B(t) < y − x) − P0 (B(t) < y − x, St > y)

where

St = maxs≤t B(s)

By the reﬂection principle, Hence,

P0 (B(t) < y − x, St > y) = P0 (B(t) > y + x) P (X(t) > x|X(0) = y) = P0 (B(t) < y − x) − P0 (B(t) > y + x)

and hence the transition density of X(t) (taking values in [0, ∞)) is given by qt (x|y) = − Also

d P (X(t) > x|X(0) = y) = (2πt)−1/2 (exp(−(x − y)2 /2t) − exp(−(x + y)2 /2t)), x, y > 0 dx P (X(t) = 0|X(0) = y) = Py (T0 < t) = Py (mins≤t B(s) ≤ 0) = P0 (mins≤t B(s) ≤ −y)

= P0 (St > y) = 2P0 (B(t) > y) the last step following from the reﬂection principle. We thus verify that ∞ qt (x|y)dx + 2P (B(t) > y) P (X(t) ≥ 0|X(0) = y) = 0

= P (B(t) > −y) − P (B(t) > y) + 2P (B(t) > y) = P (B(t) > −y) + P (B(t) > y) = P (B(t) < y) + P (B(t) > y) = 1

[6] Bessel processes [7] The Brownian local time process

Quantum Mechanics for Scientists and Engineers 92

95

[1] Let θ(x) be the unit step function. Let B(t) be Brownian motion and L(t) its local time at zero, ie, dL(t) = δ(B(t))dt Let

τ (s) = inf {t > 0 : L(t) ≥ s}

then, clearly τ (s) is a stopping time for each s. By Ito’s formula,

1 d(B(t)θ(B(t)) = θ(B(t))dB(t) + δ(B(t))dt + B(t)δ � (B(t))dt 2 Now, Hence, and hence

xδ � (x) = (xδ(x))� − δ(x) = −δ(x) 1 θ(B(t))dB(t) = d(B(t)θ(B(t)) − dL(t) 2

t

0

1 θ(B(s))dB(s) = B(t)θ(B(t)) − L(t) 2

Therefore applying Doob’s optional stopping theorem to the exponential martingale t t θ(B(s))dB(s) − (a2 /2) θ(B(s))ds) M (t) = exp(−a 0

0

and the stopping time τ (t) gives 1 = E[M (τ (t))] = exp(at − (a2 /2)

τ (t)

θ(B(s))ds)

0

(Note that B(τ (t)) = 0, L(τ (t)) = t). Thus, E[exp(−s

τ (t)

√ θ(B(s))ds)] = exp(− 2st)

0

(Reference: Marc Yor, Some aspects of Brownian motion, Birkhauser). [8] Stochastic integration with respect to continuous semi-Martingales. Let Xt be continuous semimartingale. By the Doob-Meyer theorem, it can be decomposed as Xt = At + Mt where At is a process of bounded variation and Mt is a martingale. We can write − At = A+ t − At − where A+ t and At are increasing processes. The integral of a bounded adapted process Yt w.r.t At is deﬁned as an ordinary Riemann-Stieltjes integral or equivalently as a Lebesgue integral. Since Mt is not a process of bounded variation but has a well deﬁned ﬁnite quadratic variation, its integral must be deﬁned in the Ito sense. Speciﬁcally, deﬁning T n Yt dMt as the mean square limit of partial sums of the form Sn = k=1 Ytn,k (Mtn,k+1 −Mtn,k ) as maxk |tn,k+1 −tn,k | → 0 0. We have for n > m, E(Sn − Sm )2 = E(Yt2n,k )E(Mtn,k+1 − Mtn,k )2

+

k

k

E(Yt2m,k )(E(Mtm,k+1 − Mtm,k )2 ) −2E(Sn Sm )

Suppose we assume that the partition {tn,k } is a reﬁnement of the partition {tm,k }. Consider for example a term like 3 Ys1 ((Ms2 − Ms1 ) in Sm and a term like k=1 Ytk (Mtk+1 − Mtk ) in Sn where s1 = t1 < t2 < t3 < t4 = s2 . The expected value of their product is an example of a term in E(Sn Sm ). This term can be expressed as E(Yt21 (Mt2 − Mt1 )2 ) + E(Yt1 Yt2 (Mt3 − Mt2 )2 )

Quantum Mechanics for Scientists and Engineers 93

96

+E(Yt1 Yt3 (Mt4 − Mt3 )2 )

From this observation, it is clear that if the quantity T E Yt2 d < M >t < ∞ 0

then,

E((Sn − Sm )2 ) → 0, n, m → ∞

and hence by the fact that L2 (Ω, F, P ) is a Hilbert space, it follows that there exits a random variable S∞ which we T denote as 0 Yt dMt and call it the stochastic integral of Y w.r.t M over the interval [0, T ]. We are assuming that t the increasing function t →< M >t = 0 (dMs )2 deﬁnes a ﬁnite measure on a bounded interval like [0, T ] and that the adapted process Yt is Riemann integrable w.r.t this measure. More precisely, the stochastic integral w.r.t a martinagle can be deﬁned for almost surely progressively measurable processes Yt that are integable w.r.t the random measure d < M >t over ﬁnite intervals. For a detailed discussion of this construction see the book by Karatzas and Shreve on ”Brownian motion and stochastic calculus” or the book by Revuz and Yor on ”Continuous martingales and Brownian motion. Remark: Consider the quantum system dU (t) = [−(iH(t)dt +

p p 1 Vk (t)Vm (t)d < Mk , Mm > (t)) − i Vk (t)dMk (t)]U (t) 2 k,m=1

k=1

where Mk (t), k = 1, 2, ..., p are Martingales. It is easily veriﬁed using Ito’s formula that U (t) is unitary for all t if U (0) = I. We introduce a perturbation parameter � into the martingale terms and obtain dU (t) = [−(iH(t)dt + �2

p p 1 Vk (t)Vm (t)d < Mk , Mm > (t)) − i� Vk (t)dMk (t)]U (t) 2 k,m=1

k=1

We solve for U (t) using perturbation theory: U (t) =

�m Um (t)

m≥0

Equating same powers of � gives successively

dU0 (t) = −iH(t)U0 (t)dt, Vk (t)U0 (t)dMk (t), dU1 (t) = −iH(t)U1 (t) − i k

1 Vk (t)Vm (t)U0 (t)d < Mk , Mm > (t) − i Vk (t)U1 (t)dMk (t) dU2 (t) = −iH(t)U2 (t) − 2 k,m

k

and in general,

dUr (t) = −iH(t)Ur (t) −

1 Vk (t)Vm (t)Ur−2 (t)d < Mk , Mm > (t) − i Vk (t)Ur−1 (t)dMk (t), r ≥ 2 2 k,m

k

After calculating U0 (t) +

N

�r Ur (t)

r=1

we calculate the transition probablity from an initial state |i > to a ﬁnal state |f > in time T : E[| < f |U0 (T ) +

N r=1

�r Ur (T )|i > |2 ]

where the average is taken over the probability distribution of the martingale processes over [0, T ]. Examples of Martingales and the Ito formula for them (a) t M (t) = f (s, ω)dB(s, ω) + g(s, x, ω)(N (ds, dx, ω) − λ(s)dF (x)ds) 0

s≤t,x∈E

Quantum Mechanics for Scientists and Engineers 94

97

where f, g are progressively measurable functions with B being Brownian motion and N (t, ., ω) a Poisson ﬁeld with EN (ds, E) = λ(s)dsF (E). Ito’s formula for this martingale is g(t, x)2 N (dt, dx) (dM (t))2 = d < M > (t) = f (t)2 dt + x∈E

A.4.Syllabus for a short course on Linear algebra and its application to classical and quantum signal processing. [1] Vector space over a ﬁeld, linear transformations on a vector space. [2] Finite dimensional vector spaces, basis for a vector space, matrix of a linear transformation relative to a basis, Similarity transformation of the matrix of a linear transformation under a basis change. [3] Examples of ﬁnite and inﬁnite dimensional vector spaces. [4] range, nullspace, rank and nullity of a linear transformation. [5] Subspace, direct sum decompositions of vector spaces, projection operators. [6] Linear estimation theory in the language of orthogonal projection operators. [7] Statistics of the estimation error of a vector under a small random perturbation of the data matrix, statistics of the perturbation of the orthogonal projection operator under a small random perturbation of the data matrix. [8] Primary decomposition theorem, Jordan decomposition theorem, functions of matrices. [9] Cauchy’s residue theorem in complex analysis and its approach to the computation of functions of a matrix. [10] Norms on a vector space, norms on the space of matrices, Frobenius norm, spectral norm. [11] Notions of convergence in a vector space, calculating the exponential function and inverse of a perturbed matrix using a power series. [12] Recursive least squares lattice algorithms for time and order updates of prediction error ﬁlters based on appending rows and columns to matrices and computing functions of the appended matrices, RLS lattice for second order Volterra systems, Statistical properties of the prediction ﬁlter coeﬃcients under the addition of a small noise process to the signal process: A statistical perturbation theory based approach. [13] The MUSIC and ESPRIT algorithms for direction of arrival estimation based on properties of signal and noise eigensubspaces. [14] Computing the solution to time varying linear state variable systems using the Dyson series. Convergence of the Dyson series. [15] Computing the approximate solution to nonlinear state varable systems using Dyson series applied to the linearized system. [16] Dyson series in quantum mechanics. [17] Computing transition probabilities for quantum systems with random time varying potentials using the Dyson series. [18] Approximate solution to stochastic diﬀerential equations driven by Brownian motion and Poisson ﬁelds using linearization combined with Dyson series. Mean and variance propagation equations based on linearization. [19] The spectral theorem for ﬁnite dimensional normal operators and inﬁnite dimensional unbounded self-adjoint operators in a Hilbert space. [20] Properties of spectral families in ﬁnite and inﬁnite dimensional Hilbert spaces. [21] The general theory of estimating parameters in linear models for Gaussian and non-Gaussian noise. [22] The quantum stochastic calculus of Hudson and Parthasarathy and its application to the modeling of a quantum system coupled to a photon bath. [23] Kushner nonlinear ﬁlter and its linearized EKF version. [24] The Belavkin quantum ﬁlter based on non-demolition measurements and its application to quantum control. Application to quantum control: The Belavkin equation can be expressed as dρt = Lt (ρt )dt + Mt (ρt )dWt where Wt is a classical Wiener process arising from the innovations process of the measurement dWt = dYt − πi (St + St∗ )dt = dYt − T r(ρt (St + St∗ ))dt Note that ρt can be viewed as a classical random process with values in the space of signal space density matrices. St is a system space linear operator. The Belavkin equation is a commutative equation since all the terms appearing in it like ρt , Lt (ρt ), Wt etc. are signal space operator valued functionals of the commutative noise processs {Yy }. Now let Uc (t) be the control unitary satisfying the sde dUc (t) = (−(iH1 (t) + Q1 (t))dt − iK(t)dYt )Uc (t)

Quantum Mechanics for Scientists and Engineers 95

98

We have

Y (t) = U (t)∗ Yi (t)U (t) = U (T )∗ Yi (t)U (T ), T ≥ t

Here Yi (t) is an operator on the Boson Fock space and is thus independent of the system Hilbert space operators. We have taking Yi (t) = A(t) + A(t)∗ , dY (t) = dYi (t) + jt (Zt )dt where Zt is a system space operator and jt (Z) = U (t)∗ ZU (t). Thus, dUc (t) = (−(iH1 (t) + Q1 (t))dt − iK(t)(jt (Zt )dt + dYi (t)))Uc (t) We have

d(Uc∗ Uc ) = dUc∗ Uc + Uc∗ dUc + dUc∗ dUc = Uc (t)∗ (−(Q∗1 + Q1 )dt + idYt K(t) − iK(t)dYt + dY (t)K(t)2 dY (t))Uc (t)

If K(t) commutes with dY (t) and Q1 + Q∗1 = K(t)2 , then we would get d(Uc∗ Uc ) = 0 and Uc will be a control unitary operator. Taking K(t) = jt (Pt ) where Pt is a system operator, we have K(t)dY (t) = jt (Pt )(jt (Zt )dt + dYi (t)) = jt (Pt Zt )dt + jt (Pt )dYi (t) and

dY (t)K(t) = jt (Zt Pt )dt + jt (Pt )dYi (t)

So for Uc (t) to be unitary, we require that [Zt , Pt ] = 0 for all t. Note that Zt , Pt are system Hilbert space operators. We can now deﬁne ρc (t) = Uc (t)ρ(t)Uc (t)∗ Now,

dρc (t) = dUc (t)ρ(t)Uc (t)∗ + Uc (t)dρ(t).Uc (t)∗ + Uc (t)dρ(t)dUc (t)∗ + dUc (t)dρ(t)Uc (t)∗ +Uc (t)dρ(t).dUc (t)∗ + dUc (t)ρ(t)dUc (t)∗

[25] Linear algebra applied to the study of the linearized Einstein ﬁeld equations in the presence of matter and radiation for the study of galactic evolution as the propagation of small non-uniformities in matter and radiation propagating in an expanding universe. (0) gμν (x) is the background Robertson Walker (RW) metric. Its perturbation is (0) + δgμν gμν = gμν

The coordinate system can be chosen so that Then, the linearized Ricci tensor is

δg0μ = 0 α δRμν = δΓα μα,ν − δΓμν,α β(0)

β α(0) −(δΓα μν )Γαβ − Γμν δΓαβ α(0)

β(0)

+Γμβ δΓβνα + (δΓα μβ )Γαβ This expression can be expressed as

α δRμν = (δΓα μα ):ν − (δΓμν ):α

where the covariant derivatives are computed using the unperturbed RW metric. The energy momentum tensor of the matter ﬁeld is Tμν = (ρ + p)vμ vν − pgμν and that of the radiation ﬁeld is

Sμν =

1 Fαβ F αβ gμν − Fμα Fνα 4

For example, computing in a ﬂat space-time S00 =

1 Fαβ F αβ − F0α F0α 4

Quantum Mechanics for Scientists and Engineers 96

Now

99

Fαβ F αβ = −2F0r F0r + Frs Frs = −2|E|2 + 2|B|2 F0α F0α = −F0r F0r = −|E|2

so S00 =

1 1 (−|E|2 + |B|2 ) + |E|2 = (|E|2 + |B|2 ) 2 2

which is the correct expression for the energy density of the electromagnetic ﬁeld. Likewise, S 0r = S r0 deﬁnes the energy ﬂux as well as the momentum density and S rs = S sr deﬁnes the momentum ﬂux. We can write the expression for δRμν in the general form δRμν = C1 (μναβρ, x)δgαβ,ρ (x) + C2 (μναβρσ, x)δgβρ,σ (x) (0)

where C1 , C2 are functions of x determined completely from the background gravitational ﬁeld gμν (x) The perturbation to the energy momentum tensor of matter is given by δTμν = (δρ + δp)Vμ(0) Vν(0) + (ρ(0) + p(0) )(Vμ(0) δvμ + Vν(0) δvμ ) (0) −δpgμν + p(0) δgμν

We note that

Vμ(0) = (1, 0, 0, 0),

and p(0) , ρ(0) are functions of time only. Further, (0)

(0)

(0)

(0)

g00 = 1, g11 = −S 2 (t)f (r), g22 = −S 2 (t)r2 , g33 = −S 2 (t)r2 sin2 (θ), 1 1 − kr2 with k = 1 for a spherical universe, k = 0 for a ﬂat universe and k = −1 for a hyperbolic universe. We now consider a ﬂat unperturbed universe for which the metric has the form f (r) =

dτ 2 = dt2 − S 2 (t)(dx2 + dy 2 + dz 2 ) Thus,

g00 = 1, grr = −S 2 (t), r = 1, 2, 3

The Ricci tensor components are:

α α β R00 = Γα 0α,o − Γ00,α − Γ00 Γαβ β +Γα 0β Γ0α

Now, r Γα 0α = Γ0r =

So

1 rr g grr,0 = S /S, r = 1, 2, 3 2 Γα 0α,0 = (S /S)

Γα 00,α = 0 β Γα 00 Γαβ = 0

r

So

β Γα 0β Γ0α = (Γr0r )2 = (g rr grr,0 /2)2 = 3S 2 /S 2 r

R00 = (S /S) + 3S 2 /S 2 = S /S + 2S 2 /S 2 β α α β α Rkk = Γα kα,k − Γkk,α − Γkk Γαβ + Γkβ Γkα Γ0kk,0 − Γ0kk Γr0r + 2Γ0kk Γkk0 r

Quantum Mechanics for Scientists and Engineers 97

100

(No summation over k)

= −gkk,00 /2 + (gkk,0 /2)(S /S) + 2(−gkk,0 /2)(gkk,0 /2gkk )

We have in fact

= S /2 − S 2 /2S + S (S /2S) = S /2 Rkm = (S /2)δkm

Now let us study the perturbed Einstein ﬁeld equations w.r.t. the above ﬂat space-time metric. First note that we can choose our coordinate system so that δg0μ = 0 and hence δg 0μ = 0. Raising and lowering of indices are carried out w.r.t. the above ﬂat space time metric which is diagonal. The unperturbed space-time is comoving, ie, v μ(0) = (1, 0, 0, 0) deﬁne geodesics in the unperturbed space-time. The unperturbed pressure and density p(0) (t), ρ(0) (t) are functions of t only. The unperturbed energy momentum tensor of matter is T μν(0) = (ρ(0) + p(0) )v μ(0) v ν(0) − p(0) g μν(0) so that

T 00(0) = ρ(0) , T kk(0) = p(0) /S 2

with the other components of the energy momentum tensor being zero. The unperturbed Einstein ﬁeld equations 1 (0) Rμν = K.(Tμν(0) − T (0) gμν(0) ), K = −8πG 2 thus give after noting that

(0) μν(0) T (0) = gμν T =

(0)

and hence the unperturbed ﬁeld equations are

ρ(0) − 3p(0) ,

Tkk = −S 2 T kk(0) = −p(0)

1 S /S + 2S 2 /S 2 = K.(ρ(0) − (ρ(0) − 3p(0) ), 2

S /2 = K(−p(0) +

S 2 (0) (ρ − 3p(0) )) 2

These two equations along with an equation of state: p(0) = f (ρ(0) ) determing the three functions of time S(t), ρ(0) (t), p(0) (t). The perturbed equations are δRμν = KδTμν where

Now,

δTμν = (ρ(0) + p(0) )(vμ(0) δvν + vν(0) δvμ ) + (δρ + δp)vμ(0) vν(0) (0) −δpgμν − p(0) δgμν α(0)

β α δRkm = δΓα kα,m − δΓkm,α − Γkm δΓαβ β(0)

α(0)

β −Γαβ δΓα km + Γkβ δΓmα α +Γβ(0) mα δΓkβ

The perturbed ﬁeld equations taking into account contributions from the electromagneti ﬁeld are expressible in the form C1 (μναβγ, x)δgαβ,γ (x) + C2 (μναβγσ, x)δgαβ,γσ (x) = C3 (μνα, x)δvα (x) + C4 (μν, x)δρ(x) + C5 (μν, x)δp(x) +C6 (μναβ, x)δAα,β (x) = 0

Quantum Mechanics for Scientists and Engineers 98

101

The equations implied by the Bianchi identity are (T μν + S μν ):ν = 0 This is the same as

(T μν + S μν ),ν + Γμαν (T αν + S αν ) + Γναν (T μα + S μα ) = 0

We calculate is ﬁrst order perturbed version: αν + δS αν )+ (δT μν + δS μν ),ν + Γμ(0) αν (δT μα Γν(0) + δS μα )+ αν (δT

δΓμαν (T αν(0) + S αν(0) )+ δΓναν (T μα(0) + δS μα(0) ) = 0 This can be put in the form C7 (μαβ, x)δvα,β (x) + C8 (μα, x)δvα (x) + C9 (μαβγ, x)δAα,βγ (x) +C10 (μαβγ, x)δgαβ,γ (x) + C11 (μ, x)δρ(x) + C12 (μ, x)δp(x) = 0 We note that this last equation contains 3 equations for the velocity components δvr , r = 1, 2, 3 and one equation for δρ(x). δp(x) is determined from δρ(x) using the equation of state. Also δv0 is determined from δvr using 0 = δ(gμν v μ v ν ) (0) μ(0) = (δgμν )v μ(0) v ν(0) + gμν (v δv ν + v ν(0) δv μ

using that the unperturbed dynamics is comoving,ie, v μ(0) = (1, 0, 0, 0) we get from this equation δg00 + 2δv 0 = 0 so

1 δv 0 = − δg00 2

and hence,

δv0 = δ(g0μ v μ ) = δg00 (0)

since v r(0) = 0 and g0r = 0. [26] Perturbation theory applied to electromagnetic problems. Here, we discuss the rudiments of the theory of time independent perturbation theory in quantum mechanics and explain how the same techniques can be used to solve waveguide and cavity resonator problems having almost aribtrary cross sections by transforming the boundary into a simpler boundary using the theory of analytic functions of a complex variable. Let D be a ﬂat connected region parallel to the xy plane. D represents the cross section of a waveguide or a cavity resonator. The boundary ∂D is a closed curve that represents the boundary of the guide or resonator. The z direction is orthogonal to the D plane and for the guide, it represents the direction along which the em waves propagate. For the resonator case, we assume that 0 ≤ z ≤ d with the surfaces z = 0 and z = d being perfectly conducting surfaces as is the case with the side walls. We choose a system (q1 , q2 , z) of coordinates so that (q1 , q2 ) are functions of (x, y) alone. Being more speciﬁc, we assume that w = q1 (x, y) + jq2 (x, y) = f (z) = f (x + jy) with an inverse

z = x + jy = g(w) = g(q1 + jq2 )

and assume that f (z) is an analytic function of the complex variable z and its inverse g(w) is also an analytic function of the complex variable w. We can regard (z, z¯) as independent variables just as (x, y) are. The relation between the two pairs of variables is z = x + jy, z¯ = x − jy, x = (z + z¯)/2, y = (z − z¯)/2j We have

∂ ∂ ∂ = z,x + z¯,x ∂x ∂z ∂ z¯

Quantum Mechanics for Scientists and Engineers 99

102

= and likewise,

∂ ∂ + ∂z ∂ z¯

∂ ∂ ∂ = j( − ) ∂y ∂z ∂ z¯

Thus, ∇2⊥ = =(

∂2 ∂2 + 2 2 ∂x ∂y

∂ 2 ∂ 2 ∂ ∂ + ) −( − ) ∂z ∂ z¯ ∂z ∂ z¯ ∂2 =4 ∂z∂ z¯

Now,

∂ ∂ = w� (z) ∂z ∂w ∂ ∂ =w ¯ � (z) ∂ z¯ ∂w ¯ also, we clearly have since w� (z) is an analytic function of z and w ¯ � (z) is an analytic function of z¯ and hence that ¯ � (z) is an analytic function of w ¯ that 1/w� (z) = dz/dw is an analytic function of w and 1/w ∇2⊥ = 4

∂2 ∂2 = 4|z � (w)|−2 ∂z∂ z¯ ∂w∂ w ¯

= |z � (w)|−2 (

∂2 ∂q 2 + 2 ∂q12 )

Thus, the eigenvalue problem (∇2⊥ + h2 )ψ(x, y) = 0 with the Dirichlet boundary condition ψ = 0 on ∂D becomes (∇2⊥,q + h2 F (q1 , q2 ))ψ(q1 , q2 ) = 0 with

ψ(a, q2 ) = 0

where the boundary ∂D is the same as q1 = a. For example, if we take w = log(z) = log(ρ) + jφ, then q1 = a is the circle q1 = ρ = ea which is a circle and (q1 , q2 ) = |z � (w)|2 = |g � (q1 + jq2 )|2 Also, we have deﬁned ∇2⊥,q =

∂2 ∂2 + 2 =L 2 ∂q1 ∂q2

say. Solution by perturbation theory: Let h2 = −λ. Then, we have to solve (L − λ(1 + �G(q1 , q2 )))ψ(q1 , q2 ) = 0 where we are assuming that

F (q) = 1 + �G(q)

so that the boundary is a small perturbation of the rectangular boundary. [27] Perturbation theory applied to general nonlinear partial diﬀerential equations with noisy terms. The gravitational wave equations, ﬂuid dynamical equations, Klein-Gordon and Dirac equations are special cases of this. The ﬁeld φ : Rn → Rp satisﬁes a nonlinear pde n

bk (x)φl,k (x) +

k=1

p

akm (x)φl,km (x) + δ.Fl (φl (x), φl,k (x), φl,km (x), x)

k,m=1

+δ.

m

Glm (φl (x), φl,k (x), φl,km (x), x)wm (x)

Quantum Mechanics for Scientists and Engineers 100

103

where wm (x) are Gaussian noise processes. [28] The Knill-Laﬂamme theorem for quantum error correcting codes: Explicit construction of the recovery operators for a noisy quantum channel in terms of the code subspace and the noise subspace. [29] Post-Newtonian equations of hydrodynamics. The perturbations are carried out in powers of the velocity. The mass parameter is of the order of the square of the velocity (v 2 = GM/r for the orbital velocity) and the following expansions are valid ρ = ρ2 + ρ4 + ..., p = p4 + p6 + ..., v r = v1r + v3r + ..., v 0 = 1 + v20 + v40 + ... g00 = 1 + g00(2) + g00(4) + ..., g0r = g0r(3) + g(0r(5) + ... grs = −δrs + grs(2) + grs(4) + ... g 00 = 1 + g200 + g400 + ... g 0r = g30r + g50r + ..., g rs = −δ rs + g2rs + g4rs + ... The equation can be expressed as so the O(1) equation is

T μν = (ρ + p)v μ v ν − pg μν gμν v μ v ν = 1

g00 v 02 + 2g0r v 0 v r + grs v r v s = 1 v00 = 1

The O(v 2 ) equation is v20 + g00(2) − or equivalently,

v20 = −g00(2) + We have

v1r2 = 0

r

v1r2

r

T 00 = T200 + T400 + ... T 0r = T30r + T50r + ... T rs = T2rs + T4rs + ...

where

T200 = ρ2 , T400 = 2ρ2 v20 , T600 = ρ2 v202 + 2ρ4 v20 + 2p4 v20 − p4 g200 T30r = 2ρ2 v20 v1r , T50r = 2ρ2 v20 v1r + 2ρ2 v3r + 2ρ4 v1r + 2p4 v1r , T2rs =

[30] Lab problems on linear algebra based signal processing: [1] If X is an m × n matrix, then calculate δPX upto O(� δX) �) where PX is the orthogonal projection onto R(X) and X gets perturbed to X + δX where δX is a small random perturbation of X. Calculate using this formula, the second order statistics of δPX , ie, E(δPX ⊗ δPX ) in terms of E(δX ⊗ δX). Calculate δPX upto O(||δX||2 ) [2] Take an n × n matrix A. Add a row and a column to this matrix at the end and express the inverse of this matrix B in terms of the inverse of A. Assume now that A gets perturbed to A + δA and correspondingly, the appended row and column get perturbed by small amounts. Then calculate the inverse of the perturbed appended matrix in terms of A−1 and the appended rows and columns and their perturbations upto linear orders in the perturbations.

Quantum Mechanics for Scientists and Engineers 101

104

[3] Generate some functionals of the Brownian motion process B(t) like M (t) = max(B(s) : s ≤ t), m(t) = min(B(s) : s ≤ t), Ta = min(t > 0 : B(t) = a), |B(t)| (reﬂected Brownian motion), Absorbed Brownian motion (B(min(t, T0 ) : t ≥ 0 where B(0) = a > 0 and T0 = min(t > 0 : B(t) = 0, Local time process La (t) of the Brownian motion process at the t t level a. This is deﬁned as La (t) = 0 δ(B(s) − a)ds and is approximated by (2�)−1 0 χ[a−�,a+�] (B(s))ds where � is a very small positive number. For a bivariate standard Brownian motion process (B1 (t), B2 (t)), t ≥ 0, simulate the area t process A(t) = 0 B1 (s)dB2 (s) − B2 (s)dB1 (s) and calculate its statistics. Simulate the Bessel process of order d, ie d X(t) = ( k=1 Bk (t)2 )1/2 , t ≥ 0 and verify by numerical simulations that it satisﬁes its standard stochastic diﬀerential equation. [4] Veriﬁcation of the Knill-Laﬂamme theorem for quantum error correcting codes. Generate a set of r < n linearly independent column vectors {f1 , ..., fr } in H = Cn . Denote the subspace spanned by these r vectors by C. Calculate the orthogonal projection P onto C using the standard formula P = A(A∗ A)−1 A∗ where A = [f1 , ..., fr ] = Cn×r . Generate K n × n matrices having the block structure C1k C2k Nk = , k = 1, 2, ..., K C3k C4k where C1k is an r × r matrix for each k such that for 1 ≤ k, j ≤ K, we have ∗ ∗ C1j C1k + C3j C3k = λjk Ir ∗ ∗ for some complex numbers λjk . This can be achieved by choosing the r × n matrices [C1j |C3j ], 1 ≤ j ≤ K as nonoverlapping rows of an n × n unitary matrix U and then multiply the resultant matrices by complex constants. Thus, we must have K ≤ [n/r]. The matrices C2k , C3k , C4k can be chosen arbitrarily. Now take the orthogonal projection P onto C as Ir 0 P = 0 0

It is then easily seen that

P Nk∗ Nj P = λkj P

and hence the quantum code P can correct the noise subspace {Nk : 1 ≤ k ≤ K}. [5] (a) Study waves in a plasma inﬂuenced by a strong gravitational ﬁeld and electromagnetic ﬁelds by the method of linearization: The Boltzmann particle distribution function f (t, r, v) where v = (v r = dxr /dt)r = 13 satisﬁes (after approximating the collision term by a linear relaxation term k f,vk (t, r, v) = (f0 (r, v) − f (t, r, v))/τ (v) f,t (t, r, v) + v k f,xk (t, r, v) + v,0

Here the velocity v k satisﬁes the geodesic equation in an electromagnetic ﬁeld: k + γγ,0 v k dxk /dτ = γv k , dv k /dτ = γd/dt(dxk /dτ ) = γd/dt(γv k ) = γ 2 v,0

where

γ = dt/dτ = (g00 + 2g0r v r + grs v r v s )−1/2

or equivalently,

k γ 2 v,0 + γγ,0 v k + γ 2 Γk00 + 2γ 2 Γk0m v m + γ 2 Γkmp v m v p = eγ(F0m + Fsm v s ) − − − (a) k v,0 + (γ,0 /γ)v k + Γk00 + 2Γk0m v m + Γkmp v m v p = eγ −1 (F0m + Fsm v s )

k is substituted into the above Boltzmann equation to get This value of v,0

f,t (t, r, v)+v k f,xk (t, r, v)−(γ,0 /γ)v k +Γk00 +2Γk0m v m +Γkmp v m v p −eγ −1 (F0m +Fsm v s ))f,vk (t, r, v)−(f0 (r, v)−f (t, r, v))/τ (v) = 0 k k We note that γ,0 is a function of v,0 , v k , x. We need to get a Boltzmann equation that does not involve v,0 . For this purpose, we go back to the equation of motion of the charged particle (a). First observe that −1/2

γ,0 = (g00 + 2g0r v r + grs v r v s ),0

r = (−γ 3 /2)(g00,0 + g00,k v k + 2g0r,0 v r + 2g0r,s v r v s + 2g0r v,0

s +grs,0 v r v s + grs,m v r v s v m + 2grs v,0 ) − − − (b)

k 3 k )k=1 which is inverted to get v,0 as a function (v k ), Substituting (b)into (a) gives us a linear algebraic equation for (v,0 xμ , the electromagnetic ﬁeld F μν (x) and of course the metric gμν (x) and its ﬁrst order partial derivatives gμν,α . Thus,

Quantum Mechanics for Scientists and Engineers 102

105

we get a well deﬁned Boltzmann equation. Now given the particle distribution function f (t, r, v), we need to calculate the energy momentum tensor of matter. We have T μν = (ρ + p)V μ V ν − pg μν where ρ=m Ur = V 0 is calculated using

v r f d3 v/

f d3 v,

f d3 v, V r = γ(U )U r

gμν V μ V ν = 1

Equivalently,

γ(U )2 g00 + 2γ(U )2 U r g0r + γ(U )2 grs U r U s = 1

or

γ(U ) = (g00 + 2g0r U r + grs U r U s )−1/2 , V 0 = γ(U )

The average internal kinetic energy per particle is K(t, r) = (m/2)

( (v r − U r )2 )f (t, r, v)d3 v r

and the pressure ﬁeld is given by p(t, r) = nm < |v − U |2 > /3 = (m/3)

(v r − U r )2 f (t, r, v)d3 v r

where m is the mass of a plasma particle and n is the number of plasma particles per unit volume. This energy momentum tensor of the plasma can be added to the energy momentum tensor of the electromagnetic ﬁeld and substituted into the right side of the Einstein ﬁeld equations. Thus, we get a couple system of pde’s for f, gμν , Aμ . An alternate way to deﬁne the energy momentum tensor is as T μν = m f (t, r, v)γ(t, r, v)v μ v ν d3 v − p(t, r)g μν (t, r) where

v μ = dxμ /dt

so that

v 0 = 1, v r = dxr /dt

and γ = γ(t, r, v) is deﬁned by the equation γ = (g00 + 2g0k v k + gkm v k v m )−1/2 The pressure p is as deﬁned earlier. [31] Quantum image processing: The image ﬁeld is obtained by passing a quantum em ﬁeld through a spatio temporal ﬁlter having impulse response h(t, τ, r, r� ). Assume the Coulomb gauge with zero charge density. Then the scalar potential A0 = 0 and the vector potential is given by Ar = [(2|K|)−1/2 a(K, σ)er (K, σ)exp(−i(|K|t − K.r)) + (2|K|)−1/2 a∗ (K, σ)¯ er (K, σ)exp(i(|K|t − K.r))]d3 K The Coulomb gauge condition divA = Ar,r = 0 implies K r er (K, σ) = 0 or equivalently, (K, e(K, σ)) = 0 which means that there are only two degrees of polarization which are indexed by σ = 1, 2. The electric ﬁeld is er (K, σ)exp(i(|K|t − K.r))]d3 K Er = −Ar,0 = i [(|K|/2)1/2 a(K, σ)er (K, σ)exp(−i(|K|t − K.r)) − (|K|/2)1/2 a∗ (K, σ)¯

Quantum Mechanics for Scientists and Engineers 103

106

or equivalently in three vector notation, e(K, σ)exp(i(|K|t − K.r))]d3 K E = i [|K|/2)1/2 a(K, σ)e(K, σ).exp(−i(|K|t − K.r)) − (|K|/2)1/2 a∗ (K, σ)¯ The magnetic ﬁeld is given by B = curlA = i [(2|K|)−1/2 a(K, σ)K×e(K, σ)exp(−i(|K|t−K.r))−(2|K|)−1/2 a∗ (K, σ)K×e(K, σ)exp(i(|K|t−K.r))]d3 K The em ﬁeld of the image can be regarded as the output of a spatio-temporal ﬁlter with this free em ﬁeld as input. Thus, the output ﬁeld is E o (t, r) = h(t, τ, r, r� )E(τ, r� )dτ d3 r� , B o (t, r) = h(t, τ, r, r� )B(τ, r� )dτ d3 r� and hence the output ﬁeld can be expressed as ¯ E (t, r, K, σ)a∗ (K, σ))d3 K E o (t, r) = (HE (t, r, K, σ)a(K, σ) + H B o (t, r) =

¯ B (t, r, K, σ)a∗ (K, σ)]d3 K [HB (t, r, K, σ)a(K, σ) + H

where HE , HB are 3 × 1 functions constructed from the image impulse response h(t, τ, r, r� ). The energy of the em ﬁeld coming from the image is HF = (|E o |2 + |B o |2 )d3 r/2 and this can be expressed in the form after discretization of the spatial frequencies HF (θ) = (1/2)

p

¯ 2 (k, m, θ)a∗ a∗ ) (Q1 (k, m, θ)a∗k am + Q2 (k, m, θ)ak am + Q k m

k,m=1

where

¯ 1 (k, m, θ) = Q1 (m, k, θ) Q

Here θ is a parameter vector upon which the image impulse response h(t, τ, r, r� ) depends. θ is the parameter which contains all information about the image ﬁeld and is to be estimated by exciting an atom with the output image ﬁeld and and taking measurements on the state of the atom at diﬀerent times. We can simplify the form of the image ﬁeld Hamiltonian as p HF (θ) = Q(k, m, θ)a∗k am k,m=1

Here,

[ak , a∗m ] = δkm , [ak , am ] = 0, [a∗k , a∗m ] = 0

The Hamiltonian of the atom assumed to be an N state system, is an N × N Hermitian matrix HA and the interaction Hamiltonian between the image ﬁeld and the atom has the form Hint (t) =

p

k=1

(Fk (t, θ) ⊗ ak + Fk (t, θ)∗ ⊗ a∗k )

[32] Performance analysis of the MUSIC algorithm. X = AS + W, X ∈ CN ×K , A ∈ CN ×p , S ∈ Cp×K , W ∈ CN ×K 2 Rxx = K −1 E(XX ∗ ), Rss = K −1 E(SS ∗ ), σw I = K −1 E(W W ∗ ), E(S ⊗ W ) = E(S ⊗ W ∗ ) = 0

All signals are complex Gaussian. Thus,

E(X ⊗ X) = 0, E(S ⊗ S) = 0, E(W ⊗ W ) = 0

Quantum Mechanics for Scientists and Engineers 104

107

The stochastic perturbation in the array signal correlation matrix is given by δRxx = K −1 XX ∗ − Rxx = 2 K −1 ASS ∗ A∗ + K −1 W W ∗ + K −1 ASW ∗ + K −1 W S ∗ A∗ − ARss A∗ − σw I

= AδRss A∗ + δRww + K −1 (ASW ∗ + W S ∗ A∗ )

where

2 δRss = K −1 SS ∗ − Rss , δRww = K −1 W W ∗ − σw I

To calculate the mean and covariance of the DOA estimates, we need the mean and covariance of the statistical perturbation δRxx of Rxx . Now, E(δRss ⊗ δRss ) = K −2 E(SS ∗ ⊗ SS ∗ ) − Rss ⊗ Rss Now,

E(SS ∗ ⊗ SS ∗ ) = E[(S ⊗ S)(S ∗ ⊗ S ∗ )]

This is equivalent to calculating

E(si sj s¯k s¯m ) = E(si s¯k )E(sj s¯m ) + E(si s¯m )E(sj s¯k ) = [Rss ]ik [Rss ]jm + [Rss ]im [Rss ]jk Also since S and W are independent random matrices, it follows that δRss and δRww are independent zero mean random matrices. The second order moments of δRxx are thus computed as E(δRxx ⊗ δRxx ) = E[AδRss A∗ + δRww + K −1 (ASW ∗ + W S ∗ A∗ ))⊗2 ] = (A ⊗ A)E(δRss ⊗ δRss )(A∗ ⊗ A∗ ) +K

−2

where F denotes the ﬂip operator:

+E(δRww ⊗ δRww )

(I + F )E(ASW ∗ ⊗ W S ∗ A∗ ) F (x ⊗ y) = y ⊗ x

Computing the last expectation is equivalent to computing

E(aij sjk w ¯lk wi j s¯k j a ¯ k l ) = aij a ¯lk ) ¯k l E(sjk s¯k j )E(wi j w Thus the second order moments of δRxx are easily computed and this can be combined with matrix perturbation theory ˆ xx = Rxx + δRxx . These covariances to obtain the covariance of the signal and noise eigenvalues and eigenvectors of R can in turn be used to calculate the error covariances in the DOA estimates using the MUSIC pseudospectrum. [33] Estimating the quantum image parameters from measurements on the state of an atom excited by the quantum em ﬁeld coming from the image in the interaction representation. The image em ﬁeld interacts with an atom described by an N × N Hamiltonian matrix HA . This interaction Hamiltonian can be expressed as Hint (t|θ) =

p

k=1

(Gk (t|θ) ⊗ ak + Gk (t|θ)∗ ⊗ a∗k )

where Gk (t|θ) ∈ C , θ is the image parameter vector. The joint density of the atom and the image em ﬁeld at time t can be expressed using the Glauber-Sudarshan representation: ρ(t) = C exp(−|z|2 )A(t, z) ⊗ |e(z) >< e(z)|d2p z N ×N

where |e(z) >=

n≥0

z n a∗n |0 > /n! =

n≥0

√ z n |n > / n!

Quantum Mechanics for Scientists and Engineers 105

108

since

√ |n >= a∗n |0 > / n!

is the normalized state of the ﬁeld in which a∗k ak has the eigenvalue nk and n = (nk ). Further C = π −p We have ak |e(z) >= zk |e(z) >, a∗k |e(z) >=

∂ |e(z) > ∂zk

Thus, ak |e(z) >< e(z) >= zk |e(z) >< e(z)|, a∗k |e(z) >< e(z)| = |e(z) >< e(z)|ak =

∂ |e(z) >< e(z)|, ∂zk

∂ |e(z) >, e(z)|, ∂ z¯k

|e(z) >< e(z)|a∗k = z¯k |e(z) >< e(z)|

Note that z → |e(z) > is an analytic function of z and so z¯ →< e(z)| is an analytic function of z¯. Now, the joint density ρ(t) satiﬁes Schrodinger’s equation in the interaction picture: iρ� (t) = [Hint (t|θ), ρ(t)] and this translates to i

A,t (t, z) ⊗ |e(z) >< e(z)|exp(−|z|2 )d2p z =

( (Gk (t|θ)A(t, z) ⊗ ak |e(z) >< e(z)| + Gk (t|θ)∗ A(t, z) ⊗ a∗k |e(z) >< e(z)| k

−A(t, z)Gk (t|θ) ⊗ |e(z) >< e(z)|ak − A(t, z)Gk (t|θ)∗ ⊗ |e(z) >< e(z)|a∗k )exp(−|z|2 )d2p z) ∂ ( (Gk (t|θ)A(t, z)zk ⊗ |e(z) >< e(z)| + Gk (t|θ)∗ A(t, z) ⊗ ( |e(z) >< e(z)|) = ∂zk k

∂ |e(z) >< e(z)|) − A(t, z)Gk (t|θ)∗ z¯k ⊗ |e(z) >< e(z)|)exp(−|z|2 )d2p z) ∂ z¯k ∂ ( (zk Gk (t|θ)A(t, z) − (Gk (t|θ)∗ ( − z¯k )A(t, z) = ∂zk

−A(t, z)Gk (t|θ) ⊗ (

k

∂ +( − zk )A(t, z)Gk (t|θ) − z¯k A(t, z)Gk (t|θ)) ⊗ |e(z) >< e(z)|d2p z) ∂ z¯k Thus, we get

iA,t (t, z) = (T (t|θ)A)(t, z)

where T (t|θ) is a diﬀerential operator acting on the space of matrix valued functions of the complex variable z ∈ Cp deﬁned by ∂ T (t|θ)X(z) = (zk Gk (t|θ)X(z) − z¯k X(z)Gk (t|θ) − Gk (t|θ)∗ ( − z¯k )X(z) ∂zk k

+(

∂ − zk )X(z)Gk (t|θ)) ∂ z¯k

The formal solution to this partial diﬀerential equation is A(t, z) = τ {exp(−i

t

T (s|θ)ds)}(A(0, z))

0

where τ is the time ordering operator. The atomic (system) density at time t is given by ρA (t) = T r2 (ρ(t)) = π −p A(t, z)d2p z A(t, z) and ρA (t) are N × N matrices.

Quantum Mechanics for Scientists and Engineers 106

109

[34] Existence and uniqueness of solutions to stochastic diﬀerential equations. (a) Kolmogorov’s inequality for discrete time sub-martingales. Let Mk , k = 0, 1, 2, ... be a non-negative sub-martingale. Let τa = min(k ≥ 0 : Mk ≥ a), a > 0 Then,

{τa = m} = {M0 < a, M1 < a, ..., Mm−1 < a, Mm ≥ 0}

Thus, if Fn is the underlying ﬁltration for {Mn }, it follows that

{τa = m} ∈ Fm In particular, τa is a stop-time. Thus, by the submartinagle property, for n ≥ m, E(Mn χτa =m ) ≥ E(Mm χτa =m ) ≥ a.P (τa = m) and summing over m gives or

E(Mn ) ≥ E(Mn χτa ≤n ) ≥ a.P (τa ≤ n) P (τa ≤ n) = P (max0≤k≤n Mk ≥ a) ≤ E(Mn )/a

This can be generalized to the continuous time scenario. Speciﬁcally, if Mt , t ≥ 0 is a continuous martingale, then P (max0≤s≤t |Ms | ≥ a) ≤ E|Mt |/a since |Mt | is a submartingale by Jensen’s inequality. Using this, we can deduce a version Doob’s inequality: E(max0≤t≤T |Mt |2 ) ≤ C1 (E(MT2 ) Now consider the sde

dXt = f (t, Xt )dt + g(t, Xt )dMt , X0 = x

where f, g satisfy appropriate Lipshitz conditions which will be speciﬁed later. We wish to prove the existence and (n) uniqueness of the solution to this sde. We deﬁne the processes Xt , n = 1, 2, ... recursively as t t (n+1) =x+ f (s, Xs(n) )ds + g(s, Xs(n) )dMs Xt 0

0

These processes are all adapted to the underlying ﬁltration on which the process M is deﬁned and we have by Doob’s in T T (n+1) (n) (n) (n−1) 2 − Xt |2 ) ≤ KT E |Xs(n) − Xs(n−1) |2 ds + KC1 (E |Xt − Xt | d < M >t ) E(max0≤t≤T |Xt 0

0

If we assume that the measure < M > is absolutely continuous w.r.t. the Lebesgue measure, with bounded RadonNikodym derivative, then we derive from the above that T (n) (n) |Xt − X (n−1 )t |2 dt E(max0≤t≤T |X (n+1) − Xt |2 ) ≤ (K0 T + K1 ) 0

from which the existence of a solution to the sde can be inferred using standard arguments based on Gronwall’s inequality. [35] Statistical analysis of the RLS lattice algorithm. Let x[n] be a random process and we form the vector ξn = [x[n], x[n − 1], ..., x[0]]T ∈ Rn+1×1 z −p ξn = [x[n − p], x[n − p − 1], ..., x[−p]]T ∈ Rn+1×1

where x[k] = 0f ork < 0. Deﬁne the data matrix

Xn,p = [z −1 ξn , ..., z −p ξn ] ∈ Rn+1×p Let Pn,p = PR(Xn,p ) . The forward and backward prediction error sequences or order p at times n and n − 1 are respectively deﬁned by ⊥ ⊥ −p−1 ξn , eb [n − 1|p] = Pn,p z ξn ef [n|p] = Pn,p

Quantum Mechanics for Scientists and Engineers 107

110

We have the obvious formula based on orthogonal direct sum decompositions: ⊥ ⊥ = Pn,p − Psp{eb [n−1|p]} Pn,p+1

and hence

ef [n|p + 1] = ef [n|p] − Kf [n|p]eb [n − 1|p] − − − (1)

and likewise,

e˜b [n|p + 1] = eb [n − 1|p] − Kb [n|p]ef [n|p − − − (2)]

where

⊥ −p−1 e˜b [n|p + 1] = Psp{ξ ξn −1 ξ ,...,z −p ξ } z n ,z n n

It is easy to see that

eb [n|p + 1] = Here, the forward reﬂection coeﬃcient is

e˜b [n|p + 1] 0

Kf [n|p] =< eb [n − 1|p], ef [n|p > /Eb [n − 1|p] and the backward reﬂection coeﬃcient is Kb [n|p] =< eb [n − 1|p], ef [n|p] > /Ef [n|p] where We thus have

Ef [n|p] =� ef [n|p] �2 , Eb [n − 1|p] =� eb [n − 1|p] �2 0 ≤ Kf [n|p]Kb [n|p] = | < eb [n − 1|p], ef [n|p] > |2 /Ef [n|p]Eb [n − 1|p] ≤ 1

From (1) and (2), we get

Ef [n|p + 1] = Ef [n|p] + Kf2 [n|p]Eb [n − 1|p] − 2Kf [n|p]2 Eb [n − 1|p] = Ef [n|p] − Kf2 [n|p]Eb [n − 1|p] and likewise,

= (1 − Kf [n|p]Kb [n|p])Ef [n|p]

Eb [n|p + 1] = (1 − Kf [n|p]Kb [n|p])Eb [n − 1|p]

The time update formulas for Kf and Kp and of Ef , Ep require time update formulas for Pn,p . We have T T Pn,p = Xn,p (Xn,p Xn,p )−1 Xn,p

Let

T Rn,p = Xn,p Xn,p

Then since Xn+1,p = where we get

T ηn,p Xn,p

ηn,p = x[n], x[n − 1], ..., x[n − p + 1] T Rn+1,p = ηn,p ηn,p + Rn,p

and hence −1 −1 Rn+1,p = Rn,p −

−1 T −1 ηn,p ηn,p Rn,p Rn,p T R−1 η 1 + ηn,p n,p n,p

Thus, T T −1 ] (Rn,p − Pn+1,p = [ηn,p |Xn,p T ] .[ηn,p |Xn,p

−1 T −1 ηn,p ηn,p Rn,p Rn,p T R−1 η 1 + ηn,p n,p n,p

).

Quantum Mechanics for Scientists and Engineers 108

⎛

111

T −1 ηn,p Rn,p ηn,p

⎜ ⎝

T −1 T ηn,p Rn,p Xn,p

−1 1+etaT n,p Rn,p ηn,p −1 Xn,p Rn,p n,p T R−1 η 1+ηn,p n,p n,p

η

Pn,p −

T R−1 η 1+ηn,p n,p n,p −1 T −1 T Xn,p Rn,p ηn,p ηn,p Rn,p Xn,p −1 T 1+ηn,p Rn,p ηn,p

⎞ ⎟ ⎠

[36] Electric dipole moment and magnetic dipole moment of an atom with an electron in a constant electromagnetic ﬁeld. The unperturbed Hamiltonian of the atom is given by H0 = p2 /2m − eV (r) The perturbing Hamiltonian is −eV1 + e2 V2 where V1 = −(r, E) + (Lz + gσz )B0 /2m V2 = (B0 zˆ × r)2 /2m = B02 (x2 + y 2 )/2m

To calculate the eigenfunctions of H0 − eV1 + e2 V2 upto O(e2 ), we need to develop second order time independent perturbation theory for degenerate unperturbed systems. Consider therefore a Hamiltonian H = H0 + δH1 + δ 2 H2 (0)

with the eigenvalues of H0 being En , n = 1, 2, ... and an orthonormal basis for the eigenspace N (H0 − En ) being (0) |ψnk >, k = 1, 2, ..., dn . Let dn � (0) c(n, k)|ψnk > |ψn(0) >= k=1

be the unperturbed state of the system. We note that this state has an eigenvalue En for H0 . The constants c(n, k) are yet to be determined. We write for the perturbed state |ψn >= |ψn(0) > +δ|ψn(1) > +δ 2 |ψn(2) + O(δ 3 ) and correspondingly for the perturbed energy level En = En(0) + δEn(1) + δ 2 En(2) + O(δ 3 ) Substituting these expansions into the eigenequation and equating coeﬃcients of δ m , m = 0, 1, 2 successively gives (H0 − En(0) )|ψn(0) >= 0 − − − (1) which is already known,

(H0 − En(0) )|ψn(1) > +H1 |ψn(0) > −En(1) |ψn(0) >= 0 − − − (2)

(1) (0) (1) (1) (2) (0) (H0 − En(0) )|psi(2) n > +H1 |ψn > +H2 |ψn > −En |ψn > −En |ψn >= 0 − − − (3) (0)

From (2), we get on formijng the bracket with < ψmk | from the left, (0)

(0) (Em − En(0) ) < ψmk |ψn(1) > +

� l

(0)

(0)

< ψmk |H1 |ψnl c(n, l)

−En(1) c(n, k)δmn = 0 − − − (4) (1)

Setting m = n gives us the secular equation for the possible values of En that lift the degeneracy of the unperturbed state: � (0) (0) < ψnk |H1 |ψnl > c(n, l) = En(1) c(n, k), 1 ≤ k ≤ dn l

(1) Thus, En assumes the (0) (0) ψnk |H1 |ψnl >))1≤k,l≤dn

1 values Enj , k = 1, 2, ..., dn which are the eigenvalues of the dn × dn secular matrix ((< (1)

n with the eigenvector corresponding to the eigenvalue Enj being denoted by ((cj (n, k)))dk=1 . dn We may assume that these dn eigenvectors form an orthonormal basis for C . From (4) with m �= n, we get

(0)

< ψmk |ψn(1) >=

� < ψ (0) |H1 |ψ (0) > c(n, l) mk nl (0)

l

(0)

E n − Em

Quantum Mechanics for Scientists and Engineers 109

112

(0)

and hence the ﬁrst orde perturbation to the eigenvector |ψn

eigenvalue

(0) En

+

(1) δ.Enj

(1)

>, namely δ.|ψnj > corresponding to the perturbed

is given by (1)

|ψnj >=

(0)

mkl,m�=n

(1)

(0)

(0)

(0) |ψmk >< ψmk |H1 |ψnl > cj (n, l)/(En(0) − Em )

(1)

(1)

(1)

(0)

Turning now to (3), we assume En = Enj and |ψn >= |ψnj >. Taking the bracket of this equation with < ψmk |, we get (0)

(0)

(1)

(0)

(1)

(0)

(1)

(1)

(0)

(1)

(0) (0) (2) (Em − En(0) ) < ψmk |psi(2) n > + < ψmk |H1 |ψnj > + < ψmk |H2 |ψn > −Enj < ψmk |ψnj > −En cj (n, k)δmn = 0

For m = n, this gives (0)

(1)

< ψnk |H1 |ψnj > +

l

(0)

< ψnk |H2 |ψnl > cj (n, l) − Enj < ψnk |ψnj >

−En(2) cj (n, k) = 0 − − − (5)

(2)

Actually, these equations are not all linearly independent. The only linearly independent equation for En which (0) (0) emerges from this is obtained by forming the bracket of (3) with + < ψn | = k c¯j (n, k) < ψnk |. In fact, if we extend (2) dn the conjugate of this vector to an onb for C and form the bracket with these vectors, then the term involving En disappears. Thus we infer from from (5) that (2) En(2) = Enj = (0) (1) (0) (0) ( |cj (n, k)|2 )−1 [ c¯j (n, k) < ψnk |H1 |ψnj > + < ψnk |H2 |ψnl > c¯j (n, k)cj (n, l) k

k

(1) −Enj

k,l

c¯j (n, k)
]

[37] Induced characters: Let G be a ﬁnite group and H a subgroup of G. Select one element x in each left coset of H. Let I denote the set of all such x� s. Thus, we have x∈I xH = G and xH yH = φ, x �= y, x, y ∈ I. Let L be a representation of H acting in the vector space V . We shall formally write x.V for the vector space V attached to the element x. The representation space for the representation π = IndG H L (ie π is the representation of G induced by the x.V . π(g) acts on this space by mapping x.v to [gx].v where representation L of H) may be denoted by U = x∈I g ∈ G, x ∈ I, v ∈ V and [gx] ∈ I is the element for which [gx]H = gxH. We may use the notation gxV = [gx]V . Let χU denote the character of π and χV that of L. It is clear that g ∈ G will map x.V onto itself iﬀ gxV = xV iﬀ x−1 gxV = V iﬀ x−1 gx ∈ H. For a given g ∈ G, let Xg denote the set of all such x� s. In other words, Xg = {x ∈ I : x−1 gx ∈ H} It follows that

χU (g) =

χV (x−1 gx)

x∈Xg

We note that for any x, g ∈ G, x−1 gx ∈ H iﬀ y −1 gy ∈ H for all y ∈ G(g).x and in this case, x−1 gx = y −1 gy, where G(g) is the centralizer of g in G. Thus, we can write for x ∈ Xg , χV (y −1 gy) χV (x−1 gx) = μ(G(g))−1 y∈G(g)x

where μ(G(g)) denotes the number of elements in G(g). It follows that χV (y −1 gy) χU (g) = μ(G(g))−1 x∈Xg ,y∈G(g)x

Further, it is clear that h ∈ H and y gy ∈ H implies (yh)−1 gyh = h−1 y −1 gyh ∈ H and hence χV ((yh)−1 gyh) = χV (y −1 gy). Thus, the above formula can also be expressed as χV (y −1 gy) χU (g) = μ(G(g))−1 μ(H)−1 −1

x∈Xg H,y∈G(g)x

Quantum Mechanics for Scientists and Engineers 110

113

= (μ(G(g))μ(H))−1

χV (y −1 gy)

y∈G(g)Xg H

Now, consider the set of conjugacy classes of g ∈ G:

C(g) = {xgx−1 : x ∈ G} C(g) contains

μ(C(g)) = μ(G)/μ(G(g))

distinct elements. Reference: Claudio Procesi, ”Lie groups, an approach through invariants and representations”, Springer. A.5. Some more problems in group theory and quantum mechanics. [1] The basic observables of non-relativistic quantum mechanics obtained from the unitary representations of the Galilean group based on Mackey’s theory of semidirect products. Suppose N is a vector space regarded as an Abelian group under addition. Let H be a group that acts on N as n → τh (n). Let G = N ⊗s H. Thus, any element g ∈ G can be uniquely expressed uniquely as g = nh, n ∈ N, h ∈ H and the composition law in G is given by n1 h1 n2 h2 = n1 τh1 (n2 )h2 . We have τh1 h2 = τh1 oτh2 , τh (n + n� ) = τh (n) + τh (n� ) In other words, h → τh is an homomorphism of H into aut(N ) with aut(N ) being the same as End(N ) (aut(N ) is a group theoretic notation while End(N ) is a vector space theoretic notation. The composition of n1 h1 and n2 h2 : Then we may regard N as being normalized by the action τ of H. Equivalently, via the construction of a group isomorphism, we can regard τh as begin given by τh (n) = hnh−1 , so that n1 h1 n2 h2 = (n1 + τh1 (n2 )).h2 Now let B(n1 , n2 ) be a skew symmetric real bilinear form on N that is H-invariant, ie, B(τh (n1 ), τh (n2 )) = B(n1 , n2 ) Then consider

σ(n1 h1 , n2 h2 ) = exp(iB(n1 , τh1 (n2 )))

We claim that σ satisﬁes the conditions for a multiplier on G, ie, for a projective unitary (pu) representation U of G, ie, σ(g1 , g2 )σ(g1 g2 , g3 ) = σ(g1 , g2 g3 )σ(g2 , g3 ) − − − (1) We leave this veriﬁcation to the reader. This follows from the fact that if U satsiﬁes by the deﬁnition of a pu representation U (g1 )U (g2 ) = σ(g1 , g2 )U (g1 g2 ) and hence by associativity of linear operator multiplication, (U (g1 )U (g2 ))U (g3 ) = U (g1 )(U (g2 )U (g3 )) we get or

σ(g1 , g2 )U (g1 g2 )U (g3 ) = U (g1 )U (g2 g3 )σ(g2 , g3 ) σ(g1 , g2 )σ(g1 g2 , g3 )U (g1 g2 g3 ) = σ(g1 , g2 g3 )σ(g2 , g3 )U (g1 g2 g3 )

ie (1). [2] Let f (r1 , ..., rN ) and g(r1 , ..., rN ) be two functions on (R3 )N . Let R ∈ SO(3) and σ, τ, ρ ∈ SN . Let χλ (σ) be an irreducible character of Sn corresponding to the Young tableaux (ie a partition of n) λ and consider I(f, g, r1 , ..., rN ) = f (rσ1 , ..., rσn )¯ g (rτ 1 , ..., rτ n )χλ (στ −1 ) σ,τ ∈Sn

We have

I(f, g, rρ1 , ..., rρN ) =

σ,τ

f (rρσ1 , ..., rρσn )¯ g (rρτ 1 , ..., rρτ n )χλ (στ −1 )

Quantum Mechanics for Scientists and Engineers 111

114

=

f (rσ1 , ..., rσn )¯ g (rτ1 , ..., rτ n )χλ (ρστ −1 ρ−1 )

σ,τ

= I(f, g, r1 ‘, ..., rN )

since

χ(ρσρ−1 ) = χ(σ)

for any character χ of Sn . More generally, we can deﬁne � )= f (rσ1 , ..., rσn )¯ g (rτ� 1 , ...., rτ� n )χλ (στ −1 ) I(f, g, r1 , ..., rN , r1� , ..., rN σ,τ ∈Sn

Then, we get � � I(f, g, rρ1 , ..., rρn , rρ1 , ..., rρn )=

� � f (rρσ1 , ..., rρσn )¯ g (rρτ 1 , ..., rρτ n )

σ,τ

=

.χλ (στ −1 ) = f (rσ1 , ..., rσn )¯ g (rτ� 1 , ..., rτ� n )

σ,τ

� ) χλ (στ −1 ) = I(f, g, r1 , ..., rN , r1� , ..., rN � ∈ R3 . Now,let R ∈ SO(3). The rotated and permuted image ﬁelds obtained from f and g are ∀r1 , ..., rN , r1� , ..., rN � � � f1 (r1 , ..., rN ) = f (R−1 rρ1 , ..., R−1 rρN ), g1 (r1� , ..., rN ) = g(R−1 rρ1 , ..., R−1 rρN )

and we get

� � ) = I(f, g, R−1 r1 , ..., R−1 rN , R−1 r1� , ..., R−1 rN )= I(f1 , g1 , r1 , ..., rN , r1� , ..., rN

and hence if χl denotes an irreducible character of SO(3), we get � I(f1 , g1 , S1 r1 , ...S1 rN , S2 r1� , ..., S2 rN )χl (S1 S2−1 )dS1 dS2 SO(3)×SO(3)

=

SO(3)×SO(3)

=

� I(f, g, R−1 S1 r1 , ..., R−1 S1 rN , R−1 S2 r1� , ..., R−1 S2 rN )χl (S1 S2−1 )dS1 dS2

SO(3)×SO(3)

=

� I(f, g, S1 r1 , ..., S1 rN , S2 r1� , ..., S2 rN )χl (RS1 S2−1 R−1 )dS1 dS2

SO(3)×SO(3)

It follows that

� I(f, g, S1 r1 , ..., S1 rN , S2 r1� , ..., S2 rN )χl (S1 S2−1 )dS1 dS2

� I0 (f, g, r1 , ..., rN , r1� , ..., rN )=

SO(3)×SO(3)

� I(f, g, S1 r1 , ...S1 rN , S2 r1� , ..., S2 rN )χl (S1 S2−1 )dS1 dS2

is invariant under permutations and rotations. References for A.4 and A.5.: [1] Hoﬀman and Kunze, Linear Algebra, Prentice Hall. [2] T.Kato, Perturbation theory for linear operators, Springer. [3] K.R.Parthasarathy, ”An introduction to quantum stochastic calculus”, Birkhauser. [4] S.J.Orfanidis, ”Optimum signal processing”, Prentice Hall. [5] C.R.Rao, ”Linear statistical inference and its applications, Wiley. [6] J.Gough and Koestler, ”Quantum ﬁltering in coherent states”. [7] Lec Bouten, ”Filtering and control in quantum optics”, Ph.D thesis. [8] Leonard Schiﬀ, ”Quantum mechanics”. [9] W.O.Amrein, ”Hilbert space methods in quantum mechanics”. [10] K.R.Parthasarathy, ”Coding theorems of classical and quantum information theory”. Hindustan Book Agency. [11] Naman Garg, H.Parthasarathy and D.K.Upadhyay, ”Estimating parameters of an image ﬁeld modeled as a quantum electromagnetic ﬁeld using its interaction with a ﬁnite state atomic system”, Technical Report, NSIT, 2016.

Quantum Mechanics for Scientists and Engineers 112

115

[12] Naman Garg, H.Parthasarathy and D.K.Upadhyay, ”MATLAB implementation of Hudson-Parthasarathy noisy Schrodinger equation and Belavakin’s quantum ﬁltering equation with analysis of entropy evolution”, Technical Report, NSIT, 2016. A.6. New Syllabus for a short course on transmission lines and waveguides. [1] Study of non-uniform transmission lines by expanding the distributed parameters and the line voltage and current as Fourier series in the spatial variable z. The modes of propagation (propagation constants) appear as the eigenvalues of an inﬁnite matrix deﬁned in terms of the spatial Fourier series coeﬃcients of the non-uniform line impedance and admittance. [2] Study of the statistics of the line voltage and current (spatial correlations) when the distributed parameters of the line have small random ﬂuctuations. The study is based on perturbation theory applied to matrix eigenvalue problems and is very similar to time independent perturbation theory used in quantum mechanics. [3] Analysis of transmission lines when the distributed parameters are randomly ﬂuctuating functions of both space and time. We focus on estimating the distributed parameter statistical correlations from measurements of the line voltage and current and applying the ergodic hypothesis for estimating the line voltage and current correlations and then matching these correlations with the theoretically derived correlations. [4] Analysis of transmission lines with random loading along the line using inﬁnite dimensional stochastic diﬀerential equations. The voltage and current loading along the line are assumed to be expandable in terms of basis functions of the spatial variable with the coeﬃcients being white noise processes in time, ie, derivatives of Brownian motion processes. We then calculate the probability law of the line voltage and current [5] Equivalence of transmission lines and waveguides obtained by expanding the guide electric and magnetic ﬁelds in terms of basis functions of (x, y) and the coeﬃcients being functions of z. From the Maxwell equations, we derive an inﬁnite series of ﬁrst order linear diﬀerential equations for the coeﬃcient functions of z and compare these equations with an inﬁnite sequence of coupled transmission lines. The basis functions of (x, y) used in the expansion of the electric and magnetic ﬁelds must satisfy the boundary conditions, namely, that Ez and the normal derivative of Hz vanish on the boundary. [6] Study of nonlinear hysteresis and nonlinear capacitive eﬀects on the dynamics of a transmission line. Hysteresis is related to a nonlinear B − H curve having memory and is a consequence of the Landau-Lifshitz theory of magnetism in which the magnetic moment of an atom precesses in an external magnetic ﬁeld due to the M × H torque on it. The solution to this equation is a Dyson series for the magnetization M in terms of H and truncated upto second degree terms, this leads to a quadratic expression for the hysteresis voltage term as a function of the line current. In other words, we have a second order Volterra relation between the hysteresis voltage and current. This is a consequence of magnetic properties of the material of which the line is made. Nonlinear capacitive eﬀects can be explained from the nonlinear-memory relation between the dipole moment/polarization of an electron relative to its nucleus when an external electric is incident on it. The binding of the electron to the atom has harmonic as well as anharmonic terms which causes the diﬀerential equation satisﬁed by the position of the electron to be nonlinear and hence solving this equation using perturbation theory, we obtain the dipole moment as a Volterra series in the external electric ﬁeld. When applied to transmission lines, this manifests itself as a Volterra relation between the line charge (which is proportional to the electric displacement vecto D = �0 E + P where P is the polarization/dipole moment per unit volume). The time derivative of this charge is the capacitor current and this component is incorporated into the line current equation and analysis is done using perturbation theory. [7] Quantization of the line equations using the Gorini-Kossakowski-Sudarshan-Lindblad (GKSL) formalism. The line equations for a lossless line are distributed parameter analogs of LC circuits. The lossless line equations like an LC circuit can be derived from a Lagrangian and hence from a Hamiltonian that is a quadratic function of the phase variables. The eﬀect of noise on such a system is obtained by adding a GKSL term to the dynamics of states and observables. These GKSL terms can natually be obtained using the Hudson-Parthasarathy quantum stochastic calculus by tracing out over the bath variables. By choosing our GKSL operators L as complex linear functions of the phase variables, we obtain resistive damping terms in the dynamical equations and hence we are able to obtain a quantum mechanical model for a lossy line. A.7. Creativity in the mathematical, physical and the engineering sciences. In this section, we give a brief history of the various intellectual achievements in the mathematical, physical and engineering sciences showing how creativity in these sciences very often comes from a need to understand nature and the working of the world around us. The examples we choose are Newton, Maxwell, Einstein, Planck, Bose, Rutherford, Bohr, Heisenberg, Schrodinger, Dirac, Dyson, Feynman, Schwinger, Tomonaga, Weinberg, Salam, Glashow and Hawking in the physical sciences, Faraday and Edison and the inventor Esaki of the tunnel diode in the engineeering sciences and in the mathematical sciences, Euler, Gauss, Fourier, Fermat, Galois, Abel, Cauchy, Hilbert, Poincare, Kolmogorov,

Quantum Mechanics for Scientists and Engineers 113

116

Ramanujan, Harish-Chandra and more recently, Edward Witten, a pioneer in mathematical string and superstring theory. The creation of quantum electrodynamics: After the creation of the quantum theory of atoms and molecules, there remained several gaps in our understanding of the physical world. For example, it was not clear how to explain various experimental observations like the force of an electron on itself, the electron self energy ie, the movement of an electron produces an em ﬁeld which acts back on the electron, the phenomenon of vacuum polarization according to which a photon propagates in vacuum to produce an electron-positron pair which propagate and again annihilate each other to produce once again a photon, the anomalous magnetic moment of the electron which gives radiative corrections to the magnetic moment caused once again by the the em ﬁeld generated by the electron acting back on itself. Compton scattering of an electron/positron by a photon also remained unexplained. In other words, a satisfactory quantum theory describing the interactions of electrons, positrons and photons and how to calculate probabilities of scattering processes of these particles remained to be carried out. Thus because of the need to understand these unexplained physical processes, Feynman, Schwinger and Tomonaga created new mathematical tools which eventually were sharpened by Dyson and the succeeding generation of theoretical physicists like Wienberg, Salam, Glashow, Witten, and more recently by the Indian physicists Sen and Ashtekar. Feynman proposed a path integral formulation for calculating the scattering matrix for particles. This involved identifying the Lagrangian density L0 [φ] of the electron-positron Dirac ﬁeld without their interactions and that of the electromagnetic ﬁeld and then evaluating the action S[φ] = Ld4 x for these ﬁelds. Feynman followed it with evaluation of the Gaussian path integrals exp(iS[φ])Dφ taking into account the Berezin change for Fermionic path integrals. Then the interaction term Sint [φ] = J μ Aμ d4 x = ψ ∗ γ 0 γ μ ψAμ d4 x (φ = (Aμ , ψ)) is considered and its contribution was evaluated by expanding the exponential exp(iSint [φ]) in as a power series in S[φ] and Feynman associated a diagram with each term in this series explaining how each term gives rise to a diﬀerent order term in the scattering matrix and how these terms can be calculated easily by a diagrammatic algorithm. On the other hand Schwinger and Tomonaga proposed an operator theoretic approach to calculating the scattering matrix. Their algorithm was based on ﬁrst quantizing the electromagnetic ﬁeld using creation and annihilation operators. This had already been observed by Paul Dirac when he wrote down the energy of the electromagnetic ﬁeld as a quadaratic function of the four vector potential in the spatial frequency domain and deduced that this quadratic structure meant that the em ﬁeld should be considered as an ensemble of harmonic oscillators with two oscillators associated with each spatial frequency (two degrees of polarization arise from the fact that in the coulomb gauge, in the absence of charges, the electric scalar potential is zero while divA = 0 for the Coulomb gauge implies that in the spatial frequency domain, the magnetic vector potential is orthogonal to the wave vector). The next idea of Schwinger was to substitute this quantum electromagnetic ﬁeld consisting of a superposition of operators into the atomic Hamiltonian described by position and momentum operators and obtain an interaction term between the position-momentum pair for the atom with the quantum electromagnetic ﬁeld operators. This interaction Hamiltonian was then used to calculate the scattering matrix elements and deduce corrections to the magnetic moment of the electron. Schwinger and Tomonaga also proposed a Lorentz invariant scheme for writing down the Schrodinger equation (which is not Lorentz covariant) for ﬁeld theories. The idea basically involved replacing the time t variable by a three dimensional surface variable σ. In other words, just as the t = constt surface is the three dimensional Euclidean space R3 as a subspace of R4 , likewise, σ = constt. could be an arbitrary three dimensional submanifold of R4 . The Schrodinger equation i ∂ψ(t) = H(t)ψ(t) ∂t was replaced by the Lorentz covariant equation δψ(σ) = H(σ)ψ(σ) δσ and this formalism was used with great power by Schwinger and Tomonaga to calculate the scattering matrix element between two three dimensional surfaces. Uniﬁcation of the Feynman and Schwinger-Tomonaga theory was performed by Freeman Dyson who simply showed why one should obtain the same results using Feynman diagrams and the operator theoretic approach. For a very long time, Dyson’s notes on this was the standard textbook for all courses in quantum ﬁeld theory all over the world. Dyson’s work led to renormalization theory developed by himself and subsequently othe researchers. This involved getting rid of the inﬁnities in quantum ﬁeld theory by renormalizing charge, mass and ﬁelds, ie, scaling these quantities with numbers depending on an ultraviolet and infrared cutoﬀ while integrating in the four frequency domain. After the great riddle of construcing a cogent quantum theory of electrodynamics was solved almost completely by these four powerful mathematical physicists, the problem of understanding nuclear weak and strong forces remained and also how to unify these with quantum electrodynamics. The problem of unifying quantum electrodynamics with the weak forces, called the Electroweak theory was successfully solved by Weinberg, Salam and Glashow using group theoretic formalism, more precisely the SU (2) × U (1) formalism. Both the weak forces and electromagnetic forces appeared as gauge ﬁelds in this theory. Principles of symmetry breaking were used in this uniﬁcation giving rise to mass of electrons and other nuclear particle. Goldstone had a say in this uniﬁcation when he proposed the idea of how when a Lagrangian of ﬁelds that is invariant under a group G has a vacuum state that is not G-invariant because of degeneracies of the vacuum state, the symmetry of the Lagrangian as viewed from the i

Quantum Mechanics for Scientists and Engineers 114

117

vacuum state is broken to s smaller group H ⊂ G and associated with each degree of broken symmetry is a massless particle called a massless Goldstone boson. The unbroken symmetries correspond to massive particles. Symmetry can also be broken by adding a term to the G-invariant Lagrangian density. This is what happens in the electroweak theory. The electroweak-strong uniﬁcation was achieved by Gell-Mann and Nee-Mann who based their theory on the group SU (3) × SU (2) × U (1). The idea is to derive all the coupling constants of electrdynamics, the weak and the strong theories from one uniﬁed theory. The entire idea of unifying gauge ﬁelds is based on the basic principle of Yang and Mills, namely that one can construct a covariant derivative ∇μ = ∂μ + iAμ (x) acting on a vector space (Cn ) valued function of x in such a way that the gauge ﬁeld Aμ (x) takes values in a Lie algebra g of a subgroup G of the unitary group U (n). The wave function on which ∇μ acts is Cn . Further, this covaraiant derivative satisﬁes the property that under a local G-transformation by g(x) ∈ G, the gauge ﬁeld Aμ (x) which takes values in g transforms in such a way to A�μ (x) so that g(x)(∂μ + A�μ (x)) = (∂μ + Aμ (x))g(x) or equivalently as

g(x)∇�μ = ∇μ g(x)

where both sides are regarded as ﬁrst order diﬀerential operators acting on wave functions ψ(x) ∈ Cn . This gives iA�μ (x) = g(x)−1 ∂μ g(x) + ig(x)−1 Aμ (x)g(x) This idea of gauge transformation in which the massive ﬁeld wave function ψ(x) ∈ Cn transforms to g(x)ψ(x) while the massless gauge ﬁeld Aμ (x) transforms in the above way leads to the conclusion that given a Lagrangian density of the form L(ψ(x), ∇μ ψ(x)) that is G − invariant, ie, L(gψ(x), g∇μ ψ(x)) = L(ψ(x), ∇μ ψ(x)) for all g ∈ G, it follows that L is invariant also under local G-actions g(x) provided that the Aμ (x) sitting inside the covariant derivative ∇μ is also subject to the above gauge transformation. When the group G is the Abelian group U (1), Aμ (x) is simply a real valued function for each μ and the above gauge transformation reduces to the Lorentz gauge transformation for the electromagnetic potentials. Thus, the non-commutative Yang-Mills theory provides a sweeping generalization of the commutative em ﬁeld theory. It can also be applied to describe the interaction of the Dirac ﬁeld ψ(x) with a noncommutative gauge ﬁeld with the electromagnetic potentials also coming as an extra component of the gauge potential. Although this idea of Yang and Mills is a purely group theoretic construction, it turned out to be one of the most fruitful constructs for unifying almost all the quantum particle ﬁelds. What remains now is the development of a quantum theory of gravity which would enable one to associate a particle which we may call a graviton and to describe its interaction with other quantum particles like the photon, electron, positron, and the propagators of the weak and strong forces. It should be borne in mind that the action of a classical gravitational ﬁeld on any quantum particle described by a relativistic wave equation like the Dirac equation can be achieved using the Idea of Yang and Mills, namely by introducing a gravitational connection Γμ (x) which is a matrix. For example, if the gravitational ﬁeld is described by a tetrad Vaμ (x) so that the metric is g μν (x) = η ab Vaμ (x)Vbν (x) with η ab being the Minkowski metric of ﬂat space-time, then this tetrad can be understood as a local transformation of curved space-time to an inertial frame. We then construct a covariant derivative ∂μ + Γμ (x) and transform it locally to an inertial frame by using Vaμ (x)(∂μ + Γμ (x)) and setting up the Dirac equation in a gravitational ﬁeld as [iγ a Vaμ (x)(∂μ + Γmu (x)) − m]ψ(x) = 0

where γ a , a = 0, 1, 2, 3 are the usual Dirac gamma matrices. To qualify as a valid general relativistic wave equation, this must be invariant under local Lorentz transformations Λ(x). Under such a local Lorentz transformation, ψ(x) transforms to D(Λ(x))ψ(x) where D is the Dirac representation of the Lorentz group and its Lie algebra generators are J ab = 14 [γ a , γ b ]. Suppose that under such a local Lorentz transformation Γμ (x) which is a 4 × 4 matrix changes to Γ�μ (x). Then we should have with ψ � (x) = D(Λ(x))ψ(x), the equation [iγ a Λba (x)Vbμ (x)(∂μ + Γ�μ (x) − m]ψ � (x) = 0 where the change of space-time coordinates has been accounted for by the factor matrix elements Λba (x) of the local Lorentz transformation of the inertial frame index a in the tetrad frame Vaμ (x). Using the identity D(Λ)γ b D(Λ)−1 = Λba γ a we get from the above [D(Λ(x))(iγ b Vbμ (x)(D(λ(x))−1 (∂μ + Γ�μ (x)) − m]D(Λ(x))ψ(x) = 0 This is equivalent to iγ b Vbμ (x)((D(Λ(x))−1 ∂μ D(Λ(x))) + ∂μ + D(Λ(x))−1 Γ�μ (x)D(Λ(x))) − m]ψ(x) = 0

Quantum Mechanics for Scientists and Engineers 115

118

It follows that for this to coincide with the untransformed Dirac equation in curved space-time, the connection Γμ (x) of the gravitational ﬁeld in the Dirac representation should transform to Γ�μ (x) where D(Λ(x))−1 Γ�μ (x)D(Λ(x)) + (D(Λ(x))−1 ∂μ D(Λ(x))) = Γμ (x) or equivalently,

Γ�μ (x) = D(Λ(x))Γμ (x)D(Λ(x))−1 − (∂μ D(Λ(x)))D(Λ(x))−1

Such a connection has been constructed and is given by

Γμ (x) =

1 ab ν J Va Vbν:μ 2

(Reference: Steven Weinberg, ”Gravitation and Cosmology, Principles and Applications of the General Theory of Relativity”, Wiley.) Gravity was uniﬁed with classical electromagnetism by Einstein in this beautiful theory the general theory of relativity. This theory said that gravity is not a force, it is simply a curvature of space-time and when matter moves in such a curved space time, follows geodesics which are shortest paths on the curved four dimensional manifold of space-time. These shortest paths are curved because any path on a curved surface is curved. By saying that geodesics are curved, we mean that the relation between the spatial and time coordinates of a moving particle is nonlinear and hence the motion appears to us as being accelerated motion. A.8 Classiﬁcation and representation theory of semisimple Lie algebras. By Serre’s theorem, a complex semisimple Lie algebra L is generated by 3n generators ei , hi , fi , i = 1, 2, ..., n satisyfying the commutation relations [ei , ej ] = [fi , fj ] = [hi , hj ] == [ei , fj ] = 0, [ei , fj ] = δij hi , [hi , ej ] = aij ej , [hi , fj ] = −aij fj

where aij are integers called the Cartan integers. This result of Serre follows from the Cartan’s theory which says that L has a maximal Abelian subalgebra h (Called a Cartan Algebra) and that any two maximal Abelian subalgebras are mutually conjugate. This result is not true for real semisimple Lie algebras where there can be more than one non-conjugate Cartan algebras. The reprsentation theory for real semisimple Lie algebras was developed almost single handedly by the great Indian mathematician Harish-Chandra who derived generalizations of the character formula of H.Weyl using the theory of distributions and also obtained the complete Plancherel formula for such algebras by introducing in addition to the principal and supplementary series of irreducible representations (introduced by Gelfand for obtaining the Plancherel formula for complex semisimple Lie groups), the discrete series of irreducible representations. Coming back to the theme of complex semisimple Lie algebras, Cartan proved that the elements of h in the adjoint representation, act in a semsimple way on L, ie, each operator ad(H), H ∈ h acting on the vector space L is diagonable. It follows from basic Linear algebra, that the operators ad(H), H ∈ h are simulatneously diagonable and hence we have the following direct sum decomposition of L: Lα L=h⊗ α∈Φ

where Φ is the set of all roots of L, ie, for each α ∈ Φ and X ∈ Lα , we have [H, X] = α(H)X

We are here deﬁning

Lα = {X ∈ L : [H, X] = α(H)X∀H ∈ h}

α is a non-zero linear functional on h, ie, α ∈ h∗ and all the α� s are distinct linear functionals. Moreover, Cartan’s theory states that there exists a subset Δ ⊂ Φ called a set of simple roots such that any α ∈ Φ is either a purely positive or purely negative integer linear combination of elements of Δ. This means that writing Δ = {α1 , ..., αp }, we have that for any α ∈ Φ, there exist integers m1 , ..., mp such that mj ≥ 0∀j or mj ≤ 0∀j and α=

p

mj αj

j=1

and further, no α ∈ Δ has such a decomposition. Actually, there exist many such sets Δ. Further, dimLα = 1∀α ∈ Φ. This follows from the following elementary argument, From the Jacobi identity, [Lα , Lβ ] ⊂ Lα+β , α, β ∈ Φ. Thus

Quantum Mechanics for Scientists and Engineers 116

119

[Lα , L−α ] ⊂ h. Choose an eα ∈ Lα , fα ∈ L−α such that B(eα , fα ) = b(α) where B(X, Y ) = T r(ad(X)ad(Y )) and b(α) is a constant to be chosen appropriately. Cartan proved that B deﬁnes an non-degenerate symmetric bilinear form on L (only if L is semisimple). Now deﬁne tα = a(α)[eα , fα ] where a(α) is a constant to be chosen appropriately. Then, tα ∈ h and [tα , eα ] = α(tα )eα , [tα , fα ] = −α(tα )fα and more generally, for any α, β ∈ Φ,

[tα , eβ ] = β(tα )eβ , [tα , fβ ] = −β(tα )fβ

Consistency is checked as follows using B([X, Y ], Z) = B(X, [Y, Z]):

= b(β)β(tα ) = β(tα )B(eβ , fβ ) = B([tα , eβ ], fβ )

We further have

= −B(eβ , [tα , fβ ]) = β(tα )B(eβ , fβ ) = b(β)β(tα ) B(tα , tβ ) = a(α)B([eα , fα ], tβ ) = a(α)B(eα , [fα , tβ ]) = a(α)α(tβ )B(eα , fα ) = a(α)b(α)α(tβ )

and we denote this by (α, β). The above formula implies that (α, β) = (β, α) = a(α)b(α)α(tβ ) = a(β)b(β)β(tα ) Now deﬁne Then,

hα = 2tα /(α, α) [hα , eα ] = 2α(tα )eα /(α, α) = 2eα

provided that we choose the a(α)� s and the b(α)� s so that α(tα ) = (α, α) For such a choice of the constants, we also have [hα , fα ] = −2fα , and

[eα , fα ] = tα /a(α) = (α, α)hα /(2a(α)) = hα

provided that we choose the a(α)� s and the b(α)� s so that (α, α) = 2a(α) = α(tα ) In other words, we have found eα ∈ Lα , fα ∈ L−α , hα ∈ h so that {eα , fα , hα } satisfy the same commutation relations as the standard generators of sl(2, C). We denote the Lie algebra generated by these three elements by slα (2, C). We note that in the adjoint representation, slα (2, C) is (a module for) an irreducible representation of the Lie algebra slα (2, C). We are assuming here that relative to the set of simple roots Δ, α is a positive root, ie it is expressible as a positive integer linear combination of the elements of Δ. We note that if X ∈ Lα , Y ∈ Lβ , then B([H, X], Y ) = −B(X, [H, Y ]) implies

(α(H) + β(H))B(X, Y ) = 0, H ∈ h

and hence B(X, Y ) = 0 unless β = −α. It is therefore clear from the non-degeneracy of B that for each X ∈ Lα , B(X, Y ) �= 0 for some Y ∈ L−α It is also clear that dimLα = 1 This can be seen as follows: Consider the sum Mα = h ⊕

c

Lcα

Quantum Mechanics for Scientists and Engineers 117

120

the sum being over all the irreducible representations of slα (2, C) in which the weights are multiples of α. (By weight β, here we mean that if X is a weight-vector with weight β where β is a linear functional on h, then ad(H)(X) = β(H)X∀H ∈ h. We note that Mα is a module for slα (2, C). When H = hα , then β(hα ) becomes an eigenvalue of ad(hα ) and hence β can be regarded as a weight for the Lie algebra slα (2, C). Since slα (2, C) as a Lie algebra is isomorphic to sl(2, C), in any irreducible representation of slα (2, C), either a weight zero or a weight one will occur with a unique weight vector. Clearly, ad(hα ) when acting on Mα has weight zero iﬀ the weight vector is in h. Mα contains the module h + slα (2, C) of slα (2, C) and hence it cannot contain any irreducible even module that has zero intersection with h + slα (2, C) appearing in Mα as a direct summand for any even irreducible module for slα (2, C) must necessarily contain a zero weight vector which must be an element of h and hence will intersect the module h + slα (2, C)(Note that h + slα (2, C) is a module for slα (2, C), ie, it is left invariant by the adjoint action of the latter and hence this module can be decomposed into irreducible modlues for slα (2, C)). Therefore, we must have Mα = h + slα (2, C) and in particular,

dimLα = 1, α ∈ Φ

By an even module g of slα (2, C), we mean that ad(hα ) has an even eigenvalue when operating on the root vectors in g in the adjoint representation. For example, if the eigenvalues {−2q, −2q + 2, ..., 0, 2, 4, ..., 2p}} occured in an irreducible submodule of Mα for the Lie algebra slα (2, C) as a direct summand diﬀerent from slα (2, C) in the adjoint representation, then the zero weight vector in this represenation would be hα which is a contradiction. Further, an odd summand (ie in which a vector having weight one occurs) also cannot occur, for then α/2 would be a root ((α/2)(hα ) = 1) and hence α = 2(α/2) cannot be a root by the above argument(Note that in an irreducible representation of slα (2, C), only the weights from the set 2Z or only weights from 2Z + 1 can occur. We have thus proved that if α is a root and cα is also a root, then c = ±1. Remark: If g is a semisimple Lie algebra and, then we can decompose g=

N

gi

i=1

as a direct sum of vector spaces gi where each gi is an ideal in g, ie [g, gi ] ⊂ gi ∀i and hence [gi , gj ] ⊂ gi

gj = {0}, i �= j.

A.8.Schrodinger wave equations for quantum general relativity. A manifold speciﬁed by space-time coordinates x ˜μ is given. Another coordinate system for this manifold is X μ . The metric tensor relative to the former is g˜μν and the metric tensor for the latter is gμν . Thus, we have μ ν X,σ = g˜ρσ gμν X,ρ μ

a b μ By X,ρ we mean ∂X ∂xρ . We denote the spatial coordinates of the former system by x , x ,etc, where a, b = 1, 2, 3. Thus, the spatial components of the metric in the former system are μ ν X,b g˜ab = gμν X,a

We deﬁne

qab = g˜ab

We denote by ((q )) the 3 × 3 matrix that is the inverse of ((qab )). Now, write ab

μ = T μ = N μ + N nμ X,0

where N μ is purely spatial, ie, of the form

μ N μ = N a X,a

and the vectors N μ and nμ are orthogonal with nμ normalized by the factor N . This means that N a is selected so that μ ν , gμν nμ X,a =0 N μ = N a X,a

The ﬁrst is the condition for N μ to be a spatial vector and the second is the condition for nμ to be orthogonal to all spatial vectors. We can visualize this by saying that the three dimensional spatial manifold Σt deﬁned by x0 = t = constt is embedded in the four dimensional manifold speciﬁed by the coordinates X μ . The vector (nμ ) is the unit normal to the

Quantum Mechanics for Scientists and Engineers 118

121

μ 3 manifold Σt and the vectors (X,a )μ=0 , a = 1, 2, 3 are tangential to the manifold Σt . We thus get the following equations for N a : μ μ ν − N a X,a )X,b =0 gμν (X,0

or

g˜0b − N a g˜ab = 0

or equivalently,

qab N b = g˜0a

We now prove the following decomposition:

g μν = q μν + nμ nν

where

q μν nν = 0

In other words, g μν can be decomposed as a sum of a purely spatial part and a purely normal part with regard to the surface Σt . To see this, we write μ ν g μν = g˜αβ X,α X,β = ν g 0a (N μ + N nμ )X,a + g˜00 (N μ + N nμ )(N ν + N nν ) + 2˜ μ ν g˜ab X,a + X,b μ We have to show that the cross term in this expansion is zero, ie, terms involving products of spatial parts X,a and the normal part nμ . The cross part here is ν g 0a N nμ X,a 2N g˜00 N μ nν + 2˜

To prove that this is zero amounts to proving that ν =0 g˜00 N μ nν + g˜0a nμ X,a

To prove this it suﬃces to show that or equivalently,

g˜00 N a + g˜0a = 0

Proving this is equivalent to proving that which is the same as (since g˜ba g˜

0a

μ =0 g˜00 N μ + g˜0a X,a

+ g˜b0 g˜

00

=

δba

g˜ba g˜0a + g˜00 g˜ab N a = 0 −˜ gb0 g˜00 + g˜00 g˜ab N a = 0

= 0). Thus, we have to show that

qab N a = g˜b0 μ But this has already been established using the orthogonality of the normal vector nμ with the spatial vectors X,a .

A.9. Time travel in the special and general theories of relativity and the revised notions of space-time in quantum general relativity. Gravitational red-shift: Let U (r) be the gravitational potential. Then the approximate (Newtonian) metric of spacetime is given by dτ 2 = (1 + 2U (r)/c2 )dt2 − c−2 (dx2 + dy 2 + dz 2 )

We are assuming that U depends only on the radial coordinate relative to a system. The radial null geodesic (radial propagation of photons) is given by 0 = dτ 2 = (1 + 2U (r)/c2 )dt2 − dr2 or equivalently,

dr/dt = (1 + 2U (r)/c2 )1/2

Thus assuming r1 < r2 , a photon pulse starting from r1 at time t1 arrives at r2 at time t2 given by r2 t2 − t1 = (1 + 2U (r)/c2 )−1/2 dr r1

119 Quantum Mechanics for Scientists and Engineers

122

In this expression, t1 , t2 are coordinate times, ie, times measured by a clock at a large distance from the gravitiational ﬁeld, ie, at a point where the gravitatitional ﬁeld is zero. Now if another pulse starts from r1 at coordinate time t1 + δt1 , then it will arrive at r2 at time t2 + δt2 where r2 t2 + δt2 − t1 − δt1 = (1 + 2U (r)/c2 )−1/2 dr r1

Note that we are assuming a static gravitational ﬁeld. It thus follows that δt2 = δt1 The proper time intervals measured by clocks static at r1 and r2 for the pulses are respectively given by dτ1 = (1 + 2U (r1 )/c2 )1/2 dt1 , dτ2 = (1 + 2U (r2 )/c2 )1/2 dt2 It follows therefore that dτ1 /dτ2 = [ where

1 + 2U1 /c2 1/2 ] 1 + 2U2 /c2

U1 = U (r1 ), U2 = U (r2 )

Hence, if U2 < U1 , we get dτ1 > dτ2 or in terms of frequencies,

ν1 = 1/dτ1 < 1/dτ2 = ν2

or more precisely,

1 + 2U2 /c2 1/2 ν1 =[ ] dτ2 , it follows that clocks run slower in a strong gravitational ﬁeld, ie when the gravitational potential is more negative. A.10[a].Transmission lines with random ﬂuctuations in the parameters and random line loading: v,z (t, z) + (R0 (z) + δR(t, z))i(t, z) + L0 (z)i,t (t, z) + (δL(t, z)i(t, z)),t = wv (t, z) i,z (t, z) + (G0 (z) + δG(t, z))v(t, z)) + C0 (z)v,t (t, z) + (δC(t, z)v(t, z)),t = wi (t, z) In these equations, δR(t, z), δL(t, z), δC(t, z), δG(t, z), wv (t, z), wi (t, z) are random Gaussian space-time Gaussian ﬁelds. We wish to solve this system of pde’s approximately using ﬁrst order perturbation theory and hence calculate the approximate statistical correlations of the ﬂuctuations in the line voltage and line current in terms of the correlations in the parameter ﬂucutations and the voltage and current loading terms wv and wi . A.10[b]. Taking non-linear hysteresis and nonlinear capacitive eﬀects into account, generalize the problem of A.9. A.11. Estimating parameters in statistical image models described by linear and nonlinear partial diﬀerential equations: First consider the linear case: The model for the image ﬁeld φ(x, y), (x, y) ∈ D is given by p

a,b=1

A(a, b, θ)∂xa ∂yb φ(x, y) = s(x, y) + w(x, y), (x, y) ∈ D

where w(x, y) is zero mean coloured Gaussian noise. θ ∈ Rm is the parameter vector to be estimated from measurements on φ. Here, s(x, y) is a given input non-random signal ﬁeld. We assume that an initial guess estimate θ0 of θ is known and that the correction δθ to this estimate is to be made. We write φ(x, y) = φ0 (x, y) + δφ(x, y)

Quantum Mechanics for Scientists and Engineers 120

123

where φ0 is the solution with the guess parameter θ0 and zero noise and δφ is the ﬁrst order correction to φ0 arising from the parameter estimate correction term δθ and the noise w. We regard δφ, δθ, w all as being of the ﬁrst order of smallness. We deﬁne A(a, b, θ)(jω1 )a (jω2 )b H(ω1 , ω2 , θ) = a,b

Then if two dimensional spatial Fourier transforms are denoted by placing a hat on top of a signal/noise ﬁeld, we get ˆ 1 , ω2 ) = sˆ(ω1 , ω2 ) + w(ω H(ω1 , ω2 , θ)φ(ω ˆ 1 , ω2 ) Thus to zeroth order, we get

H(ω1 , ω2 , θ0 )φˆ0 (ω1 , ω2 ) = sˆ(ω1 , ω2 ), ˆ 1 , ω2 ) + (Hr (ω1 , ω2 , θ0 )δθr )φˆ0 (ω1 , ω2 ) H(ω1 , ω2 , θ0 )δ φ(ω = w(ω ˆ 1 , ω2 )

where Hr (ω1 , ω2 , θ0 ) = Thus, if

∂H(ω1 , ω2 , θ0 ) ∂θr

g(x, y) = F −1 (H(ω1 , ω2 , θ0 )−1 )

it then follows that

φ0 (x, y) = g(x, y) ∗ s(x, y) = Likewise, if we write

g(x − x� , y − y � )s(x� , y � )dx� dy �

hr (x, y) = F −1 (Hr (ω1 , ω2 , θ0 )) Ar (a, b, θ0 )δ (a) (x)δ (b) (y) = a,b

where

Ar (a, b, θ0 ) = then we get

∂A(a, b, θ0 ) ∂θr

δφ(x, y) = −g(x, y) ∗ hr (x, y)δθr + g(x, y) ∗ w(x, y) − − − (2)

where summation over the repeated index r is implied. We measure δφ(x, y) as follows: First since we know θ0 and the input signal ﬁeld s(x, y), we can calculate φ0 (x, y) using (1) and then we measure the actual noise perturbed image ﬁeld φ(x, y) and calculate δφ(x, y) = φ(x, y) − φ0 (x, y). Now using (2), we calculate the maximum likelihood estimator of δθ as ˆ = δθ argminδθ ( (δφ(x, y) + gr (x, y)δθr )Q(x, y|x� , y � )(δφ(x� , y � ) + gs (x� , y � )δθs )dxdydx� dy � ) where Q(x, y|x� , y � ) is the inverse Kernel of

ie,

and

We write

R(x, y|x� , y � ) = E[(g(x, y) ∗ w(x, y)).(g(x� , y � ) ∗ w(x� , y � ))] g(x − x1 , y − y1 )g(x� − x�1 , y � − y1� )E(w(x1 , y1 )w(x�1 , y1� ))dx1 dy1 dx�1 dy1�

Q(x, y|x� y � )R(x� , y � |x�� , y �� )dx� dy � = δ(x − x�� )δ(y − y �� )

gr (x, y) = g(x, y) ∗ hr (x, y) =

Ar (a, b, θ0 )∂xa ∂yb g(x, y)

a,b

g(x, y) = ((gr (x, y)))

121 Quantum Mechanics for Scientists and Engineers

124

Then,

ˆ = [ Q(x, y|x� , y � )g(x, y)g(x� , y � )T dxdydx� dy � ]−1 [ Q(x, y|x� , y � )g(x, y)δφ(x� , y � )dxdydx� dy � ] δθ

A simple computation gives us the covariance of this parameter vector estimator: ˆ = [ Q(x, y|x� , y � )g(x, y)g(x� , y � )T dxdydx� dy � ]−1 Cov(δ θ)

Wavelet based image parameter estimation: Let ψn (x, y), n = 1, 2, ... be a wavelet orthonormal basis for L2 (R2 ). Here, the index n consists of the scaling and translational index in both the dimensions, ie, n corresponds to four ordered integer indices. We deﬁne c(n, φ) =< φ, ψn >=

Then,

φ(x, y) =

φ(x, y)ψn (x, y)dxdy

c(n, φ)ψn (x, y)

n

Substituting this into the image pde model gives c(n, φ)A(a, b, θ)∂xa ∂yb ψn (x, y) = s(x, y) + w(x, y) n,a,b

Taking the inner product on both sides with ψm (x, y) gives c(n, φ)A(a, b, θ) < ψm , ∂xa ∂yb ψn >= c(m, s) + c(m, w) n,a,b

Deﬁne

P (m, n|θ) =

A(a, b, θ) < ψm , ∂xa ∂yb ψn >

a,b

The above equation can then be expressed as P (m, n|θ)c(n, φ) = c(m, s) + c(m, w) n

Writing

θ = θ0 + δθ, φ(x, y) = φ0 (x, y) + δφ(x, y)

gives us on applying ﬁrst order perturbation theory, P (m, n|θ0 )c(n, φ0 ) = c(m, s), n

(( Pr (m, n|θ0 )δθr )(¸n, φ0 ) + P (m, n|θ0 )δc(n)) = c(m, w) n

where

r

Pr (m, n|θ0 ) = and We write

∂P (m, n|θ0 ) ∂θr

δc(n) = c(n, φ0 + δφ) − c(n, φ0 ) c0 (n) = c(n, φ0 ), Pr (m, n) = Pr (m, n|θ0 ), P0 (m, n) = P (m, n|θ0 )

and thus get n,r

P0 (m, n)c0 (n) = c(m, s)

n

Pr (m, n)δθr c0 (n) +

n

P0 (m, n)δc(n) = c(m, w)

Quantum Mechanics for Scientists and Engineers 122

125

A.12 Mackey’s theory on the construction of the basic quantum observables from unitary representations of the Galilean group. (a, v)inV = R3 × R3 . V is the Abelian group of translations and uniform velocity motions acting on the space-time manifold M = {(t, x) : t ∈ R, x ∈ R3 }. This action is given by (a, v)(t, x) = (t + x + vt + a) (τ, g) ∈ R × SO(3). R × SO(3) is the non-Abelian group of time translations and rotations acting on M: (τ, g)(t, x) = (t + τ, gx) The Galiean group G is the semidirect product of V and H: G = V ⊗s H where H acts on V as follows:

(τ, g).(a, v).(τ, g)−1 = (a� , v � )

or equivalently,

(τ, g).(a, v) = (a� , v � ).(τ, g)

Acting both sides on (t, x) ∈ R gives 4

(τ, g)(t, x + vt + a) = (a� , v � )(t + τ, gx) or equivalently,

(t + τ, gx + tgv + ga) = (t + τ, gx + v � t + a� + v � τ )

or equivalently,

v � = gv, a� = g(a − vτ )

Thus, the semimdirect product structure is given by

(τ, g).(a, v).(τ, g)−1 = (g(a − vτ ), gv) ∈ V Any element of the Galilean group G can be expressed in two ways, one as an element (a, v, τ, g) deﬁned by its action on R4 by (a, v, τ, g)(t, x) = (t + τ, gx + vt + a) and in another way as the element (a, v).(τ, g). The action of this on R4 is given by (a, v).(τ, g)(t, x) = (a, v)(t + τ, gx) = (t + τ, gx + vt + vτ + a) It follows that the relationship between these two methods of expressing an element of the Galilean group is given by (a + vτ, v, τ, g) = (a, v).(τ, g) or equivalently by

(a − vτ, v).(τ, g) = (a, v, τ, g)

2j+1 Let now h be a Hilbert space (like for a spin j particle) and H = L2 (R3 , h) the Hilbert space of all measurable C functions f : R3 → h for which R3 � f (x) �2 d3 x < ∞. The projective unitary representations of G are obtained by using the multiplier m((a, v).(τ, g), (a� , v � ).(τ � , g � )) = exp(iB((a, v), (τ, g)[(a� , v � )])

where B : V × V → R (with V = R3 × R3 ) being a skew symmetric bilinear form invariant under H = {(τ, g) : τ ∈ R, g ∈ SO(3)}. that is invariant under H or equivalently under (τ, g). Note that the action of H on V is deﬁned by (τ, g)[(a, v)] = (τ, g).(a, v).(τ, g)−1 = (g(a − vτ ), gv) We thus ﬁnd that

m((a, v).(τ, g), (a� , v � ).(τ � , g � )) = exp(iB((a, v), (g(a − vτ ), gv))) = exp(iλ((a, gv) − (v, g(a − vτ ))))

for some λ ∈ R where (u, v) = u v. Let U be a projective unitary representation of G with this multiplier. Then, T

U ((a, v).(τ, g)) = U (a, v)U (τ, g)

Quantum Mechanics for Scientists and Engineers 123

126

We can write

U (a, 0) = V1 (a), U (0, v) = V2 (v), U (τ, g) = W1 (τ )W2 (g)

where V1 , V2 are unitary representations of the Abelian group V = R3 × R3 and W1 and W2 are unitary representations of R and SO(3). By Stone’s theorem on unitary representations of Abelian groups, it follows that there exist Hermitian operators Q = (Q1 , Q2 , Q3 ) and P = (P1 , P2 , P3 ) in L2 (R3 , h) such that V1 (a) = exp(−ia.P ), V2 (v) = exp(−iv.Q) and also a Hermitian operator H in the same space such that W1 (τ ) = exp(−iτ H) We have In particular,

U (a1 , v1 )U (a2 , v2 ) = exp(iλ(aT1 v2 − aT2 v1 ))U (a1 + a2 , v1 + v2 ) V1 (a)V2 (v) = U (a, 0)U (0, v) = exp(iλaT v)U (a, v), V2 (v)V1 (a) = U (0, v)U (a, 0) = exp(−iλaT v)U (a, v)

Thus, we get the Weyl commutation relations V1 (a)V2 (v) = exp(i2λaT v)V2 (v)V1 (a) which can be expressed as exp(−ia.P ).exp(−iv.Q) = exp(i2λaT v)exp(−iv.Q).exp(−ia.P ) and hence by considering inﬁnitesimal a and v in R3 . we get −Pi Qj + Qj Pi = 2iλδij or equivalently,

[Qi , Pj ] = 2iλδij

To get agreement with quantum mechanics that the momentum operators P generate translations and the position operators Q generate uniform velocities, we must take λ = 1/2 and thus, we get [Qi , Pj ] = iδij The Stone-Von-Neumann theorem then implies that the Hilbert space � can be chosen so that the actions of the Qi s and the Pj s in L2 (R3 , h) are such that (Qj f )(x) = xj f (x), (Pj f )(x) = −i

∂f (x) ∂xj

In other words, regarding L2 (R3 , h) as L2 (R3 ) ⊗ h (This is a Hilbert space isomorphism), we have that Qj = xj ⊗ Ih ∂ and Pj = −i ∂x ⊗ Ih . We note compute j U (τ, g)U (a, v) = U ((τ, g).(a, v)) Now,

(τ, g).(a, v)(t, x) = (τ, g)(t, x + vt + a) = (t + τ, gx + tgv + ga) = (ga, gv, τ, g)(t, x)

ie, On the other hand,

(τ, g).(a, v) = (ga, gv, τ, g) (a, v).(τ, g)(t, x) = (a, v).(t + τ, gx) = (t + τ, gx + vt + vτ + a) = (a + vτ, v, τ, g)(t, x)

Quantum Mechanics for Scientists and Engineers 124

Thus,

127

(τ, g).(a, v) = (g(a − vτ ), gv).(τ, g)

So we get

U (τ, g)U (a, v)U (τ, g)−1 = U (g(a − vτ ), gv)

Taking g = I, this gives

W1 (τ )U (a, v)W1 (−τ ) = U (a − vτ, v)

Setting a = 0 in this formula gives

W1 (τ )V2 (v)W1 (−τ ) = U (−vτ, v)

while setting v = 0 gives

W1 (τ )V1 (a)W1 (−τ ) = V1 (a)

The second equation implies

[H, Pj ] = 0, j = 1, 2, 3

We now note that Thus, we get from the ﬁrst equation

V1 (−vτ )V2 (v) = exp(−iτ |v|2 /2)U (−vτ, v)

W1 (τ )V2 (v)W1 (−τ ) = exp(iτ |v|2 /2)V1 (−vτ )V2 (v) = exp(iτ |v|2 /2)exp(iτ v.P )V2 (v) For inﬁnitesimal τ , this gives or equivalently,

−i[H, exp(−iv.Q)] = (i|v|2 /2 + iv.P )exp(−iv.Q) H − exp(−iv.Q).H.exp(iv.Q) = −|v|2 /2 − v.P − − − (1)

The O(v) term of this equation gives

i[v.Q, H] = −v.P

or equivalently,

i[H, Qj ] = Pj

Combining this with the equation

[H, Pj ] = 0

we may conclude using the commutation relations [Qi , Pj ] = iδij that 3

H = P 2 /2 + E =

1 2 P + E − − − (2) 2 j=1 j

where E is an operator of the form I ⊗ E1 in L2 (R3 ) ⊗ h, ie, for f (x) ∈ h, x ∈ R3 , we have (Ef )(x) = E1 f (x) By considering the O(v 2 ) term in (1), we get [v.Q, [v.Q, H]] = −v 2 or equivalently, This is veriﬁed by (2):

[[H, Qi ], Qj ] = −δij /2 [P 2 /2, Qi ] = −iPi , [[P 2 /2, Qi ], Qj ] = −i[Pi , Qj ] = −δij

We now consider the equation U (τ, g)U (a, v)U (τ, g)−1 = U (g(a − vτ ), gv) with τ = 0. We get W2 (g)U (a, v)W2 (g)−1 = U (ga, gv) In particular, we get

W2 (g)V1 (a)W2 (g)−1 = V1 (ga), W2 (g)V2 (v)W2 (g)−1 = V2 (gv)

Quantum Mechanics for Scientists and Engineers 125

128

These equations are the same as W (2(g)Pi W2 (g)−1 =

3

gji Pj ,

3

gji Qj

j=1

W2 (g)Qi W2 (g)−1 =

j=1

Here, g ∈ SO(3). Thus, W2 (g) has the eﬀect of rotating the position and momentum operators. N-particle system: We assume that Hi is the Hilbert space for the ith particle and that the projective unitary representation U of the Galilean group G acting in H = ⊗N i=1 Hi has the form U (a, v, 0, g) = ⊗N i=1 Ui (a, v, 0, g) In other words, as regards translation, motion with uniform velocities and rotations, the actions of these on each particle in the system is the same. The above discussion for a single particle implies that each Ui (a, v, 0, g) acts in the same way on the corresponding particle. However, time evolution described by the operator U (0, v, τ, I) acts on the entire system and may not be factorizable into a tensor product of single particle operators. This is because, the3 generator of this group which is the energy/Hamiltonian H is the sum of the individual kinetic energies and an interaction potential energy and the latter depends on some complex combination of all the particle position operators. So for the present, we can let U (0, 0, τ, I) = exp(−iτ H) where H is a Hermitian operator acting in H. We also assume the existence of velocity operators Vi = (Vi1 , Vi2 , Vi3 ), i = 1, 2, ..., N acting in H satisfying the following properies: U (0, v, 0, I)Vij U (0, v, 0, I)−1 = Vij + vj , 1 ≤ j ≤ 3, i = 1, 2, ..., N, Since

Ui (0, v, 0, I) = exp(−iv.Q)

it follows that i[v.

k

Qk , Vij ] = −vj

where Qk = (Qk1 , Qk2 , Qk3 ) are the position operators acting in Hk and likewise Pk = (Pk1 , Pk2 , Pk3 are the momentum operators acting in Hk . It should be noted that by the theory for one particle discussed above and the separability of U (a, v, 0, g) we have that U (a, 0, 0, I) = exp(−ia.P ) = ⊗k exp(−iaPk ), P = k Pk and likewise for Q. Thus, we get Qkl , Vij ] = −δjl i[ k

We also have from the one particle theory and separability of U (a, v, 0, g) that i[Qkl , Pij ] = −δki δlj So, if we postulate that

[Qkl , Vij ] = 0, k �= i

(this is true if we assume that Vij acts in Hi ), then we get

i[Qkl , Vij ] = −δki δij and hence we derive

[Qkl , Pij − Vij ] = 0

which implies that

Pij − Vij = Aij (Q)

where Aij (Q) is a function of Q = (Qij : 1 ≤ i ≤ N, 1 ≤ j ≤ 3) only. We deﬁne H0 =

N

N

k=1

k=1

3

1 2 1 2 Vk = Vki 2 2 i=1

Quantum Mechanics for Scientists and Engineers 126

129

From the composition theory of Galilean group representations developed above, we have that U (0, 0, τ, I)U (a, 0, 0, I) = U (a, 0, τ, I) = U (a, 0, 0, I)U (0, 0, τ, I) and hence

[H,

Pki ] = 0

k

Note that

U (a, 0, 0, I) = exp(−ia.

Pk ) = exp(−i

ai Pki )

k,i

This means that the total momentum of the system of N particles is conserved. Now, we consider 3 3 2 [H0 , Qki ] = [ Vkr /2, Qki ] = [ (Pkr − Akr (Q))2 /2, Qki ] = r=1

r=1

−i(Pki − Aki (Q)) = −iVki

We also assume (by deﬁnition of the velocity as the time derivative of the position), d U (0, 0, −τ, I).Qki U (0, 0, τ, I)|τ =0 = Vki dt This gives

[H, Qki ] = −iVki

and hence

[H − H0 , Qki ] = 0

and therefore,

H − H0 = V0 (Q) + E

where E is of the form I ⊗ E1 with E1 acting in h and V0 (Q) and arbitrary function of the positions Q = (Qki : 1 ≤ k ≤ N, i = 1, 2, 3). Thus, we ﬁnally get the general form of the total system Hamiltonian: H=

1 (Pki − Aki (Q))2 + V0 (Q) + E 2 k,i

(Ref: K.R.Parthasarathy, ”Mathematical Foundations of Quantum Mechanics”, Hindustan Book Agency) A.13. A remark on quantum stochastic calculus. Let ut ∈ H, Ut ∈ U(H), t ≥ 0 Let P be a spectral measure on [0, ∞) with values in P(H) and assume that P commutes with all the Ut s. We can write Ut = exp(iHt ) where Ht is a Hermitian operator in H. Suppose that the Ht s commute with each other and that Ht acts in Pt H where Pt = P [0, t] (For example we can choose a Hermitian operator H in H that commutes with P and then deﬁne Ht = Pt H = HPt ). More generally, we shall assume that for s < t, Ht − Hs acts in P [s, t]H. Then we have dUt = iUt dHt and since dHt acts in dPt H = P [t, t + dt]H while Ut acts in Pt H, it follows that for v, w ∈ H, we have with Γ(Ut ) denoting the second quantization of Ut (ie, Γ(Ut ) = W (0, Ut )), d < e(v)|Γ(Ut )|e(w) >=< e(v)|dΓ(Ut )|e(w) >=

and hence Γ(Ut ) satisﬁes the qsde

i < e(v)|Γ(Ut )|e(w) > .d < v|Ht |w > dΓ(Ut ) = Γ(Ut )dΛ(Ht )

where Λ(X) is the second quantization of X deﬁned by < e(v)|exp(Λ(X))|e(w) >=< e(v)|e(exp(X)w) >= exp(< v|exp(X)|w >)

127 Quantum Mechanics for Scientists and Engineers

130

A.14. Scattering theory with time dependent interactions with the scattering centre. The free particle Hamiltonian is H0 and the Hamiltonian after the particle starts interacting with the scattering centre is H(t) = H0 + δ.V (t). We assume that V (t) = V0 is a constant operator for |t| > T . Let |φ1 > be a free particle state evolving according to H0 in the remote past (the ”in state”) Let |ψ1 > be the corresponding scattered state evolving according to H(t). Likewise, let |φ2 > be a free particle state evolving according to H0 in the future, ie, as t → ∞ (ie, the ”out state”) and |ψ2 > the corresponding scattered state evolving according to H(t). Deﬁne for t2 > t1 , U0 (t2 − t1 ) = exp(−i(t2 − t1 )H0 ), U (t2 , t1 ) = T {exp(−i where T {.} denotes the time ordering operator. We must have

t2

H(t)dt)}

t1

limt→∞ (U (t, 0)|ψ2 > −U0 (t)|φ2 >) = 0 and hence

|ψ2 >= limt→∞ U (t, 0)−1 U0 (t)|φ2 >

We write

Ω2 (t) = U (t, 0)−1 U0 (t), t ≥ 0,

Then on an appropriate domain of out states, we have the operator

Ω2 = limt→∞ Ω2 (t) Thus,

|ψ2 >= Ω2 |φ2 >

Likewise,

limt→−∞ (U (0, t)−1 |ψ1 > −U0 (−t)|φ1 >) = 0

or equivalently,

|ψ1 >= limt→−∞ U (0, t)U0 (−t)|φ1 > = limt→∞ U (0, −t)U0 (t)|φ1 > = Ω1 |φ1 >

where

Ω1 = limt→∞ Ω1 (t)|φ1 >

where The scattering matrix is given by

Ω1 (t) = U (0, −t)U0 (t) S = Ω∗2 Ω1 = limt→∞ Ω2 (t)∗ Ω1 (t) = limt→∞ U0 (−t)U (t, 0)U (0, −t)U0 (t)

= limt→∞ U0 (−t)U (t, −t)U0 (t) t = limt→∞ exp(itH0 ).T {exp(−i H(s)ds)}.exp(−itH0 ) −t

Note: In discussing scattering theory with noise, we assume that {V (t)} is an operator valued random process and then compute the average value of the scattering matrix with respect to the probability distribution of {V (t)}. By the Dyson series expansion, t

T {exp(−i

exp(−2itH0 ) + exp(−itH0 )(

∞

n=1

where

(−i)n

H(s)ds)} =

−t

V˜ (s1 )...V˜ (sn )ds1 ...dsn )exp(−itH0 )

−t= k,m

where

k,m

|I[k, m]|2 = 1

This means that given that we measure pixel number m, the probability of getting the grey scale amplitude and phase level k is given by PI (k|m) = I[k, m]|2 More generally, the image can be in a mixed state ρI deﬁned by ρI = I[k, m, r, s]|k, m >< r, s| 1≤k,r≤p,1≤m,s≤N

where the condition implies that

T r(ρI ) = 1

I[k, m, k, m] = 1

k,m

Then if the image is in the mixed state ρI , the probability of getting the grey scale level amplitude speciﬁed by the index k given that the mth pixel is measured is given by < k, m|ρI |k, m > PI (k|m) = r < r, m|ρI |r, m >

Quantum Mechanics for Scientists and Engineers 129

132

I[k, m, k, m] = r I[r, m, r, m]

We shall now express ρI in the frequency domain of the grey scale amplitudes: The quantum Fourier transform of the grey scale state |k > is given by p exp(−i2πkr/p)|r > |k˜ >= p−1/2 r=1

The inverse quantum Fourier transform is thus given by

p

|k >= p−1/2 Then, ρI =

where

exp(i2πkr/p)|˜ r>

r=1

I[k, m, r, s]|k, m >< r, s| = p−1 I[k, m, r, s]exp(i2π(kk � − rr� )/p)|k˜� >< r˜� | ⊗ |m >< s| ˆ � , m, s, r� ]|k˜� >< r˜� | ⊗ |m >< s| = I[k ˆ � , m, s, r� ] = p−1 I[k

k,r

We may express this as

I[k, m, r, s]exp(i2π(kk � − rr� )/p)

ˆ m, r, s]|k˜ >< r˜| ⊗ |m >< s| I[k, ˜ m >< r˜, s| ˆ m, r, s] = |k, = I[k,

ρI =

The average image energy at pixel number m and frequency k˜ is thus given by

˜ m|ρI |k, ˜ m> ˆ m, k, m] < k, I[k, = ˆ ˜, m|ρI |˜ r, m > r < s| m,s

where Tms is a p × p matrix for each m, s ∈ {1, 2, ..., N }. We assume that T is a unitary matrix, ie, T ∗ T = IpN This condition is equivalent to requiring that ∗ Tms Tm s ⊗ |s >< m|m� >< s� | = IpN or equivalently,

m,s,s

or equivalently,

∗ Tms Tms ⊗ |s >< s� | = IpN

∗ Tms Tms = δss Ip

m

Applying the unitary operator T to ρI gives a transformed image ﬁeld speciﬁed by the density matrix ρ�I = T ρI T ∗ = ρI = I[k, m, r, s]T (|k >< r| ⊗ |m >< s|)T ∗ =

Now,

ˆ m, r, s]T (|k˜ >< r˜| ⊗ |m >< s|)T ∗ I[k, T (|k >< r| ⊗ |m >< s|) =

Quantum Mechanics for Scientists and Engineers 130

(

jl

133

Tjl ⊗ |j >< l|)(|k >< r| ⊗ |m >< s|)(

=

j l

Tj∗ l ⊗ |l� >< j � |)

Tjl |k >< r|Tj∗ l ⊗ |j >< l|m >< s|l� > |m >< j � | Tjm |k >< r|Tj∗ s ⊗ |j >< j � | = jj

A.16. Quantization of the em ﬁelds inside a rectangular waveguide. (∇2⊥ + h2 )Ez = 0, (∇2⊥ + h2 )Hz = 0 jωμ γ ∇⊥ Ez − 2 ∇⊥ Hz × zˆ, h2 h γ jω� H⊥ = − 2 ∇⊥ Hz + 2 ∇⊥ Ez × zˆ h h We wish to select potentials Φ, A such that when the em ﬁelds satisfy the above, then E⊥ = −

E = −nablaΦ − jωA, μH = ∇ × A or equivalently,

Ez = γΦ − jωAz , E⊥ = −∇⊥ Φ − jωA⊥ , μHz zˆ = ∇⊥ × A⊥ ,

μH⊥ = ∇⊥ Az × zˆ − γ zˆ × A⊥

In addition, we wish that the potentials Φ, A satisfy the Lorentz gauge condition divA = −jω�μΦ, or equivalently,

∇⊥ .A⊥ − γAz = −jω�μΦ

It is easily seen that the most general potentials satisfying the above requirements are given by A⊥ = −(∇⊥ Φ + E⊥ )/jω, Az = (γΦ − Ez )/jω,

where Φ is any scalar ﬁeld that satisﬁes the Helmholtz equation

(∇2⊥ + h2 )Φ = 0 In particular, we can take Φ = 0 and then

A = −E/jω

The general solution for the electromagnetic ﬁelds in the guide with the boundary conditions that Ez and the normal components of H vanish on the side boundaries is given by Ez = C(n, m)exp(−γnm z)unm (x, y), Hz = D(n, m)exp(−γnm z)vnm (x, y) where

These functions are normalized:

γnm = (h2nm − ω 2 μ�)1/2 , √ unm (x, y) = (2/ ab)sin(nπx/a)sin(mπy/b), √ vnm (x, y) = (2/ ab)cos(nπx/a)cos(mπy/b)

0

a

0

b

u2nm dxdy =

0

a

0

b

2 vnm dxdy = 1

Quantum Mechanics for Scientists and Engineers 131

134

and further they are orthogonal

We thus ﬁnd that zˆ

C(n, m)unm (x, y)exp(−γnm z)−

We note that

unm un m dxdy = 0, (n, m) �= (n� , m� ) vnm vn m dxdy = 0, (n, m) �= (n� , m� ) E=

[(γnm /h2nm )C(n, m)∇⊥ unm (x, y)+(jωμ/hnm2 )D(n, m)∇⊥ vnm (x, y)×ˆ z ]exp(−γnm z) =

=

(∇⊥ unm , ∇⊥ vn m × zˆ)dxdy

(∇⊥ unm × ∇⊥ vn m , zˆ)dxdy

(unm,x vn m ,y − unm,y vn m ,x )dxdy = 0

on integration by parts (we are left with only boundary terms which vanish). We note that E⊥ = − [C(n, m)γnm /h2nm )∇⊥ unm (x, y) + D(n, m)(jωμ/hnm2 )∇⊥ vnm (x, y) × zˆ]exp(−γnm z)

and hence

= where

a

0

b

|E|2 dxdy

0

(|C(n, m)|2 (1 + |γnm |2 /h2nm ) + |D(n, m)|2 (μω)2 /h2nm )exp(−2αnm z) γnm (ω) = αnm (ω) + jβnm (ω)

Thus, �

a 0

b

0

0

d

|E|2 dxdydz =

(λ(n, m)|C(n, m)|2 + μ(n, m)|D(n, m)|2 )

n,m

where

λ(n, m) = (1 + |γnm |2 /h2nm )(1 − exp(−2αnm d))/2αnm μ(n, m) = ((μω)2 /h2nm )(1 − exp(−2αnm d))/2αnm

Note that the energy in the magnetic ﬁeld is given by |∇ × A|2 dxdydx/2μ = and

(∇ × A, B) = ∇.(A × B) − (A, ∇ × B)

The ﬁrst term on the rhs is a perfect divergence and by Gauss’ theorem, its volume integral over the guide volume is zero if assuming that the ﬁelds vanish on the surface. Further, ∇ × B = ∇ × (∇ × A) = ∇(divA) − nabla2 A = −∇2 A since divE = 0 implies divA = 0. Further, ∇2 A = −∇2 (E/jω) = ω 2 �μE/jω = −jω�μE and hence, (2μ)−1 |∇ × A|2 dxdydz = (�/2)

|E|2 dxdydz

Quantum Mechanics for Scientists and Engineers 132

135

In other words, the energy in the magnetic ﬁeld is same as the energy in the electric ﬁeld. Thus, the total ﬁeld energy is given by � |E|2 dxdydz

=

(λ(n, m)|C(n, m, ω)|2 + μ(n, m)|D(n, m, ω)|2 )

n,m

and this energy can be quantized using creation and annihilation operators in place of C(n, m, ω), D(n, mω) and their conjugates. A.17. Image processing for non-Gaussian noise models √ based on the Edgeworth expansion. The Edgeworth expansion: Let φ(x) = exp(−x2 /2)/ 2π, the standard normal density. Deﬁne the Hermite polynomials by Hn (x) = (−1)n exp(x2 /2)Dn exp(−x2 /2), D = d/dx The Edgeworth expansion of a density f (x) for which all the moments R |x|k f (x)dx, k = 1, 2, ..., are ﬁnite is given by f (x) = φ(x)(1 +

c[n]Hn (x)) = φ(x) + (2π)−1/2

n≥1

c[n]Dn exp(−x2 /2)

n≥1

Generating function and orthogonality of the Hermite polynomials: tn Hn (x)/n! = exp(x2 /2)( (−t)n Dn /n!)exp(−x2 /2) = exp(x2 /2)exp(−tD)exp(−x2 /2) n≥0

Thus,

n≥0

= exp(x2 /2)exp(−(x − t)2 /2) = exp(tx − t2 /2)

n,m≥0

tn sm

R

Hn (x)Hm (x)φ(x)dx =

φ(x)exp((t + s)x − t2 /2 − s2 /2)dx

= exp(ts)

and hence

Hn (x)Hm (x)φ(x)dx = n!δ[n − m], n, m ≥ 0

√ Thus {Hn (x)/ n! : n ≥ 0} forms an orthnormal basis for the Hilbert space L2 (R, φ(x)dx). It therefore follows that the coeﬃcients c[n], n ≥ 0 for the Edgeworth expansion of f (x) are given by f (x)Hn (x)dx/n!, n ≥ 0 c[n] = R

Note that H0 (x) = 1. Now consider a multivariate Edgeworth pdf deﬁned by ψ(x) = |A|ΠM i=1 f ((Ax)i ) where A is an M × M matrix and x ∈ RM . We have ψ(x) = (2π)−M/2 |A|exp(−xT AT Ax/2)ΠM i=1 (1 +

c[n]Hn ((Ax)i )

n≥1

We shall calculate is moment generating function: Let X ∈ RM have ψ as its pdf. Then ˆ = Eexp(< t, X >) = exp(< t, x >)ψ(x)dx ψ(t) = (2π)−M/2

RM

RM

exp(< t, A−1 y >)exp(−y T y/2)ΠM i=1 (1 +

= (2π)−M/2 ΠM i=1

c[n]Hn (yi ))dy

n≥1

exp((A−T t)i ξ)exp(−ξ 2 /2)(1 +

n≥1

c[n]Hn (ξ))dξ

Quantum Mechanics for Scientists and Engineers 133

136

To calculate this integral, we ﬁrst observe that (2π)−1/2

R

exp(tξ)exp(−ξ 2 /2)Hn (ξ)dξ

= (2π)−1/2 (−1)n = tn

exp(tξ).Dξn exp(−ξ 2 /2)dξ

φ(ξ)exp(tξ)dξ = tn exp(t2 /2)

where integration by parts has been used. Thus for the above multivariate case, we get ˆ = exp(tT A−1 A−T t/2)ΠM (1 + c[n]((A−T t)i )n ) ψ(t) i=1 n≥1

= exp(tT (AT A)−1 t/2)ΠM i=1 (1 +

c[n](A−T t)i )n )

n≥1

The approximate maximum likelihood estimator for an Edgeworth distribution: Suppose y = Ax + w where w has a multivariate Edgeworth expansion and x also has a multivariate Edgeworth expansion. We wish to estimate x based on y by maximizing p(x|y), ie, the MAP estimate. We have p(x|y)py (y) = p(y|x)px (x) = pw (y − Ax)px (x) Discrete time non-linear ﬁltering theory applied to real time image parameter estimation. The parameter vector v[n] at time n satisﬁes the stochastic diﬀerence equation v[n + 1] = f (n, v[n]) + �v [n + 1] where �v [n] is an iid sequence. Thus, v[n] is a discrete time Markov process with transition density p(v[n + 1]|v[n]) = p�v (v[n + 1] − f (n, v[n])) The image vector x[n] is partitioned into patches Pi x[n] with each patch given by Pi x[n] = vi [n] + �i [n], i = 1, 2, ..., L or equivalently,

P x[n] = v[n] + �[n]

where P is a non-singular square matrix and �[n] is an iid sequence independent of the sequence �v [n], n ≥ 1. Finally, the measurement model for the image vector is given by y[n] = x[n] + w[n] where w[n] is again an iid sequence independent of both the sequences �v and �. We assume that all the three random sequences �v , �, w have multivariate Edgeworth probability densities with possibly diﬀerent linear combination coeﬃcients. The aim is to dynamically estimate v[n] based on Yn = {y[k] : k ≤ n} We have p(v[n + 1]|Yn+1 ) =

We now observe that where

p(y[n + 1]|v[n + 1])p(v[n + 1]|v[n])p(v[n]|Yn )dv[n] p(y[n + 1]|v[n + 1])p(v[n + 1]|v[n])p(v[n]|Yn )dv[n]dv[n + 1]

y[n] = P −1 (v[n] + �[n]) + w[n] = P −1 v[n] + d[n] d[n] = P −1 �[n] + w[n]

Quantum Mechanics for Scientists and Engineers 134

137

is again an iid vector valued noise. Thus, the MAP estimate of v[n + 1] given Yn+1 is given by vˆ[n + 1] = argmaxv pd (y[n + 1] − Av � )p�v (v � − f (n, v))p(n, v|Yn )dv where A = P −1 . We assume that vˆ[n + 1] = f (n, vˆ[n]) + δv � = vˆ0 [n + 1] + δv � where vˆ0 [n + 1] = f (n, vˆ[n]), expand the above integral upto O((δv � )2 ) and then maximize this w.r.t δv � to get the extra correction. We have v0 [n + 1] + δv � )) = pd (y[n + 1] − A(ˆ 1 v0 [n + 1]) − p�d (y[n + 1] − Aˆ v0 [n + 1])T Aδv � + δv �T AT p��d (y[n + 1] − Aˆ v0 [n + 1])Aδv � pd (y[n + 1] − Aˆ 2 with neglect of O(|δv � |3 ) terms. Likewise, v0 [n + 1] + δv � − f (n, v)) = p�v (ˆ v0 [n + 1] − f (n, v)) + p��v (ˆ v0 [n + 1] − f (n, v))T δv � p�v (ˆ 1 v0 [n + 1] − f (n, v))δv � + δv �T p��v (ˆ 2 with neglect of O(|δv � |3 ). We thus obtain upto O(|δv � |2 ), v0 [n + 1]) − p�d (y[n + 1] − Aˆ v0 [n + 1])T Aδv � vˆ[n + 1] = vˆ0 [n + 1] + argmaxδv (pd (y[n + 1] − Aˆ 1 + δv �T AT p��d (y[n+1]−Aˆ v0 [n+1])Aδv � )(p�v (ˆ v0 [n+1]−f (n, v))+p��v (ˆ v0 [n+1] 2 T � 1 v0 [n+1]−f (n, v))δv � )p(n, v|Y −f (n, v)) δv + δv �T p��v (ˆ 2 = vˆ0 [n+1]+argmaxδv [(

+δv �T (

(pd (y[n+1]−Aˆ v0 [n+1])p��v (ˆ v0 [n+1]−f (n, v))−p�v (ˆ v0 [n+1]

−f (n, v))AT p�d (y[n+1]−Aˆ v0 [n+1]))p(n, v|Yn 1 (AT p�d (y[n+1]−Aˆ v0 [n+1])p��v (ˆ v0 [n+1]−f (n, v))T + (p��v (ˆ v0 [n+1] 2

This equation is of the form

v0 [n+1])A)p(n, v|Yn )dv)δv −f (n, v))+AT p��d (y[n+1]−Aˆ

1 vˆ[n + 1] = vˆ0 [n + 1] + argmaxδv [f (n, Yn+1 , vˆ0 [n + 1])T δv � + δv �T F (n, Yn+1 , vˆ0 [n + 1])δv � 2 = vˆ0 [n + 1] − F (n, Yn+1 , vˆ0 [n + 1])−1 (f (n, Yn+1 , vˆ0 [n + 1]))

and provides the desired recursion. Remark: The following approximation provides an alternate technique for improving the speed of the recursion: ψ(n, y[n + 1], v)p(n, v|Yn )dv ≈

1 v [n])Cov(v[n]|Yn )) ψ(n, y[n + 1], vˆ[n]) + T r(ψv�� (n, y[n + 1]ˆ 2 To apply this formula, we note that the vector and matrices f (n, Yn+1 , vˆ0 [n + 1]), F (n, Yn+1 , vˆ0 [n + 1]) can be expressed as integrals of the form f (n, Yn+1 , vˆ0 [n + 1]) = intψ1 (n, y[n + 1], vˆ0 [n + 1], v)p(n, v|Yn )dv F (n, Yn+1 , vˆ0 [n + 1]) = ψ2 (n, y[n + 1], vˆ0 [n + 1], v)p(n, v|Yn )

A.18.Cartan’s classiﬁcation of the simple Lie algebras and the Weyl character formula for the irreducible representations of Compact semisimple Lie groups. A scheme S is a ﬁnite set linearly independent vectors (elements) α1 , ..., αn in a real vector space with an inner product (., .) such that a(α, β) = 2(α, β)/(α, α)

Quantum Mechanics for Scientists and Engineers 135

138

is a non-positive integer for all α �= β, α, β ∈ S. The Cauchy Schwarz inequality then implies that 0 ≤ a(α, β)a(β, α) ≤ 3, α, β ∈ S, α �= β ie, the product a(α, β)a(β, α) assumes only the values 0, 1, 2, 3. It is known from the general theory of semisimple Lie algebras, that a set of simple positive roots of a semisimple Lie algebra forms a scheme. The numbers a(α, β) are called the Cartan integers. Obviously a(α, α) = 2. To pictorially display a scheme S having n elments, we arrange these elements as vertices with the weight of each vertex α marked by a number proportional to (α, α) = |α|2 . Theorem 1: A connected scheme with n elements cannot have more than n − 1 links. For suppose that the elments of the scheme are αk , k = 1, 2, ..., n. Then consider n n 0 for all k �= m forms a dense subset of H. Hence, there exists a vector u such that < u, uk >, k = 1, 2, ..., p are all distinct. Hence the Vand-der-Monde matrix ((< u, uk >n ))1≤k,m≤p is non-singular implying that c(k) = 0, k = 1, 2, ..., p. Now, since √ e(u) = 1 ⊕ tn u⊗n / n! �

n≥1

we get

√ dn e(tu)|t=0 = n!u⊗n dtn and since any symmetric tensor can be expressed as a linear combination of tensors of the form u⊗n , it follows that the exponential vectors e(u), u ∈ H span a dense subspace of Γs (H). An adapted process Xt , t ≥ 0 is a family of ˜ t |e(ut] )) ⊗ |e(u(t ) > for all t ≥ 0 where we have used the isomorphism that operators in Γs (H) such that Xt |e(u) >= (X identiﬁes |e(u ⊕ v) > with |e(u) > ⊗|e(v) >. Here ut] = u ⊗ χ[0,t] and u(t = u ⊗ χ (t,∞) . Ideally speaking, if T denotes the isomorphism that identiﬁes the vector e( i ui ) in Γs ( i Hi ) with ⊗i e(ui ) in i Γs (Hi ), then we should write the deﬁnition of an adapted process as ˜ t |e(ut] ) >) ⊗ e(u(t ) T (Xt |e(u) >) = (X Let P : 0 = t0 < t1 < ... < tn = T be a partition of [0, T ]. Its size is deﬁned as |P | = max0≤k≤n−1 (tk+1 − tk ) and we deﬁne the partial sum I(X, A, P ) =

n−1 k=0

Xtk (Atk+1 (m) − Atk (m))

Note that X(t) is adapted so it acts in the Fock space Γs (Ht] ) while Atk+1 − Atk acts in the Fock space Γs (H(tk ,tk+1 ] since (Atk+1 (m) − Atk (m))|e(u) >= (

tk+1

¯ dt)|e(u) >

tk

Note that time unfolds in quantum stochastic calculus as a continuous tensor product of Hilbert spaces. This can be visualized also in the classical probabilisitc setting by noting that if Ft , t ≥ 0 is a ﬁltration on a probability space (Ω, F, P ) generated by a stochastic process X(t), t ≥ 0, then for any t1 < t2 < t3 , if F(t1 ,t2 ] denotes the σ ﬁeld σ(X(t) : t1 < t ≤ t2 ), we can write L2 (F(t1 ,t3 ] ) = L2 (F(t1 ,t2 ] ) ⊗ L2 (F(t2 ,t3 ] ) in the sense that any measurable functional of X(t), t1 < t ≤ t3 can be expressed as a sum (poissbily inﬁnite) of products of functions of {X(t) : t1 < t ≤ t2 } and of {X(t) : t2 < t ≤ t3 }. More generally, we can write 2 L2 (F(0,∞) = ⊗∞ i=1 L (F(ti ,ti+1 ] )

where We now have

0 = t0 < t1 < ...., tn → ∞ I(X, A, P )|e(u) >=

n−1 k=0

X(tk )|e(u) >> ((tk , tk+1 ])

Quantum Mechanics for Scientists and Engineers 138

141

where > ((s, t]) =

t

s

< m(t� ), u(t� ) > dt� , s ≤ t

Note that > can be extended to a complex measure on (R+ , B(R+ )). If Q is a partition ﬁner that P , then it is clear that for each k = 0, 1, ..., n − 1, there exist integers a(k) < b(k) such that b(k)

(tk , tk+1 ] =

(sl , sl+1 ]

l=a(k)

and the points sl , l = 0, 1, ...m − 1 all form the partition Q. Thus, we have X(tk ) > ((tk , tk+1 ]) −

=

b(k)

l=a(k)

So

b(k)

X(sl ) > ((sl , sl+1 ])

l=a(k)

(X(tk ) − X(sl )) > ((sl , sl+1 ])

� (I(X, A, P ) − I(X, A, Q))|e(u) >�≤

max|t−s|≤|P |,s,t∈[0,T ] � (X(t) − X(s))|e(u) >� | > ([0, T ])|

Assume that X(t) is strongly uniformly continuous on [0, T ]. Then

limδ→0 max|t−s|≤δ,s,t∈[0,T ] � (X(t) − X(s))|e(u) >�= 0 and hence it follow that if Pn , n = 1, 2, ... is an increasing sequence of partitions, ie, Pn+1 > Pn ∀n and |Pn | → 0 as n → ∞, then |(I(X, A, Pn+m ) − I(X, A, Pn ))|e(u) >�→ 0, n → ∞, m = 1, 2, ...

which implies that I(X, A, Pn )|e(u) >, n = 1, 2, ... is a Cauchy sequence in the Boson Fock space Γs (H ⊗ L2 (R+ )). and hence converges to an element of this space. Further, the limit is independent of the sequence of partitions Pn for if Qn , n = 1, 2, ... is another increasing sequence of partitions such that |Qn | → 0, then � (I(X, A, Pn ) − I(X, A, Qn ))|e(u) >�≤� (I(X, A, Pn ) − I(X, A, Pn ∪ Qn ))|e(u) >� + � (I(X, A, Pn ∪ Qn ), I(X, A, Qn )|e(u) >�

and by the above logic, both of the terms on the rhs converge to zero, proving that the lhs also converges to zero and hence the strong limits of I(X, A, Pn ) and of I(X, A, Qn ) are the same. A.20. Hartree-Fock equations: ψa (x), x ∈ R3 are Fermionic operator ﬁelds and they satisfy the standard anticommutation relations {ψa (x), ψb∗ (x� )} = δab δ 3 (x − x� ), {ψa (x), ψb (x� )} = 0, {ψa (x)∗ , ψb (x)∗ } = 0

Let

T (x) = −∇2x /2m

and let V (x, x� ) = V (x� , x) be a scalar potential ﬁeld. The Hartree Fock seconed quantized Hamiltonian is deﬁned by H = H0 + H1 , H0 =

ψa (x)∗ T (x)ψa (x)d3 x

with summation over the repeated index a being implied, H1 = V (x, x� )ψa (x)∗ ψa (x)ψb (x� )∗ ψb (x� )d3 xd3 x� Note that

H0∗ = H0

Quantum Mechanics for Scientists and Engineers 139

142

follows by integration by parts and

H1∗ = H1

so that

H∗ = H

and hence H is a valid Hamiltonian. The Fermionic ﬁelds at time t are given by the rules of Heisenberg’s matrix mechanics: ψa (t, x) = exp(itH)ψa (x)exp(−itH), and its adjoint

ψa (t, x)∗ = exp(itH)ψa (x)∗ exp(−itH)

Now,

∂ψa (t, x) = iexp(itH)[H, ψa (x)]exp(−itH) ∂t [H, ψa (x)] = [H0 , ψa (x)] + [H1 , ψa (x)]

We have [H0 , ψa (y)] =

=

[H1 , ψa (y)] = Now,

=−

[ψb (x)∗ T (x)ψb (x), ψa (y)]d3 x

{ψb (x)∗ , ψa (y)}T (x)ψb (x)d3 x

δab δ 3 (x − y)T (x)ψb (x)d3 x = −T (x)ψa (x)

V (x, x� )[ψb (x)∗ ψb (x)ψc (x� )∗ ψc (x� ), ψa (y)]d3 xd3 x� ψa (y)ψb (x)∗ ψb (x)ψc (x� )∗ ψc (x� ) =

(δab δ 3 (y − x) − ψb (x)∗ ψa (y))ψb (x)ψc (x� )∗ ψc (x� ) =

δ (y − x)ψa (y)ψc (x� )∗ ψc (x� ) + ψb (x)∗ ψb (x)ψa (y)ψc (x� )∗ ψc (x� ) 3

So

[ψb (x)∗ ψb (x)ψc (x� )∗ ψc (x� ), ψa (y)] = −ψb (x)∗ ψb (x){ψc (x� )∗ , ψa (y)}ψc (x� ) − δ 3 (y − x)ψa (y)ψc (x� )∗ ψc (x� ) = −δac δ 3 (y − x� )ψb (x)∗ ψb (x)ψc (x� ) − δ 3 (y − x)ψa (y)ψc (x� )∗ ψc (x� ) = −δ 3 (y − x� )ψb (x)∗ ψb (x)ψa (y) − δ 3 (y − x)ψa (y)ψc (x� )∗ ψc (x� )

= −δ 3 (y − x� )ψb (x)∗ ψb (x)ψa (y) − δ 3 (y − x)(δac δ 3 (y − x� ) − ψc (x� )∗ ψa (y))ψc (x� )

Thus, we get

= −δ 3 (y − x� )ψb (x)∗ ψb (x)ψa (y) − δ 3 (y − x)ψc (x� )∗ ψc (x� )ψa (y) − δ 3 (y − x)δ 3 (y − x� )ψa (y)

= −iT (t, y)ψa (t, y) − 2i We write this equation as i

∂ψa (t, y) = ∂t V (y, x)ψb (t, x)∗ ψb (t, x)d3 x − iV (y, y)ψa (t, y)

∂ψa (t, y) ˜ y)ψa (t, y) = H(t, ∂t

˜ y) is the eﬀective Hamiltonian operator deﬁned by where H(t, ˜ y) = T (t, y) + 2 V (y, x)ψb (t, x)∗ ψb (t, x)d3 x + V (y, y) H(t, and This is the Hartree-Fock equation.

T (t, y) = exp(itH)T (y)exp(−itH)

Quantum Mechanics for Scientists and Engineers 140

143

A.21. Quantum scattering theory. Explicit determination of the scattering operator. H0 = P 2 /2m, H = H0 + V . exp(−itH0 )ψ(Q) = ψt (Q) say. Then,

idψt (Q)/dt = H0 ψt (Q) = −∇2Q ψt (Q)/2m ψt (Q) = exp(it∇2Q /2m)ψ(Q) / exp(it∇Q 2m) = (2πσ 2 )−3/2 exp(−|x|2 /2σ 2 )exp((x, ∇Q))d3 x

Then,

exp(σ 2 ∇2Q /2) = exp(it∇2Q /2m)

Hence,

σ 2 = it/m, σ =

So ψt (Q) = (2πit/m)−3/2

= (2πit/m)−3/2 = (2πit/m)−3/2 Deﬁne the Kernel function

exp(−m|x|2 /2it)exp((x, ∇Q ))ψ(Q)dx exp(−m|x|2 /2it)ψ(Q + x)d3 x exp(−m|Q − x|2 /2it)ψ(x)d3 x

Kt (Q) = (2πit/m)−3/2 exp(−m|Q|2 /2it)

Let Then,

it/m

W (t) = exp(itH)exp(−tH0 ) W � (t) = iexp(itH).V (Q).exp(−itH0 )

(V is assumed to be a function of Q only. Thus, W � (t) = iW (t)exp(itH0 )V (Q)exp(−itH0 ) Deﬁne Then, and

Z(t) = exp(itH0 ).V (Q).exp(−itH0 ) Z � (t) = iexp(itH0 )[H0 , V (Q)]exp(−itH0 ) [H0 , V (Q)] = [P 2 , V (Q)]/2m = (2m)−1 ([Pa , V (Q)]Pa + Pa [Pa , V (Q)]) = −i(2m)−1 ((∇Q V (Q), P ) + (P, ∇Q V (Q))) (−i/m)(V � (Q), P ) − (1/2m)∇2Q V (Q)

Another way to evaluate this is to note that

Z(t) = V (exp(itH0 )Q.exp(−itH0 )) Now,

exp(itH0 )Q.exp(−itH0 ) = exp(itad(H0 ))(Q) = Q + it[H0 , Q] + (it)2 [H0 , [H0 , Q]] + ... = Q + tP/m

Thus, Thus,

Z(t) = V (Q + tP/m) W � (t) = iW (t)V (Q + P t/m)

Quantum Mechanics for Scientists and Engineers 141

144

Now suppose |f >, |u >∈ Hac (H0 ). Then, by deﬁnition of the absolutely continuous spectrum of an operator, the Radon-Nikodym derivatives d < u|E0 (λ)|u > /dλ and d < f |E0 (λ)|f > /dλ exist and are ﬁnite. We have for V = |u >< u| with < u|u >= 1, W (t)|f >= exp(itH)exp(−itH0 )|f >= (i

t

exp(itH)V.exp(−itH0 )dt)|f >

0

and for Ω+ |f >= limt→∞ W (t)|f > to exist, it is suﬃcient that ∞ X= � V.exp(−itH0 )|f >� dt < ∞ 0

Now,

V.exp(−itH0 )|f >= |u >< u|exp(−itH0 )|f >

so

� V.exp(−itH0 )|f >�= | < u|exp(−itH0 )|f > | = | (d < u|E0 (λ)|f > /dλ)exp(−iλt)dλ| R

This is the magnitude of the Fourier transform of the function λ → d < u|E(λ)|f > /dλ. We note that | < u|dE(λ)|f > | ≤< u|dE) (λ)|u >1/2 < f |dE0 (λ)|f >1/2 Hence,

|d < u|E0 (λ)|f > /dλ| ≤ (d � E0 (λ)|u >�2 /dλ)1/2 (d � E0 (λ)|f >�2 /dλ)1/2

A necessary condition for the magnitude of the Fourier transform of a function to be integrable is that the Fourier transform be ﬁnite. Thus, a necessary condition for X < ∞ is satisﬁed since the Radon-Nikodym derivatives d � E(λ)|u >�2 /dλ and d � E0 (λ)|f >�2 /dλ are ﬁnite because both |f > and |u > belong to Hac (H0 ). Suppose H0 = P (in one dimension) andV = V (Q). H = H0 + V = P + V (Q). Then, V.exp(itP )f (x) = V (x)f (x + t). So, Ω+ |f > will exist if the function t → ( R V (x)2 |f (x + t)|2 dx)1/2 is integrable on R+ . Consider now two Hamiltonians H0 , H = H0 + V and let φ : R → R. Consider now the Hamiltonians φ(H0 ), φ(H). Let Wφ (t) = exp(itφ(H))exp(−itφ(H0 )) Then, Wφ (t) = I + i

t

0

So Wφ (∞)|f > will exist if

We note that

∞ 0

exp(isφ(H))(φ(H) − φ(H0 ))exp(−isφ(H0 ))ds

� (φ(H) − φ(H0 ))exp(−isφ(H0 ))|f >� ds < ∞

< u|(φ(H) − φ(H0 ))exp(−isφ(H0 ))|f >= −

R

R

exp(−isφ(λ))d < u|φ(H)E0 (λ)|f >

exp(−isφ(λ))φ(λ)d < u|E0 (λ)|f >

In particular, if φ is an invertible function we can write

< u|(φ(H) − φ(H0 ))exp(−isφ(H0 ))|f >= (exp(−isλ)d < u|φ(H)E0 (φ−1 (λ))|f > /dλ)dλ

−

exp(−isλ)λ(d < u|E0 (φ−1 (λ))|f > /dλ)dλ

provided we assume that the concerned Radon-Nikodym derivatives exist. These will exist provided that all |u > , φ(H)|u > and |f > belong to the absolutely continuous parts of the spectral measure E0 oφ−1 ie they belong to Hac (φ(H0 )).

Quantum Mechanics for Scientists and Engineers 142

145

A.22.Hartree Fock approximation to the two electron problem of the Helium atom. The Hamiltonian is H = H1 + H2 + V12 where

H1 = −∇21 /2m − 2e2 /r1 , H2 = −∇22 /2m − 2e2 /r2 , V12 = e2 /r12

Let us try the wave function (antisymmetric because the two electrons form a Fermionic pair) √ ψ = (ψ1 ⊗ ψ2 − ψ2 ⊗ ψ1 )/ 2 with the constraints

< ψ1 |ψ1 >=< ψ2 |ψ2 >= 1, < ψ1 |ψ2 >= 0

As in all eigenvalue problems we extremize

S =< ψ|H|ψ > −2E1 (< ψ1 |ψ1 > −1) − 2E2 (< ψ2 |ψ2 >) − 2λ1 Re(< ψ1 |ψ2 >) − 2λ2 Im(< ψ2 |ψ1 >)) We ﬁrst observe that taking into account the constraints, S =< ψ1 ⊗ ψ2 − ψ2 ⊗ ψ1 |(H1 + H2 + V12 )|ψ1 ⊗ ψ2 − ψ2 ⊗ ψ1 > −2E1 (< ψ1 |ψ1 > −1) − 2E2 (< ψ2 |ψ2 >) − 2λ1 Re(< ψ1 |ψ2 >) − 2λ2 Im(< ψ2 |ψ1 >))

=< ψ1 |H1 |ψ1 > + < ψ2 |H1 |ψ2 > + < ψ1 |H2 |ψ1 > + < ψ2 |H2 |ψ2 > + < ψ1 ⊗ ψ2 |V12 |ψ1 ⊗ ψ2 > < ψ2 ⊗ ψ1 |V12 |ψ2 ⊗ ψ1 > − < ψ1 ⊗ ψ2 |V12 |ψ2 ⊗ ψ1 >

− < ψ2 ⊗ ψ1 |V12 |ψ1 ⊗ ψ2 > −2E1 (< ψ1 |ψ1 > −1) − 2E2 (< ψ2 |ψ2 >) − 2λ1 Re(< ψ1 |ψ2 >) − 2λ2 Im(< ψ2 |ψ1 >))

Now,

δS/δ ψ¯1 = 0

gives

2H1 ψ1 (r1 ) + 2 < I ⊗ ψ2 |V12 |ψ1 ⊗ ψ2 > −2 < I ⊗ ψ2 |V12 |ψ2 ⊗ ψ1 >

−2E1 ψ1 − λ1 |ψ1 > −iλ2 |ψ2 >= 0

and likewise another equation for δS/δ ψ¯2 = 0. Expanding the above, we get 2(−∇21 /2m − 2e2 /r1 − E1 − λ1 )ψ1 (r1 ) + 2 ψ¯2 (r2 )(e2 /r12 )(ψ1 (r1 )ψ2 (r2 ) − ψ2 (r1 )ψ1 (r2 ))d3 r2 −iλ2 ψ2 (r1 ) = 0

We note that the second term can be expressed as ψ¯2 (r2 )(e2 /r12 )(ψ1 (r1 )ψ2 (r2 ) − ψ2 (r1 )ψ1 (r2 ))d3 r2 =[

(|ψ2 (r2 )|2 (e2 /r12 )d3 r2 ]ψ1 (r1 ) − (

(e2 /r12 )ψ¯2 (r2 )ψ1 (r2 )d3 r2 )ψ2 (r1 )

The ﬁrst term in this expression represents the potential energy produced by a smeared second electron charge on the ﬁrst charge, the charge density of this smeared distribution being given by e|ψ2 (r2 )|2 . The second term in the above expression represents the eﬀect of the spin interaction between the two electrons, the interaction caused by the fact that both the electrons cannot occupy the same state. Remark: The constraint < ψ1 |ψ2 >= 0 need not be introduced. Neither do the constraints < ψ1 |ψ1 >=< ψ2 |ψ2 >= 1 need to be imposed. We only need to introduce the constraint that the overall wave function be normalized, ie, 1 =< ψ|ψ > which is equivalent to

2 =< ψ1 ⊗ ψ2 − ψ2 ⊗ ψ1 |ψ1 ⊗ ψ2 − ψ2 ⊗ ψ1 >= 2 < ψ1 |ψ1 >< ψ2 |ψ2 > −2| < ψ1 |ψ2 > |2

Quantum Mechanics for Scientists and Engineers 143

146

or equivalently,

1 =< ψ1 |ψ1 >< ψ2 |ψ2 > −| < ψ1 |ψ2 > |2

This results in the following version of the Hartree Fock equation 2(−∇21 /2m − 2e2 /r1 )ψ1 (r1 ) + ψ¯2 (r2 )(e2 /r12 )(ψ1 (r1 )ψ2 (r2 ) − ψ2 (r1 )ψ1 (r2 ))d3 r2 +

ψ¯2 (r1 )(e2 /r12 )(ψ2 (r1 )ψ1 (r2 ) − ψ1 (r1 )ψ2 (r2 ))d3 r2

−E(< ψ2 |ψ2 > ψ1 (r1 )− < ψ2 |ψ1 > ψ2 (r1 )) = 0

with another equation of the same type, ie, with the same value of E for ψ2 . We can generalize this to a system of N interacting Fermions. The Hamiltonian of such a system is given by H=

N

Hi +

i=1

Vij

1≤i= N !det((< ψa |ψb >)) = N !

σ

sgn(σ)ΠN k=1 < ψk |ψσk >= Λ[ψ]

say. We have S1 [ψ] =< ψ|

N

k=1

=

Hk |ψ >=

σ,τ,k

=

σ,τ,k

σ,τ ∈SN

sgn(στ ) < ψσ1 ⊗ ...ψσN |Hk |ψτ 1 ⊗ ...ψτ N >

sgn(στ )(ΠN j=1,j�=k < ψσj , ψτ j >) < ψσk |Hk |ψτ k >

sgn(στ )(ΠN j=1,j�=k < ψj , ψτ σ −1 j >) < ψk , |Hσ −1 k |ψτ σ −1 k >

=

ρ,σ,k

and

sgn(ρ)(ΠN j=1,j�=k < ψj , ψρj >) < ψk |Hσk |ψρk > S2 [ψ] =< ψ|

i) < ψσi ⊗ ψσj |Vij |ψρi ⊗ ψρj >

σ,ρ,i,j:σ −1 i=

sgn(ρ)(ΠN k=1,k�=i,j < ψk |ψρk >) < ψi ⊗ ψj |Vσ −1 i,σ −1 j |ψρi ⊗ ψρj > S[ψ] = S1 [ψ] + S2 [ψ]

The variational principle

δψ¯k (S[ψ] − E(Λ[ψ] − 1)) = 0

thus gives

ρ,σ

+

ρ,σ,j

sgn(ρ)(ΠN j=1�=k < ψj |ψρj >)Hσk |ψρk >

sgn(ρ)|ψρk > (ΠN l=1,l�=k,j < ψl |ψρl >) < ψj |Hσj |ψρj >

N

i=1

Hi .

Quantum Mechanics for Scientists and Engineers 144

+

σ,ρ,j:σ −1 k

sgn(ρ)(ΠN m=1,m�=i,k < ψm |ψρm >)|ψρk >< ψi ⊗ I|Vσ −1 i,σ −1 k |ψρi ⊗ I >

sgn(ρ)(ΠN m=1,m�=i,j,k < ψm |ψρm >)|ψρk >< ψi ⊗ ψj |Vσ −1 i,σ −1 j |ψρi ⊗ ψρj >

= N !E

σ

sgn(σ)|ψσk > (ΠN j=1,j�=k < ψj |ψσj >), k = 1, 2, ..., N

A.23.Plasmonic waveguides. A rectangular waveguide is ﬁlled with a plasma. The charge per particle of the plasma is q and the particle distribution function is f (t, r, v). This distribution function is assumed to vanish on the boundaries of the guide and hence its Fourier transform w.r.t time can be expanded as f f (ω, r, v) = fnm (ω, v)unm (x, y)exp(−γnm (ω)z) n,m≥1

where

√ unm (x, y) = (2/ ab)sin(nπx/a)sin(mπy/b), 0 ≤ x ≤ a, 0 ≤ y ≤ b

For a ﬁxed n, m, we write f (ω, v) for fnm and γ for γnm . The Maxwell equations curlE = −jωμH, curlH = J + jω�E where J(ω, x, y) =

qvδf (ω, x, y, v)d3 v

where the f (ω, x, y, v)exp(−γz) is a component of the particle density corresponding to propagation constant γ. In component form, we have Ez,y (x, y) + γEy (x, y) = −jωμHx (x, y), γEx + Ez,x = jωμHy , Ey,x − Ex,y = −jωμHz ,

Hz,y + γHy = Jx (x, y) + jω�Ex , γHx + Hz,x = −Jy (x, y) − jω�Ey ,

Hy,x − Hx,y = Jz (x, y) + jω�Ez (x, y)

where the frequency argument ω has been omitted. Here, Jx (x, y) = qvx δf (x, y, v)d3 v, Jy (x, y) = qvy δf (x, y, v)d3 v, Jz (x, y) = qvz δf (x, y, v)d3 v − − − (a) These equations can be solved for Ex , Ey , Hx , Hy in terms of J(x, y) and the partial derivatives of Ez (x, y), Hz (x, y). Having done so, we substitute these into the Boltzmann kinetic transport equation with the unperturbed Maxwell distribution function f0 (v) being taken in place of f where multiplication with the em ﬁelds is concerned. Thus, we get the approximate equation jωδf (x, y, v) + vx δf,x (x, y, v) + vy δf,y (x, y, v) − γvz δf (x, y, v) + q(E + μv × H, ∇v )f0 (v) = −δf (x, y, v))/τ (v) − − − (b) where the relaxation time approximation for the collision term has been used. Here, δf (x, y, v) = f (x, y, v) − f0 (v). To proceed further, we ﬁrst solve the above equations for Ex , Ey , Hx , Hy : γ −jωμ (Ex , Hy )T = (−Ez,x , Hz,y − Jx (x, y))T , jω� −γ γ jωμ (Ey , Hx )T = (−Ez,y , −Hz,x − Jy (x, y))T , jω� γ

Quantum Mechanics for Scientists and Engineers 145

148

Solving these gives

Ex = −(γ/h2 )Ez,x − (jωμ/h2 )(Hz,y − Jx ) − − − (1) Ey = −(γ/h2 )Ez,y + (jωμ/h2 )(Hz,x + Jy ) − − − (2) Hx = (jω�/h2 )Ez,y − (γ/h2 )(Hz,x + Jy ) − − − (3)

Hy = −(jω�/h2 )Ez,x − (γ/h2 )(Hz,y − Jx ) − − − (4)

where

h2 = γ 2 + ω 2 �μ

Substituting these into the third components of the Maxwell curl equations gives (Hz,xx + Jy,x ) + (Hz,yy − Jx,y ) + h2 Hz = 0 (Ez,xx + Ez,yy ) − (γ/jω�)(Jx,x + Jy,y ) + h2 Ez = 0

or equivalently,

(∇2⊥ + h2 )Ez = (γ/jω�)(Jx,x + Jy,y ), (∇2⊥ + h2 )Hz = Jx,y − Jy,x

In accordance with the boundary conditions on the tangential components of E and the normal components of H, we have the expansions Enm unm (x, y)exp(−γnm z), Hz (x, y, z) = Hnm wnm (x, y)exp(−γnm z) Ez (x, y, z) = n,m

n,m

where

√ wnm (x, y) = (2/ ab)cos(nπx/a)cos(mπy/b)

Formally, we can write

Ez = (∇2⊥ + h2 )−1 (γ/jω�)(∇⊥ .J⊥ ) − − − (5) Hz = (∇2⊥ + h2 )−1 (Jx,y − Jy,x ) − − − (6)

and express Ex , Ey , Hx , Hy in terms of J using eqns. (1)-(6). Finally, these expressions are substituted into the linearized Boltzmann eqn. (b) by making use of (a). The result is a linear integro-partial diﬀerential equation for the Boltzmann particle distribution function f (ω, x, y, v) with γ as a parameter. The solutions of this equation will generally lead to discrete values of the propagation constant γ. A.24. Winding number of planar Brownian motion. Let Zt = Xt + iYt be a complex Brownian motion, ie, X, Y are independent real standard Brownian motion processes. We write ρt = Xt2 + Yt2 = |Zt |, θt = T an−1 (Yt /Xt ) = Arg(Zt ) Then,

log(ρt ) = RelogZt , θt = Imlog(Zt )

Away from the origin, z → logz is an analytic function of a complex variable and hence its Laplacian vanishes. Thus, from Ito’s formula, dlog(Zt ) = dZt /Zt and hence log(Zt ) is a Martingale. Writing log(z) = u(x, y) + iv(x, y), z = x + iy it follows that u(Xt , Yt ) and v(Xt , Yt ) are both real Martingales. We have u(Xt , Yt ) = log(ρt ), v(Xt , Yt ) = θt Further, the quadratic variation of the Martingales u(Xt , Yt ) and v(Xt , Yt ) are the same processes. They are equal to

0

t

|∇u(Xs , Ys )|2 ds =

0

t

|∇v(Xs , Ys )|2 ds =

0

t

ds/|Zs |2

Quantum Mechanics for Scientists and Engineers 146

149

where we make use of the Cauchy-Riemann equations, u,x = v,y , u,y = −v,x and of course

u,x + iu,y = u,x − iv,x , v,x + iv,y = −u,y + iv,y

while on the other hand, writing f (z) = logz = u + iv, we have

f � (z) = u,x + iv,x , if � (z) = u,y + iv,y Thus, Thus, writing

|f � (z)|2 = |∇u|2 = |∇v|2 = 1/|z|2 = 1/(x2 + y 2 ) = 1/ρ2 C(t) =

t

ds/|Zs |2 =

0

t

0

ds/ρ2s

it follows that there exists a planar Brownian motion β(t) + iγ(t) such that log(ρt ) + iθt = logZt = β(C(t)) + iγ(C(t)) so that

β(C(t)) = log(ρt ), γ(C(t)) = θt

In fact, if τ () is the inverse function of C, then we have that u(Xτ (t) , Yτ (t) ) and v(Xτ (t) , Yτ (t) ) are independent Brownian motion processes which we denote by β(t) and γ(t) respectively. We have f (Zτ (t) ) = β(t) + iγ(t) and

df (Zτ (t) ) = f � (Zτ (t) )dZτ (t)

(since (dZ) = 0). Thus, 2

(df (Zτ (t) ))2 = f � (Zτ (t) )2 (dZτ (t) )2 = 0

which proves independence of the Brownian motions β and γ. Note that we can write β(t) = log(ρτ (t) ), γ(t) = θτ (t) Remark: The real parts β(t) and γ(t) of f (Zτ (t) ) are continuous martingales with quadratic variation and imaginary dt 0 matrix equal to = dt.I2 which proves by Levy’s theorem for vector valued Martingales that these two 0 dt processes are independent standard Brownian motion processes. Now, the angle turned by the planar Brownian motion process Zt around the origin respectively inside and outside a circle of radius r are t t θr− (t) = Im χ|Zs |r dlog(Zs ) 0

0

and these can be expressed as θr− (t) = = Im θr+ (t) =

t

χρs r dZs /Zs

C(t)

χβ(s)log(r) dγ(s)

Quantum Mechanics for Scientists and Engineers 147

150

Now deﬁne

Ta = min(t ≥ 0 : β(t) = a)

and

σr = min(t ≥ 0 : ρt = r)

Then, Thus,

σr = min(t : log(ρt ) = log(r)) = min(t : β(C(t)) = log(r)) C(σr ) = min(C(t) : β(C(t)) = log(r)) = min(t : beta(t) = log(r)) = Tlog(r)

Now for a set E in C, consider

I(t) =

t

σ√ t

We write

Zt = a + Zˆt , a �= 0

We assume that Z0 = a so that Zˆ0 = 0. We have I(t) =

t

χZˆs ∈E−1 dZˆs /(a + Zˆs )

σ√t

Now, the process Zˆts , s ≥ 0 has the same law as ˜ = I(t)

√

tZˆs , s ≥ 0 and hence I(t) has the same distribution as

1

� t−1 σ√

χZ(s)∈E dZs /Zs

√ √ χZˆs ∈(E−1)/√t tdZˆs /(a + tZˆs )

t

√ where σr� is the same as σr but with the process a + tZˆs/t , s ≥ 0 used in place of Zˆs , s ≥ 0 (Note that these two processes have the same law). Now √ √ � t−1 σ√ = min(s/t : |a + tZˆs/t | = t) t √ √ = min(s : |a + tZˆs | = t) √ = min(s : |a/ t + Zˆs | = 1)

� converges as t → ∞ to the random variable min(s : |Zˆs | = 1) = σ say. It follows on This gives the result that t−1 σ√ t taking lim t → ∞ that the limit in law of I(t) as t → ∞ is given by the random variable

1

σ

where

χZˆs ∈E0 dZˆs /Zˆs

√ E0 = limt→∞ (E − 1)/ t

which is a √ ﬁnite random variable since dZˆs /Zˆs = dlog(Zˆs ) and σ equals the limit as t → ∞ of the random variable min(s : |1/ t + Zˆs | = 1). In particular, we get that limt→∞ |θr− (t) − θr− (σ√t )|/logt = 0 and likewise for θr+ . Thus, it follows that if limt→∞ θr− (σ√t )/log(t) exists in law, then this limit coincides in law with the limit in law of θr− (t)/log(t) and likewise for θr+ (t). Now, θr− (σ√t ) = =

C(σ√t )

χβ(s)

where the expected value is taken in the ground state of HF , ie vacuum state |0 > in which each an has eigenvalue zero. T is the time ordering operator. We can write G(t, r, t� , r� ) = θ(t − t� ) < ψ(t, r)ψ(t� , r� )∗ > +θ(t� − t) < ψ(t� , r� )∗ ψ(t, r) >= Now we note that < ψ(t, r)ψ(t� , r� )∗ >=

n,m

cn c¯m exp(−i(En t − Em t� ))un (r)¯ um (r� ) < an a∗m >

and since

< an a∗m >= δn,m

(since an a∗m + a∗m an = δn,m and an |0 >= 0), it follows that exp(−iEn (t − t� )un (r)¯ um (r� ) < ψ(t, r)ψ(t� , r� )∗ >= n

We note that

< ψ(t� , r� )∗ ψ(t, r) >= 0

since ψ(t, r)|0 >= 0. Thus, G(t, r, t� , r� ) = θ(t − t� ) where We then get

n

exp(−iEn (t − t� ))un (r)¯ un (r� ) = G0 (t − t� , r, r� )

G0 (t, r, r� ) = G(t, r, 0, r� )

R

G0 (t, r, r� )exp(i(ω + i�)t)dt =

un (r)¯ un (r� ) i(ω − E n + i�) n

Quantum Mechanics for Scientists and Engineers 156

We have

159

G,t (t, r, t� , r� ) = δ(t − t� ) < ψ(t, r)ψ(t, r� )∗ > +θ(t − t� ) < ψ,t (t, r)ψ(t� , r� )∗ > = δ(t − t� )δ 3 (r − r� ) − iθ(t − t� ) < H(r)ψ(t, r)ψ(t� , r� )∗ > = δ(t − t� )δ 3 (r − r� ) − iH(r)G(t, r, t� , r� )

A.28.Design of quantum gates using scattering theory. Let H0 , H be two self-adjoint operators in the same Hilbert space H. We deﬁne W (t) = exp(itH)exp(−itH0 ), t ∈ R The limits

Ω+ = slimt→∞ W (t), Ω− = slimt→−∞ W (t)

may exist on diﬀerent domains. Let D+ be the domain of Ω+ and D− that of Ω− . Then Ω∗+ : H → D+ is deﬁned and hence S = Ω∗+ Ω− : D− → D+ is deﬁned. It is clear that W (t) is a unitary operator on H and hence

< f |Ω∗+ Ω+ |g >=< Ω+ f |Ω+ g >= limt→∞ < W (t)f |W (t)g >=< f |g >, f, g ∈ D+ Thus Ω+ is unitary when restricted to D+ . Thus, Ω+ Ω∗+ : H → H is an orthogonal projection of H onto D+ . We write V = H − H0 and then ﬁnd that t t W (t) − I = W � (s)ds == i exp(isH)V.exp(−isH0 )ds 0

0

In particular,

Ω+ = I + i

∞

exp(isH)V.exp(−isH0 )ds

0

Let E0 be the spectral measure of H0 and E that of H. We have ∞ d/dt(exp(itH0 ).exp(−itH))dt = I − i Ω∗+ = I + i 0

exp(itH0 )V.exp(−itH)dt

0

Thus, I = Ω∗+ Ω+ = Ω+ − i Now,

∞

∞

exp(itH0 )V.exp(−itH)Ω+ dt

0

exp(−itH)Ω+ = lims→∞ exp(i(s − t)H)exp(−isH0 ) = lims→∞ exp(isH).exp(−i(s + t)H0 ) = Ω+ exp(−itH0 )

Thus, I = Ω+ − i or equivalently, Ω+ = I + i

∞

[0,∞)×R

=I−

exp(itH0 )V Ω+ exp(−itH0 )dt

0

R

exp(it(H0 − λ + i�))V Ω+ dtE0 (dΛ)

(H0 − λ + i�)−1 V Ω+ E0 (dλ)

The right side integral is to be interpreted as the limit of � → 0+. This is the rigorous statement of the ﬁrst LippmanSchwinger equation in scattering theory. Roughly speaking, if |φ+ > is an output free particle state corresponding to an ”eigenfunction” of H0 with ”eigenvalue” λ and |ψ+ >= Ω+ |φ+ > is the corresponding output scattered state, which is an ”eigenfunction” of H with the same eigenvalue λ (energy is conserved during the scattering process), then we have |ψ+ >= |φ+ > −(H0 − λ + i�)−1 V |ψ+ >

Quantum Mechanics for Scientists and Engineers 157

160

In a similar way, we get that if |φ− > is an input free particle state corresponding to an energy eigenfunction of H0 with eigenvalue λ and |ψ− >= Ω− |φ− > the corresponding input scattered state corresponding to an eigenfunction of H with eigenvalue λ, then we get the second Lippman-Schwinger equation |ψ− >= |φ− > −(H0 − λ − i�)−1 V |ψ− > This is seen as follows: Ω∗− = limt→−inf ty exp(itH0 )exp(−itH) = I + i =I +i

∞

0

exp(itH0 )V.exp(−itH)dt

−∞

exp(−itH0 )V.exp(itH)dt

0

Thus,

I = Ω∗− Ω− = Ω− + i

= Ω− + i or equivalently, Ω− = I − i

∞

0

∞

0

=I−

∞

0

exp(−itH0 )V.exp(itH)Ω− dt

exp(−itH0 )V Ω− .exp(itH0 )dt

exp(−it(H0 − λ − i�))V Ω− dtE0 (dλ) (H0 − λ − i�)−1 V Ω− E0 (dλ)

R

which gives the second Lippman-Schwinger equation. Now, we derive an explicit form of the scattering matrix. We deﬁne R0 (λ) = (H0 − λ)−1 , R(λ) = (H − λ)−1 respectively for λ belonging to the resolvent set of H0 and of H. We have

(I − i

S = Ω∗+ Ω− = ∞

exp(itH0 )V.exp(−itH)dt)Ω−

0

= Ω− − i

= Ω− +

∞

0

R

(H0 − λ + i�)−1 V.Ω− E0 (dλ)

= Ω− + Now,

=I −i =I−

R

Ω− = I − i

exp(itH0 )V.Ω− exp(−itH0 )dt

R0 (λ − i�)V Ω− E0 (dλ)

0

exp(itH).V.exp(−itH0 )dt

−∞

∞

exp(−itH)V exp(itH0 )dt

0

×R

(H − λ − iδ)−1 V E0 (dλ)

Substituting this into the previous expression gives R0 (λ − i�)V E0 (dλ) − R0 (λ − i�)V R(λ + iδ)V E0 (dλ) S − I = − R(λ + iδ)V E0 (dλ) + R

R

A.29. Perturbed Einstein ﬁeld equations with electromagnetic interactions. The unperturbed metric is g00 = 1, grs = −S(t)2 δrs , g0r = 0,

Quantum Mechanics for Scientists and Engineers 158

161

A small change in the coordinate system can be made such that the ﬁrst order perturbations in the metric δgμν satisﬁes δg0μ = 0. The unperturbed energy-momentum tensor of the matter ﬁeld is Tμν = (ρ(t) + p(t))Vμ Vν − p(t)gμν where Vr = 0, V0 = 1 (This is possible because the above unperturbed metric satisﬁes the comoving condition, ie, particles with Vr = 0 satisfy the geodesic equation). Thus, T00 = ρ(t), Trs = p(t)S 2 (t)δrs , g0r = 0 We also deﬁne the tensor We have so that since δg00 = 0,

Sμν = Tμν − T gμν /2, T = g μν Tμν = ρ + p − 4p = ρ − 3p δSμν = δTμν − T δgμν /2 − gμν δT /2 δS00 = δT00 − δT /2 = δρ − δρ/2 + 3δp/2 = (3δp + δρ)/2

Note that δρ, δp, δvμ , δgμν are in general functions of space and time, ie, of x = (t, r). δSrs = δTrs − T δgrs /2 − grs δT /2 = −δpgrs − (ρ − 3p)δgrs /2 − grs (δρ − 3δp)/2 = (3p − ρ)δgrs /2 + S 2 (δρ − δp)δrs /2

Note that we have the sequence of implications

vμ v μ = 1, Vμ δv μ = 0, δv 0 = 0 More precisely, gives which implies

δ(gμν v μ v ν ) = 0 V μ V ν δgμν + gμν (V μ δv ν + V ν δv μ ) = 0 δg00 + 2δv 0 = 0

and since by the coordinate condition, δg00 = 0, it follows that δv 0 = 0. Hence, δv0 = (δg0μ v μ ) = δg0μ V μ + g0μ δv μ = 0 which gives

δv0 = 0

since by the coordinate condition, δg0μ = 0 and g0r = 0. So δT00 = (p + ρ)(2V0 δv0 ) + V02 (δp + δρ) − pδg00 − g00 δp = δρ Further,

δSr0 = δS0r = (ρ + p)δvr

since δg0r = δgr0 = 0. Finally, we need to compute δRμν and equate this to K.δSμν . We have α δRμν = (δΓα μα ):ν − (δΓμν ):α

(δΓα μα ):ν = α (δΓα μν ):β = (δΓμν ),β ρ ρ α +Γα ρβ δΓμν − Γμβ δΓρν

Thus,

−Γρνβ δΓα ρμ (δΓα μν ):α

Quantum Mechanics for Scientists and Engineers 159

162

= (δΓα μν ),α ρ +Γα ρα δΓμν

− 2Γρμα δΓα ρν

Thus ﬁnal form of the ﬁrst order perturbation to the Ricci tensor is given by δRμν = ρ α (δΓα μα ),ν − Γμν δΓρα

−(δΓα μν ),α

ρ ρ α −Γα ρα δΓμν + 2Γμα δΓρν

The perturbed Einstein ﬁeld equations have the general form

A1 (μν, rs, t)δgrs (t, r) + A2 (μν, rsl, t)δgrs,l (t, r) + A3 (μν, rslp, t)δgrs,lp (t, r) +A4 (μν, rs, t)δgrs,0 (t, r) + A5 (μν, rsl, t)δgrs,l0 (t, r) + A6 (μν, rs, t)δgrs,00 (t, r) +A7 (μν, t)δρ(t, r) + A8 (μν, r, t)δvr (t, r) = KδSμν (t, r) where Sμν (t, r) is the energy-momentum tensor of the electromagnetic ﬁeld. We have, Sμν = (−1/4)Fαβ F αβ gμν + Fμα Fνα √ Fμν = Aν,μ − Aμ,ν , (F μν −g),ν = 0

F0r = Ar,0 − A0,r = −(S 2 (t)Ar ),0 − A0,r Frs = As,r − Ar,s = S 2 (t)(Ar,s − As,r )

The gauge condition is

√ (Aμ −g),μ = 0

which reads

(A0 S 3 (t)),0 + S 3 (t)Ar,r = 0

The unperturbed Maxwell equations are F,r0r = 0, (F r0 S 3 ),0 + S 3 F,srs = 0 F 0r = g 00 g rr F0r = (1/S 2 )((S 2 Ar ),0 − A0,r )

and so the ﬁrst Maxwell equation gives

(S 2 Ar ),0r − ∇2 A0 = 0

or which reads on using the gauge condition,

(S 2 )� Ar,r + S 2 Ar,0r − ∇2 A0 = 0

−(S 2 )� S −3 (A0 S 3 ),0 − S 2 (S −3 (A0 S 3 ),0 ),0 − ∇2 A0 = 0 and the second Maxwell equation gives −S(As,r − Ar,s ),s − ((A0,r + Ar,0 )S),0 = 0 which simpliﬁes to

S∇2 Ar − SAs,rs − (S(A0,r + Ar,0 )),0 = 0

and this becomes on using the gauge condition,

S∇2 Ar + S −2 (A0 S 3 ),0r − (SA0,r ),0 − (SAr,0 ),0 = 0 or

(∇2 Ar − Ar,00 ) + S −3 (A0 S 3 ),0r − S −1 (SA0,r ),0 − S −1 S � Ar,0 = 0

Quantum Mechanics for Scientists and Engineers 160

163

Since the radiation ﬁeld is homogeneous and isotropic, we assume that the vector potential has mean zero and statistical correlations of the form < Ar (t, r)As (t� , r� ) >= P (t, t� , r − r� )δrs We ﬁnd that since

(A0 S 3 (t)),0 + S 3 (t)Ar,r = 0

we have

A0,0 (t, r)S 3 (t) + 3S 2 (t)S � (t)A0 (t, r) + S 3 (t)Ar,r (t, r) = 0

and hence we can assume that A0 = 0 provided that we admit the Coulomb gauge constraint Ar,r (t, r) = 0 which gives

∂P (t, t� , r) =0 ∂xs � and so we can assume that P (t, t , r) is independent of the spatial vector r = (xs ). Thus, we can write < Ar (t, r)As (t� , r� ) >= P (t, t� )δrs

Now, the Maxwell equation is automatically satisﬁed since A = 0 and 0

Ar,r

(S 2 Ar ),0r − ∇2 A0 = 0

= 0. The second Maxwell equation

(∇2 Ar − Ar,00 ) + S −3 (A0 S 3 ),0r − S −1 (SA0,r ),0 − S −1 S � Ar,0 = 0 gives using A0 = 0,

∇2 Ar − Ar,00 − S −1 S � Ar,0 = 0

and hence taking correlations with As (t� , r� ), we get

∇2 P (t, t� ) − P,tt (t, t� ) − S −1 (t)S � (t)P,t (t, t� ) = 0 and likewise, with t and t� interchanged. We assume that we have solved this equation to obtain P (t, t� ). Now, the average energy-momentum tensor of the radiation ﬁeld before the metric has been perturbed by δgμν (t, r) is given by Sμν = (−1/4) < Fαβ F αβ > gμν + < Fμα Fνα > Now,

Fαβ F αβ = 2F0r F 0r + Frs F rs = −2S −2 (Ar,0 − A0,r )2 + S −4 (As,r − Ar,s )2

= −2S −2 ((−S 2 Ar ),0 )2 − A0,r )2 + (As,r − Ar,s )2 = 2S −2 ((S 2 Ar ),0 )2 + (As,r − Ar,s )2 and its average value is

= 2S 2 (Ar,0 )2 + 2S −2 ((S 2 )� )2 (Ar )2 + (As,r − Ar,s )2 < Fαβ F αβ >= 6S P,tt (t, t� ) + 2S −2 ((S 2 )� )2 P (t, t� ) 2

since P does not depend on the spatial coordinates. Note that by P,tt (t, t), we mean that P,tt (t, t) = limt →t P,tt (t, t� ) Thus,

< Fαβ F αβ >= 6S 2 (t)P,tt (t, t) + 8(S � )2 P (t, t� )

We next compute < Fμα Fνα >. For μ = ν = 0 this is < F0r F0r >= −S −2 < (Ar,0 )2 >= −S 2 < (Ar,0 )2 >= −3S 2 (t)P,tt (t, t)

Quantum Mechanics for Scientists and Engineers 161

164

For μ = r, ν = 0, we get < Frα F0α >=
= −S

−2

< (As,r − Ar,s )As,0 >= −S 2 < (As,r − Ar,s )As,0 >= 0

since P is independent of the spatial coordinates. Thus,

< S00 >= (−3/2)S 2 (t)P,tt (t, t) − 2(S � )2 P (t, t� ) − 3S 2 P,tt (t, t) = (−9/2)S 2 (t)P,tt (t, t) − 2S 2 P (t, t� ) < Sr0 >=< S0r >= 0 Finally,

< Srs >= (3/2)S 4 (t)P,tt (t, t)δrs + < Fr0 Fs0 + Frm Fsm > < Fr0 Fs0 >=< Ar,0 As,0 >= S 4 (t) < Ar,0 As,0 >= S 4 (t)P,tt (t, t)δrs < Frm Fsm >= (−1/S 2 ) < Frm Fsm >= (−1/S 2 ) < (Am,r − Ar,m )(Am,s − As,m ) >= 0

since P is independent of the spatial coordinates. Thus,

< Srs >= (−1/2)S 4 (t)P,tt (t, t)δrs The perturbed Maxwell equations and the perturbed energy-momentum tensor of the electromagnetic ﬁeld: The perturbed Maxwell equations are √ (δ(F μν −g),ν = 0

The perturbed gauge condition is

√ (δ(Aμ −g)),μ = 0

which gives

√ (S 3 (t)δAμ + Aμ δ −g),μ = 0

If we assume δA0 = 0, then this gauge condition gives

√ S 3 (δAr ),r + (Ar δ −g),r = 0 Now,

δg = g(−S −2 )δgrr

(Note that our coordinate system is chosen so that δg0μ = 0. Thus, we get √ √ δ −g = −δg/2 −g = −Sδgrr /2 Thus the gauge condition with δA0 = 0 gives S 2 (δAr ),r − Ar δgss,r /2 = 0 where we have used Ar,r = 0 ,ie, the Coulomb gauge for the unperturbed em ﬁeld. This gives us the following gauge condition for the perturbed em ﬁeld (δAr ),r = S −2 Ar δgss,r /2 We now write down the ﬁrst perturbed Maxwell equation using δA0 = 0 as S 2 δF,r0r − (F 0r δgss /2),r = 0 or which is since Ar,r = 0, the same as

S 2 δAr,0r − (Ar,0 δgss /2),r = 0 S 2 δAr,0r − Ar,0 δgss,r /2 = 0

Substituting for δAr,r from the perturbed gauge condition, we get

S 2 (S −2 Ar δgss,r ),0 − Ar,0 δgss,r = 0 or equivalently

Ar (S −2 δgss,r ),0 = 0

Quantum Mechanics for Scientists and Engineers 162

165

and this condition is impossible to fulﬁll in general. So we cannot assume the Coulomb condition δA0 = 0 for the perturbed situation. Remark: We have assumed that A0 = 0 and hence that the gauge condition reads Ar,r = 0. We have also assumed that < Ar (t, r)As (t , s) >= P (t, t , r − r )δrs and have hence derived the condition that P (t, t , r) is independent of r. This is rather unrealistic since P being independent of r implies that the vector potentials between all the spatial points have the same correlation. A more realistic assumption would be to take < Ar (t, r)As (t , s) >= P (t, t )f r (r)f s (r ) and hence derive from the gauge condition that

r divf = f,r =0

A more general approach: Assume that the unperturbed metric is gμν (x) and the unperturbed vector potential is Aμ (x). Their perturbed versions are gμν (x) + δgμν (x) and Aμ (x) + δAμ (x) respectively. The unperturbed velocity ﬁeld of matter is v μ (x) and it gets perturbed by δv μ (x). Likewise the unperturbed density and pressure are ρ(x) and p(x) and they get perturbed by δρ(x) and δp(x) respectively. The unperturbed four vector potential correlations are Lμν (x, x ) =< Aμ (x)Aν (x ) > The Einstein ﬁeld equations are

Rμν = K(SM μν + SEμν ))

where K = −8πG, where so that

SM μν = TM μν − TM gμν /2 TM μν = (ρ + p)vμ vν − pgμν , TM = g μν TM μν = ρ − 3p SM μν = (ρ + p)vμ vν − (p + ρ)gμν /2

and

SEμν = (−1/4)Fαβ F αβ gμν + Fμα Fνβ g αβ < SEμν >= gμν g αρ g βσ < Fαβ Fρσ > +g αβ < Fμα Fνβ >

Now,

< Fμα Fνβ >=< (Aα,μ − Aμ,α )(Aβ,nu − Aν,β ) >

=< (gαρ Aρ ),μ − (gμρ Aρ ),α )((gβσ Aσ ),ν − (gνσ Aσ ),β ) > = (gαρ,μ − gμρ,α )(gβσ,ν − gνσ,β )Lρσ (x, x)

+(gαρ gβσ < Aρ,μ Aσ,ν > +gαρ gβσ,ν < Aρ,μ Aσ > +(gαρ,μ gβσ < Aρ Aσ,ν > −gαρ gνσ,β < Aρ,μ Aσ > −gαρ,μ gνσ < Aρ Aσ,β > −gαρ gνσ < Aρ,μ Aσ,β > −gμρ gβσ < Aρ,α Aσ,ν >

−gμρ,α gβσ < Aρ Aσ,ν > −gμρ gβσ,ν < Aρ,α Aσ >

+gμρ gνσ < Aρ,α Aσ,β > +gμρ,α gνσ < Aρ Aσ,β > +gμρ gνσ,β < Aρ,α Aσ > Using this formula, we can express < SEμν (x) >= C1 (μναβ, x) < Aα (x)Aβ (x) > +C2 (μναβρ, x) < Aα (x)Aβ,ρ (x) > β +C3 (μναβρσ, x) < Aα ,ρ (x)A,σ (x) >

where the functions Ck are constructed using the unperturbed metric gμν (x) and its ﬁrst order partial derivatives gμν,α (x). The perturbation δ < SEμν (x) > of the average Maxwell energy momentum tensor is to be evaluated in terms of the metric perturbations. For evaluating this, we have to ﬁrst express using the Maxwell equations, the perturbation δAμ (x) of the electromagnetic four vector potential in terms of the unperturbed potentials Aμ (x) and the metric perturbations δgμν (x). The Maxwell equations are √ (F μν −g),ν = 0

Quantum Mechanics for Scientists and Engineers 163

166

and its perturbation is We have

√ √ ( −gδF μν + F μν δ −g),ν = 0 √ √ √ √ δ −g = −δg/2 −g = −gg μν δgμν /2 −g = ( −g/2)g μν δgμν δF μν = δ(g μα g νβ Fαβ )

= δ(g μα g νβ )Fαβ + g μα g νβ δFαβ Now,

δ(g μα g νβ ) = −g μα g νρ g βσ δgρσ −g νβ g μρ g ασ δgρσ

δFαβ = δ(Aβ,α − Aα,β ) = (δAβ ),α − (δAα ),β =

δAβ = δ(gβmu Aμ ) = Aμ δgβμ + gβμ δAμ The perturbation of the gauge condition is given by Now

√ (Aμ −g),μ = 0 √ √ ( −gδAμ ),μ + (Aμ δ −g),μ = 0

√ √ √ √ δ −g = −δg/2 −g = −gg μν δgμν /2 −g = −gg μν δgμν /2

Combining all these equations, it follows that δAμ satisﬁes an equation of the form

D1 (μνρσ, x)δAν,ρσ (x) + D2 (μνρ, x)δAν,ρ (x) + D3 (μνραβ, x)Anu ,ρ (x)δgαβ (x) +D4 (μναβ, x)Aν (x)δgαβ (x)+ D5 (μνραβσ, x)Aν,ρ (x)δgαβ,σ (x) +D6 (μναβσ, x)Aν (x)δgαβ,σ (x) = 0 This equation can formally be solved to give δAμ (x) = M (μν, αβ, x, x� , x�� )Aν (x� )δgαβ (x�� )d4 x� d4 x�� The perturbation to δ < SEμν (x) >=< δSEμν (x) > to the average energy momentum tensor of the electromagnetic ﬁeld can be expressed as follows: δSEμν (x) = (−1/4)δ(Fαβ F αβ gμν ) + δ(Fμα Fνβ g αβ ) = (E1 (μνρσαβ)(x)Aρ,σ (x) + E2 (μνραβ, x)Aρ (x))δAα ,β (x) +(E3 (μνρσα)(x)Aρ,σ (x) + E4 (μνρα, x)Aρ (x))δAα (x) +(E5 (μνρδαβσ, x)Aδ (x)Aρ (x) + E6 (μνργδαβσ, x)Aδ (x)Aρ,γ (x))δgαβ,σ (x) +(E7 (μνρδχαβ, x)Aδ,χ Aρ (x) + E8 (μνρδχγαβ, x)Aδ,χ (x)Aρ,γ (x))δgαβ (x)

A.30. BCS theory of superconductivity. The second quantized Hamiltonian is given by K = ψa (x)∗ (−∇2 /2m + μ)ψa (x)d3 x + V (x, y)ψa (x)∗ ψa (x)ψb (y)∗ ψb (y)d3 xd3 y with the Fermionic ﬁelds ψa satisfying the anticommutation relations {ψa (x), ψb (y)∗ } = δab δ 3 (x − y), {ψa (x), ψb (y)} = 0

Quantum Mechanics for Scientists and Engineers 164

We deﬁne

167

ψa (tx) = exp(itK).ψa (x).exp(−itK)

Then,

ψa,t (tx) = iexp(itK)[K, ψa (x)]exp(−itK)

The Green’s function is deﬁned as Gab (t, x|s, y) =< T (ψa (tx)ψb (s, y)∗ ) >= T r(ρG T (ψa (tx)ψb (sy)∗ )) where

ρG = exp(−βK)/T r(exp(−βK))

We observe that for t > s, Gab (tx|sy) = T r(exp(−βK).exp(itK)ψa (x)exp(i(s − t)K)ψb (y)∗ .exp(−isK)) = T r(exp((i(t − s) − β)K)ψa (x)exp(i(s − t)K))ψb (y)∗ )

We make the approximation of replacing by

V (x, y)ψa (x)∗ ψa (x)ψb (y)∗ ψb (y)d3 xd3 y

V (x, y)(2 < ψa (y)∗ ψa (y) > ψb (x)∗ ψb (x)d3 xd3 y + +

V (x, x)ψa (x)∗ ψa (x)d3 x +

V (x, y) < ψa (x)ψb (y)∗ > ψa (x)∗ ψb (y)d3 y

V (x, y) < ψa (x)ψb (y) > ψa (x)∗ ψb (y)∗ d3 xd3 y

plus other quadratic terms. We can write down the general form of the above approximation to the potential energy integral as F1 (x, y) < ψb (y)∗ ψb (y) > ψa (x)∗ ψa (x)d3 xd3 y + + +

F4 (x, y) < ψb (y)∗ ψa (x)∗ > ψa (x)ψb (y)d3 xd3 y = V1 + V2 + V3 + V4

[V1 , ψc (x� )] =

F3 (x, y) < ψb (y)ψa (x) > ψa (x)∗ ψb (y)∗ d3 xd3 y

say. We have

=−

F2 (x, y) < ψb (x)∗ ψa (y) > ψa (y)∗ ψb (x)d3 xd3 y

F1 (x, y) < ψb (y)∗ ψb (y) > [ψa (x)∗ ψa (x), ψc (x� )]d3 xd3 y �

F1 (x, y) < ψb (y)∗ ψb (y) > δac δ 3 (x − x� )ψa (x)d3 xd3 y = −(

[V2 , ψc (x� )] = =−

F2 (x, y) < ψb (x)∗ ψa (y) > [ψa (y)∗ ψb (x), ψc (x� )]d3 xd3 y

F2 (x, y) < ψb (x)∗ ψa (y) > δac δ 3 (y − x� )ψb (x)d3 xd3 y

= −(

F1 (x� , y) < ψb (y)∗ ψb (y) > d3 y)ψc (x� )

F2 (x, x� ) < ψb (x)∗ ψa (x� ) > ψb (x)d3 x [V3 , ψc (x� )] =

F3 (x, y) < ψb (y)ψa (x) > [ψa (x)∗ ψb (y)∗ , ψc (x� )]d3 xd3 y

Quantum Mechanics for Scientists and Engineers 165

168

Now,

ψa (x)∗ ψb (y)∗ ψc (x� ) = ψa (x)∗ (δbc δ 3 (y − x� ) − ψc (x� )ψb (y)∗ )

= δbc δ 3 (y − x� )ψa (x)∗ − (δac δ 3 (x − x� ) − ψc (x� )ψa (x)∗ )ψb (y)∗

= δbc δ 3 (y − x� )ψa (x)∗ − δac δ 3 (x − x� )ψb (y)∗ + ψc (x� )ψa (x)∗ ψb (y)∗

Thus,

[V3 , ψc (x� )] = = Finally,

F3 (x, y) < ψb (y)ψa (x) > (δbc δ 3 (y − x� )ψa (x)∗ − δac δ 3 (x − x� )ψb (y)∗ )d3 xd3 y

F3 (x, x� ) < ψc (x� )ψa (x) > ψa (x)∗ d3 x − [V4 , ψc (x� )] =

F3 (x� , y) < ψb (y)ψc (x� ) > ψb (y)∗ d3 y

F4 (x, y) < ψb (y)∗ ψa (x)∗ > [ψa (x)ψb (y), ψc (x� )]d3 xd3 y = 0

A.31. Maxwell’s equations in the closed isotropic model of the universe. The metric is dτ 2 = dt2 − f 2 (r)S 2 (t)dr2 − r2 S 2 (t)(dθ2 + sin2 (θ)dφ2 ) where

f (r) = (1 − kr2 )−1/2

Thus,

g00 = 1, g11 = −f 2 (r)S 2 (t), g22 = −r2 S 2 (t), g33 = −r2 S 2 (t)sin2 (θ) √ −g = r2 f (r)S 3 (t)sin(θ)

Assume A0 = 0. The gauge condition then gives

√ (Aμ −g),μ = 0

sin(θ)(A1 r2 f ),1 + r2 f (A2 sin(θ)),2 + r2 f sin(θ)A3,3 = 0

or equivalently,

r−2 (r2 f A1 ),1 + (sin(θ))−1 (A2 sin(θ)),2 + A3,3 = 0

The Maxwell equation under the assumption A0 = 0 gives or equivalently,

√ (F 0k −g),k = 0 √ (g kk Ak,0 −g),k = 0 √ (g kk −g(gkk Ak ),0 ),k = 0

or equivalently,

(r2 f sin(θ)(S 2 Ak ),0 ),k = 0

The above two equations are not compatible. So we assume that A0 �= 0 and then the above two equations, ie, the gauge condition and Gauss’ law respectively become sin(θ)r2 f (S 3 A0 ),0 + sin(θ)(A1 r2 f ),1 + r2 f (A2 sin(θ)),2 + r2 f sin(θ)A3,3 = 0 and

This further simpliﬁes to

√ 0 = (F 0k −g),k √ = (g kk F0k −g),k = √ (g kk (Ak,0 − A0,k ) −g),k = √ √ (g kk (gkk Ak ),0 −g),k − (g kk A0,k −g),k ((S 2 Ak ),0 r2 f sin(θ)),k − S 2 (g kk A0,k r2 f sin(θ)),k = 0

Quantum Mechanics for Scientists and Engineers 166

The Maxwell equation gives taking r = 1,

169

√ √ (F rs −g),s + (F r0 −g),0 = 0 √ √ √ (F 12 −g),2 + (F 13 −g),3 + (F 10 −g),0 = 0

and likewise for r = 2, 3. This equation can be expressed as √ √ √ 0 = (F 12 −g),2 + (F 13 −g),3 + (F 10 −g),0 √ √ = (g 11 g 22 (A2,1 − A1,2 ) −g),2 + (g 11 g 33 (A3,1 − A1,3 ) −g),3 √ +(g 11 (A0,1 − A1,0 ) −g),0 = (g 11 g 22 ((g22 A2 ),1 − g 11 A1,2 ),2 + √ (g 11 g 33 (g33 A3 ),1 − (g11 A1 ),3 ) −g),3 √ +(g 11 (A0,1 − (g11 A1 ),0 ) −g),0

A.32. Intuitive proof of Cramer’s theorem for large deviations for iid random variables. Let Xi , i = 1, 2, ... be iid random variables with Eexp(λX1 ) = exp(Λ(λ)) Deﬁne Also deﬁne

I(x) = supλ∈R (λx − Λ(λ)) Sn =

n

Xi

i=1

Then, we have for any Borel set E and any λ ∈ R, the following: Let x ∈ E be arbitrary. Then, 1 = E[exp(λSn − nΛ(λ))] ≥ E[exp(λSn − nΛ(λ))χSn /n∈E ] ≥ E[infx∈E exp(nλx − nΛ(λ))χSn /n∈E ]

and therefore, or equivalently,

= infx∈E exp(n(λx − Λ(λ)))P (Sn /n ∈ E) n−1 .log(P (Sn /n ∈ E) + infx∈E (λx − Λ(λ)) ≤ 0 n−1 .log(P (Sn /n ∈ E) ≤ −infx∈E (λx − Λ(λ))

If we assume that for each x ∈ E the function λ → (λx − Λ(λ)) attains it supremum at some λ(x) ∈ R, then we can conclude that n−1 .log(P (Sn /n ∈ E) ≤ −infx∈E I(x) where

I(x) = supλ∈R (λx − Λ(λ)) = λ(x)x − Λ(λ(x))

This is called the Large deviations upper bound. To get the lower bound, we deﬁne a new probability measure P˜n by the prescription, P˜n = exp(λSn − nΛ(λ)).Pn where Pn is the distribution of (X1 , ..., Xn ), then we have 1 = P˜n (Rn ) = exp(λξ − nΛ(λ))dPn (ξ)

and we obtain on diﬀerentiating both sides of this equation w.r.t. λ, 0 = (ξ − nΛ� (λ))dP˜n (ξ)

Quantum Mechanics for Scientists and Engineers 167

170

and hence if η denotes the mean of X1 under P˜1 , then we have 0 = η − Λ� (λ) ie,

η = Λ� (λ)

˜ 1 (x) = exp(xλ − Λ(λ))dP1 (x). Thus, we get Note that X1 , ..., Xn under P˜n are iid random variables with distribution dP for λ > 0 and η > 0, Pn (E) = exp(−λξ + nΛ(λ))dP˜n (ξ) = E

EP˜ [exp(−n(λSn /n − Λ(λ)))χSn /n∈E ]

≥ EP˜ [exp(−n(λSn /n − Λ(λ)))χSn /n∈E χ|Sn /n−η|≤� ]

We are assuming that λ is a function of η determined by the condition Λ� (λ) = η. It follows that Pn (E) ≥ supη+�∈E exp(−n(λ(η + �) − Λ(λ)))P˜n (|{Sn /n ∈ E} {|Sn /n − η| ≤ �})

Now assume that E = (−∞, a]. By the law of large numbers for iid random variables, we have limn→∞ P˜n (|Sn /n − η| ≤ �) = 1

∀� > 0. Note that P˜ is the probability measure on RZ+ having marginals P˜n , n = 1, 2, .... If we further assume that η < a, the it follows by the law of large numbers that P˜n ({Sn /n ∈ E} ∩ {|Sn /n − η| < �) → 1 and hence We can write the above as

liminfn→∞ n−1 .log(Pn (E)) ≥ supη∈E (−λη + Λ(λ))

liminfn→∞ n−1 .log(Pn (E)) ≥ supη∈E (−λη + Λ(λ)) = −infη∈E I(η) A similar argument works for η > a. A.33. Current in the BCS theory of superconductivity: ψα (x) are the Fermionic ﬁeld operators. They satisfy the anticommutation rules {ψα (x), ψβ (y)∗ } = δαβ δ 3 (x − y), {ψα (x), ψβ (y)} = 0, {ψα (x)∗ , ψβ (y)∗ } = 0

The BCS Hamiltonian has the form H = (−1/2m) + + + +

ψα (x)∗ (∇ + ieA(x))2 ψα (x)d3 x + μ

ψα (x)∗ ψα (x)dx

V1 (x, y) < ψα (x)ψβ (y) > ψα (x)∗ ψβ (y)∗ dxdy V¯1 (x, y) < ψβ (y)∗ ψα (x)∗ > ψβ (y)ψα (x)dxdy V2 (x, y) < ψα (x)ψβ (x)∗ > ψα (y)∗ ψβ (y)dxdy V3 (x, y) < ψα (x)ψβ (y)∗ > ψα (x)∗ ψβ (y)dxdy

where for Hermitianity of H, we require that V¯2 (x, y) = V2 (x, y), V¯3 (x, y) = V3 (y, x)

Quantum Mechanics for Scientists and Engineers 168

171

The Gibbs density operator for this Hamiltonian is ρG = exp(−βH)/T r(exp(−βH)) and averages are deﬁned w.r.t this density, ie for any operator X constructed out of the quantum ﬁelds ψα (x), ψα (x)∗ , α = 1, 2, x ∈ R3 , we deﬁne < X >= T r(ρG X) We write T = (−1/2m) Then,

ψα (x)∗ (∇ + ieA(x))2 ψα (x)dx

[T, ψα (x)] = (1/2m)(∇ + ieA(x))2 ψα (x)

Also [

+ + + =

psiβ (y)∗ psiβ (y)d3 y, ψα (x)] = −ψα (x)

[V, ψγ (z)] = V1 (x, y) < ψα (x)ψβ (y) > [ψα (x)∗ ψβ (y)∗ , ψγ (z)]dxdy V¯1 (x, y) < ψβ (y)∗ ψα (x)∗ > [ψβ (y)ψα (x), ψγ (z)]dxdy V2 (x, y) < ψα (x)ψβ (x)∗ > [ψα (y)∗ ψβ (y), ψγ (z)]dxdy V3 (x, y) < ψα (x)ψβ (y)∗ > [ψα (x)∗ ψβ (y), ψγ (z)]dxdy

δ1αγ (x, z)V1 (x, z)ψα (x)∗ dx − +

ie,

Δ1γα (z, x)V1 (z, x)ψα (x)∗ dx

V2 (x, z)Δ2γα (x, x)ψα (z)dx

A.34. Some aspects of nonlinear ﬁltering theory. Let Xt be a Markov process with transition generator kernel Kt (x, y), E(f (Xt+dt )|Xt = x) − f (x) = dt Kt (x, y)f (y)dy + o(dt) = (Kt f )(x)dt + o(dt)

Assume that the measurement process is zt deﬁned by

dzt = ht (Xt )dt + dvt where vt is a Levy process, ie, an independent increment process with moment generating function given by Eexp(svt ) = exp(tψ(s)) Let Zt = σ(zs , s ≤ t), ie, the measurement process upto time t and for any function φ(x) deﬁned on the state space of the Markov process Xt , we deﬁne πt (φ) = E(φ(Xt )|Zt ) We can write a stochastic diﬀerential equation for the process πt (φ) as dπt (φ) = Ft (φ)dt +

∞

Gkt (φ)(dzt )k

k=1

where Ft (φ), Gkt (π) are Zt measurable functions. The processes Ft (φ) and Gkt (φ) need to be calculated. We deﬁne a process yt via the sde dyt = fk (t)(dzt )k yt , y0 = 1 k≥1

Quantum Mechanics for Scientists and Engineers 169

172

where the fk� s are arbitrary non-random functions. Then yt is a process adapted to Zt and we have the relation E[(φ(Xt ) − πt (φ))yt ] = 0 by the deﬁnition of conditional expectation. We write φt = πt (φ). It follows that E[(dφt − dπt (φ))yt ] + E[(φt − pit (φ))dyt ] + E[(dφt − dπt (φ))dyt ] = 0 by Ito’s formula. This equation can be expressed as E[(dφt − dπt (φ))yt ] + fk (t)E[(φt − πt (φ))(dzt )k yt ] + E[(dφt − dπt (φ))(dzt )k yt ] = 0 k≥1

k

and from the arbitrariness of the functions fk , we get E[(dφt − dπt (φ))|Zt ] = 0 − − − (1) We note that

E[(φt − πt (φ))(dzt )k |Zt ] + E[(dφt − dπt (φ))(dzt )k |Zt ] = 0, k ≥ 1 − − − (2) (dzt )k = (dvt )k , k ≥ 2

and hence writing

E[(dvt )k ] = μk dt

we get from (1) and (2), assuming μ1 = 0, Ft (φ) +

k≥1

μk Gkt (φ) = πt (Kt φ) − − − (3),

(πt (ht φ) − πt (φ)πt (ht )) + πt ((Kt φ)ht ) − Ft (φ)πt (ht ) − μk πt (Kt (φ)) −

r≥1

μr+1 Grt (φ)πt (ht ) = 0,

r≥1

μr+k Grt (φ) = 0, k ≥ 2

A.35. Some aspects of supersymmetry. Assume that xμ , μ = 0, 1, 2, 3 are the Bosonic spatial variables and θμ , μ = 0, 1, 2, 3 are the Fermionic anticommuting variables. A super ﬁeld S(x, θ) can be expanded as S(x, θ) = S0 (x) + S1μ (x)θμ + S2μν (x)θμ θν + S3μνρ (x)θμ θν θρ +S4 (x)θ1 θ2 θ3 θ4 Note that a product of more than four θ s is zero. The sum in each term is over the repeated variables and we may assume that the summation is over μ < ν < ρ < σ. We now need to introduce an inﬁnitesimal supersymmetry transformation as a super vector ﬁeld and describe the transformation laws of the component ﬁelds S0 , S1μ , S2μν , S3μνρ , S4 under such a supersymmetry transformation. The inﬁnitesimal super-symmetry transformation may be derived by assuming the following group conservation law for Bosonic and Fermionic variables (x, θ), x = (xμ ), θ = (θμ ): �

(xμ , θα ).(x� μ, θ α ) = (xμ + x μ + θT γ μ θ� , θα + θ alpha ) We express this equation as (x, θ).(x� , θ� ) = m((x, θ), (x� , θ� )) = (m1 ((x, θ), (x� , θ� )), m2 ((x, θ), (x� , θ� ))) where

m1 ((x, θ), (x� , θ� )) = (xμ + x μ + θT γ μ θ)

m2 ((x, θ), (x� , θ� )) = (θα + θ α ) Then to get left and right-invariant vector ﬁelds on the supermanifold described by Bosonic and Fermionic coordinates (x, θ), we deﬁne ∂mν1 ∂ ∂mν ∂ Dxμ = + μ2 ν μ ν ∂x ∂x ∂x ∂θ

Quantum Mechanics for Scientists and Engineers 170

173

∂ = ∂xμ ∂xμ ν ∂m1 ∂ ∂mν ∂ = + α2 ν α ν ∂θ ∂x ∂θ ∂θ =

Dθα

= (θT γ ν )α ∂xμ + ∂θα = (θT γ μ )α

∂ ∂ + α ∂xμ ∂θ

A.36. Comparison between the classical and quantum motions of a simple pendulum perturbed by white Gaussian noise. Classical case: θ�� (t) = −a.sin(θ(t)) + σw(t) w(t) = B � (t)

B(t) is standard Brownian motion. We solve this by perturbation theory: Let θ(t) = θ0 (t) + σθ1 (t) + σ 2 θ2 (t) + ... Then substituting this expression into the equation of motion and equating coeﬃcients of σ m , m = 0, 1, 2 successively gives us θ0�� (t) = −a.sin(θ0 (t)), θ1�� (t) = −a.cos(θ0 (t))θ1 (t) + B � (t), Deﬁne the matrix

θ2�� (t) = −a.cos(θ0 (t))θ2 (t) + (a/2)sin(θ0 (t))θ1 (t)2 A(t) =

and the 2 × 2 state transition matrix Φ(t, τ ) by

0 −a.cos(θ0 (t))

1 0

,

∂Φ(t, τ ) = A(t)Φ(t, τ ), t ≥ τ, Φ(τ, τ ) = I2 ∂t Then, θ1 (t) = θ2 (t) = (a/2)

t

Φ12 (t, τ )dB(τ ),

0

Φ12 (t, τ )sin(θ0 (τ ))θ1 (τ )2 dτ

Using these formulas, the statistics of θ(t) can be computed upto O(σ 2 ). We leave the problem of calculating the mean and autorcorrelation of θ(t) upto O(σ 2 ) as an exercise to the reader. The quantum case: Unitarity of the Schrodinger evolution operator U (t) is guaranteed if we take into account an Ito correction term in the Hamiltonian. Thus, the Schrodinger evolution is given by dU (t) = [−(iH0 + σ 2 V 2 /2)dt − iσV dB(t)]U (t) where

H0 = −∂θ2 /2ml2 − mgl.cos(θ)

We note that in this formalism, U (t) should be regarded as an integral kernel U (t, θ, θ� ) so that (U (t)ψ)(θ) = U (t, θ, θ� )ψ(θ� )dθ� Heisenberg matrix mechanics: Let X be an observable and deﬁne X(t) = U (t)∗ XU (t) Then, assuming that V = V (θ) is a multiplication operator, dX(t) = dU (t)∗ XU (t) + U (t)∗ XdU (t) + dU (t)∗ XdU (t)

Quantum Mechanics for Scientists and Engineers 171

174

= U (t)∗ (i[H0 , X]dt − (σ 2 /2)(V 2 X + XV 2 − V XV )dt + iσ[V, X]dB(t))U (t)

For example, taking X = θ, pθ respectively gives

dθ(t) = U (t)∗ (pθ /ml2 − (σ 2 /2)(V 2 θ + θV 2 − V θV ) + iσ[V, θ]dB(t))U (t) = pθ (t)/ml2 , dpθ (t) = U (t) (i[−mglcos(θ), pθ ] − (σ 2 /2)(V 2 pθ + pθ V 2 − V pθ V ) + iσ[V, pθ ]dB(t))U (t) ∗

= U (t)∗ (−mglsin(θ) − (σ 2 /2)(V [V, pθ ] + [pθ , V ]V ) + iσ[V, pθ ]dB(t))U (t) = −mglsin(θ(t)) − σV � (θ(t))dB(t)

Thus, we get the quantum analogue of the classical sde. However, to be more precise and obtain further generalizations, we must take our noise processes as the creation, annihilation and conservation processes of the Hudson-Parthasarathy quantum stochastic calculus. HP calculus generalizations: Consider the qsde dU (t) = (−(iH0 + P )dt + L1 dA(t) − L2 dA(t)∗ + SdΛ(t))U (t) where L1 , L2 , P, S are system operators chosen to make U (t) unitary for all t. Here, H0 = −∇2 /2m + U (r) acts in the system Hilbert space h = L2 (R3 ). Let X be a system observable. At time t it evolves under HP noisy Heisenberg dynamics to X(t) = U (t)∗ XU (t) X(t) is deﬁned on h ⊗ Γs (L2 (R+ )) and A(t), A∗ (t), Λ(t) are the HP noise operator processes satisfying the quantum Ito formula dA(t)dA(t)∗ = dt, dA∗ dA = 0, (dA)2 = 0, (dA∗ )2 = 0, (dΛ)2 = dΛ, dA.dΛ = dA, dΛ.dA∗ = dA∗ , dΛ.dA = 0, dA∗ dΛ = 0 Remark: Formally, we can write

dΛ = dA∗ .dA/dt

and hence using dA.dA∗ = dt, we get We get

dΛ.dA∗ = dA∗ , dA.dΛ = dA

dX(t) = dU (t)∗ XU (t) + U (t)∗ XdU (t) + dU (t)∗ XdU (t) =

U (t)∗ (i[H0 , X]dt−(P X +XP )dt+(L∗1 X −XL2 +S ∗ XL∗1 )dA∗ (−L∗2 X +XL∗1 −L∗2 XS)dA+(S ∗ X +XS ∗ +S ∗ XS)dΛ)U (t)

Exercise: Evaluate the Heisenberg equations of motion taking X = qk , X = pk , k = 1, 2, 3 where r = (q1 , q2 , q3 ), p = (p1 , p2 , p3 ). Assume that the system operators L1 , L2 , S, P are arbitrary functions of q, p subject to the constraint that makes U (t) unitary for all t. Specialize to the case when L1 , L2 , S, P are functions of q alone. A.37. The magnetic ﬁeld produced by a transmission line current when hysteresis eﬀects are taken into account. The line equations are V � (ω, z) + Z(ω, z)I(ω, z) + δ H1 (ω1 , ω − ω1 , z)I(ω1 , z)I(ω − ω1 , z)dω1 = 0 I � (ω, z) + Y (ω, z)V (ω, z) + δ Here,

H2 (ω1 , ω − ω1 , z)V (ω1 , z)V (ω − ω1 , z)dω1 = 0

Z(ω, z) = R(z) + jωL(z), Y (ω, z) = G(z) + jωC(z)

We can expand in a Fourier series Z(ω, z) =

Zn (ω)exp(jnβz), β = 2π/d

n∈Z

Y (ω, z) =

n

Yn (ω)exp(jnβz)

Quantum Mechanics for Scientists and Engineers

175

V (0) (ω, z) =

Vn(0) (ω)exp((γ + jnβ)z),

n

I (0) (ω, z) =

In(0) (ω)exp((γ + jnβ)z)

n

V (ω, z) = V (0) (ω, z) + δV (1) (ω, z) + O(δ 2 ) I(ω, z) = I (0) (ω, z) + δI (1) (ω, z) + O(δ 2 ) Then, substituting these expressions into the above line diﬀerential equations and equating coeﬃcients of δ 0 and δ 1 respectively gives us (0) (γ + jnβ)Vn(0) (ω) + Zn−m (ω)Im (ω) = 0, m

(γ +

jnβ)In(0) (ω)

+

Yn−m (ω)Vm(0) (ω) = 0

m

∂V (1) (ω, z) + Z(ω, z)I (1) (ω, z) + H1 .(I (0 ⊗ I (0) )(ω, z) = 0, ∂z

Let Φ(ω, z, z � ) ∈ C2×2

∂I (1) (ω, z) + Y (ω, z)I (1) (ω, z) + H2 (V (0) ⊗ V (0) )(ω, z) = 0 ∂z denote the state transition matrix corresponding to the forcing matrix 0 −Z(ω, z) A(ω, z) = −Y (ω, z) 0

Then, the solutions to the ﬁrst order line voltage and current perturbations are given by z V (1) (ω, z) = − Φ11 (ω, z, z � )H1 .(I (0 ⊗ I (0) )(ω, z � )dz � 0

−

z

0

Φ12 (ω, z, z � )H2 1.(V (0 ⊗ V (0) )(ω, z � )dz �

I (1) (ω, z) = − −

0

z

0

z

Φ21 (ω, z, z � )H1 .(I (0) ⊗ I (0) )(ω, z � )dz �

Φ22 (ω, z, z � )H2 .(V (0) ⊗ V (0) )(ω, z � )dz �

Denoting the inﬁnite dimensional matrix ((Zn−m (ω))) by Z(ω), ((Yn−m (ω))) by Y(ω), the diagonal matrix diag[n : n ∈ (0) (0) Z] by D, the unperturbed voltage and current Fourier coeﬃcient vectors ((Vn (ω))) and ((In (ω))) by V(0) (ω) and (0) I (ω) respectively, the above unperturbed eigenvalue equations can be expressed as T(ω)u(ω) = −γ(ω)u(ω) − − − (1) where

jβD Z(ω) Y(ω) jβD (0) V (ω) u(ω) = (0) I (ω)

T(ω) =

,

Let −γn (ω), un (ω), n = 1, 2, ... be the complete set of eigenvalues and corresponding eigenvectors of T(ω) as in (1). Then we can write cn (ω)un (ω) u(ω) = n∈Z

and we have on deﬁning the inﬁnite dimensional column vector

en (ω, z) = ((exp((γn (ω) + jkβ)z)))k∈Z , the following expression for the unperturbed line voltage and current: V (0) (ω, z) = cn (omega)en (ω, z)T vn (ω), n

Quantum Mechanics for Scientists and Engineers 173

176

I (0) (ω, z) =

cn (ω)en (ω, z)T wn (ω)

n

where

un (ω) =

vn (ω) wn (ω)

The far ﬁeld magnetic vector potential produced by this unperturbed line current is given by Az (ω, r) = (μ.exp(−jKr)/r)

d

I (0) (ω, ξ)exp(jKξ.cos(θ))dξ

0

= (μ.exp(−jKr)/r)P (ω, θ) where K = ω/c and P (ω, θ) =

d

I (0) (ω, ξ)exp(jkξ.cos(θ))dξ

0

A.38. Problems in quantum mechanics with solutions: [1] Obtain the supersymmetry generators in the form of super vector ﬁelds for four Boson and four Fermionic variables (xμ ), (θμ ). hint: Consider the generators of the form Dα = (Aμ θ)α ∂/partialxμ + Bαβ ∂/∂θβ and

¯ α = (C μ θ)α ∂/partialxμ + F β ∂/∂θα D α

The matrices A , C and the complex numbers Bαβ and Fαbeta are to be chosen so that the supersymmetric commutation relations hold in the form: ¯ β ] = K μ ∂/∂xμ [Dα , D αβ μ

μ

[2] Consider a periodic potential V (r), r ∈ R3 with three linearly independent period vectors, a1 , a2 , a3 ∈ R3 so that V (r + n1 a1 + n2 a2 + n3 a3 ) = V (r), n1 , n2 , n3 ∈ Z Expand V as a 3-D Fourier series using the reciprocal lattice vectors and hence formulate the stationary state Schrodinger equation −∇2 ψ(r)/2m + V (r)ψ(r) = Eψ(r), r ∈ R3 so that in view of the periodicity of V , we have

ψ(r + n1 a1 + n2 a2 + n3 a3 ) = C1n1 C2n2 C3n3 ψ(r), n1 , n2 , n3 ∈ Z where |Ck | = 1. If there are Nk atoms along ak k − 1, 2, 3, then we may apply the periodic boundary conditions on ψ without loss of generality in the form ψ(r + Nk ak ) = ψ(r), k = 1, 2, 3 This leads to so that

CkNk = 1, k = −1, 2, 3 Ck = exp(2πilk /Nk ), k = 1, 2, 3

for some lk ∈ {0, 1, ..., Nk − 1}. Now deﬁne the Bloch wave functions ul1 l2 l3 (r) by ψ(r) = exp(2πi(l1 b1 /N1 + l2 b2 /N2 + l3 b3 /N3 , r))ul1 l2 l3 (r) where {b1 , b2 , b3 } are the reciprocal lattice vectors corresponding to {a1 , a2 , a3 }. In other words, (bi , aj ) = δij Then, we have

ul1 l2 l3 (r + m1 a1 + m2 a2 + m3 a3 ) = ul1 l2 l3 (r), m1 m2 , m3 ∈ Z

Quantum Mechanics for Scientists and Engineers 174

177

ie, ul1 l2 l3 is also periodic with periods a1 , a2 , a3 like V (r) and hence it can also be developed into a Fourier series: Ul1 l2 l3 [n1 n2 1n3 ]exp(2πi(n1 b1 + n2 b2 + n3 b3 , r)) ul1 l2 l3 (r) = n1 n2 n3 ∈Z

V (r) =

V [n1 n2 n3 ]exp(2πi(n1 b1 + n2 b2 + n3 b3 , r))

n1 n2 n3 ∈Z

The problem is to subsitute these two Fourier series expansions for the wave function and the potential into the stationary state Schrodinger wave equation and derive a diﬀerence equation for Ul1 l2 l3 [n1 n2 n3 ]. [3] Quantum control of the Hudson-Parthasarathy equation based on the Belavkin ﬁlter observer. HP equn: dU (t) = (−(iH + P )dt + L1 dA + L2 dA∗ + SdΛ)U (t) For unitarity of U (t), we require

0 = d(U ∗ U ) = dU ∗ .U + U ∗ .dU + dU ∗ .dU

This gives using the quantum Ito formula dAdA∗ = dt, dΛ.dA∗ = dA∗ , dA.dΛ = dA, (dΛ)2 = dΛ, (all the other products of diﬀerentials are zero),

P = L∗2 L2 /2, L∗2 + L1 + L∗2 S = 0, L∗1 + L2 + S ∗ L2 = 0, S + S∗ + S∗S = 0

L1 , L2 , S, H, P are system operators with H, P self-adjoint. We have (I + S)∗ (I + S) = I + S ∗ + S + S ∗ S = I so we can write

I + S = exp(iλ(t)Z)

where Z is selfadjoint and λ(t) is a real valued controllable function of time. Then, we have L1 = −L∗2 (I + S) = −L∗2 exp(iλ(t)Z) We write L2 = L =

p

ck (t)Nk = L({ck (t)})

k=1

Then

L1 = −(

c¯k (t)Nk∗ ))exp(iλ(t)Z) = L1 ({ck (t)}, λ(t)),

k

S = exp(iλ(t)Z) − 1 = S(λ(t))

ck (t) are complex valued controllable functions of time. The real time control algorithms for controlling the ck (t)� s and λk (t) are based on minimizing the expected value of the Belavkin observer error E(t) = Xd (t) − πt (X) where Xd (t) is the desired system state at time t. This is a system observable and πt (X) is a measurable function of the noise algebra σ(Y )t upto time t. The noise is taken as Y (t) = U (t)∗ Yi (t)U (t), Yi (t) = a1 B(t) + a2 Λ(t), B(t) = A(t) + A(t)∗ , a1 , a2 ∈ R It is easily veriﬁed that Y statisﬁes the non-demolition abelian property, ie, [Y (t), Y (s)] = 0∀t, s ≥ 0

Quantum Mechanics for Scientists and Engineers 175

178

and

[Y (s), jt (X)] = 0, t ≥ s, jt (X) = U (t)∗ XU (t) = U (t)∗ (X ⊗ I)U (t)

We have

Y (t) = jt (Yi (t)) = U (t)∗ Yi (t)U (t), dY (t) = dYi (t) + dU (t)∗ dYi (t).U (t) + U (t)∗ dYi (t)dU (t) = dYi (t) + U (t)∗ (L∗1 dA∗ + L∗2 dA + S ∗ dΛ)dYi + dYi (L1 dA + L2 dA∗ + SdΛ))U (t)

Now,

dYi (L1 dA + L2 dA∗ + SdΛ) = (a1 (dA + dA∗ ) + a2 dΛ)(L1 dA + L2 dA∗ + SdΛ) = a1 L2 dt + a2 SdΛ + a2 L2 dA∗ + a1 SdA

We thus get dY (t) = dYi (t) + jt (a1 (L2 + L∗2 ))dt + jt (a2 (S + S ∗ ))dΛ + jt (a2 L2 + a1 S ∗ )dA∗ + jt (a2 L∗2 + a1 S)dA write

πt (X) = E(jt (X)|σ(Y )t )

where the expectation is taken in the state |f φ(u) > with f ∈ h, the system Hilbert space and |φ(u) >= exp(− � u �2 /2)|e(u) > the normalized exponential vector in the Boson Fock space Γs (L2 (R+ )). We assume that the optimal ﬁlter is described by the following qsde: dπt (X) = Ft (X)dt + Gt (X)dY (t) where Ft (X), Gt (X) ∈ σ(Y )t . Then applying the orthogonality principle E[(jt (X) − πt (X))C(t)] = 0 for C(t) deﬁned via the qsde

dC(t) = f (t)C(t)dY (t), C(0) = 1

and using the arbitrariness of f (t) gives us E[(djt (X) − dπt (X))|σ(Y )t ] = 0, E[(djt (X) − dπt (X))dY (t)|σ(Y )t ] + E[(jt (X) − πt (X))dY (t)|σ(Y )t ] = 0

These two equations give us two equations for the observables Ft (X) and Gt (X), solving which the Belavkin ﬁlter is obtained. To derive these two equations, we compute using quantum Ito’s formula, djt (X) = dU ∗ XU + U ∗ XdU + dU ∗ XdU = = U ∗ (i[H, X]−P X −XP +L∗2 XL2 )dt+(L∗2 X +XL1 +L∗2 XS)dA+(L∗1 X +XL2 +S ∗ XL2 )dA∗ +(S ∗ X +XS +S ∗ XS)dΛ)U = jt (θ0 (X))dt + jt (θ1 (X))dAt + jt (θ2 (X))dA∗t + jt (θ3 (X))dΛt

where

θ0 (X) = i[H, X] − L∗ LX/2 − XL∗ L/2 + L∗ XL∗ θ1 (X) = L∗ X − XL∗ (I + S) + L∗ XS,

θ2 (X) = −(1 + S)∗ LX + XL − S ∗ XL, θ3 (X) = S ∗ X + XS + S ∗ XS

Note that and The equation

L2 = L = L({ck (t)}), S = S(λ(t)) L1 = −L∗ (1 + S) = L1 ({ck (t), λ(t)) E[(djt (X) − dπt (X))|σ(Y )t ] = 0,

Quantum Mechanics for Scientists and Engineers 176

thus gives

179

u(t) + πt (θ3 (X))|u(t)|2 πt (θ0 (x)) + πt (θ1 (X))u(t) + πt (θ2 (x))¯

−Gt (X)[a1 (u(t)+¯ u(t))+a2 |u(t)|2 +πt (a1 (L2 +L∗2 ))+πt (a2 (S+S ∗ ))|u(t)|2 +pit (a2 L2 +a1 S ∗ )¯ u(t)+πt (a2 L∗2 +a1 S)u(t)] = 0 or equivalently,

Ft (X) = u(t) + πt (θ3 (X))|u(t)|2 πt (θ0 (x)) + πt (θ1 (X))u(t) + πt (θ2 (x))¯

−Gt (X)[a1 (u(t) + u ¯(t)) + πt (a1 (L2 + L∗2 )) + πt (a2 (S + S ∗ ))|u(t)|2 + pit (a2 L2 + a1 S ∗ )¯ u(t) + πt (a2 L∗2 + a1 S)u(t)]

The equation gives

E[(djt (X) − dπt (X))dY (t)|σ(Y )t ] + E[(jt (X) − πt (X))dY (t)|σ(Y )t ] = 0 u(t) + πt (θ3 (X))|u(t)|2 πt (θ0 (X)) + πt (θ1 (X))u(t) + πt (θ2 (X))¯

−Ft (X) − Gt (X)E[(dY (t))2 |σ(Y )t ] + πt (X)(a1 (u(t) + u ¯(t)) + a2 |u(t)|2 + πt (a1 (L2 + L∗2 )) + πt (a2 (S + S ∗ ))|u(t)|2 +πt (a2 L2 + a1 S ∗ )¯ u(t) + πt (a2 L2 + a1 S)u(t) = 0

Now, from the equation dY (t) = dYi (t) + jt (a1 (L2 + L∗2 ))dt + jt (a2 (S + S ∗ ))dΛ + jt (a2 L2 + a1 S ∗ )dA∗ + jt (a2 L∗2 + a1 S)dA = jt (a1 (L2 + L∗2 ))dt + jt (a2 (S + S ∗ + 1))dΛ + jt (a2 L2 + a1 (S ∗ + 1))dA∗ + jt (a2 L∗2 + a1 (S + 1))dA we get using quantum Ito’s formula and the homomorphism property of jt , E[(dY (t))2 |σ(Y )t ] = u(t) πt (a22 (S + S ∗ + 1)2 )|u(t)|2 + πt ((a2 L∗2 + a1 (S + 1))(a2 L2 + a1 (S ∗ + 1))) + πt ((a2 (S + S ∗ + 1)(a2 L2 + a1 (S ∗ + 1)))¯ +πt (a2 (a2 L∗2 + a1 (S + 1))(S + S ∗ + 1))u(t)

[4](Reference: S.Wienberg, The quantum theory of ﬁelds, vol.II, Cambridge University Press)Evaluate approximately the path integeral Z(J) = exp(iI(φ) − i J(x)φ(x)d4 x)Dφ

where J(x) is an external current source and I(φ) = (∂μ φ)(∂ μ φ)/2 − m2 φ2 /2 − �V (φ)d4 x

ie, the Klein-Gordon action functional with a small perturbative correction. Calculate using this expression the propagators exp(iI(φ)φ(x1 )...φ(xn )Dφ/Z(0) by using the formula

= in

∂ n bZ(J) |J=0 ∂J(x1 )...∂J(xn ) exp(iI(φ))φ(x1 )...φ(xn )dφ

Now derive the equation for J at which log(Z(J)) − this stationary solution by J0 (x), show that

J(x)φ0 (x)d4 x becomes stationary for a ﬁxed ﬁeld φ0 . Denoting

−φ0 (x) + δlogZ(J0 )/δJ(x) = 0 or equivalently, φ0 (x) + iZ(J0 )−1

exp(i

(I(φ) − J0 (x)φ(x))d4 x)φ(x).Dφ = 0

Quantum Mechanics for Scientists and Engineers 177

180

We write

J0 (x) = J0 (φ0 )(x)

or inverting this,

φ0 (x) = φ0 (J0 )(x)

It follows that

(δφ0 (x)/δJ0 (y) = δ 2 log(Z(J0 ))/δJ(x)δJ(y)

We deﬁne Γ(φ0 ) = log(Z(J0 )) − Thus, δΓ(φ0 )/δφ0 (y) =

J0 (x)φ0 (x)d4 x

φ0 (x)(δJ0 (x)/δφ0 (y))d4 x − = −J0 (y)

(δJ0 (x)/δφ0 (y))φ0 (x)d4 x − J0 (y)

Γ(φ0 ) is called the quantum eﬀective action and the above equation is called the equation of motion for the quantum eﬀective ation. [5] Calculation of path integrals for gauge invariant theories: Let φ(x) be the set of ﬁelds and f [φ] a gauge ﬁxing functional. We consider a path integral of the form X = G[φ]B[f [φ]]F [φ]Dφ where F [φ] is the Jacobian determinant:

F [φ] = det(df [φΛ ]/dΛ)|Λ=id

where φ → φΛ is the gauge transformed ﬁeld. Λ is the gauge transorming function. We wish to show that in a certain sense, X does not depend on the gauge ﬁxing functional f . We assume that the combined action functional G[φ] along with the measure Dφ, ie, G[φ]Dφ is invariant under the gauge transformation, ie, for all gauge transformations Λ, we have G[φΛ ]DφΛ = G[φ]Dφ Then, it follows that X= =

G[φΛ ]B[f [φΛ ]]F [φΛ ]DφΛ G[φ]B[f [φΛ ]]F [φΛ ]Dφ

Now, if Λ and λ are two gauge transformations, then det(df [φΛoλ ]/dλ)|λ=id = F [φΛ ]/ρ(Λ) where

ρ(Λ)−1 = det(d(Λoλ)/dλ)λ=id

So, X=

G[φ]B[f [φΛ ]]ρ(Λ)det(df [φΛ ]/dΛ)Dφ

Integrating this equation w.r.t Λ over the gauge group gives us after appropriately normalizing the Haar measure, X = G[φ].B[f ]]df.Dφ = C G[φ]Dφ where C is a constant deﬁned by

C=

B[f ]Df =

B[f [φΛ ]]det(df [φΛ ]/dΛ)ρ(Λ)dΛ

where we use the fact that ρ(Λ) as deﬁned above is the left invariant Haar density on the gauge group and also the assumption that the gauge group acts transitively on the matter ﬁelds φ.

Quantum Mechanics for Scientists and Engineers 178

181

[6] Quantum teleportation: The simplest version of this idea involves transmitting one qubit of information by transmitting just two classical bits of information. Alice and Bob share an entangled state |00 > +|11 > apart from normalization factor of 2−1/2 . Alice prepares another state |ψ >= c1 |1 > +c2 |0 > where c1 , c2 ∈ C and |c1 |2 + |c2 |2 = 1 which she wishes to transmit to Bob by making use of the entangled state that she shares with Bob. The overall state of Alice and Bob is thus |ψ > (|00 > +|11 >) = (c1 |1 > +c2 |0 >)(|11 > +|00 >) = c1 |111 > +c1 |100 > +c2 |011 > +c2 |000 >

The ﬁrst two qubits of this state can be controlled only by Alice and the last only by Bob. Alice applies a phase gate to her two qubits thus obtaining the overall state as |φ >= c1 |111 > +c1 |100 > +c2 |011 > −c2 |000 > We express this state in terms of the orthogonal one qubit state |+ >= |1 > +|0 >, |− >= |1 > −|0 > or equivalently,

|1 >= |+ > +|− >, |0 >= |+ > −|− >

Thus the overall state of Alice and Bob is given by

|φ >= c1 (|+ > +|− >)(|+ > +|− >)|1 > +c1 (|+ > +|− >)(|+ > −|− >)|0 > +c2 (|+ > −|− >)(|+ > +|− >)|1 > −c2 (|+ > −|− >)(|+ > −|− >)|0 >

Alice now performs a measurement on her two qubits in this shared state using the orthonormal basis | + + >, | + − > , | − + >, | − − >. If she measures | + + >, then clearly from the above expression for |φ >, Bob’s state collapses to c1 |1 > +c1 |0 > +c2 |1 > −c2 |0 >= c1 (|1 > +|0 >) + c2 (|1 > −|0 >) If Alice measures | + − >, then Bob’s state collapses to c1 |1 > −c1 |0 > +c2 |1 > +c2 |0 >= c1 (|1 > −|0 >) + c2 (|1 > +|0 >) If Alice measures | − + >, then Bob’s state collapses to c1 |1 > +c1 |0 > −c2 |1 > +c2 |0 >= c1 (|1 > +|0 >) + c2 (|0 > −|1 >) Finally, if Alice measures | − − >, then Bob’s state collapses to c1 |1 > −c1 |0 > −c2 |1 > −c2 |0 >= c1 (|1 > −|0 >) − c2 (|1 > +|0 >) Using two classical bits, Alice reports to Bob the outcome of her measurements. If she reports | + + >, then Bob applies the unitarty gate U1 to his state deﬁned by √ √ U1 (|1 > +|0 >) 2 = |1 >, U1 (|1 > −|0 >)/ 2 = |0 >, to recover |ψ >. If Alice reports | + − >, then Bob applies the unitary gate U2 to his state deﬁned by √ √ U2 (|1 > −|0 >)/ 2 = |1 >, U2 (|1 > +|0 >)/ 2 = |0 > to recover |ψ >. If Alice reports | − + >, then Bob applies the unitary gate U3 to his state deﬁned by √ √ U3 (|1 > +|0 >)/ 2 = |1 >, U3 (|1 > −|0 >)/ 2 = −|0 > to recover |ψ >. Finally, if Alice reports | − − >, then Bob applies the unitary gate U4 to his state deﬁned by √ √ U4 (|1 > −|0 >)/ 2 = |1 >, U4 (|1 > +|0 >)/ 2 = −|0 >

Quantum Mechanics for Scientists and Engineers 179

182

to recover the state |ψ >. [7] Quantum Boltzmann equation. N identical particles. ρ(t) is the joint density matrix of the particles. The k th particle acts in the Hilbert space Hk . The Hilbert space of the whole system is H=

N k=1

Hk

Each Hk is an identical copy of a ﬁxed Hilbert space H0 . ρ satisﬁes the Schrodinger-Von-Neumann-Liouville equation iρ� (t) = [H, ρ(t)] where H=

N

k=1

Hk +

Vkj

1≤k)W (z)W (u) where W (z) = exp(

N j=1

(¯ zj aj − zj a∗j )), z = [z1 , ..., zN ]T

Let ρ be a state in the Hilbert space L2 (RN ) = Γs (C). Then its quantum Fourier transform is given by ρˆ(z) = T r(ρW (z)) We have

ρˆ(za − zb ) = T r(ρW (za − zb )) =

T r(ρW (za )W (zb )∗ )exp(iIm(< za , zb >)) and hence the matrix ((ˆ ρ(za − zb )exp(−iIm(< za , zb >)))1≤a,b≤p is positive deﬁnite for any p complex vectors za , a = 1, 2, ..., p in CN . Let ρ be a Gaussian state, ie, its quantum Fourier transform has the form ρˆ(x + iy) = exp(lT x + mT y − [xT , y T ]S[xT , y T ]T ) where l, m are complex vectors and S is a symmetric complex matrix. We write R(z) = [xT , y T ]T and hence, we can write ρˆ(z) = exp(pT R(z) − R(z)T SR(z)/2) We have for z, u ∈ CN ,

Im(< z, u >) = R(z)T JR(u)

where J=

0N −IN

IN 0

∈ R2N ×2N

Quantum Mechanics for Scientists and Engineers 184

187

Thus, the requirement of the the above positive deﬁnitivity amounts to requiring that 2S − iJ ≥ 0 When this condition is satisﬁed, we call ρ a quantum Gaussian state. [13] Problem: Consider Dirac’s equation in a noisy electromagnetic ﬁeld described by the qsde β α β dU (t) = [(α, pdt + e(Lα β dΛα (t))) + βmdt) + (V (q)dt + eMβ dΛα (t))U (t)

where the system Hilbert space is h = L2 (R3 ) and the noise Boson-Fock space is Γs (L2 (R+ )⊗Cd ). The system operators α Lα β andMβ are assumed to be functions of q, p only. The electron charge e is a perturbation parameter. Formally, the noisy magnetic vector potential is β At = Lα β dΛα (t)/dt If we assume that Lα β are functions of the position vector observable q only, then At becomes a a multiplication operator when restricted to the system Hilbert space h only. The noisy electric scalar potential is Φt = Mβα dΛβα (t)/dt and if we assume that Mβα are functions of q only, then the electric scalar potential becomes a multiplication operator when restricted to the system Hilbert space only. To get an approximate solution for U (t), we write U (t) = U0 (t) + eU1 (t) + e2 U2 (t) + O(e3 ) Problem: Calculate the matrix elements of U (t) between the states |f φ(u) > and |gφ(v) > upto O(e2 ) where |f >, |g >∈ h and u, v ∈ L2 (R+ ) ⊗ Cd , |φ(u) >= exp(− � u �2 /2)|e(u) > and likewise for |φ(v) >. Interpret this result in terms of transition probabilities for the system between two energy levels by taking an average over the noise space assuming that the noise space remains in the coherent state |φ(u) > before and after the transition. [14] Alternative description of the induced representation of a semidirect product: G = N ⊗s H, hN h−1 = N, ∀h ∈ H, N ˆ . Let is Abelian. Choose χ0 ∈ N H0 = {h ∈ H : h.χ0 = χ0 } where

ˆ h.χ(n) = χ(h−1 nh), h ∈ H, n ∈ N, χ ∈ N

Let Oχ0 denote the orbit of χ0 under H, ie,

ˆ Oχ0 = {hχ0 : h ∈ H} ⊂ N The manifolds H/H0 and Oχ0 are isomorphic under the H-group action. This follows by deﬁning the map φ : H/H0 → Oχ0 by φ(hH0 ) = hχ0 , h ∈ H. The map is well deﬁned since hH0 = h� H0 implies h −1 h ∈ H0 implies h −1 hχ0 = χ0 � � � implies hχ0 = h χ0 . The map φ is a bijection since φ(hH0 ) = φ(h H0 ) implies hχ0 = h χ0 implies h −1 hχ0 = χ0 implies h −1 h ∈ H0 implies hH0 = h� H0 . This proves the injectivity of φ. Surjectivity is obvious. Let L be a representation of H0 in a vector space V . Now Let Y denote the vector space of all functions f : H → V such that L L : H → Y by U L (h1 )f (h) = f (h−1 is a f (hh−1 0 ) = L(h0 )f (h), h0 ∈ H0 , h ∈ H. Deﬁne U 1 h), h1 , h ∈ H. Then, U representation of H in Y and is called the representation of H induced by the representation L of H0 . Now we give an alternate description of the induced representation. Let Y˜ denote the vector space of all functions f : Oχ0 → V . Deﬁne T : Y˜ → Y by the formula T f (h) = A(h)f (hχ0 ), h ∈ H where A(h) is an appropriate matrix, ie A : H → GL(V ). In order that T f ∈ Y , we require that T f (hh−1 0 ) = L(h0 )T f (h), h0 ∈ H0 , h ∈ H, ie, −1 A(hh−1 0 )f (hh0 χ0 ) = L(h0 )A(h)f (hχ0 ), h ∈ H, h0 ∈ H0

or equivalently, since h−1 0 χ0 = χ0 ,

A(hh−1 0 ) = L(h0 )A(h), h0 ∈ H0 , h ∈ H

This can be satisﬁed for example by choosing

A(h) = L(γ(hχ0 )−1 h)−1 = L(h−1 γ(hχ0 ))

Quantum Mechanics for Scientists and Engineers 185

188

We then check that

−1 −1 = L(h0 h−1 γ(hh−1 h) = L(h0 ) A(hh−1 0 )A(h) 0 χ0 )L(γ(hχ0 )

Thus, We have for h, h1 ∈ H,

(T f )(h) = L(h−1 γ(hχ0 ))f (hχ0 ), h ∈ H −1 −1 (U L (h1 )T f )(h) = T f (h−1 h1 γ(h−1 1 h) = L(h 1 hχ0 ))f (h1 hχ0 )

If we equate this to T V L (h1 )f (h) = L(h−1 γ(hχ0 ))(V L (h1 )f )(hχ0 ), then we must have (V L (h1 )f )(hχ0 ) = L(γ(hχ0 )−1 hh−1 h1 γ(h−1 1 hχ0 )) = L(γ(hχ0 )−1 h1 γ(h−1 1 hχ0 )) or equivalently, writing χ = h.χ0 , we get V L (h1 )f (χ) = L(γ(χ)−1 h1 γ(h−1 1 χ)) V L therefore deﬁnes a representation of H in Y˜ that is equivalent to the induced representation U L of H in Y . as

[15] EM wave propagation in an inhomogeneous waveguide: The permittivity and permeability ﬁelds are represented �[n, x, y]exp(−nβz), �(x, y, z) = n

μ(x, y, z) =

μ[n, x, y]exp(−nβz)

n

For example, if the length of the guide is d, we can take β = j2π/d, ie, develop � and μ as Fourier series in the z variable. We also expand the electric and magnetic ﬁelds as Fourier series in the z variable with an additional complex propagation constant γ: E(x, y, z) = E[n, x, y]exp(−(γ + nβ)z), n

H(x, y, z) =

H[n, x, y]exp(−(γ + nβ)z)

n

We also expand the current density J(x, y, z) within the guide as J[n, x, y]exp(−jnβz) J(x, y, z) = n

Substituting these into the Maxwell curl equations curlE(x, y, z) = −jωμ(x, y, z)H(x, y, z), curlH(x, y, z) = J(x, y, z) + jω�(x, y, )E(x, y, z) and equating coeﬃcients of exp(−jnβz) for each integer n gives us z × E⊥ [n, x, y] = −jω ∇⊥ Ez [n, x, y] × zˆ − (γ + nβ)ˆ z × H⊥ [n, x, y] = jω ∇⊥ Hz [n, x, y] × zˆ − (γ + nβ)ˆ +J⊥ [n, x, y]

m

m

for the x − y components. For the z component, we have (ˆ z , ∇⊥ × H⊥ ) = Jz + jω�Ez , (ˆ z , ∇⊥ × E⊥ ) = −jωμHz

μ[m, x, y]H⊥ [n − m, x, y], �[m, x, y]E⊥ [n − m, x, y]

Quantum Mechanics for Scientists and Engineers 186

189

or in the transform domain, (ˆ z , ∇⊥ × E⊥ [n, x, y]) = −jω

m

μ[m, x, y]Hz [n − m, x, y],

(ˆ z , ∇⊥ × H⊥ [n, x, y]) = Jz [n, x, y] + jω

m

�[m, x, y]Ez [n − m, x, y]

To cast these equations in the format of perturbation theory, we assume that �[n, x, y] and μ[n, x, y] for n �= 0 are small compared to �[0, x, y] and μ[0, x, y] and hence attach a small pertrubation parameter δ to the former terms. We then get z × E⊥ [n, x, y] + jωμ[0, x, y]H⊥ [n, x, y] = −jδω ∇⊥ Ez [n, x, y] × zˆ − (γ + nβ)ˆ μ[m, x, y]H⊥ [n − m, x, y], m�=0

z × H⊥ [n, x, y] − jω�[0, x, y]E⊥ [n, x, y] − J⊥ [n, x, y] = jδω ∇⊥ Hz [n, x, y] × zˆ − (γ + nβ)ˆ or in the transform domain, (ˆ z , ∇⊥ × E⊥ [n, x, y]) + jωμ[0, x, y]Hz [n, x, y] = −jδω

m�=0

m�=0

�[m, x, y]E⊥ [n − m, x, y]

μ[m, x, y]Hz [n − m, x, y],

(ˆ z , ∇⊥ × H⊥ [n, x, y]) − Jz [n, x, y] − jω�[0, x, y]Ez [n, x, y] = jωδ

m�=0

�[m, x, y]Ez [n − m, x, y]

[16] Symmetry breaking: Let ψ(x) ∈ CN be the matter ﬁeld. It transforms under gauge group G ⊂ U (N ). Under a local gauge transformation, this is given by ψ(x) → g(x)ψ(x), where g(x) ∈ G. Now, assume that we can write ˜ ˜ ψ(x) = γ(x)ψ(x) where γ(x) is a representative element of a coset in G/H with H a subgroup of G. Thus, ψ(x) ˜ transforms according to the ”broken subgroup” H and g(x) ∈ G transforms ψ(x) to g(x)ψ(x) = g(x)γ(x)ψ(x) and / H. Assume that Aμ (x) are we can write g(x)γ(x) = γ � (x)h(x) where h(x) ∈ H and either γ � (x) = e or else γ � (x) ∈ the gauge ﬁelds taking values in the Lie algebra of G, ie in g. ψ(x) transforms according to G and the corresponding transformation law of Aμ (x) under G is obtained by using the fact that g(x)(∂μ + ieAμ (x))ψ(x) = (∂μ + A�μ (x))g(x)ψ(x) for this ensures that if the Lagrangian density L(ψ(x), ∇μ ψ(x)) is globally G-invariant, ie L(gψ(x), g∇μ ψ(x)) = L(ψ(x), ∇μ ψ(x)) for all g ∈ G, where

∇μ = ∂μ + ieAμ (x)

is the gauge covariant derivative, then it would follow that L is also locally G-invariant, ie, L(ψ � (x), ∇�μ ψ � (x)) = L(ψ(x), ∇μ ψ(x)) where

∇�μ = ∂μ + ieA�μ (x)

It follows that the transformation law of the matter ﬁeld ψ(x) under the gauge transformation is ψ � (x) = g(x)ψ(x) while the corresponding transformation law of the gauge ﬁeld Aμ (x) under the gauge transformation is given by (∂μ + ieA�μ (x)) = g(x)(∂μ + ieAμ (x))g(x)−1 (as an operator equation), or equivalently, ieA�μ (x) = ieg(x)Aμ (x)g(x)−1 + g(x)∂μ (g(x)−1 ) or

A�μ (x) = g(x)Aμ (x)g(x)−1 + (i/e)(∂μ g(x))g(x)−1

Quantum Mechanics for Scientists and Engineers 187

190

This equation is the non-Abelian generalization of the Abelian U(1) gauge transformation of the Dirac ﬁeld in an electromagnetic ﬁeld where the wave function ψ(x) ∈ C4 transforms to ψ � (x) = exp(iφ(x))ψ(x) where φ(x) ∈ R and correspondingly the electromagnetic four potential Aμ (x) changes to A�μ (x) so that (∂μ + ieA�μ )exp(iφ) = exp(iφ)(∂μ + ieAμ ) or equivalently,

ieA�μ = ieAμ − i∂μ φ

or equivalently,

A�μ (x) = Aμ (x) − e−1 ∂μ φ(x)

which is the standard Lorentz gauge transformation. Coming back to the general non-Abelian case, we assume that if {ta } are the generators of the Lie algebra of G, then Dba (g)tb gta g −1 = b

In other words, D(g) is the adjoint representation of G acting in g. Then, Aaμ ta )g −1 gAμ (x)g −1 = g( a

=

Aaμ gta g −1 =

a

This means that

A�μ (x) =

Aaμ Dba (g)tb

a

Dba (g)Aaμ (x)tb + (i/e)(∂μ g(x))g(x)−1

b

Now suppose that the matter Lagrangian density in the absence of the gauge ﬁeld has the form L=

1 (∂μ ψ(x))∗ (∂ μ ψ(x)) − m2 ψ(x)∗ ψ(x) 2

This Lagrangian density is invariant under global G-transformations since g ∈ U (N ). Then since ˜ ψ(x) = γ(x)ψ(x) we get so that

∂μ ψ(x) = (∂μ γ)ψ˜ + γ∂μ ψ˜ ˜ μ ψ)∗ + ψ˜∗ (∂μ γ)∗ (∂ μ γ)ψ˜ L = (∂μ ψ)(∂ ˜ − m2 ψ(x) ˜ ∗ ψ(x) ˜ +2Re(ψ˜∗ ∂μ γ)∗ γ∂ μ ψ)

It is clear that This Lagrangian density of the matter part ie those terms invoving only ψ˜ and not γ is invariant under ˜ ˜ ψ(x) → hψ(x), h ∈ H and further that the above Lagrangian does not contain terms that are quadratic in γ(x) not ˜ involving their derivatives. This means that ψ(x) has mass while the other part γ(x) does not have mass. γ(x) is called the Goldstone-massless part of the ﬁeld and ψ˜ is called the massive part. Thus, broken symmetries are associated with ˜ the productions of massless Goldstone Bosons. Just as ψ was separated into a massive part ψ(x) and a massless part γ(x), we can also separate the gauge ﬁeld Aμ (x) into two parts: ˜ ψ(x) = γ(x)−1 ψ(x), A˜μ (x) = γ(x)−1 Aμ (x)γ(x) or equivalently, Then we ﬁnd that

A˜aμ (x) =

Dab (γ(x)−1 )Abμ (x)

˜ ψ � (x) = g(x)ψ(x) = g(x)γ(x)ψ(x), g(x)γ(x) = γ � (x)h(x),

Quantum Mechanics for Scientists and Engineers 188

191

(∂μ g)γ + g∂μ γ = (∂μ γ � )h + γ � ∂μ h and hence

Thus, implies

γ −1 g −1 (∂μ g)γ + γ −1 ∂μ γ = h−1 γ � − 1(∂μ γ � )h + h−1 partialμ h A�μ (x) = g(x)Aμ (x)g(x)−1 + (i/e)(∂μ g(x))g(x)−1 − − − (1) A˜�μ = γ −1 A�μ γ � =

γ

−1

g(x)Aμ (x)g(x)−1 γ � + (i/e)γ

Noting that γ

−1

it follows that this equals

−1

(∂μ g(x))g(x)−1 γ �

g = hγ −1

A˜�μ = hγ −1 Aμ γh−1 + (i/e)γ

−1

((−g(∂μ γ)γ −1 g −1 + (∂μ γ � )γ

+γ � (∂μ h)h−1 γ

−1

−1

)γ �

= hA˜μ h−1 + (i/e)(−hγ −1 (∂μ γ)h−1 + γ or equivalently,

−1

∂μ γ � + (∂μ h)h−1 )

A˜�μ − (i/e)γ −1 ∂μ γ � =

h(A˜μ (−i/e)γ −1 ∂μ γ)h1 + (i/e)(∂μ h)h−1 − − − − − (2)

where we have used the fact that the equation

γ −1 g −1 (∂μ g)γ + γ −1 ∂μ γ =

implies which again implies

h−1 γ � − 1(∂μ γ � )h + h−1 partialμ h (∂μ g)γ + g∂μ γ = (∂μ γ � )h + γ � (∂μ h) (∂μ g)g −1 + g(∂μ γ)γ −1 g −1 = (∂μ γ � )hγ −1 g −1 +γ � (∂μ h)γ −1 g −1

which implies

(∂μ g)g −1 + g(∂μ γ)γ −1 g −1 = (∂μ γ � )γ +γ � (∂μ h)h−1 γ

−1

−1

Equation (2) should be compared to equation (1). (1) tells us how the gauge ﬁelds Aμ transform under a local gauge transformation g(x) coming from the big group G. (2) tells us how the massless components of the gauge ﬁelds A˜μ transform locally under the broken subgroup H. It tells us that when symmetry breaking takes place, then the massless gauge ﬁeld should be taken as A˜μ − (i/e)γ −1 ∂μ γ rather than as A˜μ . In other words, the massless Goldstone components γ of the matter ﬁeld ψ also becomes a part of the massless component of the gauge ﬁeld A˜μ and the sum total of these two determines the eﬀective gauge ﬁeld which transforms under the broken subgroup H just as the original gauge ﬁeld Aμ transformed under the larger group G. It is possible to express all these formulae in terms of functions of the space-time coordinates by ﬁrst choosing a set of generators ti , i = 1, 2, ..., p of the Lie algebra of H and then extending this to a basis of the Lie algebra of G, ie, {t1 , ..., tp , x1 , ..., xq }, p + q = n = dimG. Then, we can write γ(x) = exp(

q

(ξi (x)xi )

i=1

ξi (x) can be taken as the Goldstone Boson ﬁelds and quantities like (∂μ γ(x))γ(x)−1 can be expressed as nonlinear functions of the ξi (x). In this way, the Lagrangian density can be expressed in terms of the massless part A˜μ of the Gauge ﬁelds, the massive part ψ˜ of the matter ﬁeld ψ and the massless Goldstone Boson ﬁelds ξi (x).

Quantum Mechanics for Scientists and Engineers 189

192

[17] Supersymmetric ﬁeld theories: Let θa , a = 0, 1, 2, 3 be anticommuting Majorana Fermions: Writing θ = (θa ), we have 0 e θ θ∗ = −e 0 where

e= with

0 iσ 2

iσ 2 0

σ μ , μ = 0, 1, 2, 3

begin the Pauli spin matrices:

0 1 , 1 0 0 −i 1 0 , σ3 = σ2 = i 0 0 −1 σ 0 = I2 , σ 1 =

We write

σμ = ημν σ ν

so that The Dirac gamma matrices are deﬁned as

σ0 = σ 0 , σr = −σ r , r = 1, 2, 3 γμ =

They satisfy

0 sigmaμ

σμ 0

{γ μ , γ ν } = 2η μν I4

Deﬁne the left Chiral and right Chiral fermionic variables

θL = ((1 + γ5 )/2)θ, θR = ((1 − γ5 )/2)θ where γ5 = Clearly,

I2 0

0 −I2

γ 0 γ 1 γ 2 γ 3 = iγ5

and hence from the anticommutativity of the γ , we deduce that μ

{γ μ , γ5 } = 0 We also have

θ = θ L + θR , ∂/∂θ = ((1 + γ5 )/2)(∂/∂θL ) + ((1 − γ5 )/2)(∂/∂θR )

Deﬁne the supersymmetry generators as where Also deﬁne where

L = γ5 �∂/∂θ + γ μ θ∂μ ∂μ = ∂/∂xμ ¯ = −γ5 �L L �=

Clearly,

e 0

0 e

e2 = −I2 , �2 = −I4 , eT = −e, �T = −�, γ5T = γ5 , [γ5 , �] = 0

Quantum Mechanics for Scientists and Engineers 190

and hence

193

(γ5 �)T = −γ5 �, (γ5 �)2 = −I4

so we can also write

¯ T = LT γ5 � L

We get

¯= L ∂/∂θ − γ5 �γ μ θ∂μ

Then using

{∂/∂θa , θb } = δba

we get

¯ b } = {(γ μ θ)a ∂μ , ∂/∂θb } {La , L

+{(γ5 �)ac ∂/∂θc , −(γ5 �γ μ θ)b ∂μ }

= (γ μ )ab ∂μ − (γ5 �)ac (γ5 �γ μ )bd δcd ∂μ = (γ μ )ab ∂μ − (γ5 �)ac (γ5 �γ μ )bc ∂μ

= (γ μ )ab ∂μ − [(γ5 �)γ μT (γ5 �)T ]ab ∂μ μ = (γab + γ5 �γμT γ5 �)∂μ μ = 2γab ∂μ

since γ5 �γ μT γ5 � =

e 0

0 −e

(

0 σ

μT

σμT 0

Formally, we can write this anticommutation relation as

e 0

0 −e

= γμ

¯ = 2γ μ ∂μ {L, L} It is easy to see that the six matrices �, γ5 �, �γ μ , μ = 0, 1, 2, 3 form a basis for the space of all skew-symmetric 4 × 4 matrices. We can thus expand any such matrix X as X = a0 � + a1 γ5 � + bμ �γ μ where a0 , a1 , bμ are complex constants. To evaluate these constants, we note that T r(�2 ) = −4, T r(�γ5 �) = 0, T r(γμ �2 ) = 0, T r((γ5 �)(γ5 �)) = −4, T r((γ5 �)γμ �) = 0,

It follows that

T r(γν )�2 γ μ ) = −4δνμ a0 = −T r(�X)/4, a1 = −T r(γ5 �X)/4, bμ = −T r(γμ �X)/4

In particular, we ﬁnd that since θθT is a 4 × 4 skew-symmetric matrix,

θθT = (−1/4)((θT �θ)� + (θT γ5 �θ)γ5 � + (θT γμ �θ)�γ μ ) Now let S(x, θ) be any superﬁeld. It can be expanded in powers of θ upto fourth degree with the coeﬃcients being functions of x only. It follows that we can expand it as S(x, θ) = C(x) + θT �ω(x) + θT �θM (x) + θT γ5 �θN (x) + θT �γ μ θVμ (x)+ (θT �θ)θT γ5 �β(x) + (θT �θ)2 K(x) This is a consequence of the fact that any ﬁrst degree polynomial in θ can be expressed as θT �ω where ω has four components, any second degree polynomial in θ is a linear combination of θT �θ, θT γ5 �θ and θT �γ μ θ, μ = 0, 1, 2, 3 since if M is any symmetric matrix, then θT M θ = 0 and further that as noted above, �, γ5 � and �γ μ , μ = 0, 1, 2, 3 is a basis for

Quantum Mechanics for Scientists and Engineers 191

194

4 × 4 skew symmetric matrices. We now evaluate the change in the superﬁeld S under a supersymmetry transformation αT L where α is a Fermionic 4-vector. We have αT L(C(x)) = αT (γ5 �∂/∂θ + γ μ θ∂μ )(C(x)) = αT γ μ θC,μ (x) αT L(θT �ω(x)) = α (−γ5 ω(x) + γ μ θθT �ω,μ (x)) T

αT L(θT �θM (x)) = = −2αT γ5 θM (x) + θT �θαT γ μ θM,μ (x))

Note that we have used the fact that θ �θ commutes with θ (two anticommutations make one commutation). T

αT L(θT γ5 �θN (x)) = −2αT θN (x) + θT γ5 �θαT γ μ θN,μ (x) αT L(θT �γ μ θVμ (x))

= αT (−2γ5 γ μ θVμ (x) + θT �γ μ θγ ν θVμ,ν (x)) = −2αT γ5 γ μ θVμ + θT �γ μ θαT γ ν θVμ,ν αT L(θT �θθT γ5 �β(x))

= −2αT γ5 θθT γ5 �β(x)) −θT �θαT β(x)

+θT �θγ μ θθT γ5 �β,μ (x) We now recall that

θθT = (−1/4)(θT �θ� + θT γ5 �θγ5 � + θT �γ μ θγμ �)

and hence αT L(θT �θθT γ5 �β(x)) = −2αT γ5 θθT γ5 �β(x)) −θT �θαT β(x)

−(1/4)θT �θαT γ μ (θT �θ� + θT γ5 �θγ5 � + θT �γ μ θγμ �)γ5 �β,μ (x) = −2αT γ5 θθT γ5 �β(x)) −θT �θαT β(x)

Now using

−(1/4)θT �θ(θT �θαT γ μ � + θT γ5 �θαT γ μ γ5 � + θT �γ ν θαT γ μ γν �)γ5 �β,μ (x) θT �θ.θT γ5 �θ = 0, θT �θ.θT �γ ν θ = 0

we get

αT L(θT �θθT γ5 �β(x)) = −2αT γ5 θθT γ5 �β(x)) −θT �θαT β(x)

+(1/4)(θT �θ)2 αT γ μ γ5 β,μ (x) Finally, we ﬁnd that αT L((θT �θ)2 K(x)) = αT (−4γ5 θθT �θK(x))

Quantum Mechanics for Scientists and Engineers 192

195

Now denoting the inifnitesimal supersymmetry transformation by δ = αT L we obtain on comparing the coeﬃcients of diﬀerent powers of θ, the following transformation rules for the components of the superﬁeld under a supersymmetry transformation: δC(x) = −αT γ5 ω(x) θT �δω(x) = αT γ μ θC,μ (x) −2αT γ5 θM (x) − 2αT θN (x) −2αT γ5 γ μ θVμ (x)

or equivalently,

δω(x) = −γ μ �αC,μ (x) +2γ5 �αM (x) +2�αN +2γ μ �γ5 αVμ (x)

where we have used the fact that γ μ � and � are skew symmetric. Further, θT �θδM (x) + θT γ5 �θδN (x) + θT �γ μ θδVμ (x) = αT γ μ θθT �ω,μ (x)) −2αT γ5 θθT γ5 �β(x)) from which we deduce by expanding θθ as earlier, T

−θT �θαT β(x) δM (x) =

(1/4)αT γ μ ω,μ (x) −(3/2)αT β(x),

δN (x) = (1/4)θT γ μ γ5 ω,μ (x) +(1/2)αT γ5 β(x) δVμ (x) = (1/4)αT γ ν γμ ω,ν (x)) (1/2)αT γμ β(x) where we have used the fact that γ5 commutes with � and anticommutes with γ μ . Note that if we have an equation of the form a(x)θθT =a quadratic polynomial in θ, then we can expand θθT as a linear combination of θT �θ, θT γ5 �θ and θT �γ μ θ and equate the corresponding coeﬃcients. We also have the useful fact that if any two of these six distinct quadratic polynomials are multiplied, we get zero. Next, equating cubic terms in θ, (θT �θ)θT γ5 �δβ(x) = θT �θαT γ μ θM,μ (x)) +θT γ5 �θαT γ μ θN,μ (x) +θT �γ μ θαT γ ν θVμ,ν −4αT γ5 θθT �θK(x))

Quantum Mechanics for Scientists and Engineers 193

196

from which we deduce again by the above mentioned principles, (multiply both sides by θ, expand θθT as above and use the fact that the product of this quantity with θT γ5 �θ is zero and also its product with θT �γ μ θ is zero: (1/4)(θT �θ)2 γ5 δβ(x) = −(1/4)θT �θ)2 �γ μT αM,μ (x)

−(1/4)(θT γ5 �θ)2 γ5 �γ μT αN,μ (x)

(−1/4)(θT �γ μ )θ)(θT �γ ρ θ)γρ �γ νT αVμ,ν +(θT �θ)2 �γ5 αK(x) Equivalently,

γ5 δβ(x) = −�γ μT αM,μ (x) + 4�γ5 αK(x) +γ5 �γ μT αN,μ (x)

−η μρ γρ �γ νT αVμ,ν

Here, we make use of the following easily veriﬁable identities:

(θT �θ)2 = 8θ0 θ1 θ2 θ3 (θT γ5 �θ)2 = −8θ0 θ1 θ2 θ3 (θT �γ μ θ)(θT �γ ν θ) = = 8θ0 θ1 θ2 θ3 η μν Multiplying both sides of the above identity by γ5 gives us δβ(x) = −γ5 �γ

μT

αM,μ (x) + 4�αK(x)

+�γ μT αN,μ (x)

−γ5 γ μ �γ νT αVμ,ν

Finally, equating the fourth degree term in θ gives us

(θT �θ)2 δK(x) = (1/4)(θT �θ)2 αT γ μ γ5 β,μ (x) or equivalently, Now writing and

δK(x) = (1/4)αT γ μ γ5 β,μ (x) β(x) = λ(x) + a.γ μ ω,μ (x) K(x) = D(x) + b∂ μ ∂μ C(x)

and using {γ5 , γ μ } = 0 and {γ μ , γ ν } = η μν I4 , we get δD(x) + b.∂ μ ∂μ δC(x) = (1/4)αT γ ν γ5 (λ,ν + a.γ μ ω,μν ) So we would expect that

= (−1/4)αT γ5 γ ν λ,ν − (a/8)γ5 ∂ μ ∂μ ω δD(x) = (−1/4)αT γ5 γ ν λ,ν , bδC(x) = (−a/8)αT γ5 ω(x)

Quantum Mechanics for Scientists and Engineers 194

197

On the other hand, we have already seen that δC(x) = −αT γ5 ω(x) So we must choose

a = 8b

We note that δD(x) is a perfect four divergence and hence its integral over the whole of four dimensional space-time is zero. Thus, D(x)d4 x is invariant under the supersymmetry transformation δ = αT L and hence is a candidate Lagrangian density for supersymmetric theories. We also have δβ(x) = δλ(x) + a.γ μ δω,μ (x) −γ5 �γ μT αM,μ (x) +�γ μT αN,μ (x)

−γ5 γ μ �γ νT αVμ,ν

+4�α(D(x) + b∂ μ ∂μ C(x)) So,

δλ(x) = −γ5 �γ μT αM,μ (x) +�γ μT αN,μ (x)

−γ5 γ μ �γ νT αVμ,ν

+4�α(D(x) + b∂ μ ∂μ C(x)) −a.γ μ (−γ ν �αC,ν (x)

+2γ5 �αM (x) + 2�αN +2γ ν �γ5 αVν (x)),μ or equivalently using

γ5 γ μ = −γ μ γ5 ,

and

�γ μT = γ μ �,

we get

δλ = (1 − 2a)(γ μ γ5 �αM,μ +(1 − 2a)γ μ �αN,μ

+(γ γ − 2aγ ν γ μ )�γ5 αVμ,ν μ ν

+4�α(D + b∂μ ∂ μ C) + aγ μ γ ν C,μν and hence, it follows by taking a = 1/2 and noting that (γ μ γ ν C,μν = (1/2)∂μ ∂ μ C, that δλ = [γ μ , γ ν ]�γ5 αVμ,ν + 4�αD Suppose we choose D = 0, λ = 0. Then, It follows then that if Vμ,ν − Vν,μ = 0, then

δD = 0, δλ = [γ μ , γ ν ]�γ5 αVμ,ν δλ = 0

This means that a superﬁeld S[x, θ] with the constraints D = 0, λ = 0 after a supersymmetry transformation, again maintains the same constraints provided that Vμ,ν − Vν,μ = 0, ie, provided that Vμ (x) = ∂μ Z(x)

Quantum Mechanics for Scientists and Engineers 195

198

for some scalar ﬁeld Z(x). A superﬁeld S with these constraints, ie, D = 0, λ = 0 is called a Chiral ﬁeld. We denote such a superﬁeld by Φ(x, θ). Remark: Apparently, some errors in the signs of the variables occur above. These are easily rectiﬁed if we take into account that α is a Fermionic parameter vector which anticommutes with θ. We would then have αT θ = −θT α and more generally for a bosonic matrix A, we would have αT Aθ = −θT Aα. [18] Quantum Control in the sense of Belavkin and Luc Bouten: The HP equation describing unitary evolution in the tensor product of the system Hilbert space and the Boson Fock space of the noisy bath is given by dU (t) = (−i(H + L∗ L/2)dt + LdA∗ (t) − L∗ SdA(t) + (S − I)dΛ(t))U (t) where S ∗ S = I, H ∗ = H. Using quantum Ito’s formula dA.dA∗ = dt, dAdΛ = dA, dΛ.dA∗ = dA∗ , (dΛ)2 = dΛ∗ = dΛ with all the other product diﬀerentials zero (ie, of o(dt)). The corresponding quantum process equation describing noisy evolution of a system observable X coupled to the bath is given by jt (X) = U (t)∗ XU (t) = U (t)∗ (X ⊗ I)U (t) Application of quantum Ito’s formula gives djt (X) = jt (θ0 (X))dt + jt (θ1 (X))dA(t) + jt (θ2 (X))dA(t)∗ + jt (θ3 (X))dΛ(t) where the structure maps θk , k = 0, 1, 2, 3 of this Evans-Hudson ﬂow are given by θ0 (X) = i[H, X] − (1/2)(L∗ LX + XL∗ L − 2L∗ XL) θ1 (X) = [L∗ , X]S, θ2 (X) = S ∗ [X, L], θ3 (X) = S ∗ XS − X

The measurement process is assumed to be

Yout (t) = U (t)∗ Yin (t)U (t), Yin (t) = A(t) + A(t)∗ This is a non-demolition measurement in the sense of Belavakin: [Yout (t), Yout (s)] = [Yout (t), js (X)] = 0, s ≥ t We have by quantum Ito’s formula, dYout (t) = dYin (t) + jt (S − I)dA + jt (S ∗ − I)dA∗ + jt (L + L∗ )dt = jt (S)dA + jt (S ∗ )dA∗ + jt (L + L∗ )dt Let and

ηout (t) = {Yout (s) : s ≤ t} πt (X) = E(jt (X)|ηout (t))

where expectations are taken w.r.t. the state |f ⊗ ψ(β) > where |f >∈ h, < f, f >= 1 with h being the system Hilbert space in which the operators H, S, L, X are deﬁned and |ψ(β) >= exp(− � β �2 /2)|e(β) > with |e(β) > being the exponential vector ∞ √ β ⊗n / n! |e(β) >= n=0

where β ∈ L2 (R+ ). If β = 0, then the bath is in the vacuum state. Belavkin’s ﬁltering equation is dπt (X) = πt (Lt X) + (πt (XMt + Mt∗ X) − πt (Mt + Mt∗ )πt (X))dW (t) where

¯ dW (t) = dYout (t) − (πt (Mt + Mt∗ ) + β(t) + β(t))dt

Quantum Mechanics for Scientists and Engineers 196

and

199

Mt = L + (S − I)β(t),

2 ¯ Lt X = θ0 (X) + β(t)θ1 (X) + β(t)θ 2 (X) + |β(t)| θ3 (X)

In the special case when β = 0 (vacuum state of the bath), we get

Mt = L, Lt X = θ0 (X) = i[H, X] − (1/2)(L∗ LX + XL∗ L − 2L∗ XL) Writing

πt (X) = T r(ρ(t)X)

where ρ(t) can be regarded as a classical random process with values in the space of density operators in the system Hilbert space h, we can express the Belavkin equation as dρ(t) = L∗t (ρ(t))dt + ((Mt ρ(t) + ρ(t)Mt∗ ) − T r(ρ(t)(Mt + Mt∗ ))ρ(t))(dYout (t) − T r(ρ(t)(Mt + Mt∗ )) + 2Re(β(t)))dt) It is easy to see that where

∗ 2 ∗ ¯ L∗t (ρ) = θ0∗ (ρ) + β(t)θ1∗ (ρ) + β(t)θ 2 (ρ) + |β(t)| θ3 (ρ)

θ0∗ (ρ) = −i[H, ρ] − (1/2)(L∗ Lρ + ρL∗ L − 2LρL∗ ) θ1∗ (ρ) = SρL∗ − L∗ Sρ = [Sρ, L∗ ], θ3∗ (ρ) = SρS ∗ − ρ

Note that for any map θ in B(h), the Banach space of bounded operators in h, we deﬁne its dual map θ∗ by the prescription T r(Y θ(X))T r(θ∗ (Y )X) Sometimes it is more convenient to deﬁne the dual map by the prescription T r(Y ∗ θ(X)) = T r(θ∗ (Y )∗ X) This is true especially if the operators in question are non-Hermitian. This latter deﬁnition can be expressed as < Y, θ(X) >=< θ∗ (Y ), X > where < ., . > is the inner product on B(h) deﬁned by < X, Y >= T r(X ∗ Y ) We note that

and hence

dW (t) = dYout (t) − (πt (Mt + Mt∗ ) + 2Re(β(t)))dt

= jt (S)dA(t) + jt (S ∗ )dA(t)∗ + jt (L + L∗ )dt − (πt (Mt + Mt∗ ) + 2Re(β(t))dt E(dW (t)|ηout (t)) = ¯ πt (S)β(t)dt + πt (S ∗ )β(t)dt − πt (L + L∗ )dt − πt (Mt + Mt∗ )dt − 2Re(β(t))dt =0

and further,

(dW (t))2 = jt (SS ∗ )dt = dt

by quantum Ito’s formula. Hence, the innovations process {W (t) : t ≥ 0} is a classical Brownian motion w.r.t. the ﬁltration {ηout (t), t ≥ 0}. We now assume that t = 0 so that t + dt = dt. We choose a Hermitian system observable Z and apply the control unitary c = exp(iZdY (t)) Udt where Y = Yout . Since t = 0, jt = j0 = I and hence dY (t) = SdA(t) + S ∗ dA(t)∗ + (L + L∗ )dt − ((Mt + Mt∗ ) + 2Re(β(t))dt ¯ = S(dA(t) − β(t)dt) + S ∗ (dA(t)∗ − β(t)dt)

Quantum Mechanics for Scientists and Engineers 197

200

Note that in the general case, ie, t > 0, we have dW (t) = jt (S)dA(t) + jt (S ∗ )dA(t)∗ + jt (L + L∗ )dt − (πt (Mt + Mt∗ ) + 2Re(β(t))dt ¯ = (jt (S)dA − πt (S)β(t)dt) + (jt (S ∗ )dA∗ − πt (S ∗ )β(t)dt) +(jt (L + L∗ ) − πt (L + L∗ ))dt

We also note that

dYout (t) = dW (t)

in the special case when β = 0 and L = −L. The resulting state at time t + dt = dt after applying the control under the condition β = 0 (so that Mt = L) is given by ∗

ρc (dt) = Uc (dt)ρ(dt)dUc (dt)∗ (I + iZdY − Z 2 dt/2)(ρc (0) + L∗ (ρc (0))dt + ((Lρc (0) + ρc (0)L∗ ) − T r(ρc (0)(L + L∗ ))ρc (0))dW (t))(I − iZdY − Z 2 dt/2) [19] Proof of the Shannon Cq coding theorem. A is a ﬁnite alphabet and for each x ∈ A, ρ(x) is a density matrix in the ﬁnite dimensional Hilbert space H = CN . Let P be a probability distribution on A and deﬁne the set of all δ-typical sequences of length n: T (P, n, δ) = {u ∈ An : |N (x|u) − nP (x)| ≤ δ nP (x)(1 − P (x))∀x ∈ A}

Here, N (x|u) is the number of times x occurs in the sequence u. It is clear that u ∈ T (P, n, δ) implies 2−nS(P )−Kδ where

S(P ) = −

√ n

≤ P ⊗n (u) = Πx∈A P (x)N (x|u) ≤ 2−nS(P )+Kδ

P (x)log(P (x)), K =

x∈A

√

n

P (x)(1 − P (x))log(P (x))

x∈A

The greedy algorithm: Choose the maximal integer M such that for a given probability distribution Pon A and M given � > 0 there exist sequences u1 , ..., uM ∈ T (P, n, δ) and operators D1 , ..., DM ≥ 0 in H such that i=1 Di ≤ I, T r(ρ(ui )Di ) ≥ 1 − �∀i and Di ≤ E(n, ui , δ)∀i. Here, E(ρ(x)⊗N (x|u) , δ) E(n, u, δ) = x∈A

where if ρ is any density matrix and n any positive integer, then if ρ= |i > p(i) < i| i

is the spectral representation of ρ, then

E(ρ⊗n , δ) =

N (i|i1 ,...,in )−np(i)|≤δ

It is easily seen that where

2−nS(ρ)−δK1 S(ρ) = −

i

This simple fact is based on the identity

√

n

√

np(i)(1−p(i))∀i

≤ ρ⊗n E(ρ⊗n , δ) ≤ 2−nS(ρ)+δK1

p(i)log(p(i)), K1 =

i

√

n

p(i)(1 − p(i))log(p(i))

p(i1 )...p(in ) = Πi p(i)N (i|i1 ,...,in ) = 2 and

|i1 , ..., in >< i1 , ..., in |

i

N (i|i1 ,...,in )log(p(i))

ρ⊗n |i1 , ..., in >= p(i1 )...p(in )|i1 , ..., in >

Quantum Mechanics for Scientists and Engineers 198

201

It follows that on setting ρ⊗n (u) =

ρ(x)⊗N (x|u)

x∈A

we get 2−n

x∈A

Pu (x)S(ρ(x))−δ

x∈A

K1 (x)

√

N (x|u)

≤ ρ⊗n (u)E(n, u, δ) ≤ 2−n

where

x∈A

Pu (x)S(ρ(x))+δ

x∈A

K1 (x)

√

N (x|u)

Pu (x) = N (x|u)/n

and

K1 (x) = Pρ(x) (i)(1 − Pρ(x) (i))log(Pρ(x) (i)) i

where

ρ(x) =

i

is the spectral representation of ρ(x). Writing

|i > Pρ(x) (i) < i|

K0 =

K1 (x)

x∈A

it follows that

2−n

x∈A

Pu (x)S(ρ(x))−δK0

√

n

Remark: If u ∈ T (P, n, δ), then

≤ ρ⊗n (u)E(n, u, δ) ≤ 2−n

|N (x|u) − nP (x)| ≤ δ and hence and so

P (x) − δ

x∈A

Pu (x)S(ρ(x))+δK0

√

n

nP (x)(1 − P (x))∀x ∈ A

P (x)(1 − P (x))/n ≤ Pu (x) = N (x|u)/n ≤ P (x) + δ

P (x)(1 − P (x))/n, x ∈ A

√ √ P (x) − δ/ 2n ≤ Pu (x) ≤ P (x) + δ/ 2n, x ∈ A

In other words, for any � > 0, there exists n0 such that n ≥ n0 implies

|Pu (x) − P (x)| ≤ �, x ∈ A, u ∈ T (P, n, δ) where δ > 0 is given. It follows from this observation and the above inequality that if u ∈ T (P, n, δ), then there is a K2 independent of n such that 2−n

x∈A

P (x)S(ρ(x))−δK2

√

n

≤ ρ⊗n (u)E(n, u, δ) ≤ 2−n

For example, we can take K2 = K0 +

x∈A

P (x)S(ρ(x))+δK2

√

n

(P (x)(1 − P (x))1/2 S(ρ(x))

x∈A

A lower bound on M in the greedy algorithm: M Lemma: In the greedy algorithm, deﬁne i=1 Di = D. Then, T r(ρ(u)D) ≥ γ∀u ∈ T (P, n, δ). For suppose T r(ρ(u)D) ≤ γ for some u ∈ T (P, n, δ). Then since γ ≤ 1 − �, it follows that u ∈ / {u1 , ..., uM }. Now deﬁne D� = (1 − D)1/2 E(n, u, δ)(1 − D)1/2 Then, (writing ρ(u) for ρ⊗n (u) = ⊗x∈A ρ(x)⊗N (x|u) ), we have T r(ρ(u)D� ) = T r(ρ(u)E(n, u, δ)) − T r(ρ(u)(E(n, u, δ) − (1 − D)1/2 E(n, u, δ)(1 − D)1/2 )) ≥ T r(ρ(u)E(n, u, δ))− � ρ(u) − (1 − D)1/2 ρ(u)(1 − D)1/2 �1 ≥ T r(ρ(u)E(n, u, δ)) − α.T r(ρ(u)D) ≥ T r(ρ(u)E(n, u, δ)) − αγ

Now observe that for any density ρ,

{|N (i|i1 , ..., in ) − nPρ (i)| ≤ δ nPρ (i)(1 − Pρ (i)))

T r(ρ⊗n E(ρ⊗n , δ)) = Pρ⊗n (

i

Quantum Mechanics for Scientists and Engineers 199

202

≥1−

i

P (|N (i|i1 , ..., in ) − nPρ (i)| ≥ δ

nPρ (i)(1 − Pρ (i))

≥ 1 − N/δ 2 − − − (1)

where we have used the union bound, the Chebyshev inequality and the fact that for each i, N (i|i1 , ..., in ) is a random variable having mean nPρ (i) and variance nPρ (i)(1 − Pρ (i)). Note that we also have P (T (P, n, δ)c ) = P (u ∈ An : |N (x|u) − nP (x)| > δ nP (x)(1 − P (x))f orsomex ∈ A) ≤ P (u ∈ An : |N (x|u) − nP (x)| > δ nP (x)(1 − P (x))) x∈A

by the union bound and Chebyshev’s inequality. Thus,

≤ a/δ 2

P (T (P, n, δ)) ≥ 1 − a/δ 2 From the above arguments (ie equn(1)), T r(ρ(u)E(n, u, δ)) = Πx∈A T r(ρ(x)⊗N (x|u) E(ρ(x)N (x|u) , δ)) ≥ (1 − N/δ 2 )a

Note:If δ = O(n ) where 0 < s < 1, then (1 − N/δ 2 )a , then it follows that T r(ρ(u)E(n, u, δ)) can be made as close √ θ to unity as possible by choosing n suﬃciently large and quantities like 2−nS(ρ)±K δ n are of the order 2−nS(ρ)±K”n where 0 < θ < 1. Equivalently, the logarithm of these quantities divided by n would converge to −S(ρ) as n → ∞ with the probabilities T r(ρ(u)E(n, u, δ)) converging to zero. We have now derived the inequality s/2

T r(ρ(u)D� ) ≥ T r(ρ(u)E(n, u, δ)) − αγ ≥ (1 − N/δ 2 )a − αγ ≥ 1 − � if for suﬃciently large n provided δ is allowed to vary with n as in the above note and γ < �/α is assumed. We thus get a contradiction to the maximality of M on using the fact that 0 ≤ D� ≤ 1 − D ie

D + D� ≤ 1

This completes the proof of the lemma. A lower bound on M: Lemma: Let 0 ≤ Z, T ≤ be operators and ρ a density operator in a Hilbert space such that for some θ > 0, we have ρT ≤ θT . Then, T r(Z) ≥ θ−1 (T r(ρZ) − T r(ρ(1 − T )))

An intuitive interpretation of this inequality: Z, T are viewed as events. If Z occurs, then T may or may not occur. If Z and T occur, then the probability of this is the trace of ρT ∩ Z which is smaller than the trace of θT ∩ Z which in turn is smaller than the trace of θZ. If Z occurs and T does not occur then the probability of this is the trace of ρZ ∩ (1 − T ) which is smaller than the trace of ρ(1 − T ). Combining these two facts, we get T r(ρZ) ≤ θT r(Z) + T r(ρ(1 − T )) and the proof of the Lemma is complete. Now deﬁne

ρ¯ =

P (x)ρ(x)

x∈A

Then, we have where

¯ 3δ ρ⊗n , δ) ≤ 2−nS(ρ)+K ρ¯⊗n E(¯

K3 =

√ n

E(¯ ρ⊗n , δ)

Pρ (i)(1 − Pρ (i))log(Pρ (i)) j

Quantum Mechanics for Scientists and Engineers 200

203

Hence by the above lemma, ¯ 3δ T r(D) ≥ 2nS(ρ)−K

√

n

(T r(¯ ρ⊗n D) − T r(¯ ρ(¯ ρ⊗n (1 − E(¯ ρ⊗n , δ)))

Now, since

ρ¯⊗n =

P ⊗n (u)ρ(u)

u∈An

it follows that

and further,

Thus, On the other hand,

T r(¯ ρ⊗n D) ≥

P ⊗n (u)T r(ρ(u)D)

u∈T (P,n,δ)

≥ γP ⊗n (T (P, n, δ)) ≥ γ(1 − N/δ 2 ) T r(¯ ρ⊗n E(¯ ρ⊗n , δ)) = Pρ¯⊗n (T (Pρ¯, n, δ)) ≥ 1 − N/δ 2 ¯ 3δ T r(D) ≥ 2nS(ρ)−K

T r(D) =

M

T r(Di ) ≤

i=1

≤ M.2n

Combining the above two inequalities gives us

x∈A

¯ M ≥ (1 − 2N/δ 2 )2n(S(ρ)−

√

n

(1 − 2N/δ 2 )

M

T r(E(n, ui , δ))

i=1

√ P (x)S(ρ(x))+K5 δ n

x∈A

√ Pu (x)S(ρ(x)))−K5 δ n

− − − (2)

which is the desired lower bound on M . Remark: Here we have made used of the following fact: ρ(x)⊗N (x|u) E(ρ(x)⊗N (x|u) , δ) ρ(u)E(n, u, δ) = x∈A

≥ (Πx∈A 2

−N (x|u)S(ρ(x))−δ

where as deﬁned earlier

K1 (x) =

√

N (x|u))K1 (x)

E(n, u, δ)

Pρ(x) (i)(1 − Pρ(x) (i))log(Pρ(x) (i) i

It follows that

T r(E(n, u, δ)) ≤ 2n

x∈A

√ Pu (x)S(ρ(x))+K5 δ n

and hence (2) is obtained. Note that Pui (x) diﬀers from P (x) by an amount of order K1 (x) K5 =

√

n since ui ∈ T (P, n, δ). Here,

x∈A

An upper bound on M: Consider the spectral representation of ρ¯: ρ¯ = |i > Pρ¯(i) < i| i

and with respect to the onb {|i >: 1 ≤ i ≤ N } of this representation, deﬁne the density operators |i >< i|ρ(x)|i >< i|, x ∈ A ρ˜(x) = i

˜ Then, ρ˜(x), x ∈ A form a commuting family of operators in H = CN . Deﬁne the typical projections E(n, u, δ) in the usual way corresponding to this family ρ˜(x), x ∈ A. In other words, if |j1 , ..., jN (x|u) > are the eigenvectors of ρ˜(x)⊗N (x|u) , then ρ˜(x)⊗N (x|u) ρ˜(u) = x∈A

Quantum Mechanics for Scientists and Engineers 201

204

˜ u, δ) = E(n,

E(˜ ρ(x)⊗N (x|u) , δ)

x∈A

=

x∈A (j1 ,...,jN (x|u) )∈T (j,x)∀j

where with

|j1 , ..., jN (x|u) >< j1 , ..., jN (x|u) |

T (j, x) = {(j1 , ..., jN (x|u) ) : N (j|j1 , ..., jN (x|u) ) − N (x|u)p(j|x)| ≤ δ

ie the probability of the j

p(j|x) =< j|ρ(x)|j > th

N (x|u)p(j|x)(1 − p(j|x))}

eigenvector of ρ¯ occurring given that the system is in the state ρ(x). Note that we can write ˜ u, δ) = E(n, |v >< v|

where the summation is over all n-length sequences v of the form v= (j1 , ..., jN (x|u) ) x∈A

with

(j1 , ..., jN (x|u) ) ∈ T (j, x)∀j = 1, 2, ..., N

It immediately follows that

N (j|v) =

N (j|j1 , ..., jN (x|u) )

x∈A

Suppose Pu (x) = P (x), x ∈ A. Then |N (j|v) − nPρ¯(j)| = | ≤

x∈A

≤

x∈A

N (j|j1 , ..., jN (x|u) ) −

nP (x)p(j|x)|

x∈A

|N (j|j1 , ..., jN (x|u) − p(j|x)N (x|u)|

δ N (x|u)p(j|x)(1 − p(j|x))

x∈A

√ P (x)p(j|x)(1 − p(j|x)) =δ n x∈A

√ P (x))1/2 .( p(j|x)(1 − p(j|x)))1/2 ≤ δ n( x

x

√ ≤ δ an

From this, it follows that

√ v ∈ T (Pρ¯, n, δ a)

Thus, we have proved that

√ ˜ u, δ) ≤ E(¯ E(n, ρ⊗n , δ a)

whenever Pu (x) = P (x)∀x ∈ A. Now deﬁne

˜ ui , δ)Di E(n, ˜ ui , δ), i = 1, 2, ..., M Di� = E(n, Then, Di� ≤

M

√ √ E(¯ ρ⊗n , δ a)Di E(¯ ρ⊗n , δ a)

i=1

√ √ ρ⊗n , δ a) = E(¯ ρ⊗n , a δ)DE(¯ ≤I

Quantum Mechanics for Scientists and Engineers

205

Now, we observe that

ρ(u)E(n, u, δ) =

ρ(x)⊗N (x|u) E(ρ(x)⊗N (x|u) , δ)

x∈A

≤ (Πx∈A 2−N (x|u)S(ρ(x))+δ

where as deﬁned earlier

K1 (x) =

√

N (x|u))K1 (x)

E(n, u, δ)

Pρ(x) (i)(1 − Pρ(x) (i))log(Pρ(x) (i) i

and hence

ρ(u)E(n, u, δ) ≤ 2−n

x

√ Pu (x)S(ρ(x))+K5 δ n

E(n, u, δ)

and if we assume further that u ∈ T (P, n, δ), then we get ρ(u)E(n, u, δ) ≤ 2−n since for such u,

x

√ P (x)S(ρ(x))+K6 δ n

E(n, u, δ) − − − (3)

√ |Pu (x) − P (x)| ≤ δ/ 2n

as observed earlier. It follows from (3) and the above lemma that T r(Di� ) ≥ 2n Now we have

x

√ P (x)S(ρ(x))−K6 δ n

(T r(ρ(ui )Di� ) − T r(ρ(ui )(1 − E(n, ui , δ)))

˜ ui , δ)ρ(ui )E(n, ˜ ui , δ))Di ) T r(ρ(ui )Di� ) = T r(ρ(ui )Di ) − T r((ρ(ui ) − E(n, ˜ ui , δ)ρ(ui )E(n, ˜ ui , δ) �1 ≥ 1 − �− � ρ(ui ) − E(n, ˜ ui , δ))) ≥ 1 − � − βT r(ρ(ui )(1 − E(n,

Let π denote any speciﬁc rank one projection of the form |i1 , ..., in >< i1 , ..., in | where |i1 , ..., , in > are eigenvectors of ρ¯⊗n obtained by tensoring the orthonormal basis of eigenvectors of ρ¯. Then, we have ˜ ui , δ)) = ˜ ui , δ)π) T r(ρ(ui )π E(n, T r(ρ(ui )E(n, π

=

˜ ui , δ)) T r(πρ(ui )π E(n,

π

˜ ui , δ)) = T r(˜ ρ(ui )E(n, ≥ 1 − N/δ 2

Thus,

˜ ui , δ)) ≤ N/δ 2 T r(ρ(ui )E(n,

and also Thus, we get

T r(ρ(ui )E(n, ui , δ)) ≤ N/δ 2 T r(Di� ) ≥ 2n = 2n

It follows that T r(

M i=1

x

x

√ P (x)S(ρ(x))−K6 δ n √

P (x)S(ρ(x))−K6 δ n

Di� ) ≥ M.2n

x

(1 − � − βN/δ 2 − N/δ 2 )

(1 − � − (β + 1)N/δ 2 )

√ P (x)S(ρ(x))−K6 δ n

(1 − � − (β + 1)N/δ 2 )

on the one hand and on the other,

T r(

M i=1

Di� ) =

M

˜ ui , δ)Di E(n, ˜ ui , δ)) T r(E(n,

i=1

We have already noted that if P = Pu , then √ ˜ u, δ) ≤ E(¯ E(n, ρ⊗n , δ a)

Quantum Mechanics for Scientists and Engineers 203

206

so if we assume that Pui = P ∀i, then we get T r(

M i=1

and hence we get

√ √ Di� ) ≤ T r(E(¯ ρ⊗n , δ a)DE(¯ ρ⊗n , δ a) √ ≤ T r(E(¯ ρ⊗n , δ a)) ¯ 7δ ≤ 2nS(ρ)+K

¯ 7 2nS(ρ)+K

√

an

≥ M.2n

x

√

an

√ P (x)S(ρ(x))−K6 δ n

(1 − � − (β + 1)N/δ 2 )

from which we get the upper bound on M in the special case when Pui = P ∀i:

¯ M ≤ (1 − � − (β + 1)N/δ 2 )−1 2n(S(ρ)−

x

P (x)S(ρ(x)))+(K6 +K7

√

√ a)δ n

Suppose now that u1 , ..., uM , D1 , ..., DM are as in the greedy algorithm. Let Q be any empirical probability distribution on A corresponding to the integer n. This means that there is a sequence u ∈ An such that Q(x) = N (x|u)/n, x ∈ A. It is clear that the number of empirical probability distributions on A corresponding to n cannot exceed (n + 1)a . (Each symbol x ∈ A can occur in a sequence of length n, k times where k = 0, 1, ..., n). We now consider the subset FQ of all integers 1, 2, ..., M such that ui is of empirical type Q, ie, N (x|ui )/n = Q(x), ∀x ∈ A. Let MQ denote the cardinality of FQ . Then it is clear that M= MQ Q

a where the summation is over all empirical distributions Q, there being atmost (n + 1) of them. By the above inequality, we have since T r(ρ(ui )Di ) ≥ 1 − �, Di ≤ E(n, ui , δ), ∀i ∈ FQ and i∈FQ Di ≤ I,

MQ ≤ (1 − � − (β + 1)N/δ 2 )−1 2n(S(ρ¯Q )−

x

Q(x)S(ρ(x)))+(K6 +K7

≤ 1 − � − (β + 1)N/δ 2 )−1 2nC+(K6 +K7

where

ρQ =

√

√

√ a)δ n

√ a)δ n

Q(x)ρ(x)

x

and

C = maxP (S(

x

P (x)ρ(x)) −

P (x)S(ρ(x)))

x

the maximum being taken over all probability distributions P on A. Summing this over all empirical distributions Q of length n on A gives us √ √ M= MQ ≤ (n + 1)a .(1 − � − (β + 1)N/δ 2 )−1 2nC+(K6 +K7 a)δ n Q

which is the desired upper bound on M . √ A remark: Now suppose we do not assume Pui = P ∀i. Since ui ∈ T (P, n, δ), we have |Pui (x) − P (x)| ≤ δ/ 2n∀i, x. ˜ u, δ) in the same way as above except that we replace P by Pu . Then, by the same arguments For any u, we deﬁne E(n, as above, we have √ ˜ u, δ) ≤ E(¯ E(n, ρ⊗n u , δ a) where

ρ¯u =

Pu (x)ρ(x)

x∈A

˜ leading to the results: We proceed in the same way as above by deﬁning Di� in terms of the new E ˜ ui , δ)Di E(n, ˜ ui , δ) D� = Di� = E(n, i

i

= E(¯ ρ

⊗n

√ √ ρ⊗n , δ a)+ , δ a)DE(¯

Quantum Mechanics for Scientists and Engineers

i

where

207

¯ i (Ei − E) ¯ + (Ei − E)D

i

¯ iE ¯ ¯ + ED ¯ i (Ei − E) (Ei − E)D

√ ˜ ui , δ), E ¯ = E(¯ Ei = E(n, ρ⊗n , δ a)

Thus, in terms of norms,

¯ + 2.maxi � Ei − E ¯ �2 +maxi � Ei − E �1 T r(D� ) ≤ T r(E) 1

˜ u, δ) Note that ui ∈ T (P, n, δ). Now if u ∈ T (P, n, δ), then Pu is close to P , hence ρ¯u will be close to ρ¯ and so E(n, ¯ and we would get that T r(D� ) cannot exceed T r(E) ¯ by an order of an ¯ Thus, Ei will be close to E will be close E. ¯ �2 cannot grow faster than an exponential of np where exponential of n provided that we are able to show that � Ei − E √ ¯ x P (x)S(ρ(x))+Kδ n would follow. To make this more p < 1). Hence, the desired lower bound of the form M ≤ 2n(S(ρ)− precise, we observe that u ∈ T (P, n, δ) implies |Pu⊗n (v) − P ⊗n (v)| = |Πx∈A Pu (x)N (x|v) − Πx∈A P (x)N (x|v) |

= |e

x

=≤ |exp( We can write

N (x|v)log(Pu (x))

x

−e

x

N (x|v)log(P (x))

|

N (x|v)(log(Pu (x)) − log(P (x))) − 1|

√ Pu (x) = P (x) + f (x)/ n, x ∈ A, |f (x)| ≤ 1, f (x) = 0 x

Hence,

|exp(

x

|Pu⊗n (v) − P ⊗n (v)| ≤ √ N (x|v)log(1 + f (x)/P (x) n)) − 1| = |exp( N (x|v)(f (x)/P (x) n − f (x)2 /2P (x)2 n + ...)) − 1| √

x

√ If v ∈ / T (P, n, δ), then N (x|v) < nP (x) + O( n) in which case we see that √ √ N (x|v)(f (x)/P (x) n − f (x)2 /2P (x)2 n + ...) < O(1/ n) x

since

√ f (x) = 0. If v ∈ T (P, n, δ), then N (x|v) = nP (x) + g(x) n where |g(x)| < 1. Hence, in this case, √ √ N (x|v)(f (x)/P (x) n − f (x)2 /2P (x)2 n + ..) = (g(x)f (x)/P (x) − f (x)2 /2P (x)) + O(1/ n) x

x

√ = c0 + O(1/ n)

Other useful inequalities are =|

x

|log(Pu⊗n (v)) − log(P ⊗n (v))| = N (x|v)log(Pu (x)) − N (x|v)log(P (x))| = |

x

√ N (x|v)(log(1 + f (x)/P (x) n)|

x

[20] Restricted quantum gravity in one spatial dimension and one time dimension. The metric is dτ 2 = (1 + 2U (t, x))dt2 − (1 + 2V (t, x))dx2 The position ﬁelds are U (t, x) and V (t, x) and to ﬁnd the momentum ﬁelds, we must ﬁrst evaluate the Lagrangian density √ β β α L = g μν −g(Γα μν Γαβ − Γμβ Γαβ )

This Lagrangian density is a function of U, V, U,μ , V,μ . Deﬁne the position ﬁelds as U, V and the canonical momentum ﬁelds as ∂L ∂L , πV = πU = ∂U,0 ∂V,0

Quantum Mechanics for Scientists and Engineers

208

Then, apply the Legendre transformation after solving for U,0 , V,0 in terms of U, V, ∇U, ∇V to get the Hamiltonian density as H(U, V, ∇U, ∇V, πU , πV ) = πU U,0 + πV V,0 − L Then, set up the Schrodinger wave equation ( H(U (x), V (x), ∇U (x), ∇V (x), −iδ/δU (x), −iδ/δV (x))dx)ψt ({U (x), V (x) : x ∈ R) =i

∂ ψt ({U (x), V (x) : x ∈ R) ∂t

References [1] Rohit Singh, Naman Garg and H.Parthasarathy, ”Statistical modeling of transmission lines using stochastic differential equations with random loading eﬀects along the line”, Technical report, NSIT, 2016. [2] Rohit Singh, Naman Garg and H.Parthasarathy, ”Quantization of transmission line equations using the GKSL formalism”, Technical report, NSIT, 2016. [3] Naman Garg, H.Parthasarathy and D.K.Upadhyay, ”Real time simulation of Hudson-Parthasarathy noisy Schrodinger equation and Belavkin equation”, Technical report, NSIT, 2016. [4] Naman Garg, H.Parthasarathy and D.K.Upadhyay, ”Real time control of the the Hudson-Parthasarathy noisy Schrodinger equation with observer obtained from Belavkin’s ﬁlter, Technical report, NSIT, 2016. [5] D.W.Stroock and S.R.S.Varadhan, Multidimensional Diﬀusion Processes, Springer, 1997. [6] L.D.Landau and E.M.Lifshitz, ”The classical theory of ﬁelds”, Butterworth and Heinemann. [7] Rohit Singh, Jyotsna Singh and H.Parthasarathy, ”Application of wavelets to partial diﬀerential equation based modeling of images, Technical Report, NSIT, 2016. [8] H.Parthasarathy, ”Time travel in the special and general theories of relativity and the notions of space and time in quantum gravity, Lecture delivered at the G.B.Pant University, 15th September, 2016. [9] Rohit Singh, Jyotsna Singh and H.Parthasarathy, ”Image parameter estimation using Edgeworth models for nonGaussian noise and nonlinear ﬁltering theory in discrete time”, Technical Report, NSIT, 2016. [10] Rohit Singh, Naman Garg, Kumar Gautam and H.Parthasarathy, ”Design of quantum gates using scattering theory”, Technical Report, NSIT, 2016. [11] Lalit Kumar and H.Parthasarathy, ”The magnetic ﬁeld produced by a non-uniform transmission line carrying current when nonlinear hysteresis and nonlinear capacitive eﬀects are taken into account, Technical report, NSIT, 2016. [12] K.R.Parthasarathy, ”An introduction to quantum stochastic calculus”, Birkhauser, 1992. [13] K.R.Parthasarathy, ”Coding theorems of classical and quantum information theory”, Hindustan Book Agency. [14] K.R.Parthasarathy, ”Mathematical foundations of quantum mechanics”, Hindustan Book Agency. [15] J.Gough and Koestler, Quantum Filtering in Coherent States”. [16] S.Weinberg, The quantum theory of ﬁelds, Vols.1,2,3, Cambridge University Press. [17] Leonard Schiﬀ, Quantum mechanics, TMH. [18] P.A.M.Dirac, ”The principles of quantum mechanics”, Oxford University Press. [19] S.Weinberg, Gravitation and Cosmology:Principles and applications of the general theory of relativity. [20] Mandel and Wolf, Optical coherence and quantum optics. [21] W.O.Amrein, A Hilbert space approach to quantum mechanics, CRC press. [22] T.Kato, Perturbation theory for linear operators, Springer.